CORO6 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | CORO6 , coronin 6 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 2183448; HomoloGene: 104099; GeneCards: CORO6; OMA:CORO6 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Coronin-6 also known as coronin-like protein E (Clipin-E) is a protein that in humans is encoded by the CORO6 gene.
Coronin-6 belongs to the coronin family which is an actin binding protein. [5] [6] Human CORO6 gene is located on chromosome 17 on the cytogenetic band 17 p11.2. [7] Gene CORO6 is well conserved across domain of eukaryote organisms from fungus to animal. [8]
Based on the EST profile, CORO6 expressed in high level at the larynx, nerve and muscle. CORO6 has also been shown to be expressed in high levels in the breast (mammary gland) tumor. During the human development stage, the higher level of CORO6 expressed at blastocyst and adult. [9]
Alternative mRNAs are shown aligned from 5' to 3' on a virtual genome where introns have been shrunk to a minimal length. Exon size is proportional to length, intron height reflects the number of cDNAs supporting each intron. Introns of the same color are identical, of different colors are different. 'Good proteins' are pink, partial or not-good proteins are yellow, uORFs are green. 5' cap or3' poly A flags show completeness of the transcript . CORO6 contains 21 distinct gt-ag introns. Transcription produces 10 alternatively spliced mRNA. There are 3 probable alternative promoters, and validated alternative polyadenylation sites. [10]
CORO6 protein sequence contains WD-40 repeats. WD40 domain is a structural motif found in Eukaryotes and cover variety of functions, such as adaptor or regulatory modules in signal transduction, pre-mRNA processing and cytoskeletal assembly. It usually terminating at WD dipeptide at its C-terminus and is about 40 residues long, so called WD40. [11]
The structure of CORO6 is predicted by using Phyre2 program. It is similar to the crystal structure of murine coronin-1. 390 residues ( 83% of CORO6 protein sequence) have been modelled with 100.0% confidence by the single highest scoring template. Image coloured by rainbow N → C terminus
Human proteins which are the paralogs to CORO6, CORO1A, CORO1B, CORO1C, CORO2A, CORO2B, CORO7
The table compared Homo sapiens protein CORO6 to its paralogs
Name of paralogs | CORO6 | CORO1A | CORO1B | CORO2A | CORO2B | CORO7 |
---|---|---|---|---|---|---|
Accession number | NP_116243 | NP_009005 | NP_065174 | NP_438171 | NP_006082 | NP_078811 |
Sequence length | 472 aa | 461 aa | 489 aa | 525 aa | 480 aa | 925 aa |
Sequence identity | 67% | 67% | 45% | 45% | 32% | |
Sequence similarity | 81% | 80% | 64% | 63% | 49% | |
By comparing its paralogs we found that CORO1A and CORO1B are most related to CORO6.
CORO6 is highly conserved throughout the organisms from vertebrate to fungus, the organisms listed in the table are some representatives.
Genus and species (Orthologs comparison) | Homo sapiens | Pan troglodytes | Canis familiaris | Anolis carolinensis | Danio rerio | Saccharomyces cerevisiae | Plasmodium falciparum |
---|---|---|---|---|---|---|---|
Common name | Human | Chimpanzee | Dog | Lizard | Zebrafish | Baker's Yeast | Malaria parasite |
Date of divergence from human lineage | 6.3 MYA | 94.2 MYA | 269 MYA | 400.1 MYA | 1215.8 MYA | 1381.2 MYA | |
Accession number | NP_116243 | XP_001137660 | XP_548302 | XP_0003227217 | NP_956690 | NP_013533 | XP_001350896 |
Sequence length | 472 aa | 471 aa | 472 aa | 471 aa | 436 aa | 651 aa | 602 aa |
Sequence identity to human | 96% | 98% | 83% | 78% | 42% | 31% | |
Sequence similarity to human | 97% | 99% | 90% | 90% | 62% | 52% |
There are several clinical studies about that have been performed by using microarray indicating that CORO6 is positively related to allergic nasal epithelium response to house dust mite allergen in vitro. [12]
Coronin-1C is a protein that in humans is encoded by the CORO1C gene.
Coiled-coil domain-containing protein 113 also known as HSPC065, GC16Pof6842 and GC16P044152, is a protein that in humans is encoded by the CCDC113 gene. The human CCDC113 gene is located on chromosome 16q21 and encodes 5,304 base pairs of mRNA and 377 amino acids.
Family with Sequence Similarity 203, Member B (FAM203B) is a protein encoded by the FAM203B gene (8q24.3) in humans. While FAM203B is only found in humans and possibly non-human primates, its paralog, FAM203A, is highly conserved. The FAM203B protein contains two conserved domains of unknown function, DUF383 and DUF384, and no transmembrane domains. This protein has no known function yet, although the homolog of FAM203A in Caenorhabditis elegans (Y54H5A.2) is thought to help regulate the actin cytoskeleton.
Coiled-coil domain containing 94 (CCDC94) is a protein that in humans is encoded by the CCDC94 gene. The CCDC94 protein contains a coiled-coil domain, a domain of unknown function (DUF572), an uncharacterized conserved protein (COG5134), and lacks a transmembrane domain.
NBEAL1 is a protein that in humans is encoded by the NBEAL1 gene. It is found on chromosome 2q33.2 of Homo sapiens.
Coiled-coil domain-containing protein 138, also known as CCDC138, is a human protein encoded by the CCDC138 gene. The exact function of CCDC138 is unknown.
Cilia And Flagella Associated Protein 206 (CFAP206) is a gene that in humans encodes a protein “DUF3508”. This protein has a function that is not currently very well understood. Other known aliases are “dJ382I10.1, UPF0704 Protein C6orf165.” In humans, the gene coding sequence is 56,501 base pairs long, with an mRNA of 2,215 base pairs, and a protein sequence of 622 amino acids. The C6orf165 gene is conserved in chimpanzee, rhesus monkey, dog, cow, mouse, rat, chicken, zebrafish, mosquito, frog, and more C6orf165 is rarely expressed in humans, with relatively high expression in brain, lungs (trachea) and testis. The molecular weight of UPF0704 is 71,193 Da and the PI is 6.38
Family with sequence similarity 167, member A is a protein in humans that is encoded by the FAM167A gene located on chromosome 8. FAM167A and its paralogs are protein encoding genes containing the conserved domain DUF3259, a protein of unknown function. FAM167A has many orthologs in which the domain of unknown function is highly conserved.
WD repeat-containing protein 90 is a protein that, in humans, is encoded by the WDR90 gene (16p13.3). This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. It contains multiple WD40 repeat domains and one domain of unknown function. This protein is conserved all the way back to invertebrates. Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis.
Uncharacterized protein C12orf60 is a protein that in humans is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.
Retrotransposon Gag Like 6 is a protein encoded by the RTL6 gene in humans. RTL6 is a member of the Mart family of genes, which are related to Sushi-like retrotransposons and were derived from fish and amphibians. The RTL6 protein is localized to the nucleus and has a predicted leucine zipper motif that is known to bind nucleic acids in similar proteins, such as LDOC1.
WD repeat containing protein 53 (WDR53) is a protein encoded by the WDR53 gene that has been identified in the human genome by the Human Genome Project but has, at the moment, lacked experimental procedures to understand the function. It is located on chromosome 3 at location 3q29 in Homo sapiens. It has short up and down stream untranslated regions as well as WD40 repeat regions which have been linked to various functions.
C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.
Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.
WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.
Coiled-coil domain containing 121 (CCDC121) is a protein encoded by the CCDC121 gene in humans. CCDC121 is located on the minus strand of chromosome 2 and encodes three protein isoforms. All isoforms of CCDC121 contain a domain of unknown function referred to as DUF4515 or pfam14988.
The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.
KIAA2013, also known as Q8IYS2 or MGC33867, is a single-pass transmembrane protein encoded by the KIAA2013 gene in humans. The complete function of KIAA2013 has not yet been fully elucidated.
Transmembrane protein 104 (TMEM104) is a protein that in humans is encoded by the TMEM104 gene. The aliases of TMEM104 are FLJ00021 and FLJ20255. Humans have a 163,255 base pair long gene coding sequence, 4703 base pair long mRNA, and 496 amino acid long protein sequence. In Eukaryotes, the TMEM104 gene is conserved.
C1orf159 is a protein that in human is encoded by the C1orf159 gene located on chromosome 1. This gene is also found to be an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer.