Rai14 | |||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||
Aliases | 1700008J19Rik1700020L11RikAnkycorbinNorpegmKIAA1334retinoic acid induced 14 | ||||||||||||||||||||||||
External IDs | HomoloGene: 9199 GeneCards: | ||||||||||||||||||||||||
| |||||||||||||||||||||||||
Orthologs | |||||||||||||||||||||||||
Species | Human | Mouse | |||||||||||||||||||||||
Entrez |
| ||||||||||||||||||||||||
Ensembl |
| ||||||||||||||||||||||||
UniProt |
| ||||||||||||||||||||||||
RefSeq (mRNA) |
| ||||||||||||||||||||||||
RefSeq (protein) |
| ||||||||||||||||||||||||
Location (UCSC) | Chr 15: 10.57 – 10.71 Mb | n/a | |||||||||||||||||||||||
PubMed search | [1] | n/a | |||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||
|
Ankycorbin is an ankyrin repeat and coiled-coil domain containing protein that in humans is encoded by the RAI14 gene. It is expressed in a variety of human tissues and is thought to play a role in actin regulation of ectoplasmic specialization, establishment of sperm polarity and sperm adhesion. It may also promote the integrity of Sertoli cell tight junctions at the blood testis barrier.
RAI14 contains 15 exons and is located on the plus strand of chromosome number 5 at position 5p13.2. It spans 5,068 base pairs from position 34,656,328 to 34,832,612. [2]
Genes TTC23L and LOC105374721 neighbor RAI14 on chromosome 5.
RAI14 is expressed within a wide range of human tissues. Some areas of the highest expression by TPM (transcripts per million) include tissues of the endometrium, smooth muscle, cervix, cervix, testis, and spleen. Within the human brain, RAI14 expression is abundant in the area around the brain stem and medulla. The highest expression levels came from the myelencephalon, a region of the embryonic brain that would later become the medulla oblongata. [3]
RAI14 does not have any paralogs.
Most of the entire sequence of the ankycorbin protein is well conserved between the majority of mammals and vertebrates. [4] More distantly related orthologs, such as sharks, begin to lose similarity and become less conserved in the C-terminus end of the protein. Ankycorbin is not well conserved within invertebrates, such as coral, at all. The latter half of the protein is largely missing and the only portion that retains some semblance of conservation is the area near the N-terminus that is associated with the ankyrin repeat domain.
Genus and Species | Common Name | Date of Divergence (from Homosapiens) | NCBI Accession Number | Sequence Length (Amino Acids) | Sequence Identity |
---|---|---|---|---|---|
Homo sapiens | Human | NP_001138992.1 | 980 | 100% | |
Castor canadensis | North American Beaver | 88 MYA | XP_020026070.1 | 980 | 92% |
Delphinapterus leucas | Beluga Whale | 94 MYA | XP_022455298.1 | 980 | 90% |
Pelecanus crispus | Dalmatian pelican | 320 MYA | XP_009485865.1 | 977 | 69% |
Python bivittatus | Burmese python | 320 MYA | XP_007432514.1 | 992 | 66% |
Xenopus laevis | African clawed frog | 353 MYA | XP_018100022.1 | 971 | 53% |
Rhincodon typus | Whale Shark | 465 MYA | XP_020373853.1 | 719 | 46% |
Stylophora pistillata | Coral | 685 MYA | XP_022783988.1 | 653 | 31% |
Ankycorbin has four different isoforms (A, B, C, and D) with most isoforms containing one or more splice variants that mainly differ in the 5' UTR. Isoform D is the longest known isoform at 983 amino acids in length, however isoform A (980 amino acids) is the most predominant form of the protein. [5]
The RAI14 protein has a predicted weight of 110.0 kDal and an isoelectric point of 5.87. It's also predicted to have glutamine rich region between nucleotides 914-978 and is otherwise rich in lycine, glutamate, and serine while being poor in glycine and phenylalanine. [6] It is a soluble protein. [7]
Ankycorbin contains an ankyrin repeat region located near the N-terminus [8] that consists of 6 ankyrin repeats. The protein is predominantly composed of alpha helices and coiled-coil domains with no predicted beta-sheet structures.
RAI14 is a highly phosphorylated protein with hundreds of predicted phosphorylation sites. It is predicted to be a nuclear protein and contains 3 nuclear localization signals.
RAI14 is predicted to play a role in cell-cell adhesion sites, particularly the cell-cell interactions of Sertoli cells from within the testis. Sertoli cells are involved in the creation of the blood testis barrier and provides a specialized, protected environment within the seminiferous tubules of the testis for germ cell development. [9]
TSBP1 is a protein that in humans is encoded by the TSBP1 gene. C6orf10 is an open reading frame on chromosome 6 containing a protein that is ubiquitously expressed at low levels in the adult genome and may play a role during fetal development. C6orf10 has been found to be linked to both neurodegenerative and autoimmune diseases in adults. Expression of this gene is highest in the testis but is also seen in other tissue types such as the brain, lens of the eye and the medulla. TSBP1 was previously known as C6orf10.
Coiled-coil domain containing 94 (CCDC94), is a protein that in humans is encoded by the CCDC94 gene. The CCDC94 protein contains a coiled-coil domain, a domain of unknown function (DUF572), an uncharacterized conserved protein (COG5134), and lacks a transmembrane domain.
NBEAL1 is a protein that in humans is encoded by the NBEAL1 gene. It is found on chromosome 2q33.2 of Homo sapiens.
Coiled-coil domain-containing protein 138, also known as CCDC138, is a human protein encoded by the CCDC138 gene. The exact function of CCDC138 is unknown.
Cilia And Flagella Associated Protein 206 (CFAP206) is a gene that in humans encodes a protein “DUF3508”. This protein has a function that is not currently very well understood. Other known aliases are “dJ382I10.1, UPF0704 Protein C6orf165.” In humans, the gene coding sequence is 56,501 base pairs long, with an mRNA of 2,215 base pairs, and a protein sequence of 622 amino acids. The C6orf165 gene is conserved in chimpanzee, rhesus monkey, dog, cow, mouse, rat, chicken, zebrafish, mosquito, frog, and more C6orf165 is rarely expressed in humans, with relatively high expression in brain, lungs (trachea) and testis. The molecular weight of UPF0704 is 71,193 Da and the PI is 6.38
Family with sequence similarity 63, member A is a protein that, in humans, is encoded by the FAM63A gene. It is located on the minus strand of chromosome 1 at locus 1q21.3.
Intermediate filament family orphan 1 is a protein that in humans is encoded by the IFFO1 gene. IFFO1 has uncharacterized function and a weight of 61.98 kDa. IFFO1 proteins play an important role in the cytoskeleton and the nuclear envelope of most eukaryotic cell types.
FAM76A is a protein that in Homo sapiens is encoded by the FAM76A gene. Notable structural characteristics of FAM76A include an 83 amino acid coiled coil domain as well as a four amino acid poly-serine compositional bias. FAM76A is conserved in most chordates but it is not found in other deuterostrome phlya such as echinodermata, hemichordata, or xenacoelomorpha—suggesting that FAM76A arose sometime after chordates in the evolutionary lineage. Furthermore, FAM76A is not found in fungi, plants, archaea, or bacteria. FAM76A is predicted to localize to the nucleus and may play a role in regulating transcription.
The coiled-coil domain containing 142 (CCDC142) is a gene which in humans encodes the CCDC142 protein. The CCDC142 gene is located on chromosome 2, spans 4339 base pairs and contains 9 exons. The gene codes for the coiled-coil domain containing protein 142 (CCDC142), whose function is not yet well understood. There are two known isoforms of CCDC142. CCDC142 proteins produced from these transcripts range in size from 743 to 665 amino acids and contain signals suggesting protein movement between the cytosol and nucleus. Homologous CCDC142 genes are found in many animals including vertebrates and invertebrates but not fungus, plants, protists, archea, or bacteria. Although the function of this protein is not well understood, it contains a coiled-coil domain and a RINT1_TIP1 motif located within the coiled-coil domain.
PROSER1 is a protein that in humans is encoded by the PROSER1 gene.
FAM221B is a protein that in humans is encoded by the FAM221B gene . FAM221B is also known by the alias C9orf128, is expressed at low level, and is defined by 17 GenBank accessions . It is predicted to function in transcription regulation as a transcription factor.
BEND2 is a protein that in humans is encoded by the BEND2 gene. It is also found in other vertebrates, including mammals, birds, and reptiles. The expression of BEND2 in Homo sapiens is regulated and occurs at high levels in the skeletal muscle tissue of the male testis and in the bone marrow. The presence of the BEN domains in the BEND2 protein indicates that this protein may be involved in chromatin modification and regulation.
SMCO4 is a human gene that encodes the FN5 protein. The FN5 protein is a single-pass transmembrane protein, which means that one end of the protein will remain in the cytoplasm, while the other end of the protein is exposed to the interior of the cell.
Transmembrane Protein 217 is a protein encoded by the gene TMEM217. TMEM217 has been found to have expression correlated with the lymphatic system and endothelial tissues and has been predicted to have a function linked to the cytoskeleton.
Coiled-coil domain containing 166 is a protein that in humans is encoded by the CCDC166 gene. Its function is currently unknown. It contains a coiled-coil domain, hence the current origin of its name. It is primarily expressed in the testes.
Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.
Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.
CAP-Gly Domain Containing Linker Protein Family Member 4 is a protein that in humans is encoded by the CLIP4 gene. In terms of conserved domains, the CLIP4 gene contains primarily ankyrin repeats and the eponymous CAP-Gly domains. The structure of the CLIP4 protein is largely made up of coil, with alpha helices dominating the rest of the protein. CLIP4 mRNA expression occurs largely in the adrenal cortex and atrioventricular node. The literature encompassing CLIP4's conserved domains and paralogs points toward microtubule regulation as a possible function of CLIP4.
C12Orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12Orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.