EVA1C (Eva-1 Homolog C) is a transmembrane protein in humans ( Homo sapiens ) that is encoded by the EVA1C gene on chromosome 21. The EVA1C protein is thought to be involved in herapin binding activity. In addition, the gene is thought to be associated with diseases such as X-Linked Intellectual Disability-Short Stature-Overweight Syndrome.
B18, B19, C21orf63, C21orf64, FAM176C, PRED34, and SUE21 are aliases of the EVA1C gene. [2]
EVA1C is located on the plus strand of chromosome 21 (21q22.11). [2] The span of the EVA1C gene is 103,394 bases (chr21:33,784,314-33,887,707). [2]
EVA1C has 9 isoforms with EVA1C isoform X1 being the longest. [3] This isoform is 441 amino acids in length. [3]
EVA1C RNA is most highly expressed in the prostate, lungs, uterus, and heart. [3] It is also highly expressed in the human stomach at 20 weeks postnatal, whereas it is most highly expressed in the heart at 11 weeks postnatal. [3] Overall, it seems as though EVA1C RNA is most highly expressed in respiratory organs along with male and female reproductive organs. [3] EVA1C was found to be expressed at low levels in the brain. [3] However, using Allen Brain Atlas, EVA1C was found to be most highly expressed in the periaqueductal gray region of the midbrain in the house mouse (Mus musculus) brain. [6]
The isoelectric point of the EVA1C protein in humans (Homo sapiens) is 6.5 pl and the molecular weight is 49 kDa. [8] When comparing to its paralogs, EVA1A and EVA1B, EVA1C had the highest molecular weight and isoelectric point. [8] This indicates that EVA1C is the largest protein.
The protein composition of EVA1C was found using EMBL-EBI SAPS. [9] EVA1C consists of all 20 amino acids; with cysteine (C) being present in high amounts. [9] The net charge of EVA1C was found to be lower than average. [9] EVA1C has a negative charge cluster from 369 to 389 amino acids, which is where the disordered region is located. [9] The transmembrane region was found to have a major hydrophobic region. [9]
EVA1C is predicted to have 6 post-translational modifications. [11] Glycosylation can be found on the first half of the protein, while phosphorylation and ubiquitylation can be found on the second half of the protein. [11] There are two of each type of post-translational modifications. [11]
EVA1C has been shown to interact with AMN1, USE1, SLITRK3, ROBO3, FLRT3, DONSON, and POFUT2. [12]
The orthologs of EVA1C were found using NCBI Homologene and sorted by median date of divergence found using TimeTree and sequence identity to the human protein was found using the EMBOSS Needle Tool. [13] [14] [15] [16] The species that has the most distantly related EVA1C gene to humans is EVA1C in a cartilaginous fish called the thorny skate (Amblyraja radiata). [16] The sequence identity of this species with humans is 45%. Mammals had an identity range of 87.8-98.6%, Aves had an identity range of 51.3-63.4%, Reptilia had an identity range of 55.1-62%, Amphibians had an identity range of 51.7-58%, and Bony Fish had an identity range of 40.3-41.3%. [15]
The paralogs of EVA1C are EVA1A (Eva-1 Homolog A) and EVA1B (Eva-1 Homolog B). [17] [18] The thorny skate (Amblyraja radiata) was found to be the most distant ortholog in EVA1A, EVA1B, and EVA1C. [14] [19] [20] The divergence time of humans and the thorny skate is 464 million years ago. [16]
The EVA1C gene is located on the critical region of Down syndrome on chromosome 21. [21] This syndrome is the result of individuals having an extra copy of chromosome 21. [21] Neurological and muscle impairments are experienced by people with Down syndrome. [21] An experiment that studied orthologs of chromosome 21 in roundworms (Caenorhabditis elegans) found that EVA1C was one of the orthologs that was required for neuromuscular behaviors. [21] The results of this experiment indicate that the EVA1C is a gene that underlies the phenotypes of Down syndrome. [21]
TSR3, or TSR3 Ribosome Maturation Factor, is a hypothetical human protein found on chromosome 16. Its protein is 312 amino acids long and its cDNA has 1214 base pairs. It was previously designated C16orf42.
Transmembrane protein 151B is a protein that in humans is encoded by the TMEM151B gene.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.
CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.
C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.
C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.
C2orf72 is a gene in humans that encodes a protein currently named after its gene, C2orf72. It is also designated LOC257407 and can be found under GenBank accession code NM_001144994.2. The protein can be found under UniProt accession code A6NCS6.
Chromosome 3 open reading frame 38 (C3orf38) is a protein which in humans is encoded by the C3orf38 gene.
Chromosome 4 open reading frame 50 is a protein that in humans is encoded by the C4orf50 gene. The protein localizes in the nucleus. C4orf50 has orthologs in vertebrates but not invertebrates
NADP-dependent oxidoreductase domain-containing protein 1 is a protein that in humans is encoded by the NOXRED1 gene. An alias of this gene is Chromosome 14 Open Reading Frame 148 (c14orf148). This gene is located on chromosome 14, at 14q24.3. NOXRED1 is predicted to be involved in pyrroline-5-carboxylate reductase activity as part of the L-proline biosynthetic pathway. It is expressed in a wide variety of tissues at a relatively low level, including the testes, thyroid, skin, small intestine, brain, kidney, colon, and more.
FAM131A is a protein that is encoded by the FAM131A gene in humans. Aliases for FAM131A include C3orf40, FLAT715, and PRO1378.
Transmembrane protein 19 is a protein that in humans is encoded by the TMEM19 gene.
Zinc Finger Protein 62, also known as "ZNF62," "ZNF755," or "ZET," is a protein that in humans is encoded by the ZFP62 gene. ZFP62 is part of the C2H2 Zinc Finger family of genes.