ERICH5 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | ERICH5 , C8orf47, glutamate rich 5 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 2447772; HomoloGene: 52129; GeneCards: ERICH5; OMA:ERICH5 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Glutamate rich protein 5 is a protein in humans encoded by the ERICH5 gene, also known as chromosome 8 open reading frame 47 (C8orf47).
The ERICH5 gene is located on human chromosome 8 at 8q22.2 and spans 29 kb on the plus strand of the DNA. [5] ERICH5 contains three exons and two introns. [6] ERICH5 is also known as C8orf47 and glutamate rich 5. [5]
The ERICH5 protein has two isofroms. The longest isofrom, isofrom 1, spans 1,550 base pairs and is composed of 374 amino acids. The second isoform lacks the third and final exon and is 596 base pairs long. [6]
ERICH5 contains one conserved domain, a domain of unknown function called DUF4573. [7] ERICH5 is predicted to contain two highly conserved motifs, an APC/C binding motif and the LIG_FHA_2 motif. [8] The APC/C motif spans amino acid 222-226 and serves as a binding site for the anaphase-promoting complex. [9] The LIG_FHA_2 motif is involved in the cell checkpoint pathway and is found in many proteins localized in the nucleus that regulate cell cycle. [10]
ERICH5 is predicted to undergo several post-translational modifications including phosphorylation, O-glycosylation, and sumoylation. [11] [12] [13] Many of the phosphorylation sites and O-glycosylation sites were predicted at the same amino acid. The post translational modifications shown are those conserved among ERICH5 orthologs. Several kinases were predicted to phosphorylate ERICH5 including PKC, cdc2, CKI, PKA, DNAPK, ATM, EGFR, and CKII. [11]
Post-translational modification | Amino acid location |
Phosphorylation | S4, S5, S27, S32, S49, T50, S58, S100, T101, T104, T138, S169, S227, S234, T239, T248, S274, T289, S307, Y320, T340, T346 |
O-Glycosylation | S5, S27, S32, S49, T50, S58, S100, T101, T104, T138, S169, S227, S234, T239, T248, S274, T289, S307 |
Sumoylation | K121, K131, K211, K251, K343, K360 |
ERICH5 was predicted to contain three alpha helices and two beta sheets as well as regions of random coils. [14]
ERICH5 was predicted to be localized in the nucleus. [16]
ERICH5 was predicted to interact with several proteins through yeast two-hybrid screening and affinity chromatography. [15] Several of the proteins ERICH5 was predicted to interact with were also localized in the nucleus. [15]
ERICH5 shows elevated levels of expression in the fetal liver, liver, pancreas, and retina compared to other tissues.
ERICH5 shows increased expression in Alcoholic Hepatitis.
There are no known paralogs of ERICH5. [19]
True orthologs of ERICH5 have only been identified among mammalian species. The most distantly related mammalian ortholog is in Monodelphis domestica, or the gray short-tailed opossum. [20]
Species name | Common names | MYA | % similarity | % identity | Accession # | Protein length |
Acinonyx jubatus | Cheetah | 96 | 71% | 65% | XP_014937513.1 | 358 |
Cerathotherium | White rhino | 96 | 71% | 66% | XP_004431343.2 | 358 |
Trichechus manatus latirostris | Manatee | 105 | 68% | 59% | XP_004370879.3 | 342 |
Monodelphis | Gray short-tailed opossum | 159 | 40% | 28% | XP_007488209.1 | 478 |
ERICH5 has distant orthologs among birds and reptiles. These distant orthologs contain only the third exon of ERICH5. [20]
Species name | Common name | MYA | % similarity | % identity | Accession # | Protein length |
Gallus gallus | Chicken | 312 | 80% | 64% | XP_004940017.1 | 207 |
Chysemys picta belli | Painted turtle | 312 | 74% | 59% | XP_008166572.1 | 442 |
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.
CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.
Chromosome X Open Reading Frame 38 (CXorf38) is a protein which, in humans, is encoded by the CXorf38 gene. CXorf38 appears in multiple studies regarding the escape of X chromosome inactivation.
Testis expressed 55 (TEX55) is a human protein that is encoded by the C3orf30 gene located on the forward strand of human chromosome three, open reading frame 30 (3q13.32). TEX55 is also known as Testis-specific conserved, cAMP-dependent type II PK anchoring protein (TSCPA), and uncharacterized protein C3orf30.
Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.
C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.
C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.
ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.
Transmembrane protein 221 (TMEM221) is a protein that in humans is encoded by the TMEM221 gene. The function of TMEM221 is currently not well understood.
Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.
C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.
C4orf19 is a protein which in humans is encoded by the C4orf19 gene.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.