C19orf44 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C19orf44 , chromosome 19 open reading frame 44, Chromosome 19 open reading frame 44 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1919504 HomoloGene: 12975 GeneCards: C19orf44 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. [5] C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein (and gene) exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, [6] but also has significant expression in the thyroid and parathyroid. [7] Other names for this protein include: LOC84167. [8]
The entire gene is 25,416 base pairs in length, [9] and has an unprocessed mRNA that is 3,446 nucleotides in length. [6] It contains 10 exons that code for a 657 amino acid protein. There are 7 splice variants that exist for C19orf44. [10]
C19orf44 is located on the nineteenth chromosome on 19p13.11. [6]
C19orf44 has a molecular weight of 71,343 Da, [11] and an isoelectric point of 5.52. [12] The amino acid sequence for C19orf44 was found to be serine rich using tools on EMBL-EBI. [13] Additionally, there is a domain of unknown function (DUF) located from amino acid 474 to 641. [14]
C19orf44 has experimentally determined phosphorylation sites at the S114 and S213 positions. [14] Other predicted post-translational modifications were found using tools on ExPASy [15] and are shown in the protein illustration below. N-terminal acetylation is predicted at S3. There is also a predicted sumoylation motif from amino acid 212 to 221.
C19orf44 is predicted to be localized in the nucleus or cytosol. [17]
C19orf44 is shown to be expressed at low levels in various tissues throughout the body as shown by NCBI's EST Profile. [18] It most highly expressed in the testis and ovary, [6] but also has significant expression in the thyroid and parathyroid. [7] C19orf44 is expressed in all stages of development, except for in infants. There is an increased expression of C19orf44 in a developing fetus. [18]
Orthologs of C19orf44 have been found in most mammals and a select few other vertebrates and invertebrates. Multiple sequence alignments using ClustalW [19] provided evidence that the DUF in C19orf44 is highly conserved in its orthologs. The table below represents a small selection of the orthologs found using NCBI Blast. [20]
Genus and Species | Common Name | Accession Number (from NCBI [21] ) | Divergence (MYA) [22] | Sequence Identity (%) [23] |
---|---|---|---|---|
Rhinopithecus roxellana | Golden Snub-nosed Monkey | XP_010359783.1 | 29 | 86.9 |
Orcinus orca | Killer Whale | XP_004277754.1 | 96 | 83.2 |
Sus scrofa | Wild Boar | XP_005661251.2 | 96 | 60.1 |
Monodelphis domestica | Opossum | XP_007489796.1 | 159 | 45.5 |
Chelonia mydas | Green Sea Turtle | XP_007072179.1 | 312 | 35.2 |
Astyanax mexicanus | Mexican Tetra | XP_007246256.2 | 435 | 28.2 |
Mizuhopecten yessoensis | Scallop | XP_021343742.1 | 797 | 24.4 |
There are no paralogs for C19orf44 in Homo sapiens.
C19orf44 has been found to interact with various proteins from the two-hybrid screening method. Interactions with Hsp90 co-chaperone (CDC37), [24] and spermatid associated protein (SPERT) [25] have been found.
C9orf64 is a gene located on chromosome 9, that in humans encodes the protein queuosine salvage protein. The function and biological process of the queuosine salvage protein is not well understood by the scientific community, but some evidence from orthologs indicates it may be involved in tRNA processing. The most common mRNA contains 4 coding exons, and it has 2 additional alternatively spliced exons. C9orf64 has been found in 5 different splice variants.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.
C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.
Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.
Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.
Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.
Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.
Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.
C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
C14orf180 is found on chromosome 14 in humans: 14q32.33. It consists of 1832 bp and 160 amino acids post translation. There is a total number of 6 exons. C14orf180 is also known as NRAC, C14orf77, and Chromosome 14 Open Reading Frame 180.
Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.
Chromosome 3 open reading frame 38 (C3orf38) is a protein which in humans is encoded by the C3orf38 gene.
C4orf19 is a protein which in humans is encoded by the C4orf19 gene.
C12orf54 is a protein in humans that is encoded by the C12orf54 gene.