C3orf38

Last updated
C3orf38
Identifiers
Aliases C3orf38 , chromosome 3 open reading frame 38
External IDs MGI: 1914859 HomoloGene: 27867 GeneCards: C3orf38
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_173824

NM_026273

RefSeq (protein)

NP_776185

NP_080549

Location (UCSC) Chr 3: 88.15 – 88.17 Mb Chr 16: 64.57 – 64.59 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Chromosome 3 open reading frame 38 (C3orf38) is a protein which in humans is encoded by the C3orf38 gene.

Contents

Gene

Figure depicting human chromosome 3 and the 3p11.1 location at which the C3orf38 gene is found. Image derived from GeneCards. C3orf38-gene.png
Figure depicting human chromosome 3 and the 3p11.1 location at which the C3orf38 gene is found. Image derived from GeneCards.

The C3orf38 gene is located on chromosome 3 (3p11.1) on the forward strand. [5] It spans 18,771 bases from chr3:88,149,959-88,168,729. [5] It contains 3 exons. [6] Common aliases for this gene are MGC26717, LOC285237, and FLJ54270. [7] Some of the genes neighboring C3orf38 include ZNF654, CGGBP1, and LOC105377202. [8]

Transcripts

C3orf38 Transcripts
Protein NameGene IDTranscript AccessionLength (nt)Length (aa)
uncharacterized protein C3orf38285237 NM_173824.4 2414329
uncharacterized protein C3orf38 isoform X1285237 XM_005264745.5 2356328

Protein

Multiple sequence alignment of C3orf38 protein in humans and various orthologs showing DUF conservation. MSA created using BoxShade tools. DUF4518 conservation MSA.png
Multiple sequence alignment of C3orf38 protein in humans and various orthologs showing DUF conservation. MSA created using BoxShade tools.

The C3orf38 protein is 329 amino acids in length. [9] A large domain of unknown function, DUF4518, encompasses majority of the C3orf38 protein. [9] This domain is a part of the protein family pfam15008, which is thought to be involved in apoptosis regulation. [10] This pfam15008 is the only member of the cl20886 superfamily. [10] While the C3orf38 protein does not have any abnormal amino acid abundance as a whole, the DUF4518 has a high abundance of histidines and a low abundance of serines, according to compositional analysis. [11] The predicted molecular weight of the entire C3orf38 protein is 37.0 kD and the isoelectric point is 6.01. [12] The DUF4518 contained inside the C3orf38 protein has a predicted molecular weight of 31 kD and an isoelectric point of 6.49. [12]

Regulation

Gene Level Regulation

There have been a number of potential promoters identified for the C3orf38 gene, which are described in the table below. [13]

Potential Promoters for the C3orf38 Gene [13]
PromoterStartEndLength (bp)Transcripts
GXP_20311888148634881500461413GXT_23216585, GXT_22791246, GXT_2803824, GXT_26239186
GXP_979596288148768881498071040no transcript assigned; promoter based on comparative genomics
GXP_979596388148794881500271234no transcript assigned; promoter based on comparative genomics
GXP_319483688149604881506431040GXT_24485561

The C3orf38 gene exhibits ubiquitous expression in human tissues. [14]

Normal human tissue expression profiling (HG-U95E) for the C3orf38 gene. Data pictured is captured from the NCBI GEO database. Normal human tissue expression profile of C3orf38.png
Normal human tissue expression profiling (HG-U95E) for the C3orf38 gene. Data pictured is captured from the NCBI GEO database.

Protein Level Regulation

The C3orf38 protein is expected to be found with the highest confidence in the cytoplasm. [15] This finding is supported by examination of an array of C3orf38 orthologs. [15]

There are several well conserved post translation modification sites found amongst the human C3orf38 protein and its orthologs, which are depicted in the table below. [16] Majority of these PTMs are PKC phosphorylation sites. [16] Additionally, two confirmed active sites are located in the C3orf38 protein. The first is an aldehyde dehydrogenases glutamic acid active site located from amino acids 1-8. [16] The second site is a eukaryotic thiol (cysteine) proteases histidine active site located from amino acids 227-237. [16]

Predicted cellular localization of C3orf38 in humans and several orthologs. Localization predictions gathered from PSORT II Prediction tool. Ortholog table with cellular localizations.pdf
Predicted cellular localization of C3orf38 in humans and several orthologs. Localization predictions gathered from PSORT II Prediction tool.
Conserved Post Translational Modification Sites
PTMProtein Location (aa)
Myristyl site235-240
PKC phosphorylation site34-36
PKC phosphorylation site86-88
PKC phosphorylation site199-201
PKC phosphorylation site265-267

Homology/evolution

Orthologs for the C3orf38 protein can be found in mammals, reptiles, birds, amphibians, fish, and invertebrates using BLAST searches. [17] A selection of these orthologs can be found in the ortholog table below. There are no paralogs. [17] Additionally, by comparing sequences of C3orf38 protein with cytochrome C and fibrinogen alpha proteins, a moderate rate of evolution was determined for the C3orf38 protein.

C3orf38 Ortholog Table [17] [18] [19]
Genus, speciesCommon NameTaxonomic GroupDivergence Date (MYA)Accession NumberSequence Length (aa)Sequence Identity (%)Sequence Similarity (%)
Mammals Homo sapiensHumanPrimates0NP_776185.2329100100
Pan paniscusBonoboPrimates6.7XP_003831564.132999.499.7
Puma concolorPumaCarnivora96XP_025769652.134879.886.6
Reptiles Mauremys reevesiiReeve's TurtleTestudines312XP_039379932.131555.770.5
Chelonoidis abingdoniiAbingdon Island Giant TortoiseTestudines312XP_032650981.130455.469.9
Birds Strigops habroptilaKakapoPsittaciformes312XP_030327387.130952.166.3
Taeniopygia guttataZebra FinchPasseriformes312XP_002190058.53065163.9
Gallus gallusChickenGalliformes312XP_004938363.231244.259.9
Amphibians Rhinatrema bivittatumTwo-Lined CaecilianGymnophiona351.8XP_029434832.128949.764.5
Bufo bufoCommon ToadAnura351.8XP_040279187.128943.962.1
Xenopus tropicalisTropical Clawed FrogAnura351.8XP_017946806.126138.654.8
Fish Chelmon rostratusCopperband ButterflyfishPerciformes435XP_041807133.130242.758.2
Coregonus clupeaformisLake WhitefishSalmoniformes435XP_041700482.130842.460.6
Carcharodon carchariasGreat White SharkLamniformes473XP_041066710.13084559.8
Amblyraja radiataThorny SkateRajiformes473XP_032888490.138232.546.5
Invertebrates Lytechinus variegatusSea UrchinTemnopleuroida684XP_041465399.131236.448.3
Patiria miniataBat StarValvatida684XP_038067113.129434.146.2
Cryptotermes secundusTermiteBlattodea797XP_023724689.129630.148
Crassostrea virginicaEastern OysterOstreidae797XP_022335568.134029.646.5
Diabrotica virgiferaWestern Corn RootwormColeoptera797XP_028133096.128426.943.6
Acropora milleporaBranching Stony CoralScleractinia824XP_029194133.128832.650.9
Figure showing an evolution rate graph comparing the C3orf38, cytochrome C, and fibrinogen alpha proteins. Noting the cytochrome C to be a relatively slow-evolving protein and the fibrinogen alpha to be a relatively fast-evolving protein, it is clear that C3orf38 protein evolves at a comparatively moderate rate. Evolution Rate Graph.jpg
Figure showing an evolution rate graph comparing the C3orf38, cytochrome C, and fibrinogen alpha proteins. Noting the cytochrome C to be a relatively slow-evolving protein and the fibrinogen alpha to be a relatively fast-evolving protein, it is clear that C3orf38 protein evolves at a comparatively moderate rate.

Function

Although investigation into the function of the C3orf38 gene is ongoing, a couple studies have granted valuable insights into its role. One study has identified C3orf38 as a candidate proapoptotic gene. [20] Another study identified C3orf38 as a top candidate tumor suppressor gene (TSG). [21]

Interacting proteins

Of the various proteins C3orf38 protein interacts with, two are particularly interesting seeing as C3orf38 is a candidate proapoptotic and tumor suppressor gene. First, BAG family molecular chaperone regulator 4 (BAG4) is an anti-apoptotic protein that is known to interact with a number of apoptosis and growth-related proteins. [22] Second, DnaJ Heat Shock Protein Family Member B4 (DNAJB4) is a member of the heat shock protein-40 family (Hsp40), a molecular chaperone, and a tumor suppressor (specifically for colorectal carcinoma). [23]

Related Research Articles

<span class="mw-page-title-main">C9orf64</span> Protein-coding gene in the species Homo sapiens

C9orf64 is a gene located on chromosome 9, that in humans encodes the protein queuosine salvage protein. The function and biological process of the queuosine salvage protein is a queuosine-nucleotide N-glycosylase/hydrolase (QNG1) that releases queuine from Q-5'-monophosphate, and this activity is required for the salvage of queuine from exogenous Queuosine by S. pombe and HeLa cells. Some evidence from orthologs indicates it may be involved in tRNA processing and recycling. The most common mRNA contains 4 coding exons, and it has 2 additional alternatively spliced exons. C9orf64 has been found in 5 different splice variants.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.

<span class="mw-page-title-main">C12orf60</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C12orf60 is a protein that in humans is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.

<span class="mw-page-title-main">C6orf62</span> Protein-coding gene in the species Homo sapiens

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">Chromosome 9 open reading frame 43</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">CFAP299</span> Protein-coding gene in the species Homo sapiens

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

<span class="mw-page-title-main">C9orf50</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.

<span class="mw-page-title-main">C20orf202</span>

C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">C11orf98</span> Protein-coding gene in the species Homo sapiens

C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.

<span class="mw-page-title-main">C12orf29</span> Protein-coding gene in the species Homo sapiens

C12orf29 is a protein that in humans is encoded by chromosome 12 open reading frame 29. The gene is ubiquitously expressed in various tissues. The protein has 325 amino acids. The biological process of C12orf29 has been annotated as hematopoietic progenitor cell differentiation. The molecular and cellular functions of C12orf29 gene have not yet well understood by the scientific community.

<span class="mw-page-title-main">C12orf50</span> Protein encoding gene C12orf50

Chromosome 12 Open Reading Frame 50 (C12orf50) is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811, on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.

<span class="mw-page-title-main">C4orf19</span> Human C4orf19 gene

C4orf19 is a protein which in humans is encoded by the C4orf19 gene.

<span class="mw-page-title-main">C5orf22</span> Protein-coding gene in the species Homo sapiens

Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).

<span class="mw-page-title-main">Chromosome 12 open reading frame 71</span> Protein encoded in humans by c12orf71 gene

Chromosome 12 open reading frame 71 (c12orf71) is a protein which in humans is encoded by c12orf71 gene. The protein is also known by the alias LOC728858.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000179021 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000059920 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 3 "C3orf38". www.genecards.org. Archived from the original on 2011-11-29. Retrieved 2021-09-30.
  6. "Homo sapiens chromosome 3 open reading frame 38 (C3orf38), mRNA". 2021-04-16.{{cite journal}}: Cite journal requires |journal= (help)
  7. "AceView: Gene:C3orf38, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2021-09-30.
  8. "C3orf38 chromosome 3 open reading frame 38 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2021-12-17.
  9. 1 2 "uncharacterized protein C3orf38 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2021-09-30.
  10. 1 2 "CDD Conserved Protein Domain Family: DUF4518". www.ncbi.nlm.nih.gov. Retrieved 2021-12-17.
  11. "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2021-12-17.
  12. 1 2 "ExPASy - Compute pI/Mw tool". web.expasy.org. Retrieved 2021-12-17.
  13. 1 2 "Genomatix Software Suite". Archived from the original on 2012-01-14.
  14. 1 2 "2928464 - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2021-12-18.
  15. 1 2 3 "PSORT II Prediction". psort.hgc.jp. Retrieved 2021-12-18.
  16. 1 2 3 4 "Motif Scan". myhits.sib.swiss. Retrieved 2021-12-18.
  17. 1 2 3 "Protein BLAST: search protein databases using a protein query". blast.ncbi.nlm.nih.gov. Retrieved 2021-12-17.
  18. "EMBOSS Needle < Pairwise Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2021-12-17.
  19. "TimeTree :: The Timescale of Life". timetree.org. Retrieved 2021-12-17.
  20. Park, Kyung Mi; Kang, Eunju; Jeon, Yeo-Jin; Kim, Nayoung; Kim, Nam-Soon; Yoo, Hyang-Sook; Yeom, Young Il; Kim, Soo Jung (2007-04-30). "Identification of novel regulators of apoptosis using a high-throughput cell-based screen". Molecules and Cells. 23 (2): 170–174. ISSN   1016-8478. PMID   17464193.
  21. Cody, Neal A. L.; Shen, Zhen; Ripeau, Jean-Sebastien; Provencher, Diane M.; Mes-Masson, Anne-Marie; Chevrette, Mario; Tonin, Patricia N. (2009). "Characterization of the 3p12.3-pcen region associated with tumor suppression in a novel ovarian cancer cell line model genetically modified by chromosome 3 fragment transfer". Molecular Carcinogenesis. 48 (12): 1077–1092. doi:10.1002/mc.20535. ISSN   1098-2744. PMID   19347865. S2CID   10259832.
  22. "BAG4". www.genecards.org. Archived from the original on 2011-11-27. Retrieved 2021-12-18.
  23. "DNAJB4". www.genecards.org. Archived from the original on 2021-12-18. Retrieved 2021-12-18.