DGLUCY

Last updated
DGLUCY
Identifiers
Aliases DGLUCY , C14orf159, chromosome 14 open reading frame 159, D-glutamate cyclase
External IDs MGI: 2444813 HomoloGene: 11798 GeneCards: DGLUCY
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_145448
NM_001361017
NM_001361020
NM_001361023

RefSeq (protein)

NP_663423
NP_001347946
NP_001347949
NP_001347952

Location (UCSC) Chr 14: 91.06 – 91.23 Mb Chr 12: 100.75 – 100.86 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

DGLUCY (D-glutamate cyclase) is a protein that in humans is encoded by the DGLUCY gene. [5]

Orthologs

The human gene, DGLUCY, is highly conserved in mammals and birds. [6] Orthologs gathered from BLAST and BLAT searches reveal that the human DGLUCY mRNA sequence is conserved with a sequence identity of 98% in chimpanzees, 88% in mice, and 81% in platypus and chicken. [7] [8] The following table contains a list orthologs that were gathered from BLAST searches. Sequence alignments were performed using blastn to derive sequence identity, score, and E-values between the human c14orf159 variant 1 mRNA and its orthologs.

Genus and speciesCommon nameNCBI accession numberSequence length (bp)Sequence identityScoreE-value
Homo sapiens HumanNM_0011023663164100%0
Pan troglodytes ChimpanzeeXM_510121297498%42810
Mus musculus MouseNM_145448323188%4950
Ornithorhynchus anatinus PlatypusXM_00154336.1196281%2170
Gallus gallus ChickenXM_421319338981500
Human C14orf159 Orthologs-mRNA

The protein that the human gene DGLUCY encodes has been found to be highly conserved among mammals, birds, amphibians, fish, tunicates, cnidarians, and echinoderms. However, no protein orthologs have been found in nematodes, arthropods, fungi, protists, plants, bacteria, or archea. Fungi and bacteria contain the DUF1445 conserved domain which is found in human c14orf159 and its orthologs. BLAST and BLAT searches have been utilized to find orthologs to the c14orf159 protein. The following table lists protein orthologs for the human protein with sequence identity, sequence similarity, scores, and E-values derived from blastp sequence comparisons. [9]

Genus and speciesCommon nameNCBI accession numberSequence length-amino acidsSequence identitySequence similarityScoreE-value
Homo sapiensHumanNP_001095839.1564100%100%0
Pan troglodytesChimpanzeeXP_510121.2724557/621 (89%)561/621 (90%)11090
Ailuropoda melanoleuca PandaEFB15996.1585413/585 (70%)461/585 (78%)8240
Rattus norvegicus RatXP_343096.2618423/618 (68%)470/618 (76%)7740
Mus musculusMouseNP_663423.2617414/623 (66%)468/621 (75%)7960
Equus caballus HorseXP_001916913.1581390/585 (66%)433/585 (74%)7286E-115
Ornithorhynchus anatinusPlatypusXP_001514386.1653358/628 (57%)443/628 (70%)6960
Gallus gallusChickenXP_421319.2617330/614 (53%)414/614 (67%)6300
Xenopus tropicalis Western clawed frogCAJ82045.1616302/611 (49%)399/611 (65%)5821E-170
Danio rerio ZebrafishAAI244131.1621284/607 (46%)386/607 (63%)5306E-155
Branchiostoma floridae LanceletXP_002612376.1615237/611 (38%)334/611 (54%)3976E-115
Ciona intestinalis Vase tunicateXP_001173256486161/501 (32%)241/501 (48%)2445E-69
Strongylocentrotus purpuratus California purple sea urchinXP_782739.16319/33 (27%)15/33 (45%)3205E-87
Nematostella vectensis Starlet sea anemoneXP_001637867529134/501 (26%)211/501 (42%)1201E-31
Human C14orf159 Orthologs-protein

Post-translational modification

The protein product of the DGLUCY gene is predicted [5] and was found [10] [11] to be translocated to mitochondrion.

Post-translational modifications are predicted for the protein DGLUCY. All predicted sites in human DGLUCY were compared to orthologs using multiple sequence alignments to determine likelihood of modification. [12] [13] [14] [15] [16]

Regulation

Estrogen receptor alpha, in the presence of estradiol, binds to the DGLUCY gene and likely regulates its expression. [17]

Related Research Articles

<span class="mw-page-title-main">BBS5</span> Protein-coding gene in the species Homo sapiens

Bardet–Biedl syndrome 5 protein is a protein that in humans is encoded by the BBS5 gene.

<span class="mw-page-title-main">ARL6</span> Mammalian protein found in Homo sapiens

ADP-ribosylation factor-like protein 6 is a protein that in humans is encoded by the ARL6 gene.

<span class="mw-page-title-main">TMEM50A</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 50A is a protein that in humans is encoded by the TMEM50A gene.

<span class="mw-page-title-main">Shugoshin 2</span> Protein-coding gene in the species Homo sapiens

Shugoshin 2(Shugoshin-2), also known as Shugoshin-like 2, is a protein which in humans is encoded by the SGO2 gene.

<span class="mw-page-title-main">TMUB2</span> Protein-coding gene in the species Homo sapiens

Transmembrane and ubiquitin-like domain-containing protein 2 is a protein that in humans is encoded by the TMUB2 gene.

<span class="mw-page-title-main">Allograft inflammatory factor 1-like</span> Protein-coding gene in the species Homo sapiens

Allograft inflammatory factor 1-like is a protein that is encoded by the AIF1L gene in humans.

<span class="mw-page-title-main">NBPF3</span> Protein-coding gene in the species Homo sapiens

Neuroblastoma breakpoint family, member 3, also known as NBPF3, is a human gene of the neuroblastoma breakpoint family, which resides on chromosome 1 of the human genome. NBPF3 is located at 1p36.12, immediately upstream of genes ALPL and RAP1GAP.

<span class="mw-page-title-main">SOGA2</span> Protein-coding gene in the species Homo sapiens

SOGA2, also known as Suppressor of glucose autophagy associated 2 or CCDC165, is a protein that in humans is encoded by the SOGA2 gene. SOGA2 has two human paralogs, SOGA1 and SOGA3. In humans, the gene coding sequence is 151,349 base pairs long, with an mRNA of 6092 base pairs, and a protein sequence of 1586 amino acids. The SOGA2 gene is conserved in gorilla, baboon, galago, rat, mouse, cat, and more. There is distant conservation seen in organisms such as zebra finches and anoles. SOGA2 is ubiquitously expressed in humans, with especially high expression in brain, colon, pituitary gland, small intestine, spinal cord, testis and fetal brain.

<span class="mw-page-title-main">CCDC186</span> Protein found in humans

CCDC186 is a protein that in humans is encoded by the CCDC186 gene The CCDC186 gene is also known as the CTCL-tumor associated antigen with accession number NM_018017.

<span class="mw-page-title-main">ITFG3</span> Protein-coding gene in the species Homo sapiens

Protein ITFG3 also known as family with sequence similarity 234 member A (FAM234A) is a protein that in humans is encoded by the ITFG3 gene. Here, the gene is explored as encoded by mRNA found in Homo sapiens. The FAM234A gene is conserved in mice, rats, chickens, zebrafish, dogs, cows, frogs, chimpanzees, and rhesus monkeys. Orthologs of the gene can be found in at least 220 organisms including the tropical clawed frog, pandas, and Chinese hamsters. The gene is located at 16p13.3 and has a total of 19 exons. The mRNA has a total of 3224 bp and the protein has 552 aa. The molecular mass of the protein produced by this gene is 59660 Da. It is expressed in at least 27 tissue types in humans, with the greatest presence in the duodenum, fat, small intestine, and heart.

<span class="mw-page-title-main">FAM40A</span> Protein-coding gene in the species Homo sapiens

Protein FAM40A is a protein that is located on chromosome 1 in humans and is encoded by the FAM40A gene.

<span class="mw-page-title-main">TCAIM</span> Protein-coding gene in the species Homo sapiens

TCAIM is a protein that in humans is encoded by the TCAIM gene.

<span class="mw-page-title-main">Family with sequence similarity 19 (chemokine (C-C motif)-like), member A1</span> Protein-coding gene in the species Homo sapiens

Protein FAM19A1 is a protein that in humans is encoded by the FAM19A1 gene.

<span class="mw-page-title-main">SUHW4</span> Protein-coding gene in the species Homo sapiens

Zinc finger protein 280D, also known as Suppressor Of Hairy Wing Homolog 4, SUWH4, Zinc Finger Protein 634, ZNF634, or KIAA1584, is a protein that in humans is encoded by the ZNF280D gene located on chromosome 15q21.3.

<span class="mw-page-title-main">C1orf21</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C1orf21, also known as Proliferation-Inducing Protein 13, is a protein that in humans is encoded by the C1orf21 gene. C1orf21 is an intracellular protein that flows between the nucleus and the cytoplasm in the cell. It has been linked with cell growth and reproduction and there has been strong links with various types of cancers. There are no paralogs for this gene, however, many conserved orthologs have been found in all invertebrates. C1orf21 has low to moderate level of expression in most tissues in humans, however, it has the most expression in the skin, lung and prostate.

<span class="mw-page-title-main">CD99L2</span> Protein-coding gene in humans

CD99 antigen-like protein 2 is a protein that in humans is encoded by the CD99L2 gene.

<span class="mw-page-title-main">Morn repeat containing 1</span> Protein-coding gene in the species Homo sapiens

MORN1 containing repeat 1, also known as Morn1, is a protein that in humans is encoded by the MORN1 gene.

<span class="mw-page-title-main">CXorf36</span> Protein-coding gene in humans

Chromosome X open reading frame 36 (CXorf36) is a gene that in humans encodes a protein “hypothetical protein LOC79742”. This protein has a function that is not currently very well understood. Other known aliases are “FLJ14103, DKFZp313K0825, FLJ55198, PRO3743, FLJ55198, hCG1981635, bA435K1.1,” and “4930578C19Rik.”

<span class="mw-page-title-main">FAM163A</span> Protein-coding gene in the species Homo sapiens

FAM163A, also known as cebelin and neuroblastoma-derived secretory protein (NDSP) is a protein that in humans is encoded by the FAM163A gene. This protein has been implicated in promoting proliferation and anchorage-independent growth of neuroblastoma cancer cells. In addition, this protein has been found to be up-regulated in the lung tissue of chronic smokers. FAM163A is found on human chromosome 1q25.2; its protein product is 167 amino acids long. FAM163A contains a very highly conserved signal peptide sequence, coded for by the first ~37 amino acids in its sequence; albeit only conserved in eukaryotes, the most distant of which being the Japanese Rice Fish.

<span class="mw-page-title-main">FAM221A</span> Protein-coding gene in the species Homo sapiens

Family with sequence similarity 221 member A is a protein in humans that is encoded by the FAM221A gene. FAM221A is a gene that is not yet well understood by the scientific community. However, it appears that this gene may have a role in Parkinson's disease and prostate cancer.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000133943 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000021185 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 "Entrez Gene: C14orf159 chromosome 14 open reading frame 159".
  6. BLAST. NCBI. accessed 19 April 2010. http://blast.ncbi.nlm.nih.gov/Blast.cgi
  7. UCSC Genome Browser website, BLAT. accessed 10 April 2010.
  8. BLAST. NCBI. accessed 19 April 2010.
  9. Blastp. NCBI. http://blast.ncbi.nlm.nih.gov/Blast.cgi
  10. Mehrle A, Rosenfelder H. "RZPD CloneID DKFZp686J0759". LifeDB: Database for Localization, Interaction, Functional assays and Expression of Proteins. German Cancer Research Center.[ permanent dead link ]
  11. Wiemann S, Arlt D, Huber W, Wellenreuther R, Schleeger S, Mehrle A, Bechtel S, Sauermann M, Korf U, Pepperkok R, Sültmann H, Poustka A (October 2004). "From ORFeome to Biology: A Functional Genomics Pipeline". Genome Res. 14 (10B): 2136–44. doi:10.1101/gr.2576704. PMC   528930 . PMID   15489336.
  12. Prediction of glycosylation across the human proteome and the correlation to protein function. Gupta, R. and S. Brunak. Pacific Symposium on Biocomputing, 7:310-322, 2002 <http://www.cbs.dtu.dk/services/YinOYang/>.
  13. Locating proteins in the cell using TargetP, SignalP, and related tools Olof Emanuelsson, Søren Brunak, Gunnar von Heijne, Henrik Nielsen Nature Protocols 2, 953-971 (2007) http://www.cbs.dtu.dk/services/SignalP/.
  14. Scanning the available Dictyostelium discoideum proteome for O-linked GlcNAc glycosylation sites using neural networks. R. Gupta, E. Jung, A.A. Gooley, K.L. Williams, S. Brunak and J. Hansen. Glycobiology: 9(10):1009-22, 1999 http://www.cbs.dtu.dk/services/DictyOGlyc/.
  15. Analysis and prediction of mammalian protein glycation. Morten Bo Johansen, Lars Kiemer and Søren Brunak Glycobiology, 16:844-853, 2006 http://www.cbs.dtu.dk/services/NetGlycate/.
  16. Sulfinator. Expasy tools. 2010. http://expasy.org/tools/sulfinator/.
  17. Creekmore AL, Ziegler YS, Bonéy JL, Nardulli AM (March 2007). "Estrogen receptor α regulates expression of the breast cancer 1 associated ring domain 1 (BARD1) gene through intronic DNA sequence". Mol. Cell. Endocrinol. 267 (1–2): 106–15. doi:10.1016/j.mce.2007.01.001. PMC   1933484 . PMID   17275994.

Further reading