DOP1B

Last updated
DOP1B
Identifiers
Aliases DOP1B , 21orf5, C21orf5, dopey family member 2, DOPEY2, DOP1 leucine zipper like protein B
External IDs OMIM: 604803 MGI: 1917278 HomoloGene: 21068 GeneCards: DOP1B
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_005128
NM_001320714

NM_026700
NM_027293
NM_001357118

RefSeq (protein)

NP_001307643
NP_005119

NP_080976
NP_081569
NP_001344047

Location (UCSC) Chr 21: 36.16 – 36.29 Mb Chr 16: 93.51 – 93.61 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

DOP1B is a human gene located just above the Down Syndrome chromosomal region (DSCR) located at 21p22.2 sub-band. [5] [6] [7] Although the exact function of this gene is not yet fully understood, it has been proven to play a role in multiple biological processes, and its over-expression (triplication) has been linked to multiple facets of the Down Syndrome phenotype, most notably mental retardation. [6]

Contents

Gene

The DOP1B gene is located on human chromosome 21, at chromosome band 21q22.12. [5] This band is located in open reading frame 5, hence the alias C21orf5. DOP1B gene is composed of 137,493 bases making up 37 exons and 39 distinct gt-ag introns, all located between CBR3 and KIAA0136 genes. [6] [8]

Transcription produces 10 unique mRNAs, 8 alternatively spliced variants, and 2 unspliced forms. [8] These unique mRNAs differ by varying truncation of the 3' and 5' ends, as well as the presence of 3 cassette exons. [8] These mRNA variants range from 7691bp (mRNA variant DOPEY2.aAug10) to 315bp (mRNA variant DOPEY2.jAug10-unspliced) and are further described in Table 1 below. [9]

The mRNA expressed and levels of expression differ based on the location and tissue type in the body, but overall has been found to be expressed ubiquitously. [6] The highest expression has been found in differentiating, rather than proliferating, tissue zones. [5] Transcript was identified with the highest confidence in the erythroleukemia, placental cells and overall in the brain, and at a medium confidence level in the perirhinal cortex, medial temporal lobe, colon, as well in the salivary and adrenal glands. [8]

Protein

Of the ten mRNAs produced, six of them are translated into viable proteins. Please see table above for more details. [9] The largest, having a molecular weight of 258230 Da, and highest expressed protein, DOP1B.a, is composed of 2298 amino acids that make up an N-terminal domain, seven transmembrane domains, and a C-terminal coiled coil stretch that forms a leucine-like zipper domain. [8] Like other leucine zippers domains, DOP1B's C-terminal is hypothesized to be involved in multiple protein-protein and transcription factor interactions. [6] This indicates that DOP1B might act as a transcription co-activator; however, further research must be done to fully understand the precise physiological function. [6]

Protein Interactions

Very little work has been done on understanding the intricacies of the protein interactions; however, STRING has identified direct links with three proteins: MON2, TRIP12, and HECTD1. [10] DOP1B is also indirectly associated with the following proteins: ARL16, ATP9A, ARL1, ATP9B, UBE3A, HERC5, HERC4, HACE1, UBE3C, and UBR5. [10] See Figure 2 for interactions.

Figure 2. Overview of direct protein interactions that have been experimentally determined. These interactions and relationships include gene fusions, co-occurrences, co-expressions, and homologies. DOPEY2 protein.interactions.jpg
Figure 2. Overview of direct protein interactions that have been experimentally determined. These interactions and relationships include gene fusions, co-occurrences, co-expressions, and homologies.

Homology

Phylogenesis suggest that DOP1B can be traced back to a common ancestor of animals and fungi due to its highly conserved C-terminal domain DOP1B has 84 known orthologs and 158 speciation nodes in the gene tree. [11] The most similar orthologs being in the chimpanzee (Pan troglodytes), dog (Canis familiaris), cow (Bos Taurus), as well as the rat and mouse (Rattus norvegicus and Mus musculus). [11]

The only known paralog is DOPEY1. [11]

Sub-cellular localization

Gene Ontology (GO) has traced the DOP1B protein to 5 main areas: the Golgi membrane, the trans-Golgi network, cytosol, and extracellular endosome. [8] COMPARTMENTS localization data places the highest confidence of localization to the extracellular exosome and the Golgi membrane. [12]

Figure 1: Description of mRNA and Protein Variants: [9]

mRNA

Variant

Spliced mRNA LengthProtein Length5' UTR3' UTRUnspliced pre-mRNA LengthNumber of ExonsTissue-mRNA Expression (no strict specificity implied)
aAug107691 bp2298 aa85 bp709 bp129746 bp37ubiquitous
bAug102173 bp332aa1174 bp15610 bp6 carcinoid, lung, colon, colon tumor, RER+
cAug10742 bp222 aa74 bp49789 bp6breast, t-lymphocytes
dAug10623 bp145 aa188 bp43664 bp4lung
eAug10345 bp114 aa9925 bp3spleen
fAug10571 bp110 aa241 bp1322 bp2thalamus
gAug10549 bpnon-coding11 bp499 bp794 bp2spleen
hAug- unspliced543 bpnon-coding377 bp543 bp1stomach
iAug10514 bpnon-coding205 bp204 bp7377 bp2 thyroid gland
jAug10-

unspliced

315 bpnon-coding165 bp315 bp1marrow

Function

As mentioned previously, the specific function and however, its function can be largely inferred through the study of similar genes. DOP1B has been found to be involved in the following processes: multicellular organism development in cell differentiation and developmental patterning, cognition, as well as endoplasmic reticulum organization and Golgi to endosome transport. [5] [6] [13] [14]

Cell differentiation and patterning

The DOP1B ortholog, pad-1, in C. elegans, was found to have a role in cell differentiation and patterning. In an experiment where the pad-1 was silenced using RNA-mediated interference, the phenotype of the injected worm's offspring was fetal lethality. [5] The reason being: most of the embryonic tissues did not undergo appropriate cell patterning during gastrulation. [5] Abnormally positioned cells lead to misinformation of organs; the failed morphogenesis of embryo. [5] A similar observation was made in the inactivation of the Dop1 gene, the DOP1B ortholog, in S. cerevisiae . [6] The inactivation lead to abnormal cell positioning and subsequent death. Overexpression of the N-terminal in S. cerevisiae also resulted in a loss of proper growth polarity and abnormal asexual reproductive patterning. [6] This function was further supported by the function of the ortholog DopA in A. nidulans , which similarly codes for a 207kDa protein that also contains leucine zipper-like domains. [15] Its inactivation revealed its role directing alternations in cell division timing, growth polarity, as well as cell-specific gene expression, ultimately affecting organogenesis and cell differentiation. [15]

Endoplasmic reticulum and golgi transport

Dop1, an ortholog of DOP1B, in S. cerevisiae was found to play an essential role in membrane organization. [13] It was found that it forms a complex with another protein, Mon2, which recruits the pool of Dop1 from the Golgi. [13] In a Mon2 knockout model, Dop1 was mislocalized, and in turn resulted in defective cycling between endosomes and the Golgi. [13] In a Dop1 knockout model, severe defects in the endoplasmic reticulum organization. [13] This Dop1 and Mon2 complex was also linked to traffic in the enocytic pathway. [13]

Clinical significance

Cognition

DOP1B has been identified as a CNV region in Alzheimer's disease subjects, and its triplication has been tied to various phenotypic aspects of Down Syndrome. [14]

Down syndrome

DOP1B has been associated with the Down Syndrome phenotype. [6] When DOP1B was overexpressed in mice, abnormal lamination patterns of cortical cells was observed, as well as altered cortical, hippocampal, and cerebellar cells, regions that play key roles in memory and learning. [6] These changes are similar to those observed in Down Syndrome patients. [6] It is because of this that C21orf15 is now being studied as a new candidate gene for the intellectual disability phenotype in Down Syndrome. [6]

See also

Related Research Articles

<span class="mw-page-title-main">Chromosome 21</span> Human chromosome

Chromosome 21 is one of the 23 pairs of chromosomes in humans. Chromosome 21 is both the smallest human autosome and chromosome, with 45 million base pairs representing about 1.5 percent of the total DNA in cells. Most people have two copies of chromosome 21, while those with three copies of chromosome 21 have Down syndrome, also called "trisomy 21".

<span class="mw-page-title-main">Chromosome 7</span> Human chromosome

Chromosome 7 is one of the 23 pairs of chromosomes in humans, who normally have two copies of this chromosome. Chromosome 7 spans about 160 million base pairs and represents between 5 and 5.5 percent of the total DNA in cells.

<span class="mw-page-title-main">Sterol regulatory element-binding protein 2</span> Protein-coding gene in the species Homo sapiens

Sterol regulatory element-binding protein 2 (SREBP-2) also known as sterol regulatory element binding transcription factor 2 (SREBF2) is a protein that in humans is encoded by the SREBF2 gene.

<span class="mw-page-title-main">GABPA</span> Protein-coding gene in the species Homo sapiens

GA-binding protein alpha chain is a protein that in humans is encoded by the GABPA gene.

<span class="mw-page-title-main">DYRK1A</span> Protein-coding gene in the species Homo sapiens

Dual specificity tyrosine-phosphorylation-regulated kinase 1A is an enzyme that in humans is encoded by the DYRK1A gene. Alternative splicing of this gene generates several transcript variants differing from each other either in the 5' UTR or in the 3' coding region. These variants encode for at least five different isoforms.

<span class="mw-page-title-main">KCNJ6</span> Protein-coding gene in the species Homo sapiens

G protein-activated inward rectifier potassium channel 2 is a protein that in humans is encoded by the KCNJ6 gene. Mutation in KCNJ6 gene has been proposed to be the cause of Keppen-Lubinsky Syndrome (KPLBS).

<span class="mw-page-title-main">BACH1</span> Protein-coding gene in the species Homo sapiens

Transcription regulator protein BACH1 is a protein that in humans is encoded by the BACH1 gene.

<span class="mw-page-title-main">SIM2</span> Protein-coding gene in the species Homo sapiens

Single-minded homolog 2 is a protein that in humans is encoded by the SIM2 gene. It plays a major role in the development of the central nervous system midline as well as the construction of the face and head.

<span class="mw-page-title-main">CHAF1B</span> Protein-coding gene in the species Homo sapiens

Chromatin assembly factor 1 subunit B is a protein that in humans is encoded by the CHAF1B gene.

<span class="mw-page-title-main">Nucleobindin 1</span> Protein-coding gene in the species Homo sapiens

Nucleobindin-1 (NUCB1), also known as calnuc, is a protein that in humans is encoded by the NUCB1 gene.

<span class="mw-page-title-main">PCNT</span> Protein-coding gene in the species Homo sapiens

Pericentrin (kendrin), also known as PCNT and pericentrin-B (PCNTB), is a protein which in humans is encoded by the PCNT gene on chromosome 21. This protein localizes to the centrosome and recruits proteins to the pericentriolar matrix (PCM) to ensure proper centrosome and mitotic spindle formation, and thus, uninterrupted cell cycle progression. This gene is implicated in many diseases and disorders, including congenital disorders such as microcephalic osteodysplastic primordial dwarfism type II (MOPDII) and Seckel syndrome.

<span class="mw-page-title-main">SIM1</span> Genetic protein

Single-minded homolog 1, also known as class E basic helix-loop-helix protein 14 (bHLHe14), is a protein that in humans is encoded by the SIM1 gene.

<span class="mw-page-title-main">BLZF1</span> Protein-coding gene in the species Homo sapiens

Golgin-45 is a protein that in humans is encoded by the BLZF1 gene.

<span class="mw-page-title-main">Carbohydrate-responsive element-binding protein</span> Protein-coding gene in the species Homo sapiens

Carbohydrate-responsive element-binding protein (ChREBP) also known as MLX-interacting protein-like (MLXIPL) is a protein that in humans is encoded by the MLXIPL gene. The protein name derives from the protein's interaction with carbohydrate response element sequences of DNA.

<span class="mw-page-title-main">TCP10L</span> Protein-coding gene in the species Homo sapiens

T-complex protein 10A homolog 2 is a protein that in humans is encoded by the TCP10L gene. It is located next to CFAP298.

<span class="mw-page-title-main">RWDD2B</span> Protein-coding gene in the species Homo sapiens

RWD domain-containing protein 2B is a protein that in humans is encoded by the RWDD2B gene.

<span class="mw-page-title-main">LZTR1</span> Protein-coding gene in the species Homo sapiens

Leucine-zipper-like transcriptional regulator 1 is a protein that in humans is encoded by the LZTR1 gene.

<span class="mw-page-title-main">C7orf26</span> Human protein-encoding gene on chromosome 7

c7orf26 is a gene in humans that encodes a protein known as c7orf26. Based on properties of c7orf26 and its conservation over a long period of time, its suggested function is targeted for the cytoplasm and it is predicted to play a role in regulating transcription.

<span class="mw-page-title-main">CCDC188</span> Protein found in humans

CCDC188 or coiled-coil domain containing protein is a protein that in humans is encoded by the CCDC188 gene.

<span class="mw-page-title-main">TEDDM1</span> Protein-coding gene in the species Homo sapiens

Transmembrane epididymal protein 1 is a transmembrane protein encoded by the TEDDM1 gene. TEDDM1 is also commonly known as TMEM45C and encodes 273 amino acids that contains six alpha-helix transmembrane regions. The protein contains a 118 amino acid length family of unknown function. While the exact function of TEDDM1 is not understood, it is predicted to be an integral component of the plasma membrane.


References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000142197 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000022946 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 3 4 5 6 7 Guipponi M, Brunschwig K, Chamoun Z, Scott HS, Shibuya K, Kudoh J, Delezoide AL, El Samadi S, Chettouh Z, Rossier C, Shimizu N, Mueller F, Delabar JM, Antonarakis SE (August 2000). "C21orf5, a novel human chromosome 21 gene, has a Caenorhabditis elegans ortholog (pad-1) required for embryonic patterning". Genomics. 68 (1): 30–40. doi:10.1006/geno.2000.6250. PMID   10950924.
  6. 1 2 3 4 5 6 7 8 9 10 11 12 13 Rachidi M, Lopes C, Costantine M, Delabar JM (2005-01-01). "C21orf5, a new member of Dopey family involved in morphogenesis, could participate in neurological alterations and mental retardation in Down syndrome". DNA Research. 12 (3): 203–10. doi: 10.1093/dnares/dsi004 . PMID   16303751.
  7. Shapiro BL (1999). "The Down Syndrome critical region". The Molecular Biology of Down Syndrome. Vol. 57. pp. 41–60. doi:10.1007/978-3-7091-6380-1_3. ISBN   978-3-211-83377-3. PMID   10666667.{{cite book}}: |journal= ignored (help)
  8. 1 2 3 4 5 6 Consortium GO. "AmiGO 2: Term Details for "molecular_function" (GO:0003674)". amigo.geneontology.org. Retrieved 2017-11-29.
  9. 1 2 3 Thierry-Mieg D, Thierry-Mieg J. "AceView: Gene:DOPEY2, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. NCBI/NLM/NIH. Retrieved 2017-11-30.
  10. 1 2 3 "26 items (Homo sapiens) - STRING network view". string-db.org. Retrieved 2017-11-29.
  11. 1 2 3 "Gene: DOPEY2 (ENSG00000142197) - Gene tree - Homo sapiens - Ensembl genome browser 90". www.ensembl.org. Retrieved 2017-11-29.
  12. "COMPARTMENTS - DOPEY2". compartments.jensenlab.org. Retrieved 2017-11-30.
  13. 1 2 3 4 5 6 Gillingham AK, Whyte JR, Panic B, Munro S (January 2006). "Mon2, a relative of large Arf exchange factors, recruits Dop1 to the Golgi apparatus". The Journal of Biological Chemistry. 281 (4): 2273–80. doi: 10.1074/jbc.M510176200 . PMID   16301316.
  14. 1 2 Swaminathan S, Huentelman MJ, Corneveaux JJ, Myers AJ, Faber KM, Foroud T, Mayeux R, Shen L, Kim S, Turk M, Hardy J, Reiman EM, Saykin AJ (2012). "Analysis of copy number variation in Alzheimer's disease in a cohort of clinically characterized and neuropathologically verified individuals". PLOS ONE. 7 (12): e50640. Bibcode:2012PLoSO...750640S. doi: 10.1371/journal.pone.0050640 . PMC   3515604 . PMID   23227193.
  15. 1 2 Pascon RC, Miller BL (June 2000). "Morphogenesis in Aspergillus nidulans requires Dopey (DopA), a member of a novel family of leucine zipper-like proteins conserved from yeast to humans". Molecular Microbiology. 36 (6): 1250–64. doi: 10.1046/j.1365-2958.2000.01950.x . PMID   10931277. S2CID   27402390.

Further reading