DOP1B | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | DOP1B , 21orf5, C21orf5, dopey family member 2, DOPEY2, DOP1 leucine zipper like protein B | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 604803 MGI: 1917278 HomoloGene: 21068 GeneCards: DOP1B | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
DOP1B is a human gene located just above the Down Syndrome chromosomal region (DSCR) located at 21p22.2 sub-band. [5] [6] [7] Although the exact function of this gene is not yet fully understood, it has been proven to play a role in multiple biological processes, and its over-expression (triplication) has been linked to multiple facets of the Down Syndrome phenotype, most notably mental retardation. [6]
The DOP1B gene is located on human chromosome 21, at chromosome band 21q22.12. [5] This band is located in open reading frame 5, hence the alias C21orf5. DOP1B gene is composed of 137,493 bases making up 37 exons and 39 distinct gt-ag introns, all located between CBR3 and KIAA0136 genes. [6] [8]
Transcription produces 10 unique mRNAs, 8 alternatively spliced variants, and 2 unspliced forms. [8] These unique mRNAs differ by varying truncation of the 3' and 5' ends, as well as the presence of 3 cassette exons. [8] These mRNA variants range from 7691bp (mRNA variant DOPEY2.aAug10) to 315bp (mRNA variant DOPEY2.jAug10-unspliced) and are further described in Table 1 below. [9]
The mRNA expressed and levels of expression differ based on the location and tissue type in the body, but overall has been found to be expressed ubiquitously. [6] The highest expression has been found in differentiating, rather than proliferating, tissue zones. [5] Transcript was identified with the highest confidence in the erythroleukemia, placental cells and overall in the brain, and at a medium confidence level in the perirhinal cortex, medial temporal lobe, colon, as well in the salivary and adrenal glands. [8]
Of the ten mRNAs produced, six of them are translated into viable proteins. Please see table above for more details. [9] The largest, having a molecular weight of 258230 Da, and highest expressed protein, DOP1B.a, is composed of 2298 amino acids that make up an N-terminal domain, seven transmembrane domains, and a C-terminal coiled coil stretch that forms a leucine-like zipper domain. [8] Like other leucine zippers domains, DOP1B's C-terminal is hypothesized to be involved in multiple protein-protein and transcription factor interactions. [6] This indicates that DOP1B might act as a transcription co-activator; however, further research must be done to fully understand the precise physiological function. [6]
Very little work has been done on understanding the intricacies of the protein interactions; however, STRING has identified direct links with three proteins: MON2, TRIP12, and HECTD1. [10] DOP1B is also indirectly associated with the following proteins: ARL16, ATP9A, ARL1, ATP9B, UBE3A, HERC5, HERC4, HACE1, UBE3C, and UBR5. [10] See Figure 2 for interactions.
Phylogenesis suggest that DOP1B can be traced back to a common ancestor of animals and fungi due to its highly conserved C-terminal domain DOP1B has 84 known orthologs and 158 speciation nodes in the gene tree. [11] The most similar orthologs being in the chimpanzee (Pan troglodytes), dog (Canis familiaris), cow (Bos Taurus), as well as the rat and mouse (Rattus norvegicus and Mus musculus). [11]
Gene Ontology (GO) has traced the DOP1B protein to 5 main areas: the Golgi membrane, the trans-Golgi network, cytosol, and extracellular endosome. [8] COMPARTMENTS localization data places the highest confidence of localization to the extracellular exosome and the Golgi membrane. [12]
Figure 1: Description of mRNA and Protein Variants: [9]
mRNA Variant | Spliced mRNA Length | Protein Length | 5' UTR | 3' UTR | Unspliced pre-mRNA Length | Number of Exons | Tissue-mRNA Expression (no strict specificity implied) |
---|---|---|---|---|---|---|---|
aAug10 | 7691 bp | 2298 aa | 85 bp | 709 bp | 129746 bp | 37 | ubiquitous |
bAug10 | 2173 bp | 332aa | 1174 bp | 15610 bp | 6 | carcinoid, lung, colon, colon tumor, RER+ | |
cAug10 | 742 bp | 222 aa | 74 bp | 49789 bp | 6 | breast, t-lymphocytes | |
dAug10 | 623 bp | 145 aa | 188 bp | 43664 bp | 4 | lung | |
eAug10 | 345 bp | 114 aa | 9925 bp | 3 | spleen | ||
fAug10 | 571 bp | 110 aa | 241 bp | 1322 bp | 2 | thalamus | |
gAug10 | 549 bp | non-coding | 11 bp | 499 bp | 794 bp | 2 | spleen |
hAug- unspliced | 543 bp | non-coding | 377 bp | 543 bp | 1 | stomach | |
iAug10 | 514 bp | non-coding | 205 bp | 204 bp | 7377 bp | 2 | thyroid gland |
jAug10- unspliced | 315 bp | non-coding | 165 bp | 315 bp | 1 | marrow |
As mentioned previously, the specific function and however, its function can be largely inferred through the study of similar genes. DOP1B has been found to be involved in the following processes: multicellular organism development in cell differentiation and developmental patterning, cognition, as well as endoplasmic reticulum organization and Golgi to endosome transport. [5] [6] [13] [14]
The DOP1B ortholog, pad-1, in C. elegans, was found to have a role in cell differentiation and patterning. In an experiment where the pad-1 was silenced using RNA-mediated interference, the phenotype of the injected worm's offspring was fetal lethality. [5] The reason being: most of the embryonic tissues did not undergo appropriate cell patterning during gastrulation. [5] Abnormally positioned cells lead to misinformation of organs; the failed morphogenesis of embryo. [5] A similar observation was made in the inactivation of the Dop1 gene, the DOP1B ortholog, in S. cerevisiae . [6] The inactivation lead to abnormal cell positioning and subsequent death. Overexpression of the N-terminal in S. cerevisiae also resulted in a loss of proper growth polarity and abnormal asexual reproductive patterning. [6] This function was further supported by the function of the ortholog DopA in A. nidulans , which similarly codes for a 207kDa protein that also contains leucine zipper-like domains. [15] Its inactivation revealed its role directing alternations in cell division timing, growth polarity, as well as cell-specific gene expression, ultimately affecting organogenesis and cell differentiation. [15]
Dop1, an ortholog of DOP1B, in S. cerevisiae was found to play an essential role in membrane organization. [13] It was found that it forms a complex with another protein, Mon2, which recruits the pool of Dop1 from the Golgi. [13] In a Mon2 knockout model, Dop1 was mislocalized, and in turn resulted in defective cycling between endosomes and the Golgi. [13] In a Dop1 knockout model, severe defects in the endoplasmic reticulum organization. [13] This Dop1 and Mon2 complex was also linked to traffic in the enocytic pathway. [13]
DOP1B has been identified as a CNV region in Alzheimer's disease subjects, and its triplication has been tied to various phenotypic aspects of Down Syndrome. [14]
DOP1B has been associated with the Down Syndrome phenotype. [6] When DOP1B was overexpressed in mice, abnormal lamination patterns of cortical cells was observed, as well as altered cortical, hippocampal, and cerebellar cells, regions that play key roles in memory and learning. [6] These changes are similar to those observed in Down Syndrome patients. [6] It is because of this that C21orf15 is now being studied as a new candidate gene for the intellectual disability phenotype in Down Syndrome. [6]
Chromosome 21 is one of the 23 pairs of chromosomes in humans. Chromosome 21 is both the smallest human autosome and chromosome, with 45 million base pairs representing about 1.5 percent of the total DNA in cells. Most people have two copies of chromosome 21, while those with three copies of chromosome 21 have Down syndrome, also called "trisomy 21".
Chromosome 7 is one of the 23 pairs of chromosomes in humans, who normally have two copies of this chromosome. Chromosome 7 spans about 160 million base pairs and represents between 5 and 5.5 percent of the total DNA in cells.
Sterol regulatory element-binding protein 2 (SREBP-2) also known as sterol regulatory element binding transcription factor 2 (SREBF2) is a protein that in humans is encoded by the SREBF2 gene.
GA-binding protein alpha chain is a protein that in humans is encoded by the GABPA gene.
Dual specificity tyrosine-phosphorylation-regulated kinase 1A is an enzyme that in humans is encoded by the DYRK1A gene. Alternative splicing of this gene generates several transcript variants differing from each other either in the 5' UTR or in the 3' coding region. These variants encode for at least five different isoforms.
G protein-activated inward rectifier potassium channel 2 is a protein that in humans is encoded by the KCNJ6 gene. Mutation in KCNJ6 gene has been proposed to be the cause of Keppen-Lubinsky Syndrome (KPLBS).
Transcription regulator protein BACH1 is a protein that in humans is encoded by the BACH1 gene.
Single-minded homolog 2 is a protein that in humans is encoded by the SIM2 gene. It plays a major role in the development of the central nervous system midline as well as the construction of the face and head.
Chromatin assembly factor 1 subunit B is a protein that in humans is encoded by the CHAF1B gene.
Nucleobindin-1 (NUCB1), also known as calnuc, is a protein that in humans is encoded by the NUCB1 gene.
Pericentrin (kendrin), also known as PCNT and pericentrin-B (PCNTB), is a protein which in humans is encoded by the PCNT gene on chromosome 21. This protein localizes to the centrosome and recruits proteins to the pericentriolar matrix (PCM) to ensure proper centrosome and mitotic spindle formation, and thus, uninterrupted cell cycle progression. This gene is implicated in many diseases and disorders, including congenital disorders such as microcephalic osteodysplastic primordial dwarfism type II (MOPDII) and Seckel syndrome.
Single-minded homolog 1, also known as class E basic helix-loop-helix protein 14 (bHLHe14), is a protein that in humans is encoded by the SIM1 gene.
Golgin-45 is a protein that in humans is encoded by the BLZF1 gene.
Carbohydrate-responsive element-binding protein (ChREBP) also known as MLX-interacting protein-like (MLXIPL) is a protein that in humans is encoded by the MLXIPL gene. The protein name derives from the protein's interaction with carbohydrate response element sequences of DNA.
T-complex protein 10A homolog 2 is a protein that in humans is encoded by the TCP10L gene. It is located next to CFAP298.
RWD domain-containing protein 2B is a protein that in humans is encoded by the RWDD2B gene.
Leucine-zipper-like transcriptional regulator 1 is a protein that in humans is encoded by the LZTR1 gene.
c7orf26 is a gene in humans that encodes a protein known as c7orf26. Based on properties of c7orf26 and its conservation over a long period of time, its suggested function is targeted for the cytoplasm and it is predicted to play a role in regulating transcription.
CCDC188 or coiled-coil domain containing protein is a protein that in humans is encoded by the CCDC188 gene.
Transmembrane epididymal protein 1 is a transmembrane protein encoded by the TEDDM1 gene. TEDDM1 is also commonly known as TMEM45C and encodes 273 amino acids that contains six alpha-helix transmembrane regions. The protein contains a 118 amino acid length family of unknown function. While the exact function of TEDDM1 is not understood, it is predicted to be an integral component of the plasma membrane.
{{cite book}}
: |journal=
ignored (help)