ZC3H12B

Last updated
ZC3H12B
Identifiers
Aliases ZC3H12B , CXorf32, MCPIP2, zinc finger CCCH-type containing 12B
External IDs OMIM: 300889 MGI: 2442133 HomoloGene: 19395 GeneCards: ZC3H12B
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001010888

NM_001034907

RefSeq (protein)

NP_001010888

n/a

Location (UCSC) Chr X: 65.37 – 65.51 Mb Chr X: 94.76 – 94.98 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

ZC3H12B, also known as CXorf32 or MCPIP2, is a protein encoded by gene ZC3H12B located on chromosome Xq12 in humans.

Contents

Gene

The ZC3H12B gene is composed of 19,709 base pairs (bp) and contains 5 exons. It is located on the X chromosome at q12 on the plus strand.

ZC3H12B contains a ribonuclease domain, as well as a CCCH-type zinc finger domain. Ribonucleases (RNases) degrade RNA and are involved in the RNA maturation process. They are also a line of defense against viral RNA (D'Alessio and Riordan 1997). CCCH-type zinc fingers are associated with mRNA destabilization. CCCH-type zinc fingers have been shown to turn over mRNA without the removal of the PolyA tail (Lai and Blackshear 2001). ZC3H12B and its paralogs ZC3H12A, ZC3H12C and ZC3H12D all contain CCCH-type zinc finger domains, which have been associated with cell cycle and growth phase transitions in eukaryotes (InterPro).

Promoter

Genomatix ElDorado program predicted a 601 bp promoter upstream of the ZC3H12B gene with multiple transcription factor binding sites including nuclear factor of activated T-cells and ribonucleoprotein associated zinc finger protein MOK-2 (also known as ZNF239).

mRNA

ZC3H12B contains 7273 bp mRNA. There is only one predicted transcript by Aceview. No folding patterns have been predicted (Mfold). There are for introns excised from ZC3H12B.

Protein

ZC3H12B is a probable ribonuclease containing CCCH-type zinc finger domain and ribonuclease domains. The 836 amino acid protein has a predicted molecular weight of 94.2 kdal. It does not contain a signal peptide or a transmembrane region. PSORTII predicted 65.2% probability of nuclear location. CCCH-type zinc fingers and ribonucleases are presumably located in the nucleus for RNA cleaving and specifically, RNA hairpin cleaving (Boysen and Hearn 2008).

Structural characteristics

The protein secondary structure is a mixture of alpha helices and beta strands. The two domains identified so far are the ribonuclease and CCCH-type zinc finger domains.

Shown below is a conserved domain of ZC3H12B paralog, Mcpip1 (or ZC3H12A). In a BLAST structure comparison, there was an 82% identity match with 24% query coverage, with a predicted e-value of 2e-118. 82% identity match is enough to make comparisons of ZC3H12B and Mcpip1 (ZC3H12A) zinc finger conserved domain, which are both predicted to be composed of beta strands and alpha helices.

Post-translational modification

Phobius program predicted non-cytoplasmic protein location. NetPhos 2.0. predicted 63 phosphorylation sites in ZC3H12B, which are marked on the conceptual translation. YinOYang1.2. predicted three 0-Beta-GlcNAc attachment sites, which are competing with phosphorylation sites. 0-Beta-GlcNAc is presumably the only type of glycosylation occurring in the nucleus and/or cytoplasm of cells. There is a notable link between antigen activation by lymphocytes and dynamic 0-B-Glycosylation in nuclear proteins (Hart and Akimoto). NetNGlyc predicted glycosylation sites; however, these sites were excluded because the protein is likely nuclear and would not undergo this form of glycosylation. There were no predicted acetylation sites at the N-terminus of the protein. This is unusual because approximately 85% of human proteins are acetylated at the N terminus for synthesis, stabilization and localization of proteins (Van Damm et al.). No positive, negative or mixed charge clusters present. No hydrophobic segments detected (SAPS SDSC Biology Workbench). MitoProtII did not detect any mitochondria export signals. These post-translational tests suggest the protein is located in the nucleus and undergoes dynamic phosphorylation and 0-Beta-GlcNAc modifications.

Evolution

Select domains of ZC3H12B are conserved in most vertebrates, arthropods and annelids. There are not conserved domains in domains bacteria or archaea. There were not significantly conserved domains in yeasts, plants or protists.

Paralogs

There are three paralogs of ZC3H12B which are in the same CCCH-type zinc finger family, all which maintain greater than 50% identity to ZC3H12B based on BLAST analysis (NCBI).

NameSpeciesNCBI accession numberLength (AA)Protein identity
ZC3H12BHomo sapiens NM_001010888.3 836aa100%
ZC3H12AHomo sapiens NM_025079.2 599aa68%
ZC3H12CHomo sapiens NM_033390.1 883aa53%
ZC3H12DHomo sapiens NM_207360.2 527aa61%

Orthologs

ZC3H12B is conserved in mammals, birds, insects and nematodes (BLAST). See the table below for the summary of orthologs of ZC3H12B in humans.

SpeciesSpecies common nameDivergence (MYA)NCBI accession number (protein)Length (amino acids)Protein identitySimilarity
Homo sapiensHumann/a NP_001010888.3 836aa100%100%
Pan paniscusChimpanzee6.3 XP_003816967.1 836aa99%99%
Pongo abeliiOrangutan15.7 XP_002831786.1 836aa99%99%
Macaca mulattaRhesus monkey29 XP_002806307.1 836aa99%99%
Callithrix jacchusMarmoset42.6 XP_002762992.2 836aa98%98%
Mus musculusMouse92.3 NP_001030079.2 835aa91%94%
Sus scrofaPig94.2 XP_003360389.1 836aa93%96%
Gallus gallusChicken296 XP_003641177.1 837aa77%85%
Chrysemys picta belliiPainted turtle296 XP_005279572.1 838aa78%86%
Oryzias latipesMedaka400.1 XP_004076599.1 845aa67%77%
Gadus morhuaAtlantic cod400.1 AFK76491.1 842aa29%44%
Danio rerioZebrafish400.1 XP_001342172.3 982aa68%77%
Petromyzon marinusLamprey535.7 ABO21295.1 222aa44%58%
Branchiostoma floridaeLancelet713.2 XP_002598834.1 492aa66%79%
Ciona intestinalisVase tunicate722.5 XP_002125834.1 863aa54%66%
Strongylocentrotus purpuratusPurple sea urchin742.9 XP_787030.3 974aa58%72%
Aplysiomorpha californicaSea hare782.7 XP_005113312.1 1269aa51%69%
Drosophila grimshawiHawaii fruit fly782.7 XP_001994140.1 548aa51%69%
Anopheles gambiaeMosquito782.7 XP_321880 637aa59%75%
Apis mellifera Honey bee782.7 XP_397264 652aa58%73%
Caenorhabditis elegansRound worm (nematode)937.5 NP_491985.4 634aa46%64%

Expression and function

Microarrays in normal tissue expression profiling showed increased expression of the gene in the pancreas, prostate, brain, spinal cord and thymus (GEO). Unlike its paralogs, it is not expressed in macrophage-activated tissues, which indicates the paralogous relationship to the inflammatory response (Liang et al. 2008). ZC3H12B is expressed transiently in brain, thymus and testis tissues (EST).

Interaction

Predicted interactions by Ingenuity Systems showed no drug targeting molecules in pathway and no known drug targets. The listed top functions and diseases were cancer, organismal injury and abnormalities, reproductive system disease. Several miRNA interactions were predicted. The predicted miRNA targets have yet to be matched to the ZC3H12B sequence and it is unclear whether there is an interaction between the two. Tests such as Forster Resonance Energy Transfer (FRET), co-immunoprecipitation, two-hybrid screening, hydropathic complementarity, cluster-microarray and ChiP could be used in the future to test for new protein/chromatin interactions with ZC3H12B.

Clinical significance

Deletions of the Xq12 locus has resulted in several disorders such as androgen insensitivity, susceptibility to prostate cancer, spinal and bulbar muscular atrophy of Kennedy and mental retardation; however, no link has been found between these diseases and ZC3H12B (NCBI).

Related Research Articles

<span class="mw-page-title-main">KIAA1109</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein KIAA1109 is a protein that in humans is encoded by the KIAA1109 gene.

<span class="mw-page-title-main">RNF128</span> Protein-coding gene in the species Homo sapiens

E3 ubiquitin-protein ligase RNF128 is an enzyme that in humans is encoded by the RNF128 gene.

<span class="mw-page-title-main">QRICH1</span> Protein-coding gene in the species Homo sapiens

QRICH1, also known as Glutamine-rich protein 1, is a protein that in humans is encoded by the QRICH1 gene. One notable feature of this protein is that it contains a Caspase Activation Recruitment Domain, also known as a CARD domain. As a result of having this domain, QRICH1 is believed to be involved in apoptotic, inflammatory, and host-immune response pathways.

<span class="mw-page-title-main">Protein FAM46B</span> Protein-coding gene in the species Homo sapiens

Protein FAM46B also known as family with sequence similarity 46 member B is a protein that in humans is encoded by the FAM46B gene. FAM46B contains one protein domain of unknown function, DUF1693. Yeast two-hybrid screening has identified three proteins that physically interact with FAM46B. These are ATX1, PEPP2 and DAZAP2.

<span class="mw-page-title-main">FAM214A</span> Protein-coding gene in the species Homo sapiens

Protein FAM214A, also known as protein family with sequence similarity 214, A (FAM214A) is a protein that, in humans, is encoded by the FAM214A gene. FAM214A is a gene with unknown function found at the q21.2-q21.3 locus on Chromosome 15 (human). The protein product of this gene has two conserved domains, one of unknown function (DUF4210) and another one called Chromosome_Seg. Although the function of the FAM214A protein is uncharacterized, both DUF4210 and Chromosome_Seg have been predicted to play a role in chromosome segregation during meiosis.

<span class="mw-page-title-main">CCDC94</span> Protein found in humans

Coiled-coil domain containing 94 (CCDC94) is a protein that in humans is encoded by the CCDC94 gene. The CCDC94 protein contains a coiled-coil domain, a domain of unknown function (DUF572), an uncharacterized conserved protein (COG5134), and lacks a transmembrane domain.

<span class="mw-page-title-main">EVI5L</span> Protein-coding gene in the species Homo sapiens

EVI5L is a protein that in humans is encoded by the EVI5L gene. EVI5L is a member of the Ras superfamily of monomeric guanine nucleotide-binding (G) proteins, and functions as a GTPase-activating protein (GAP) with a broad specificity. Measurement of in vitro Rab-GAP activity has shown that EVI5L has significant Rab2A- and Rab10-GAP activity.

<span class="mw-page-title-main">Transmembrane protein 255A</span> Mammalian protein found in Homo sapiens

Transmembrane protein 255A is a protein that is encoded by the TMEM255A gene. TMEM255A is often referred to as family with sequence similarity 70, member A (FAM70A). The TMEM255A protein is transmembrane and is predicted to be located the nuclear envelope of eukaryote organisms.

Zinc Finger Protein 800 or ZNF800 is a protein that in humans is encoded by the ZNF800 gene. The specific function of ZNF800 is not yet well understood by the scientific community.

Forkhead-associated domain containing protein 1 (FHAD1) is a protein encoded by the FHAD1 gene.

<span class="mw-page-title-main">Chromosome 9 open reading frame 43</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.

LOC101928193 is a protein which in humans is encoded by the LOC101928193 gene. There are no known aliases for this gene or protein. Similar copies of this gene, called orthologs, are known to exist in several different species across mammals, amphibians, fish, mollusks, cnidarians, fungi, and bacteria. The human LOC101928193 gene is located on the long (q) arm of chromosome 9 with a cytogenic location at 9q34.2. The molecular location of the gene is from base pair 133,189,767 to base pair 133,192,979 on chromosome 9 for an mRNA length of 3213 nucleotides. The gene and protein are not yet well understood by the scientific community, but there is data on its genetic makeup and expression. The LOC101928193 protein is targeted for the cytoplasm and has the highest level of expression in the thyroid, ovary, skin, and testes in humans.

<span class="mw-page-title-main">ZC3H11B</span>

ZC3H11B also known as zinc finger CCCH-type containing protein 11B is a protein in humans that is encoded by the ZC3H11B gene. The zc3h11b gene is located on chromosome 1, on the long arm, in band 4 section 1. This protein is also known as ZC3HDC11B. The zc3h11b gene is a total of 5,134 base pairs long, and the protein is 805 amino acids in length. The zc3h11b gene has 2 exons in total.

<span class="mw-page-title-main">C16orf90</span> Protein-coding gene in the species Homo sapiens

C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.

<span class="mw-page-title-main">ZNF337</span> Protein-coding gene in the species Homo sapiens

ZNF337, also known as zinc finger protein 337, is a protein that in humans is encoded by the ZNF337 gene. The ZNF337 gene is located on human chromosome 20 (20p11.21). Its protein contains 751 amino acids, has a 4,237 base pair mRNA and contains 6 exons total. In addition, alternative splicing results in multiple transcript variants. The ZNF337 gene encodes a zinc finger domain containing protein, however, this gene/protein is not yet well understood by the scientific community. The function of this gene has been proposed to participate in a processes such as the regulation of transcription (DNA-dependent), and proteins are expected to have molecular functions such as DNA binding, metal ion binding, zinc ion binding, which would be further localized in various subcellular locations. While there are no commonly associated or known aliases, an important paralog of this gene is ZNF875

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">C12orf50</span> Protein-coding gene in humans

Chromosome 12 Open Reading Frame 50 (C12orf50) is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811, on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.

<span class="mw-page-title-main">GPATCH2L</span> It is Wikipedia article of unknown gene called "GPATCH2L".

GPATCH2L is a protein that is encoded by the GPATCH2L human gene located at 14q24.3. In humans, the length of mRNA in GPATCH2L (NM_017926) is 14,021 base pairs and the gene spans bases is 62,422 nt between chr14: 76,151,922 - 76,214,343. GPATCH2L is on the positive strand. IFT43 is the gene directly before GPATCH2L on the positive strand and LOC105370575 is the uncharacterized gene on the negative strand, which is approximately one and a half the size of GPATCH2L. Known aliases for GPATCH2L contain C14orf118, FLJ20689, FLJ10033, and KIAA1152. GPATCH2L produces 28 distinct introns, 17 different mRNAs, 14 alternatively spliced variants, and 3 unspliced forms. It has 5 probable alternative promoters, 7 validated polyadenylation sites, and 6 predicted promoters of varying lengths.

<span class="mw-page-title-main">ZNF821</span> Zinc Finger 821

Zinc Finger Protein 821, also known as ZNF821, is a protein encoded by the ZNF821 gene. This gene is located on the 16th chromosome and is expressed highly in the testes, moderately expressed in the brain and low expression in 23 other tissues. The protein encoded is 412 amino acids long with 2 Zinc Finger motifs and a 23 amino acid long STPR domain.

<span class="mw-page-title-main">UBALD1</span> Human Gene/Protein

UBALD1 is a protein encoded by the UBALD1 gene, located on chromosome 16 in humans. UBALD1 has high ubiquitous tissue expression and localizes in the nucleus and cytoplasm. UBALD1 is conserved in animals, including invertebrates. An alias for UBALD1 is FAM100A.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000102053 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000035045 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.

Further reading