Trinucleotide repeat containing 18

Last updated
TNRC18
Identifiers
Aliases TNRC18 , CAGL79, TNRC18A, trinucleotide repeat containing 18
External IDs MGI: 3648294 HomoloGene: 45603 GeneCards: TNRC18
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001013722
NM_001080495

NM_001122730
NM_178242

RefSeq (protein)

NP_001073964

NP_001116202
NP_839973

Location (UCSC) Chr 7: 5.31 – 5.43 Mb Chr 5: 142.72 – 142.82 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Trinucleotide repeat containing 18 is a protein that in humans is encoded by the TNRC18 gene. [5]

Contents

Function

The exact function of TNRC18 is not yet well understood by the scientific community. The protein sequence provided by the National Center for Biotechnology Information (NCBI) database includes a Bromo Adjacent Homology (BAH) Domain within TNRC18. [6] BAH domains are often found in chromatin-associated proteins and assist in the silencing of genes. [7]

Gene

According to the UCSC Genome Browser, TNRC18 is located within Chromosome 7 in humans (chr7: 5,306,800-5,423,546). There are 29 introns and 30 exons listed. Directly preceding TNRC18 is SLC29A4 and immediately following is AC092171.4. [8] SLC29A4 encodes the plasma membrane monoamine transporter in humans.

GeneCards lists five aliases for TNRC18, Long CAG Trinucleotide Repeat-Containing Gene 79 Protein, Trinucleotide Repeat-Containing Gene 18 Protein, CAGL79, KIAA1856, and TNRC18A. Additionally, TNRC18 has two paralogs, BAH Domain And Coiled-Coil Containing 1 (BAHCC1) and Bromo Adjacent Homology Domain Containing 1 (BAHD1). [9]

mRNA and Isoforms

The NCBI gene page for TNRC18 lists 9 different protein isoforms across 12 transcript variant mRNA sequences. [10] TNRC18 isoform X7 is encoded by mRNA transcript variants X7-X10. Additionally, isoforms X8 and X9 are produced by variants X11 and X12 respectively.

TNRC18 Isoform variations
IsoformmRNAProteinmRNA length (bp)Protein length (aa)
TNRC18 NM_001080495 NP_001073964 105862968
TNRC18 isoform X1 XM_017012728 XP_016868217 109022979
TNRC18 isoform X2 XM_017012730 XP_016868219 104242978
TNRC18 isoform X3 XM_017012731 XP_016868220 102992936
TNRC18 isoform X4 XM_017012732 XP_016868221 102962935
TNRC18 isoform X5 XM_017012733 XP_016868222 102842931
TNRC18 isoform X6 XM_017012734 XP_016868223 113192929
TNRC18 isoform X7 XM_017012735 XP_016868224 103122905
TNRC18 isoform X7 XM_017012736 XP_016868225 108992905
TNRC18 isoform X7 XM_017012737 XP_016868226 104722905
TNRC18 isoform X7 XM_017012738 XP_016868227 101942905
TNRC18 isoform X8 XM_017012739 XP_016868228 68162189
TNRC18 isoform X9 XM_017012740 XP_016868229 68392094

Protein

The protein sequence provided by NCBI lists human TNRC18 having a length of 2968 amino acids. [6] The Compute pI/Mw tool program by ExPASy [11] predicts the isoelectric point and molecular weight for the TNRC18 to be 8.88 and 315 kDa respectively. Additionally, the NCBI protein sequence for TNRC18 contains nine phosphorylation sites on TNRC18, eight phosphoserines and one phosphothreonine. There is a large serine repeat upstream of the BAH site located from amino acid positions 2604–2670. The BAH site is located on position 2816–2960.

The predicted secondary structure for TNRC18 consists of 32.61% alpha helix, 6.74% extended strand, and 60.55% random coil. This was found using the GOR4 program available at PRABI-Lyon-Gerland with the NCBI protein sequence for TNRC18. [12] [13]

Expression

RNA sequencing of TNRC18 tissue samples found ubiquitous gene expression. Most prominent expression was observed within the colon, kidney, and prostate tissue samples. In fetal human tissue samples, notable expression was found in the stomach, lung, and brain. RNA sequencing data was acquired though the TNRC18 gene expression page found on NCBI. [14]

The Human Protein Atlas shows highest RNA expression of TNRC18 in the brain, endocrine tissue, and muscle tissue. Additionally, the highest protein expression is observed in the brain, endocrine tissue, lung, gastrointestinal tract, and male and female specific tissues. Conversely, there is no protein expression in the eye or blood tissue, yet ubiquitous RNA expression for TNRC18. [15]

TNRC18 expression in mouse brain can be found below. Noteworthy expression is observed in the olfactory bulb, isocortex, and cerebellar cortex shown in color. This image and brain atlas information is provided by the Allen Institute Brain Atlas. [16] [17]

Expression of TNRC18 in Mus musculus brain. Expression is observed in the olfactory bulb, isocortex, and cerebellar cortex (see color). This image is provided by the Allen Institute Brain Atlas. TNRC18 Mouse Expression.png
Expression of TNRC18 in Mus musculus brain. Expression is observed in the olfactory bulb, isocortex, and cerebellar cortex (see color). This image is provided by the Allen Institute Brain Atlas.

Homology

NCBI Protein BLAST search for reference proteins lists the following orthologs for human TNRC18. The table is ordered first by increasing estimated date of divergence from humans in millions of years (MYA) and then by highest-to-lowest sequence identity with humans. Date of divergence information was acquired from TimeTree [18] and sequence identify and similarity percentages were found by a pairwise sequence alignment using the European Bioinformatics Institute (EMBL-EBI) EMBOSS Needle program. [19]

Orthologs for Homo sapiens TNRC18 gene
SpeciesCommon nameNCBI Protein AccessionDate of Divergence (MYA)Sequence Identity with Humans (%)Sequence Similarity with Humans (%)Length (aa)
Homo SapiensHuman NP_001073964 01001002968
Pongo abelii Sumatran orangutan XP_024097037 15.7698.698.92964
Nomascus leucogenys Northern white-cheeked gibbon XP_030652734 19.898.498.82965
Ictidomys tridecemlineatus Thirteen-lined ground squirrel XP_021575654 9085.388.22924
Rattus norvegicus Brown Rat NP_001100593 9081.485.62900
Acinonyx jubatus Cheetah XP_026898111 9687.589.92972
Orcinus orca Killer whale XP_012388176 9687.389.82967
Lynx canadensis Canada lynx XP_030156810 9687.389.82966
Enhydra lutris kenyoni Sea otter XP_022356606 9686.889.62999
Odocoileus virginianus texanus White-tailed deer XP_020730376 9686.289.12939
Ursus arctos horribilis Grizzy Bear XP_026371323 9682.1853111
Haliaeetus leucocephalus Bald Eagle XP_010568620 31258.769.42928
Apteryx rowi Okarito kiwi XP_025939070 31258.769.32932
Pogona vitticeps Central bearded dragon XP_020658091 31254.765.72943
Python bivittatus Burmese python XP_015744874 31253.6652872
Perca flavescens Yellow perch XP_028454308 43539.150.53044
Amphiprion ocellaris Ocellaris clownfish XP_023150301 43539503071
Branchiostoma belcheri Lancelet XP_019646059 68425.636.52799
Saccoglossus kowalevskii Acorn worm XP_002738509 68423.634.83174
Crassostrea virginica Eastern oyster XP_022319473 7972131.62200

Predicted post-translational modifications and motifs

The following post-translational modifications and motifs are predicted for TNRC18 and found on the ExPASy Proteomics page. [20] Exception to GPS-MSP methylation program which is found on The Cuckoo Workgroup site. [21] This list is not conclusive of the total post-translational modifications or motifs associated with TNRC18 and is solely based on software predictions.

Of the predicted post-translational modifications, there are 92 O-Linked β-N-acetylglucosamine (O-ß-GlcNAc) sites with a high scoring threshold (>=0.5), 23 Sumoylation sites, two palmitoylation sites, one methylation site, and 52 glycation sites. Additionally, GPS 5.0 predicted 22,317 phosphorylation sites on TNRC18. The program was used to confirm the nine phosphorylation sites found on the NCBI protein page for TNRC18.

Predicted post-translational modifications and motifs for TNRC18
ProgramPredicted post-translational modificationAmino Acid location on protein
YinOYang O-ß-GlcNAc 80, 185, 199, 352, 416, 626, 627, 640, 788, 991, 995, 1033, 1038, 1533, 1753, 1956, 2023, 2368, 2404, 2510, 2557, 2559–2573, 2611, 2614–2667, 2721, 2892
GPS-SUMO (SUMOsp) Sumoylation 238-242, 467, 620, 652, 858–862, 1159, 1258, 1461, 1544, 1629, 1638, 1704, 1743, 1885–1889, 1893, 1898, 1899, 2098, 2213–2217, 2259–2263, 2463, 2542, 2964-2968
CSS-Palm Palmitoylation 284, 1196
GPS-MSP Methylation 2332
GPS 5.0 Phosphorylation Phosphorylation 263, 611, 1127, 1136, 1540, 1857, 1863, 2146, 2771
NetGlycate Glycation 156, 197, 270, 272, 429, 492, 548, 609, 652. 692, 749, 755, 938, 988, 1058, 1059, 1131, 1370, 1461, 1470, 1503, 1554, 1558, 1577, 1615, 1618, 1791, 1797, 1893, 1895,

1898, 1899, 1933, 1967, 1978, 2028, 2091, 2301, 2315, 2328, 2388, 2438, 2475, 2519, 2702, 2709, 2720, 2750, 2801, 2816, 2857, 2869

Eukaryotic Linear Motif (ELM) Coiled-Coil region 916-949, 1481-1516

Clinical significance

Shen et al. observed circTNRC18 inhibiting miR-762 activity within pre-eclampsia (PE) placenta tissue samples. [22] The inhibition of miR-762 by circTNRC18 resulted in elevated Grhl2 protein levels. PE placenta samples were observed to have lower miR-762 levels and higher Grhl2 levels which was attributed to overexpression of circTNRC18. Shen et al. conclude that circTNRC18 was upregulated in PE placentas when compared with normal pregnancy placentas.

Chu et al. found that from 19 CpG sites linked with glomerular filtration rate (eGFR), 5 were also linked with renal fibrosis and DNA methylation occurrences in the kidney cortex of chronic kidney disease (CKD) patients. [23] Chu et. note that reduced eGFR is a defining feature of (CKD). These 5 CpG sites were found in proteins TNRC18, PTPN6/PHB2, ANKRD11, PQLC2, and PRPF8. Chu et al. conclude that epigenetic variation may be associated with CKD.

Related Research Articles

<span class="mw-page-title-main">C11orf49</span> Protein-coding gene in the species Homo sapiens

C11orf49 is a protein coding gene that in humans encodes for the C11orf49 protein. It is heavily expressed in brain tissue and peripheral blood mononuclear cells, with the latter being an important component of the immune system. It is predicted that the C11orf49 protein acts as a kinase, and has been shown to interact with HTT and APOE2.

LCHN is a protein that in humans is encoded by the KIAA1147 gene located on chromosome 7. It is likely part of the tripartite DENN domain family of proteins that often function as Rab-GEFs to regulate vesicular trafficking. Both the mRNA and protein have been shown to be upregulated following ischemic stroke, and to be produced at altered levels in patients with FTD-ALS, however the gene's contribution to these states is not well understood.

<span class="mw-page-title-main">C17orf53</span>

C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.

<span class="mw-page-title-main">ZCCHC18</span> Protein-coding gene in the species Homo sapiens

Zinc finger CCHC-type containing 18 (ZCCHC18) is a protein that in humans is encoded by ZCCHC18 gene. It is also known as Smad-interacting zinc finger protein 2 (SIZN2), para-neoplastic Ma antigen family member 7b (PNMA7B), and LOC644353. Other names such as zinc finger, CCHC domain containing 12 pseudogene 1, P0CG32, ZCC18_HUMAN had been used to describe this protein.

<span class="mw-page-title-main">Chromosome 9 open reading frame 43</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.

<span class="mw-page-title-main">C9orf25</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">LRRIQ3</span> Protein-coding gene in the species Homo sapiens

LRRIQ3, which is also known as LRRC44, is a protein that in humans is encoded by the LRRIQ3 gene. It is predominantly expressed in the testes, and is linked to a number of diseases.

<span class="mw-page-title-main">C16orf86</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.

<span class="mw-page-title-main">C22orf23</span> Protein-coding gene in the species Homo sapiens

C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.

<span class="mw-page-title-main">C20orf202</span>

C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.

<span class="mw-page-title-main">ZNF337</span> Protein-coding gene in the species Homo sapiens

ZNF337, also known as zinc finger protein 337, is a protein that in humans is encoded by the ZNF337 gene. The ZNF337 gene is located on human chromosome 20 (20p11.21). Its protein contains 751 amino acids, has a 4,237 base pair mRNA and contains 6 exons total. In addition, alternative splicing results in multiple transcript variants. The ZNF337 gene encodes a zinc finger domain containing protein, however, this gene/protein is not yet well understood by the scientific community. The function of this gene has been proposed to participate in a processes such as the regulation of transcription (DNA-dependent), and proteins are expected to have molecular functions such as DNA binding, metal ion binding, zinc ion binding, which would be further localized in various subcellular locations. While there are no commonly associated or known aliases, an important paralog of this gene is ZNF875

<span class="mw-page-title-main">CLIP4</span> Protein

CAP-Gly Domain Containing Linker Protein Family Member 4 is a protein that in humans is encoded by the CLIP4 gene. In terms of conserved domains, the CLIP4 gene contains primarily ankyrin repeats and the eponymous CAP-Gly domains. The structure of the CLIP4 protein is largely made up of coil, with alpha helices dominating the rest of the protein. CLIP4 mRNA expression occurs largely in the adrenal cortex and atrioventricular node. The literature encompassing CLIP4's conserved domains and paralogs points toward microtubule regulation as a possible function of CLIP4.

<span class="mw-page-title-main">Fam89A</span> Human protein and gene

ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

<span class="mw-page-title-main">SMIM19</span> Protein-coding gene in the species Homo sapiens

SMIM19, also known as Small Integral Membrane Protein 19, encodes the SMIM19 protein. SMIM19 is a confirmed single-pass transmembrane protein passing from outside to inside, 5' to 3' respectively. SMIM19 has ubiquitously high to medium expression with among varied tissues or organs. The validated function of SMIM19 remains under review because of on sub-cellular localization uncertainty. However, all linked proteins research to interact with SMIM19 are associated with the endoplasmic reticulum (ER), presuming SMIM19 ER association

<span class="mw-page-title-main">FAM214B</span> Protein-coding gene in the species Homo sapiens

The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.

<span class="mw-page-title-main">FAM120AOS</span> Protein-coding gene in the species Homo sapiens

FAM120AOS, or family with sequence similarity 120A opposite strand, codes for uncharacterized protein FAM120AOS, which currently has no known function. The gene ontology describes the gene to be protein binding. Overall, it appears that the thyroid and the placenta are the two tissues with the highest expression levels of FAM120AOS across a majority of datasets.

<span class="mw-page-title-main">GPATCH2L</span> It is Wikipedia article of unknown gene called "GPATCH2L".

GPATCH2L is a protein that is encoded by the GPATCH2L human gene located at 14q24.3. In humans, the length of mRNA in GPATCH2L (NM_017926) is 14,021 base pairs and the gene spans bases is 62,422 nt between chr14: 76,151,922 - 76,214,343. GPATCH2L is on the positive strand. IFT43 is the gene directly before GPATCH2L on the positive strand and LOC105370575 is the uncharacterized gene on the negative strand, which is approximately one and a half the size of GPATCH2L. Known aliases for GPATCH2L contain C14orf118, FLJ20689, FLJ10033, and KIAA1152. GPATCH2L produces 28 distinct introns, 17 different mRNAs, 14 alternatively spliced variants, and 3 unspliced forms. It has 5 probable alternative promoters, 7 validated polyadenylation sites, and 6 predicted promoters of varying lengths.

<span class="mw-page-title-main">CXorf65</span> Gene on human chromosome X

Human uncharacterized protein CXorf65 is encoded by the gene CXorf65, which is located on the minus strand of chromosome X. Its transcript is 834 nucleotides long and consists of 6 exons. The translated protein is 183 amino acids in length. with a molecular weight of 21.3 kDa

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000182095 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000039477 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. "Entrez Gene: Trinucleotide repeat containing 18" . Retrieved 2016-10-22.
  6. 1 2 "trinucleotide repeat-containing gene 18 protein [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-05-01.
  7. "InterPro". www.ebi.ac.uk. Retrieved 2020-05-01.
  8. "Human hg38 chr7:5,306,800-5,423,546 UCSC Genome Browser v397". genome.ucsc.edu. Retrieved 2020-05-03.
  9. "TNRC18 Gene - GeneCards | TNC18 Protein | TNC18 Antibody". www.genecards.org. Retrieved 2020-05-01.
  10. "TNRC18 trinucleotide repeat containing 18 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-05-01.
  11. "ExPASy - Compute pI/Mw tool". web.expasy.org. Retrieved 2020-05-01.
  12. NPS@: Network Protein Sequence Analysis TIBS 2000 March Vol. 25, No 3 [291]:147-150 Combet C., Blanchet C., Geourjon C. and Deléage G.
  13. "NPS@ : GOR4 secondary structure prediction". npsa-prabi.ibcp.fr. Retrieved 2020-05-03.
  14. "TNRC18 Gene Expression - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-05-03.
  15. "Tissue expression of TNRC18 - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2020-05-03.
  16. "Interactive Atlas Viewer :: Atlas Viewer". atlas.brain-map.org. Retrieved 2020-05-03.
  17. 1 2 "Experiment 73424495 Tnrc18 - RP_051019_04_A01 - sagittal". Allen Institute Brain Atlas. © 2020 Allen Institute for Brain Science. Allen Human Brain Atlas. Available from: human.brain-map.org. Retrieved 3 May 2020.{{cite web}}: CS1 maint: url-status (link)
  18. "TimeTree :: The Timescale of Life". www.timetree.org. Retrieved 2020-05-03.
  19. "EMBOSS Needle < Pairwise Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2020-05-03.
  20. "ExPASy: SIB Bioinformatics Resource Portal - Categories". www.expasy.org. Retrieved 2020-05-03.
  21. "GPS-MSP - Methyl-group Specific Predictor 1.0". msp.biocuckoo.org. Retrieved 2020-05-03.
  22. Shen XY, Zheng LL, Huang J, Kong HF, Chang YJ, Wang F, Xin H (November 2019). "CircTRNC18 inhibits trophoblast cell migration and epithelial-mesenchymal transition by regulating miR-762/Grhl2 pathway in pre-eclampsia". RNA Biology. 16 (11): 1565–1573. doi:10.1080/15476286.2019.1644591. PMC   6779405 . PMID   31354028.
  23. Chu AY, Tin A, Schlosser P, Ko YA, Qiu C, Yao C, et al. (November 2017). "Epigenome-wide association studies identify DNA methylation associated with kidney function". Nature Communications. 8 (1): 1286. doi:10.1038/s41467-017-01297-7. PMC   5668367 . PMID   29097680.