SERTM2

Last updated

SERTM2, also known as the Serine Rich And Transmembrane Domain Containing 2, is a protein which in humans is encoded by the SERTM2 gene. The SERTM2 protein is a transmembrane protein located in the intracellular membrane and active in membrane-bound organelles. [1] [2] SERTM2 expression has been linked to metastatic prostate tumors, prostate carcinomas and renal cell carcinomas. [3] [4]

Contents

Gene

SERTM2 labelled at location Xq23 SERTM2 Chromosome Label.jpg
SERTM2 labelled at location Xq23

The SERTM2 gene in humans is located on the positive strand of the X chromosome (Xq23), spanning 10,755 base pairs. [5] The SERTM2 gene has three total exons. There is one known transcript or isoform that spans 4,612 base pairs. [6]

Aliases

SERTM2 is also known as:

Protein

Predicted tertiary structure prediction of the SERTM2 protein in humans, Microcaecilia unicolor, alligators, and wombats. Blue = high confidence, yellow = low confidence. Made using the AlphaFold Protein Structure Database. HasSERTM2Protein.png
Predicted tertiary structure prediction of the SERTM2 protein in humans, Microcaecilia unicolor, alligators, and wombats. Blue = high confidence, yellow = low confidence. Made using the AlphaFold Protein Structure Database.

The SERTM2 protein is 90 amino acids long. This protein has a predicted molecular weight of 10 kDa and an isoelectric point of 6. [11] [12] The human SERTM2 protein structure contains two topological domains: extracellular and cytoplasmic. [13] These domains are connected by a transmembrane domain within a confirmed alpha helix. [8] [9] [10] [12] The human protein contains a disordered region at the tail of the protein. [12] Despite having serine-rich in its common name, the protein was not found to have abundance of serine or any other amino acid when compared to other human proteins. [11]

Post-translational modifications

Diagram of human SERTM2 protein highlighting its transmembrane domain. The N-terminus is extracellular, and the C-terminus is intracellular. Made using Protter. Problem Set 4 2022.docx (1).jpg
Diagram of human SERTM2 protein highlighting its transmembrane domain. The N-terminus is extracellular, and the C-terminus is intracellular. Made using Protter.

The human SERTM2 protein has one confirmed post-translational modification at the 11th position. [6] The asparagine at that position undergoes N-linked glycosylation, or the attachment of an oligosaccharide to a nitrogen atom on the asparagine side chain. [15]

Expression

RNA-sequencing and human tissue profiling has found that SERTM2 is expressed primarily in the endometrium prostate, and liver of humans at moderate level. [6] SERTM2 is found to be upregulated in cardiac progenitor cells compared to mesoderm cells and in fetal cells versus adult heart tissue using RNA-sequencing data. [7] Using knockout and overexpression experiments, it was found that both the knockout and overexpression of SERTM2 results in low cardiomyocyte yield, suggesting that expression must be carefully regulated during cellular differentiation for normal cardiac development to occur and resulted in the nickname CARDEL (Cardiac Development Long non-coding RNA). [7]

Homologs and evolution

The human SERTM2 has no paralogs. SERTM2 orthologs are found in mammals, birds, reptiles, amphibians, and some fish. [13] The earliest known SERTM2 gene appeared 462 million years ago in the catshark, a cartilaginous fish. The gene is hard to find in fish, with only two other known appearances in the tiger barb and the Chinese sucker fish, two bony fish. SERTM2 became more established in amphibians 352 million years ago, and its orthologs are found throughout modern reptiles, birds, mammals, and primates. [12]

Table 1:Human serine-rich and transmembrane-domain containing 2 (SERTM2) gene orthologs. Orthologs are sorted first by date of divergence from the human gene, then by similarity to the human sequence. [12]

Common NameScientific NameAccession NumberTaxonomical GroupSequence Length (amino acids)Date of Divergence

(MYA)

% identical
PrimataHumanHomo sapiens NP_001341402.1 Primates 90-100
Ring-tailed lemur Lemur catta XP_045393689.1 Primates 907493
Beluga whale Delphinapterus leucas XP_030615360.1 Cetacea 909492
Mouse Mus musculus NP_001341422.1 Rodentia 898791
Big brown bat Eptesicus fuscus XP_054573025.1 Chiroptera 909481
Common wombat Vombatus ursinus XP_027691215.1 Marsupial 9016081
Aves Blue tit Cyanistes caeruleus XP_023773484.1 Aves 9131976
Chicken Gallus gallus XP_046795767.1 Aves 9231973
Reptilia Alligator Alligator mississippiensis XP_059588794.1 Crocodilia 9231979
Burmese python Python bivittatus XP_025020345.1 Squamata 9231975
Softshell turtle Pelodiscus sinensis XP_025033828.1 Testudines 9231960
Amphibians Microcaecilia unicolor Microcaecilia unicolor XP_030065343.1 Gymnophiona 9135268
Two-lined caecilians Rhinatrema bivittatum XP_029463498.1 Gymnophiona 9335267
Common frog Rana temporaria XP_040179805.1 Anura 9235270
Fish/Sharks Tiger barb Puntigrus tetrazona XP_043094501.1 Osteichthyes 10342924
Chinese sucker fish Myxocyprinus asiaticus XP_051542736.1 Osteichthyes 10842921
Catshark Scyliorhinus canicula XP_038632174.1 Chondrichthyes 8946242

Clinical significance

Metastatic tumors in the prostate have been shown to have 3-fold more expression of SERTM2 than primary tumors, suggesting that overexpression of SERTM2 may be linked to the metastatic nature of prostate tumors. [3] SERTM2 overexpression has been observed in tumor microenvironment of androgen receptor pathway-positive adenocarcinoma of the prostate (ARPC). [4] In comparison to ARPC, SERTM2 expression is lower in the tumor microenvironment of neuroendocrine prostate carcinomas (NEPC), a more severe type of prostate cancer. [4]

Related Research Articles

<span class="mw-page-title-main">TMPRSS2</span> Protein-coding gene in the species Homo sapiens

Transmembrane protease, serine 2 is an enzyme that in humans is encoded by the TMPRSS2 gene. It belongs to the TMPRSS family of proteins, whose members are transmembrane proteins which have a serine protease activity. The TMPRSS2 protein is found in high concentration in the cell membranes of epithelial cells of the lung and of the prostate, but also in the heart, liver and gastrointestinal tract.

<span class="mw-page-title-main">TMEPAI</span> Protein-coding gene in the species Homo sapiens

Transmembrane prostate androgen-induced protein is a protein that in humans is encoded by the PMEPA1 gene.

<span class="mw-page-title-main">FAM83A</span> Protein-coding gene in the species Homo sapiens

Protein FAM83A also known as tumor antigen BJ-TSA-9 is a protein that in humans is encoded by the FAM83A gene.

<span class="mw-page-title-main">QSER1</span> Protein-coding gene in the species Homo sapiens

Glutamine Serine Rich Protein 1 or QSER1 is a protein encoded by the QSER1 gene.

<span class="mw-page-title-main">TM6SF2</span> Protein-coding gene in the species Homo sapiens

TM6SF2 is the Transmembrane 6 superfamily 2 human gene which codes for a protein by the same name. This gene is otherwise called KIAA1926. Its exact function is currently unknown.

<span class="mw-page-title-main">Proser2</span> Protein-coding gene in the species Homo sapiens

PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.

<span class="mw-page-title-main">FAM76A</span> Protein-coding gene in the species Homo sapiens

FAM76A is a protein that in Homo sapiens is encoded by the FAM76A gene. Notable structural characteristics of FAM76A include an 83 amino acid coiled coil domain as well as a four amino acid poly-serine compositional bias. FAM76A is conserved in most chordates but it is not found in other deuterostrome phlya such as echinodermata, hemichordata, or xenacoelomorpha—suggesting that FAM76A arose sometime after chordates in the evolutionary lineage. Furthermore, FAM76A is not found in fungi, plants, archaea, or bacteria. FAM76A is predicted to localize to the nucleus and may play a role in regulating transcription.

<span class="mw-page-title-main">VXN</span> Protein-coding gene in the species Homo sapiens

Vexin is a protein encoded by VXN gene. VXN is found to be highly expressed in regions of the brain and spinal cord.

Chromosome 19 open reading frame 18 (c19orf18) is a protein which in humans is encoded by the c19orf18 gene. The gene is exclusive to mammals and the protein is predicted to have a transmembrane domain and a coiled coil stretch. This protein has a function that is not yet fully understood by the scientific community.

<span class="mw-page-title-main">C1orf185</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 185, also known as C1orf185, is a protein that in humans is encoded by the C1orf185 gene. In humans, C1orf185 is a lowly expressed protein that has been found to be occasionally expressed in the circulatory system.

<span class="mw-page-title-main">WD Repeat and Coiled Coil Containing Protein</span> Protein-coding gene in humans

WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.

<span class="mw-page-title-main">SBK3</span> Protein-coding gene in the species Homo sapiens

SH3 Domain Binding Kinase Family Member 3 is an enzyme that in humans is encoded by the SBK3 gene. SBK3 is a member of the serine/threonine protein kinase family. The SBK3 protein is known to exhibit transferase activity, especially phosphotransferase activity, and tyrosine kinase activity. It is well-conserved throughout mammalian organisms and has two paralogs: SBK1 and SBK2.

C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.

<span class="mw-page-title-main">OCEL1</span> Protein-coding gene in the species Homo sapiens

OCEL1, also called Occludin//ELL Domain Containing 1, is a protein encoding gene located at chromosome 19p13.11 in the human genome. Other aliases for the gene include FLJ22709, FWP009, and S863-9. The function of OCEL1 has not yet been identified.

<span class="mw-page-title-main">GPATCH2L</span> It is Wikipedia article of unknown gene called "GPATCH2L".

GPATCH2L is a protein that is encoded by the GPATCH2L human gene located at 14q24.3. In humans, the length of mRNA in GPATCH2L (NM_017926) is 14,021 base pairs and the gene spans bases is 62,422 nt between chr14: 76,151,922 - 76,214,343. GPATCH2L is on the positive strand. IFT43 is the gene directly before GPATCH2L on the positive strand and LOC105370575 is the uncharacterized gene on the negative strand, which is approximately one and a half the size of GPATCH2L. Known aliases for GPATCH2L contain C14orf118, FLJ20689, FLJ10033, and KIAA1152. GPATCH2L produces 28 distinct introns, 17 different mRNAs, 14 alternatively spliced variants, and 3 unspliced forms. It has 5 probable alternative promoters, 7 validated polyadenylation sites, and 6 predicted promoters of varying lengths.

CKLF-like MARVEL transmembrane domain-containing 5 (CMTM5), previously termed chemokine-like factor superfamily 5, designates any one of the six protein isoforms encoded by six different alternative splices of its gene, CMTM5; CMTM5-v1 is the most studied of these isoforms. The CMTM5 gene is located in band 11.2 on the long arm of chromosome 14.

<span class="mw-page-title-main">TMEM248</span> Transmembrane protein 248/TMEM248 gene

Transmembrane protein 248, also known as C7orf42, is a gene that in humans encodes the TMEM248 protein. This gene contains multiple transmembrane domains and is composed of seven exons.TMEM248 is predicted to be a component of the plasma membrane and be involved in vesicular trafficking. It has low tissue specificity, meaning it is ubiquitously expressed in tissues throughout the human body. Orthology analyses determined that TMEM248 is highly conserved, having homology with vertebrates and invertebrates. TMEM248 may play a role in cancer development. It was shown to be more highly expressed in cases of colon, breast, lung, ovarian, brain, and renal cancers.

<span class="mw-page-title-main">TMEM61</span> Protein and coding gene in humans

Transmembrane protein 61 (TMEM61) is a protein that is encoded by the TMEM61 gene in humans. It is located on the first chromosome in humans and is highly expressed in the intestinal regions predominantly the kidney, adrenal gland and pituitary tissues. The protein, unlike other transmembrane protein in the region does not promote cancer growth. However, the TMEM61 protein when inhibited by secondary factors restricts normal activity in the kidney. The human protein shares many Orthologs and has been prevalent on Earth for millions of years.

<span class="mw-page-title-main">CCDC177</span> Protein-coding gene in Homo sapiens

Coiled-Coil Domain Containing 177 (CCDC177) is a protein, which in humans, is encoded by the gene CCDC177. It is composed of a coiled helical domain that spans half of the protein. CCDC177 deletions are associated with intellectual disability and congenital heart defects.

<span class="mw-page-title-main">WDR88</span> Human gene

WDR88 is a protein, which in humans, is encoded by the gene WDR88. It consists of seven WD40 repeats, which form a seven-bladed beta-propeller. Mutations within the WDR88 gene are associated with a variety of cancers, as well as schizophrenia and fungal infections.

References

  1. Alliance of Genome Resources. "SERTM2" . Retrieved 28 September 2023.
  2. Watanabe, Ryuta; Miura, Noriyoshi; Kurata, Mie; Kitazawa, Riko; Kikugawa, Tadahiko; Saika, Takashi (January 2023). "Spatial Gene Expression Analysis Reveals Characteristic Gene Expression Patterns of De Novo Neuroendocrine Prostate Cancer Coexisting with Androgen Receptor Pathway Prostate Cancer". International Journal of Molecular Sciences. 24 (10): 8955. doi: 10.3390/ijms24108955 . ISSN   1422-0067. PMC   10219300 . PMID   37240308.
  3. 1 2 Chandran, Uma R.; Ma, Changqing; Dhir, Rajiv; Bisceglia, Michelle; Lyons-Weiler, Maureen; Liang, Wenjing; Michalopoulos, George; Becich, Michael; Monzon, Federico A. (2007-04-12). "Gene expression profiles of prostate cancer reveal involvement of multiple molecular pathways in the metastatic process". BMC Cancer. 7 (1): 64. doi: 10.1186/1471-2407-7-64 . ISSN   1471-2407. PMC   1865555 . PMID   17430594.
  4. 1 2 3 Watanabe, Ryuta; Miura, Noriyoshi; Kurata, Mie; Kitazawa, Riko; Kikugawa, Tadahiko; Saika, Takashi (2023-05-18). "Spatial Gene Expression Analysis Reveals Characteristic Gene Expression Patterns of De Novo Neuroendocrine Prostate Cancer Coexisting with Androgen Receptor Pathway Prostate Cancer". International Journal of Molecular Sciences. 24 (10): 8955. doi: 10.3390/ijms24108955 . ISSN   1422-0067. PMC   10219300 . PMID   37240308.
  5. 1 2 3 GeneCards (Aug 2, 2023). "SERTM2 Gene - Serine Rich And Transmembrane Domain Containing 2" . Retrieved 28 September 2023.
  6. 1 2 3 National Library of Medicine. "Serine rich and transmembrane domain containing 2 (SERTM2) [Homo sapiens (human)], Gene" . Retrieved 28 September 2023.
  7. 1 2 3 Pereira, Isabela T.; Gomes-Júnior, Rubens; Hansel-Frose, Aruana; Liu, Man; Soliman, Hossam A.N.; Chan, Sunny S.K.; Dudley, Samuel C.; Kyba, Michael; Dallagiovanna, Bruno (19 February 2023). "Cardiac Development Long non-coding RNA (CARDEL) is activated during human heart development and contributes to cardiac specification and homeostasis". doi:10.1101/2023.02.19.529122. S2CID   257052580.
  8. 1 2 "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2023-12-14.
  9. 1 2 Jumper, John; Evans, Richard; Pritzel, Alexander; Green, Tim; Figurnov, Michael; Ronneberger, Olaf; Tunyasuvunakool, Kathryn; Bates, Russ; Žídek, Augustin; Potapenko, Anna; Bridgland, Alex; Meyer, Clemens; Kohl, Simon A. A.; Ballard, Andrew J.; Cowie, Andrew (August 2021). "Highly accurate protein structure prediction with AlphaFold". Nature. 596 (7873): 583–589. Bibcode:2021Natur.596..583J. doi:10.1038/s41586-021-03819-2. ISSN   1476-4687. PMC   8371605 . PMID   34265844.
  10. 1 2 Varadi, Mihaly; Anyango, Stephen; Deshpande, Mandar; Nair, Sreenath; Natassia, Cindy; Yordanova, Galabina; Yuan, David; Stroe, Oana; Wood, Gemma; Laydon, Agata; Žídek, Augustin; Green, Tim; Tunyasuvunakool, Kathryn; Petersen, Stig; Jumper, John (2022-01-07). "AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models". Nucleic Acids Research. 50 (D1): D439–D444. doi:10.1093/nar/gkab1061. ISSN   0305-1048. PMC   8728224 . PMID   34791371.
  11. 1 2 "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2023-12-07.
  12. 1 2 3 4 5 "Home - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2023-10-23.
  13. 1 2 SERTM2 (SERTM2 - serine rich and transmembrane domain containing 2) [https://www.ncbi.nlm.nih.gov/gene/401613]
  14. Wollscheid Lab (2018). Protter [Computer Software]. https://wlab.ethz.ch/protter/
  15. Lowenthal, Mark S.; Davis, Kiersta S.; Formolo, Trina; Kilpatrick, Lisa E.; Phinney, Karen W. (2016-07-01). "Identification of novel N-glycosylation sites at non-canonical protein consensus motifs". Journal of Proteome Research. 15 (7): 2087–2101. doi:10.1021/acs.jproteome.5b00733. ISSN   1535-3893. PMC   5100817 . PMID   27246700.