LSMEM2

Last updated
LSMEM2
Identifiers
Aliases LSMEM2 , C3orf45, leucine rich single-pass membrane protein 2
External IDs MGI: 3612240 HomoloGene: 45163 GeneCards: LSMEM2
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001304385
NM_153215

NM_001081244

RefSeq (protein)

NP_001291314
NP_694947

n/a

Location (UCSC) Chr 3: 50.28 – 50.29 Mb n/a
PubMed search [2] [3]
Wikidata
View/Edit Human View/Edit Mouse

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene (also known as c3orf45). [4] The LSMEM2 protein is conserved in mammals, birds, and reptiles. [5] In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue. [6] [4]

Contents

Gene

LSMEM2 is also known as c3orf45. [4] It is found at human chromosome loci 3p21 on the plus strand from bases 50,277,907-50,288,116. [4] This gene is 1,434 base pairs long and has four exon regions. [4] Nearby genes include SEMA3B and IFRD2. [4]

mRNA

LSMEM2 has two different isoforms, isoform 1 and 2. [4] These two isoforms encode the same protein. Isoform 2 uses an alternate in-frame splice-site in the 5' coding region in comparison to isoform 1. [4] Isoform 1 is three base pairs and one amino acid longer than isoform 2 at the exon 2 and exon 3 junction. [7]

Protein

The LSMEM2 protein has two isoforms. [8] Isoform 1 has an alanine added after amino acid 57, otherwise the two isoforms are identical. [7] It has a predicted MW of 17.8 kDa and isoelectric point of 5.7 pI. [9] LSMEM2 is predicted to have one transmembrane region which is composed of 50% leucine and considered leucine rich. [10] The N-terminus is predicted to be the cytosolic/intracellular region of the protein, while the C-terminus is predicted as the lumenal/extracellular region. [11] It is found to have one domain, Domain of unknown function 4714 (DUF4714), spanning from amino acid 13 to 161. [12]

Predicted primary sequence, regions and post-translational modifications of the LSMEM2 protein. Predicted primary sequence of the LSMEM2 protein.png
Predicted primary sequence, regions and post-translational modifications of the LSMEM2 protein.
Predicted intracellular/cytoplasmic, transmembrane, and extracellular/lumenal regions of the human LSMEM2 protein. Predicted regions of the human LSMEM2 protein.png
Predicted intracellular/cytoplasmic, transmembrane, and extracellular/lumenal regions of the human LSMEM2 protein.

Post-translational Modifications

LSMEM2 is predicted to have an acetylation and palmitoylation site near the N-terminus of the protein. [15] [16] It is also predicted to have various phosphorylation and O-GlcNAc sites throughout the predicted intracellular/cytosolic region of the protein. [17] [18] LSMEM2 has a predicted N-glycosylation site at amino acids 155,156, and 157 in the probable extracellular/lumenal region. [19]

Schematic illustration displaying the predicted regions, domains, and post-translational modifications of the LSMEM2 protein. Schematic Illustration of the predicted regions and domains of the human LSMEM2 protein.png
Schematic illustration displaying the predicted regions, domains, and post-translational modifications of the LSMEM2 protein.

Structure

The secondary and tertiary structure of LSMEM2 are currently unknown. The secondary structure is predicted as largely alpha-helices for the transmembrane and lumenal/extracellular region. [22] The cytoplasmic/intracellular region structure still remains relatively unclear. To the right is a predicted tertiary structure of the human LSMEM2 protein by the I-TASSER software. [23]

Predicted tertiary structure of the human LSMEM2 protein by I-TASSER. The structure is colored in the order of the rainbow from the N-terminus to the C-terminus. Predicted tertiary structure of the human LSMEM2 protein.jpg
Predicted tertiary structure of the human LSMEM2 protein by I-TASSER. The structure is colored in the order of the rainbow from the N-terminus to the C-terminus.

Homology

Paralogs

LSMEM2 has no known paralogs. [25]

Orthologs

LSMEM2 has 168 orthologs total, 131 of them being mammals, the other orthologs consist of aves and reptiles [5] The LSMEM2 protein is conserved in mammals with 71.3% chemically-similar sequences. [25] The table below displays features of select orthologs of LSMEM2 of varying evolutionary distance. The predicted transmembrane domain of LSMEM2 is found to be highly conserved in its orthologs. [26]

Genus and SpeciesCommon NameAccession Number [27] Length (amino acids)Sequence Identity [28] Sequence Similarity [25] Date of Divergence (million years ago) [29]
Homo sapiensHumanNP_001291314.1163100%100%0
Acinonyx jubatusCheetahXP_01493257614984.11%85.30%105
Ornithorhynchus anatinusPlatypusXP_02890603217267.97%71.30%177
Gallus gallusChickenXP_015148980.115950.00%41.20%312
Chrysemys pictaPainted turtleXP_00530881717239.53%44.10%312


Evolution

LSMEM2 was found to emerge about 312 million years ago (MYA). [29] It has been found to evolve at an intermediate rate when compared to a quickly evolving protein, Fibronectin, and a slowly evolving protein, Cytochrome C. [30] LSMEM2 is predicted to change 1% every 3.9 million years. [28] [29]

Expression

LSMEM2 is found to be highly expressed in the human heart and skeletal muscle with RNA Sequencing and Microarray data. [4] [31] It is also found to be highly expressed in the heart during human fetal development. [4]

Regulation of Expression

The promoter region for LSMEM2 is predicted by El Dorado to be the 2,328 basepairs directly upstream from the LSMEM2 gene. [32] A notable transcription factor predicted to bind to this promoter is the Brachyury gene, mesoderm developmental factor. [33] This transcription factor is involved in regulating the development of the notochord. [34]

Function

LSMEM2 has been predicted to be involved in Mitochondrial ATP synthesis coupled proton transport. [35] However, the function of LSMEM2 is still not fully understood by the scientific community.

Interacting Proteins

LSMEM2 has been found to potentially interact with MEP1B, DEFA6, CYP3A43, TBC1D29, KLHL23, ZNF551, c5orf24, CWH43, and PDIA2. [36]

Clinical Significance

LSMEM2 was discovered to be down-regulated in the myotubes of patients with FSHD, a form of muscular dystrophy. [37] LSMEM2 was also predicted to be involved in the pathway for sepsis-induced myopathy, although more research is required to determine its exact role [38]

Related Research Articles

<span class="mw-page-title-main">YIF1A</span> Protein-coding gene in the species Homo sapiens

Protein YIF1A is a Yip1 domain family proteins that in humans is encoded by the YIF1A gene.

<span class="mw-page-title-main">Interferon-inducible GTPase 5</span> Protein-coding gene in the species Homo sapiens

Interferon-inducible GTPase 5 also known as immunity-related GTPase cinema 1 (IRGC1) is an enzyme that in humans is coded by the IRGC gene. It is predicted to behave like other proteins in the p47-GTPase-like and IRG families. It is most expressed in the testis.

CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.

<span class="mw-page-title-main">TMCO4</span> Protein-coding gene in the species Homo sapiens

Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.

<span class="mw-page-title-main">C16orf46</span> Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

<span class="mw-page-title-main">TMEM44</span> Protein-coding gene in the species Homo sapiens

TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.

<span class="mw-page-title-main">TEX9</span> Protein-coding gene in the species Homo sapiens

Testis-expressed protein 9 is a protein that in humans is encoded the TEX9 gene. TEX9 that encodes a 391-long amino acid protein containing two coiled-coil regions. The gene is conserved in many species and encodes orthologous proteins in eukarya, archaea, and one species of bacteria. The function of TEX9 is not yet fully understood, but it is suggested to have ATP-binding capabilities.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">CFAP299</span> Protein-coding gene in the species Homo sapiens

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.

<span class="mw-page-title-main">SMCO3</span> Protein-coding gene in the species Homo sapiens

Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.

Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.

<span class="mw-page-title-main">C1orf185</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 185, also known as C1orf185, is a protein that in humans is encoded by the C1orf185 gene. In humans, C1orf185 is a lowly expressed protein that has been found to be occasionally expressed in the circulatory system.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

<span class="mw-page-title-main">C12orf24</span> Protein-coding gene in the species Homo sapiens

C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.

<span class="mw-page-title-main">Fam89A</span> Human protein and gene

ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000179564 - Ensembl, May 2017
  2. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  3. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. 1 2 3 4 5 6 7 8 9 10 "LSMEM2 leucine rich single-pass membrane protein 2 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-04-30.
  5. 1 2 "LSMEM2 orthologs". NCBI. Retrieved 2020-05-01.
  6. "LSMEM2 protein expression summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 24 August 2021.
  7. 1 2 "leucine-rich single-pass membrane protein 2 isoform 1 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-04-30.
  8. "leucine-rich single-pass membrane protein 2 isoform 2 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-04-30.
  9. "ExPASy - Compute pI/Mw tool". web.expasy.org. Retrieved 2020-04-30.
  10. "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2020-04-30.
  11. "TMpred Server". embnet.vital-it.ch. Retrieved 2020-04-30.
  12. "MOTIF: Searching Protein Sequence Motifs". www.genome.jp. Retrieved 2020-04-30.
  13. "Homo sapiens leucine rich single-pass membrane protein 2 (LSMEM2), transcript variant 2, mRNA". 2019-05-31.{{cite journal}}: Cite journal requires |journal= (help)
  14. "Protter - interactive protein feature visualization". wlab.ethz.ch. Retrieved 2020-05-03.
  15. "CSS-Palm - Palmitoylation Site Prediction". csspalm.biocuckoo.org. Retrieved 2020-05-03.
  16. "NetAcet 1.0 Server". www.cbs.dtu.dk. Retrieved 2020-05-03.
  17. "YinOYang 1.2 Server". www.cbs.dtu.dk. Retrieved 2020-05-03.
  18. "NetPhos 3.1 Server". www.cbs.dtu.dk. Retrieved 2020-05-03.
  19. "NetNGlyc 1.0 Server". www.cbs.dtu.dk. Retrieved 2020-05-01.
  20. "leucine-rich single-pass membrane protein 2 isoform 2 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-05-03.
  21. "ExPASy: SIB Bioinformatics Resource Portal - Categories". www.expasy.org. Retrieved 2020-05-03.
  22. "PHYRE2 Protein Fold Recognition Server". www.sbg.bio.ic.ac.uk. Retrieved 2020-04-30.
  23. "I-TASSER server for protein structure and function prediction". zhanglab.ccmb.med.umich.edu. Retrieved 2020-05-03.
  24. "I-TASSER server for protein structure and function prediction". zhanglab.ccmb.med.umich.edu. Retrieved 2020-05-03.
  25. 1 2 3 "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2020-05-01.
  26. "Clustal Omega < Multiple Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2020-05-01.
  27. "Home - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-04-30.
  28. 1 2 "EMBOSS Needle < Pairwise Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2020-05-01.
  29. 1 2 3 "TimeTree :: The Timescale of Life". www.timetree.org. Retrieved 2020-05-01.
  30. "Home - HomoloGene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-05-01.
  31. "GEO Profile Links for Gene (Select 132228) - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-05-01.
  32. "Genomatix: Annotation & Analysis". www.genomatix.de. Retrieved 2020-05-01.
  33. "Genomatix: MatInspector Input". www.genomatix.de. Retrieved 2020-05-03.
  34. Reference, Genetics Home. "TBXT gene". Genetics Home Reference. Retrieved 2020-05-03.
  35. "ARCHS4". amp.pharm.mssm.edu. Retrieved 2020-05-03.
  36. "LSMEM2 protein (human) - STRING interaction network". string-db.org. Retrieved 2020-05-03.
  37. Dmitriev P, Bou Saada Y, Dib C, Ansseau E, Barat A, Hamade A, et al. (October 2016). "DUX4-induced constitutive DNA damage and oxidative stress contribute to aberrant differentiation of myoblasts from FSHD patients" (PDF). Free Radical Biology & Medicine. 99: 244–258. doi:10.1016/j.freeradbiomed.2016.08.007. PMID   27519269. S2CID   24609856.
  38. Ning YL, Yang ZQ, Xian SX, Lin JZ, Lin XF, Chen WT (February 2020). "Bioinformatics Analysis Identifies Hub Genes and Molecular Pathways Involved in Sepsis-Induced Myopathy". Medical Science Monitor. 26: e919665. doi:10.12659/MSM.919665. PMC   7009723 . PMID   32008037.