LENG9

Last updated

Leukocyte Receptor Cluster Member 9 (LENG 9) is an uncharacterized protein encoded by the LENG9 gene. [1] [2] In humans, LENG9 is predicted to play a role in fertility and reproductive disorders associated with female endometrium structures. [3] [4]

Contents

Gene

Location

Gene neighborhood of LENG9 on chromosome 19. Geneneighborhood.png
Gene neighborhood of LENG9 on chromosome 19.

LENG9 is located at 19q13.42 on chromosome 19, spanning the sense strand (-) from 54,461,796 bp to 54,463,711 bp. [5] The LENG9 gene is 1,930 base pairs in length and contains one exon. [1] [2]

Gene Neighborhood

Genes LENG8-AS1 and CDC42EP5 neighbor LENG9 on chromosome 19. [2] CDC42EP5 extends over the same region of LENG9 while LENG8-AS1 is located to the left of both genes. [5] TTYH1 and LENG8 are also found in the same gene neighborhood but are located on the opposite strand.

Expression

LENG9 expression in various tissues GEO Profile LENG9.png
LENG9 expression in various tissues

LENG9 is highly expressed (75-100%) in skeletal muscles and part of fetal liver tissues while ubiquitous expression of LENG9 is moderate (50-75%) in all other tissues observed. [6] [7] Human expression of LENG9 is observed in the cervix, lung, and placenta of adults. [8] The gene is also expressed in disease states including lung tumors and primitive neuroectodermal tumors, usually found in children or young adults. However, LENG9 is not expressed during the juvenile stage of development. [8]

Promoter

The promoter region is predicted to be 1101 base pairs in length. [9] The transcriptional start site found in this region is located 119 bp upstream of the start codon [1] as well as an in-frame stop codon at 1087 bp to 1089 bp. [5]

mRNA Transcript

Splice Variants

In humans, LENG9 has two mRNA unspliced transcript variants. [5] Variant (1) is the longest and most conserved transcript of the gene and is made up of one exon that is composed of 1,919 bp.

Protein

General Properties

LENG9 is 501 amino acids in length, with a predicted molecular weight of 53.2 kDa. [10] The isoelectric point of LENG9 protein is predicted to be 7.7. [11] No known transmembrane sequences were found for LENG9. [12]

Composition

Predicted tertiary structure of LENG9 protein. LENG9 Tertiary Structure.png
Predicted tertiary structure of LENG9 protein.

Analysis of the LENG9 protein was performed against the "human" database, [10] which indicated a higher frequency of alanine and proline amino acids than of that of a normal human protein. Inversely, an abnormally lower frequency of aspartate, isoleucine, methionine, asparagine, serine, and tyrosine amino acids were detected.

Structure

The secondary structure of LENG9 is predicted to be composed of alpha-helices and beta-sheets throughout the sequence. [14] [13] [15] The tertiary structure of LENG9 is displayed in the image to the right.

Sub-cellular localization

A strong signal peptide detected in the mitochondrion region (0.788) suggests that the LENG9 protein localizes in the mitochondrial matrix. [12] However, further analysis among other mammal orthologs predicted sub-cellular localization in the cytoplasm and nuclear localization for Danio rerio. [12] [16]

Post Translational Modifications

Predicted phosphorylation sites of LENG9. Netphos-3.1b.Sequence.gif
Predicted phosphorylation sites of LENG9.

LENG9 is predicted to undergo post-translational modifications such as phosphorylation, N-terminal acetylation, sumoylation, and C-terminal Glycosylphosphatidylinositol (GPI) anchor modification.

Phosphorylation

LENG9 contains numerous phosphorylation sites distributed in the protein sequence, as show in the diagram to the right. These sites include casein kinase 2 (CK2), cAMP-dependent protein kinase (PKA), protein kinase C (PKC), ataxia telangiectasia mutated kinase (ATM), cyclin-dependent kinase 5 (CDK5), and casein kinase 1 (CK1). [18]

N-terminal Acetylation

There is one predicted N-terminal acetylation site found in the protein LENG9 at the serine amino acid position 3. [19]

Sumoylation

There are two predicted sumoylation sites within LENG9 at position 82 and 452 on lysine residues. [20]

C-Terminal GPI Anchor Modification

A C-terminal GPI modification site was found on the glycine residue at position 486. [21]

Domains and Motifs

There are 3 conserved domains in LENG9. The metal-ion binding zinc finger domain ZnF_C3H1 [22] is found from amino acid 46 to 61. LENG9 also has a domain of unknown function belonging to the domain family DUF504, that spans from amino acid 109 to 160. The last conserved domain spans from amino acid 320 to 500 and is known as AKAP7 2'5' RNA ligase-like domain (AKAP7_NLS). [5] [23]

Conserved WD40 repeats are found in LENG9, spanning from amino acid 98 to 159. [13] [24] This motif is characterized by beta-propeller structures in the tertiary structure.

PTM, domain, and motif schematic of LENG9 gene. LENG9 Motif-Domain.png
PTM, domain, and motif schematic of LENG9 gene.

Protein Interactions

LENG9 is found to interact with the C9ORF41 protein that encodes methyltransferase, involved in converting carnosine to anserine present in skeletal muscle. Another interactor is CDC5L, [26] which is a positive regulator for the cell cycle for phase G2 to M transition. [27] FOXS1 is another interacting protein that functions as a transcriptional repressor to suppress promoters such as FASLG, FOXO3, and FOXO4. [28]

Clinical Significance

Disease Association

The LENG9 gene was found to be up-regulated and expressed in endothelial endometrium (hEECs) tissues during various stages of the menstrual/reproductive cycle when transfected with the RNA gene, miR-30d. [3] [29] As ectopic over-expression of miR-30d in hEECs is observed to affect cancer-associated genes, LENG9 is predicted to play a role in reproductive and endocrine system disorders.

Fertility

Analysis of endometrial receptivity using miRNA of receptive and prereceptive endometrium from fertile women also indicated significant up-regulation of miR-30d. [4] Consequently, the induced expression of LENG9 from miR-30d transfection suggests a possible relationship between the LENG9 gene and female fertility functions.

Homology

Change of evolution rate for LENG9 compared to fibrinogen and cytochrome C. Evolution LENG9.png
Change of evolution rate for LENG9 compared to fibrinogen and cytochrome C.

Evolution

Comparison of the LENG9 protein was conducted against fibrinogen and cytochrome C to observe the rate of evolutionary change. The relatively fast rate of change in LENG9 compared to that of other proteins suggests that the gene is adaptive for vital cell structures and functions.

Paralogs

There are no known paralogs for LENG9. [2]

Orthologs

LENG9 is highly conserved in mammals and bony fish such as the zebrafish. [24] It is also conserved a in few reptiles and amphibians. [31] The gene is not present in invertebrates, birds, bacteria, or fungi [32]

Genus and SpeciesCommon NameOrderDivergence

from Human

Lineage (MYA)

NCBI Accession

Number

Sequence Length

(bp)

Percent Identity

to Human

Percent Similarity to Human
Homo sapiens HumanPrimates0NP_945339.2501100%100%
Pan troglodytes Chimpanzee6.65XP_003316725.147994.4%94.4%
Gorilla gorilla gorilla Gorilla9.06XP_018870227.145889.4%89.8%
Macaca mulatta Rhesus Macaque29.44XP_014980381.147680.7%83.5%
Ictidomys tridecemlineatus Thirteen-Lined SquirrelRodentia90XP_005341645.149062.1%68.2%
Peromyscus maniculatus bairdii Deer Mouse 90XP_006995933.148856.6%63.5%
Ceratotherium simum simum Southern White RhinocerosPerissodactyla96XP_014649760.152858.3%64.1%
Equus caballus Horse96XP_001488865.250658.0%63.5%
Odobenus rosmarus divergens WalrusCarnivora94XP_004415925.152759.3%62.5%
Panthera pardus Leopard96XP_019292295.154055.1%61.0%
Bos taurus CattleArtiodactyla96XP_005192875.153553.6%58.7%
Ovis aries Sheep96XP_014955819.166941.1%47.1%
Alligator mississippiensis American AlligatorCrocodilla312XP_014459626.145239.0%46.5%
Thamnophis sirtalis Common Garter SnakeSquamata312XP_013921249.150532.6%45.5%
Xenopus laevis African Clawed FrogAnura352XP_018080040.143130.9%40.6%
Nanorana parkeri High Himalaya Frog352XP_018430267.140129.4%39.3%
Lates calcarifer BarramundiPerciformes435XP_018538114.156531.3%38.2%
Nothobranchius furzeri Tortoise KillfishCyprinodontiformes435XP_015805834.153730.3%39.4%
Danio Rerio Zebrafish435XP_005157957.154227.9%37.5%
Poecilia formosa Amazon Molly435XP_007550061.156926.7%37.5%

Related Research Articles

C5orf34 is a protein that in humans is encoded by the C5orf34 gene (5p12).

CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.

<span class="mw-page-title-main">Glutamate rich 5</span> Protein-coding gene in the species Homo sapiens

Glutamate rich protein 5 is a protein in humans encoded by the ERICH5 gene, also known as chromosome 8 open reading frame 47 (C8orf47).

Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.

<span class="mw-page-title-main">CRACD-like protein</span>

CRACD-like protein. previously known as KIAA1211L is a protein that in humans is encoded by the CRACDL gene. It is highly expressed in the cerebral cortex of the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, CRACDL is associated with certain mental disorders and various cancers.

<span class="mw-page-title-main">TMCO4</span> Protein-coding gene in the species Homo sapiens

Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.

<span class="mw-page-title-main">C16orf46</span> Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

<span class="mw-page-title-main">C15orf39</span>

C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.

<span class="mw-page-title-main">Chromosome 9 open reading frame 43</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.

<span class="mw-page-title-main">C1orf122</span> Protein-coding gene in the species Homo sapiens

C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.

<span class="mw-page-title-main">C7orf50</span> Mammalian protein found in Homo sapiens

C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.

<span class="mw-page-title-main">Fam89A</span> Human protein and gene

ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.

<span class="mw-page-title-main">TMEM221</span> Protein

Transmembrane protein 221 (TMEM221) is a protein that in humans is encoded by the TMEM221 gene. The function of TMEM221 is currently not well understood.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">C11orf98</span> Protein-coding gene in the species Homo sapiens

C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.

<span class="mw-page-title-main">KIAA2013</span> Protein-coding gene in the species Homo sapiens

KIAA2013, also known as Q8IYS2 or MGC33867, is a single-pass transmembrane protein encoded by the KIAA2013 gene in humans. The complete function of KIAA2013 has not yet been fully elucidated.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

References

  1. 1 2 3 4 "LENG9 leukocyte receptor cluster member 9 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-05-06.
  2. 1 2 3 4 Database, GeneCards Human Gene. "LENG9 Gene - GeneCards | LENG9 Protein | LENG9 Antibody". www.genecards.org. Retrieved 2017-04-30.
  3. 1 2 Moreno-Moya, Juan Manuel; Vilella, Felipe; Martínez, Sebastián; Pellicer, Antonio; Simón, Carlos (2014-06-01). "The transcriptomic and proteomic effects of ectopic overexpression of miR-30d in human endometrial epithelial cells". MHR: Basic Science of Reproductive Medicine. 20 (6): 550–566. doi: 10.1093/molehr/gau010 . ISSN   1360-9947. PMID   24489115.
  4. 1 2 Altmäe, Signe; Martinez-Conejero, Jose A.; Esteban, Francisco J.; Ruiz-Alonso, Maria; Stavreus-Evers, Anneli; Horcajadas, Jose A.; Salumets, Andres (2012-08-17). "MicroRNAs miR-30b, miR-30d, and miR-494 Regulate Human Endometrial Receptivity". Reproductive Sciences. 20 (3): 308–317. doi:10.1177/1933719112453507. PMC   4077381 . PMID   22902743.
  5. 1 2 3 4 5 "LENG9 leukocyte receptor cluster member 9 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-02-19.
  6. 1 2 "49016057 - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-05-06.
  7. "Leukocyte receptor cluster (LRC) member 9 (LENG9)". www.ncbi.nlm.nih.gov. Retrieved 2017-05-06.
  8. 1 2 Group, Schuler. "EST Profile - Hs.590976". www.ncbi.nlm.nih.gov. Retrieved 2017-05-06.
  9. "Genomatix: Annotation & Analysis". www.genomatix.de. Retrieved 2017-04-30.
  10. 1 2 "SAPS". Biology Workbench. Retrieved May 6, 2017.[ permanent dead link ]
  11. "PI" . Retrieved May 6, 2017.[ permanent dead link ]
  12. 1 2 3 "TargetP 1.1 Server". www.cbs.dtu.dk. Retrieved 2017-04-30.
  13. 1 2 3 "I-TASSER results". zhanglab.ccmb.med.umich.edu. Retrieved 2017-05-06.[ permanent dead link ]
  14. "Phyre 2 Results for Undefined". www.sbg.bio.ic.ac.uk. Retrieved 2017-04-30.[ permanent dead link ]
  15. "CHOFAS". Biology Workbench. Retrieved May 6, 2017.[ permanent dead link ]
  16. "PSORT II Prediction". psort.hgc.jp. Retrieved 2017-05-06.
  17. "NetPhos 3.1 Server". www.cbs.dtu.dk. Retrieved 2017-05-07.
  18. "NetPhos 3.1 Server". www.cbs.dtu.dk. Retrieved 2017-05-06.
  19. "NetAcet 1.0 Server". www.cbs.dtu.dk. Retrieved 2017-05-06.
  20. "SUMOplot™ Analysis Program | Abgent".
  21. "GPI Prediction Server". mendel.imp.ac.at. Retrieved 2017-05-06.
  22. "SMART: ZnF_C3H1 domain annotation". smart.embl.de. Retrieved 2017-05-06.
  23. "leukocyte receptor cluster member 9 isoform 1 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-05-06.
  24. 1 2 "CLUSTALW". Biology Workbench. Retrieved May 6, 2017.[ permanent dead link ]
  25. Castro, Edouard de. "PROSITE". prosite.expasy.org. Retrieved 2017-05-06.
  26. Huttlin, Edward L.; Ting, Lily; Bruckner, Raphael J.; Gebreab, Fana; Gygi, Melanie P.; Szpyt, John; Tam, Stanley; Zarraga, Gabriela; Colby, Greg (2015-07-16). "The BioPlex Network: A Systematic Exploration of the Human Interactome". Cell. 162 (2): 425–440. doi:10.1016/j.cell.2015.06.043. ISSN   1097-4172. PMC   4617211 . PMID   26186194.
  27. Llères, David; Denegri, Marco; Biggiogera, Marco; Ajuh, Paul; Lamond, Angus I. (2010-06-01). "Direct interaction between hnRNP-M and CDC5L/PLRG1 proteins affects alternative splice site choice". EMBO Reports. 11 (6): 445–451. doi:10.1038/embor.2010.64. ISSN   1469-3178. PMC   2892320 . PMID   20467437.
  28. Li, Xu; Wang, Wenqi; Wang, Jiadong; Malovannaya, Anna; Xi, Yuanxin; Li, Wei; Guerra, Rudy; Hawke, David H.; Qin, Jun (2015-01-21). "Proteomic analyses reveal distinct chromatin-associated and soluble transcription factor complexes". Molecular Systems Biology. 11 (1): 775. doi:10.15252/msb.20145504. ISSN   1744-4292. PMC   4332150 . PMID   25609649.
  29. Database, GeneCards Human Gene. "MIR30D Gene - GeneCards | MIR30D RNA Gene". www.genecards.org. Retrieved 2017-05-07.
  30. "Protein BLAST: search protein databases using a protein query". blast.ncbi.nlm.nih.gov. Retrieved 2017-05-07.
  31. "Human BLAT Search". genome.ucsc.edu. Retrieved 2017-05-07.
  32. "Protein BLAST: search protein databases using a protein query". blast.ncbi.nlm.nih.gov. Retrieved 2017-05-06.