VXN

Last updated
VXN
Identifiers
Aliases VXN , chromosome 8 open reading frame 46, C8orf46, vexin
External IDs MGI: 1924232 HomoloGene: 17666 GeneCards: VXN
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_152765

NM_178399

RefSeq (protein)

NP_689978

NP_848486

Location (UCSC) Chr 8: 66.49 – 66.52 Mb Chr 1: 9.67 – 9.7 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Vexin is a protein encoded by VXN gene. [5] VXN is found to be highly expressed in regions of the brain and spinal cord.

Contents

Gene

Location

VXN is found along the plus strand of chromosome 8. [6] The entire gene is 58,522 bp long. [6] VXN is flanked by alcohol dehydrogenase iron containing 1 and Myb proto-oncogene like 1. [5]

Gene neighborhood of VXN along the forward strand of chromosome 8. Gene Neighborhood C8orf46.png
Gene neighborhood of VXN along the forward strand of chromosome 8.

Homology

Paralogs

No human paralogs for VXN have been identified [5]

Orthologs

Phylogentic tree of select orthologs of the gene VXN. Estimated date of divergence is shown. Phylogenetic Tree C8orf46.png
Phylogentic tree of select orthologs of the gene VXN. Estimated date of divergence is shown.

Vexin is found in all classes of vertebrates, including mammals, birds, fish, reptiles and amphibians. [5] The most distant ortholog of VXN is in Callorhinchus milli, which diverged from the human version of the gene an estimated 482.9 million years ago. [7] The gene has not been found in any plants, fungi or single celled organisms. [5]

Homologous domains

The N-terminus and C-terminus are highly conserved regions across both distant and close orthologs. The orthologs of vexin all show conservation of the SH3 protein domain family as well as a domain of unknown function (DUF4648).

mRNA

Splice variants

VXN does not have any alternative mRNA splice variants. The mature mRNA is approximately 3,741 base pairs in length and contains six exons. [6]

Protein

General properties

Location of the domain of unknown function and nuclear localization signal along vexin. DUF4648 C8orf46.png
Location of the domain of unknown function and nuclear localization signal along vexin.

Vexin is 207 amino acids long, which equates to a molecular weight of 22.6 kdal. [6] The isoelectric point of the protein is 10.42 which indicates the pH of the protein is basic. [8] Vexin does contain a domain of unknown function (DUF4648) and is a part of the SH3 domain family, which is known to bind to proline-rich ligands. [5] The secondary and tertiary structure of this protein is not well known.

Composition

Vexin is considered rich in arginine, and poor in phenylalanine compared to the composition of the average human protein. [8] Vexin does contain several regions of positively charged runs and has a high concentration of basic amino acids. [8]

Post-translational modifications

Vexin is predicted to undergo several types of post translational modifications. With a high degree of certainty, it is predicted that vexin undergoes lysine glycation, O-glycosylation, serine, threonine and tyrosine phosphorylation, SUMOylation and initial methionine acetylation. [9]

Type of ModificationAmino Acid PositionImpact on Protein [10]
Glycation of Epsilon Amino Groups of Lysine Lys33, Lys41, Lys124, Lys152. Lys153, Lys193Impairs enzymatic function of protein.
Initial Methionine Acetylation Met1Mediates protein stability, sorting and localization.
O-glycosylation sitesSer25, Ser90, Ser97, Ser102, Ser113, Ser122, Ser126, Ser128 Ser130, Ser148, Ser194, Thr78, Thr101, Thr125, Thr134, Thr155Regulates transcription and translation factors.
Phosphorylation sitesSer22, Ser25, Ser26, Ser34, Ser35, Ser97, Ser122, Ser126, Ser130, Ser194, Thr78, Thr83, Thr138, Tyr50, Tyr158, Tyr196Regulates protein function, cell signaling and enzymatic functions of protein
SUMOylation sitesLys141, Lys195Plays a role in nuclear-cytosolic transport, acts as binding site.

Subcellular location

Conceptual translation of VXN depicts predicted post-translational modification sites. C8orf46 Conceptual Translation.png
Conceptual translation of VXN depicts predicted post-translational modification sites.

Vexin is predicted to be a nuclear protein, given the classical nuclear localization signal found at amino acids Lys191 to Lys193. [9] Vexin does not contain any transmembrane domains or signal peptides suggesting that it is an intracellular protein. [9]

Expression

Image from Allen Brain Atlas shows the areas of elevated expression of VXN in the brain. Allen Brain2.png
Image from Allen Brain Atlas shows the areas of elevated expression of VXN in the brain.

VXN has shown to be ubiquitously expressed in the body. The gene is expressed in 13 different types of tissue throughout the body, with the brain, spinal cord and nerves showing elevated expression of the gene. [12] Specifically, the isocortex and hippocampal formation areas of the brain show high levels of expression. In addition to healthy tissue, vexin is also found in several disease states. These disease states include chondrosarcoma, glioma, kidney tumors, liver tumors, and germ cell tumors. [12] VXN is only expressed in infants and adults. [12]

Clinical significance

VXN has been associated with breast cancer in humans. The gene has been researched in connection with estrogen receptor 1- enhancer (ESR1), whose expression determines if a breast cancer patient receives endocrine therapy. [13] It is predicted that VXN has ESR1 enhancer regions that become hypermethylated and promote acquired endocrine resistance in breast cancer. [13]

Related Research Articles

<span class="mw-page-title-main">TFAP2C</span> Protein-coding gene in the species Homo sapiens

Transcription factor AP-2 gamma also known as AP2-gamma is a protein that in humans is encoded by the TFAP2C gene. AP2-gamma is a member of the activating protein 2 family of transcription factors.

<span class="mw-page-title-main">Morn repeat containing 1</span> Protein-coding gene in the species Homo sapiens

MORN1 containing repeat 1, also known as Morn1, is a protein that in humans is encoded by the MORN1 gene.

<span class="mw-page-title-main">SH3D21</span> Protein-coding gene in the species Homo sapiens

SH3D21 is a nuclear protein that is encoded by the SH3D21 gene. In humans, this gene is located on chromosome 1 p34.3. The human mRNA transcript is 2527 base pairs and the final protein product is 756 amino acids. While the exact function of this protein remains unknown, due to the presence of three SH3 domains, it has been implicated in protein-protein interactions.

<span class="mw-page-title-main">Proser2</span> Protein-coding gene in the species Homo sapiens

PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.

<span class="mw-page-title-main">C11orf52</span> Protein-coding gene in the species Homo sapiens

C11orf52 is an uncharacterized protein that in homo sapiens is encoded by the C11orf52 gene.

<span class="mw-page-title-main">C12orf60</span> Protein-coding gene in humans

Uncharacterized protein C12orf60 is a protein that in humans is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.

<span class="mw-page-title-main">Transmembrane protein 255A</span> Mammalian protein found in Homo sapiens

Transmembrane protein 255A is a protein that is encoded by the TMEM255A gene. TMEM255A is often referred to as family with sequence similarity 70, member A (FAM70A). The TMEM255A protein is transmembrane and is predicted to be located the nuclear envelope of eukaryote organisms.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">SHLD1</span> Protein-coding gene in the species Homo sapiens

SHLD1 or shieldin complex subunit 1 is a gene on chromosome 20. The C20orf196 gene encodes an mRNA that is 1,763 base pairs long, and a protein that is 205 amino acids long.

<span class="mw-page-title-main">TMEM171</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.

<span class="mw-page-title-main">SKIDA1</span> Protein-coding gene in the species Homo sapiens

Ski/Dach domain-containing protein 1 is a protein that in humans is encoded by the SKIDA1 gene. It is also known as C10orf140 and DLN-1. It has orthologs in vertebrates. It has two domains: the Ski/Sno/Dac domain and a domain of unknown function, DUF4854. It is associated with multiple types of cancer, like leukemia, ovarian cancer, and colon cancer. It's predicted to be a nuclear protein. It may interact with PRC2.

<span class="mw-page-title-main">C22orf23</span> Protein-coding gene in the species Homo sapiens

C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.

<span class="mw-page-title-main">TMEM81</span> Protein-coding gene in the species Homo sapiens

Transmembrane Protein 81 or TMEM81 is a protein that in humans is encoded by the TMEM81 gene. TMEM81 is a poorly-characterized transmembrane protein which contains an extracellular immunoglobulin domain.

<span class="mw-page-title-main">TMEM101</span>

Transmembrane protein 101 (TMEM101) is a protein that in humans is encoded by the TMEM101 gene. The TMEM101 protein has been demonstrated to activate the NF-κB signaling pathway. High levels of expression of TMEM101 have been linked to breast cancer.

<span class="mw-page-title-main">ZNF821</span> Zinc Finger 821

Zinc Finger Protein 821, also known as ZNF821, is a protein encoded by the ZNF821 gene. This gene is located on the 16th chromosome and is expressed highly in the testes, moderately expressed in the brain and low expression in 23 other tissues. The protein encoded is 412 amino acids long with 2 Zinc Finger motifs and a 23 amino acid long STPR domain.

<span class="mw-page-title-main">TBC1D30</span> Protein-coding gene in the species Homo sapiens

TBC1D30 is a gene in the human genome that encodes the protein of the same name. This protein has two domains, one of which is involved in the processing of the Rab protein. Much of the function of this gene is not yet known, but it is expressed mostly in the brain and adrenal cortex.

<span class="mw-page-title-main">Transmembrane protein 89</span> Human gene

Transmembrane protein 89 (TMEM89) is a protein that in humans is encoded by the TMEM89 gene.

<span class="mw-page-title-main">TMEM248</span> Transmembrane protein 248/TMEM248 gene

Transmembrane protein 248, also known as C7orf42, is a gene that in humans encodes the TMEM248 protein. This gene contains multiple transmembrane domains and is composed of seven exons.TMEM248 is predicted to be a component of the plasma membrane and be involved in vesicular trafficking. It has low tissue specificity, meaning it is ubiquitously expressed in tissues throughout the human body. Orthology analyses determined that TMEM248 is highly conserved, having homology with vertebrates and invertebrates. TMEM248 may play a role in cancer development. It was shown to be more highly expressed in cases of colon, breast, lung, ovarian, brain, and renal cancers.

<span class="mw-page-title-main">MROH9</span> Mammalian gene

Maestro heat-like repeat-containing protein family member 9 (MROH9) is a protein which in humans is encoded by the MROH9 gene. The word ‘maestro’ itself is an acronym, standing for male-specific transcription in the developing reproductive organs (MRO). MRO genes belong to the MROH family, which includes MROH9.

<span class="mw-page-title-main">CCDC184</span> Protein found in humans

Coiled-coil domain-containing 184 (CCDC184) is a protein which, in humans, is encoded by the CCDC184 gene

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000169085 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000067879 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 3 4 5 6 "VXN vexin [Homo sapiens (human) ]". National Center for Biotechnology Information. Retrieved 2021-01-06.
  6. 1 2 3 4 "VXN Gene - Vexin". GeneCards. Retrieved 2016-04-25.
  7. "TimeTree :: The Timescale of Life". timetree.org. Retrieved 2016-05-09.
  8. 1 2 3 "SDSC Biology Workbench".[ permanent dead link ]
  9. 1 2 3 "ExPASy: SIB Bioinformatics Resource Portal - Home". expasy.org. Retrieved 2016-04-25.
  10. "Overview of Post-Translational Modification". Thermo Fisher Scientific. Retrieved 2016-05-09.
  11. "ISH Data :: Allen Brain Atlas: Developing Mouse Brain". developingmouse.brain-map.org. Retrieved 2016-05-09.
  12. 1 2 3 "EST Profile - Hs.268869". National Center for Biotechnology Information. Retrieved 2016-05-09.
  13. 1 2 Stone A, Zotenko E, Locke WJ, Korbie D, Millar EK, Pidsley R, et al. (July 2015). "DNA methylation of oestrogen-regulated enhancers defines endocrine sensitivity in breast cancer". Nature Communications. 6: 7758. doi:10.1038/ncomms8758. PMC   4510968 . PMID   26169690.

Further reading