VXN | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | VXN , chromosome 8 open reading frame 46, C8orf46, vexin | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1924232 HomoloGene: 17666 GeneCards: VXN | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Vexin is a protein encoded by VXN gene. [5] VXN is found to be highly expressed in regions of the brain and spinal cord.
VXN is found along the plus strand of chromosome 8. [6] The entire gene is 58,522 bp long. [6] VXN is flanked by alcohol dehydrogenase iron containing 1 and Myb proto-oncogene like 1. [5]
No human paralogs for VXN have been identified [5]
Vexin is found in all classes of vertebrates, including mammals, birds, fish, reptiles and amphibians. [5] The most distant ortholog of VXN is in Callorhinchus milli, which diverged from the human version of the gene an estimated 482.9 million years ago. [7] The gene has not been found in any plants, fungi or single celled organisms. [5]
The N-terminus and C-terminus are highly conserved regions across both distant and close orthologs. The orthologs of vexin all show conservation of the SH3 protein domain family as well as a domain of unknown function (DUF4648).
VXN does not have any alternative mRNA splice variants. The mature mRNA is approximately 3,741 base pairs in length and contains six exons. [6]
Vexin is 207 amino acids long, which equates to a molecular weight of 22.6 kdal. [6] The isoelectric point of the protein is 10.42 which indicates the pH of the protein is basic. [8] Vexin does contain a domain of unknown function (DUF4648) and is a part of the SH3 domain family, which is known to bind to proline-rich ligands. [5] The secondary and tertiary structure of this protein is not well known.
Vexin is considered rich in arginine, and poor in phenylalanine compared to the composition of the average human protein. [8] Vexin does contain several regions of positively charged runs and has a high concentration of basic amino acids. [8]
Vexin is predicted to undergo several types of post translational modifications. With a high degree of certainty, it is predicted that vexin undergoes lysine glycation, O-glycosylation, serine, threonine and tyrosine phosphorylation, SUMOylation and initial methionine acetylation. [9]
Type of Modification | Amino Acid Position | Impact on Protein [10] |
---|---|---|
Glycation of Epsilon Amino Groups of Lysine | Lys33, Lys41, Lys124, Lys152. Lys153, Lys193 | Impairs enzymatic function of protein. |
Initial Methionine Acetylation | Met1 | Mediates protein stability, sorting and localization. |
O-glycosylation sites | Ser25, Ser90, Ser97, Ser102, Ser113, Ser122, Ser126, Ser128 Ser130, Ser148, Ser194, Thr78, Thr101, Thr125, Thr134, Thr155 | Regulates transcription and translation factors. |
Phosphorylation sites | Ser22, Ser25, Ser26, Ser34, Ser35, Ser97, Ser122, Ser126, Ser130, Ser194, Thr78, Thr83, Thr138, Tyr50, Tyr158, Tyr196 | Regulates protein function, cell signaling and enzymatic functions of protein |
SUMOylation sites | Lys141, Lys195 | Plays a role in nuclear-cytosolic transport, acts as binding site. |
Vexin is predicted to be a nuclear protein, given the classical nuclear localization signal found at amino acids Lys191 to Lys193. [9] Vexin does not contain any transmembrane domains or signal peptides suggesting that it is an intracellular protein. [9]
VXN has shown to be ubiquitously expressed in the body. The gene is expressed in 13 different types of tissue throughout the body, with the brain, spinal cord and nerves showing elevated expression of the gene. [12] Specifically, the isocortex and hippocampal formation areas of the brain show high levels of expression. In addition to healthy tissue, vexin is also found in several disease states. These disease states include chondrosarcoma, glioma, kidney tumors, liver tumors, and germ cell tumors. [12] VXN is only expressed in infants and adults. [12]
VXN has been associated with breast cancer in humans. The gene has been researched in connection with estrogen receptor 1- enhancer (ESR1), whose expression determines if a breast cancer patient receives endocrine therapy. [13] It is predicted that VXN has ESR1 enhancer regions that become hypermethylated and promote acquired endocrine resistance in breast cancer. [13]
Transcription factor AP-2 gamma also known as AP2-gamma is a protein that in humans is encoded by the TFAP2C gene. AP2-gamma is a member of the activating protein 2 family of transcription factors.
MORN1 containing repeat 1, also known as Morn1, is a protein that in humans is encoded by the MORN1 gene.
SH3D21 is a nuclear protein that is encoded by the SH3D21 gene. In humans, this gene is located on chromosome 1 p34.3. The human mRNA transcript is 2527 base pairs and the final protein product is 756 amino acids. While the exact function of this protein remains unknown, due to the presence of three SH3 domains, it has been implicated in protein-protein interactions.
PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.
C11orf52 is an uncharacterized protein that in homo sapiens is encoded by the C11orf52 gene.
Uncharacterized protein C12orf60 is a protein that in humans is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.
Transmembrane protein 255A is a protein that is encoded by the TMEM255A gene. TMEM255A is often referred to as family with sequence similarity 70, member A (FAM70A). The TMEM255A protein is transmembrane and is predicted to be located the nuclear envelope of eukaryote organisms.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
SHLD1 or shieldin complex subunit 1 is a gene on chromosome 20. The C20orf196 gene encodes an mRNA that is 1,763 base pairs long, and a protein that is 205 amino acids long.
Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.
Ski/Dach domain-containing protein 1 is a protein that in humans is encoded by the SKIDA1 gene. It is also known as C10orf140 and DLN-1. It has orthologs in vertebrates. It has two domains: the Ski/Sno/Dac domain and a domain of unknown function, DUF4854. It is associated with multiple types of cancer, like leukemia, ovarian cancer, and colon cancer. It's predicted to be a nuclear protein. It may interact with PRC2.
C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.
Transmembrane Protein 81 or TMEM81 is a protein that in humans is encoded by the TMEM81 gene. TMEM81 is a poorly-characterized transmembrane protein which contains an extracellular immunoglobulin domain.
Transmembrane protein 101 (TMEM101) is a protein that in humans is encoded by the TMEM101 gene. The TMEM101 protein has been demonstrated to activate the NF-κB signaling pathway. High levels of expression of TMEM101 have been linked to breast cancer.
Zinc Finger Protein 821, also known as ZNF821, is a protein encoded by the ZNF821 gene. This gene is located on the 16th chromosome and is expressed highly in the testes, moderately expressed in the brain and low expression in 23 other tissues. The protein encoded is 412 amino acids long with 2 Zinc Finger motifs and a 23 amino acid long STPR domain.
TBC1D30 is a gene in the human genome that encodes the protein of the same name. This protein has two domains, one of which is involved in the processing of the Rab protein. Much of the function of this gene is not yet known, but it is expressed mostly in the brain and adrenal cortex.
Transmembrane protein 89 (TMEM89) is a protein that in humans is encoded by the TMEM89 gene.
Transmembrane protein 248, also known as C7orf42, is a gene that in humans encodes the TMEM248 protein. This gene contains multiple transmembrane domains and is composed of seven exons.TMEM248 is predicted to be a component of the plasma membrane and be involved in vesicular trafficking. It has low tissue specificity, meaning it is ubiquitously expressed in tissues throughout the human body. Orthology analyses determined that TMEM248 is highly conserved, having homology with vertebrates and invertebrates. TMEM248 may play a role in cancer development. It was shown to be more highly expressed in cases of colon, breast, lung, ovarian, brain, and renal cancers.
Maestro heat-like repeat-containing protein family member 9 (MROH9) is a protein which in humans is encoded by the MROH9 gene. The word ‘maestro’ itself is an acronym, standing for male-specific transcription in the developing reproductive organs (MRO). MRO genes belong to the MROH family, which includes MROH9.
Coiled-coil domain-containing 184 (CCDC184) is a protein which, in humans, is encoded by the CCDC184 gene