FAM199X

Last updated
Location of FAM199X on the X chromosome. X Chromosome highlight q22.2.png
Location of FAM199X on the X chromosome.

Family with sequence similarity 199, X-linked (FAM199X) is a protein which in humans is encoded by the FAM199X gene. [1] This gene has orthologs in most vertebrates, including most mammals, birds, amphibians, and fish with some homologs within invertebrates. [2] Within humans, this gene is commonly expressed in the brain and thyroid. [2] The gene has been linked to some genetic disorders, such as Pelizaeus–Merzbacher disease, and some cancers, such as Stomach cancer, but FAM199X's role in those diseases is not yet well understood within the scientific community. [3]

Contents

Gene

FAM199X is located on the long arm of the X chromosome at Xq22.2 on the plus strand, approximately 30,000 bases, and encodes six exons. [1] [3] The gene is located next to an enhancer called LOC130068517, also known as ATAC-STARR-Seq Lymphoblastoid Active Region 29826. [4]

Mouse expression of FAM199X within tissues at E14.5 Mouse In-situ hybridization.png
Mouse expression of FAM199X within tissues at E14.5

Expression

Expression is ubiquitous and high across may tissues at consistent values of expression. The gene has the highest expression within the cerebellum of the brain, followed by tissues related to hormone secretion, the thyroid, prostate, and kidney. [1] These results were checked against distant orthologs of FAM199X, which had similar expression profiles. There was especially high expression in the cerebellum, thalamus, and epididymis with below expected expression in adipose, bladder, heart, liver, fetal lung, skin, ileum, and stomach tissue. [5] Within the cell, FAM199X has expression with in the nucleus and endoplasmic reticulum. Within the promoter sequence, there was 6 eQTLs that were expressed and half of them were related to the thyroid of respiratory system. [6]

FAM199X Expression Profile FAM199X Expression Profile.png
FAM199X Expression Profile





Protein Localization

FAM199X is localized within the nucelus and cytoplasm. [7]

mRNA

Four transcript variants of FAM199X produce two protein isoforms. The four transcript variants are FAM199X-X1 variant 1 with 7498 nucleotides, FAM199X-X1 variant 2 with 7495 nucleotides, FAM199X-X2 variant 3 with 7179 nucleotides, and FAM199X-X1 variant 4 with 7171 nucleotides. [8] There are six exons in FAM199X-X1 variants and five exons in FAM199X-X2 variants. [1]

FAM199X has two isoforms, each with 6 exons and two variants of each isoform. Isoform X1 encodes for 345 amino acids while Isoform X2 encodes a 205 amino acid protein. [9]

The 3' Untranslated Region of FAM199X is abnormally large, spanning 6124 nucleotides. [1]

Isoform Table
Transcript VariantAccession # mRNALength (nt)ExonsProtein isoformAccession # ProteinLength (aa)Isoelectric Point (pI)
Variant 1XM_005262079.474956Isoform X1XP_005262136.13454.84
Variant 2XM_054326467.174986Isoform X1XP_054182442.13454.84
Variant 3XM_047441826.171796Isoform X2XP_047297782.12059.07
Variant 4XM_054326468.171716Isoform X2XP_054182443.12059.07

Evolutionary History

Homologs

FAM199X had several highly conserved orthologs amongst mammals, birds, reptiles, amphibians, fish, and less conserved orthologs in chorodates and arachnids. The most distant ortholog detected is the Common Household Spider, Parasteatoda tepidaiorum.

Paralogs

FAM199X has no paralogs.

Evolution

FAM199X evolved around 708 million years ago, with the oldest known ortholog, Parasteatoda tepidaiorum, diverging from human evolution about 708 million years ago. The evolution of FAM199X was slow, with a protein divergence close to cytochrome c, a highly conserved protein.

Comparing FAM199X to a fast-evolving protein, Fibrinogen alpha, and a slow-evolving protein, Cytochrome C. FAM199X Protein Divergence.png
Comparing FAM199X to a fast-evolving protein, Fibrinogen alpha, and a slow-evolving protein, Cytochrome C.
Orthologs
Genus and SpeciesCommon nameTaxonomyDate of divergence (MYA)Accession #Sequence length (aa)Identity (%)Similarity (%)
Homo sapiensHumanPrimates: Great Apes0NM_207318.4388100100
Macaca mulattaIndochinese rhesus macaquePrimates: New World Monkey28.8NP_001180862.1388100100
Plecturocebus cupreusCoppery titi monkeyPrimates: Old World Monkey43KAL0588686.142399100
Mus musculusHouse mouseMammals: Rodent87NP_666373.13889897
Pteropus vampyrusLarge flying foxMammals: Chiropetra/Megabat94XP_011379288.13889999
Ornithorhynchus anatinusPlatypusMammals: Monotremes180XP_028923647.13909395
Chelydra serpentinaCommon snapping turtleReptiles: Testudines/Turtles319KAG6937128.13819092
Eublepharis maculariusLeopard geckoBirds: Aves/Galliformes319XP_054853091.13848789
Gallus gallusRed junglefowlReptiles: Squamata319XP_003641135.23819092
Ranitomeya variabilisZimmerman's poison frogAmphibian: Anura352XP_077141317.13758992
Erpetoichthys calabaricusReedfishFish: Ray-finned429XP_028671530.13778589
Collichthys lucidusSpinyhead CroakerFish: Ray-finned429TKS72713.13967883
Leucoraja erinaceusLittle skateFish: Cartilagenous462XP_055499985.13788186
Pristis pectinataSmalltooth sawfishFish: Cartilagenous462XP_051876775.13788186
Lethenteron reissneriAsiatic Brook LampreyJawless Vertebrate: Petromyzontida563XP_061419915.14415969
Branchiostoma lanceolatumCommon lanceletInvertebrate: Cephalochordata581CAH1254916.13563351
Nematostella vectensisStarlet Sea AnemoneInvertebrate: Cnidaria685XP_001632934.13443144
Ixodes scapularisDeerk tickInvertebrate: Arachida6863533149
Magallana gigasPacific OysterInvertebrate: Mollusk708XP_011447698.33173047
Parasteatoda tepidariorumCommon house spiderInvertebrate: Arachida708XP_042905439.12732844

Protein

The protein contains 388 amino acids. FAM199X has a molecular weight about 43 kDa with an isoelectric point of 4.95. [10] [11] There are two protein isoforms of FAM199X, FAM199X-X1 and FAM199X-X2. [12] FAM199X-X1 is 345 amino acids long and has a weight of 38.61kDa, and FAM199X-X2 is 205 amino acids long and 22.8kDa. [12] [13] FAM199X has a protein motif for cytomegalovirus protein US29. [14] Found within FAM199X are cleaevage sites for N-Arginine dibasic convertase, MAPK, and BRCA1. N-Arginine dibasic convertase is an enzyme located in the brain that converts proto-hormones to hormones, but has not been extensively studied. [15] MAPK and BRCA1 have been implicated in cancer, acting as a tumor suppressor that can increase the risk of some cancers. [16] [17] FAM199X also has a high amount of serine, with a two standard deviation increase in serine compared to other human proteins. [18]

Post translational modifications

FAM199X has several regions of interest including a disordered region, a NET domain, N-linked glycosylation, N-myristoylation, C-mannosylation, and two proven phosphorylation sites. The NET domain stands for N-terminal extra-terminal domain. It is thought that this domain is related to bromodomain proteins, and this domain is used for protein binding. [19] N-linked glycosylation is an oligosaccharide bound generally to membrane-associated or secreted proteins, which further shows that FAM199X is secreted. [20] N-myristolation is the attatchment of a fatty acid to the protein, which could allow for a site to associate with the plasma membrane. [21] C-mannosylation has many roles, including intercellular transport and structural stability. [22]

FAM199X protein tertiary structure with labeled regions and secondary structure. The coloration depicts charged sections with blue signifying basic amino acids, red signifying acidic amino acids, and purple signifying partially positive amino acids, and grey signifying neutral amino acids. FAM199X Tertiary Protein Structure.png
FAM199X protein tertiary structure with labeled regions and secondary structure. The coloration depicts charged sections with blue signifying basic amino acids, red signifying acidic amino acids, and purple signifying partially positive amino acids, and grey signifying neutral amino acids.

Tertiary structure

The tertiary structure of FAM199X shows a globular area with alpha helices and beta strands within the first 300 amino acids, but the last 88 amino acids are depicted with a large arm and a possible protein binding domain, which is encoded by a alpha helix within the most conserved region of the FAM199X protein.

Protein Interaction

FAM199X associates with three proteins of note, WRD5, P, and M. [23] Proteins P and M are viral proteins while protein WRD5 is a high scoring protein related to the kidney and brain. [24] Protein P and M are relate to the flu and SARS-COV-2. [25] There is also evidence of FAM199X association with Nipah virus.

Variants

There were no variants found to be pathogenic and the majority of the variants were uncategorized and were found at very low frequency. [26]

Clinical Signficance

It is suggested that FAM199X could be involved in various clinical diseases and viruses, including the flu, SARS-COV-2, the Nipah virus, and cancer. It was speculated that FAM199X had effects on Pelizaeus–Merzbacher disease, but those results were never found. It is suggested that FAM199X could be secreted via the normal pathway involved with the endocrine system and the uncinventional secretion method.

References

  1. 1 2 3 4 5 "FAM199X family with sequence similarity 199, X-linked [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2025-09-20.
  2. 1 2 "FAM199X orthologs". NCBI. Retrieved 2025-09-20.
  3. 1 2 GeneCards Human Gene Database. "FAM199X Gene - GeneCards | F199X Protein | F199X Antibody". www.genecards.org. Archived from the original on 2022-11-20. Retrieved 2025-09-20.
  4. "LOC130068517 ATAC-STARR-seq lymphoblastoid active region 29826 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2025-09-20.
  5. "GDS3834 / 7695". www.ncbi.nlm.nih.gov. Retrieved 2025-12-04.
  6. "GTEx Portal". gtexportal.org. Retrieved 2025-12-04.
  7. "69122276003B26AC87C8AF2D expired". services.healthtech.dtu.dk. Retrieved 2025-12-04.
  8. "Genes for Homo sapiens (human)". NCBI. Retrieved 2025-09-20.
  9. "Genes for Homo sapiens (human)". NCBI. Retrieved 2025-10-17.
  10. EMBL-EBI; Institute, European Bioinformatics. "Job Dispatcher homepage | EMBL-EBI". www.ebi.ac.uk. Retrieved 2025-12-01.
  11. Sigrist, Christian J A; Cuche, Béatrice A; de Castro, Edouard; Coudert, Elisabeth; Redaschi, Nicole; Bridge, Alan (2025-11-20). "The PROSITE database for protein families, domains, and sites". Nucleic Acids Research gkaf1188. doi:10.1093/nar/gkaf1188. ISSN   0305-1048. PMID   41263099.
  12. 1 2 "protein FAM199X [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2025-09-20.
  13. "Protein Molecular Weight". www.bioinformatics.org. Retrieved 2025-09-20.
  14. "genome.jp". www.genome.jp. Retrieved 2025-12-04.
  15. Pierotti, A R; Prat, A; Chesneau, V; Gaudoux, F; Leseney, A M; Foulon, T; Cohen, P (1994-06-21). "N-arginine dibasic convertase, a metalloendopeptidase as a prototype of a class of processing enzymes". Proceedings of the National Academy of Sciences. 91 (13): 6078–6082. Bibcode:1994PNAS...91.6078P. doi: 10.1073/pnas.91.13.6078 . ISSN   0027-8424.
  16. "Mitogen-Activated Protein Kinase - an overview | ScienceDirect Topics". www.sciencedirect.com. Retrieved 2025-12-04.
  17. PhD, Marisa Rubio (2025-02-16). "What Is BRCA1? About the BRCA1 Mutation and More | BCRF". Breast Cancer Research Foundation. Retrieved 2025-12-04.
  18. EMBL-EBI; Institute, European Bioinformatics. "Job Dispatcher homepage | EMBL-EBI". www.ebi.ac.uk. Retrieved 2025-12-04.
  19. Lin, Yi-Jan; Umehara, Takashi; Inoue, Makoto; Saito, Kohei; Kigawa, Takanori; Jang, Moon-Kyoo; Ozato, Keiko; Yokoyama, Shigeyuki; Padmanabhan, Balasundaram; Güntert, Peter (2008). "Solution structure of the extraterminal domain of the bromodomain-containing protein BRD4". Protein Science (in French). 17 (12): 2174–2179. doi:10.1110/ps.037580.108. ISSN   1469-896X. PMC   2590908 . PMID   18815416.
  20. "UniProt". UniProt. Retrieved 2025-12-04.
  21. Wang, Bin; Dai, Tong; Sun, Wenhuan; Wei, Yujun; Ren, Jiang; Zhang, Long; Zhang, Mengdi; Zhou, Fangfang (April 2021). "Protein N-myristoylation: functions and mechanisms in control of innate immunity". Cellular & Molecular Immunology. 18 (4): 878–888. doi:10.1038/s41423-021-00663-2. ISSN   2042-0226. PMC   7966921 . PMID   33731917.
  22. Minakata, Shiho; Manabe, Shino; Inai, Yoko; Ikezaki, Midori; Nishitsuji, Kazuchika; Ito, Yukishige; Ihara, Yoshito (2021-08-30). "Protein C-Mannosylation and C-Mannosyl Tryptophan in Chemical Biology and Medicine". Molecules. 26 (17): 5258. doi: 10.3390/molecules26175258 . ISSN   1420-3049. PMC   8433626 . PMID   34500691.
  23. "STRING: functional protein association networks". string-db.org. Retrieved 2025-12-04.
  24. PubChem. "PDPK1 - 3-phosphoinositide dependent protein kinase 1 (human)". pubchem.ncbi.nlm.nih.gov. Retrieved 2025-12-04.
  25. Zhang, Zhikuan; Nomura, Norimichi; Muramoto, Yukiko; Ekimoto, Toru; Uemura, Tomoko; Liu, Kehong; Yui, Moeko; Kono, Nozomu; Aoki, Junken; Ikeguchi, Mitsunori; Noda, Takeshi; Iwata, So; Ohto, Umeharu; Shimizu, Toshiyuki (2022-08-05). "Structure of SARS-CoV-2 membrane protein essential for virus assembly". Nature Communications. 13 (1): 4399. Bibcode:2022NatCo..13.4399Z. doi:10.1038/s41467-022-32019-3. ISSN   2041-1723.
  26. "VCV004253412.1 - ClinVar - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2025-12-04.