FAM166B

Last updated

Family with Sequence Similarity 166, member B, or FAM166B, is an uncharacterized protein in humans that is encoded by the FAM166B gene.

Contents

Gene

The FAM166B gene is located on the short arm of chromosome 9 at 9p13.3 on the minus strand. [1] The genomic sequence spans 2,069 base pairs from 35563899 to 35561830. Gene neighbors are RUSC2, RPS29P17, and TESK1.

Location and Neighborhood of FAM166B on Chromosome 9 FAM166B Gene Neighbordhood.png
Location and Neighborhood of FAM166B on Chromosome 9

Expression

FAM166B is expressed 0.5 times higher than average in humans. [2] FAM166B is highly expressed in the adrenal gland, fallopian tube, and respiratory epithelial tissues. It is weakly to moderately expressed in skeletal muscle and heart muscle. [3] [4] [5]

Promoter

FAM166B is predicted to have a promoter that spans 680 bp and includes the 5' UTR. [6]

mRNA

In humans, FAM166B has 10 transcript variants, which are all spliced. [2] FAM166B transcript variant 1 is 1,092 bp in length and contains 6 total exons. The accession number for this variant is NM_001164310. [7]

Protein

The amino acid sequence is 275 amino acids in length and contains 3 DUF 2475 regions. [8] The three DUF2475 regions are located from amino acids 15 to 80, 174 to 234, and 234 to 261.The predicted molecular weight is 30.6 kdal with the predicted isoelectric point of 8.414. [9] It is known to have a higher than normal proline composition compared to other human proteins at 12.4%. The protein has a negative charged region from residues 141 to 172. [10]

1    MAVASTFIPGLNPQNPHYIPGYTGHCPLLRFSVGQTYGQVTGQLLRGPPGLAWPPVHRTLLPPIRPPRSP 71   EVPRESLPVRRGQERLSSSMIPGYTGFVPRAQFIFAKNCSQVWAEALSDFTHLHEKQGSEELPKEAKGRK 141  DTEKDQVPEPEGQLEEPTLEVVEQASPYSMDDRDPRKFFMSGFTGYVPCARFLFGSSFPVLTNQALQEFG 211  QKHSPGSAQDPKHLPPLPRTYPQNLGLLPNYGGYVPGYKFQFGHTFGHLTHDALGLSTFQKQLLA 

Post-Translational Modifications

FAM166B is predicted to have 12 phosphorylation, 3 sumoylation, and 1 acetylation sites. [11] [12] [13] FAM166 has no predicted signal peptide sequences. [14]

Structure

FAM166B is predicted to be composed mostly of coils with short interspersed regions of alpha helices and beta sheets. [15] There are no predicted transmembrane domains and this is consistent through orthologs. [16]

Subcellular Localization

However, the intracellular location of FAM166B is unknown. [17] [18] The average hydrophobicity of the protein is -0.519272, which suggests that it is a soluble protein. [16]

Homology

Orthologs

FAM166B has a number of orthologs in mammals, birds, reptiles, fish, and some invertebrates. The table below lists a number of FAM166B orthologs that were found using BLAST. [19] The table descending exhibits the diversity of species with FAM166B orthologs in descending order of identity.

Scientific NameCommon NameProtein Accession NumberSequence Length (aa)IdentitySimilarity
Homo SapiensHumanNP_001157782.1275
Camelus FerusBactrian CamelXP_006187942.127983%87%
Bos TaurusDomestic CowXP_005210130.130381%87%
Felis catusDomestic CatXP_003995654.128081%86%
Tursiops truncatusCommon Bottlenose DolphinXP_004312901.127480%86%
Elephantulus edwardiiCape Elephant ShrewXP_006887001.127580%86%
Mus MusculusMouseXP_006538075.127375%82%
Dasypus novemcinctusNine-banded armadilloXP_004457403.128675%82%
Equus caballusHorseXP_001914783.230075%80%
Monodelphis domesticaGray short-tailed opossumXP_007498869.129257%68%
Chelonia mydasGreen TurtleXP_007055984.125147%62%
Tinamus guttatusWhite-throated TinamuXP_010220019.130745%55%
Python bivittatusBurmese PythonXP_007427141.134042%56%
Xenopus tropicalisWestern Clawed FrogNP_001106452.130642%55%
Anolis carolinesisCarolina AnoleXP_003228316.131440%53%
Danio rerioZebrafishNP_001076489.229939%52%
Poecilia formosaAmazon MollyXP_007566989.126234%50%
Ciona intestinalisVase TunicateXP_002129379.133030%42%
Picoides pubescensDowny WoodpeckerXP_009895146.129626%41%
Hydra vulgarisFreshwater PolypXP_002162128.128226%41%
Saccoglossus kowalevskiiAcorn WormXP_002739701.133224%41%
Stronglyocentrotus purpuratusPurple Sea UrchinXP_786484.333324%40%

Paralogs

FAM166B has one paralog, FAM166A, which spans 317 aa and has a 25% identity. [20] The accession number for FAM166A is NP_001001710.

Clinical Significance

Diseases

Currently, FAM166 is not associated within a human disease or condition. Despite being located on Spastic paraplegia 46, a locus on chromosome 9, that is known to cause an autosomal-recessive disease called hereditary spastic paraplegia (HSP), FAM166B was determined not to be the gene responsible for the disease due to its frequency in the population controls. [21] FAM166B was excluded from a patent looking for genes that are prognosis predictors for classic Hodgkin's lymphoma (cHL). [22]

Related Research Articles

<span class="mw-page-title-main">KIAA1109</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein KIAA1109 is a protein that in humans is encoded by the KIAA1109 gene.

<span class="mw-page-title-main">Tetratricopeptide repeat 39A</span> Protein-coding gene in the species Homo sapiens

Tetratricopeptide repeat 39A is a human protein encoded by the TTC39A gene. TTC39A is also known as DEME-6, KIAA0452, and c1orf34. The function of TTC39A is currently not well understood. The main feature within tetratricopeptide repeat 39A is the domain of unknown function 3808 (DUF3808), spanning almost the entire protein. KIAA0452 can also be seen as an isoform of TTC39A because of differences in genome sequence, but overlap in DUF domain.

<span class="mw-page-title-main">QSER1</span> Protein-coding gene in the species Homo sapiens

Glutamine Serine Rich Protein 1 or QSER1 is a protein encoded by the QSER1 gene.

<span class="mw-page-title-main">FAM214A</span> Protein-coding gene in the species Homo sapiens

Protein FAM214A, also known as protein family with sequence similarity 214, A (FAM214A) is a protein that, in humans, is encoded by the FAM214A gene. FAM214A is a gene with unknown function found at the q21.2-q21.3 locus on Chromosome 15 (human). The protein product of this gene has two conserved domains, one of unknown function (DUF4210) and another one called Chromosome_Seg. Although the function of the FAM214A protein is uncharacterized, both DUF4210 and Chromosome_Seg have been predicted to play a role in chromosome segregation during meiosis.

<span class="mw-page-title-main">Fam78b</span> Protein-coding gene in the species Homo sapiens

Family with Sequence Similarity 78-Member B (FAM78B) is a protein of unknown function in humans that is encoded by the FAM78B gene (1q24.1). It has orthologous genes and predicted proteins in vertebrates and several invertebrates, but not in arthropods. It has a nuclear localization signal in the protein sequence and a miRNA target region in the mRNA sequence.

<span class="mw-page-title-main">KIAA1841</span> Protein-coding gene in the species Homo sapiens

KIAA1841 is a gene in humans that encodes a protein known as KIAA1841. KIAA1841 is targeted for the nucleus and it predicted to play a role in regulating transcription.

NHL Repeat Containing Protein 2, or NHLRC2, is a protein encoded by the NHLRC2 gene.

<span class="mw-page-title-main">Proser2</span> Protein-coding gene in the species Homo sapiens

PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.

<span class="mw-page-title-main">Transmembrane protein 268</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 268 is a protein that in humans is encoded by TMEM268 gene. The protein is a transmembrane protein of 342 amino acids long with eight alternative splice variants. The protein has been identified in organisms from the common fruit fly to primates. To date, there has been no protein expression found in organisms simpler than insects.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

OCC-1 is a protein, which in humans is encoded by the gene C12orf75. The gene is approximately 40,882 bp long and encodes 63 amino acids. OCC-1 is ubiquitously expressed throughout the human body. OCC-1 has shown to be overexpressed in various colon carcinomas. Novel splice variant of this gene was also detected in various human cancer types; in addition to encoding a novel smaller protein, OCC-1 gene produces a non-protein coding RNA splice variant lncRNA.

<span class="mw-page-title-main">C10orf67</span> Protein-coding gene in the species Homo sapiens

Chromosome 10 open reading frame 67 (C10orf67), also known as C10orf115, LINC01552, and BA215C7.4, is an un-characterized human protein-coding gene. Several studies indicate a possible link between genetic polymorphisms of this and several other genes to chronic inflammatory barrier diseases such as Crohn's Disease and sarcoidosis.

<span class="mw-page-title-main">C21orf62</span> Protein-coding gene in the species Homo sapiens

C21orf62 is a protein that, in humans, is encoded by the C21orf62 gene. C21orf62 is found on human chromosome 21, and it is thought to be expressed in tissues of the brain and reproductive organs. Additionally, C21orf62 is highly expressed in ovarian surface epithelial cells during normal regulation, but is not expressed in cancerous ovarian surface epithelial cells.

BEND2 is a protein that in humans is encoded by the BEND2 gene. It is also found in other vertebrates, including mammals, birds, and reptiles. The expression of BEND2 in Homo sapiens is regulated and occurs at high levels in the skeletal muscle tissue of the male testis and in the bone marrow. The presence of the BEN domains in the BEND2 protein indicates that this protein may be involved in chromatin modification and regulation.

Transmembrane Protein 217 is a protein encoded by the gene TMEM217. TMEM217 has been found to have expression correlated with the lymphatic system and endothelial tissues and has been predicted to have a function linked to the cytoskeleton.

UPF0575 protein C19orf67 is a protein which in humans is encoded by the C19orf67 gene. Orthologs of C19orf67 are found in many mammals, some reptiles, and most jawed fish. The protein is expressed at low levels throughout the body with the exception of the testis and breast tissue. Where it is expressed, the protein is predicted to be localized in the nucleus to carry out a function. The highly conserved and slowly evolving DUFF3314 region is predicted to form numerous alpha helices and may be vital to the function of the protein.

<span class="mw-page-title-main">C6orf62</span> Protein-coding gene in the species Homo sapiens

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">C8orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.

<span class="mw-page-title-main">C7orf26</span> Human protein-encoding gene on chromosome 7

c7orf26 is a gene in humans that encodes a protein known as c7orf26. Based on properties of c7orf26 and its conservation over a long period of time, its suggested function is targeted for the cytoplasm and it is predicted to play a role in regulating transcription.

<span class="mw-page-title-main">C11orf98</span> Protein-coding gene in the species Homo sapiens

C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.

References

  1. "NCBI Gene".
  2. 1 2 "AceView, NCBI".
  3. "NCBI UniGene: FAM166B".
  4. "Human Protein Atlas: FAM166B".
  5. Gaudet, P; Argout-Put, G; Cusin, I; et al. (December 3, 2013). "neXtProt: Organizing Protein Knowledge in the Context of Human Proteome Projects". Journal of Proteome Research. 12 (1): 293–298. doi:10.1021/pr300830v. PMID   23205526.
  6. "Genomatix ElDorado".
  7. "NCBI Nucleotide". 30 June 2018.
  8. "NCBI Protein: FAM166B" . Retrieved 28 Feb 2015.
  9. "Biology Workbench 3.2".[ permanent dead link ]
  10. Brendel, V; Nourbakhsh, I.R.; Blaisdell, B.E. (March 1992). "Methods and algorithms for statistical analysis of protein sequences". Proc. Natl. Acad. Sci. U.S.A. 89 (6): 2002–2006. Bibcode:1992PNAS...89.2002B. doi: 10.1073/pnas.89.6.2002 . PMC   48584 . PMID   1549558.
  11. "Net Phos 2.0".
  12. "NetAcet".
  13. "SUMOplot".
  14. "SignalP".
  15. "Biology Workbench 3.2: PELE".[ permanent dead link ]
  16. 1 2 "SOSUI".
  17. "The Human Protein Atlas".
  18. "COMPARTMENTS: FAM166B".
  19. "NCBI BLAST".
  20. "NCBI Protein: FAM166A".
  21. Martin, E; Schule, R; Smets, K; et al. (2013). "Loss of Function of Glucocerebrosidase GBA2 is responsible for Motor Neuron Defects in Hereditary Spastic Paraplegia". American Journal of Human Genetics. 92 (2): 238–244. doi:10.1016/j.ajhg.2012.11.021. PMC   3567271 . PMID   23332916.
  22. Gascoyne, R; Steidl, C; Scott, D. "Predicting prognosis in Classic Hodgkin Lymphoma".{{cite journal}}: Cite journal requires |journal= (help)