FAM71F2

Last updated
GARIN1A
Identifiers
Aliases GARIN1A , FAM137B, family with sequence similarity 71 member F2, GARI, FAM71F2, golgi associated RAB2 interactor 1A
External IDs MGI: 2141439 HomoloGene: 52974 GeneCards: GARIN1A
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001012454
NM_001128926
NM_001290254
NM_001290255

Contents

NM_001101486

RefSeq (protein)

NP_001012457
NP_001122398
NP_001277183
NP_001277184

NP_001094956

Location (UCSC) Chr 7: 128.67 – 128.69 Mb Chr 6: 29.28 – 29.29 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

FAM71F2 or Family with Sequence Similarity 71 member F2 is a protein that in humans is encoded by the Family with Sequence Similarity 71 member F2 gene. This gene is highly active in the reproductive tissues, specifically the testis, and may serve as a potential biomarker for determining metastatic testicular cancer.

Gene

Location

Figure 1. Location of the gene FAM71F2, marked with a blue rectangle, on Chromosome 7. Neighboring genes in the area are labelled as well. Chromosome Diagram.png
Figure 1. Location of the gene FAM71F2, marked with a blue rectangle, on Chromosome 7. Neighboring genes in the area are labelled as well.

FAM71F2 gene is located on chromosome 7 in humans (7q32.1), [5] [6] starting at 128,671,636 and ending at 128,702,262 on the positive strand. [6] The gene paralog FAM71F1 and the gene LINC01000 directly neighbor FAM71F2 on chromosome 7.

Size of gene

The gene spans 30,627 base pairs [6] and codes for 12 exons. [5]

Common aliases

FAM71F2 is also referred to as family with sequence similarity 137 member B, FAM137B. [6]

mRNA

FAM71F2 has 14 transcript variants. [5] Isoform a is the longest of the mRNA transcripts and spans 5,775 base pairs that translates into a 309 amino acids sequence. [5] It codes for 5 exons. [5] Other alternative splice isoforms are labelled in the diagram in Figure 2.

Figure 2. Diagram of the 14 annotated transcript variants for FAM71F2 with the protein isoform names on the left. Annotated transcripts.png
Figure 2. Diagram of the 14 annotated transcript variants for FAM71F2 with the protein isoform names on the left.

At the first splice site, Isoform b, found in most reptiles having FAM71F2 protein, deletes the following nine amino acids and picks up at the valine amino acid at location 61 in humans. Isoform c uses an alternative downstream start site to Isoform a and adds another exon between the first and second exons of Isoform a.

General properties

The molecular weight of FAM71F2 is 34.5 kilodaltons. [7] The isoelectric point is 6.15. [8]

Domains and motifs

FAM71F2 protein contains only one domain, named domain of unknown function, DUF3699. [5] [6] This domain is located from amino acids 114-185 on the human FAM71F2 protein. [5] This domain family is found only in eukaryotes and approximately 71 amino acids in length. [5] There is also a potential R-2 mitochondrial pre-sequence cleavage site [9] that would signal the protein to the mitochondria. These sites are labelled in Figure 4 below.

Secondary structure

The secondary structure of FAM71F2 contains alpha helices and beta sheets. [10] [11] These structures are identified in the generated image of the FAM71F2 protein in Figure 3. Highly conserved amino acid residues, such as the Val61-Thr62-Lys63 sequence where the reptiles and isoform b pick up in the second exon, are labelled on this figure as well.

Figure 3. Diagram of the FAM71F2 protein with secondary structures and highly conserved amino acid residues. Alpha helices are labelled in orange, beta sheets in cyan, the N-terminus is magenta, and the C-terminus is green. FAM71F2 Protein.png
Figure 3. Diagram of the FAM71F2 protein with secondary structures and highly conserved amino acid residues. Alpha helices are labelled in orange, beta sheets in cyan, the N-terminus is magenta, and the C-terminus is green.

Post-translational modifications

FAM71F2 has seven highly conserved phosphorylated sites. There is one acetylation site [13] and one N-glycosylation site, [14] playing a role in stabilizing the protein.

Figure 4. Schematic diagram of FAM71F2 protein. The DUF3699 domain is labelled as the large, grey hexagonal shape. A potential R-2 mitochondrial targeting sequence shown at 5' end is labelled with a red rectangle. Conserved phosphorylation sites are labelled with the red stop sign images and a 5' acetylation site on the second amino acid is labelled with a grey stop sign image. A N-terminus glycosylation site on Met98 is shown as a thin, green rectangle and two cysteine bonds are labelled with the green brackets. Schematic Diagram.png
Figure 4. Schematic diagram of FAM71F2 protein. The DUF3699 domain is labelled as the large, grey hexagonal shape. A potential R-2 mitochondrial targeting sequence shown at 5' end is labelled with a red rectangle. Conserved phosphorylation sites are labelled with the red stop sign images and a 5' acetylation site on the second amino acid is labelled with a grey stop sign image. A N-terminus glycosylation site on Met98 is shown as a thin, green rectangle and two cysteine bonds are labelled with the green brackets.

Sub-cellular localization

FAM71F2 protein stays in the cytoplasm of cells, [9] but may have localization in the nucleus and mitochondria. [9]

Expression

Tissue expression pattern

FAM71F2 is highly expressed in reproductive structures, such as the testis, epididymis, uterus and ovaries. [5] [16] There is some expression in the brain and connective tissue as well. [17] As development stages progress, the number of gene transcripts increases and are at highest expression levels in adults. [17] In the mouse, during spermatogenesis and development of the testis, gene transcript levels of FAM71F2 increase dramatically. [18]

Cellular expression

FAM71F2 protein expression has been detected in the cytoplasm of Leydig cells and in epididymis cells of the male testis and is also detected in the cytoplasm in ovarian follicles. [16]

Figure 4. Immunohistological staining of FAM71F2 in the human testis. Immunohistological Stain.png
Figure 4. Immunohistological staining of FAM71F2 in the human testis.

Expression level

FAM71F2 is moderately expressed in comparison to other proteins in the human, with an average protein expression level of 8.47 part per million. [19]

Clinical significance

FAM71F2 is repressed in males with non-obstructive azoospermia [20] and teratozoospermia, [21] or abnormalities in sperm morphology and quantity. These diseases lead to fertility issues. In addition, FAM71F2 gene expression is up-regulated with Dopamine receptor D1 expression in testicular cancer patients, and may be an important biomarker for metastatic forms of this cancer. [22] [23] [20]

Homology

FAM71F2 has 91 orthologs in other animal species. [5] Its evolutionary history goes as far back as the reptiles, and its most distant relative is the homolog in the west Indian Ocean coelacanth. [24] [25] [26] The time of divergence between eight orthologs from the human FAM71F2 is shown in Figure 5. It is not found in birds or in Gallus gallus (chicken). [25] [5] FAM71F2 has six paralogs in humans: FAM71A, FAM71B, FAM71C, FAM71D, FAM71E1, and FAM71F1. [5]

Figure 5. Date of Divergence of FAM71F2 from the human ortholog. Date of Divergence for FAM71F2.png
Figure 5. Date of Divergence of FAM71F2 from the human ortholog.

Related Research Articles

<span class="mw-page-title-main">C6orf62</span> Protein-coding gene in the species Homo sapiens

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">FAM71E1</span> Mammalian protein found in Homo sapiens

FAM71E1, also known as Family With Sequence Similarity 71 Member E1, is a protein that in humans is encoded by the FAM71E1 gene. It is thought to be ubiquitously expressed at low levels throughout the body, and it is conserved in vertebrates, particularly mammals and some reptiles. The protein is localized to the nucleus and can be exported to the cytoplasm.

<span class="mw-page-title-main">TMEM44</span> Protein-coding gene in the species Homo sapiens

TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.

<span class="mw-page-title-main">TEX9</span> Protein-coding gene in the species Homo sapiens

Testis-expressed protein 9 is a protein that in humans is encoded the TEX9 gene. TEX9 that encodes a 391-long amino acid protein containing two coiled-coil regions. The gene is conserved in many species and encodes orthologous proteins in eukarya, archaea, and one species of bacteria. The function of TEX9 is not yet fully understood, but it is suggested to have ATP-binding capabilities.

<span class="mw-page-title-main">C16orf86</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.

<span class="mw-page-title-main">CFAP299</span> Protein-coding gene in the species Homo sapiens

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

<span class="mw-page-title-main">CDV3 (gene)</span> Human protein-coding gene

Protein CDV3 homolog also known as carnitine deficiency-associated gene expressed in ventricle 3 is a protein that in humans is encoded by the CDV3 gene.

<span class="mw-page-title-main">C17orf78</span> Mammalian protein found in Homo sapiens

Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

<span class="mw-page-title-main">FAM214B</span> Protein-coding gene in the species Homo sapiens

The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.

Lysine-rich nucleolar protein 1 (KNOP1) is a protein which in human's is encoded by the KNOP1 gene. Aliases for KNOP1 include TSG118, C16orf88, and FAM191A.

<span class="mw-page-title-main">C5orf22</span> Protein-coding gene in the species Homo sapiens

Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).

<span class="mw-page-title-main">TEKTIP1</span> Gene

TEKTIP1, also known as tektin-bundle interacting protein 1, is a protein that in humans is encoded by the TEKTIP1 gene.

<span class="mw-page-title-main">Chromosome 12 open reading frame 71</span> Protein encoded in humans by c12orf71 gene

Chromosome 12 open reading frame 71 (c12orf71) is a protein which in humans is encoded by c12orf71 gene. The protein is also known by the alias LOC728858.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">C10orf53</span> Human gene

C10orf53 is a protein that in humans is encoded by the C10orf53 gene. The gene is located on the positive strand of the DNA and is 30,611 nucleotides in length. The protein is 157 amino acids and the gene has 3 exons. C10orf53 orthologs are found in mammals, birds, reptiles, amphibians, fish, and invertebrates. It is primarily expressed in the testes and at very low levels in the cerebellum, liver, placenta, and trachea.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

<span class="mw-page-title-main">SCRN3</span> Protein-coding gene in the species Homo sapiens

Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.

<span class="mw-page-title-main">LRRC74A</span> Protein-coding gene

Leucine-rich repeat-containing protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000205085 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000079652 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 3 4 5 6 7 8 9 10 11 12 NCBI (National Center for Biotechnology Information) entry on FAM71F2 [https://www.ncbi.nlm.nih.gov/gene/346653]
  6. 1 2 3 4 5 Database, GeneCards Human Gene. "FAM71F2 Gene - GeneCards | F71F2 Protein | F71F2 Antibody". www.genecards.org. Retrieved 2017-05-07.
  7. Kramer, Jack (1990). "AASTATS". Biology WorkBench 3.2.[ permanent dead link ]
  8. Toldo, Dr. Luca. "PI: Isoelectric point determination".[ permanent dead link ]
  9. 1 2 3 "PSORT II Prediction". psort.hgc.jp. Retrieved 2017-05-07.
  10. "Phyre 2 Results for FAM71F2__". www.sbg.bio.ic.ac.uk. Retrieved 2017-05-07.[ permanent dead link ]
  11. Pappas, Georgios Jr. "PELE Protein Structure Prediction". Biology Workbench.[ permanent dead link ]
  12. "UCSF Chimera Home Page". www.cgl.ucsf.edu. Retrieved 2017-05-08.
  13. Kiemer, Lars (2004). "NetAcet 1.0 Server". www.cbs.dtu.dk. Retrieved 2017-05-08.
  14. Gupta, R. (2004). "Prediction of N-glycosylation sites in human proteins". NetNGlyc.
  15. Castro, Edouard de. "PROSITE". prosite.expasy.org. Retrieved 2017-05-08.
  16. 1 2 3 "Tissue expression of FAM71F2 - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2017-05-07.
  17. 1 2 Group, Schuler. "EST Profile - Hs.445236". www.ncbi.nlm.nih.gov. Retrieved 2017-05-08.
  18. "4778451 - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-05-08.
  19. "FAM71F2 protein abundance in PaxDb". pax-db.org. Retrieved 2017-05-08.
  20. 1 2 Kurpisz MK (2014). "Novel gene biomarkers of spermatogenesis-potential for spermatogenesis assessment and treatment monitoring". Fertility and Sterility. 102 (3): e349. doi: 10.1016/j.fertnstert.2014.07.1179 .
  21. "38158770 - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-05-08.
  22. Shanmugalingam T, Soultati A, Chowdhury S, Rudman S, Van Hemelrijck M (October 2013). "Global incidence and outcome of testicular cancer". Clinical Epidemiology. 5: 417–27. doi: 10.2147/CLEP.S34430 . PMC   3804606 . PMID   24204171.
  23. Ruf CG, Linbecker M, Port M, Riecke A, Schmelz HU, Wagner W, Meineke V, Abend M (July 2012). "Predicting metastasized seminoma using gene expression". BJU International. 110 (2 Pt 2): E14–20. doi:10.1111/j.1464-410X.2011.10778.x. PMID   22243760. S2CID   205546085.
  24. San Diego Supercomputer Center. "Biology Workbench". Archived from the original on 2003-08-11.
  25. 1 2 Kent, Jim. "BLAT Search Genome". UCSC Genome Browswer.
  26. "TimeTree :: The Timescale of Life". www.timetree.org. Retrieved 2017-05-08.