GARIN1A | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | GARIN1A , FAM137B, family with sequence similarity 71 member F2, GARI, FAM71F2, golgi associated RAB2 interactor 1A | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 2141439 HomoloGene: 52974 GeneCards: GARIN1A | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
FAM71F2 or Family with Sequence Similarity 71 member F2 is a protein that in humans is encoded by the Family with Sequence Similarity 71 member F2 gene. This gene is highly active in the reproductive tissues, specifically the testis, and may serve as a potential biomarker for determining metastatic testicular cancer.
FAM71F2 gene is located on chromosome 7 in humans (7q32.1), [5] [6] starting at 128,671,636 and ending at 128,702,262 on the positive strand. [6] The gene paralog FAM71F1 and the gene LINC01000 directly neighbor FAM71F2 on chromosome 7.
The gene spans 30,627 base pairs [6] and codes for 12 exons. [5]
FAM71F2 is also referred to as family with sequence similarity 137 member B, FAM137B. [6]
FAM71F2 has 14 transcript variants. [5] Isoform a is the longest of the mRNA transcripts and spans 5,775 base pairs that translates into a 309 amino acids sequence. [5] It codes for 5 exons. [5] Other alternative splice isoforms are labelled in the diagram in Figure 2.
At the first splice site, Isoform b, found in most reptiles having FAM71F2 protein, deletes the following nine amino acids and picks up at the valine amino acid at location 61 in humans. Isoform c uses an alternative downstream start site to Isoform a and adds another exon between the first and second exons of Isoform a.
The molecular weight of FAM71F2 is 34.5 kilodaltons. [7] The isoelectric point is 6.15. [8]
FAM71F2 protein contains only one domain, named domain of unknown function, DUF3699. [5] [6] This domain is located from amino acids 114-185 on the human FAM71F2 protein. [5] This domain family is found only in eukaryotes and approximately 71 amino acids in length. [5] There is also a potential R-2 mitochondrial pre-sequence cleavage site [9] that would signal the protein to the mitochondria. These sites are labelled in Figure 4 below.
The secondary structure of FAM71F2 contains alpha helices and beta sheets. [10] [11] These structures are identified in the generated image of the FAM71F2 protein in Figure 3. Highly conserved amino acid residues, such as the Val61-Thr62-Lys63 sequence where the reptiles and isoform b pick up in the second exon, are labelled on this figure as well.
FAM71F2 has seven highly conserved phosphorylated sites. There is one acetylation site [13] and one N-glycosylation site, [14] playing a role in stabilizing the protein.
FAM71F2 protein stays in the cytoplasm of cells, [9] but may have localization in the nucleus and mitochondria. [9]
FAM71F2 is highly expressed in reproductive structures, such as the testis, epididymis, uterus and ovaries. [5] [16] There is some expression in the brain and connective tissue as well. [17] As development stages progress, the number of gene transcripts increases and are at highest expression levels in adults. [17] In the mouse, during spermatogenesis and development of the testis, gene transcript levels of FAM71F2 increase dramatically. [18]
FAM71F2 protein expression has been detected in the cytoplasm of Leydig cells and in epididymis cells of the male testis and is also detected in the cytoplasm in ovarian follicles. [16]
FAM71F2 is moderately expressed in comparison to other proteins in the human, with an average protein expression level of 8.47 part per million. [19]
FAM71F2 is repressed in males with non-obstructive azoospermia [20] and teratozoospermia, [21] or abnormalities in sperm morphology and quantity. These diseases lead to fertility issues. In addition, FAM71F2 gene expression is up-regulated with Dopamine receptor D1 expression in testicular cancer patients, and may be an important biomarker for metastatic forms of this cancer. [22] [23] [20]
FAM71F2 has 91 orthologs in other animal species. [5] Its evolutionary history goes as far back as the reptiles, and its most distant relative is the homolog in the west Indian Ocean coelacanth. [24] [25] [26] The time of divergence between eight orthologs from the human FAM71F2 is shown in Figure 5. It is not found in birds or in Gallus gallus (chicken). [25] [5] FAM71F2 has six paralogs in humans: FAM71A, FAM71B, FAM71C, FAM71D, FAM71E1, and FAM71F1. [5]
Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.
FAM71E1, also known as Family With Sequence Similarity 71 Member E1, is a protein that in humans is encoded by the FAM71E1 gene. It is thought to be ubiquitously expressed at low levels throughout the body, and it is conserved in vertebrates, particularly mammals and some reptiles. The protein is localized to the nucleus and can be exported to the cytoplasm.
TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.
Testis-expressed protein 9 is a protein that in humans is encoded the TEX9 gene. TEX9 that encodes a 391-long amino acid protein containing two coiled-coil regions. The gene is conserved in many species and encodes orthologous proteins in eukarya, archaea, and one species of bacteria. The function of TEX9 is not yet fully understood, but it is suggested to have ATP-binding capabilities.
Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.
Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.
Protein CDV3 homolog also known as carnitine deficiency-associated gene expressed in ventricle 3 is a protein that in humans is encoded by the CDV3 gene.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.
Lysine-rich nucleolar protein 1 (KNOP1) is a protein which in human's is encoded by the KNOP1 gene. Aliases for KNOP1 include TSG118, C16orf88, and FAM191A.
Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).
TEKTIP1, also known as tektin-bundle interacting protein 1, is a protein that in humans is encoded by the TEKTIP1 gene.
Chromosome 12 open reading frame 71 (c12orf71) is a protein which in humans is encoded by c12orf71 gene. The protein is also known by the alias LOC728858.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
C10orf53 is a protein that in humans is encoded by the C10orf53 gene. The gene is located on the positive strand of the DNA and is 30,611 nucleotides in length. The protein is 157 amino acids and the gene has 3 exons. C10orf53 orthologs are found in mammals, birds, reptiles, amphibians, fish, and invertebrates. It is primarily expressed in the testes and at very low levels in the cerebellum, liver, placenta, and trachea.
Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.
Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.
Leucine-rich repeat-containing protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.