FAM71E2

Last updated

FAM71E2, also known as Family With Sequence Similarity 71 Member E2, is a protein that, in humans, is encoded by the FAM71E2 gene. [1] Aliases include C19orf16, Protein FAM71E2, Chromosome 19 open reading frame 16, and Putative Protein FAM71E2. The gene is primarily conserved in mammals, but it is also conserved in two reptile species. [2]

Contents

Gene

Location

FAM71E2 is located on the minus strand at 9q13.42 and extends from 55,354,908 bp to 55,363,252 bp. The gene is 8,353 bp long, and has 11 exons. [3] [1]

FAM71E2 gene FAM71E2 gene.png
FAM71E2 gene

Gene Neighborhood

These genes are closest to FAM71E2 on the human genome: [3]

Transcript

mRNA variants

Two alternatively spliced mRNA variants are produced during transcription: aAUG10 and bAUG10. They are both validated alternative polyadenylation sites. [8] However, there are no isoforms of FAM71E2.

Stem loops

5' UTR folding FAM71E2 5' UTR folding FAM71E2.png
5' UTR folding FAM71E2
3' UTR folding FAM71E2 3' UTR folding FAM71E2.png
3' UTR folding FAM71E2

Conserved stem loop regions were found on both the 5' and 3' UTR in closely related orthologs. [9] [10] There were no conserved stem loops for distantly related orthologs.

Protein

Properties

Visualization of domains and motifs on the FAM71E2 gene Domain carton.png
Visualization of domains and motifs on the FAM71E2 gene
Secondary structure of FAM71E2 Phyre structure of gene.png
Secondary structure of FAM71E2

FAM71E2 is 922 amino acids long and has a molecular weight of 10/100,000 pI/Mw. The protein has four different domains: DUF3699, PRK14951, PHA03247, and BASP1. [2] The structure consists of 8 alpha helixes and 1 beta sheet. [11]

Localization

This protein is localized in the nucleus. [12] Localization in the nucleus is conserved in all orthologs.

Gene regulation

Promoter

The promoter of FAM71E2 is located between 55363152 and 55364260 on the minus strand and is 1,109 bp long. [13] This promoter was selected based on its main expression in the testes and high CAGE values.

Transcription factor binding sites

Multiple transcription factor binding sites were found for FAM71E2. They were selected based on relatedness to potential gene function such as SOX11 and estrogen response elements.

Expression

FAM71E2 is primarily expressed in male tissues, particularly the testis. [14] [3] There is also lower expression in the brain, mammary gland, prostate, and thymus. [15] FAM71E2 has also been expressed in breast (mammary gland) tumor and normal tissues.

Metaphase II stage oocytes matured in vivo

Metaphase II stage oocytes matured in vivo Expression 1.png
Metaphase II stage oocytes matured in vivo

The graph on the right is from a study analyzing the Metaphase II stage oocytes matured in vivo. The goal of this study was to identify genes and deduced pathways from human oocyte that can help us understand oogenesis, folliculogenesis, fertilization, and embryonic development. [16] The control consisted of RNA from 10 different normal human tissues: skeletal muscle, kidney, lung, colon, liver, spleen, breast, brain, heart, and stomach. The results from this study indicate that expression of FAM71E2 in oocytes is very low compared to that of normal adult tissue from various parts of the body. Human protein atlas supports these observations since there was no expression during the earliest phase of development (embryoid body). However, Human protein atlas also showed there was very minimal expression in the fetus.

Estrogen receptor alpha-silenced MCF7 breast cancer cells

Estrogen receptor alpha-silenced MCF7 breast cancer cells. Expression 2.png
Estrogen receptor alpha-silenced MCF7 breast cancer cells.

This study indicates that there is a very slight decrease in FAM71E2 expression in estrogen receptor knockdown samples. [17] This study may also support the Human protein atlas information stating FAM71E2 has slight expression in Breast (mammary glad) tumors.

Neural transcription factor SOX11 depletion effect on mantle cell lymphoma cell line Expression 3.png
Neural transcription factor SOX11 depletion effect on mantle cell lymphoma cell line

Neural transcription factor SOX11 depletion effect on mantle cell lymphoma cell line

This study was conducted by looking at mantle cell lymphoma cells depleted for the transcription factor SOX11. What is interesting is that FAM71E2 is expressed higher in the SOX11 depleted cells than the control, even though there are SOX11 transcription factors in FAM71E2. It may be possible that these transcription factors exist but are simply not transcribed. Further research on this topic should be conducted.

Homology

Paralogs

FAM71 has many paralogs, especially from FAM71. The paralogs are sorted by similarity. The paralogs in the table were selected based on their e-value and relevance to the FAM71 family. E-value range: 0 to 3e^-11. Similarity range: 100% to 51%.

Select Paralogs of FAM71E2
ProteinE-valueSimilarity range
FAM71C8.00E-2256
FAM71D3.00E-2152
HSD-515.00E-2155
FAM71B5.00E-2155
FAM71A2.00E-2055
FAM71F11.00E-1151
FAM71F23.00E-1151
FAM71E12.00-1151
Evolution chart FAM71E2 Evolution chart FAM71E2.png
Evolution chart FAM71E2

Orthologs

Select Orthologs of FAM71E2
Genus and speciesCommon nameaccessionsequence lengthPercent IdentityPercent SimilarityDate of divergence (MYA)
Homo sapiensHumansNP_001138874.19221001000
Papio anubusOlive baboonXP_003916175.286579.548228.1
Galeopterus variegatusSunda flying lemurXP_008589520.187660.747082
Ictidomys tridecemlineatusThirteen-lined ground squirrelXP_021576662.1102855.117088
Vulpes vulpesRed foxXP_004777071.182654.236594
Pteropus vampyrusLarge flying foxXP_023378500.193752.926494
Vicugna pacosAlpacaXP_006216382.186553.76494
Chrysemys pica belliiWestern painted turtleXP_008174220.12175067320
Pogona vitticepsCentral bearded dragonXP_020634337.168933.6755320

Interacting proteins

There are several interacting proteins with FAM71E2. One protein interaction program predicted NOTCH2NL, P60369, ALB, and MTUS2 interact with FAM71E2. [18] NOTCH2NL might have a role in the Notch signaling pathway as well as regulating neutrophil differentiation. P60369 is a hair keratin-associated protein. ALB functions as a regulator of colloidal osmotic pressure of blood, as well as a major zinc transporter. MTUS2 main function is to bind microtubules.

Another protein interaction program predicted BOD1L2, FAM200A, CCT8L2, OR9G1, and AMPD3 interact with FAM71E2. [19] BOD1L2 may have a role in biorientation via mitotic spindles. CCT8L2 assists folding proteins after ATP hydrolysis. OR9G1 functions as an odorant receptor. AMPD3 functions in energy metabolism. FAM200A has no known function.

Future research

Based on expression data, there are several topics that can be explored to learn more about the exact function of FAM71E2.

Related Research Articles

<span class="mw-page-title-main">Transmembrane protein 151b</span> Transmembrane protein

Transmembrane protein 151B is a protein that in humans is encoded by the TMEM151B gene.

<span class="mw-page-title-main">FAM63A</span> Protein-coding gene in the species Homo sapiens

Family with sequence similarity 63, member A is a protein that, is encoded by the FAM63A gene in humans,. It is located on the minus strand of chromosome 1 at locus 1q21.3.

WD repeat-containing protein 90 is a protein that, in humans, is encoded by the WDR90 gene (16p13.3). This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. It contains multiple WD40 repeat domains and one domain of unknown function. This protein is conserved all the way back to invertebrates. Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis.

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

TMEM156 is a gene that encodes the transmembrane protein 156 (TMEM156) in Homo sapiens. It has the clone name of FLJ23235.

OCC-1 is a protein, which in humans is encoded by the gene C12orf75. The gene is approximately 40,882 bp long and encodes 63 amino acids. OCC-1 is ubiquitously expressed throughout the human body. OCC-1 has shown to be overexpressed in various colon carcinomas. Novel splice variant of this gene was also detected in various human cancer types; in addition to encoding a novel smaller protein, OCC-1 gene produces a non-protein coding RNA splice variant lncRNA.

<span class="mw-page-title-main">C10orf67</span> Protein-coding gene in the species Homo sapiens

Chromosome 10 open reading frame 67 (C10orf67), also known as C10orf115, LINC01552, and BA215C7.4, is an un-characterized human protein-coding gene. Several studies indicate a possible link between genetic polymorphisms of this and several other genes to chronic inflammatory barrier diseases such as Crohn's Disease and sarcoidosis.

<span class="mw-page-title-main">ERICH2</span> Protein-coding gene in the species Homo sapiens

Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.

Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.

Cardiac-enriched FHL2-interacting protein (CEFIP) is a protein encoded by the gene C10orf71 on chromosome 10 open reading frame 71. It is primarily understood that this gene is moderately expressed in muscle tissue and cardiac tissue.

<span class="mw-page-title-main">C12orf60</span> Protein-coding gene in humans

Uncharacterized protein C12orf60 is a protein that in humans is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.

<span class="mw-page-title-main">FAM71E1</span> Mammalian protein found in Homo sapiens

FAM71E1, also known as Family With Sequence Similarity 71 Member E1, is a protein that in humans is encoded by the FAM71E1 gene. It is thought to be ubiquitously expressed at low levels throughout the body, and it is conserved in vertebrates, particularly mammals and some reptiles. The protein is localized to the nucleus and can be exported to the cytoplasm.

<span class="mw-page-title-main">Chromosome 9 open reading frame 43</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.

<span class="mw-page-title-main">TEX55</span> Protein-coding gene in the species Homo sapiens

Testis expressed 55 (TEX55) is a human protein that is encoded by the C3orf30 gene located on the forward strand of human chromosome three, open reading frame 30 (3q13.32). TEX55 is also known as Testis-specific conserved, cAMP-dependent type II PK anchoring protein (TSCPA), and uncharacterized protein C3orf30.

<span class="mw-page-title-main">Fam89A</span> Human protein and gene

ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.

<span class="mw-page-title-main">FAM155B</span> Protein-coding gene in humans

Family with Sequence Similarity 155 Member B is a protein in humans that is encoded by the FAM155B gene. It belongs to a family of proteins whose function is not yet well understood by the scientific community. It is a transmembrane protein that is highly expressed in the heart, thyroid, and brain.

C3orf56 is a protein encoding gene found on chromosome 3. Although, the structure and function of the protein is not well understood, it is known that the C3orf56 protein is exclusively expressed in metaphase II of oocytes and degrades as the oocyte develops towards the blastocyst stage. Degradation of the C3orf56 protein suggests that this gene plays a role in the progression from maternal to embryonic genome and in embryonic genome activation.

<span class="mw-page-title-main">C6orf136</span> Protein-coding gene in the species Homo sapiens

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

References

  1. 1 2 "Gene - Golgi Associated RAB2 Interactor Family Member 5B". www.genecards.org. Retrieved 2019-05-12.
  2. 1 2 "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2019-05-12.
  3. 1 2 3 "National Center for Biotechnology Information". www.ncbi.nlm.nih.gov. Retrieved 2019-05-12.
  4. "Gene - Transmembrane Protein 190". www.genecards.org. Retrieved 2019-05-12.
  5. "Gene - Cytochrome C Oxidase Subunit 6B2". www.genecards.org. Retrieved 2019-05-12.
  6. "Gene - Lysine Methyltransferase 5B". www.genecards.org. Retrieved 2019-05-12.
  7. "Gene - Interleukin 11". www.genecards.org. Retrieved 2019-05-12.
  8. "AceView: Gene:FAM71E2, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2019-05-12.
  9. "The Mfold Web Server | mfold.rit.albany.edu". unafold.rna.albany.edu. Retrieved 2019-05-12.
  10. "Clustal Omega < Multiple Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2019-05-12.
  11. "Phyre 2 Results for Undefined". www.sbg.bio.ic.ac.uk. Retrieved 2019-05-12.
  12. "PSORT: Protein Subcellular Localization Prediction Tool". www.genscript.com. Retrieved 2019-05-12.
  13. "Genomatix - NGS Data Analysis & Personalized Medicine". www.genomatix.de. Retrieved 2019-05-12.
  14. "Tissue expression of FAM71E2 - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2019-05-12.
  15. "EST Profile - Hs.528319". www.ncbi.nlm.nih.gov. Retrieved 2019-05-12.
  16. "52811124 - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-05-12.
  17. "77625175 - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-05-12.
  18. "Results - mentha: the interactome browser". mentha.uniroma2.it. Retrieved 2019-05-12.
  19. "FAM71E2 protein (human) - STRING interaction network". string-db.org. Retrieved 2019-05-12.