FAM46C

Last updated
TENT5C
Identifiers
Aliases TENT5C , family with sequence similarity 46 member C, terminal nucleotidyltransferase 5C, FAM46C
External IDs OMIM: 613952 MGI: 1921895 HomoloGene: 56783 GeneCards: TENT5C
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_017709

NM_001142952

RefSeq (protein)

NP_060179

NP_001136424

Location (UCSC) Chr 1: 117.61 – 117.63 Mb Chr 3: 100.36 – 100.4 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Protein FAM46C also known as family with sequence similarity 46, member C is a protein that, in humans, is encoded by the FAM46C gene at locus 1p12 spanning base pairs from 118,148,556 to 118,171,011.

Contents

Summary

FAM46C is a protein of unknown function consisting of 391 amino acid residues that are translated from an mRNA consisting of 5720 base pairs. FAM46C was initially sequenced as part of the Full-length long Japan genomic sequencing project. FAM46C is found on chromosome 1 at the locus 1p12 FAM46C contains one domain of unknown function, DUF1693, and as such has been placed in the DUF1693 protein family. This protein family has been established as a part of the Nucleotidyltransferase superfamily [5] and contains 4 nematode prion-like proteins. FAM46C was placed into group XXV of the nucleotidyltransferase superfamily [5] along with 3 other Homo sapiens FAM46 proteins (A, B, D).

It has been suggested that FAM46C may be functionally involved with the Type 1 interferon response. [6] Deletion and/ or mutation of FAM46C has been associated with impaired overall survival in Myeloma patients. [7]

Gene

Homo sapiens FAM46C spans 22,456 bases on chromosome 1. Microarray data suggests that human FAM46C displays elevated expression levels in bone marrow, CD71+ early erythroid, various B-cells, T-cells, and lymphocytes, as well as all tissues associated with testes. [8]

Evolution and homology

Homo sapiens FAM46C is highly conserved in close orthologs with only small changes in protein AA sequence when comparing to other mammals.

FAM46C and specifically the DUF1693 is traceable throughout the known metazoans, with a distant homolog found in Trichoplax adhaerens, a member of the basal multicellular organismal group Placozoa.

Paralogs

Homo sapiens FAM46C is paralogous to 3 separate known FAM46 proteins, all of which contain DUF1693.

This table lists human FAM46C paralogs, indicating relevant NCBI accession numbers (current as of May 2013), as well as nucleotide and protein alignment scores calculated using the ALIGN algorithm and cross-checked with NCBI BLAST search. FAM46C Paralog Table.png
This table lists human FAM46C paralogs, indicating relevant NCBI accession numbers (current as of May 2013), as well as nucleotide and protein alignment scores calculated using the ALIGN algorithm and cross-checked with NCBI BLAST search.
Orthologs of FAM46C are listed in order of date of divergence from the human lineage, as determined by a search of the publicly available Timetree database. Note that all values are considered approximate and are used solely as bioinformatic data compiled via free public databases of published research. Ortholog table.png
Orthologs of FAM46C are listed in order of date of divergence from the human lineage, as determined by a search of the publicly available Timetree database. Note that all values are considered approximate and are used solely as bioinformatic data compiled via free public databases of published research.

Orthologs

There are numerous FAM46C orthologs present throughout the current catalog of multicellular organisms, all of which exhibit impressive conservation of domain of unknown function 1693. The most notable ortholog is taken to be TRIADDRAFT14293, a gene of the species Trichoplax adhaerens. This ortholog provides evidence that FAM46C is a member of a group of proteins containing highly conserved amino acid residues, and as such it is thought to provide some highly important function, as would be expected when considering its possible nucleotidyltransferase activity.

Homologous domains

This PHYRE2 FAM46C structure prediction was annotated with brackets to demonstrate regions of conserved structure prediction through the Trichoplax ortholog. Note that many of the structures seem to have originated as shorter segments of what exists in human FAM46C 9if we are to take high confidence predictions as most probable structures). PHYRE2 FAM46C prediction.png
This PHYRE2 FAM46C structure prediction was annotated with brackets to demonstrate regions of conserved structure prediction through the Trichoplax ortholog. Note that many of the structures seem to have originated as shorter segments of what exists in human FAM46C 9if we are to take high confidence predictions as most probable structures).

In order to determine possible conserved structures, a predictive approach was utilized with regards to comparison of FAM46C with the most distant homolog available, that of Trichoplax adhearens. PHYRE2 [9] was used to predict protein secondary structure of human FAM46C as well as trichoplax TRIADDRAFT-14293. We are able to visualize possible structures predicted with high confidence in both the human gene, as well as the placozoan gene. This preliminary prediction offers some insight into important structures, most probably of catalytic and/or binding function.

Phylogeny

FAM46C Ortholog Unrooted Tree FAM46C Ortholog Unrooted Tree.png
FAM46C Ortholog Unrooted Tree

Based on multiple sequence alignments generated by ClustalW an unrooted phylogenetic tree was generated for select FAM46C orthologs in order to demonstrate the diverse occurrences of FAM46C-like genes throughout the current evolutionary catalog. For posterity, many orthologs were omitted (see ortholog table above; many were omitted from this table as well).

Protein structure

"Homo sapiens" FAM46C encodes a 391 amino acid protein with no known isoforms. "Homo sapiens" FAM46C has not been crystallized and its secondary structure has not yet been determined as of May 2013. FAM46C has a predicted isoelectric point of 5.338. The protein contains one domain of unknown function, DUF1693 (Pfam: PF07984)

A Leucine zipper pattern was found beginning at Lys113 and ending at Lys134. This could help distinguish nuclear proteins from non-nuclear proteins, however all other subsets of analyses using PSORTII [10] have predicted that FAM46C is strictly a cytosolic protein.

No other significant protein structural motifs have been predicted.

Protein-protein interactions

FAM46C has been experimentally shown to interact physically with at least 4 separate proteins, with other interactions that have been predicted [11]

ProteinEvidenceAccessionDatabase
DAZAP22-hybridAAR11454MINT
TRIP62-hybridCAA05080MINT
PLK42-hybridNM_014264MINT
AP2B12-hybridNM_001030006MINT

Related Research Articles

<span class="mw-page-title-main">Protein FAM46B</span> Protein-coding gene in the species Homo sapiens

Protein FAM46B also known as family with sequence similarity 46 member B is a protein that in humans is encoded by the FAM46B gene. FAM46B contains one protein domain of unknown function, DUF1693. Yeast two-hybrid screening has identified three proteins that physically interact with FAM46B. These are ATX1, PEPP2 and DAZAP2.

<span class="mw-page-title-main">FAM203B</span> Protein-coding gene in the species Homo sapiens

Family with Sequence Similarity 203, Member B (FAM203B) is a protein encoded by the FAM203B gene (8q24.3) in humans. While FAM203B is only found in humans and possibly non-human primates, its paralog, FAM203A, is highly conserved. The FAM203B protein contains two conserved domains of unknown function, DUF383 and DUF384, and no transmembrane domains. This protein has no known function yet, although the homolog of FAM203A in Caenorhabditis elegans (Y54H5A.2) is thought to help regulate the actin cytoskeleton.

<span class="mw-page-title-main">FAM214A</span> Protein-coding gene in the species Homo sapiens

Protein FAM214A, also known as protein family with sequence similarity 214, A (FAM214A) is a protein that, in humans, is encoded by the FAM214A gene. FAM214A is a gene with unknown function found at the q21.2-q21.3 locus on Chromosome 15 (human). The protein product of this gene has two conserved domains, one of unknown function (DUF4210) and another one called Chromosome_Seg. Although the function of the FAM214A protein is uncharacterized, both DUF4210 and Chromosome_Seg have been predicted to play a role in chromosome segregation during meiosis.

<span class="mw-page-title-main">CCDC94</span> Protein-coding gene in the species Homo sapiens

Coiled-coil domain containing 94 (CCDC94), is a protein that in humans is encoded by the CCDC94 gene. The CCDC94 protein contains a coiled-coil domain, a domain of unknown function (DUF572), an uncharacterized conserved protein (COG5134), and lacks a transmembrane domain.

<span class="mw-page-title-main">FAM149A</span> Protein-coding gene in the species Homo sapiens

Family with sequence similarity 149, member A is a protein that in humans is encoded by the FAM149A gene. It is well conserved in primates, dog, cow, mouse, rat, and chicken. It has one paralog, FAM149B.

<span class="mw-page-title-main">FAM167A</span> Protein-coding gene in the species Homo sapiens

Family with sequence similarity 167, member A is a protein in humans that is encoded by the FAM167A gene located on chromosome 8. FAM167A and its paralogs are protein encoding genes containing the conserved domain DUF3259, a protein of unknown function. FAM167A has many orthologs in which the domain of unknown function is highly conserved.

<span class="mw-page-title-main">FAM63A</span> Protein-coding gene in the species Homo sapiens

Family with sequence similarity 63, member A is a protein that, in humans, is encoded by the FAM63A gene. It is located on the minus strand of chromosome 1 at locus 1q21.3.

<span class="mw-page-title-main">ABHD18</span> Protein-coding gene in the species Homo sapiens

ABHD18 is a protein that in Homo sapiens is encoded by the ABHD18 gene.

C6orf222 is a protein that in humans is encoded by the C6orf222 gene (6p21.31). C6orf222 is conserved in mammals, birds and reptiles with the most distant ortholog being the green sea turtle, Chelonia mydas. The C6orf222 protein contains one mammalian conserved domain: DUF3293. The protein is also predicted to contain a BH3 domain, which has predicted conservation in distant orthologs from the clade Aves.

<span class="mw-page-title-main">FAM210B</span> Protein-coding gene in the species Homo sapiens

FAM210B is a gene that which in Homo sapiens encodes the protein FAM210B. It has been conserved throughout evolutionary history, and is highly expressed in multiple tissues within the human body. FAM210B's primary location is the endoplasmic reticulum.

<span class="mw-page-title-main">Fam221b</span> Protein-coding gene in the species Mus musculus

FAM221B is a protein that in humans is encoded by the FAM221B gene . FAM221B is also known by the alias C9orf128, is expressed at low level, and is defined by 17 GenBank accessions . It is predicted to function in transcription regulation as a transcription factor.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">FAM222A</span> Protein-coding gene in the species Homo sapiens

Family with sequence similarity 222 member A or Aggregatin is a protein of unknown function. In humans it is encoded by the gene FAM222A. Aggregatin's cellular function is not well understood, however it has been implicated in Alzheimer's Disease.

<span class="mw-page-title-main">LSMEM2</span> Protein-coding gene in the species Homo sapiens

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.

<span class="mw-page-title-main">CCDC121</span> Protein-coding gene in the species Homo sapiens

Coiled-coil domain containing 121 (CCDC121) is a protein encoded by the CCDC121 gene in humans. CCDC121 is located on the minus strand of chromosome 2 and encodes three protein isoforms. All isoforms of CCDC121 contain a domain of unknown function referred to as DUF4515 or pfam14988.

<span class="mw-page-title-main">FAM214B</span> Protein-coding gene in the species Homo sapiens

The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">C4orf19</span> Human C4orf19 gene

C4orf19 is a protein which in humans is encoded by the C4orf19 gene.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000183508 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000044468 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 Kuchta K, Knizewski L, Wyrwicz LS, Rychlewski L, Ginalski K (December 2009). "Comprehensive classification of nucleotidyltransferase fold proteins: identification of novel families and their representatives in human". Nucleic Acids Res. 37 (22): 7701–14. doi:10.1093/nar/gkp854. PMC   2794190 . PMID   19833706.
  6. Schoggins JW, Wilson SJ, Panis M, Murphy MY, Jones CT, Bieniasz P, Rice CM (April 2011). "A diverse range of gene products are effectors of the type I interferon antiviral response". Nature. 472 (7344): 481–5. Bibcode:2011Natur.472..481S. doi:10.1038/nature09907. PMC   3409588 . PMID   21478870.
  7. Boyd KD, Ross FM, Walker BA, Wardell CP, Tapper WJ, Chiecchio L, Dagrada G, Konn ZJ, Gregory WM, Jackson GH, Child JA, Davies FE, Morgan GJ (December 2011). "Mapping of chromosome 1p deletions in myeloma identifies FAM46C at 1p12 and CDKN2C at 1p32.3 as being genes in regions associated with adverse survival". Clin. Cancer Res. 17 (24): 7776–84. doi:10.1158/1078-0432.CCR-11-1791. PMC   5751883 . PMID   21994415.
  8. "BioGPS - your Gene Portal System".
  9. Kelley LA, Sternberg MJ (2009). "Protein structure prediction on the Web: a case study using the Phyre server" (PDF). Nat Protoc. 4 (3): 363–71. doi:10.1038/nprot.2009.2. hdl: 10044/1/18157 . PMID   19247286. S2CID   12497300.
  10. Horton P, Nakai K (1997). "Better prediction of protein cellular localization sites with the k nearest neighbors classifier". Proc Int Conf Intell Syst Mol Biol. 5: 147–52. PMID   9322029.
  11. "FAM46C PSICQUIC View" . Retrieved 11 May 2013.