C16orf58

Last updated
RUSF1
Identifiers
Aliases RUSF1 , RUS, chromosome 16 open reading frame 58, C16orf58, RUS family member 1
External IDs MGI: 2384572; HomoloGene: 11232; GeneCards: RUSF1; OMA:RUSF1 - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_022744

NM_145590
NM_001360882

RefSeq (protein)

NP_073581

NP_663565
NP_001347811

Location (UCSC) Chr 16: 31.49 – 31.51 Mb Chr 7: 127.87 – 127.9 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Chromosome 16 open reading frame 58, or C16orf58, also known as FLJ13638 is a protein which in humans is encoded by the C16orf58 gene. [5] The gene itself is 18892 bp long, with mRNA of 2760 bp, and a protein sequence of 468 amino acids. There is a conserved domain of unknown, DUF647. No function has been determined for this gene yet, but it is predicted that it resides in the endoplasmic reticulum in the cytoplasm. [6]

Contents

Species distribution

C16orf58 has very interesting conservation in that it has orthologs back through plants and fungi. However, it has not been found in reptiles, birds, or amphibians. The below table shows some, but not all, orthologs which were found using BLAST. [7]

SpeciesOrganism Common NameNCBI AccessionSequence IdentityE-valueLength (AAs)Gene Common Name
Homo sapiens Human NP_073581 100%0.0468C16orf58
Equus Caballus Horse XP_001495510 85%0.0468PREDICTED: similar to UPF0420 protein C16orf58
Canis familiaris Dog XP_547054 85%0.0485similar to CG10338-PA
Mus musculus Mouse Q91W34 81%0.0466cDNA sequence BC017158
Monodelphis domestica Opossum XP_001370394 65%3e−160466PREDICTED: hypothetical protein
Danio rerio Zebrafish NP_001103923 53%4e−112432hypothetical protein LOC555936
Drosophila melanogaster Fly NP_609897 40%3e−69395CG10338
Arabidopsis thaliana Thale Cress AAF81284 37%2e−68403Contains similarity to CG10338 gene product from Drosophila melanogaster
Gallus gallus Chicken NP_989823 25%0.361434protein tyrosine phosphatase, receptor type, U
Xenopus tropicalis Frog AAI22058 31%3.4268Stk19 protein
Saccharomyces cerevisiae Yeast EDZ73379 25%0.211578YDL140Cp-like protein
Caenorhabditis elegans Nematode NP_502300 19%3.0414hypothetical protein M18.6

Protein Interactions

Though the function is still unknown, C16orf58 has been shown to interact with three different proteins:

Structure

Although there are several sites that will give predictions on protein structure, C16orf58 does not have a known structure yet. That being said there is at least one transmembrane domain, if not more. Within the protein structure there are several extended areas with uncharged amino acids, these could be possible transmembrane domains, or hydrophobic cores. [6] The below shows the charge of each of the amino acids in the protein sequence, + for positive, - for negative and 0 for uncharged. Note the large segments of uncharged amino acids appear bolded. These stretches of uncharged amino acids are conserved back through distant orthologs.

      1  00—000-00 000-00000- 0+00+000-0 0000-0000+ 00000+0000 +0-0+-00-0       61  0000000000 0000000000 000-0000-0 000000-000 0000000000 0000000000      121  0000+00000 0000000+-0 00000+0000 00+00+0-00 0+00+000-0 00-00000-0      181  0000000000 000000000+ 0000000000 +00000000+ +0000-000+ -000-00000      241  0000000000 0000000000 0000000000 000000+00+ 0000-000-0 +0+000+000      301  0+0-00-000 00+0-00000 0000000000 0000+00000 0-00000-00 0-000000-0      361  0000000000 0+000+000+ 0000000000 000-00000- 0—0+0+0+0 00++-00000      421  +-00-00-00 00+00+000- 000+0-+000 -00-0+0000 000-++00

Related Research Articles

<span class="mw-page-title-main">XBP1</span> Protein-coding gene in the species Homo sapiens

X-box binding protein 1, also known as XBP1, is a protein which in humans is encoded by the XBP1 gene. The XBP1 gene is located on chromosome 22 while a closely related pseudogene has been identified and localized to chromosome 5. The XBP1 protein is a transcription factor that regulates the expression of genes important to the proper functioning of the immune system and in the cellular stress response.

<span class="mw-page-title-main">AGPAT2</span> Protein-coding gene in the species Homo sapiens

1-acyl-sn-glycerol-3-phosphate acyltransferase beta is an enzyme that in humans is encoded by the AGPAT2 gene.

<span class="mw-page-title-main">REEP5</span> Protein-coding gene in the species Homo sapiens

Receptor expression-enhancing protein 5 is a protein that in humans is encoded by the REEP5 gene. Receptor Expression Enhancing Protein is a protein encoded for in Humans by the REEP5 gene.

<span class="mw-page-title-main">ITFG3</span> Protein-coding gene in the species Homo sapiens

Protein ITFG3 also known as family with sequence similarity 234 member A (FAM234A) is a protein that in humans is encoded by the ITFG3 gene. Here, the gene is explored as encoded by mRNA found in Homo sapiens. The FAM234A gene is conserved in mice, rats, chickens, zebrafish, dogs, cows, frogs, chimpanzees, and rhesus monkeys. Orthologs of the gene can be found in at least 220 organisms including the tropical clawed frog, pandas, and Chinese hamsters. The gene is located at 16p13.3 and has a total of 19 exons. The mRNA has a total of 3224 bp and the protein has 552 aa. The molecular mass of the protein produced by this gene is 59660 Da. It is expressed in at least 27 tissue types in humans, with the greatest presence in the duodenum, fat, small intestine, and heart.

<span class="mw-page-title-main">TSR3</span> Hypothetical human protein

TSR3, or TSR3 Ribosome Maturation Factor, is a hypothetical human protein found on chromosome 16. Its protein is 312 amino acids long and its cDNA has 1214 base pairs. It was previously designated C16orf42.

<span class="mw-page-title-main">Coiled-coil domain-containing protein 135</span> Protein found in humans

Coiled-coil domain-containing protein 135, also known as CCDC135, is a protein that in humans is encoded by the CCDC135 gene.

<span class="mw-page-title-main">TMEM242</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 242 (TMEM242) is a protein that in humans is encoded by the TMEM242 gene. The tmem242 gene is located on chromosome 6, on the long arm, in band 2 section 5.3. This protein is also commonly called C6orf35, BM033, and UPF0463 Transmembrane Protein C6orf35. The tmem242 gene is 35,238 base pairs long, and the protein is 141 amino acids in length. The tmem242 gene contains 4 exons. The function of this protein is not well understood by the scientific community. This protein contains a DUF1358 domain.

Uncharacterized LOC644249 gene., also known as RP11-195B21.3, is about 1058 base pairs long and is found in Homo sapiens on chromosome 9q12. More specifically, the sequence is located on Chromosome: 9; NC_000009.11(67977457..67987991 bp). This gene’s protein product is the “coiled-coil domain-containing protein 29” which is 291 amino acids long and may contain a conserved domain in the superfamily, pfam 12001. In particular, this conserved domain contains the domain of unknown function DUF3496 which is about 110 amino acids long, functionally uncharacterized, and found in eukaryotes. Other possible motifs for the protein product exist but the DUF3496 remains the most likely. This protein may play a role as a transmembrane protein.

<span class="mw-page-title-main">FAM214A</span> Protein-coding gene in the species Homo sapiens

Protein FAM214A, also known as protein family with sequence similarity 214, A (FAM214A) is a protein that, in humans, is encoded by the FAM214A gene. FAM214A is a gene with unknown function found at the q21.2-q21.3 locus on Chromosome 15 (human). The protein product of this gene has two conserved domains, one of unknown function (DUF4210) and another one called Chromosome_Seg. Although the function of the FAM214A protein is uncharacterized, both DUF4210 and Chromosome_Seg have been predicted to play a role in chromosome segregation during meiosis.

<span class="mw-page-title-main">CCDC47</span> Protein-coding gene in humans

Coiled-coil domain 47 (CCDC47) is a gene located on human chromosome 17, specifically locus 17q23.3 which encodes for the protein CCDC47. The gene has several aliases including GK001 and MSTP041. The protein itself contains coiled-coil domains, the SEEEED superfamily, a domain of unknown function (DUF1682) and a transmembrane domain. The function of the protein is unknown, but it has been proposed that CCDC47 is involved in calcium ion homeostasis and the endoplasmic reticulum overload response.

<span class="mw-page-title-main">Transmembrane protein 134</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 134 is a protein encoded by the TMEM134 gene. TMEM134 does not have any other known aliases. There are two transmembrane domains and a domain of unknown function (DUF872). Evolutionary, the majority of the organisms that have this gene are primates and mammals, although there are some organisms dating back to Drosophila and C. elegans. Through current research, there has not been any confirmed function of TMEM134.

Seipin is a homo-oligomeric integral membrane protein in the endoplasmic reticulum (ER) that concentrates at junctions with cytoplasmic lipid droplets (LDs). Alternatively, seipin can be referred to as Berardinelli–Seip congenital lipodystrophy type 2 protein (BSCL2), and it is encoded by the corresponding gene of the same name, i.e. BSCL2. At protein level, seipin is expressed in cortical neurons in the frontal lobes, as well as motor neurons in the spinal cord. It is highly expressed in areas like the brain, testis and adipose tissue. Seipin's function is still unclear but it has been localized close to lipid droplets, and cells knocked out in seipin have anomalous droplets. Hence, recent evidence suggests that seipin plays a crucial role in lipid droplet biogenesis.

<span class="mw-page-title-main">C11orf86</span> Protein-coding gene in the species Homo sapiens

Chromosome 11 open reading frame 86, also known as C11orf86, is a protein-coding gene in humans. It encodes for a protein known as uncharacterized protein C11orf86, which is predicted to be a nuclear protein. The function of this protein is currently unknown.

<span class="mw-page-title-main">FAM210B</span> Protein-coding gene in the species Homo sapiens

FAM210B is a gene that which in Homo sapiens encodes the protein FAM210B. It has been conserved throughout evolutionary history, and is highly expressed in multiple tissues within the human body. FAM210B's primary location is the endoplasmic reticulum.

<span class="mw-page-title-main">TMCO4</span> Protein-coding gene in the species Homo sapiens

Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.

<span class="mw-page-title-main">Gram domain containing 1b</span> Protein-coding gene in the species Homo sapiens

GRAM domain containing 1B, also known as GRAMD1B, Aster-B and KIAA1201, is a cholesterol transport protein that is encoded by the GRAMD1B gene. It contains a transmembrane region and two domains of known function; the GRAM domain and a VASt domain. It is anchored to the endoplasmic reticulum. This highly conserved gene is found in a variety of vertebrates and invertebrates. Homologs are found in yeast.

<span class="mw-page-title-main">TMEM44</span> Protein-coding gene in the species Homo sapiens

TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.

<span class="mw-page-title-main">SMCO3</span> Protein-coding gene in the species Homo sapiens

Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.

<span class="mw-page-title-main">Melanocortin 2 receptor accessory protein</span> Protein-coding gene in the species Homo sapiens

Melanocortin 2 receptor accessory protein is a transmembrane accessory protein that in humans is encoded by the MRAP gene located in chromosome 21q22.11. Alternate splicing of the MRAP mRNA generates two functionally isoforms MRAP-α and MRAP-β.

<span class="mw-page-title-main">SMIM19</span> Protein-coding gene in the species Homo sapiens

SMIM19, also known as Small Integral Membrane Protein 19, encodes the SMIM19 protein. SMIM19 is a confirmed single-pass transmembrane protein passing from outside to inside, 5' to 3' respectively. SMIM19 has ubiquitously high to medium expression with among varied tissues or organs. The validated function of SMIM19 remains under review because of on sub-cellular localization uncertainty. However, all linked proteins research to interact with SMIM19 are associated with the endoplasmic reticulum (ER), presuming SMIM19 ER association

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000140688 Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000030780 Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. "Entrez Gene: C16orf58" . Retrieved 6 May 2009.
  6. 1 2 "SDSC Biology Workbench". San Diego Supercomputer Center. Retrieved 2009-05-07.
  7. "BLAST: Basic Local Alignment Search Tool". National Center for Biotechnology Information, United States National Institutes of Health. Retrieved 2009-05-07.
  8. "STRING: functional protein association networks". EMBL.de. Retrieved 2009-05-07.
  9. "Entrez Gene: MVD mevalonate (diphospho) decarboxylase" . Retrieved 6 May 2009.
  10. 1 2 "mint database". Archived from the original on 2006-05-06. Retrieved 2009-05-07.
  11. "Entrez Gene: BSCL2 Bernardinelli-Seip congenital lipodystrophy 2 (seipin)" . Retrieved 6 May 2009.
  12. "Entrez Gene: TSC22D4 TSC22 domain family, member 4" . Retrieved 6 May 2009.