C13orf46

Last updated
C13orf46
Identifiers
Aliases C13orf46 , uncharacterized LOC100507747, chromosome 13 open reading frame 46
External IDs HomoloGene: 139853 GeneCards: C13orf46
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001365455

n/a

RefSeq (protein)

n/a

n/a

Location (UCSC) Chr 13: 113.95 – 113.97 Mb n/a
PubMed search [2] n/a
Wikidata
View/Edit Human

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. [3] In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

Contents

Gene

An alternative name for C13orf46 is LOC100507747. [4] C13orf46 spans 47,563 base pairs, contains 11 exons, and is on the minus strand of chromosome 13 at 13q34. [5] [6]

Graphical representation of the location of the human C13orf46 gene. The red arrow pointing to the left represents the span of the gene C13orf46. Gene Neighborhood of C13orf46.png
Graphical representation of the location of the human C13orf46 gene. The red arrow pointing to the left represents the span of the gene C13orf46.

Gene neighbors

The neighboring genes around C13orf46 include LINC00454, LINC00452, SWINGN, RASA3, and LOC124903221. [7]

LINC00454 and LINC00452 (Long Intergenic NonProtein Coding RNA 454 & 452) are both long non-coding RNAs (lncRNA) that regulate epigenetic gene expression, chromatin remodeling, and levels of gene transcription and translation. [7] [8] [9] Both LINC00454 and LINC00452 expression are restricted to the testis. LINC00454 has been associated with Factor X Deficiency while LINC00452 has been found to promote ovarian carcinogenesis. [10] [11]

SWINGN (SWI/SNF Complex Interacting GAS6 Enhancer Non-Coding RNA) is also a lncRNA that neighbors C13orf46. [12] SWINGN regulates the activation of the GAS6 (Growth Arrest Specific 6) oncogene, by interacting with matrix associated and actin dependent regulators of chromatin. [13]

The RASA3 (RAS p21 Protein Activator 3) gene encodes the Ras GTPase activating protein. This protein binds inositol 1,3,4,5-tetrakisphosphate to stimulate the activity of Ras p21 and negatively regulates the Ras signaling pathway. [14] RASA3 is most highly expressed in fat, lymph nodes, and the spleen. The encoded protein is localized to the cell membrane.

mRNA

Eight different transcript variants have been identified for C13orf46. [7] These transcript variants are alternatively spliced to include variations of 11 different exons. Depending on the different transcript variant that is translated, 5 different possible protein isoforms are encoded by C13orf46. The most common protein product encoded by C13orf46 is isoform 1, which is 212 amino acids long. [15]

Table of Variants and Exons [3]
Transcript VariantmRNA Length (nt)Protein IsoformProtein Length (aa)Molecular Weight (kDA)Exon 1 (bp)Exon 2 (bp)Exon 3 (bp)Exon 4 (bp)Exon 5 (bp)Exon 6 (bp)Exon 7 (bp)Exon 8 (bp)Exon 9 (bp)Exon 10 (bp)Exon 11 (bp)
C13orf46transcript variant 1 3786 C13orf46 protein isoform 1 21223.4269521664848683135
C13orf46transcript variant X1 26461 C13orf46 protein isoform X1 62466.73269252192021020
C13orf46 transcript variant X2 26389 C13orf46 protein isoform X1 62466.73269252209031965
C13orf46 transcript variant X3 25642 C13orf46 protein isoform X1 62466.73269252197452376
C13orf46 transcript variant X4 26573 C13orf46 protein isoform X2 58762.8326925223052
C13orf46 transcript variant X5 29437 C13orf46 protein isoform X3 21223.426952166484868333725449
C13orf46 transcript variant X6 961 C13orf46 protein isoform X4 19221.126952166484868310
C13orf46 transcript variant X7 833 C13orf46 protein isoform X5 17518.9269521664824652

Protein

The primary protein isoform of the C13orf46 gene consists of 212 amino acids. [15] [16] The longest encoded isoform, known as C13orf46 protein isoform X1, is 624 amino acids long. [17] Other protein isoforms encoded by the C13orf46 gene are similar to either of these two versions of the C13orf46 protein. Varying forms of the primary 212 amino acid protein is encoded by transcript variants 1, X5, X6, and X7. Variations of the longest C13orf46 protein isoform are encoded by transcript variants X1, X2, X3, and X4.

Protein isoform 1

Properties and composition

C13orf46 Isoform 1 has a theoretical isoelectric point of 4.84 and a predicted molecular weight of 23.4 kDA. [18] Higher relative amounts of glutamic acid (15.1%) and aspartic acid (7.5%) are found within this isoform, while the amino acids phenylalanine (0.9%) and threonine (0.5%) are found to be less abundant within the protein composition. [19] C13orf46 Isoform 1 also has a glutamic acid rich region where multiple glutamic acid and lysine doublets are present, some of which occur side by side. A total of 14 multiplets are found within the protein overall, 12 of which are charged. [19] C13orf46 Isoform 1 is not predicted to contain any charge clusters, hydrophobic segments, or transmembrane segments.

Structure

iTASSER predicted tertiary structure of C13orf46 Isoform 1. Tertiary Structure Isoform 1 ITASSER.png
iTASSER predicted tertiary structure of C13orf46 Isoform 1.

C13orf46 Protein Isoform 1 is predicted to consist of 7 alpha helices and 2 beta strands in addition to regions of random free coils. [21] [20]

Domains and motifs

C13orf46 Isoform 1 has two identified disordered regions spanning between amino acid residues 1 through 148 and 168 to 190. [22] In addition, C13orf46 Isoform 1 has a glutamic acid rich region spanning along amino acid residues 109 to 191. [19]

Regulation and post translational modifications

C13orf46 Isoform 1 is predicted to undergo several post-translational modifications such as phosphorylation, [23] [24] [25] O-GlcNAcylation, [24] mucin type GalNAc O-glycosylation, [24] palmitoylation, [26] [27] and sumoylation. [28] PKA, PKC, CKII, PKG, GSK3, cdc2, RSK, and ATM are kinases that are predicted to bind and phosphorylate the human C13orf46 Isoform 1. There is also one predicted phosphoprotein-binding phosphosite on the protein. [29]

Schematic illustration of C13orf46 Isoform 1 annotated with relevant predicted post translational modifications. Red pentagons represent predicted relevant O-glycosylation sites, yellow spheres represent significant phosphorylation sites, and pink trapezoids represent predicted sumoylation sites. The predicted significant site of palmitoylation is represented by a purple diamond, while the phosphoprotein-binding phosphosite is depicted with a blue hexagon. A glutamic acid rich region within C13orf46 Isoform 1 is shown as a blue domain between amino acids 109 to 191 Wiki Annotated Post Translational Modifications for Isoform 1.png
Schematic illustration of C13orf46 Isoform 1 annotated with relevant predicted post translational modifications. Red pentagons represent predicted relevant O-glycosylation sites, yellow spheres represent significant phosphorylation sites, and pink trapezoids represent predicted sumoylation sites. The predicted significant site of palmitoylation is represented by a purple diamond, while the phosphoprotein-binding phosphosite is depicted with a blue hexagon. A glutamic acid rich region within C13orf46 Isoform 1 is shown as a blue domain between amino acids 109 to 191
C13orf46 Isoform 1 annotated tertiary structure prediction from iTASSER with highest confidence score. Glutamic acid within the glutamic acid rich region is shown as spheres. The magenta highlights the N-terminal UBR box recognition site for E3 ubiquitin ligases. The green highlights show selected phosphorylation sites that are most conserved and relevant significant predictions based on scores and cross referencing predictions between site tools. The sections highlighted in yellow indicate locations within the protein where O-linked glycosylation may occur. These sections were selected based on comparisons between site prediction tools, score values, and comparisons with orthologs. Annotated iTASSER of Isoform 1 with Highlighted glutamic acid.png
C13orf46 Isoform 1 annotated tertiary structure prediction from iTASSER with highest confidence score. Glutamic acid within the glutamic acid rich region is shown as spheres. The magenta highlights the N-terminal UBR box recognition site for E3 ubiquitin ligases. The green highlights show selected phosphorylation sites that are most conserved and relevant significant predictions based on scores and cross referencing predictions between site tools. The sections highlighted in yellow indicate locations within the protein where O-linked glycosylation may occur. These sections were selected based on comparisons between site prediction tools, score values, and comparisons with orthologs.

Protein isoform X1

Properties

C13orf46 Isoform X1 has a theoretical isoelectric point of 9.33 and a predicted molecular weight of 66.7 kDA. [18] C13orf46 Isoform X1 protein contains much higher relative amounts of serine (18.4%) and leucine (18.8%) compared to other human proteins and also has high amounts of proline (14.4%). [19] Roughly equal amounts of serine and leucine are found within the protein. C13orf46 Isoform X1 protein is also composed of lower than usual amounts of glutamic acid (1.3%), phenylalanine (0.3%), and lysine (0.5%) and also has low amounts of valine (2.4%). [19] Asparagine is not found within the C13orf46 Isoform X1 protein. Within this isoform, 100 amino acid multiplets are found, 5 of which are charged. No charge clusters, hydrophobic segments or transmembrane domains are predicted within the protein. [19]

Structure

C13orf46 Isoform X1 is predicted to consist of a combination of alpha helices, beta sheets, and free random coil regions. [33] There are 22 predicted alpha helices and 18 predicted beta sheets within the predicted structure of C13orf46 Isoform X1.

iTASSER predicted tertiary structure of C13orf46 Isoform X1 Isoform X1 Tertiary iTASSER.png
iTASSER predicted tertiary structure of C13orf46 Isoform X1
Logo of 26 repeats within C13orf46 Isoform X1 Logo of Repeats.png
Logo of 26 repeats within C13orf46 Isoform X1

C13orf46 Isoform X1 contains a series of 26 repeats, which vary in sequence structure and length. [19] [35] Out of the 26 identified repeat sequences, 14 sequences consisted of 20 amino acids, while 5 of the repeats consisted of 21 amino acids, 3 repeats consisted of 22 amino acids, and 4 repeats were 23 amino acids long. [36] Each repeat sequence beings with either the amino acid methionine, isoleucine, or leucine. The main sequence structure of the amino acids within the 26 repeats is MLLLSTGCSSSPPDAPPLHQ. [34] An alignment of the 26 repeats indicate that the most conserved part of the repeat sequence occurs in the middle of the sequences with a triplet of the amino acid serine. [36] [37]

Logo the 14 repeats out of all 26 repeats within C13orf46 Isoform X1 consisting of only 20 amino acids, illuminating the main internal structure of the 26 repeats. Baseline Sequence of 26 Repeats.png
Logo the 14 repeats out of all 26 repeats within C13orf46 Isoform X1 consisting of only 20 amino acids, illuminating the main internal structure of the 26 repeats.

Domains and motifs

C13orf46 Isoform X1 has a predicted a dimerization domain between amino acids residues 69 to 87. [38]

Regulation and post translational modification

C13orf46 Isoform X1 is predicted to undergo several post-translational modifications such as phosphorylation, [23] [24] [25] O-GlcNAcylation, [24] mucin type GalNAc O-glycosylation, [24] palmitoylation, [26] [27] and sumoylation. [28] The human C13orf46 Isoform X1 protein also has 11 predicted PPBD-specific binding phosphosites. [29] The most conserved phosphorylation sites occur on the third serine of 23 out of 26 repeats. PKC, PKG, PKA, p38MAPK, GSK3, DNAPK, CKI, cdk5, CKII, and cdc2 are kinases predicted to bind and phosphorylate the human C13orf46 Isoform X1 protein. [24] Predicted phosphorylated sites are also predicted to be sites where O-glycosylation can occur. [24]

Schematic diagram of predicted significant post translational modifications of C13orf46 Isoform X1. The 26 repeats are shown in alternating color. Predicted phosphorylation sites of CKII are represented by yellow spheres. Three repeats that do not have a predicted to be phosphosite by CKII but are predicted O-GlcNAcylated and/or GalNAc O-glycosylated are depicted as pentagons. Two sites that are only predicted to be O-GlcNAcylated are shown in light pink, while the site predicted to have the potential to be both O-GlcNAcylated and GalNAc O-glycosylated is illustrated in red. A predicted dimerization domain is highlighted by a pink box. One significant s-palmitoylation site predicted on the protein is represented by a purple diamond. Wiki Annotated Post Translational Modifications for Isoform X1.png
Schematic diagram of predicted significant post translational modifications of C13orf46 Isoform X1. The 26 repeats are shown in alternating color. Predicted phosphorylation sites of CKII are represented by yellow spheres. Three repeats that do not have a predicted to be phosphosite by CKII but are predicted O-GlcNAcylated and/or GalNAc O-glycosylated are depicted as pentagons. Two sites that are only predicted to be O-GlcNAcylated are shown in light pink, while the site predicted to have the potential to be both O-GlcNAcylated and GalNAc O-glycosylated is illustrated in red. A predicted dimerization domain is highlighted by a pink box. One significant s-palmitoylation site predicted on the protein is represented by a purple diamond.
Annotated tertiary structure of C13orf46 depicting charge and hydrophobic regions within the protein. The serine's that are predicted to be phosphorylated by CKII are illustrated in green. The dimerization domain predicted between amino acids 69 to 87 are depicted as spheres. The 26 repeats are highlighted and shown in alternating color of blue and purple. Annotated Tertiary Structure of Isoform X1.png
Annotated tertiary structure of C13orf46 depicting charge and hydrophobic regions within the protein. The serine's that are predicted to be phosphorylated by CKII are illustrated in green. The dimerization domain predicted between amino acids 69 to 87 are depicted as spheres. The 26 repeats are highlighted and shown in alternating color of blue and purple.

Protein interactions

C13orf46 protein isoform X1 has several predicted S-phase cyclin binding sites, in addition to MAPK and p38 interacting motifs. [40]

Expression

RNA sequencing shows the expression of C13orf46 is most observed in the lungs, prostate, pancreas, and stomach at intermediate levels. [7] [41] C13orf46 also has lower expression levels in the bone marrow, spleen, thyroid, lymph node, gall bladder, and thymus.

Cellular localization

C13orf46 Isoform 1 is predicted to be mostly localized within the nucleus. [42] [24] This protein isoform may also be localized on the cell membrane. [24] C13orf46 Isoform X1 is predicted to be mostly localized within the nucleus [42] [24] or cytoplasm. [24]

Homology

Orthologs

The C13orf46 gene has orthologs to the human C13orf46 isoform 1 protein and C13orf46 isoform X1 protein, found within primates, mammals, birds, reptiles, fish, and invertebrates. [43]

Isoform 1

Orthologs to the human C13orf46 isoform 1 protein are only known to be found in primates and mammals, suggesting that this part of the C13orf46 gene encoding the C13orf46 isoform 1 protein appeared around 99 million years ago. [43]

Unrooted phylogenetic tree of C13orf46 isoform 1 evolutionary history. Circles indicate species groups of like taxons. The red circle highlights primates, the yellow circle indicates carnivores, and the purple circle indicates rodents. Phylogenetic Tree Isoform 1.png
Unrooted phylogenetic tree of C13orf46 isoform 1 evolutionary history. Circles indicate species groups of like taxons. The red circle highlights primates, the yellow circle indicates carnivores, and the purple circle indicates rodents.
Table of Orthologs to Human Protein C13orf46 Isoform 1 [43]
Genus and SpeciesCommon NameTaxonomic GroupMedian Date of Divergence (mya)Accession #Sequence Length (aa)Sequence Identity (%)Sequence Similarity (%)
Homo sapiens HumanPrimates0 NP_001352384.1 212100.0%100.0%
Pan paniscus BonoboPrimates6.4 XP_034792262.1 21298.1%98.1%
Gorilla gorilla gorilla Western Lowland GorillaPrimates8.6 XP_030857272.1 21295.3%98.1%
Papio anubis Olive BaboonPrimates28.9 XP_021785522.1 21288.7%92.5%
Cercocebus atys Sooty MangabeyPrimates28.9 XP_011913555.1 19287.3%91.0%
Macaca mulatta Rhesus MacaquePrimates28.9 XP_014977020.1 19279.2%82.5%
Ursus arctos Brown BearCarnivora87 XP_048071403.1 22259.2%71.3%
Callorhinus ursinus Northern Fur SealCarnivora87 XP_025730354.1 18457.5%65.6%
Lontra canadensis Northern River OtterCarnivora87 XP_032736869.1 23246.4%53.6%
Odobenus rosmarus divergens Pacific WalrusCarnivora87 XP_004412327.1 31036.4%41.7%
Loxodonta africana African Bush ElephantProboscidea87 XP_010591994.1 21462.6%73.4%
Choloepus didactylus Two-Toed SlothPilosa87 XP_037662557.1 21460.7%73.8%
Orycteropus afer afer AardvarkTubulidentata87 XP_007940592.1 21460.0%69.3%
Castor canadensis North American BeaverRodentia87 XP_020020073.1 21759.6%72.0%
Pteropus giganteus Indian Flying FoxChiroptera94 XP_039734682.1 21367.6%76.5%
Eptesicus fuscus Big Brown BatChiroptera94 XP_028004567.1 21465.9%75.2%
Trichechus manatus latirostris Antillean ManateeSirenia94 XP_023589319.1 21463.1%74.3%
Balaenoptera musculus Blue WhaleCetacea94 XP_036687016.1 20757.3%69.0%
Urocitellus parryii Arctic Ground SquirrelRodentia94 XP_026237314.1 21661.9%72.0%
Sciurus carolinensis Eastern Gray SquirrelRodentia94 XP_047409299.1 23855.6%66.1%
Ictidomys tridecemlineatus Thirteen-Lined Ground SquirrelRodentia94 XP_013221671.2 27649.3%57.6%
Chinchilla lanigera Long-Tailed ChinchillaRodentia99 XP_005373979.1 21755.5%66.4%
Arvicola amphibius European Water VoleRodentia99 XP_038185081.1 23753.1%65.1%
Mesocricetus auratus Golden HamsterRodentia99 XP_005082676.1 23751.0%61.8%
Arvicanthis niloticus African Grass RatRodentia99 XP_034376776.1 24150.0%61.9%

Isoform X1

Predicted orthologs to the human C13orf46 isoform X1 protein are found in primates, mammals, birds, reptiles, fish, and as far as back as invertebrates of the bacterial phylum Legionella . [43]

Table of Predicted Orthologs to Human Protein C13orf46 Isoform X1 [43]
Genus and SpeciesCommon NameTaxonomic GroupMedian Date of Divergence (mya)Accession #Sequence Length (aa)Sequence Identity (%)Sequence Similarity (%)
Homo sapiens HumanPrimates0 XP_047285937.1 624100.0%100.0%
Pan troglodytes ChimpanzeePrimates6.4 XP_024209271.1 72054.9%61.0%
Microtus ochrogaster Prairie VoleRodentia87 KAH0512811.1 93610.3%15.8%
Phoca vitulina European Harbour SealCarnivora94 XP_032285971.1 51018.8%30.1%
Orcinus orca Killer WhaleCetacea94 XP_049556886.1 34814.2%19.3%
Myotis davidii Whiskered BatChiroptera94 ELK34143.1 53026.1%33.6%
Phasianus colchicus Ring-Necked PheasantGalliformes319 XP_031464934.1 49923.2%33.5%
Corvus hawaiiensis Hawaiian CrowPasseriformes319 XP_048182949.1 31617.3%23.4%
Hirundo rustica Barn SwallowPasseriformes319 XP_039927228.1 118510.0%14.9%
Pelodiscus sinensis Chinese Soft-Shelled TurtleTestudines319 XP_025042872.1 55417.3%26.4%
Rana temporaria Grass FrogAnura353 XP_040201915.1 114712.9%19.7%
Bufo bufo Common ToadAnura353 XP_040296088.1 25912.3%19.7%
Lithobates catesbeianus American BullfrogAnura353 PIO00716.1 24512.0%18.5%
Larimichthys crocea Large Yellow CroakerPerciformes431 KAE8277666.1 47828.9%37.9%
Coregonus clupeaformis Lake WhitefishSalmoniformes431 XP_041725148.2 60927.0%26.0%
Austrofundulus limnaeus KillifishCyprinodontiformes431 XP_013856594.1 24422.7%25.4%
Oncorhynchus tshawytscha Chinook Blackmouth SalmonSalmoniformes431 XP_042158955.1 71423.5%26.2%
Salmo salar Atlantic SalmonSalmoniformes431 XP_045562793.1 32419.9%20.9%
Prochilodus magdalenae Columbian Freshwater FishCharaciformes431 KAI4891011.1 38818.6%25.4%
Oncorhynchus mykiss Rainbow TroutSalmoniformes431 XP_036845983.1 33218.5%27.3%
Chiloscyllium punctatum Brownbanded Bamboo SharkOrectolobiformes464 GCC17506.1 62526.7%26.8%
Biomphalaria glabrata Freshwater SnailBasommatophora694 KAI8768938.1 30814.4%18.9%
Bulinus truncatus Freshwater SnailBasommatophora694 KAH9489149.1 87910.3%30.7%
Owenia fusiformis Bristle WormCanalipalpata694 CAH1787814.1 22414.3%15.2%
Legionella fallonii LegionellaLegionellales3036 WP_045095679.1 69515.2%30.7%

Paralogs

Human C13orf46 isoform X1 protein has one predicted paralog among mucins, specifically mucin-1. [43] Mucins play a role in creating protective mucus barriers on epithelial tissues. [45] The MUC1 gene is located on chromosome 1 at 1q22, contains 11 exons, and has 22 different isoforms. [46] Mucins are highly O-glycosylated and contain tandem repeat domains abundant with proline, serine, and threonine. [47] Surrounding the repeat domains are cysteine rich regions. Mucin genes do not always share a common ancestry, are prone to convergent evolution, and are grouped based on their functionality instead of common evolutionary history. [48]

Related Research Articles

Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.

<span class="mw-page-title-main">C10orf67</span> Protein-coding gene in the species Homo sapiens

Chromosome 10 open reading frame 67 (C10orf67), also known as C10orf115, LINC01552, and BA215C7.4, is an un-characterized human protein-coding gene. Several studies indicate a possible link between genetic polymorphisms of this and several other genes to chronic inflammatory barrier diseases such as Crohn's Disease and sarcoidosis.

<span class="mw-page-title-main">C6orf62</span>

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">C16orf46</span> Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

<span class="mw-page-title-main">C15orf39</span>

C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">CFAP299</span> Protein-coding gene in the species Homo sapiens

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.

<span class="mw-page-title-main">C1orf122</span> Protein-coding gene in the species Homo sapiens

C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.

<span class="mw-page-title-main">C7orf50</span>

C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.

<span class="mw-page-title-main">C17orf78</span> Mammalian protein found in Homo sapiens

Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.

<span class="mw-page-title-main">LSMEM2</span>

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">C6orf136</span>

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

<span class="mw-page-title-main">C11orf98</span>

C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.

<span class="mw-page-title-main">C12orf50</span> Protein encoding gene C12orf50

Chromosome 12 Open Reading Frame 50 (C12orf50) is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811, on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.

<span class="mw-page-title-main">TEKTIP1</span> Gene

TEKTIP1, also known as tektin-bundle interacting protein 1, is a protein that in humans is encoded by the TEKTIP1 gene.

<span class="mw-page-title-main">THAP3</span> Protein in Humans

THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

<span class="mw-page-title-main">LRRC74A</span> Protein-coding gene

Leucine-rich Repeat-containing Protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.

References

  1. 1 2 3 ENSG00000283199 GRCh38: Ensembl release 89: ENSG00000283302, ENSG00000283199 - Ensembl, May 2017
  2. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  3. 1 2 3 "C13orf46 chromosome 13 open reading frame 46 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-15.
  4. "GEO Profile Links for Gene (Select 100507747) - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-17.
  5. "Homo sapiens chromosome 13, GRCh38.p14 Primary Assembly". 2022-04-06.
  6. "C13orf46 Gene - CM046 Protein | CM046 Antibody". GeneCards. Retrieved 2022-12-14.
  7. 1 2 3 4 "C13orf46 chromosome 13 open reading frame 46 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-14.
  8. "LINC00452 long intergenic non-protein coding RNA 452 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-15.
  9. Zhang X, Wang W, Zhu W, Dong J, Cheng Y, Yin Z, Shen F (November 2019). "Mechanisms and Functions of Long Non-Coding RNAs at Multiple Regulatory Levels". International Journal of Molecular Sciences. 20 (22): 5573. doi: 10.3390/ijms20225573 . PMC   6888083 . PMID   31717266.
  10. "Alliance of Genome Resources". www.alliancegenome.org. Retrieved 2022-12-15.
  11. Yang J, Wang WG, Zhang KQ (November 2020). "LINC00452 promotes ovarian carcinogenesis through increasing ROCK1 by sponging miR-501-3p and suppressing ubiquitin-mediated degradation". Aging. 12 (21): 21129–21146. doi:10.18632/aging.103758. PMC   7695380 . PMID   33168781.
  12. "SWINGN SWI/SNF complex interacting GAS6 enhancer non-coding RNA [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-15.
  13. Grossi E, Raimondi I, Goñi E, González J, Marchese FP, Chapaprieta V, et al. (February 2020). "A lncRNA-SWI/SNF complex crosstalk controls transcriptional activation at specific promoter regions". Nature Communications. 11 (1): 936. Bibcode:2020NatCo..11..936G. doi:10.1038/s41467-020-14623-3. PMC   7028943 . PMID   32071317.
  14. "RASA3 RAS p21 protein activator 3 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-15.
  15. 1 2 "UniProt". www.uniprot.org. Retrieved 2022-12-15.
  16. "uncharacterized protein C13orf46 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-15.
  17. "uncharacterized protein C13orf46 isoform X1 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-15.
  18. 1 2 "Expasy - Compute pI/Mw tool". web.expasy.org. Retrieved 2022-12-15.
  19. 1 2 3 4 5 6 7 "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2022-12-15.
  20. 1 2 "I-TASSER results". seq2fun.dcmb.med.umich.edu. Retrieved 2022-12-15.
  21. "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2022-12-15.
  22. "uncharacterized protein C13orf46 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-15.
  23. 1 2 "GPS 5.0 - Kinase-specific Phosphorylation Site Prediction". gps.biocuckoo.cn. Retrieved 2022-12-15.
  24. 1 2 3 4 5 6 7 8 9 10 11 12 "Services". healthtech.dtu.dk. Retrieved 2022-12-15.
  25. 1 2 "Motif Scan". myhits.sib.swiss. Retrieved 2022-12-15.
  26. 1 2 "GPS-Palm: A Graphic Presentation System for Palmitoylation Site Prediction". gpspalm.biocuckoo.cn. Retrieved 2022-12-15.
  27. 1 2 "GPS-Lipid - Prediction of Lipid Modifications (S-Palmitoylation, N-Myristoylation, S-Farnesylation, S-Geranylgeranylation)". lipid.biocuckoo.org. Retrieved 2022-12-15.
  28. 1 2 "GPS-SUMO: Prediction of SUMOylation Sites & SUMO-interaction Motifs". sumosp.biocuckoo.org. Retrieved 2022-12-15.
  29. 1 2 "GPS-PBS - PPBDs–specific binding p-site prediction". pbs.biocuckoo.cn. Retrieved 2022-12-15.
  30. "IBS: Illustrator for Biological Sequences". ibs.biocuckoo.org. Retrieved 2022-12-15.
  31. "I-TASSER results". seq2fun.dcmb.med.umich.edu. Retrieved 2022-12-15.
  32. "iCn3D: Web-based 3D Structure Viewer". www.ncbi.nlm.nih.gov. Retrieved 2022-12-15.
  33. 1 2 "I-TASSER results". seq2fun.dcmb.med.umich.edu. Retrieved 2022-12-15.
  34. 1 2 3 "WebLogo - Create Sequence Logos". weblogo.berkeley.edu. Retrieved 2022-12-15.
  35. "Dotlet JS". dotlet.vital-it.ch. Retrieved 2022-12-15.
  36. 1 2 "Clustal Omega < Multiple Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2022-12-15.
  37. "Multiple Sequence Alignment - CLUSTALW". www.genome.jp. Retrieved 2022-12-15.
  38. "Motif Scan". myhits.sib.swiss. Retrieved 2022-12-15.
  39. "IBS: Illustrator for Biological Sequences". ibs.biocuckoo.org. Retrieved 2022-12-15.
  40. "ELM - Search the ELM resource". elm.eu.org. Retrieved 2022-12-17.
  41. "Tissue expression of C13orf46 - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2022-12-16.
  42. 1 2 "PSORT II Prediction". psort.hgc.jp. Retrieved 2022-12-15.
  43. 1 2 3 4 5 6 "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2022-12-16.
  44. "Phylogeny.fr: Home". www.phylogeny.fr. Retrieved 2022-12-17.
  45. Reznik N, Gallo AD, Rush KW, Javitt G, Fridmann-Sirkis Y, Ilani T, et al. (October 2022). "Intestinal mucin is a chaperone of multivalent copper". Cell. 185 (22): 4206–4215.e11. doi: 10.1016/j.cell.2022.09.021 . PMID   36206754. S2CID   245671675.
  46. "MUC1 mucin 1, cell surface associated [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-16.
  47. Pinzón Martín S, Seeberger PH, Varón Silva D (2019). "Mucins and Pathogenic Mucin-Like Molecules Are Immunomodulators During Infection and Targets for Diagnostics and Vaccines". Frontiers in Chemistry. 7: 710. Bibcode:2019FrCh....7..710V. doi: 10.3389/fchem.2019.00710 . PMC   6817596 . PMID   31696111.
  48. Pajic P, Shen S, Qu J, May AJ, Knox S, Ruhl S, Gokcumen O (August 2022). "A mechanism of gene evolution generating mucin function". Science Advances. 8 (34): eabm8757. Bibcode:2022SciA....8M8757P. doi:10.1126/sciadv.abm8757. PMC   9417175 . PMID   36026444.