C6orf163 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C6orf163 , chromosome 6 open reading frame 163 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 2684982 HomoloGene: 79759 GeneCards: C6orf163 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
C6orf163 is a human protein encoded by the C6orf163 gene.
C6orf163 is a 20.6 kb gene encoded on the plus strand of chromosome 6 (6q15). C6orf163 is predicted to be part of a readthrough locus with its neighboring genes on the plus strand, SMIM8 (small integral membrane protein 8, also known as C6orf162), LINC01590 (long intergenic non-coding RNA 1590), and CFAP206 (cilia and flagella associated protein 206). [5]
C6orf163 has been observed to be near-ubiquitously expressed at low levels in RNA-seq datasets. It is expressed most highly in the testes. [5] There are 4 isoforms of C6orf163, the most common of which has 5 exons. The splice variants differ by truncation on the 5' end. [6] An additional unspliced mRNA variant has been identified, but it does not appear to code for a protein. [6]
Throughout early development, C6orf163 is expressed at moderate levels in many tissues. Its expression is highest at 10 weeks gestational time and decreases as development progresses. [7]
The human C6orf163 protein is 329 amino acids long and has a molecular weight of 38 kDa. [8] Its predicted isoelectric point is 6.49. [9] According to the structural prediction from Alphafold, [10] it mainly consists of a long alpha helical region, which is a relatively rare structure in human proteins. [11] The long alpha helical structure is well conserved among orthologs.
C6orf163 contains a predicted leucine zipper motif from amino acids 247 to 269. This motif is typically involved in DNA binding, and is commonly found in transcription factors and other regulatory proteins. Leucine zippers form dimers to bind DNA, so the presence of this motif suggests that C6orf163 may exist as a dimer.
C6orf163 has been experimentally found to undergo phosphorylation at 7 different residues and ubiquitination at 1 residue. [12] [13] Additionally, it has been computationally predicted to undergo sumoylation, [14] lycine acetylation, [15] and mucin-type O-GlcNac glycosylation. [16]
C6orf163 has been found to interact with the protein DRC6 [17] (Dynein regulatory complex subunit 6, also known as F-box and leucine rich repeat protein 13), which is a ubiquitin ligase that forms part of the SCF-type E3 ubiquitin ligase complex. DRC6 has been found to be involved in regulation of ciliary and flagellar motility. [18]
C6orf163 has a nuclear localization signal from amino acids 310 to 316. Antibody staining has shown C6orf163 to be localized to the nucleus and cytoplasm. [19] [20]
The C6orf163 protein is highly conserved among animals. Orthologs of C6orf163 have been identified in mammals, birds, reptiles, amphibians, fish, and some invertebrates including mollusks, echinoderms, and lancelets. The most distant c6orf163 ortholog identified is in the Japanese mud snail, Batillaria attramentaria . This ortholog has 23% sequence identity with the human protein. [21]
Homo sapiens C6orf163 has no known paralogs in humans.
Genus and species | Common name | Accession number | Length (aa) | Sequence identity (%) | Date of divergence (MYA) |
Homo sapiens | human | NP_001010868.2 | 328 | 100 | 0 |
Mus musculus | mouse | NP_001028427.1 | 328 | 74 | 87 |
Egretta garzetta | little egret | XP_009647262.2 | 330 | 47 | 319 |
Crocodylus porosus | saltwater crocodile | XP_019393256.1 | 350 | 43 | 319 |
Geotrypetes seraphini | gaboon caecilian | XP_033792248.1 | 313 | 33 | 353 |
Scyliorhinus canicula | small-spotted catshark | XP_038654713.1 | 330 | 27 | 464 |
Branchiostoma floridae | Florida lancelet | XP_035665671.1 | 297 | 29 | 556 |
Batillaria attramentaria | Japanese mud snail | KAG5690851.1 | 278 | 23 | 680 |
A genome-wide association study analyzing genetic predictors of long-term treatment outcome for bipolar disorder showed that SNPs near C6orf163 were associated with the total number of manic and depressive episodes during follow up treatment and the number of depressive episodes during follow up, suggesting that C6orf163 may be involved in susceptibility to bipolar disorder. [22]
DEP Domain Containing Protein 1B also known as XTP1, XTP8, HBV XAg-Transactivated Protein 8, [formerly referred to as BRCC3] is a human protein encoded by a gene of similar name located on chromosome 5.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Glutamate rich protein 5 is a protein in humans encoded by the ERICH5 gene, also known as chromosome 8 open reading frame 47 (C8orf47).
Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.
C16orf82 is a protein that, in humans, is encoded by the C16orf82 gene. C16orf82 encodes a 2285 nucleotide mRNA transcript which is translated into a 154 amino acid protein using a non-AUG (CUG) start codon. The gene has been shown to be largely expressed in the testis, tibial nerve, and the pituitary gland, although expression has been seen throughout a majority of tissue types. The function of C16orf82 is not fully understood by the scientific community.
Chromosome 1 open reading frame 112, is a protein that in humans is encoded by the C1orf112 gene, and is located at position 1q24.2. C1orf112 encodes for seventeen variants of mRNA, fifteen of which are functional proteins. C1orf112 has a determined precursor molecular weight of 96.6 kDa and an isoelectric point of 5.62. C1orf112 has been experimentally determined to localize to the mitochondria, although it does not contain a mitochondrial targeting sequence.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.
C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
Transmembrane protein 39B (TMEM39B) is a protein that in humans is encoded by the gene TMEM39B. TMEM39B is a multi-pass membrane protein with eight transmembrane domains. The protein localizes to the plasma membrane and vesicles. The precise function of TMEM39B is not yet well-understood by the scientific community, but differential expression is associated with survival of B cell lymphoma, and knockdown of TMEM39B is associated with decreased autophagy in cells infected with the Sindbis virus. Furthermore, the TMEM39B protein been found to interact with the SARS-CoV-2 ORF9C protein. TMEM39B is expressed at moderate levels in most tissues, with higher expression in the testis, placenta, white blood cells, adrenal gland, thymus, and fetal brain.
C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.
C15orf54 is a protein in humans that is encoded by the C6orf54 gene. This gene is mostly conserved in mammals, primarily primates. While the function of the gene is currently unknown, the gene has shown high expression in the prostate, thymus, appendix, bone marrow, and lungs.
Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.
C4orf19 is a protein which in humans is encoded by the C4orf19 gene.
Transmembrane epididymal protein 1 is a transmembrane protein encoded by the TEDDM1 gene. TEDDM1 is also commonly known as TMEM45C and encodes 273 amino acids that contains six alpha-helix transmembrane regions. The protein contains a 118 amino acid length family of unknown function. While the exact function of TEDDM1 is not understood, it is predicted to be an integral component of the plasma membrane.
Transmembrane Protein 269 (TMEM269) is a protein which in humans is encoded by the TMEM269 gene.
Chromosome 12 open reading frame 71 (c12orf71) is a protein which in humans is encoded by c12orf71 gene. The protein is also known by the alias LOC728858.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
Leucine-rich repeat-containing protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.