SKIDA1 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | SKIDA1 , C10orf140, DLN-1, SKI/DACH domain containing 1 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1919918 HomoloGene: 66327 GeneCards: SKIDA1 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Ski/Dach domain-containing protein 1 is a protein that in humans is encoded by the SKIDA1 gene. [5] It is also known as C10orf140 and DLN-1. It has orthologs in vertebrates. It has two domains: the Ski/Sno/Dac domain and a domain of unknown function, DUF4854. It is associated with multiple types of cancer, like leukemia, ovarian cancer, and colon cancer. [6] [7] It's predicted to be a nuclear protein. [8] It may interact with PRC2. [9] [10]
SKIDA1 has orthologs in vertebrate species. The species least related to humans with a SKIDA1 ortholog is the lancelet Branchiostoma belcheri. The clades amphibia and chondrichthyes have at least two species with SKIDA1, but SKIDA1 is not found throughout the clades. No orthologs have been found in lungfish or invertebrate species. [11]
SKIDA1 shares the Ski/Sno/Dac domain with Ski oncogene (Ski), Ski-like protein (Sno), and dachshund (Dac). [12] It shares DUF4584 with Elongin BC Polycomb Repressive Complex 2 associated Protein (EPOP). [5]
In humans, SKIDA1 is located on the reverse strand of chromosome 10 at locus 10p12.31. It contains five exons. [5]
There is not a consensus on whether humans have one or two SKIDA1 isoforms. NCBI Gene claims there is one, while UniProt claims there are two. [13] [14] It's possible isoform 2 is recorded in NCBI Gene as DLN-1 (accession BAE93016.1). Isoform 1 is 908 amino acids long, while isoform 2 is 827 amino acids long; isoform 2 is missing amino acids 240-318 from isoform 1. [14] Isoform 1 is predicted to weigh 98 kDa and have an isoelectric point of 8.7, while isoform 2 is predicted to weigh 90 kDa and have an isoelectric point of 7.6. [15]
Other mammalian species also have multiple isoforms of SKIDA1, including carnivorans, rodents, and primates. The number of isoforms each species has varies: cheetahs have five recorded isoforms, chimpanzees have three recorded, and brown rats have two recorded. [16]
Human SKIDA1 contains two poly-alanine regions, one poly-histidine region, and one poly-glutamic acid region. [5] It's unknown if they have any function. The poly-alanine and poly-histidine regions are not highly conserved among orthologs; for example, while they are found in the house mouse ortholog, they are not found in the western lowland gorilla ortholog. [17] [18] The poly-glutamic acid region shows more conservation, and is found abbreviated in species as distantly related from humans as the tire track eel. [19]
SKIDA1 contains two domains: Ski/Sno/Dac and DUF4854. The Ski/Sno/Dac domain is at the N-terminus end of the protein. The Ski/Sno/Dac domain is also found in the proteins Ski, Ski-like protein, and dachshund. [12] It is potentially a DNA-binding domain. [20]
The other domain, DUF4854, is also found in EPOP, near its C-terminus. However, the DUF4584 found in EPOP is roughly a fifth the size of that in SKIDA1. The C-termini of SKIDA1 (amino acids 844-908) and EPOP (amino acids 313-379) have 52% identity. The C-terminus of EPOP binds to the SUZ12 subunit of Polycomb Repressive Complex 2 (PRC2), suggesting that of SKIDA1 may as well. [9]
In humans, there are five predicted potential promoters. Two align with the second half of the mRNA transcript, suggesting they are not used or only produce an incomplete polypeptide. [21]
The promoter that aligns best with the start of the mRNA transcript is potentially bound to by many transcription factors, including Transcription factor II B, Nuclear factor Y, Early growth response 1, and Krueppel-like factor 6. [21] It does not contain a TATA box.
SKIDA1 is regulated by microRNAs. miR-93 binds to the SKIDA1 3'-UTR. [22] Multiple microRNAs are predicted to bind to the SKIDA1 3'-UTR, including miR-130, miR-301, miR-454, and miR-494. [23]
SKIDA1 is SUMOylated at five sites. [24] Additional sites are predicted to be SUMOylated. [25] [26] SKIDA1 is also predicted to be phosphorylated and O-GlcNAcylated. [27] [28]
SKIDA1 is predicted to be localized primarily in the nucleus and less so in the cytosol. [8]
SKIDA1 is expressed at high levels in the brain, thyroid, and testes. It's expressed at medium to low levels in adipose tissue, lymph nodes, and skeletal muscle. [29] [30] [31] [32] In mice, it's noted to have medium-to-high expression in the olfactory bulb, retina, and salivary gland. [29]
SKIDA1 expression changes during organism development. Expression is low in the zygote, peaks during embryonic development, and is low post-birth. In the house mouse, it's expressed most during organogenesis. [33] In the fetus, its expression is low in the liver but not other organs. [34] Expression in the adult liver is much higher. In contrast, SKIDA1 expression in the fetal brain is higher than in the adult brain. [32]
SKIDA1 in the African clawed frog is expressed faintly in the marginal zone of gastrulae. During neurulation, it's expressed in the brain and cranial neural crest. During tailbud, SKIDA1 expression increases in sensory placodes. By the end of tailbud, neural expression has faded except in the olfactory organ. [35]
SKIDA1 is predicted to function primarily in the nucleus and also in the cytosol. [8]
SKIDA1 knockouts in mice have significant differences from wild-type mice in the skeletal, neurological, reproductive, and immune systems. Other significant differences include effected hearing, an enlarged thymus, and increased pre-weaning mortality. [36] Some, but not all, of these effects were found in heterozygous knockouts.
SKIDA1 expression is associated with multiple types of cancer. It is over-expressed in epithelial ovarian cancer cells. [37] Its expression is altered by various cancer-treatment compounds: human alpha-lactalbumin made lethal to tumor cells; oleate salts; metformin; and aspirin.[ citation needed ] In cell lines of cancerous cells, altered expression is associated with resistance to dasatinib and docetaxel, which are used to treat cancer. [38] [39]
Altered methylation of SKIDA1 is associated with human pancreatic cancer, rheumatoid arthritis, and lupus erythematosus. [40] [41] Additionally, SKIDA1 is expressed less in women with Down syndrome compared to their identical twins without Down syndrome. [42] Its expression is dramatically reduced in brains affected by untreated HIV1-associated neurocognitive disorders (HAND) in comparison to healthy brains and brains affected by HAND but treated with antiretrovirals. [43]
The SKI protein is a nuclear proto-oncogene that is associated with tumors at high cellular concentrations. SKI has been shown to interfere with normal cellular functioning by both directly impeding expression of certain genes inside the nucleus of the cell as well as disrupting signaling proteins that activate genes.
Ski-like protein is a protein that in humans is encoded by the SKIL gene.
MORN1 containing repeat 1, also known as Morn1, is a protein that in humans is encoded by the MORN1 gene.
DEP Domain Containing Protein 1B also known as XTP1, XTP8, HBV XAg-Transactivated Protein 8, [formerly referred to as BRCC3] is a human protein encoded by a gene of similar name located on chromosome 5.
Family with sequence similarity 63, member A is a protein that, is encoded by the FAM63A gene in humans,. It is located on the minus strand of chromosome 1 at locus 1q21.3.
FAM76A is a protein that in Homo sapiens is encoded by the FAM76A gene. Notable structural characteristics of FAM76A include an 83 amino acid coiled coil domain as well as a four amino acid poly-serine compositional bias. FAM76A is conserved in most chordates but it is not found in other deuterostrome phlya such as echinodermata, hemichordata, or xenacoelomorpha—suggesting that FAM76A arose sometime after chordates in the evolutionary lineage. Furthermore, FAM76A is not found in fungi, plants, archaea, or bacteria. FAM76A is predicted to localize to the nucleus and may play a role in regulating transcription.
BEND2 is a protein that in humans is encoded by the BEND2 gene. It is also found in other vertebrates, including mammals, birds, and reptiles. The expression of BEND2 in Homo sapiens is regulated and occurs at high levels in the skeletal muscle tissue of the male testis and in the bone marrow. The presence of the BEN domains in the BEND2 protein indicates that this protein may be involved in chromatin modification and regulation.
C16orf82 is a protein that, in humans, is encoded by the C16orf82 gene. C16orf82 encodes a 2285 nucleotide mRNA transcript which is translated into a 154 amino acid protein using a non-AUG (CUG) start codon. The gene has been shown to be largely expressed in the testis, tibial nerve, and the pituitary gland, although expression has been seen throughout a majority of tissue types. The function of C16orf82 is not fully understood by the scientific community.
Coiled-coil domain containing 74A is a protein that in humans is encoded by the CCDC74A gene. The protein is most highly expressed in the testis and may play a role in developmental pathways. The gene has undergone duplication in the primate lineage within the last 9 million years, and its only true ortholog is found in Pan troglodytes.
Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.
C2orf16 is a protein that in humans is encoded by the C2orf16 gene. Isoform 2 of this protein is 1,984 amino acids long. The gene contains 1 exon and is located at 2p23.3. Aliases for C2orf16 include Open Reading Frame 16 on Chromosome 2 and P-S-E-R-S-H-H-S Repeats Containing Sequence.
Chromosome 1 open reading frame 141, or C1orf141 is a protein which, in humans, is encoded by gene C1orf141. It is a precursor protein that becomes active after cleavage. The function is not yet well understood, but it is suggested to be active during development
WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.
C22orf31 is a protein which in humans is encoded by the C22orf31 gene. The C22orf31 mRNA transcript has an upstream in-frame stop codon, while the protein has a domain of unknown function (DUF4662) spanning the majority of the protein-coding region. The protein has orthologs with high percent similarity in mammals. The most distant orthologs are found in species of bony fish, but C22orf31 is not found in any species of birds or amphibians.
Transmembrane protein 39B (TMEM39B) is a protein that in humans is encoded by the gene TMEM39B. TMEM39B is a multi-pass membrane protein with eight transmembrane domains. The protein localizes to the plasma membrane and vesicles. The precise function of TMEM39B is not yet well-understood by the scientific community, but differential expression is associated with survival of B cell lymphoma, and knockdown of TMEM39B is associated with decreased autophagy in cells infected with the Sindbis virus. Furthermore, the TMEM39B protein been found to interact with the SARS-CoV-2 ORF9C protein. TMEM39B is expressed at moderate levels in most tissues, with higher expression in the testis, placenta, white blood cells, adrenal gland, thymus, and fetal brain.
C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.
THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
Transmembrane protein 248, also known as C7orf42, is a gene that in humans encodes the TMEM248 protein. This gene contains multiple transmembrane domains and is composed of seven exons.TMEM248 is predicted to be a component of the plasma membrane and be involved in vesicular trafficking. It has low tissue specificity, meaning it is ubiquitously expressed in tissues throughout the human body. Orthology analyses determined that TMEM248 is highly conserved, having homology with vertebrates and invertebrates. TMEM248 may play a role in cancer development. It was shown to be more highly expressed in cases of colon, breast, lung, ovarian, brain, and renal cancers.
Maestro heat-like repeat-containing protein family member 9 (MROH9) is a protein which in humans is encoded by the MROH9 gene. The word ‘maestro’ itself is an acronym, standing for male-specific transcription in the developing reproductive organs (MRO). MRO genes belong to the MROH family, which includes MROH9.