SH3D21 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | SH3D21 , C1orf113, SH3 domain containing 21 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1914188 HomoloGene: 12057 GeneCards: SH3D21 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
SH3D21 is a nuclear protein that is encoded by the SH3D21 gene. In humans, this gene is located on chromosome 1 p34.3. [5] The human mRNA transcript is 2527 base pairs and the final protein product is 756 amino acids. [6] While the exact function of this protein remains unknown, due to the presence of three SH3 domains, it has been implicated in protein-protein interactions. [7]
SH3D21 is expressed in low levels in most tissue. [8] Microarray analysis has shown SH3D21 expression to be decreased in TP63 knockout mice. [9] SH3D21 has been shown to be expressed highly in the superior cervical ganglion, the dorsal root ganglia and the trigeminal ganglion. [8] [10] Transcription of SH3D21 is known to be upregulated in the presence of testosterone. [11]
SH3D21 contains three SH3 domains. [7] [12] [13] These domains are located near the N-terminus of the protein. In humans, these SH3 domains have a common amino acid sequence Asp-Glu-Leu. This sequence motif is also conserved in other species. SH3D21 has been found to interact with Adenylate Kinase 2, Artemin, and Importin 13. [5] The human protein has two isoforms and no paralogs. [6] The second isoform is 645 amino acids long and is identical to the first isoform, except it is missing the first 111 amino acids. [14] Due to this, the second isoform is missing the first, and half of the second, N-terminal SH3 domain. [14] Secondary structure analysis of SH3D21 indicates a long alpha helical structure near the C-terminus. [15] [16] The purpose of this structure is unknown. SH3D21 is predicted to have many phosphorylation sites and multiple sumoylation sites throughout the entirety of the protein. [17] [18]
The function of this gene is still unclear. However, research has linked SH3D21 expression changes to male infertility and Ataxia Telangiectasia. [19] [20] Further studies have implicated the chromosomal region of 1p34.3 in Intracranial Aneurysm and as a negative prognosis sign in colorectal cancer. [21] [22] These studies do not, however, directly mention SH3D21.
SH3D21 is well-conserved in mammals. BLAST analysis found distant orthologs in Osteichthyes with a max identity of 28%. [23] Sequence identity was calculated using available sequence data and ALIGN software. [24]
Species | Species common name | NCBI Accession Number (Protein) | Length (aa) | Sequence Identity |
---|---|---|---|---|
Homo sapiens | Human | NP_001156002 | 756aa | 100% |
Gorilla gorilla | Gorilla | XP_004025512 | 761 aa | 97.1% |
Pongo abelii | Orangutan | XP_002811093 | 755aa | 94.9% |
Macaca mulatta | Macaques | XP_001110607 | 755aa | 91.4% |
Papia anubir | Olive Baboon | XP_003891645/ | 761aa | 91.2% |
Saimiri boliviensis | Black Capped Squirrel Monkey | XP_003308029 | 650aa | 82.0% |
Bos taurus | Cattle | NP_001156006 | 676aa | 58.70% |
Cavia porcellus | Guinea pig | XP_003471528 | 658aa | 52.60% |
Oreochromis niloticus | Nile Talapia | XP_003450596 | 505aa | 28.1% |
SOGA2, also known as Suppressor of glucose autophagy associated 2 or CCDC165, is a protein that in humans is encoded by the SOGA2 gene. SOGA2 has two human paralogs, SOGA1 and SOGA3. In humans, the gene coding sequence is 151,349 base pairs long, with an mRNA of 6092 base pairs, and a protein sequence of 1586 amino acids. The SOGA2 gene is conserved in gorilla, baboon, galago, rat, mouse, cat, and more. There is distant conservation seen in organisms such as zebra finches and anoles. SOGA2 is ubiquitously expressed in humans, with especially high expression in brain, colon, pituitary gland, small intestine, spinal cord, testis and fetal brain.
Coiled-coil domain-containing protein 113 also known as HSPC065, GC16Pof6842 and GC16P044152, is a protein that in humans is encoded by the CCDC113 gene. The human CCDC113 gene is located on chromosome 16q21 and encodes 5,304 base pairs of mRNA and 377 amino acids.
MORN1 containing repeat 1, also known as Morn1, is a protein that in humans is encoded by the MORN1 gene.
UPF0172 protein FAM158A, also known as c14orf122 or CGI112, is a protein that in humans is encoded by the FAM158A gene located on chromosome 14q11.2.
Transmembrane protein 131-like, alternatively named uncharacterized protein KIAA0922, is an integral transmembrane protein encoded by the human gene KIAA0922 that is significantly conserved in eukaryotes, at least through protists. Although the function of this gene is not yet fully elucidated, initial microarray evidence suggests that it may be involved in immune responses. Furthermore, its paralog, prolyl endopeptidase (PREP) whose function is known, provides clues as to the function of TMEM131L.
Alpha-1-B glycoprotein is a 54.3 kDa protein in humans that is encoded by the A1BG gene. The protein encoded by this gene is a plasma glycoprotein of unknown function. The protein shows sequence similarity to the variable regions of some immunoglobulin supergene family member proteins. Patients who have pancreatic ductal adenocarcinoma show an overexpression of A1BG in pancreatic juice.
ARMH3 or Armadillo Like Helical Domain Containing 3, also known as UPF0668 and c10orf76, is a protein that in humans is encoded by the ARMH3 gene. Its function is not currently known, but experimental evidence has suggested that it may be involved in transcriptional regulation. The protein contains a conserved proline-rich motif, suggesting that it may participate in protein-protein interactions via an SH3-binding domain, although no such interactions have been experimentally verified. The well-conserved gene appears to have emerged in Fungi approximately 1.2 billion years ago. The locus is alternatively spliced and predicted to yield five protein variants, three of which contain a protein domain of unknown function, DUF1741.
Protein FAM214A, also known as protein family with sequence similarity 214, A (FAM214A) is a protein that, in humans, is encoded by the FAM214A gene. FAM214A is a gene with unknown function found at the q21.2-q21.3 locus on Chromosome 15 (human). The protein product of this gene has two conserved domains, one of unknown function (DUF4210) and another one called Chromosome_Seg. Although the function of the FAM214A protein is uncharacterized, both DUF4210 and Chromosome_Seg have been predicted to play a role in chromosome segregation during meiosis.
Tetratricopeptide repeat protein 39B is a protein that in humans is encoded by the TTC39B gene. TTC39B is also known as C9orf52 or FLJ33868. The main feature within tetratricopeptide repeat 39B is the domain of unknown function 3808 (DUF3808), spanning the majority of the protein.
Family with sequence similarity 167, member A is a protein in humans that is encoded by the FAM167A gene located on chromosome 8. FAM167A and its paralogs are protein encoding genes containing the conserved domain DUF3259, a protein of unknown function. FAM167A has many orthologs in which the domain of unknown function is highly conserved.
DEP Domain Containing Protein 1B also known as XTP1, XTP8, HBV XAg-Transactivated Protein 8, [formerly referred to as BRCC3] is a human protein encoded by a gene of similar name located on chromosome 5.
FAM71F2 or Family with Sequence Similarity 71 member F2 is a protein that in humans is encoded by the Family with Sequence Similarity 71 member F2 gene. This gene is highly active in the reproductive tissues, specifically the testis, and may serve as a potential biomarker for determining metastatic testicular cancer.
Transmembrane protein 255A is a protein that is encoded by the TMEM255A gene. TMEM255A is often referred to as family with sequence similarity 70, member A (FAM70A). The TMEM255A protein is transmembrane and is predicted to be located the nuclear envelope of eukaryote organisms.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
Chromosome 1 open reading frame 112, is a protein that in humans is encoded by the C1orf112 gene, and is located at position 1q24.2. C1orf112 encodes for seventeen variants of mRNA, fifteen of which are functional proteins. C1orf112 has a determined precursor molecular weight of 96.6 kDa and an isoelectric point of 5.62. C1orf112 has been experimentally determined to localize to the mitochondria, although it does not contain a mitochondrial targeting sequence.
C2orf16 is a protein that in humans is encoded by the C2orf16 gene. Isoform 2 of this protein is 1,984 amino acids long. The gene contains 1 exon and is located at 2p23.3. Aliases for C2orf16 include Open Reading Frame 16 on Chromosome 2 and P-S-E-R-S-H-H-S Repeats Containing Sequence.
Family with Sequence Similarity 155 Member B is a protein in humans that is encoded by the FAM155B gene. It belongs to a family of proteins whose function is not yet well understood by the scientific community. It is a transmembrane protein that is highly expressed in the heart, thyroid, and brain.
Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.
Coiled-Coil Domain Containing 190, also known as C1orf110, the Chromosome 1 Open Reading Frame 110, MGC48998 and CCDC190, is found to be a protein coding gene widely expressed in vertebrates. RNA-seq gene expression profile shows that this gene selectively expressed in different organs of human body like lung brain and heart. The expression product of c1orf110 is often called Coiled-coil domain-containing protein 190 with a size of 302 aa. It may get the name because a coiled-coil domain is found from position 14 to 72. At least 6 spliced variants of its mRNA and 3 isoforms of this protein can be identified, which is caused by alternative splicing in human.
Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.