CCDC144A | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | CCDC144A , coiled-coil domain containing 144A | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | HomoloGene: 108248; GeneCards: CCDC144A; OMA:CCDC144A - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Coiled-coil domain-containing protein 144A is a protein that in humans is encoded by the CCDC144A gene. [3] An alias of this gene is called KIAA0565. There are four members of the CCDC family: CCDC 144A, 144B, 144C and putative CCDC 144 N-terminal like proteins. [4]
This gene has a nucleotide sequence that is 5140 bp long, and it encodes 641 amino acids. [5] It is found on the short arm, plus (forward) strand of chromosome 17 at p11.2. [6] [7] The mRNA for the CCDC144A gene has 3 alternative splicing isoforms named A2RUR9-1, A2RUR9-2, AND A2RUR9-3, but there is no experimental confirmation available yet. [8]
This protein for this gene is also known as coiled coil domain containing 144A (CCDC144A) protein. It consists of 641 amino acids. [9] This protein weighs 75.8 kDa and has an isoelectric point of 6.357. [10] This protein localizes near the nucleus, [11] and is a soluble protein with a hydrophobicity of -1.021842. [12] This protein is also non-secretory [13] and has 10 potential serine and 3 potential threonine phosphorylation sites. [14] There are no tyrosine sulfation sites, [15] but there are a few potential sumoylation sites on this protein. [16] [17] Also, this protein is predicted to be non-myristoylated [18] and does not contain a signal peptide. [13] [19]
This protein has a domain of unknown function (DUF) 3496, which has been conserved in eukaryotes. [20] The DUF3496 domain is found from amino acids 547-622. [9] CCDC144A, an alias of this gene, indicates that there should be a coiled coil domain within the protein. Coiled coils are structural motifs in proteins in which 2 more alpha helices are coiled together, and they usually contain a heptad repeat, hxxhcxc, or hydrophobic (h) and charge (c) amino acid residues. [7] The 5' and 3' untranslated regions of the nucleotide sequence of this gene are rich in stem-loop structures. [21] In place of a coiled coil, a leucine zipper was found. [11] Residues from 478-499, "LHNTRDALGRESLILERVQRDL", are the residues that form the leucine zipper pattern. [11] The structure of this protein consists of mostly alpha helices, with some random coils. [22]
Number | Species |
---|---|
1 | Nine-banded armadillo |
2 | Cow |
3 | Flying fox |
4 | Mouse eared bat |
5 | Chimpanzee |
6 | Treeshrew |
7 | House mouse |
8 | Chinese hamster |
9 | Naked mole rat |
10 | Rhesus monkey |
11 | Crab-eating macaque |
12 | Human KIAA0565 |
13 | Platypus |
14 | Western clawed frog |
15 | Pufferfish |
16 | Carolina anole |
17 | Zebra finch |
Orthologs of KIAA0565 protein have been identified mostly in mammals, but some birds, reptiles, amphibians, and fish as well. [23]
Protein name | Genus and species | Common name | Ortholog space | Query cover (%) | Max identity (%) | Accession number |
---|---|---|---|---|---|---|
CCDC 144A | Macaca fasicularis | Crab-eating macaque | 0 | 97 | 86 | EHH57800.1 [9] |
CCDC 144A, Partial | Macaca mulatta | Rhesus monkey | 0 | 97 | 86 | EHH24608.1 [9] |
ANKRD 26 | Pan troglodytes | Common chimpanzee | 2e-160 | 96 | 67 | JAA07196.1 [9] |
ANKRD 26, Predicted | Dasypus novemcinctus | Nine-banded armadillo | 1e-158 | 96 | 65 | XP_004470808.1 [9] |
ANKRD 26 | Myotis davidii | Mouse eared bat | 2e-154 | 96 | 64 | ELK35935.1 [9] |
ANKRD 26 | Bos taurus | Cow | 2e-157 | 96 | 63 | NP_001107239.1 [9] |
ANKRD 26 | Tupaia chinensis | Treeshrew | 3e-147 | 96 | 62 | ELW73004.1 [9] |
ANKRD 26 | Cricetulus griseus | Chinese hamster | 1e-145 | 96 | 60 | EGW08323.1 [9] |
ANKRD 26 | Heterocephalus glaber | Naked mole rat | 2e-138 | 96 | 59 | EHB01988.1 [9] |
ANKRD 26 | Mus musculus | House mouse | 4e-141 | 96 | 57 | NP_001074581.1 [9] |
ANKRD 26, Partial | Pteropus alecto | Black flying fox | 2e-171 | 97 | 51 | ELK03279.1 [9] |
ANKRD 26-Like, Predicted | Ornithorhynchus anatinus | Platypus | 2e-108 | 96 | 51 | XP_001509663.2 [9] |
ANKRD 26-Like, Predicted | Taeniopygia guttata | Zebra finch | 3e-88 | 92 | 45 | XP_004177264.1 [9] |
ANKRD 26-Like, Predicted | Anolis carolinensis | Carolina anole | 2e-75 | 97 | 44 | XP_003221333.1 [9] |
ANKRD 26, Predicted | Xenopus tropicalis | Western clawed frog | 2e-78 | 98 | 44 | XP_002935004.1 [9] |
Unnamed Protein Product | Tetraodon nigroviridis | Pufferfish | 1e-28 | 98 | 34 | CAF98417.1 [9] |
This gene has been linked to Smith-Magenis Syndrome (SMS), which is also known as chromosome 17p11.2 deletion syndrome, [24] chromosome 17p deletion syndrome, [25] deletion 17p syndrome, [25] partial monosomy 17p, [25] and deletion abnormality. [26] [27]
There may potentially be two proteins that interact with KIAA0565, and they are ubiquitin specific peptidase 32 (USP32) and ubiquitin specific peptidase 25 (USP25). [28]
This protein has been shown to have relatively low expression in all tissues. [29]
A leucine zipper is a common three-dimensional structural motif in proteins. They were first described by Landschulz and collaborators in 1988 when they found that an enhancer binding protein had a very characteristic 30-amino acid segment and the display of these amino acid sequences on an idealized alpha helix revealed a periodic repetition of leucine residues at every seventh position over a distance covering eight helical turns. The polypeptide segments containing these periodic arrays of leucine residues were proposed to exist in an alpha-helical conformation and the leucine side chains from one alpha helix interdigitate with those from the alpha helix of a second polypeptide, facilitating dimerization.
Transport and golgi organization 2 homolog (TANGO2) also known as chromosome 22 open reading frame 25 (C22orf25) is a protein that in humans is encoded by the TANGO2 gene.
CXorf26, also known as MGC874, is a well conserved human gene found on the plus strand of the short arm of the X chromosome. The exact function of the gene is poorly understood, but the polysaccharide biosynthesis domain that spans a major portion of the protein product, as well as the yeast homolog, YPL225, offer insights into its possible function.
Coiled-Coil Domain Containing 11, also known as CCDC11 is a protein, that is encoded by CCDC11 gene located at chromosome 18 in humans.
Uncharacterized LOC644249 gene., also known as RP11-195B21.3, is about 1058 base pairs long and is found in Homo sapiens on chromosome 9q12. More specifically, the sequence is located on Chromosome: 9; NC_000009.11(67977457..67987991 bp). This gene’s protein product is the “coiled-coil domain-containing protein 29” which is 291 amino acids long and may contain a conserved domain in the superfamily, pfam 12001. In particular, this conserved domain contains the domain of unknown function DUF3496 which is about 110 amino acids long, functionally uncharacterized, and found in eukaryotes. Other possible motifs for the protein product exist but the DUF3496 remains the most likely. This protein may play a role as a transmembrane protein.
Glutamine Serine Rich Protein 1 or QSER1 is a protein encoded by the QSER1 gene.
Coiled coil domain containing protein 120 (CCDC120), also known as JM11 protein, is a protein that, in humans, is encoded by the CCDC120 gene. The function of CCDC120 has not been formally identified but structural components, conservation, and interactions can be identified computationally.
Transmembrane protein 33 is a protein that in humans, is encoded by the TMEM33 gene, also known as SHINC3. Another name for the TMEM33 protein is DB83.
WW and C2 domain containing 2 (WWC2) is a protein that in humans is encoded by the WWC2 gene (4q35.1). Though function of WWC2 remains unknown, it has been predicted that WWC2 may play a role in cancer.
Coiled Coil Domain Containing protein 42B, also known as CCDC42B, is a protein encoded by the protein-coding gene CCDC42B.
Coiled-coil domain 47 (CCDC47) is a gene located on human chromosome 17, specifically locus 17q23.3 which encodes for the protein PAT complex subunit CCDC47. The protein itself contains coiled-coil domains, the SEEEED superfamily, a domain of unknown function (DUF1682) and a transmembrane domain. The function of the protein is unknown, but it has been proposed that CCDC47 is involved in calcium ion homeostasis and the endoplasmic reticulum overload response.
Family with sequence similarity 167, member A is a protein in humans that is encoded by the FAM167A gene located on chromosome 8. FAM167A and its paralogs are protein encoding genes containing the conserved domain DUF3259, a protein of unknown function. FAM167A has many orthologs in which the domain of unknown function is highly conserved.
PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.
Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.
C14orf93 is a protein that is encoded in humans by the C14orf93 gene. It is a globular protein with a conserved C-terminus that is localized to the nucleus. While expressed relatively highly in all tissues except nervous tissue, it is expressed particularly highly in T cells and other immune tissues.
Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.
Transmembrane Protein 217 is a protein encoded by the gene TMEM217. TMEM217 has been found to have expression correlated with the lymphatic system and endothelial tissues and has been predicted to have a function linked to the cytoskeleton.
Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.
KRBA1 is a protein that in humans is encoded by the KRBA1 gene. It is located on the plus strand of chromosome 7 from 149,411,872 to 149,431,664. It is also commonly known under two other aliases: KIAA1862 and KRAB A Domain Containing 1 gene and encodes the KRBA1 protein in humans. The KRBA family of genes is understood to encode different transcriptional repressor proteins
CCDC188 or coiled-coil domain containing protein is a protein that in humans is encoded by the CCDC188 gene.