CCDC144A

Last updated
CCDC144A
Identifiers
Aliases CCDC144A , coiled-coil domain containing 144A
External IDs HomoloGene: 108248 GeneCards: CCDC144A
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_014695
NM_001382000

n/a

RefSeq (protein)

NP_055510
NP_001368929

n/a

Location (UCSC) Chr 17: 16.69 – 16.78 Mb n/a
PubMed search [2] n/a
Wikidata
View/Edit Human

Coiled-coil domain-containing protein 144A is a protein that in humans is encoded by the CCDC144A gene. [3] An alias of this gene is called KIAA0565. There are four members of the CCDC family: CCDC 144A, 144B, 144C and putative CCDC 144 N-terminal like proteins. [4]

Contents

Gene

This gene has a nucleotide sequence that is 5140 bp long, and it encodes 641 amino acids. [5] It is found on the short arm, plus (forward) strand of chromosome 17 at p11.2. [6] [7] The mRNA for the CCDC144A gene has 3 alternative splicing isoforms named A2RUR9-1, A2RUR9-2, AND A2RUR9-3, but there is no experimental confirmation available yet. [8]

Protein

This protein for this gene is also known as coiled coil domain containing 144A (CCDC144A) protein. It consists of 641 amino acids. [9] This protein weighs 75.8 kDa and has an isoelectric point of 6.357. [10] This protein localizes near the nucleus, [11] and is a soluble protein with a hydrophobicity of -1.021842. [12] This protein is also non-secretory [13] and has 10 potential serine and 3 potential threonine phosphorylation sites. [14] There are no tyrosine sulfation sites, [15] but there are a few potential sumoylation sites on this protein. [16] [17] Also, this protein is predicted to be non-myristoylated [18] and does not contain a signal peptide. [13] [19]

Structure

This protein has a domain of unknown function (DUF) 3496, which has been conserved in eukaryotes. [20] The DUF3496 domain is found from amino acids 547-622. [9] CCDC144A, an alias of this gene, indicates that there should be a coiled coil domain within the protein. Coiled coils are structural motifs in proteins in which 2 more alpha helices are coiled together, and they usually contain a heptad repeat, hxxhcxc, or hydrophobic (h) and charge (c) amino acid residues. [7] The 5' and 3' untranslated regions of the nucleotide sequence of this gene are rich in stem-loop structures. [21] In place of a coiled coil, a leucine zipper was found. [11] Residues from 478-499, "LHNTRDALGRESLILERVQRDL", are the residues that form the leucine zipper pattern. [11] The structure of this protein consists of mostly alpha helices, with some random coils. [22]

Evolution

Phylogenetic tree displaying orthologs of CCDC144A. Unrooted Phylogenetic Tree of KIAA0565 and Orthologous Proteins.png
Phylogenetic tree displaying orthologs of CCDC144A.
NumberSpecies
1Nine-banded armadillo
2Cow
3Flying fox
4Mouse eared bat
5Chimpanzee
6Treeshrew
7House mouse
8Chinese hamster
9Naked mole rat
10Rhesus monkey
11Crab-eating macaque
12Human KIAA0565
13Platypus
14Western clawed frog
15Pufferfish
16Carolina anole
17Zebra finch

Orthologs of KIAA0565 protein have been identified mostly in mammals, but some birds, reptiles, amphibians, and fish as well. [23]

Potential Orthologs

Protein nameGenus and speciesCommon nameOrtholog spaceQuery cover (%)Max identity (%)Accession number
CCDC 144AMacaca fasicularisCrab-eating macaque09786EHH57800.1 [9]
CCDC 144A, PartialMacaca mulattaRhesus monkey09786EHH24608.1 [9]
ANKRD 26Pan troglodytesCommon chimpanzee2e-1609667JAA07196.1 [9]
ANKRD 26, PredictedDasypus novemcinctusNine-banded armadillo1e-1589665XP_004470808.1 [9]
ANKRD 26Myotis davidiiMouse eared bat2e-1549664ELK35935.1 [9]
ANKRD 26Bos taurusCow2e-1579663NP_001107239.1 [9]
ANKRD 26Tupaia chinensisTreeshrew3e-1479662ELW73004.1 [9]
ANKRD 26Cricetulus griseusChinese hamster1e-1459660EGW08323.1 [9]
ANKRD 26Heterocephalus glaberNaked mole rat2e-1389659EHB01988.1 [9]
ANKRD 26Mus musculusHouse mouse4e-1419657NP_001074581.1 [9]
ANKRD 26, PartialPteropus alectoBlack flying fox2e-1719751ELK03279.1 [9]
ANKRD 26-Like, PredictedOrnithorhynchus anatinusPlatypus2e-1089651XP_001509663.2 [9]
ANKRD 26-Like, PredictedTaeniopygia guttataZebra finch3e-889245XP_004177264.1 [9]
ANKRD 26-Like, PredictedAnolis carolinensisCarolina anole2e-759744XP_003221333.1 [9]
ANKRD 26, PredictedXenopus tropicalisWestern clawed frog2e-789844XP_002935004.1 [9]
Unnamed Protein ProductTetraodon nigroviridisPufferfish1e-289834CAF98417.1 [9]

[23]

Clinical significance

This gene has been linked to Smith-Magenis Syndrome (SMS), which is also known as chromosome 17p11.2 deletion syndrome, [24] chromosome 17p deletion syndrome, [25] deletion 17p syndrome, [25] partial monosomy 17p, [25] and deletion abnormality. [26] [27]

Interacting proteins

There may potentially be two proteins that interact with KIAA0565, and they are ubiquitin specific peptidase 32 (USP32) and ubiquitin specific peptidase 25 (USP25). [28]

Expression

This protein has been shown to have relatively low expression in all tissues. [29]

Related Research Articles

<span class="mw-page-title-main">Leucine zipper</span> DNA-binding structural motif

A leucine zipper is a common three-dimensional structural motif in proteins. They were first described by Landschulz and collaborators in 1988 when they found that an enhancer binding protein had a very characteristic 30-amino acid segment and the display of these amino acid sequences on an idealized alpha helix revealed a periodic repetition of leucine residues at every seventh position over a distance covering eight helical turns. The polypeptide segments containing these periodic arrays of leucine residues were proposed to exist in an alpha-helical conformation and the leucine side chains from one alpha helix interdigitate with those from the alpha helix of a second polypeptide, facilitating dimerization.

<span class="mw-page-title-main">TANGO2</span> Protein-coding gene in the species Homo sapiens

Transport and golgi organization 2 homolog (TANGO2) also known as chromosome 22 open reading frame 25 (C22orf25) is a protein that in humans is encoded by the TANGO2 gene.

<span class="mw-page-title-main">PBDC1</span> Human gene

CXorf26, also known as MGC874, is a well conserved human gene found on the plus strand of the short arm of the X chromosome. The exact function of the gene is poorly understood, but the polysaccharide biosynthesis domain that spans a major portion of the protein product, as well as the yeast homolog, YPL225, offer insights into its possible function.

<span class="mw-page-title-main">DEPDC5</span> Protein-coding gene in the species Homo sapiens

DEPDC5 is a human protein of poorly understood function but has been associated with cancer in several studies. It is encoded by a gene of the same name, located on chromosome 22.

Coiled-Coil Domain Containing 11, also known as CCDC11 is a protein, that is encoded by CCDC11 gene located at chromosome 18 in humans.

Uncharacterized LOC644249 gene., also known as RP11-195B21.3, is about 1058 base pairs long and is found in Homo sapiens on chromosome 9q12. More specifically, the sequence is located on Chromosome: 9; NC_000009.11(67977457..67987991 bp). This gene’s protein product is the “coiled-coil domain-containing protein 29” which is 291 amino acids long and may contain a conserved domain in the superfamily, pfam 12001. In particular, this conserved domain contains the domain of unknown function DUF3496 which is about 110 amino acids long, functionally uncharacterized, and found in eukaryotes. Other possible motifs for the protein product exist but the DUF3496 remains the most likely. This protein may play a role as a transmembrane protein.

<span class="mw-page-title-main">QSER1</span> Protein-coding gene in the species Homo sapiens

Glutamine Serine Rich Protein 1 or QSER1 is a protein encoded by the QSER1 gene.

<span class="mw-page-title-main">Coiled-coil domain containing protein 120</span> Protein-coding gene in humans

Coiled coil domain containing protein 120 (CCDC120), also known as JM11 protein, is a protein that, in humans, is encoded by the CCDC120 gene. The function of CCDC120 has not been formally identified but structural components, conservation, and interactions can be identified computationally.

Transmembrane protein 33 is a protein that in humans, is encoded by the TMEM33 gene, also known as SHINC3. Another name for the TMEM33 protein is DB83.

<span class="mw-page-title-main">WWC2</span> Protein-coding gene in the species Homo sapiens

WW and C2 domain containing 2 (WWC2) is a protein that in humans is encoded by the WWC2 gene (4q35.1). Though function of WWC2 remains unknown, it has been predicted that WWC2 may play a role in cancer.

<span class="mw-page-title-main">Coiled-coil domain containing 42B</span> Protein found in humans

Coiled Coil Domain Containing protein 42B, also known as CCDC42B, is a protein encoded by the protein-coding gene CCDC42B.

<span class="mw-page-title-main">CCDC47</span> Protein-coding gene in humans

Coiled-coil domain 47 (CCDC47) is a gene located on human chromosome 17, specifically locus 17q23.3 which encodes for the protein CCDC47. The gene has several aliases including GK001 and MSTP041. The protein itself contains coiled-coil domains, the SEEEED superfamily, a domain of unknown function (DUF1682) and a transmembrane domain. The function of the protein is unknown, but it has been proposed that CCDC47 is involved in calcium ion homeostasis and the endoplasmic reticulum overload response.

<span class="mw-page-title-main">Proser2</span> Protein-coding gene in the species Homo sapiens

PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.

Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.

<span class="mw-page-title-main">C14orf93</span> Protein-coding gene in the species Homo sapiens

C14orf93 is a protein that is encoded in humans by the C14orf93 gene. It is a globular protein with a conserved C-terminus that is localized to the nucleus. While expressed relatively highly in all tissues except nervous tissue, it is expressed particularly highly in T cells and other immune tissues.

<span class="mw-page-title-main">C2orf73</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.

Transmembrane Protein 217 is a protein encoded by the gene TMEM217. TMEM217 has been found to have expression correlated with the lymphatic system and endothelial tissues and has been predicted to have a function linked to the cytoskeleton.

<span class="mw-page-title-main">KRBA1</span> Protein-coding gene in the species Homo sapiens

KRBA1 is a protein that in humans is encoded by the KRBA1 gene. It is located on the plus strand of chromosome 7 from 149,411,872 to 149,431,664. It is also commonly known under two other aliases: KIAA1862 and KRAB A Domain Containing 1 gene and encodes the KRBA1 protein in humans. The KRBA family of genes is understood to encode different transcriptional repressor proteins

<span class="mw-page-title-main">CCDC188</span> Protein found in humans

CCDC188 or coiled-coil domain containing protein is a protein that in humans is encoded by the CCDC188 gene.

Chromosome 4 open reading frame 54 is a protein that in humans is coded by the c4orf54 gene. This gene is also known as FOPV and LOC285556. This protein is mostly expressed in the nucleus of muscle cells. Orthologs are found in vertebrates but not invertebrates.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000170160 - Ensembl, May 2017
  2. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  3. "NCBI: Gene".
  4. "NeXtProt".
  5. "NCBI".
  6. "NCBI: Gene".
  7. 1 2 "GeneCards".
  8. "GenBank: The Human Gene Compendium".
  9. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 "NCBI: Protein".
  10. 1 2 "Biology Workbench".[ permanent dead link ]
  11. 1 2 3 "PSORTII".
  12. "SOSUI Hydrophobicity". Archived from the original on 2004-03-18. Retrieved 2013-05-11.
  13. 1 2 "ExPASy: SignalP".
  14. "ExPASy: NetPhos".
  15. "ExPASy: Sulfinator".
  16. "ExPASy: SUMOplot".
  17. "ExPASy: SUMOsp".
  18. "ExPASy: Myristoylator".
  19. "ExPASy: NetNGlyc".
  20. "The European Bioinformatics Institute".
  21. "MFOLD".
  22. "PELE: Biology Workbench".
  23. 1 2 "BLASTp".
  24. "NIH Rare Diseases".
  25. 1 2 3 "Genetics Home Reference".
  26. "Unified Medical Language System".
  27. "MalaCards".
  28. "Search Tool for the Retrieval of Interacting Genes/Proteins".
  29. "GEO Profiles".