CCDC180

Last updated
CCDC180
Identifiers
Aliases
External IDs GeneCards:
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

XM_011733563

n/a

RefSeq (protein)

n/a

n/a

Location (UCSC)n/an/a
PubMed search [1] n/a
Wikidata
View/Edit Human

Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. [2] This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.

Contents

Gene

The location of CCDC180 on human chromosome 9 within locus 9q22.33 is marked with a line. Gene location of CCDC180.jpg
The location of CCDC180 on human chromosome 9 within locus 9q22.33 is marked with a line.

Locus

CCDC180 is located on chromosome 9 at the locus 9q22.33. [2]

Common aliases

CCDC180 is also known by the aliases KIAA1529, BDAG1 (Behçet's Disease Associated Gene 1), and C9orf174. [2]

Gene features

The CCDC180 gene is 71,221 bases long. It contains 37 exons and is oriented on the forward strand of the chromosome. [3]

mRNA

There are no known isoforms or alternative splicing variants of the CCDC180 mRNA. [3]

Protein

General features

CCDC180 contains 1,701 amino acids [4] and has a molecular weight of 197.3 kDa. The isoelectric point (pI) is 5.74. The low pI is attributed to a relatively high concentration of glutamic acid when compared to other human proteins at 12.9%. CCDC180 also contains a relatively low concentration of glycine when compared to the average human protein at 3.5%. [5]

Domains

Here are shown the major domains present within the protein CCDC180: Domains of Unknown Function 4455 and 4456, two coiled-coil domains, and a glutamic acid rich domain. Diagram of Domains in CCDC180.jpg
Here are shown the major domains present within the protein CCDC180: Domains of Unknown Function 4455 and 4456, two coiled-coil domains, and a glutamic acid rich domain.

CCDC180 contains two domains of unknown function (DUFs): DUF4455 and DUF4456. There are also two coiled-coil regions which overlap with the DUFs. There is a region of low complexity that is very rich in glutamic acid.

A model of CCDC180 secondary and tertiary structure predicted by the University of Michigan I-TASSER server Predicted Structure of CCDC180.gif
A model of CCDC180 secondary and tertiary structure predicted by the University of Michigan I-TASSER server

Secondary and tertiary structure

The secondary structure of CCDC180 is predicted to be almost completely composed of alpha helices, with only a few predicted beta sheets. [7] The tertiary structure is not completely characterized as yet, but a model predicted by the I-TASSER server at the University of Michigan is pictured.

Translation of the mRNA and protein sequence for the human protein CCDC180, including domains, secondary structure, exon splice sites, and post-translation modification sites Conceptual Translation of CCDC180.jpg
Translation of the mRNA and protein sequence for the human protein CCDC180, including domains, secondary structure, exon splice sites, and post-translation modification sites

Post-translational modifications

CCDC180 is predicted to undergo a variety of post-translational modifications:

ModificationPositionContext
Serine Phosphorylation195KARESENTI
Serine Phosphorylation627LRQQSDKET
Serine Phosphorylation680SSALSQYFF
Serine Phosphorylation734RSEESISSG
Serine Phosphorylation961NELDSELEL
Serine Phosphorylation1069VTQVSLRSF
Serine Phosphorylation1087KLRYSNIEF
Serine Phosphorylation1105GGNFSPKEI
Serine Phosphorylation1381QPENSGKKA
Serine Phosphorylation1396TSAGSFTPH
Serine Phosphorylation1526KFFTSKVEI
Serine Phosphorylation1649LAGLSLKEE
Serine Phosphorylation1663IERGSRKWP
Threonine Phosphorylation521WKAFTEEEA
Threonine Phosphorylation1621DEVVTIDDV
Threonine Phosphorylation1690SSISTTKTT
Tyrosine Phosphorylation345EKTSYLMRP
Tyrosine Phosphorylation650MKSRYECFH
Tyrosine Phosphorylation1141LENEYLDQA
Tyrosine Phosphorylation1447AEEFYRKEK
Tyrosine Phosphorylation1485QANKYHNSC
Sumoylation89ERSVTLKSGRIPMM
Sumoylation137REKERAKREKARES
Sumoylation355DTWKALKKEALLQS
Sumoylation492VGALQGKVEEDLEL
Sumoylation1590LAGLSLKEESEKPL
Serine O-linked β-N-acetylglucosamine1635KQKLSMLIRR

Subcellular localization

CCDC180 is predicted to localize to the nucleus, and it contains four nuclear localization sequences. [12]

Expression

GEO Profile for the expression of the human protein CCDC180 in normal tissues CCDC180 GEO Profile.png
GEO Profile for the expression of the human protein CCDC180 in normal tissues

CCDC180 is expressed ubiquitously at low levels throughout the body, and the highest expression is consistently seen to be in the testes. Other replicated tissues of high expression include the trachea and eye. [13] [14]

Regulation of expression

Transcriptional regulation

Transcription of CCDC180 is predicted to be regulated by a 664 base pair promoter region, with the ID GXP_1829211. This prediction is supported by the transcripts GXT_23217882, GXT_24495001, GXT_24495002, and GXT_24495003. Transcription factors predicted to bind to this promoter region are described below. [15]

Interacting proteins

The following proteins have been shown to interact with CCDC180 in yeast two-hybrid assays. [16]

ProteinFunction
Y box binding protein 1 DNA and RNA binding, transcriptional and translational regulation [17]
Mitotic checkpoint serine/threonine kinase Assists in spindle assembly during mitosis [18]
B-cell CLL/Lymphoma 10 Induces apoptosis and activates NF-κB [19]
Neuroblastoma RAS viral oncogene homolog Regulates cell division [20]
Erb-B2 receptor tyrosine kinase 2 Stabilizes ligand binding in other EGF receptor family members [21]
Retinoblastoma 1 Negative regulation of cell cycle progression [22]
Proto-oncogene tyrosine-protein kinase Src Signal transduction [23]
Mutated in colorectal cancers Negative regulation of cell cycle progression [24]
Catenin alpha 1 Associates with cadherins [25]
mutL homolog 1 DNA repair [26]
Postmeitotic segregation increased 2 DNA mismatch repair [27]
Phosphatase and tensin homolog Tumor suppression [28]
Protein tyrosine phosphatase, non-receptor type 12 Signal transduction [29]
Mothers against decapentaplegic homolog 4 Transcription regulation [30]
Platelet-derived growth factor receptor-like Unknown, possible tumor suppression [30]
Serine/threonine kinase 11 Regulates cell polarity [31]
Cycline-dependent kinase inhibitor 2A Stabilizes p53 [32]
Folliculin Unknown, possible tumor suppression [33]
DLC Rho GTPase activating protein Regulates small GTP-binding proteins [34]
mutL homolog 3 DNA repair [35]

Clinical significance

A single-nucleotide polymorphism (SNP) in the gene that leads to a single amino acid change (S995C) has been shown in a genome-wide association study to be significantly associated with Behçet's disease, and this designation led to the alias Behcet's disease-associated gene 1 (BDAG1). [36] The role of CCDC180 in the disease phenotype is unknown.

Homology

There are no paralogs in humans for this gene, but there are orthologs in a wide variety of organisms, extending back to single-celled green algae. CCDC180 is not conserved in bacteria, archaea, plants, fungi, or protists. The following table includes a subset of species containing protein orthologs of CCDC180. It is not exhaustive, but it indicates the variety of species containing orthologs of CCDC180.

Genus and SpeciesCommon NameDivergence from

Humans [37]

Accession #Sequence

Length

% Identity% Similarity
Homo sapiens Human- NP_065944.2 1701--
Pan paniscus Bonobo6.6 mya XP_008972301.1 170399%99%
Capra hircus Goat97.5 mya XP_013821462.1 174670%83%
Physeter cotodon Sperm whale97.5 mya XP_007131156.1 174472%84%
Struthio camelus Ostrich320.5 mya XP_009664045.1 160539%58%
Apteryx australis Brown kiwi320.5 mya XP_013797236.1 160640%60%
Alligator sinensis Chinese alligator320.5 mya XP_006029881.1 155840%59%
Gekko japonicus Gecko320.5 mya XP_015266758.1 163840%58%
Thamnophis sirtalis Garter snake320.5 mya XP_013926700.1 55641%56%
Chelonia mydas Green sea turtle320.5 mya XP_007061172.1 163245%68%
Salmo salar Atlantic salmon429.6 mya XP_014027541.1 148838%54%
Lepisosteus oculatus Spotted gar429.6 mya XP_015222467.1 148040%59%
Ciona intestinalis Sea squirt733.0 mya XP_002123678.2 157132%51%
Branchiostoma floridae Lancelet733.0 mya XP_002609423.1 151533%50%
Saccoglossus kowalevskii Acorn worm747.8 mya XP_002742433.1 152333%53%
Priapulida caudatus Priapulid worm847.0 mya XP_014672086.1 129328%46%
Crassostrea gigas Pacific oyster847.0 mya XP_011430927.1 114433%51%
Lottia gigantea Owl limpet847.0 mya XP_009044533.1 88634%52%
Lingula anatina Brachiopod847.0 mya XP_013409374.1 152335%53%
Chlamydomonas reinhardtii Chlamydomonas1513.9 mya XP_001694909.1 154420%40%
Salpingoeca rosetta Choanoflagellate1724.7 mya XP_004997848.1 151424%49%

Evolutionary history

The amino acid changes per 100 (m) in a selection of orthologs of CCDC180 versus time of divergence of the species from human in millions of years. This is compared to Cytochrom C and Fibrinogen to indicate the relatively high speed of evolution of the CCDC180 protein. Speed of evolution of CCDC180.jpg
The amino acid changes per 100 (m) in a selection of orthologs of CCDC180 versus time of divergence of the species from human in millions of years. This is compared to Cytochrom C and Fibrinogen to indicate the relatively high speed of evolution of the CCDC180 protein.

CCDC180 is a relatively quickly-evolving gene compared to other well-known genes. There are no known family members, splice variants or isoforms, or evidence of gene duplications in the history of the gene.

Related Research Articles

<span class="mw-page-title-main">WWC2</span> Protein-coding gene in the species Homo sapiens

WW and C2 domain containing 2 (WWC2) is a protein that in humans is encoded by the WWC2 gene (4q35.1). Though function of WWC2 remains unknown, it has been predicted that WWC2 may play a role in cancer.

<span class="mw-page-title-main">FAM63A</span> Protein-coding gene in the species Homo sapiens

Family with sequence similarity 63, member A is a protein that, is encoded by the FAM63A gene in humans,. It is located on the minus strand of chromosome 1 at locus 1q21.3.

KIAA0753 is a protein that in humans is encoded by the gene KIAA0753. The gene is located on chromosome 17p13.1, on the reverse strand spanning bases 6578141 to 6641744. The KIAA0753 gene contains 18 exons, 19 introns, and has no known aliases.

C20orf96 is a protein-coding gene in humans. It codes for an unknown protein known as uncharacterized protein C20orf96, predicted to be a nuclear protein. The function and biological processes of the gene is not well understood by the scientific community yet.

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

<span class="mw-page-title-main">ANKRD24</span> Protein-coding gene in the species Homo sapiens

Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.

TMEM156 is a gene that encodes the transmembrane protein 156 (TMEM156) in Homo sapiens. It has the clone name of FLJ23235.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">TMEM176B</span> Protein-coding gene in the species Homo sapiens

Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.

Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.

Cardiac-enriched FHL2-interacting protein (CEFIP) is a protein encoded by the gene C10orf71 on chromosome 10 open reading frame 71. It is primarily understood that this gene is moderately expressed in muscle tissue and cardiac tissue.

<span class="mw-page-title-main">TMCO4</span> Protein-coding gene in the species Homo sapiens

Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.

The Family with sequence similarity 149 member B1 is an uncharacterized protein encoded by the human FAM149B1 gene, with one alias KIAA0974. The protein resides in the nucleus of the cell. The predicted secondary structure of the gene contains multiple alpha-helices, with a few beta-sheet structures. The gene is conserved in mammals, birds, reptiles, fish, and some invertebrates. The protein encoded by this gene contains a DUF3719 protein domain, which is conserved across its orthologues. The protein is expressed at slightly below average levels in most human tissue types, with high expression in brain, kidney, and testes tissues, while showing relatively low expression levels in pancreas tissues.

<span class="mw-page-title-main">C17orf53</span>

C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">C15orf39</span>

C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.

<span class="mw-page-title-main">TMEM44</span> Protein-coding gene in the species Homo sapiens

TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.

<span class="mw-page-title-main">TEDC2</span> Protein-coding gene in the species Homo sapiens

Tubulin epsilon and delta complex 2 (TEDC2), also known as Chromosome 16 open reading frame 59 (C16orf59), is a protein that in humans is encoded by the TEDC2 gene. Its NCBI accession number is NP_079384.2.

<span class="mw-page-title-main">TMEM221</span> Protein

Transmembrane protein 221 (TMEM221) is a protein that in humans is encoded by the TMEM221 gene. The function of TMEM221 is currently not well understood.

C3orf56 is a protein encoding gene found on chromosome 3. Although, the structure and function of the protein is not well understood, it is known that the C3orf56 protein is exclusively expressed in metaphase II of oocytes and degrades as the oocyte develops towards the blastocyst stage. Degradation of the C3orf56 protein suggests that this gene plays a role in the progression from maternal to embryonic genome and in embryonic genome activation.

References

  1. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  2. 1 2 3 "www.genecards.org/cgi-bin/carddisp.pl?gene=CCDC180&keywords=CCDC180". www.genecards.org. Retrieved 2016-05-06.
  3. 1 2 "CCDC180 coiled-coil domain containing 180 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-05-09.
  4. "coiled-coil domain-containing protein 180 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-05-09.
  5. "SDSC Biology Workbench" . Retrieved 2016-05-09.[ permanent dead link ]
  6. "I-TASSER results". zhanglab.ccmb.med.umich.edu. Archived from the original on 2016-08-13. Retrieved 2016-05-10.
  7. Kelley L. "PHYRE2 Protein Fold Recognition Server". www.sbg.bio.ic.ac.uk. Retrieved 2016-05-09.
  8. "NetPhos 2.0 Server". www.cbs.dtu.dk. Retrieved 2016-05-09.
  9. "ExPASy - Sulfinator tool". web.expasy.org. Retrieved 2016-05-09.
  10. "SUMOplot™ Analysis Program | Abgent". www.abgent.com. Retrieved 2016-05-09.
  11. "YinOYang 1.2 Server". www.cbs.dtu.dk. Retrieved 2016-05-09.
  12. "PSORT II Prediction". psort.hgc.jp. Retrieved 2016-05-10.
  13. 1 2 "Home - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-05-10.
  14. "Home - EST - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-05-10.
  15. "Genomatix: Gene2Promoter Result". www.genomatix.de. Retrieved 2016-05-10.[ permanent dead link ]
  16. IntAct. "www.ebi.ac.uk/intact/". www.ebi.ac.uk. Retrieved 2016-05-10.
  17. "www.genecards.org/cgi-bin/carddisp.pl?gene=YBX1". www.genecards.org. Retrieved 2016-05-10.
  18. "BUB1 - Mitotic checkpoint serine/threonine-protein kinase BUB1 - Homo sapiens (Human) - BUB1 gene & protein". www.uniprot.org. Retrieved 2016-05-10.
  19. "www.genecards.org/cgi-bin/carddisp.pl?gene=BCL10". www.genecards.org. Retrieved 2016-05-10.
  20. Reference GH. "NRAS". Genetics Home Reference. Retrieved 2016-05-10.
  21. "www.genecards.org/cgi-bin/carddisp.pl?gene=ERBB2". www.genecards.org. Retrieved 2016-05-10.
  22. "www.genecards.org/cgi-bin/carddisp.pl?gene=RB1". www.genecards.org. Retrieved 2016-05-10.
  23. "SRC - Proto-oncogene tyrosine-protein kinase Src - Homo sapiens (Human) - SRC gene & protein". www.uniprot.org. Retrieved 2016-05-10.
  24. "www.genecards.org/cgi-bin/carddisp.pl?gene=MCC". www.genecards.org. Retrieved 2016-05-10.
  25. "www.genecards.org/cgi-bin/carddisp.pl?gene=CTNNA1". www.genecards.org. Retrieved 2016-05-10.
  26. Reference GH. "MLH1". Genetics Home Reference. Retrieved 2016-05-10.
  27. "www.genecards.org/cgi-bin/carddisp.pl?gene=PMS2". www.genecards.org. Retrieved 2016-05-10.
  28. "PTEN phosphatase and tensin homolog [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-05-10.
  29. "PTPN12 protein tyrosine phosphatase, non-receptor type 12 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-05-10.
  30. 1 2 "www.genecards.org/cgi-bin/carddisp.pl?gene=SMAD4". www.genecards.org. Retrieved 2016-05-10.
  31. "STK11 serine/threonine kinase 11 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-05-10.
  32. "www.genecards.org/cgi-bin/carddisp.pl?gene=CDKN2A". www.genecards.org. Retrieved 2016-05-10.
  33. "www.genecards.org/cgi-bin/carddisp.pl?gene=FLCN". www.genecards.org. Retrieved 2016-05-10.
  34. "www.genecards.org/cgi-bin/carddisp.pl?gene=DLC1". www.genecards.org. Retrieved 2016-05-10.
  35. "www.genecards.org/cgi-bin/carddisp.pl?gene=MLH3". www.genecards.org. Retrieved 2016-05-10.
  36. Vieira AR, McHenry TG, Daack-Hirsch S, Murray JC, Marazita ML (September 2008). "Candidate gene/loci studies in cleft lip/palate and dental anomalies finds novel susceptibility genes for clefts". Genetics in Medicine. 10 (9): 668–74. doi:10.1097/GIM.0b013e3181833793. PMC   2734954 . PMID   18978678.
  37. "TimeTree :: The Timescale of Life". www.timetree.org. Retrieved 2016-05-10.