CCDC78 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | CCDC78 , C16orf25, CNM4, JFP10, hsCcdc78, coiled-coil domain containing 78 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 614666 MGI: 2685784 HomoloGene: 105408 GeneCards: CCDC78 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Coiled-coil domain-containing 78 (CCDC78) is a protein in humans encoded by the CCDC78 gene. It has several aliases including C16orf25, FLJ34512, CNM4, and JFP10. [5] It is located on the (-) strand on chromosome 16 (16p13.3). Its gene neighborhood includes NARFL (also on the minus strand), HAGHL, FAM173A, and METRN. The CCDC78 gene is 10,892 base pairs long, and the protein contains 438 amino acids. [6] The protein weighs approximately 4.852 KDal. [7] There are several isoforms, including one indicated with a unique congenital myopathy. [8] Several expression profiles show it has ubiquitous expression at moderate levels. Although no paralogs exist several orthologs do.
The function of this gene is currently unknown. There is evidence that CCDC78 plays a role in skeletal muscle contraction. This is supported by structural similarities to other muscle proteins and by localization assays. CCDC78's predicted structure was similar to that of tropomyosin (see below). [9] The gene product is found primarily in the perinuclear region, the sarcolemmal membrane, and in the reticular pattern of the sarcoplasm. However, localization assays predict it to also be found in the cytoplasm. [8]
General Properties: [6]
Transcript Variants: There are 13 known alternative splicing patterns. [5] These can be seen in the adjacent image. One of these is indicated in disease. [8]
General Properties: [7]
When looking at EST profiles in humans, CCDC78 seems to show ubiquitous expression at moderate levels. [6]
Predicted post-translational modification: Phosphorylation of several serine residues has been predicted by using tools at ExPasy. [10]
Secondary structure of CCDC78 was predicted using the protein secondary structure prediction tool PELE. As would be expected with a coiled-coil domain containing protein, there are several α-helices. [7] The model was predicted to be 98% accurate to 65% of the protein. The predicted image can be seen below. This predicted model is closely related to tropomyosin - a contractile protein. [9]
Only one protein has been found to interact with CCDC78. An analysis performed from IntAct showed an interaction between CCDC78 and dAK1_1 in Yersinia pestis. [11]
CCDC78 has no known paralogs in the human genome. However, it has several orthologs in other organisms. Orthologs can be found throughout the animal kingdom. CCDC78 is highly conserved in mammals. [7] The coiled-coil domain is highly conserved throughout all orthologs, demonstrating the importance of these domains.
A mutation in this gene has been shown to cause a unique congenital myopathy. [8] This mutation is caused by alternative splicing - a 222 bp in-frame insertion. A group of researchers from the University of Michigan analyzed a family with a dominantly inherited congenital myopathy. After linkage analysis followed by whole-exome capture and next-generation sequencing, they found CCDC78 to be present in affected individuals and absent in >10,000 controls. [8] They then successfully modeled this congenital myopathy in zebrafish. CCDC78 has also been associated with an immune response to Hepatitis B. [12]
Transmembrane and Tetratricopeptide repeat containing 4 is a protein that in humans is encoded by the TMTC4 gene. This protein crosses the plasma membrane 10 times, and resides in the ER lumen and cytosol. The predicted structure of the TMTC4 protein is a series of alpha-helices.
Uncharacterized protein KIAA0895-like also known as LOC653319, is a protein that in humans is encoded by the KIAA0895L gene.
Uncharacterized LOC644249 gene., also known as RP11-195B21.3, is about 1058 base pairs long and is found in Homo sapiens on chromosome 9q12. More specifically, the sequence is located on Chromosome: 9; NC_000009.11(67977457..67987991 bp). This gene’s protein product is the “coiled-coil domain-containing protein 29” which is 291 amino acids long and may contain a conserved domain in the superfamily, pfam 12001. In particular, this conserved domain contains the domain of unknown function DUF3496 which is about 110 amino acids long, functionally uncharacterized, and found in eukaryotes. Other possible motifs for the protein product exist but the DUF3496 remains the most likely. This protein may play a role as a transmembrane protein.
Coiled-coil domain-containing protein 144A is a protein that in humans is encoded by the CCDC144A gene. An alias of this gene is called KIAA0565. There are four members of the CCDC family: CCDC 144A, 144B, 144C and putative CCDC 144 N-terminal like proteins.
Family with Sequence Similarity 203, Member B (FAM203B) is a protein encoded by the FAM203B gene (8q24.3) in humans. While FAM203B is only found in humans and possibly non-human primates, its paralog, FAM203A, is highly conserved. The FAM203B protein contains two conserved domains of unknown function, DUF383 and DUF384, and no transmembrane domains. This protein has no known function yet, although the homolog of FAM203A in Caenorhabditis elegans (Y54H5A.2) is thought to help regulate the actin cytoskeleton.
Coiled-coil domain containing 94 (CCDC94) is a protein that in humans is encoded by the CCDC94 gene. The CCDC94 protein contains a coiled-coil domain, a domain of unknown function (DUF572), an uncharacterized conserved protein (COG5134), and lacks a transmembrane domain.
Family with sequence similarity 149, member A is a protein that in humans is encoded by the FAM149A gene. It is well conserved in primates, dog, cow, mouse, rat, and chicken. It has one paralog, FAM149B.
Coiled Coil Domain Containing protein 42B, also known as CCDC42B, is a protein encoded by the protein-coding gene CCDC42B.
Coiled-coil domain 47 (CCDC47) is a gene located on human chromosome 17, specifically locus 17q23.3 which encodes for the protein CCDC47. The gene has several aliases including GK001 and MSTP041. The protein itself contains coiled-coil domains, the SEEEED superfamily, a domain of unknown function (DUF1682) and a transmembrane domain. The function of the protein is unknown, but it has been proposed that CCDC47 is involved in calcium ion homeostasis and the endoplasmic reticulum overload response.
FAM71D, also known as chromosome 14 open reading frame 54 (C14orf54), is a protein that in humans is encoded by the FAM71D gene on Chromosome 14. Orthologs of FAM71D reach as far back in evolution to Reptiles, however, it is closer in homology to primates than any other orthologs. FAM71D has 6 paralogs: FAM71A, FAM71B, FAM71C, FAM71E1, FAM71F1, and FAM71F2 which encode a protein of unknown function.
GPATCH11 is a protein that in humans is encoded by the G-patch domain containing protein 11 gene. The gene has four transcript variants encoding two functional protein isoforms and is expressed in most human tissues. The protein has been found to interact with several other proteins, including two from a splicing pathway. In addition, GPATCH11 has orthologs in all taxa of the eukarya domain.
Vexin is a protein encoded by VXN gene. VXN is found to be highly expressed in regions of the brain and spinal cord.
PROSER1 is a protein that in humans is encoded by the PROSER1 gene.
C14orf93 is a protein that is encoded in humans by the C14orf93 gene. It is a globular protein with a conserved C-terminus that is localized to the nucleus. While expressed relatively highly in all tissues except nervous tissue, it is expressed particularly highly in T cells and other immune tissues.
Uncharacterized protein C12orf60 is a protein that in humans is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.
Proline-rich basic protein 1(PROB1) is a protein encoded by the PROB1 gene located on human chromosome 5, open reading frame 65. PROB1 is also known as C5orf65 and weakly similar to basic proline-rich protein.
Transmembrane protein 171 (TMEM171) is a protein that in humans is encoded by the TMEM171 gene.
C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.
WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.
MIPOL1 , also known as CCDC193 , is a protein that in humans is encoded by the MIPOL1 gene. Mutation of this gene is associated with mirror-image polydactyly in humans, which is a rare genetic condition characterized by mirror-image duplication of digits.
{{cite web}}
: Missing or empty |title=
(help)[ permanent dead link ]