WWC2 | |||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||
Aliases | WWC2 , BOMB, WW and C2 domain containing 2 | ||||||||||||||||||||||||
External IDs | MGI: 1261872 HomoloGene: 32618 GeneCards: WWC2 | ||||||||||||||||||||||||
| |||||||||||||||||||||||||
| |||||||||||||||||||||||||
| |||||||||||||||||||||||||
Orthologs | |||||||||||||||||||||||||
Species | Human | Mouse | |||||||||||||||||||||||
Entrez | |||||||||||||||||||||||||
Ensembl | |||||||||||||||||||||||||
UniProt | |||||||||||||||||||||||||
RefSeq (mRNA) | |||||||||||||||||||||||||
RefSeq (protein) | |||||||||||||||||||||||||
Location (UCSC) | Chr 4: 183.1 – 183.32 Mb | Chr 8: 47.82 – 47.99 Mb | |||||||||||||||||||||||
PubMed search | [3] | [4] | |||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||
|
WW and C2 domain containing 2 (WWC2) is a protein that in humans is encoded by the WWC2 gene (4q35.1). Though function of WWC2 remains unknown, it has been predicted that WWC2 may play a role in cancer.
Locus
The human gene WWC2 is found on chromosome 4 at band 4q35.1. The gene is found on the plus strand of the chromosome and is 8,822 base pairs long. The gene contains 23 exons. The WWC2 locus is quite complex and appears to produce several proteins with no sequence overlap [5]
Aliases
A common alias of the gene is BH3-Only Member B (BOMB) [6]
Paralogs
There are two paralogs of WWC2 found in humans, WWC1 and WWC3. WWC1 is located on chromosome 5 and is a probable regulator of the Hippo signaling pathway that plays a role in tumor suppression by restricting proliferation and promoting apoptosis. [7] WWC3 is located on chromosome X and not much is known about its function.
Sequence | Genus/species | Accession # | Seq. length | Seq. identity |
---|---|---|---|---|
WWC2 | Homo sapiens | NP_079225 | 1192 aa | 100% |
KIBRA (WWC1) | Homo sapiens | AA015881 | 1113 aa | 49.7% |
WWC3 | Homo sapiens | NP_056506 | 1092 aa | 41.2% |
Orthologs
WWC2 is highly conserved in Mammalia, Aves, Reptilia, and Amphibia, as well as the rare coelacanth, which is more closely related to lungfish, reptiles, and mammals than ray finned fish. WWC2 is conserved in some Actinopterygii, Gastropoda, and Bivalvia. However, WWC2 is not well conserved in Insecta.
Genus/Species | Common name | Date of divergence | Accession # | Seq. identity |
---|---|---|---|---|
Homo sapiens | Human | N/A | NP_079225 | 100% |
Pan troglodytes | Chimpanzee | 6.1 MYA | XP_003310624 | 99% |
Heterocephalus glaber | Naked mole rat | 91 MYA | EHB18748 | 88% |
Mus musculus | Mouse | 91 MYA | NP_598552 | 86% |
Orcinus orca | Killer whale | 97.4 MYA | XP_004281794 | 90% |
Bos mutus | Yak | 97.4 MYA | XP_005903227 | 84% |
Alligator mississippiensis | Alligator | 324.5 MYA | XP_006269678 | 79.2% |
Pelodiscus sinensis | Chinese soft-shelled turtle | 324.5 MYA | XP_006130219 | 79% |
Anas platyrhynchos | Mallard | 324.5 MYA | EOA93642 | 78% |
Falco peregrinus | Peregrine falcon | 324.5 MYA | XP_005230882 | 77% |
Ficedula albicollis | Collared flycatcher | 324.5 MYA | XP_005045160 | 76% |
Xenopus (Silurana) tropicalis | Western clawed frog | 361.2 MYA | NP_001004872 | 71% |
Ophiophagus hannah | King cobra | 362.2 MYA | ETE71408 | 71% |
Latimeria chalumnae | Coelacanth | 430 MYA | XP_005989542 | 72% |
Takifugu rubripes | Pufferfish | 454.6 MYA | XP_003973883 | 55% |
Danio rerio | Zebrafish | 454.6 MYA | XP_689275 | 53% |
Xiphophorus maculatus | Southern platyfish | 454.6 MYA | XP_005800442 | 51% |
Aplysia californica | California sea hare (slug) | 782.7 MYA | XP_005096216 | 51% |
Crassostrea gigas | Pacific oyster | 910 MYA | EKC42771 | 39% |
Anopheles darlingi | Mosquito | 910 MYA | ETN67979 | 34% |
Drosophila melanogaster | Fruit fly | 910 MYA | AAF55090.2 | 28.9% |
Primary sequence
The gene encodes a protein also called WWC2 which is 1,192 amino acids long. The molecular weight of the protein is 133.9 kilodaltons. [8] The protein is serine rich with no charge clusters, hydrophobic segments or transmembrane domains. The isoelectric point is 5.23800 [9]
Domains and motifs
WWC2 is a member of the WWC protein family [10] which consists of a WW domain and a C2 domain. WWC2 contains two WW domains and one C2 domain. WWC2 also contains two domains of unknown function, DUF342 and DUF444. A leucine zipper is located at position 854.
Post translational modifications
The WWC2 protein is predicted to be highly phosphorylated. [11] There are 89 predicted sites of serine phosphorylation, 17 predicted sites of threonine phosphorylation, and 11 predicted sites of tyrosine phosphorylation. These numbers were relatively consistent in orthologous proteins.
It is also predicted that p38 mitogen-activated protein kinases and glycogen synthase kinase 3 bind at position T3, and casein kinase 2 binds at positions S13 and T50. [12]
Expression
WWC2 is expressed at a low level, and is tissue specific to the uterus, thyroid, lung, and liver. WWC2 expression is found to be elevated in the blastocyst and fetal stages of development.
Transcript variants
Many transcript variants exist for WWC2. Those that change a highly conserved amino acid residue, or surround a highly conserved amino acid residue are listed below:
SNP | Allele | Protein residue | Amino acid position |
---|---|---|---|
rs200024780 | A to G | Tyr (T) to Cys (C) | 470 |
rs191286964 | C to T | Arg (R) to Cys (C) | 1082 |
rs139606516 | G to T | Arg (R) to Leu (L) | 1082 |
rs149738870 | A to G | Asn (N) to Ser (S) | 1084 |
Transcription factors
Transcription factors with highest matrix scores that bind to sequences within the promoter (ID GXP_1499160) are shown below:
Proteins
Potential interacting proteins include: YWHAZ, YWHAQ, RUVBL1, and REPS1.
While the exact function of WWC2 remains unknown, several mutations and variants of WWC2 have been researched in disease. A novel missense mutation in WWC2 was analyzed in Restless Leg Syndrome, but was not identified as a candidate gene. [13] One study examined the role of Drosophila KIBRA (WWC1) in the Expanded-Hippo-Warts signaling cascade, which is involved with tumor suppression. The study stated that copy number aberration, translocation, and point mutations of WWC2, as well as other genes, should be further investigated in human cancers. [14] WWC2 alias, BOMB, was researched in a grant suggesting that BOMB, along with two other genes (APOL6 and APOL1) promoted cell death in p53-null HCT116 cells.
Platelet-derived growth factor receptor beta is a protein that in humans is encoded by the PDGFRB gene.
UPF0687 protein C20orf27 is a protein that in humans is encoded by the C20orf27 gene. It is expressed in the majority of the human tissues. One study on this protein revealed its role in regulating cell cycle, apoptosis, and tumorigenesis via promoting the activation of NFĸB pathway.
Transmembrane protein 229b is a protein that in humans is encoded by the TMEM229b gene.
QRICH1, also known as Glutamine-rich protein 1, is a protein that in humans is encoded by the QRICH1 gene. One notable feature of this protein is that it contains a Caspase Activation Recruitment Domain, also known as a CARD domain. As a result of having this domain, QRICH1 is believed to be involved in apoptotic, inflammatory, and host-immune response pathways.
CXorf26, also known as MGC874, is a well conserved human gene found on the plus strand of the short arm of the X chromosome. The exact function of the gene is poorly understood, but the polysaccharide biosynthesis domain that spans a major portion of the protein product, as well as the yeast homolog, YPL225, offer insights into its possible function.
Protein FAM214A, also known as protein family with sequence similarity 214, A (FAM214A) is a protein that, in humans, is encoded by the FAM214A gene. FAM214A is a gene with unknown function found at the q21.2-q21.3 locus on Chromosome 15 (human). The protein product of this gene has two conserved domains, one of unknown function (DUF4210) and another one called Chromosome_Seg. Although the function of the FAM214A protein is uncharacterized, both DUF4210 and Chromosome_Seg have been predicted to play a role in chromosome segregation during meiosis.
Transmembrane protein 134 is a protein encoded by the TMEM134 gene. TMEM134 does not have any other known aliases. There are two transmembrane domains and a domain of unknown function (DUF872). Evolutionary, the majority of the organisms that have this gene are primates and mammals, although there are some organisms dating back to Drosphila and C. elegans. Through current research, there has not been any confirmed function of TMEM134.
EVI5L is a protein that in humans is encoded by the EVI5L gene. EVI5L is a member of the Ras superfamily of monomeric guanine nucleotide-binding (G) proteins, and functions as a GTPase-activating protein (GAP) with a broad specificity. Measurement of in vitro Rab-GAP activity has shown that EVI5L has significant Rab2A- and Rab10-GAP activity.
Family with sequence similarity 63, member A is a protein that, in humans, is encoded by the FAM63A gene. It is located on the minus strand of chromosome 1 at locus 1q21.3.
Intermediate filament family orphan 1 is a protein that in humans is encoded by the IFFO1 gene. IFFO1 has uncharacterized function and a weight of 61.98 kDa. IFFO1 proteins play an important role in the cytoskeleton and the nuclear envelope of most eukaryotic cell types.
KIAA0753 is a protein that in humans is encoded by the gene KIAA0753. The gene is located on chromosome 17p13.1, on the reverse strand spanning bases 6578141 to 6641744. The KIAA0753 gene contains 18 exons, 19 introns, and has no known aliases.
C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.
Chromosome 12 Open Reading Frame 42 (C12orf42) is a protein-encoding gene in Homo sapiens.
Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.
Transmembrane protein 254 is a transmembrane protein that is encoded by the TMEM254 gene, it is predicted to have many orthologs across eukaryotes.
C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.
Tubulin epsilon and delta complex 2 (TEDC2), also known as Chromosome 16 open reading frame 59 (C16orf59), is a protein that in humans is encoded by the TEDC2 gene. Its NCBI accession number is NP_079384.2.
Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.
Transmembrane protein 221 (TMEM221) is a protein that in humans is encoded by the TMEM221 gene. The function of TMEM221 is currently not well understood.
Transmembrane protein 247 is a multi-pass transmembrane protein of unknown function found in Homo sapiens encoded by the TMEM247 gene. Notable in the protein are two transmembrane regions near the c-terminus of the translated polypeptide. Transmembrane protein 247 has been found to be expressed almost entirely in the testes.