POTEB | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | POTEB , A26B1, CT104.5, POTE-15, POTE15, POTE ankyrin domain family member B, POTEB2 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 608912 HomoloGene: 136720 GeneCards: POTEB | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
POTE ankyrin domain family, member B is a protein in humans that is encoded by the POTEB gene. [3] (Prostate, Ovary, Testes Expressed ankyrin domain family member B).It is most likely involved in mediating protein-protein interaction via its 5 ankyrin domains. [4] POTEB is most probably aids in intracellular signaling, but is not likely to be a secreted or nuclear protein. [4] POTEB's function is likely to be regulated via 17 potential phosphorylation sites. [5] There is currently no evidence to suggest that POTEB has nuclear localization signals. [6]
POTEB is located at 15q11.2 on chromosome 15 in humans and is transcribed from the reverse DNA strand. POTEB is also known as POTEB3 and POTE15. [7] The POTEB gene is 47,547 base pairs in length and is composed of 11 exons. [7]
The POTEB gene can be transcribed to create four potential mRNAs. However, only one of these mRNAs, possessing all 11 exons, is capable of being translated to the POTEB protein. [8] The three other transcripts do not encode proteins.
The POTEB protein is composed of 544 amino acids and, according to bioinformatic analyses, has a molecular weight of 61.7 kDa. It has an isoelectric point of 5.68. [9] Its most common amino acids are leucine and glutamic acid, which account for 11% and 10.3% of the protein respectively. [9] However, this is normal for human proteins. POTEB is most likely a cytoplasmic protein [10] that is phosphorylated at 17 serines, threonines, and tyrosines located throughout the length of the protein, [5] but concentrated at the C-terminus of the protein. Its secondary structure is mainly five helical ankyrin repeat domains, which contain the TALHL motif. There is also one myristoylation site on the protein, close to the N-terminus. [11]
POTEB is expressed at high levels in the human prostate, ovary, and testes. However, there is also evidence to show that it is expressed at low levels in embryonic stem cells, the nasopharyngeal region, and in breast tissue. [12] [13] In embryonic stem cells, differentiation is likely to turn off the expression of POTEB while in breast cancer, triple negative cells are found to have no POTEB expression suggesting a role in cancer-activated pathways. [13]
Some studies have used POTEB probes to study the expression of POTEB in the human brain. However, the only region with notable POTEB expression is the cerebellar cortex, responsible for motor function and some cognitive functions. [14]
POTEB expression is likely regulated by E-box binding factors and Krueppel-like transcription factors, along with nuclear factor kappa B (NF-κB) transcription factors. [15]
POTEB expression could be regulated by the binding of transcription factors to intron 1 of the pre-mRNA, leading to the production of a truncated mRNA which is not translated. Alternatively, POTEB expression could be downregulated by the formation of stem loops close to the start codon. [16] There are no known ubiquitination sites in POTEB that could aid in regulating POTEB function and stability.
POTEB is most likely involved in mediating protein-protein interaction via its 5 ankyrin domains. [4] POTEB is most probably aids in intracellular signaling, but is not likely to be a secreted or nuclear protein [4] as it is unlikely to contain nuclear localization signals. [6] POTEB's function is likely to be regulated via 17 potential phosphorylation sites [5] which determine how the ankyrin domains interact with other proteins. [17]
There have been no studies published confirming the interaction of POTEB with other human proteins. However, there is unpublished data suggesting an interaction between POTEB and alpha-1-B glycoproteins, APOBEC1 complementation factors, and alpha-2-macroglobulin. [18] This data is based on affinity capture- mass spectrometry.
POTEB expression is low or completely reduced in triple-negative breast cancer cells when compared to other types of breast cancer cells. [13] This suggests POTEB’s involvement in intracellular signaling pathways that suppress cancer, or in pathways that regulate the normal growth and division of cells.
POTEB has 8 predicted paralogs (According to protein sequence) in humans, with most paralogs being located on different human chromosomes. [19] [20] It is speculated that this large number of paralogs arose from multiple duplication events. [10]
Paralog name | Accession number | % Identity | Amino acids | Chromosome # |
---|---|---|---|---|
POTE 15B | AAS58869.1 | 99 | 544 | Chromosome 15 |
POTE C | NP_001131143.1 | 90 | 542 | Chromosome 18 |
POTE H | NP_001129685.1 | 89 | 545 | Chromosome 22 |
POTE J | NP_001264012.1 | 86 | 1038 | Chromosome 2 |
POTE 14A | AAS58868.1 | 82 | 508 | Chromosome 14 |
POTE G | NP_001005356.1 | 82 | 508 | Chromosome 14 |
POTE E | XP_016859648.1 | 80 | 967 | Chromosome 2 |
POTE I | NP_001264335.1 | 80 | 1075 | Chromosome 2 |
POTEB orthologs have been found in mammals, birds, reptiles, amphibians, fish, and even in invertebrates such as sea anemones and marine polychaete worms. [19] These orthologs share a similarity with POTEB largely due to the presence of ankyrin repeats, suggesting that ankyrin domain-containing proteins have been conserved over millions of years. POTEB orthologs have not been found in plants, unicellular eukaryotes, bacteria and archaea. [19]
HIKESHI is a protein important in lung and multicellular organismal development that, in humans, is encoded by the HIKESHI gene. HIKESHI is found on chromosome 11 in humans and chromosome 7 in mice. Similar sequences (orthologs) are found in most animal and fungal species. The mouse homolog, lethal gene on chromosome 7 Rinchik 6 protein is encoded by the l7Rn6 gene.
Protein FAM214A, also known as protein family with sequence similarity 214, A (FAM214A) is a protein that, in humans, is encoded by the FAM214A gene. FAM214A is a gene with unknown function found at the q21.2-q21.3 locus on Chromosome 15 (human). The protein product of this gene has two conserved domains, one of unknown function (DUF4210) and another one called Chromosome_Seg. Although the function of the FAM214A protein is uncharacterized, both DUF4210 and Chromosome_Seg have been predicted to play a role in chromosome segregation during meiosis.
ZC3H12B, also known as CXorf32 or MCPIP2, is a protein encoded by gene ZC3H12B located on chromosome Xq12 in humans.
Coiled-coil domain 47 (CCDC47) is a gene located on human chromosome 17, specifically locus 17q23.3 which encodes for the protein CCDC47. The gene has several aliases including GK001 and MSTP041. The protein itself contains coiled-coil domains, the SEEEED superfamily, a domain of unknown function (DUF1682) and a transmembrane domain. The function of the protein is unknown, but it has been proposed that CCDC47 is involved in calcium ion homeostasis and the endoplasmic reticulum overload response.
Family with Sequence Similarity 78-Member B (FAM78B) is a protein of unknown function in humans that is encoded by the FAM78B gene (1q24.1). It has orthologous genes and predicted proteins in vertebrates and several invertebrates, but not in arthropods. It has a nuclear localization signal in the protein sequence and a miRNA target region of the mRNA sequence.
EVI5L is a protein that in humans is encoded by the EVI5L gene. EVI5L is a member of the Ras superfamily of monomeric guanine nucleotide-binding (G) proteins, and functions as a GTPase-activating protein (GAP) with a broad specificity. Measurement of in vitro Rab-GAP activity has shown that EVI5L has significant Rab2A- and Rab10-GAP activity.
PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.
Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.
C9orf135 is a gene that encodes a 229 amino acid protein. It is located on Chromosome 9 of the Homo sapiens genome at 9q12.21. The protein has a transmembrane domain from amino acids 124-140 and a glycosylation site at amino acid 75. C9orf135 is part of the GRCh37 gene on Chromosome 9 and is contained within the domain of unknown function superfamily 4572. Also, c9orf135 is known by the name of LOC138255 which is a description of the gene location on Chromosome 9.1.
Vexin is a protein encoded by VXN gene. VXN is found to be highly expressed in regions of the brain and spinal cord.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
OCC-1 is a protein, which in humans is encoded by the gene C12orf75. The gene is approximately 40,882 bp long and encodes 63 amino acids. OCC-1 is ubiquitously expressed throughout the human body. OCC-1 has shown to be overexpressed in various colon carcinomas. Novel splice variant of this gene was also detected in various human cancer types; in addition to encoding a novel smaller protein, OCC-1 gene produces a non-protein coding RNA splice variant lncRNA.
FAM163A, also known as cebelin and neuroblastoma-derived secretory protein (NDSP) is a protein that in humans is encoded by the FAM163A gene. This protein has been implicated in promoting proliferation and anchorage-independent growth of neuroblastoma cancer cells. In addition, this protein has been found to be up-regulated in the lung tissue of chronic smokers. FAM163A is found on human chromosome 1q25.2; its protein product is 167 amino acids long. FAM163A contains a very highly conserved signal peptide sequence, coded for by the first ~37 amino acids in its sequence; albeit only conserved in eukaryotes, the most distant of which being the Japanese Rice Fish.
Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.
Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.
C21orf62 is a protein that, in humans, is encoded by the C21orf62 gene. C21orf62 is found on human chromosome 21, and it is thought to be expressed in tissues of the brain and reproductive organs. Additionally, C21orf62 is highly expressed in ovarian surface epithelial cells during normal regulation, but is not expressed in cancerous ovarian surface epithelial cells.
BEND2 is a protein that in humans is encoded by the BEND2 gene. It is also found in other vertebrates, including mammals, birds, and reptiles. The expression of BEND2 in Homo sapiens is regulated and occurs at high levels in the skeletal muscle tissue of the male testis and in the bone marrow. The presence of the BEN domains in the BEND2 protein indicates that this protein may be involved in chromatin modification and regulation.
KIAA1211L is a protein that in humans is encoded by the KIAA1211L gene. It is highly expressed in the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, KIAA1211L is associated with certain mental disorders and various cancers.
Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.
CAP-Gly Domain Containing Linker Protein Family Member 4 is a protein that in humans is encoded by the CLIP4 gene. In terms of conserved domains, the CLIP4 gene contains primarily ankyrin repeats and the eponymous CAP-Gly domains. The structure of the CLIP4 protein is largely made up of coil, with alpha helices dominating the rest of the protein. CLIP4 mRNA expression occurs largely in the adrenal cortex and atrioventricular node. The literature encompassing CLIP4's conserved domains and paralogs points toward microtubule regulation as a possible function of CLIP4.