THAP3

Last updated
THAP3
Identifiers
Aliases THAP3 , THAP domain containing 3
External IDs OMIM: 612532 MGI: 1917126 HomoloGene: 18413 GeneCards: THAP3
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001145929
NM_175152

RefSeq (protein)

NP_001182681
NP_001182682
NP_612359

NP_001139401
NP_780361

Location (UCSC) Chr 1: 6.62 – 6.64 Mb Chr 4: 152.07 – 152.07 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse
Predicted Tertiary Structure of Homo sapiens THAP3 protein. Homo sapiens THAP3 Tertiary Structure.png
Predicted Tertiary Structure of Homo sapiens THAP3 protein.

THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. [7] The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain [8] and a host-cell factor 1C binding motif. [9] These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. [10] THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys. [7]

Contents

Gene

The H. sapiens THAP3 gene is a protein-encoding gene that is located on the plus strand of chromosome 1 [7] at cytogenetic location 1p36.31. [11] It is 10,727 base pairs long, spanning from genomic coordinates 6,624,868-6,635,595. [11] It contains 6 exons. [12]

Gene neighborhood of Homo sapiens (human) THAP3 gene on chromosome 1. Gene Neighborhood of Homo sapiens THAP3.png
Gene neighborhood of Homo sapiens (human) THAP3 gene on chromosome 1.

Expression

In H. sapiens , THAP3 gene is expressed ubiquitously throughout different tissues, and expression is greatest in the kidneys. [13] It has also been determined that expression of THAP3 tends to be slightly higher in organs located in the abdomen and male and female sexual organs, such as the ovaries, testes, prostate, adrenal gland, spleen, liver, and colon, though expression in the kidneys is 1.4-1.5x higher than those organs. [13] THAP3 mRNA is 1.3x. more abundant in H. sapiens fetal brain tissue than in H. sapiens adult kidney tissue. [14]

mRNA

Transcription of the THAP3 gene can result in 11 different mRNA variants, of which 8 are alternatively spliced and 3 are unspliced. [7] Variant 1 is the predominant variant and encodes THAP3 protein isoform 1. [7]

Alternatively spliced Homo sapiens THAP3 mRNA transcript variants [15]
VariantSequence length (nucleotides)Accession number [7]
11358NM_001195752.2 [16]
22071NM_138350.4 [17]
31361NM_001195753.2 [18]
41262NM_001394496.1 [19]
52050NM_001394497.1 [20]
62047NM_001394498.1 [21]
71123NM_001394499.1 [22]
81120NM_001394500.1 [23]

Protein

Conceptual translation of Homo sapiens THAP3 aligned mRNA and amino acid sequences. Annotated with start and stop sites of translations, protein domains, and predicted post-translational modification sites. Conceptual Translation of Homo sapiens THAP3.png
Conceptual translation of Homo sapiens THAP3 aligned mRNA and amino acid sequences. Annotated with start and stop sites of translations, protein domains, and predicted post-translational modification sites.

The H. sapiens THAP3 protein is predicted to have a molecular weight of 26.9 kilodaltons [24] and a pI of 10.26. [25] The amino acid sequence is isoleucine and tyrosine rich and arginine poor. [24] Characteristics domains of H. sapiens are the THAP domain (THAP) and the hell-cell factor 1C binding motif (HCM). [7]

Isoforms

Due to having 8 alternatively spliced variants, there are 8 THAP3 isoforms. [7]

Isoforms of Homo sapiens THAP3 [7]
IsoformSequence length (amino acids)Accession numberEncoded by
1238NP_001182681.1 [26] Variant 1
2175NP_612359.2 [27] Variant 2
3239NP_001182682.1 [28] Variant 3
4236NP_001381425.1 [29] Variant 4
5168NP_001381426.1 [30] Variant 5
6167NP_001381427.1 [31] Variant 6
7148NP_001381428.1 [32] Variant 7
8147NP_001381429.1 [33] Variant 8

Structure

Schematic of Homo sapiens THAP3 protein sequence with annotated domains, predicted phosphorylation, glycosylation, and Yin-Yang sites. THAP represents the location of the THAP domain, and HBM represents the HCF1C binding motif. Yellow represents glycosylation sites (with scores over 0.5), green represents phosphorylation sites (with scores over 0.75), and diamond shapes represent Yin-Yang sites. Cartoon Schematic of Homo sapiens THAP3.png
Schematic of Homo sapiens THAP3 protein sequence with annotated domains, predicted phosphorylation, glycosylation, and Yin-Yang sites. THAP represents the location of the THAP domain, and HBM represents the HCF1C binding motif. Yellow represents glycosylation sites (with scores over 0.5), green represents phosphorylation sites (with scores over 0.75), and diamond shapes represent Yin-Yang sites.

The predicted H. sapiens THAP3 tertiary structure contains a globular region and an alpha helix. [5] [6] The globular region is located near the N-terminus of the sequence and is the structure of the THAP domain. It spans amino acids 4-82. [37] The alpha helix is located from amino acids 186-230 and contains the host-cell factor 1C binding motif. [37]

Regulation

Localization

THAP3 can be localized in the nucleus or mitochondria of H. sapiens cells. [38]

Post-translation modifications

The H. sapiens the THAP3 protein has 30 predicted phosphorylation sites, 28 predicted O-β-glycosylation sites, and 11 predicted Yin-Yang sites. [35] [36] Many proteins involved in transcription regulation are influenced by phosphorylation and glycosylation sites, which corroborates THAP3's function. [39]

Homology and evolution

Paralogs

The H. sapiens THAP3 protein, along with several other proteins, is part of the THAP family of proteins. [40] All of these proteins contain the THAP domain and are, thus, paralogs of H. sapiens THAP3. [15]

Paralogs of Homo sapiens THAP3 protein [7]
Protein NameE-Value! [15] Percent Identity to THAP3 [15]
THAP1 [41] 8×10−2348.00
THAP2 [42] 6×10−1745.24
THAP5 [43] 4×10−1331.96
THAP6 [44] 6×10−634.44
THAP7 [45] 1×10−733.33
THAP8 [46] 8×10−1131.96
THAP9 [47] 2×10−832.99

Orthologs

Multiple sequence alignment of THAP domain in Homo sapiens THAP3 (HSa THAP3; accession number NP 001182681.1 ) with distant orthologs. Boxing represents the location of the THAP domain in H. sapiens. Bolding and asterisks below groups of sequence indicate that an amino acid is highly conserved at that position. Full sequences include that of Sumatra barb (PTe THAP3), Electric eel (EEl THAP3), Lake whitefish (CCL THAP3), Baby whale (BBr THAP3), Whale shark (ARa THAP3), White-spotted bamboo shark (RTy THAP3), and Thorny skate (CPl THAP3). Accession numbers as in ortholog table. MSA of THAP3.png
Multiple sequence alignment of THAP domain in Homosapiens THAP3 (HSa THAP3; accession number NP 001182681.1 ) with distant orthologs. Boxing represents the location of the THAP domain in H. sapiens. Bolding and asterisks below groups of sequence indicate that an amino acid is highly conserved at that position. Full sequences include that of Sumatra barb (PTe THAP3), Electric eel (EEl THAP3), Lake whitefish (CCL THAP3), Baby whale (BBr THAP3), Whale shark (ARa THAP3), White-spotted bamboo shark (RTy THAP3), and Thorny skate (CPl THAP3). Accession numbers as in ortholog table.

There are approximately 206 orthlologs of H. sapiens THAP3. [7] Orthologs can be found in a variety of taxomonic classes, including mammals, reptiles, amphibians, bony fishes, and cartilaginous fishes. [15] However, there are no orthologs in bacteria, fungi, protists, archaea, plants, invertebrates, or birds. [15] Additionally, not all orders are represented with in a class. For example, in reptiles, orthologs to H. sapiens THAP3 are found in testudines (turtles or tortoises) and not found in crocodilia (crocodiles and alligators) or squamata (lizards and snakes). [15] Similarly, there are only orthologs in apoda within amphibians. [15] There are no orthologs in anura (frogs) or urodela (salamanders). [15]

In closely related organisms, those diverged 0-160 million years ago (MYA), percent similarity of orthologs ranges from 36-82.9%. THAP3 sequences in rodents are the least conserved compared to H. sapiens. Sequences that diverged 319-353 MYA, those moderately related, have 47.2-68.9% similarity to H. sapiens THAP3, and 41.3-54.1% similarity in organisms that are distantly related, diverged 431-464 MYA.

Orthologs of Homo sapiens THAP3 [49]
Taxonomic Class Scientific Name Common Name Taxonomic Order Date of Divergence [50] Accession Number [15] Percent Identity to THAP3Percent Similarity to THAP3 [51]
Mammals Marmota flaviventris Yellow-bellied marmot Rodentia 87XP_027803226.1 [52] 29.936
Lontra canadensis North American river otter Carnivora 94XP_032719186.1 [53] 59.565.8
Eptesicus fuscus Big brown bat Chiroptera 94XP_028016747.1 [54] 65.069.6
Balaenoptera musculus Blue whale Cetacea 94XP_036686252.1 [55] 77.582.9
Dromiciops gliroides Colocolo opossum Microbiotheria 160XP_043850206.1 [56] 64.674.5
Phascolarctos cinereus Koala Diprotodontia 160XP_020830574.1 [57] 65.776.4
Reptiles Caretta caretta Loggerhead turtle Testudines 319XP_048680971.1 [58] 36.947.2
Gopherus evgoodei Goode's thornscrub tortoise Testudines 319XP_030393185.1 [59] 48.958.1
Chelonoidis abingdonii Abingdon Island giant tortoise Testudines 319XP_032619750.1 [60] 48.961.4
Mauremys mutica Yellow pond turtle Testudines 319XP_044852367.1 [61] 49.060.9
Amphibians Microcaecilia unicolor Microcaecilia unicolor Gymnophiona 353XP_030041702.1 [62] 41.256.8
Geotrypetes seraphini Gaboon caecilian Gymnophiona 353XP_033777236.1 [63] 44.257.8
Bony Fishes Electrophorus electricus Electric eel Gymnotiformes 431XP_026873261.2 [64] 31.941.3
Coregonus clupeaformis Lake whitefish Salmoniformes 431XP_041712304.2 [65] 32.947.7
Brienomyrus brachyistius Baby whale Osteoglossiformes 431XP_048872538.1 [66] 33.546.3
Puntigrus tetrazona Sumatra barb Cypriniformes 431XP_043081346.1 [67] 34.048.1
Cartilaginous Fishes Rhincodon typus Whale shark Orectolobiformes 464XP_020386430.1 [68] 39.053.4
Chiloscyllium plagiosum White-spotted bamboo shark Orectolobiformes 464XP_043531920.1 [69] 39.053.0
Amblyraja radiata Thorny skate Rajiformes 464XP_032904038.1 [70] 40.254.1

Evolution

H. sapiens THAP3 has evolved at a rate similar to H. sapiens fibrinogen alpha, which is involved in the immune system. [15]

Protein interactions

H. sapiens THAP3 interacts with proteins involved in various cellular processes, like transcription regulation and neuronal development. [10] It is also interacts with molecular chaperones during its translation.

Homo sapiens THAP3 Protein Interactions [71]
ProcessProtein NameIdentified By! [72] Interaction Type
Transcription Regulation CHAT two hybrid assay Functional
FGFR3 two hybrid assay Functional
HCF1C [73] affinity capture - mass spectrometry Functional
OGT [73] affinity capture - mass spectrometry Functional
PKN1 two hybrid assay Functional
POLR2A two hybrid assay Functional
TARDBP two hybrid assay Functional
Neuronal Development LSAMP two hybrid assay Functional
DNAJB6 two hybrid assay Functional
Protein Folding BAG6 two hybrid assay Developmental

Clinical significance

THAP3 contributes to the presentation of X-linked Dystonia-Parkinsonism, also known as Lubag Syndrome. [74] This disease is a neurodegenerative movement disorder that predominantly affects males of Filipino descent. [75] Symptoms include tremors, bradykinesia, rigidity, postural instability, shuffling gait and dystonia, which typically develops later in life. [75]

Related Research Articles

Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.

<span class="mw-page-title-main">TMEM176B</span> Protein-coding gene in the species Homo sapiens

Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.

BEND2 is a protein that in humans is encoded by the BEND2 gene. It is also found in other vertebrates, including mammals, birds, and reptiles. The expression of BEND2 in Homo sapiens is regulated and occurs at high levels in the skeletal muscle tissue of the male testis and in the bone marrow. The presence of the BEN domains in the BEND2 protein indicates that this protein may be involved in chromatin modification and regulation.

<span class="mw-page-title-main">C6orf62</span> Protein-coding gene in the species Homo sapiens

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">WD Repeat and Coiled Coil Containing Protein</span> Protein-coding gene in humans

WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.

<span class="mw-page-title-main">FAM120AOS</span> Protein-coding gene in the species Homo sapiens

FAM120AOS, or family with sequence similarity 120A opposite strand, codes for uncharacterized protein FAM120AOS, which currently has no known function. The gene ontology describes the gene to be protein binding. Overall, it appears that the thyroid and the placenta are the two tissues with the highest expression levels of FAM120AOS across a majority of datasets.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">C12orf50</span> Protein-coding gene in humans

Chromosome 12 Open Reading Frame 50 (C12orf50) is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811, on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.

<span class="mw-page-title-main">TBC1D30</span> Protein-coding gene in the species Homo sapiens

TBC1D30 is a gene in the human genome that encodes the protein of the same name. This protein has two domains, one of which is involved in the processing of the Rab protein. Much of the function of this gene is not yet known, but it is expressed mostly in the brain and adrenal cortex.

<span class="mw-page-title-main">C4orf19</span> Human C4orf19 gene

C4orf19 is a protein which in humans is encoded by the C4orf19 gene.

<span class="mw-page-title-main">TEKTIP1</span> Gene

TEKTIP1, also known as tektin-bundle interacting protein 1, is a protein that in humans is encoded by the TEKTIP1 gene.

<span class="mw-page-title-main">C13orf42</span> C13orf42 gene page

C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. Tertiary structure predictions for C13orf42 indicate multiple alpha helices.

<span class="mw-page-title-main">NOXRED1</span> Human gene

NADP-dependent oxidoreductase domain-containing protein 1 is a protein that in humans is encoded by the NOXRED1 gene. An alias of this gene is Chromosome 14 Open Reading Frame 148 (c14orf148). This gene is located on chromosome 14, at 14q24.3. NOXRED1 is predicted to be involved in pyrroline-5-carboxylate reductase activity as part of the L-proline biosynthetic pathway. It is expressed in a wide variety of tissues at a relatively low level, including the testes, thyroid, skin, small intestine, brain, kidney, colon, and more.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">TMEM248</span> Transmembrane protein 248/TMEM248 gene

Transmembrane protein 248, also known as C7orf42, is a gene that in humans encodes the TMEM248 protein. This gene contains multiple transmembrane domains and is composed of seven exons.TMEM248 is predicted to be a component of the plasma membrane and be involved in vesicular trafficking. It has low tissue specificity, meaning it is ubiquitously expressed in tissues throughout the human body. Orthology analyses determined that TMEM248 is highly conserved, having homology with vertebrates and invertebrates. TMEM248 may play a role in cancer development. It was shown to be more highly expressed in cases of colon, breast, lung, ovarian, brain, and renal cancers.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

<span class="mw-page-title-main">SCRN3</span> Protein-coding gene in the species Homo sapiens

Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.

<span class="mw-page-title-main">LRRC74A</span> Protein-coding gene

Leucine-rich repeat-containing protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.

<span class="mw-page-title-main">TMEM19</span> Protein encoded by the TMEM19 gene

Transmembrane protein 19 is a protein that in humans is encoded by the TMEM19 gene.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000041988 Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000039759 Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. (August 2021). "Highly accurate protein structure prediction with AlphaFold". Nature. 596 (7873): 583–589. Bibcode:2021Natur.596..583J. doi:10.1038/s41586-021-03819-2. PMC   8371605 . PMID   34265844.
  6. 1 2 Varadi M, Anyango S, Deshpande M, Nair S, Natassia C, Yordanova G, et al. (January 2022). "AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models". Nucleic Acids Research. 50 (D1): D439–D444. doi:10.1093/nar/gkab1061. PMC   8728224 . PMID   34791371.
  7. 1 2 3 4 5 6 7 8 9 10 11 12 "THAP3 THAP domain containing 3 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-08.
  8. Roussigne M, Kossida S, Lavigne AC, Clouaire T, Ecochard V, Glories A, et al. (February 2003). "The THAP domain: a novel protein motif with similarity to the DNA-binding domain of P element transposase". Trends in Biochemical Sciences. 28 (2): 66–69. doi:10.1016/S0968-0004(02)00013-0. PMID   12575992.
  9. "Homo sapiens THAP domain containing 3 (THAP3), transcript variant 1, mRNA". 2022-04-22.
  10. 1 2 Sabogal A, Lyubimov AY, Corn JE, Berger JM, Rio DC (January 2010). "THAP proteins target specific DNA sites through bipartite recognition of adjacent major and minor grooves". Nature Structural & Molecular Biology. 17 (1): 117–123. doi:10.1038/nsmb.1742. PMC   2933787 . PMID   20010837.
  11. 1 2 "Entry - *612532 - THAP Doman-Containing Protein 3; THAP3 - OMIM". www.omim.org. Retrieved 2022-12-15.
  12. 1 2 "THAP domain-containing protein 3 isoform 1 [Homo sapiens] - Protein - NCBI". National Center of Biotechnology Information. Retrieved 2022-12-16.
  13. 1 2 Fagerberg L, Hallström BM, Oksvold P, Kampf C, Djureinovic D, Odeberg J, et al. (February 2014). "Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics". Molecular & Cellular Proteomics. 13 (2): 397–406. doi: 10.1074/mcp.M113.035600 . PMC   3916642 . PMID   24309898.
  14. Duff MO, Olson S, Wei X, Garrett SC, Osman A, Bolisetty M, et al. (May 2015). "Genome-wide identification of zero nucleotide recursive splicing in Drosophila". Nature. 521 (7552): 376–379. Bibcode:2015Natur.521..376D. doi:10.1038/nature14475. PMC   4529404 . PMID   25970244.
  15. 1 2 3 4 5 6 7 8 9 10 11 "Protein BLAST: search protein databases using a protein query". National Center of Biotechnology Information. Retrieved 2022-12-08.
  16. "Homo sapiens THAP domain containing 3 (THAP3), transcript variant 1, mRNA". April 22, 2022 via NCBI Nucleotide.
  17. "Homo sapiens THAP domain containing 3 (THAP3), transcript variant 2, m - Nucleotide - NCBI". www.ncbi.nlm.nih.gov. 22 April 2022.
  18. "Homo sapiens THAP domain containing 3 (THAP3), transcript variant 3, m - Nucleotide - NCBI". www.ncbi.nlm.nih.gov. 10 June 2022.
  19. "Homo sapiens THAP domain containing 3 (THAP3), transcript variant 4, mRNA". April 22, 2022 via NCBI Nucleotide.
  20. "Homo sapiens THAP domain containing 3 (THAP3), transcript variant 5, mRNA". April 22, 2022 via NCBI Nucleotide.
  21. "Homo sapiens THAP domain containing 3 (THAP3), transcript variant 6, mRNA". April 22, 2022 via NCBI Nucleotide.
  22. "Homo sapiens THAP domain containing 3 (THAP3), transcript variant 7, m - Nucleotide - NCBI". www.ncbi.nlm.nih.gov. 22 April 2022.
  23. "Homo sapiens THAP domain containing 3 (THAP3), transcript variant 8, mRNA". April 22, 2022 via NCBI Nucleotide.
  24. 1 2 Brendel V, Bucher P, Nourbakhsh IR, Blaisdell BE, Karlin S (March 1992). "Methods and algorithms for statistical analysis of protein sequences". Proceedings of the National Academy of Sciences of the United States of America. 89 (6): 2002–2006. Bibcode:1992PNAS...89.2002B. doi: 10.1073/pnas.89.6.2002 . PMC   48584 . PMID   1549558.
  25. Gasteiger E., Hoogland C., Gattiker A., Duvaud S., Wilkins M.R., Appel R.D., Bairoch A.; Protein Identification and Analysis Tools on the Expasy Server; (In) John M. Walker (ed): The Proteomics Protocols Handbook, Humana Press (2005).
  26. "THAP domain-containing protein 3 isoform 1 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  27. "THAP domain-containing protein 3 isoform 2 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  28. "THAP domain-containing protein 3 isoform 3 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  29. "THAP domain-containing protein 3 isoform 4 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  30. "THAP domain-containing protein 3 isoform 5 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  31. "THAP domain-containing protein 3 isoform 6 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  32. "THAP domain-containing protein 3 isoform 7 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  33. "THAP domain-containing protein 3 isoform 8 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  34. Liu W, Xie Y, Ma J, Luo X, Nie P, Zuo Z, et al. (October 2015). "IBS: an illustrator for the presentation and visualization of biological sequences". Bioinformatics. 31 (20): 3359–3361. doi:10.1093/bioinformatics/btv362. PMC   4595897 . PMID   26069263.
  35. 1 2 3 Gupta, R. (2001). Prediction of glycosylation sites in proteomes: from post-translational modifications to protein function. Technical University of Denmark.
  36. 1 2 Blom N, Gammeltoft S, Brunak S (December 1999). "Sequence and structure-based prediction of eukaryotic protein phosphorylation sites". Journal of Molecular Biology. 294 (5): 1351–1362. doi:10.1006/jmbi.1999.3310. PMID   10600390.
  37. 1 2 Wang J, Youkharibache P, Marchler-Bauer A, Lanczycki C, Zhang D, Lu S, et al. (2022). "iCn3D: From Web-Based 3D Viewer to Structural Analysis Tool in Batch Mode". Frontiers in Molecular Biosciences. 9: 831740. doi: 10.3389/fmolb.2022.831740 . PMC   8892267 . PMID   35252351.
  38. "PSORT II Prediction". psort.hgc.jp. Retrieved 2022-12-16.
  39. Filtz TM, Vogel WK, Leid M (February 2014). "Regulation of transcription factor activity by interconnected post-translational modifications". Trends in Pharmacological Sciences. 35 (2): 76–85. doi:10.1016/j.tips.2013.11.005. PMC   3954851 . PMID   24388790.
  40. Sanghavi HM, Mallajosyula SS, Majumdar S (March 2019). "Classification of the human THAP protein family identifies an evolutionarily conserved coiled coil region". BMC Structural Biology. 19 (1): 4. doi: 10.1186/s12900-019-0102-2 . PMC   6402169 . PMID   30836974.
  41. "THAP1 THAP domain containing 1 [Homo sapiens (human)] - Gene - NCBI". National Center of Biotechnology Information.
  42. "THAP2 THAP domain containing 2 [Homo sapiens (human)] - Gene - NCBI". National Center of Biotechnology Information.
  43. "THAP5 THAP domain containing 5 [Homo sapiens (human)] - Gene - NCBI". National Center of Biotechnology Information.
  44. "THAP6 THAP domain containing 6 [Homo sapiens (human)] - Gene - NCBI". National Center of Biotechnology Information.
  45. "THAP7 THAP domain containing 7 [Homo sapiens (human)] - Gene - NCBI". National Center of Biotechnology Information.
  46. "THAP8 THAP domain containing 8 [Homo sapiens (human)] - Gene - NCBI". National Center of Biotechnology Information.
  47. "THAP9 THAP domain containing 9 [Homo sapiens (human)] - Gene - NCBI". National Center of Biotechnology Information.
  48. Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, et al. (October 2011). "Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega". Molecular Systems Biology. 7 (1): 539. doi:10.1038/msb.2011.75. PMC   3261699 . PMID   21988835.
  49. "THAP domain-containing protein 3 isoform 1 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-08.
  50. Kumar S, Suleski M, Craig JM, Kasprowicz AE, Sanderford M, Li M, et al. (August 2022). "TimeTree 5: An Expanded Resource for Species Divergence Times". Molecular Biology and Evolution. 39 (8): msac174. doi:10.1093/molbev/msac174. PMC   9400175 . PMID   35932227.
  51. "EMBOSS Needle < Pairwise Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2022-12-08.
  52. "THAP domain-containing protein 3 isoform X1 [Marmota flaviventris] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  53. "THAP domain-containing protein 3 isoform X1 [Lontra canadensis] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  54. "THAP domain-containing protein 3 isoform X1 [Eptesicus fuscus] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  55. "THAP domain-containing protein 3 isoform X1 [Balaenoptera musculus] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  56. "THAP domain-containing protein 3 isoform X1 [Dromiciops gliroides] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  57. "THAP domain-containing protein 3 isoform X1 [Phascolarctos cinereus] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  58. "THAP domain-containing protein 3 isoform X1 [Caretta caretta] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  59. "THAP domain-containing protein 3 isoform X1 [Gopherus evgoodei] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  60. "THAP domain-containing protein 3 isoform X1 [Chelonoidis abingdonii] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  61. "THAP domain-containing protein 3 isoform X1 [Mauremys mutica] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  62. "THAP domain-containing protein 3 isoform X1 [Microcaecilia unicolor] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  63. "THAP domain-containing protein 3 isoform X1 [Geotrypetes seraphini] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  64. "THAP domain-containing protein 3 isoform X1 [Electrophorus electricus] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  65. "THAP domain-containing protein 3 isoform X1 [Coregonus clupeaformis] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  66. "THAP domain-containing protein 3 isoform X1 [Brienomyrus brachyistius] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  67. "THAP domain-containing protein 3 isoform X1 [Puntigrus tetrazona] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  68. "THAP domain-containing protein 3 isoform X1 [Rhincodon typus] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  69. "THAP domain-containing protein 3 isoform X1 [Chiloscyllium plagiosum] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  70. "THAP domain-containing protein 3 isoform X1 [Amblyraja radiata] - Protein - NCBI". www.ncbi.nlm.nih.gov.
  71. "UniProt". www.uniprot.org. Retrieved 2022-12-16.
  72. "IntAct Portal". www.ebi.ac.uk. Retrieved 2022-12-16.
  73. 1 2 "THAP3 Result Summary | BioGRID". thebiogrid.org. Retrieved 2022-12-16.
  74. "THAP3 Gene - GeneCards | THAP3 Protein | THAP3 Antibody". www.genecards.org. Retrieved 2022-12-08.
  75. 1 2 Rosales RL (October 2010). "X-linked dystonia parkinsonism: clinical phenotype, genetics and therapeutics". Journal of Movement Disorders. 3 (2): 32–38. doi:10.14802/jmd.10009. PMC   4027667 . PMID   24868378.