GATA zinc finger

Last updated
GATA zinc finger
PDB 1y0j EBI.jpg
zinc fingers as protein recognition motifs: structural basis for the gata-1/friend of gata interaction
Identifiers
SymbolGATA
Pfam PF00320
Pfam clan CL0167
InterPro IPR000679
PROSITE PDOC00300
SCOP2 1gat / SCOPe / SUPFAM
CDD cd00202
Available protein structures:
Pfam   structures / ECOD  
PDB RCSB PDB; PDBe; PDBj
PDBsum structure summary

In molecular biology, GATA zinc fingers are zinc-containing domains found in a number of transcription factors (including erythroid-specific transcription factor and nitrogen regulatory proteins). Some members of this class of zinc fingers specifically bind the DNA sequence (A/T)GATA(A/G) in the regulatory regions of genes., [1] giving rise to the name of the domain. In these domains, a single zinc ion is coordinated by 4 cysteine residues. [2] [3] NMR studies have shown the core of the Znf to comprise 2 irregular anti-parallel beta-sheets and an alpha-helix, followed by a long loop to the C-terminal end of the finger. The N-terminal part, which includes the helix, is similar in structure, but not sequence, to the N-terminal zinc module of the glucocorticoid receptor DNA-binding domain. The helix and the loop connecting the 2 beta-sheets interact with the major groove of the DNA, while the C-terminal tail wraps around into the minor groove. Interactions between the Znf and DNA are mainly hydrophobic, explaining the preponderance of thymines in the binding site; a large number of interactions with the phosphate backbone have also been observed. [3] Two GATA zinc fingers are found in GATA-family transcription factors. However, there are several proteins that only contain a single copy of the domain. It is also worth noting that many GATA-type Znfs (such as those found in the proteins GATAD2B and MTA1) have not been experimentally demonstrated to be DNA-binding domains. Furthermore, several GATA-type Znfs have been demonstrated to act as protein-recognition domains. For example, the N-terminal Znf of GATA1 binds specifically to a zinc finger from the transcriptional coregulator FOG1 (ZFPM1). [4]

Related Research Articles

<span class="mw-page-title-main">Zinc finger</span> Small structural protein motif found mostly in transcriptional proteins

A zinc finger is a small protein structural motif that is characterized by the coordination of one or more zinc ions (Zn2+) in order to stabilize the fold. It was originally coined to describe the finger-like appearance of a hypothesized structure from the African clawed frog (Xenopus laevis) transcription factor IIIA. However, it has been found to encompass a wide variety of differing protein structures in eukaryotic cells. Xenopus laevis TFIIIA was originally demonstrated to contain zinc and require the metal for function in 1983, the first such reported zinc requirement for a gene regulatory protein followed soon thereafter by the Krüppel factor in Drosophila. It often appears as a metal-binding domain in multi-domain proteins.

<span class="mw-page-title-main">DNA-binding protein</span> Proteins that bind with DNA, such as transcription factors, polymerases, nucleases and histones

DNA-binding proteins are proteins that have DNA-binding domains and thus have a specific or general affinity for single- or double-stranded DNA. Sequence-specific DNA-binding proteins generally interact with the major groove of B-DNA, because it exposes more functional groups that identify a base pair. However, there are some known minor groove DNA-binding ligands such as netropsin, distamycin, Hoechst 33258, pentamidine, DAPI and others.

<span class="mw-page-title-main">Helix-turn-helix</span> Structural motif capable of binding DNA

Helix-turn-helix is a DNA-binding protein (DBP). The helix-turn-helix (HTH) is a major structural motif capable of binding DNA. Each monomer incorporates two α helices, joined by a short strand of amino acids, that bind to the major groove of DNA. The HTH motif occurs in many proteins that regulate gene expression. It should not be confused with the helix–loop–helix motif.

<span class="mw-page-title-main">EGR1</span> Protein-coding gene in the species Homo sapiens

EGR-1 also known as ZNF268 or NGFI-A is a protein that in humans is encoded by the EGR1 gene.

A DNA-binding domain (DBD) is an independently folded protein domain that contains at least one structural motif that recognizes double- or single-stranded DNA. A DBD can recognize a specific DNA sequence or have a general affinity to DNA. Some DNA-binding domains may also include nucleic acids in their folded structure.

<span class="mw-page-title-main">GATA1</span> Protein-coding gene in the species Homo sapiens

GATA-binding factor 1 or GATA-1 is the founding member of the GATA family of transcription factors. This protein is widely expressed throughout vertebrate species. In humans and mice, it is encoded by the GATA1 and Gata1 genes, respectively. These genes are located on the X chromosome in both species.

In molecular genetics, the Krüppel-like family of transcription factors (KLFs) are a set of eukaryotic C2H2 zinc finger DNA-binding proteins that regulate gene expression. This family has been expanded to also include the Sp transcription factor and related proteins, forming the Sp/KLF family.

Therapeutic gene modulation refers to the practice of altering the expression of a gene at one of various stages, with a view to alleviate some form of ailment. It differs from gene therapy in that gene modulation seeks to alter the expression of an endogenous gene whereas gene therapy concerns the introduction of a gene whose product aids the recipient directly.

In a zinc finger protein, certain sequences of amino acid residues are able to recognise and bind to an extended target-site of four or even five nucleotides When this occurs in a ZFP in which the three-nucleotide subsites are contiguous, one zinc finger interferes with the target-site of the zinc finger adjacent to it, a situation known as target-site overlap. For example, a zinc finger containing arginine at position -1 and aspartic acid at position 2 along its alpha-helix will recognise an extended sequence of four nucleotides of the sequence 5'-NNG(G/T)-3'. The hydrogen bond between Asp2 and the N4 of either a cytosine or adenine base paired to the guanine or thymine, respectively defines these two nucleotides at the 3' position, defining a sequence that overlaps into the subsite of any zinc finger that may be attached N-terminally.

<span class="mw-page-title-main">CTCF</span> Transcription factor

Transcriptional repressor CTCF also known as 11-zinc finger protein or CCCTC-binding factor is a transcription factor that in humans is encoded by the CTCF gene. CTCF is involved in many cellular processes, including transcriptional regulation, insulator activity, V(D)J recombination and regulation of chromatin architecture.

<span class="mw-page-title-main">TAL1</span> Protein-coding gene in the species Homo sapiens

T-cell acute lymphocytic leukemia protein 1 is a protein that in humans is encoded by the TAL1 gene.

<span class="mw-page-title-main">GATA2</span> Protein-coding gene in the species Homo sapiens

GATA2 or GATA-binding factor 2 is a transcription factor, i.e. a nuclear protein which regulates the expression of genes. It regulates many genes that are critical for the embryonic development, self-renewal, maintenance, and functionality of blood-forming, lympathic system-forming, and other tissue-forming stem cells. GATA2 is encoded by the GATA2 gene, a gene which often suffers germline and somatic mutations which lead to a wide range of familial and sporadic diseases, respectively. The gene and its product are targets for the treatment of these diseases.

<span class="mw-page-title-main">LMO2</span> Protein-coding gene in the species Homo sapiens

LIM domain only 2, also known as LMO2, RBTNL1, RBTN2, RHOM2, LIM Domain Only Protein 2, TTG2, and T-Cell Translocation Protein 2, is a protein which in humans is encoded by the LMO2 gene.

<span class="mw-page-title-main">CTBP1</span> Protein-coding gene in the species Homo sapiens

C-terminal-binding protein 1 also known as CtBP1 is a protein that in humans is encoded by the CTBP1 gene. CtBP1 is one of two CtBP proteins, the other protein being CtBP2.

<span class="mw-page-title-main">ZNF143</span> Protein-coding gene

Zinc finger protein 143 is a protein that in humans is encoded by the ZNF143 gene.

<span class="mw-page-title-main">Zinc finger protein ZFPM1</span> Protein-coding gene in the species Homo sapiens

Zinc finger protein ZFPM1(FOG-1) also known as friend of GATA protein 1 is a protein that in humans is encoded by the ZFPM1 gene. It is a cofactor of the GATA1 transcription factor.

<span class="mw-page-title-main">ZBTB32</span> Protein-coding gene in the species Homo sapiens

Zinc finger and BTB domain-containing protein 32 is a protein that in humans is encoded by the 1960 bp ZBTB32 gene. The 52 kDa protein is a transcriptional repressor and the gene is expressed in T and B cells upon activation, but also significantly in testis cells. It is a member of the Poxviruses and Zinc-finger (POZ) and Krüppel (POK) family of proteins, and was identified in multiple screens involving either immune cell tumorigenesis or immune cell development.

<span class="mw-page-title-main">Zinc finger protein 804A</span> Protein-coding gene in the species Homo sapiens

Zinc finger protein 804A is a protein that in humans is encoded by the ZNF804A gene. The human gene maps to chromosome 2 q32.1 and consists of 4 exons that code for a protein of 1210 amino acids.

<span class="mw-page-title-main">Archaeal transcription factor B</span> Protein family

Archaeal transcription factor B is a protein family of extrinsic transcription factors that guide the initiation of RNA transcription in organisms that fall under the domain of Archaea. It is homologous to eukaryotic TFIIB and, more distantly, to bacterial sigma factor. Like these proteins, it is involved in forming transcription preinitiation complexes. Its structure includes several conserved motifs which interact with DNA and other transcription factors, notably the single type of RNA polymerase that performs transcription in Archaea.

<span class="mw-page-title-main">Zinc finger protein 226</span> Protein-coding gene in the species Homo sapiens

Zinc finger protein 226 is a protein that in humans is encoded by the ZNF226 gene.

References

  1. Yamamoto M, Ko LJ, Leonard MW, Beug H, Orkin SH, Engel JD (October 1990). "Activity and tissue-specific expression of the transcription factor NF-E1 multigene family". Genes Dev. 4 (10): 1650–62. doi: 10.1101/gad.4.10.1650 . PMID   2249770.
  2. Evans T, Felsenfeld G (September 1989). "The erythroid-specific transcription factor Eryf1: a new finger protein". Cell. 58 (5): 877–85. doi:10.1016/0092-8674(89)90940-9. PMID   2776214. S2CID   46374550.
  3. 1 2 Omichinski JG, Clore GM, Schaad O, Felsenfeld G, Trainor C, Appella E, Stahl SJ, Gronenborn AM (July 1993). "NMR structure of a specific DNA complex of Zn-containing DNA binding domain of GATA-1". Science. 261 (5120): 438–46. Bibcode:1993Sci...261..438O. doi:10.1126/science.8332909. PMID   8332909.
  4. Liew CK, Simpson RJ, Kwan AH, Crofts LA, Loughlin FE, Matthews JM, Crossley M, Mackay JP (March 2005). "Zinc fingers as protein recognition motifs: structural basis for the GATA-1/friend of GATA interaction". PNAS USA. 102 (3): 583–8. Bibcode:2005PNAS..102..583L. doi: 10.1073/pnas.0407511102 . PMC   545545 . PMID   15644435.
This article incorporates text from the public domain Pfam and InterPro: IPR000679