W-box

Last updated

The W box is a deoxyribonucleic acid (DNA) cis-regulatory element sequence, (T)TGAC(C/T), which is recognized by the family of WRKY transcription factors. [1] [2]

Contents

Functionality and conservation of the W-box element across plant species has been shown by gel shift experiments, random binding site selection, yeast one-hybrid screens and co-transfection assays performed with many different WRKY proteins. In silico-based studies together with functional studies of plant promoters have identified clusters of W-boxes in stress-inducible promoters. The binding of WRKY proteins to W-boxes is a feature of both biotic and abiotic stress responses, together with other plant processes such as germination. [3] It has also been shown that multiple W-boxes have a synergistic effect on transcription.

Almost all WRKY transcription factors bind preferentially to W-boxes, and since their discovery, this has raised the question as to how they show specificity for the promoters of their target genes. [2] Ciolkowski et al. (2008) showed that although the W-box core is required, adjacent sequences also play a role in determining binding-site preference. [4] Recent evidence suggests that the TGAC core is more degenerate, composed of a guanine adenine cytosine (GAC) core, and the upstream thymine and downstream pyrimidine flanking sequences help dictate recognition by specific WRKY factors. [5] Basic residues of the WRKY protein domain also are believed to recognize the phosphate backbone of the cis-element. [5]

Recently, Yamasaki et al. have determined the solution structure of the C-terminal WRKY domain of Arabidopsis WRKY4 in complex with the W-box DNA by NMR. [6] They found that a four-stranded β-sheet enters the major groove of DNA in a structure they called the β-wedge, where the sheet is nearly perpendicular to the DNA helical ais. As predicted amino acids in the conserved WRKYGQK signature motif contact the W-box DNA.

Related Research Articles

<span class="mw-page-title-main">Promoter (genetics)</span> Region of DNA encouraging transcription

In genetics, a promoter is a sequence of DNA to which proteins bind to initiate transcription of a single RNA transcript from the DNA downstream of the promoter. The RNA transcript may encode a protein (mRNA), or can have a function in and of itself, such as tRNA or rRNA. Promoters are located near the transcription start sites of genes, upstream on the DNA . Promoters can be about 100–1000 base pairs long, the sequence of which is highly dependent on the gene and product of transcription, type or class of RNA polymerase recruited to the site, and species of organism.

<span class="mw-page-title-main">Transcription factor</span> Protein that regulates the rate of DNA transcription

In molecular biology, a transcription factor (TF) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The function of TFs is to regulate—turn on and off—genes in order to make sure that they are expressed in the desired cells at the right time and in the right amount throughout the life of the cell and the organism. Groups of TFs function in a coordinated fashion to direct cell division, cell growth, and cell death throughout life; cell migration and organization during embryonic development; and intermittently in response to signals from outside the cell, such as a hormone. There are up to 1600 TFs in the human genome. Transcription factors are members of the proteome as well as regulome.

A regulatory sequence is a segment of a nucleic acid molecule which is capable of increasing or decreasing the expression of specific genes within an organism. Regulation of gene expression is an essential feature of all living organisms and viruses.

In molecular biology, the TATA box is a sequence of DNA found in the core promoter region of genes in archaea and eukaryotes. The bacterial homolog of the TATA box is called the Pribnow box which has a shorter consensus sequence.

<span class="mw-page-title-main">DNA-binding protein</span> Proteins that bind with DNA, such as transcription factors, polymerases, nucleases and histones

DNA-binding proteins are proteins that have DNA-binding domains and thus have a specific or general affinity for single- or double-stranded DNA. Sequence-specific DNA-binding proteins generally interact with the major groove of B-DNA, because it exposes more functional groups that identify a base pair. However, there are some known minor groove DNA-binding ligands such as netropsin, distamycin, Hoechst 33258, pentamidine, DAPI and others.

<span class="mw-page-title-main">Repressor</span> Sort of RNA-binding protein in molecular genetics

In molecular genetics, a repressor is a DNA- or RNA-binding protein that inhibits the expression of one or more genes by binding to the operator or associated silencers. A DNA-binding repressor blocks the attachment of RNA polymerase to the promoter, thus preventing transcription of the genes into messenger RNA. An RNA-binding repressor binds to the mRNA and prevents translation of the mRNA into protein. This blocking or reducing of expression is called repression.

In molecular biology, a CCAAT box is a distinct pattern of nucleotides with GGCCAATCT consensus sequence that occur upstream by 60–100 bases to the initial transcription site. The CAAT box signals the binding site for the RNA transcription factor, and is typically accompanied by a conserved consensus sequence. It is an invariant DNA sequence at about minus 70 base pairs from the origin of transcription in many eukaryotic promoters. Genes that have this element seem to require it for the gene to be transcribed in sufficient quantities. It is frequently absent from genes that encode proteins used in virtually all cells. This box along with the GC box is known for binding general transcription factors. Both of these consensus sequences belong to the regulatory promoter. Full gene expression occurs when transcription activator proteins bind to each module within the regulatory promoter. Protein specific binding is required for the CCAAT box activation. These proteins are known as CCAAT box binding proteins/CCAAT box binding factors.

A DNA-binding domain (DBD) is an independently folded protein domain that contains at least one structural motif that recognizes double- or single-stranded DNA. A DBD can recognize a specific DNA sequence or have a general affinity to DNA. Some DNA-binding domains may also include nucleic acids in their folded structure.

<span class="mw-page-title-main">Regulator gene</span>

A regulator gene, regulator, or regulatory gene is a gene involved in controlling the expression of one or more other genes. Regulatory sequences, which encode regulatory genes, are often at the five prime end (5') to the start site of transcription of the gene they regulate. In addition, these sequences can also be found at the three prime end (3') to the transcription start site. In both cases, whether the regulatory sequence occurs before (5') or after (3') the gene it regulates, the sequence is often many kilobases away from the transcription start site. A regulator gene may encode a protein, or it may work at the level of RNA, as in the case of genes encoding microRNAs. An example of a regulator gene is a gene that codes for a repressor protein that inhibits the activity of an operator.

Cis-regulatory elements (CREs) or Cis-regulatory modules (CRMs) are regions of non-coding DNA which regulate the transcription of neighboring genes. CREs are vital components of genetic regulatory networks, which in turn control morphogenesis, the development of anatomy, and other aspects of embryonic development, studied in evolutionary developmental biology.

<span class="mw-page-title-main">Apetala 2</span> Protein in Arabidopsis

Apetala 2(AP2) is a gene and a member of a large family of transcription factors, the AP2/EREBP family. In Arabidopsis thaliana AP2 plays a role in the ABC model of flower development. It was originally thought that this family of proteins was plant-specific; however, recent studies have shown that apicomplexans, including the causative agent of malaria, Plasmodium falciparum encode a related set of transcription factors, called the ApiAP2 family.

<span class="mw-page-title-main">RFX1</span> Protein-coding gene in the species Homo sapiens

MHC class II regulatory factor RFX1 is a protein that, in humans, is encoded by the RFX1 gene located on the short arm of chromosome 19.

<span class="mw-page-title-main">B3 domain</span> DNA binding domain

The B3 DNA binding domain (DBD) is a highly conserved domain found exclusively in transcription factors combined with other domains. It consists of 100-120 residues, includes seven beta strands and two alpha helices that form a DNA-binding pseudobarrel protein fold ; it interacts with the major groove of DNA.

<span class="mw-page-title-main">Ethylene-responsive element binding protein</span> Protein family

Ethylene-responsive element binding protein(EREBP) is a homeobox gene from Arabidopsis thaliana and other plants which encodes a transcription factor. EREBP is responsible in part for mediating the response in plants to the plant hormone ethylene.

The Epstein–Barr virus nuclear antigen 2 (EBNA-2) is one of the six EBV viral nuclear proteins expressed in latently infected B lymphocytes is a transactivator protein. EBNA2 is involved in the regulation of latent viral transcription and contributes to the immortalization of EBV infected cells. EBNA2 acts as an adapter molecule that binds to cellular sequence-specific DNA-binding proteins, JK recombination signal-binding protein (RBP-JK), and PU.1 as well as working with multiple members of the RNA polymerase II transcription complex.

<span class="mw-page-title-main">B recognition element</span>

The B recognition element (BRE) is a DNA sequence found in the promoter region of most genes in eukaryotes and Archaea. The BRE is a cis-regulatory element that is found immediately near TATA box, and consists of 7 nucleotides. There are two sets of BREs: one (BREu) found immediately upstream of the TATA box, with the consensus SSRCGCC; the other (BREd) found around 7 nucleotides downstream, with the consensus RTDKKKK.

<span class="mw-page-title-main">Squamosa promoter binding protein</span>

The SQUAMOSA promoter binding protein-like family of transcription factors are defined by a plant-specific DNA-binding domain. The founding member of the family was identified based on its specific in vitro binding to the promoter of the snapdragon SQUAMOSA gene. SBP proteins are thought to be transcriptional activators.

<span class="mw-page-title-main">WRKY protein domain</span> Protein domain

The WRKY domain is found in the WRKY transcription factor family, a class of transcription factors. The WRKY domain is found almost exclusively in plants although WRKY genes appear present in some diplomonads, social amoebae and other amoebozoa, and fungi incertae sedis. They appear absent in other non-plant species. WRKY transcription factors have been a significant area of plant research for the past 20 years. The WRKY DNA-binding domain recognizes the W-box (T)TGAC(C/T) cis-regulatory element.

WRKY transcription factors are proteins that bind DNA. They are transcription factors that regulate many processes in plants and algae (Viridiplantae), such as the responses to biotic and abiotic stresses, senescence, seed dormancy and seed germination and some developmental processes but also contribute to secondary metabolism.

LUX or Phytoclock1 (PCL1) is a gene that codes for LUX ARRHYTHMO, a protein necessary for circadian rhythms in Arabidopsis thaliana. LUX protein associates with Early Flowering 3 (ELF3) and Early Flowering 4 (ELF4) to form the Evening Complex (EC), a core component of the Arabidopsis repressilator model of the plant circadian clock. The LUX protein functions as a transcription factor that negatively regulates Pseudo-Response Regulator 9 (PRR9), a core gene of the Midday Complex, another component of the Arabidopsis repressilator model. LUX is also associated with circadian control of hypocotyl growth factor genes PHYTOCHROME INTERACTING FACTOR 4 (PIF4) and PHYTOCHROME INTERACTING FACTOR 5 (PIF5).

References

  1. Rushton, Paul; Macdonald, H.; Huttly, A.K.; Lazarus, C.M.; Hooley, R (1995). "Members of a new family of DNA-binding proteins bind to a conserved cis-element in the promoters of alpha-Amy2 genes". Plant Molecular Biology. 29 (4): 29: 691–702. doi:10.1007/bf00041160. PMID   8541496. S2CID   21178824.
  2. 1 2 Rushton PJ, Somssich IE, Ringler P, Shen QJ (May 2010). "WRKY transcription factors". Trends Plant Sci. 15 (5): 247–58. doi:10.1016/j.tplants.2010.02.006. PMID   20304701.
  3. Rushton, Paul. "The Lab of Dr. Paul Rushton". wordpress.com. Retrieved 17 June 2013.
  4. Ciolkowski, I.; Wanke D; Birkenbihl RP; Somssich IE. (2008). "Studies on DNA-binding selectivity of WRKY transcription factors lend structural clues into WRKY-domain function". Plant Mol Biol. 68 (1–2): 81–92. doi:10.1007/s11103-008-9353-1. PMC   2493524 . PMID   18523729.
  5. 1 2 Brand; Fischer; Harter; Kohlbacher; Wanke (2013). "Elucidating the evolutionary conserved DNA-binding specificities of WRKY transcription factors by molecular dynamics and in vitro binding assays". Nucleic Acids Research. 41 (21): 9764–9778. doi:10.1093/nar/gkt732. PMC   3834811 . PMID   23975197.
  6. Yamasaki, K.; Kigawa T; Watanabe S; Inoue M; Yamasaki T; Seki M; Shinozaki K; Yokoyama S. (2012). "Structural basis for sequence-specific DNA recognition by an Arabidopsis WRKY transcription factor". J. Biol. Chem. 287 (10): 7683–91. doi: 10.1074/jbc.M111.279844 . PMC   3293589 . PMID   22219184.

See also