CUT domain

Last updated
CUT

PDB 2csf EBI.jpg

solution structure of the second cut domain of human satb2
Identifiers
Symbol CUT
Pfam PF02376
InterPro IPR003350
SCOP 1wh6
SUPERFAMILY 1wh6

In molecular biology, the CUT domain (also known as ONECUT) is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, which is often found downstream of the CUT domain. Proteins display two modes of DNA binding, which hinge on the homeodomain and on the linker that separates it from the CUT domain, and two modes of transcriptional stimulation, which hinge on the homeodomain. [1] [2]

DNA Molecule that encodes the genetic instructions used in the development and functioning of all known organisms and many viruses

Deoxyribonucleic acid is a molecule composed of two chains that coil around each other to form a double helix carrying the genetic instructions used in the growth, development, functioning, and reproduction of all known organisms and many viruses. DNA and ribonucleic acid (RNA) are nucleic acids; alongside proteins, lipids and complex carbohydrates (polysaccharides), nucleic acids are one of the four major types of macromolecules that are essential for all known forms of life.

Molecular binding is an attractive interaction between two molecules that results in a stable association in which the molecules are in close proximity to each other. It often but not always involves some chemical bonding.

Protein biological molecule consisting of chains of amino acid residues

Proteins are large biomolecules, or macromolecules, consisting of one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, responding to stimuli, providing structure to cells and organisms, and transporting molecules from one location to another. Proteins differ from one another primarily in their sequence of amino acids, which is dictated by the nucleotide sequence of their genes, and which usually results in protein folding into a specific three-dimensional structure that determines its activity.

Related Research Articles

Homeobox

A homeobox is a DNA sequence, around 180 base pairs long, found within genes that are involved in the regulation of patterns of anatomical development (morphogenesis) in animals, fungi and plants. These genes encode homeodomain protein products that are transcription factors sharing a characteristic protein fold structure that binds DNA. The "homeo-" prefix in the words "homeobox" and "homeodomain" stems from the mutational phenotype known as "homeosis", which is frequently observed when these genes are mutated in animals. Homeosis is a term coined by William Bateson to describe the outright replacement of a discrete body part with another body part. Homeobox genes are not only found in animals, but have also been found in fungi, for example the unicellular yeasts, in plants, and numerous single cell eukaryotes.

Helix-turn-helix

In proteins, the helix-turn-helix (HTH) is a major structural motif capable of binding DNA. Each monomer incorporates two α helices, joined by a short strand of amino acids, that bind to the major groove of DNA. The HTH motif occurs in many proteins that regulate gene expression. It should not be confused with the helix-loop-helix motif.

Hepatocyte nuclear factors (HNFs) are a group of phylogenetically unrelated transcription factors that regulate the transcription of a diverse group of genes into proteins. These proteins include blood clotting factors and in addition, enzymes and transporters involved with glucose, cholesterol, and fatty acid transport and metabolism.

The W box is a deoxyribonucleic acid (DNA) cis-regulatory element sequence, (T)TGAC(C/T), which is recognized by the family of WRKY transcription factors.

Apetala 2

Apetala 2(AP2) is a gene and a member of a large family of transcription factors, the AP2/EREBP family. In Arabidopsis thaliana AP2 plays a role in the ABC model of flower development. It was originally thought that this family of proteins was plant-specific; however, recent studies have shown that apicomplexans, including the causative agent of malaria, Plasmodium falciparum encode a related set of transcription factors, called the ApiAP2 family.

TCF3 protein-coding gene in the species Homo sapiens

Transcription factor 3, also known as TCF3, is a protein that in humans is encoded by the TCF3 gene. TCF3 has been shown to directly enhance Hes1 expression.

POU2F1 protein-coding gene in the species Homo sapiens

POU domain, class 2, transcription factor 1 is a protein that in humans is encoded by the POU2F1 gene.

ILF3 protein-coding gene in the species Homo sapiens

Interleukin enhancer-binding factor 3 is a protein that in humans is encoded by the ILF3 gene.

PBX1 protein-coding gene in the species Homo sapiens

Pre-B-cell leukemia transcription factor 1 is a protein that in humans is encoded by the PBX1 gene.

Y box binding protein 1 protein-coding gene in the species Homo sapiens

Y box binding protein 1 also known as Y-box transcription factor or nuclease-sensitive element-binding protein 1 is a protein that in humans is encoded by the YBX1 gene.

POU is a family of proteins that have well-conserved homeodomains.

CUTL1 protein-coding gene in the species Homo sapiens

Cux1 is a homeodomain protein that in humans is encoded by the CUX1 gene.

MEIS1 protein-coding gene in the species Homo sapiens

Homeobox protein Meis1 is a protein that in humans is encoded by the MEIS1 gene.

LDB1 protein-coding gene in the species Homo sapiens

LIM domain-binding protein 1 is a protein that in humans is encoded by the LDB1 gene.

LHX3 protein-coding gene in the species Homo sapiens

LIM/homeobox protein Lhx3 is a protein that in humans is encoded by the LHX3 gene.

ILF2 protein-coding gene in the species Homo sapiens

Interleukin enhancer-binding factor 2 is a protein that in humans is encoded by the ILF2 gene.

B3 domain

The B3 DNA binding domain (DBD) is a highly conserved domain found exclusively in transcription factors, from higher plants combined with other domains. It consists of 100-120 residues, includes seven beta strands and two alpha helices that form a DNA-binding pseudobarrel protein fold ; it interacts with the major groove of DNA.

Ethylene-responsive element binding protein

Ethylene-responsive element binding protein(EREBP) is a homeobox gene from Arabidopsis thaliana and other plants which encodes a transcription factor. EREBP is responsible in part for mediating the response in plants to the plant hormone ethylene.

Squamosa promoter binding protein

The SQUAMOSA promoter binding protein-like family of transcription factors are defined by a plant-specific DNA-binding domain. The founding member of the family was identified based on its specific in vitro binding to the promoter of the snapdragon SQUAMOSA gene. SBP proteins are thought to be transcriptional activators.

WRKY protein domain

The WRKY domain is found in the WRKY transcription factor family, a class of transcription factors. The WRKY domain is found almost exclusively in plants although WRKY genes appear present in some diplomonads, social amoebae and other amoebozoa, and fungi incertae sedis. They appear absent in other non-plant species. WRKY transcription factors have been a significant area of plant research for the past 20 years. The WRKY DNA-binding domain recognizes the W-box (T)TGAC(C/T) cis-regulatory element.

References

  1. Lannoy VJ, Burglin TR, Rousseau GG, Lemaigre FP (May 1998). "Isoforms of hepatocyte nuclear factor-6 differ in DNA-binding properties, contain a bifunctional homeodomain, and define the new ONECUT class of homeodomain proteins". J. Biol. Chem. 273 (22): 13552–62. doi:10.1074/jbc.273.22.13552. PMID   9593691.
  2. Yamasaki, K.; Akiba, T.; Yamasaki, T.; Harata, K. (2007). "Structural basis for recognition of the matrix attachment region of DNA by transcription factor SATB1". Nucleic Acids Research. 35 (15): 5073–5084. doi:10.1093/nar/gkm504. PMC   1976457 Lock-green.svg. PMID   17652321.

This article incorporates text from the public domain Pfam and InterPro IPR003350

The public domain consists of all the creative works to which no exclusive intellectual property rights apply. Those rights may have expired, been forfeited, expressly waived, or may be inapplicable.

Pfam Database of protein families

Pfam is a database of protein families that includes their annotations and multiple sequence alignments generated using hidden Markov models. The most recent version, Pfam 31.0, was released in March 2017 and contains 16,712 families.

InterPro is a database of protein families, domains and functional sites in which identifiable features found in known proteins can be applied to new protein sequences in order to functionally characterise them.