A major contributor to this article appears to have a close connection with its subject.(May 2011) |
The Sleeping Beauty transposon system is a synthetic DNA transposon designed to introduce precisely defined DNA sequences into the chromosomes of vertebrate animals for the purposes of introducing new traits and to discover new genes and their functions. It is a Tc1/mariner-type system, with the transposase resurrected from multiple inactive fish sequences.
The Sleeping Beauty transposon system is composed of a Sleeping Beauty (SB) transposase and a transposon that was designed in 1997 to insert specific sequences of DNA into genomes of vertebrate animals. DNA transposons translocate from one DNA site to another in a simple, cut-and-paste manner (Fig. 1). Transposition is a precise process in which a defined DNA segment is excised from one DNA molecule and moved to another site in the same or different DNA molecule or genome. [1]
As do all other Tc1/mariner-type transposases, SB transposase inserts a transposon into a TA dinucleotide base pair in a recipient DNA sequence. [2] The insertion site can be elsewhere in the same DNA molecule, or in another DNA molecule (or chromosome). In mammalian genomes, including humans, there are approximately 200 million TA sites. The TA insertion site is duplicated in the process of transposon integration. This duplication of the TA sequence is a hallmark of transposition and used to ascertain the mechanism in some experiments. However, a recent study indicated that SB also integrates into non-TA dinucleotides at a low frequency. [3] The transposase can be encoded either within the transposon (e.g., the putative transposon shown in Fig. 2) or the transposase can be supplied by another source, in which case the transposon becomes a non-autonomous element. Non-autonomous transposons (e.g., Fig. 1) are most useful as genetic tools because after insertion they cannot independently continue to excise and re-insert. All of the DNA transposons identified in the human genome and other mammalian genomes are non-autonomous because even though they contain transposase genes, the genes are non-functional and unable to generate a transposase that can mobilize the transposon.
This resurrected transposase gene was named "Sleeping Beauty (SB)" because it was brought back to activity from a long evolutionary sleep. [4] The SB transposon system is synthetic in that the SB transposase was re-constructed from extinct (fossil) transposase sequences belonging to the Tc1/mariner class of transposons [5] [6] found in the genomes of salmonid fish. [7] As in humans, where about 20,000 inactivated Tc1/mariner-type transposons comprise almost 3% of the human genome, [8] [9] the transposase genes found in fish have been inactive for more than 10 million years due to accumulated mutations. The reconstruction of SB transposase was based on the concept that there was a primordial Tc1-like transposon that was the ancestor to the sequences found in fish genomes. Although there were many sequences that looked like Tc-1 transposons in all the fish genomes studied, the transposon sequences were all inactive due to mutations. By assuming that the variations in sequences were due to independent mutations that accumulated in the different transposons, a putative ancestral transposon (Fig. 2) was postulated. [10]
The construction for the transposase began by fusing portions of two inactive transposon sequences from Atlantic salmon (Salmo salar) and one inactive transposon sequence from rainbow trout (Oncorhynchus mykiss) and then repairing small deficits in the functional domains of the transposase enzyme (Fig. 3). Each amino acid in the first completed transposase, called SB10, was determined by a “majority-rule consensus sequence” based on 12 partial genes found in eight fish species. The first steps (1->3 in Fig. 3) were to restore a complete protein by filling in gaps in the sequence and reversing termination codons that would keep the putative 360-amino acid polypeptide from being synthesized. The next step (4 in Fig. 3) was to reverse mutations in the nuclear localization signal (NLS) that is required to import the transposase enzyme from the cytoplasm where it is made to the nucleus where it acts. The amino-terminus of the transposase, which contains the DNA-binding motifs for recognition of the direct repeats (DRs), was restored in steps 5->8. The last two steps restored the catalytic domain, which features conserved aspartic acid (D) and glutamic acid (E) amino acids with specific spacing that are found in integrases and recombinases. [11] The result was SB10, which contains all of the motifs required for function. [4]
SB10 transposase has been improved over the decade since its construction by increasing the consensus with a greater number of extinct Tc1 transposon sequences and testing various combinations of changes. [12] [13] [14] [15] [16] [17] Further work has shown that the DNA-binding domain consists of two paired sequences, which are homologous to sequence motifs found in certain transcription factors. The paired subdomains in SB transposase were designated PAI and RED. The PAI subdomain plays a dominant role in recognition of the DR sequences in the transposon. The RED subdomain overlaps with the nuclear localization signal, but its function remains unclear. [18] The most recent version of SB transposase, SB100X, has about 100 times the activity of SB10 as determined by transposition assays of antibiotic-resistance genes conducted in tissue cultured human HeLa cells. [16] The International Society for Molecular and Cell Biology and Biotechnology Protocols and Research (ISMCBBPR) named SB100X the molecule of the year for 2009 for recognition of the potential it has in future genome engineering. [19]
The transposon recognized by SB transposase was named T because it was isolated from the genome of another salmond fish, Tanichthys albonubes. The transposon consists of a genetic sequence of interest that is flanked by inverted repeats (IRs) that themselves contain short direct repeats (DR) (tandem arrowheads IR-DR in Figs. 1 and 2). T had the closest IR/DR sequence to the consensus sequence for the extinct Tc-1 like transposons in fish. The consensus transposon has IRs of 231 base pairs. The innermost DRs are 29 base pairs long whereas the outermost DRs are 31 base pairs long. The difference in length is critical for maximal transposition rates. [20] The original T transposon component of the SB transposon system has been improved with minor changes to conform to the consensus of many related extinct and active transposons. [20] [21]
Over the past decade, SB transposons have been developed as non-viral vectors for introduction of genes into genomes of vertebrate animals and for gene therapy. The genetic cargo can be an expression cassette—a transgene and associated elements that confer transcriptional regulation for expression at a desired level in specific tissue(s). An alternative use of SB transposons is to discover functions of genes, especially those that cause cancer, [22] [23] by delivering DNA sequences that maximally disrupt expression of genes close to the insertion site. This process is referred to as insertional mutagenesis or transposon mutagenesis. When a gene is inactivated by insertion of a transposon (or other mechanism), that gene is “knocked out”. Knockout mice and knockout rats have been made with the SB system. [24] [25] Figure 4 illustrates these two uses of SB transposons.
For either gene delivery or gene disruption, SB transposons combine the advantages of viruses and naked DNA. Viruses have been evolutionarily selected based on their abilities to infect and replicate in new host cells. Simultaneously, cells have evolved major molecular defense mechanisms to protect themselves against viral infections. For some applications of genome engineering such as some forms of gene therapy, [26] [27] [28] avoiding the use of viruses is also important for social and regulatory reasons. The use of non-viral vectors avoids many, but not all, of the defenses that cells employ against vectors.
Plasmids, the circular DNAs shown in Fig. 1, are generally used for non-viral gene delivery. However, there are two major problems with most methods for delivering DNA to cellular chromosomes using plasmids, the most common form of non-viral gene delivery. First, expression of transgenes from plasmids is brief due to lack of integration and due to cellular responses that turn off expression. Second, uptake of plasmid molecules into cells is difficult and inefficient. The Sleeping Beauty Transposon System was engineered to overcome the first problem. DNA transposons precisely insert defined DNA sequences (Fig. 1) almost randomly into host genomes thereby increasing the longevity of gene expression (even through multiple generations). Moreover, transposition avoids the formation of multiple, tandem integrations, which often results in switching off expression of the transgene. Currently, insertion of transgenes into chromosomes using plasmids is much less efficient than using viruses. However, by using powerful promoters to regulate expression of a transgene, delivery of transposons to a few cells can provide useful levels of secreted gene products for an entire animal. [29] [30]
Arguably the most exciting potential application of Sleeping Beauty transposons will be for human gene therapy. The widespread human application of gene therapy in first-world nations as well as countries with developing economies can be envisioned if the costs of the vector system are affordable. Because the SB system is composed solely of DNA, the costs of production and delivery are considerably reduced compared to viral vectors. The first clinical trials using SB transposons in genetically modified T cells will test the efficacy of this form of gene therapy in patients at risk of death from advanced malignancies. [31]
A transposable element (TE), also transposon, or jumping gene, is a type of mobile genetic element, a nucleic acid sequence in DNA that can change its position within a genome, sometimes creating or reversing mutations and altering the cell's genetic identity and genome size.
A transposase is any of a class of enzymes capable of binding to the end of a transposon and catalysing its movement to another part of a genome, typically by a cut-and-paste mechanism or a replicative mechanism, in a process known as transposition. The word "transposase" was first coined by the individuals who cloned the enzyme required for transposition of the Tn3 transposon. The existence of transposons was postulated in the late 1940s by Barbara McClintock, who was studying the inheritance of maize, but the actual molecular basis for transposition was described by later groups. McClintock discovered that some segments of chromosomes changed their position, jumping between different loci or from one chromosome to another. The repositioning of these transposons allowed other genes for pigment to be expressed. Transposition in maize causes changes in color; however, in other organisms, such as bacteria, it can cause antibiotic resistance. Transposition is also important in creating genetic diversity within species and generating adaptability to changing living conditions.
P elements are transposable elements that were discovered in Drosophila as the causative agents of genetic traits called hybrid dysgenesis. The transposon is responsible for the P trait of the P element and it is found only in wild flies. They are also found in many other eukaryotes.
Insertion element is a short DNA sequence that acts as a simple transposable element. Insertion sequences have two major characteristics: they are small relative to other transposable elements and only code for proteins implicated in the transposition activity. These proteins are usually the transposase which catalyses the enzymatic reaction allowing the IS to move, and also one regulatory protein which either stimulates or inhibits the transposition activity. The coding region in an insertion sequence is usually flanked by inverted repeats. For example, the well-known IS911 is flanked by two 36bp inverted repeat extremities and the coding region has two genes partially overlapping orfA and orfAB, coding the transposase (OrfAB) and a regulatory protein (OrfA). A particular insertion sequence may be named according to the form ISn, where n is a number ; this is not the only naming scheme used, however. Although insertion sequences are usually discussed in the context of prokaryotic genomes, certain eukaryotic DNA sequences belonging to the family of Tc1/mariner transposable elements may be considered to be insertion sequences.
A transgene is a gene that has been transferred naturally, or by any of a number of genetic engineering techniques, from one organism to another. The introduction of a transgene, in a process known as transgenesis, has the potential to change the phenotype of an organism. Transgene describes a segment of DNA containing a gene sequence that has been isolated from one organism and is introduced into a different organism. This non-native segment of DNA may either retain the ability to produce RNA or protein in the transgenic organism or alter the normal function of the transgenic organism's genetic code. In general, the DNA is incorporated into the organism's germ line. For example, in higher vertebrates this can be accomplished by injecting the foreign DNA into the nucleus of a fertilized ovum. This technique is routinely used to introduce human disease genes or other genes of interest into strains of laboratory mice to study the function or pathology involved with that particular gene.
Mobile genetic elements (MGEs), sometimes called selfish genetic elements, are a type of genetic material that can move around within a genome, or that can be transferred from one species or replicon to another. MGEs are found in all organisms. In humans, approximately 50% of the genome are thought to be MGEs. MGEs play a distinct role in evolution. Gene duplication events can also happen through the mechanism of MGEs. MGEs can also cause mutations in protein coding regions, which alters the protein functions. These mechanisms can also rearrange genes in the host genome generating variation. These mechanism can increase fitness by gaining new or additional functions. An example of MGEs in evolutionary context are that virulence factors and antibiotic resistance genes of MGEs can be transported to share genetic code with neighboring bacteria. However, MGEs can also decrease fitness by introducing disease-causing alleles or mutations. The set of MGEs in an organism is called a mobilome, which is composed of a large number of plasmids, transposons and viruses.
In molecular biology, insertional mutagenesis is the creation of mutations in DNA by the addition of one or more base pairs. Such insertional mutations can occur naturally, mediated by viruses or transposons, or can be artificially created for research purposes in the lab.
Gene targeting is a biotechnological tool used to change the DNA sequence of an organism. It is based on the natural DNA-repair mechanism of Homology Directed Repair (HDR), including Homologous Recombination. Gene targeting can be used to make a range of sizes of DNA edits, from larger DNA edits such as inserting entire new genes into an organism, through to much smaller changes to the existing DNA such as a single base-pair change. Gene targeting relies on the presence of a repair template to introduce the user-defined edits to the DNA. The user will design the repair template to contain the desired edit, flanked by DNA sequence corresponding (homologous) to the region of DNA that the user wants to edit; hence the edit is targeted to a particular genomic region. In this way Gene Targeting is distinct from natural homology-directed repair, during which the ‘natural’ DNA repair template of the sister chromatid is used to repair broken DNA. The alteration of DNA sequence in an organism can be useful in both a research context – for example to understand the biological role of a gene – and in biotechnology, for example to alter the traits of an organism.
In the fields of bioinformatics and computational biology, Genome survey sequences (GSS) are nucleotide sequences similar to expressed sequence tags (ESTs) that the only difference is that most of them are genomic in origin, rather than mRNA.
Histone-lysine N-methyltransferase SETMAR is an enzyme that in humans is encoded by the SETMAR gene.
Transposon mutagenesis, or transposition mutagenesis, is a biological process that allows genes to be transferred to a host organism's chromosome, interrupting or modifying the function of an extant gene on the chromosome and causing mutation. Transposon mutagenesis is much more effective than chemical mutagenesis, with a higher mutation frequency and a lower chance of killing the organism. Other advantages include being able to induce single hit mutations, being able to incorporate selectable markers in strain construction, and being able to recover genes after mutagenesis. Disadvantages include the low frequency of transposition in living systems, and the inaccuracy of most transposition systems.
A knockout rat is a genetically engineered rat with a single gene turned off through a targeted mutation used for academic and pharmaceutical research. Knockout rats can mimic human diseases and are important tools for studying gene function and for drug discovery and development. The production of knockout rats was not economically or technically feasible until 2008.
Transposons are semi-parasitic DNA sequences which can replicate and spread through the host's genome. They can be harnessed as a genetic tool for analysis of gene and protein function. The use of transposons is well-developed in Drosophila and in Thale cress and bacteria such as Escherichia coli.
Miniature Inverted-repeat Transposable Elements (MITEs) are a group of non-autonomous Class II transposable elements. Being non-autonomous, MITEs cannot code for their own transposase. They exist within the genomes of animals, plants, fungi, bacteria and even viruses. MITEs are generally short elements with terminal inverted repeats and two flanking target site duplications (TSDs). Like other transposons, MITEs are inserted predominantly in gene-rich regions and this can be a reason that they affect gene expression and play important roles in accelerating eukaryotic evolution. Their high copy number in spite of small sizes has been a topic of interest.
The PiggyBac (PB) transposon system employs a genetically engineered transposase enzyme to insert a gene into a cell's genome. It is built upon the natural PiggyBac (PB) transposable element (transposon), enabling the back and forth movement of genes between chromosomes and genetic vectors such as plasmids through a "cut and paste" mechanism. During transposition, the PB transposase recognizes transposon-specific inverted terminal repeat sequences (ITRs) located on both ends of the transposon vector and efficiently moves the contents from the original sites and integrates them into TTAA chromosomal sites. The powerful activity of the PiggyBac transposon system enables genes of interest between the two ITRs in the PB vector to be easily mobilized into target genomes. The TTAA-specific transposon piggyBac is rapidly becoming a highly useful transposon for genetic engineering of a wide variety of species, particularly insects. They were discovered in 1989 by Malcolm Fraser at the University of Notre Dame.
Self-complementary adeno-associated virus (scAAV) is a viral vector engineered from the naturally occurring adeno-associated virus (AAV) to be used as a tool for gene therapy. Use of recombinant AAV (rAAV) has been successful in clinical trials addressing a variety of diseases. This lab-made progeny of rAAV is termed "self-complementary" because the coding region has been designed to form an intra-molecular double-stranded DNA template. A rate-limiting step for the standard AAV genome involves the second-strand synthesis since the typical AAV genome is a single-stranded DNA template. However, this is not the case for scAAV genomes. Upon infection, rather than waiting for cell mediated synthesis of the second strand, the two complementary halves of scAAV will associate to form one double stranded DNA (dsDNA) unit that is ready for immediate replication and transcription. The caveat of this construct is that instead of the full coding capacity found in rAAV (4.7–6kb) scAAV can only hold about half of that amount (≈2.4kb).
Transposition is the process by which a specific genetic sequence, known as a transposon, is moved from one location of the genome to another. Simple, or conservative transposition, is a non-replicative mode of transposition. That is, in conservative transposition the transposon is completely removed from the genome and reintegrated into a new, non-homologous locus, the same genetic sequence is conserved throughout the entire process. The site in which the transposon is reintegrated into the genome is called the target site. A target site can be in the same chromosome as the transposon or within a different chromosome. Conservative transposition uses the "cut-and-paste" mechanism driven by the catalytic activity of the enzyme transposase. Transposase acts like DNA scissors; it is an enzyme that cuts through double-stranded DNA to remove the transposon, then transfers and pastes it into a target site.
DNA transposons are DNA sequences, sometimes referred to "jumping genes", that can move and integrate to different locations within the genome. They are class II transposable elements (TEs) that move through a DNA intermediate, as opposed to class I TEs, retrotransposons, that move through an RNA intermediate. DNA transposons can move in the DNA of an organism via a single-or double-stranded DNA intermediate. DNA transposons have been found in both prokaryotic and eukaryotic organisms. They can make up a significant portion of an organism's genome, particularly in eukaryotes. In prokaryotes, TE's can facilitate the horizontal transfer of antibiotic resistance or other genes associated with virulence. After replicating and propagating in a host, all transposon copies become inactivated and are lost unless the transposon passes to a genome by starting a new life cycle with horizontal transfer. DNA transposons do not randomly insert themselves into the genome, but rather show preference for specific sites.
Tc1/mariner is a class and superfamily of interspersed repeats DNA transposons. The elements of this class are found in all animals, including humans. They can also be found in protists and bacteria.
CRISPR-associated transposons or CASTs are mobile genetic elements (MGEs) that have evolved to make use of minimal CRISPR systems for RNA-guided transposition of their DNA. Unlike traditional CRISPR systems that contain interference mechanisms to degrade targeted DNA, CASTs lack proteins and/or protein domains responsible for DNA cleavage. Specialized transposon machinery, similar to that of the well characterized Tn7 transposon, complexes with the CRISPR RNA (crRNA) and associated Cas proteins for transposition. CAST systems have been characterized in a wide range of bacteria and make use of variable CRISPR configurations including Type I-F, Type I-B, Type I-C, Type I-D, Type I-E, Type IV, and Type V-K. MGEs remain an important part of genetic exchange by horizontal gene transfer and CASTs have been implicated in the exchange of antibiotic resistance and antiviral defense mechanisms, as well as genes involved in central carbon metabolism. These systems show promise for genetic engineering due to their programmability, PAM flexibility, and ability to insert directly into the host genome without double strand breaks requiring activation of host repair mechanisms. They also lack Cas1 and Cas2 proteins and so rely on other more complete CRISPR systems for spacer acquisition in trans.