Toprim domain | |||||||||
---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||
Symbol | Toprim | ||||||||
Pfam | PF01751 | ||||||||
Pfam clan | Toprim-like | ||||||||
InterPro | IPR006171 | ||||||||
SCOP2 | 2fcj / SCOPe / SUPFAM | ||||||||
|
Toprim catalytic core | |||||||||
---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||
Symbol | Toprim_N | ||||||||
Pfam | PF08275 | ||||||||
InterPro | IPR013264 | ||||||||
SCOP2 | 1dd9 / SCOPe / SUPFAM | ||||||||
|
AEP DNA primase, small subunit | |||||||||
---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||
Symbol | DNA_primase_S | ||||||||
Pfam | PF01896 | ||||||||
Pfam clan | AEP | ||||||||
InterPro | IPR002755 | ||||||||
SCOP2 | 1g71 / SCOPe / SUPFAM | ||||||||
|
AEP DNA primase, large subunit | |||||||||
---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||
Symbol | DNA_primase_lrg | ||||||||
Pfam | PF04104 | ||||||||
Pfam clan | CL0242 | ||||||||
InterPro | IPR007238 | ||||||||
SCOP2 | 1zt2 / SCOPe / SUPFAM | ||||||||
|
DNA primase is an enzyme involved in the replication of DNA and is a type of RNA polymerase. Primase catalyzes the synthesis of a short RNA (or DNA in some living organisms [1] ) segment called a primer complementary to a ssDNA (single-stranded DNA) template. After this elongation, the RNA piece is removed by a 5' to 3' exonuclease and refilled with DNA.
In bacteria, primase binds to the DNA helicase forming a complex called the primosome. Primase is activated by the helicase where it then synthesizes a short RNA primer approximately 11 ±1 nucleotides long, to which new nucleotides can be added by DNA polymerase. Archaeal and eukaryote primases are heterodimeric proteins with one large regulatory and one minuscule catalytic subunit. [2]
The RNA segments are first synthesized by primase and then elongated by DNA polymerase. [3] Then the DNA polymerase forms a protein complex with two primase subunits to form the alpha DNA Polymerase primase complex. Primase is one of the most error prone and slow polymerases. [3] Primases in organisms such as E. coli synthesize around 2000 to 3000 primers at the rate of one primer per second. [4] Primase also acts as a halting mechanism to prevent the leading strand from outpacing the lagging strand by halting the progression of the replication fork. [5] The rate determining step in primase is when the first phosphodiester bond is formed between two molecules of RNA. [3]
The replication mechanisms differ between different bacteria and viruses where the primase covalently link to helicase in viruses such as the T7 bacteriophage. [5] In viruses such as the herpes simplex virus (HSV-1), primase can form complexes with helicase. [6] The primase-helicase complex is used to unwind dsDNA (double-stranded) and synthesizes the lagging strand using RNA primers [6] The majority of primers synthesized by primase are two to three nucleotides long. [6]
There are two main types of primase: DnaG found in most bacteria, and the AEP (Archaeo-Eukaryote Primase) superfamily found in archaean and eukaryotic primases. While bacterial primases (DnaG-type) are composed of a single protein unit (a monomer) and synthesize RNA primers, AEP primases are usually composed of two different primase units (a heterodimer) and synthesize two-part primers with both RNA and DNA components. [7] While functionally similar, the two primase superfamilies evolved independently of each other.
The crystal structure of primase in E. coli with a core containing the DnaG protein was determined in the year 2000. [4] The DnaG and primase complex is cashew shaped and contains three subdomains. [4] The central subdomain forms a toprim fold which is made of a mixture five beta sheets and six alpha helices. [4] [8] The toprim fold is used for binding regulators and metals. The primase uses a phosphotransfer domain for the transfer coordination of metals, which makes it distinct from other polymerases. [4] The side subunits contain a NH2 and COOH-terminal made of alpha helixes and beta sheets. [4] The NH2 terminal interacts with a zinc binding domain and COOH-terminal region which interacts with DnaB-ID. [4]
The Toprim fold is also found in topoisomerase and mitochrondrial Twinkle primase/helicase. [8] Some DnaG-like (bacteria-like; InterPro : IPR020607 ) primases have been found in archaeal genomes. [9]
Eukaryote and archaeal primases tend to be more similar to each other, in terms of structure and mechanism, than they are to bacterial primases. [10] [11] The archaea-eukaryotic primase (AEP) superfamily, which most eukaryal and archaeal primase catalytic subunits belong to, has recently been redefined as a primase-polymerase family in recognition of the many other roles played by enzymes in this family. [12] This classification also emphasizes the broad origins of AEP primases; the superfamily is now recognized as transitioning between RNA and DNA functions. [13]
Archaeal and eukaryote primases are heterodimeric proteins with one large regulatory (human PRIM2, p58) and one small catalytic subunit (human PRIM1, p48/p49). [2] The large subunit contains a N-terminal 4Fe–4S cluster, split out in some archaea as PriX/PriCT. [14] The large subunit is implicated in improving the activity and specificity of the small subunit. For example, removing the part corresponding to the large subunit in a fusion protein PolpTN2 results in a slower enzyme with reverse transcriptase activity. [13]
The AEP family of primase-polymerases has diverse features beyond making only primers. In addition to priming DNA during replication, AEP enzymes may have additional functions in the DNA replication process, such as polymerization of DNA or RNA, terminal transfer, translesion synthesis (TLS), non-homologous end joining (NHEJ), [12] and possibly in restarting stalled replication forks. [15] Primases typically synthesize primers from ribonucleotides (NTPs); however, primases with polymerase capabilities also have an affinity for deoxyribonucleotides (dNTPs). [16] [11] Primases with terminal transferase functionality are capable of adding nucleotides to the 3’ end of a DNA strand independently of a template. Other enzymes involved in DNA replication, such as helicases, may also exhibit primase activity. [17]
Human PrimPol (ccdc111 [16] ) serves both primase and polymerase functions, like many archaeal primases; exhibits terminal transferase activity in the presence of manganese; and plays a significant role in translesion synthesis [18] and in restarting stalled replication forks. PrimPol is actively recruited to damaged sites through its interaction with RPA, an adapter protein that facilitates DNA replication and repair. [15] PrimPol has a zinc finger domain similar to that of some viral primases, which is essential for translesion synthesis and primase activity and may regulate primer length. [18] Unlike most primases, PrimPol is uniquely capable of starting DNA chains with dNTPs. [16]
PriS, the archaeal primase small subunit, has a role in translesion synthesis (TLS) and can bypass common DNA lesions. Most archaea lack the specialized polymerases that perform TLS in eukaryotes and bacteria. [19] PriS alone preferentially synthesizes strings of DNA; but in combination with PriL, the large subunit, RNA polymerase activity is increased. [20]
In Sulfolobus solfataricus, the primase heterodimer PriSL can act as a primase, polymerase, and terminal transferase. PriSL is thought to initiate primer synthesis with NTPs and then switch to dNTPs. The enzyme can polymerize RNA or DNA chains, with DNA products reaching as long as 7000 nucleotides (7 kb). It is suggested that this dual functionality may be a common feature of archaeal primases. [11]
AEP multifutional primases also appear in bacteria and phages that infect them. They can display novel domain organizations with domains that bring even more functions beyond polymerization. [14]
Bacterial LigD ( A0R3R7 ) is primarily involved in the NHEJ pathway. It has an AEP superfamily polymerase/primase domain, a 3'-phosphoesterase domain, and a ligase domain. It is also capable of primase, DNA and RNA polymerase, and terminal transferase activity. DNA polymerization activity can produce chains over 7000 nucleotides (7 kb) in length, while RNA polymerization produces chains up to 1 kb long. [21]
AEP enzymes are widespread, and can be found encoded in mobile genetic elements including virus/phages and plasmids. They either use them as a sole replication protein or in combination with other replication-associated proteins, such as helicases and, less frequently, DNA polymerases. [22] Whereas the presence of AEP in eukaryotic and archaeal viruses is expected in that they mirror their hosts, [22] bacterial viruses and plasmids also as frequently encode AEP-superfamily enzymes as they do DnaG-family primases. [14] A great diversity of AEP families has been uncovered in various bacterial plasmids by comparative genomics surveys. [14] Their evolutionary history is currently unknown, as these found in bacteria and bacteriophages appear too different from their archaeo-eukaryotic homologs for a recent horizontal gene transfer. [22]
MCM-like helicase in Bacillus cereus strain ATCC 14579 (BcMCM; Q81EV1 ) is an SF6 helicase fused with an AEP primase. The enzyme has both primase and polymerase functions in addition to helicase function. The gene coding for it is found in a prophage. [17] It bears homology to ORF904 of plasmid pRN1 from Sulfolobus islandicus, which has an AEP PrimPol domain. [23] Vaccinia virus D5 and HSV Primase are examples of AEP-helicase fusion as well. [12] [6]
PolpTN2 is an Archaeal primase found in the TN2 plasmid. A fusion of domains homologous to PriS and PriL, it exhibits both primase and DNA polymerase activity, as well as terminal transferase function. Unlike most primases, PolpTN2 forms primers composed exclusively of dNTPs. [13] Unexpectedly, when the PriL-like domain was truncated, PolpTN2 could also synthesize DNA on the RNA template, i.e., acted as an RNA-dependent DNA polymerase (reverse transcriptase). [13]
Even DnaG primases can have extra functions, if given the right domains. The T7 phage gp4 is a DnaG primase-helicase fusion, and performs both functions in replication. [5]
In molecular biology, DNA replication is the biological process of producing two identical replicas of DNA from one original DNA molecule. DNA replication occurs in all living organisms acting as the most essential part of biological inheritance. This is essential for cell division during growth and repair of damaged tissues, while it also ensures that each of the new cells receives its own copy of the DNA. The cell possesses the distinctive property of division, which makes replication of DNA essential.
In biochemistry, a polymerase is an enzyme that synthesizes long chains of polymers or nucleic acids. DNA polymerase and RNA polymerase are used to assemble DNA and RNA molecules, respectively, by copying a DNA template strand using base-pairing interactions or RNA by half ladder replication.
Transcription is the process of copying a segment of DNA into RNA. Some segments of DNA are transcribed into RNA molecules that can encode proteins, called messenger RNA (mRNA). Other segments of DNA are transcribed into RNA molecules called non-coding RNAs (ncRNAs).
In molecular biology, RNA polymerase, or more specifically DNA-directed/dependent RNA polymerase (DdRP), is an enzyme that catalyzes the chemical reactions that synthesize RNA from a DNA template.
A DNA polymerase is a member of a family of enzymes that catalyze the synthesis of DNA molecules from nucleoside triphosphates, the molecular precursors of DNA. These enzymes are essential for DNA replication and usually work in groups to create two identical DNA duplexes from a single original DNA duplex. During this process, DNA polymerase "reads" the existing DNA strands to create two new strands that match the existing ones. These enzymes catalyze the chemical reaction
Helicases are a class of enzymes thought to be vital to all organisms. Their main function is to unpack an organism's genetic material. Helicases are motor proteins that move directionally along a nucleic acid phosphodiester backbone, separating two hybridized nucleic acid strands, using energy from ATP hydrolysis. There are many helicases, representing the great variety of processes in which strand separation must be catalyzed. Approximately 1% of eukaryotic genes code for helicases.
DNA polymerase I is an enzyme that participates in the process of prokaryotic DNA replication. Discovered by Arthur Kornberg in 1956, it was the first known DNA polymerase. It was initially characterized in E. coli and is ubiquitous in prokaryotes. In E. coli and many other bacteria, the gene that encodes Pol I is known as polA. The E. coli Pol I enzyme is composed of 928 amino acids, and is an example of a processive enzyme — it can sequentially catalyze multiple polymerisation steps without releasing the single-stranded template. The physiological function of Pol I is mainly to support repair of damaged DNA, but it also contributes to connecting Okazaki fragments by deleting RNA primers and replacing the ribonucleotides with DNA.
DnaG is a bacterial DNA primase and is encoded by the dnaG gene. The enzyme DnaG, and any other DNA primase, synthesizes short strands of RNA known as oligonucleotides during DNA replication. These oligonucleotides are known as primers because they act as a starting point for DNA synthesis. DnaG catalyzes the synthesis of oligonucleotides that are 10 to 60 nucleotides long, however most of the oligonucleotides synthesized are 11 nucleotides. These RNA oligonucleotides serve as primers, or starting points, for DNA synthesis by bacterial DNA polymerase III. DnaG is important in bacterial DNA replication because DNA polymerase cannot initiate the synthesis of a DNA strand, but can only add nucleotides to a preexisting strand. DnaG synthesizes a single RNA primer at the origin of replication. This primer serves to prime leading strand DNA synthesis. For the other parental strand, the lagging strand, DnaG synthesizes an RNA primer every few kilobases (kb). These primers serve as substrates for the synthesis of Okazaki fragments.
DNA polymerase III holoenzyme is the primary enzyme complex involved in prokaryotic DNA replication. It was discovered by Thomas Kornberg and Malcolm Gefter in 1970. The complex has high processivity and, specifically referring to the replication of the E.coli genome, works in conjunction with four other DNA polymerases. Being the primary holoenzyme involved in replication activity, the DNA Pol III holoenzyme also has proofreading capabilities that corrects replication mistakes by means of exonuclease activity reading 3'→5' and synthesizing 5'→3'. DNA Pol III is a component of the replisome, which is located at the replication fork.
Okazaki fragments are short sequences of DNA nucleotides which are synthesized discontinuously and later linked together by the enzyme DNA ligase to create the lagging strand during DNA replication. They were discovered in the 1960s by the Japanese molecular biologists Reiji and Tsuneko Okazaki, along with the help of some of their colleagues.
The replisome is a complex molecular machine that carries out replication of DNA. The replisome first unwinds double stranded DNA into two single strands. For each of the resulting single strands, a new complementary sequence of DNA is synthesized. The total result is formation of two new double stranded DNA sequences that are exact copies of the original double stranded DNA sequence.
RNA-dependent RNA polymerase (RdRp) or RNA replicase is an enzyme that catalyzes the replication of RNA from an RNA template. Specifically, it catalyzes synthesis of the RNA strand complementary to a given RNA template. This is in contrast to typical DNA-dependent RNA polymerases, which all organisms use to catalyze the transcription of RNA from a DNA template.
Eukaryotic DNA replication is a conserved mechanism that restricts DNA replication to once per cell cycle. Eukaryotic DNA replication of chromosomal DNA is central for the duplication of a cell and is necessary for the maintenance of the eukaryotic genome.
DNA primase large subunit is an enzyme that in humans is encoded by the PRIM2 gene.
T7 DNA polymerase is an enzyme used during the DNA replication of the T7 bacteriophage. During this process, the DNA polymerase “reads” existing DNA strands and creates two new strands that match the existing ones. The T7 DNA polymerase requires a host factor, E. coli thioredoxin, in order to carry out its function. This helps stabilize the binding of the necessary protein to the primer-template to improve processivity by more than 100-fold, which is a feature unique to this enzyme. It is a member of the Family A DNA polymerases, which include E. coli DNA polymerase I and Taq DNA polymerase.
A circular chromosome is a chromosome in bacteria, archaea, mitochondria, and chloroplasts, in the form of a molecule of circular DNA, unlike the linear chromosome of most eukaryotes.
PrimPol is a protein encoded by the PRIMPOL gene in humans. PrimPol is a eukaryotic protein with both DNA polymerase and DNA Primase activities involved in translesion DNA synthesis. It is the first eukaryotic protein to be identified with priming activity using deoxyribonucleotides. It is also the first protein identified in the mitochondria to have translesion DNA synthesis activities.
DNA primase small subunit is an enzyme that in humans is encoded by the PRIM1 gene.
DNA polymerase alpha catalytic subunit is an enzyme that in humans is encoded by the POLA1 gene.
A helicase–primase complex is a complex of enzymes including DNA helicase and DNA primase. A helicase-primase associated factor protein may also be present.