The positive transcription elongation factor, P-TEFb, is a multiprotein complex that plays an essential role in the regulation of transcription by RNA polymerase II (Pol II) in eukaryotes. [1] Immediately following initiation Pol II becomes trapped in promoter proximal paused positions on the majority of human genes (Figure 1). [2] [3] P-TEFb is a cyclin dependent kinase that can phosphorylate the DRB sensitivity inducing factor (DSIF) [4] and negative elongation factor (NELF), [5] as well as the carboxyl terminal domain of the large subunit of Pol II [6] and this causes the transition into productive elongation leading to the synthesis of mRNAs. P-TEFb is regulated in part by a reversible association with the 7SK snRNP. [7] Treatment of cells with the P-TEFb inhibitors DRB or flavopidirol leads to loss of mRNA production and ultimately cell death. [6] [8]
P-TEFb was identified and purified as a factor needed for the generation of long run-off transcripts using an in vitro transcription system derived from Drosophila cells. [9] It is a cyclin dependent kinase containing the catalytic subunit, Cdk9, and a regulatory subunit, cyclin T in Drosophila. [10] In humans there are multiple forms of P-TEFb which contain Cdk9 and one of several cyclin subunits, cyclin T1, T2, and K. [11] [12] P-TEFb associates with other factors including the bromodomain protein BRD4, [13] and is found associated with a large complex of proteins called the super elongation complex. [14] [15] Importantly, for the AIDS virus, HIV, P-TEFb is targeted by the HIV Tat protein [16] which bypasses normal cellular P-TEFb control and directly brings P-TEFb to the promoter proximal paused polymerase in the HIV genome. [17] [18]
The structures of human P-TEFb containing Cdk9 and cyclin T1 and the HIV Tat•P-TEFb complex have been solved using X-ray crystallography. The first structure solved demonstrated that the two subunits were arranged as has been found in other cyclin dependent kinases. [19] Three amino acid substitutions were inadvertently introduced in the subunits used for the original structure and a subsequent structure determination using the correct sequences demonstrated the same overall structure except for a few significant changes around the active site. [20] The structure of HIV Tat bound to P-TEFb demonstrated that the viral protein forms extensive contacts with the cyclin T1 subunit (Figure 2). [20]
Because of its central role in controlling eukaryotic gene expression, P-TEFb is subject to stringent regulation at the level of transcription of the genes encoding the subunits, translation of the subunit mRNAs, turnover of the subunits, and also by an unusual mechanism involving the 7SK snRNP. [7] As shown in Figure 3 P-TEFb is held in the 7SK snRNP by the double stranded RNA binding protein HEXIM (HEXIM1 or HEXIM2 in humans). HEXIM bound to 7SK RNA or any double stranded RNA binds to P-TEFb and inhibits the kinase activity. [21] [22] Two other proteins are always found associated with 7SK RNA. The methyl phosphase capping enzyme MEPCE puts a methyl group on the gamma phosphate of the first nucleotide of the 7SK RNA [23] and the La related protein LARP7 binds to the 3' end of 7SK. [24] [25] When P-TEFb is extracted from the 7SK snRNP, 7SK RNA undergoes a conformation change, HEXIM is ejected and hnRNPs take the place of the factors removed. [7] The re-sequestration of P-TEFb requires another rearrangement of the RNA, binding of HEXIM and then P-TEFb. In rapidly growing cells the 7SK snRNP is the predominant form of P-TEFb. For review. [26]
In molecular biology 7SK is an abundant small nuclear RNA found in metazoans. It plays a role in regulating transcription by controlling the positive transcription elongation factor P-TEFb. 7SK is found in a small nuclear ribonucleoprotein complex (snRNP) with a number of other proteins that regulate the stability and function of the complex.
Cyclin-dependent kinase 9 or CDK9 is a cyclin-dependent kinase associated with P-TEFb.
Cyclin-T1 is a protein that in humans is encoded by the CCNT1 gene.
DNA-directed RNA polymerases I, II, and III subunit RPABC1 is a protein that in humans is encoded by the POLR2E gene.
DNA-directed RNA polymerase II subunit RPB2 is an enzyme that in humans is encoded by the POLR2B gene.
DNA-directed RNA polymerase II subunit RPB7 is an enzyme that in humans is encoded by the POLR2G gene.
DNA-directed RNA polymerases I, II, and III subunit RPABC3 is a protein that in humans is encoded by the POLR2H gene.
DNA-directed RNA polymerases I, II, and III subunit RPABC2 is a protein that in humans is encoded by the POLR2F gene.
DNA-directed RNA polymerase II subunit RPB4 is an enzyme that in humans is encoded by the POLR2D gene.
DNA-directed RNA polymerase II subunit RPB9 is an enzyme that in humans is encoded by the POLR2I gene.
Transcription elongation factor SPT5 is a protein that in humans is encoded by the SUPT5H gene.
General transcription factor IIH subunit 4 is a protein that in humans is encoded by the GTF2H4 gene.
Protein HEXIM1 is a protein that in humans is encoded by the HEXIM1 gene.
Cyclin-T2 is a protein that in humans is encoded by the CCNT2 gene.
CTD small phosphatase-like protein is an enzyme that in humans is encoded by the CTDSPL gene.
Cyclin-K is a protein that in humans is encoded by the CCNK gene.
Protein HEXIM2 is a protein that in humans is encoded by the HEXIM2 gene.
RNA polymerase II holoenzyme is a form of eukaryotic RNA polymerase II that is recruited to the promoters of protein-coding genes in living cells. It consists of RNA polymerase II, a subset of general transcription factors, and regulatory proteins known as SRB proteins.
DSIF is a protein complex that can either negatively or positively affect transcription by RNA polymerase II. It can interact with the negative elongation factor (NELF) to promote the stalling of Pol II at some genes, which is called promoter proximal pausing. The pause occurs soon after initiation, once 20–60 nucleotides have been transcribed. This stalling is relieved by positive transcription elongation factor b (P-TEFb) and Pol II enters productive elongation to resume synthesis till finish. In humans, DSIF is composed of hSPT4 and hSPT5. hSPT5 has a direct role in mRNA capping which occurs while the elongation is paused.
In molecular biology, Tat is a protein that is encoded for by the tat gene in HIV-1. Tat is a regulatory protein that drastically enhances the efficiency of viral transcription. Tat stands for "Trans-Activator of Transcription". The protein consists of between 86 and 101 amino acids depending on the subtype. Tat vastly increases the level of transcription of the HIV dsDNA. Before Tat is present, a small number of RNA transcripts will be made, which allow the Tat protein to be produced. Tat then binds to cellular factors and mediates their phosphorylation, resulting in increased transcription of all HIV genes, providing a positive feedback cycle. This in turn allows HIV to have an explosive response once a threshold amount of Tat is produced, a useful tool for defeating the body's response.