RNA polymerase II

RNA polymerase II (RNAP II and Pol II) is a multiprotein complex that transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNAP enzymes found in the nucleus of eukaryotic cells. A 550 kDa complex of 12 subunits, RNAP II is the most studied type of RNA polymerase. A wide range of transcription factors are required for it to bind to upstream gene promoters and begin transcription.

Discovery
Early studies suggested a minimum of two RNAPs: one which synthesized rRNA in the nucleolus, and one which synthesized other RNA in the nucleoplasm, part of the nucleus but outside the nucleolus. In 1969, biochemists Robert G. Roeder and William Rutter discovered there are total three distinct nuclear RNA polymerases, an additional RNAP that was responsible for transcription of some kind of RNA in the nucleoplasm. The finding was obtained by the use of ion-exchange chromatography via DEAE coated Sephadex beads. The technique separated the enzymes by the order of the corresponding elutions, Ι,ΙΙ,ΙΙΙ, by increasing the concentration of ammonium sulfate. The enzymes were named according to the order of the elutions, RNAP I, RNAP II, RNAP IΙI. This discovery demonstrated that there was an additional enzyme present in the nucleoplasm, which allowed for the differentiation between RNAP II and RNAP III.

RNA polymerase II (RNAP2) undergoes regulated transcriptional pausing during early elongation. Various studies has shown that disruption of transcription elongation is implicated in cancer, neurodegeneration, HIV latency etc.

Subunits
The eukaryotic core RNA polymerase II was first purified using transcription assays. The purified enzyme has typically 10–12 subunits (12 in humans and yeast) and is incapable of specific promoter recognition. Many subunit-subunit interactions are known.


 * DNA-directed RNA polymerase II subunit RPB1 – an enzyme that in humans is encoded by the POLR2A gene and in yeast is encoded by RPO21. RPB1 is the largest subunit of RNA polymerase II. It contains a carboxy terminal domain (CTD) composed of up to 52 heptapeptide repeats (YSPTSPS) that are essential for polymerase activity. The CTD was first discovered in the laboratory of C.J. Ingles at the University of Toronto and by JL Corden at Johns Hopkins University. In combination with several other polymerase subunits, the RPB1 subunit forms the DNA binding domain of the polymerase, a groove in which the DNA template is transcribed into RNA. It strongly interacts with RPB8.
 * RPB2 (POLR2B) – the second-largest subunit that in combination with at least two other polymerase subunits forms a structure within the polymerase that maintains contact in the active site of the enzyme between the DNA template and the newly synthesized RNA.
 * RPB3 (POLR2C) – the third-largest subunit. Exists as a heterodimer with another polymerase subunit, POLR2J forming a core subassembly. RPB3 strongly interacts with RPB1-5, 7, 10–12.
 * RNA polymerase II subunit B4 (RPB4) – encoded by the POLR2D gene is the fourth-largest subunit and may have a stress protective role.
 * RPB5 – In humans is encoded by the POLR2E gene. Two molecules of this subunit are present in each RNA polymerase II. RPB5 strongly interacts with RPB1, RPB3, and RPB6.
 * RPB6 (POLR2F) – forms a structure with at least two other subunits that stabilizes the transcribing polymerase on the DNA template.
 * RPB7 – encoded by POLR2G and may play a role in regulating polymerase function. RPB7 interacts strongly with RPB1 and RPB5.
 * RPB8 (POLR2H) – interacts with subunits RPB1-3, 5, and 7.
 * RPB9 – The groove in which the DNA template is transcribed into RNA is composed of RPB9 (POLR2I) and RPB1.
 * RPB10 – the product of gene POLR2L. It interacts with RPB1-3 and 5, and strongly with RPB3.
 * RPB11 – the RPB11 subunit is itself composed of three subunits in humans: POLR2J (RPB11-a), POLR2J2 (RPB11-b), and POLR2J3 (RPB11-c).
 * RPB12 – Also interacts with RPB3 is RPB12 (POLR2K).

Assembly
RPB3 is involved in RNA polymerase II assembly. A subcomplex of RPB2 and RPB3 appears soon after subunit synthesis. This complex subsequently interacts with RPB1. RPB3, RPB5, and RPB7 interact with themselves to form homodimers, and RPB3 and RPB5 together are able to contact all of the other RPB subunits, except RPB9. Only RPB1 strongly binds to RPB5. The RPB1 subunit also contacts RPB7, RPB10, and more weakly but most efficiently with RPB8. Once RPB1 enters the complex, other subunits such as RPB5 and RPB7 can enter, where RPB5 binds to RPB6 and RPB8 and RPB3 brings in RPB10, RPB 11, and RPB12. RPB4 and RPB9 may enter once most of the complex is assembled. RPB4 forms a complex with RPB7.

Kinetics
Enzymes can catalyze up to several million reactions per second. Enzyme rates depend on solution conditions and substrate concentration. Like other enzymes POLR2 has a saturation curve and a maximum velocity (Vmax). It has a Km (substrate concentration required for one-half Vmax) and a kcat (the number of substrate molecules handled by one active site per second). The specificity constant is given by kcat/Km. The theoretical maximum for the specificity constant is the diffusion limit of about 108 to 109 (M−1s−1), where every collision of the enzyme with its substrate results in catalysis. In yeast, mutation in the Trigger-Loop domain of the largest subunit can change the kinetics of the enzyme.

Bacterial RNA polymerase, a relative of RNA Polymerase II, switches between inactivated and activated states by translocating back and forth along the DNA. Concentrations of [NTP]eq = 10 μM GTP, 10 μM UTP, 5 μM ATP and 2.5 μM CTP, produce a mean elongation rate, turnover number, of ~1 bp (NTP)−1 for bacterial RNAP, a relative of RNA polymerase II.

RNA polymerase II undergoes extensive co-transcriptional pausing during transcription elongation. This pausing is especially pronounced at nucleosomes, and arises in part through the polymerase entering a transcriptionally incompetent backtracked state. The duration of these pauses ranges from seconds to minutes or longer, and exit from long-lived pauses can be promoted by elongation factors such as TFIIS. In turn, the transcription rate influences whether the histones of transcribed nucleosomes are evicted from chromatin, or reinserted behind the transcribing polymerase.

Alpha-Amanitin
RNA polymerase II is inhibited by α-Amanitin and other amatoxins. α-Amanitin is a highly poisonous substance found in many mushrooms. The mushroom poison has different effects on each of the RNA Polymerases: I, II, III. RNAP I is completely unresponsive to the substance and will function normally while RNAP III has a moderate sensitivity. RNAP II, however, is completely inhibited by the toxin. Alpha-Amanitin inhibits RNAP II by strong interactions in the enzyme's "funnel", "cleft", and the key "bridge α-helix" regions of the RPB-1 subunit.

Holoenzyme
RNA polymerase II holoenzyme is a form of eukaryotic RNA polymerase II that is recruited to the promoters of protein-coding genes in living cells. It consists of RNA polymerase II, a subset of general transcription factors, and regulatory proteins known as SRB proteins.

Part of the assembly of the holoenzyme is referred to as the preinitiation complex, because its assembly takes place on the gene promoter before the initiation of transcription. The mediator complex acts as a bridge between RNA polymerase II and the transcription factors.

Control by chromatin structure
This is an outline of an example mechanism of yeast cells by which chromatin structure and histone post-translational modification help regulate and record the transcription of genes by RNA polymerase II.

This pathway gives examples of regulation at these points of transcription:
 * Pre-initiation (promotion by Bre1, histone modification)
 * Initiation (promotion by TFIIH, Pol II modification and promotion by COMPASS, histone modification)
 * Elongation (promotion by Set2, Histone Modification)

This refers to various stages of the process as regulatory steps. It has not been proven that they are used for regulation, but is very likely they are.

RNA Pol II elongation promoters can be summarised in 3 classes.
 * 1) Drug/sequence-dependent arrest-affected factors (Various interfering proteins)
 * 2) Chromatin structure-oriented factors (Histone posttranscriptional modifiers, e.g., Histone Methyltransferases)
 * 3) RNA Pol II catalysis-improving factors (Various interfering proteins and Pol II cofactors; see RNA polymerase II).

Transcription mechanisms
(HMTs (Histone MethylTransferases)): COMPASS§† – (COMplex of Proteins ASsociated with Set1) – Methylates lysine 4 of histone H3: Is responsible of repression/silencing of transcription. A normal part of cell growth and transcription regulation within RNAP II. (interesting irrelevant example: Dot1*‡ – Methylates lysine 79 of histone H3.)
 * Chromatin structure oriented factors:
 * Set2 – Methylates lysine 36 of histone H3: Set2 is involved in regulation transcription elongation through its direct contact with the CTD.
 * Bre1 – Ubiquinates (adds ubiquitin to) lysine 123 of histone H2B. Associated with pre-initiation and allowing RNA Pol II binding.

C-terminal Domain
The C-terminus of RPB1 is appended to form the C-terminal domain (CTD). The carboxy-terminal domain of RNA polymerase II typically consists of up to 52 repeats of the sequence Tyr-Ser-Pro-Thr-Ser-Pro-Ser. The domain stretches from the core of the RNAPII enzyme to the exit channel, this placement is effective due to its inductions of "RNA processing reactions, through direct or indirect interactions with components of the RNA processing machinery". The CTD domain does not exist in RNA Polymerase I or RNA Polymerase III. The RNA Polymerase CTD was discovered first in the laboratory of C. J. Ingles at the University of Toronto and also in the laboratory of J Corden at Johns Hopkins University during the processes of sequencing the DNA encoding the RPB1 subunit of RNA polymerase from yeast and mice respectively. Other proteins often bind the C-terminal domain of RNA polymerase in order to activate polymerase activity. It is the protein domain that is involved in the initiation of transcription, the capping of the RNA transcript, and attachment to the spliceosome for RNA splicing.

Phosphorylation of the CTD
RNA Polymerase II exists in two forms unphosphorylated and phosphorylated, IIA and IIO respectively. The transition between the two forms facilitates different functions for transcription. The phosphorylation of CTD is catalyzed by one of the six general transcription factors, TFIIH. TFIIH serves two purposes: one is to unwind the DNA at the transcription start site and the other is to phosphorylate. The form polymerase IIA joins the preinitiation complex, this is suggested because IIA binds with higher affinity to the TBP (TATA-box binding protein), the subunit of the general transcription factor TFIID, than polymerase IIO form. The form polymerase IIO facilitates the elongation of the RNA chain. The method for the elongation initiation is done by the phosphorylation of serine at position 5 (Ser5), via TFIIH. The newly phosphorylated Ser5 recruits enzymes to cap the 5' end of the newly synthesized RNA and the "3' processing factors to poly(A) sites". Once the second serine is phosphorylated, Ser2, elongation is activated. In order to terminate elongation dephosphorylation must occur. Once the domain is completely dephosphorylated the RNAP II enzyme is "recycled" and catalyzes the same process with another initiation site.

Transcription coupled recombinational repair
Oxidative DNA damage may block RNA polymerase II transcription and cause strand breaks. An RNA templated transcription-associated recombination process has been described that can protect against DNA damage. During the G1/G0 stages of the cell cycle, cells exhibit assembly of homologous recombination factors at double-strand breaks within actively transcribed regions. It appears that transcription is coupled to repair of DNA double-strand breaks by RNA templated homologous recombination. This repair process efficiently and accurately rejoins double-strand breaks in genes being actively transcribed by RNA polymerase II.