Sigma factor

A sigma factor (σ factor or specificity factor) is a protein needed for initiation of transcription in bacteria. It is a bacterial transcription initiation factor that enables specific binding of RNA polymerase (RNAP) to gene promoters. It is homologous to archaeal transcription factor B and to eukaryotic factor TFIIB. The specific sigma factor used to initiate transcription of a given gene will vary, depending on the gene and on the environmental signals needed to initiate transcription of that gene. Selection of promoters by RNA polymerase is dependent on the sigma factor that associates with it. They are also found in plant chloroplasts as a part of the bacteria-like plastid-encoded polymerase (PEP).

The sigma factor, together with RNA polymerase, is known as the RNA polymerase holoenzyme. Every molecule of RNA polymerase holoenzyme contains exactly one sigma factor subunit, which in the model bacterium Escherichia coli is one of those listed below. The number of sigma factors varies between bacterial species. E. coli has seven sigma factors. Sigma factors are distinguished by their characteristic molecular weights. For example, σ70 is the sigma factor with a molecular weight of 70 kDa.

The sigma factor in the RNA polymerase holoenzyme complex is required for the initiation of transcription, although once that stage is finished, it is dissociated from the complex and the RNAP continues elongation on its own.

Specialized sigma factors
Different sigma factors are utilized under different environmental conditions. These specialized sigma factors bind the promoters of genes appropriate to the environmental conditions, increasing the transcription of those genes.

Sigma factors in E. coli:
 * σ70(RpoD) – σA – the "housekeeping" sigma factor or also called as primary sigma factor (Group 1), transcribes most genes in growing cells. Every cell has a "housekeeping" sigma factor that keeps essential genes and pathways operating. In the case of E. coli and other gram-negative rod-shaped bacteria, the "housekeeping" sigma factor is σ70.  Genes recognized by σ70 all contain similar promoter consensus sequences consisting of two parts.  Relative to the DNA base corresponding to the start of the RNA transcript, the consensus promoter sequences are characteristically centered at 10 and 35 nucleotides before the start of transcription (−10 and −35).
 * σ19 (FecI) – the ferric citrate sigma factor, regulates the fec gene for iron transport and metabolism
 * σ24 (RpoE) – extreme heat stress response and the extracellular proteins sigma factor
 * σ28 (RpoF/FliA) – the flagellar synthesis and chemotaxis sigma factor
 * σ32 (RpoH) – the heat shock sigma factor, it is turned on when the bacteria are exposed to heat. Due to the higher expression, the factor will bind with a high probability to the polymerase-core-enzyme. Doing so, other heatshock proteins are expressed, which enable the cell to survive higher temperatures. Some of the enzymes that are expressed upon activation of σ32 are chaperones, proteases and DNA-repair enzymes.
 * σ38 (RpoS) – the starvation/stationary phase sigma factor
 * σ54 (RpoN) – the nitrogen-limitation sigma factor

There are also anti-sigma factors that inhibit the function of sigma factors and anti-anti-sigma factors that restore sigma factor function.

Structure


By sequence similarity, most sigma factors are σ70-like. They have four main regions (domains) that are generally conserved:

N-terminus - C-terminus 1.1   2    3    4

The regions are further subdivided. For example, region 2 includes 1.2 and 2.1 through 2.4.

Domain 1.1 is found only in "primary sigma factors" (RpoD, RpoS in E.coli; "Group 1"). It is involved in ensuring the sigma factor will only bind the promoter when it is complexed with the RNA polymerase. Domains 2-4 each interact with specific promoter elements and with RNAP. Region 2.4 recognizes and binds to the promoter −10 element (called the "Pribnow box"). Region 4.2 recognizes and binds to the promoter −35 element.

Not every sigma factor of the σ70 family contains all the domains. Group 2, which includes RpoS, is very similar to Group 1 but lacks domain 1. Group 3 also lacks domain 1, and includes σ28. Group 4, also known as the Extracytoplasmic Function (ECF) group, lack both σ1.1 and σ3. RpoE is a member.

Other known sigma factors are of the σ54/RpoN type. They are functional sigma factors, but they have significantly different primary amino acid sequences.

Retention during transcription elongation
The core RNA polymerase (consisting of 2 alpha (α), 1 beta (β), 1 beta-prime (β'), and 1 omega (ω) subunits) binds a sigma factor to form a complex called the RNA polymerase holoenzyme. It was previously believed that the RNA polymerase holoenzyme initiates transcription, while the core RNA polymerase alone synthesizes RNA. Thus, the accepted view was that sigma factor must dissociate upon transition from transcription initiation to transcription elongation (this transition is called "promoter escape"). This view was based on analysis of purified complexes of RNA polymerase stalled at initiation and at elongation. Finally, structural models of RNA polymerase complexes predicted that, as the growing RNA product becomes longer than ~15 nucleotides, sigma must be "pushed out" of the holoenzyme, since there is a steric clash between RNA and a sigma domain. However, σ70 can remain attached in complex with the core RNA polymerase in early elongation and sometimes throughout elongation. Indeed, the phenomenon of promoter-proximal pausing indicates that sigma plays roles during early elongation. All studies are consistent with the assumption that promoter escape reduces the lifetime of the sigma-core interaction from very long at initiation (too long to be measured in a typical biochemical experiment) to a shorter, measurable lifetime upon transition to elongation.

Sigma cycle
It had long been thought that the sigma factor obligatorily leaves the core enzyme once it has initiated transcription, allowing it to link to another core enzyme and initiate transcription at another site. Thus, the sigma factor would cycle from one core to another. However, fluorescence resonance energy transfer was used to show that the sigma factor does not obligatorily leave the core. Instead, it changes its binding with the core during initiation and elongation. Therefore, the sigma factor cycles between a strongly bound state during initiation and a weakly bound state during elongation.

Sigma factor competition
The number of RNAPs in bacterial cells (e.g., E. coli) have been shown to be smaller than the number of sigma factors. Consequently, if a certain sigma factor is overexpressed, not only will increase the expression levels of genes whose promoters have preference for that sigma factor, but it will also reduce the probability that genes with promoters with preference for other sigma factors.

Meanwhile, transcription initiation has two major rate limiting steps: the closed and the open complex formation. However, only the dynamics of the first step depends on the concentration of sigma factors. Interestingly, the fastest is the closed complex formation relative to the open complex formation, the less responsive is a promoter to changes in sigma factors’ concentration (see for a model and empirical data of this phenomenon).

Genes with dual sigma factor preference
While most genes of E. coli can be recognized by an RNAP with one and only one type of sigma factor (e.g. sigma 70), a few genes (~ 5%) have what is called a “dual sigma factor preference”, that is, they can respond to two different sigma factors, as reported in RegulonDB. The most common ones are those promoters that can respond to both sigma 70 and to sigma 38 (iIlustrated in the figure). Studies of the dynamics of these genes showed that when the cells enter stationary growth they are almost as induced as those genes that have preference for σ38 alone. This induction level was shown to be predictable from their promoter sequence. A model of their dynamics is shown in the figure. In the future, these promoters may become useful tools in synthetic genetic constructs in E. coli.