Ubiquitin ligase

A ubiquitin ligase (also called an E3 ubiquitin ligase) is a protein that recruits an E2 ubiquitin-conjugating enzyme that has been loaded with ubiquitin, recognizes a protein substrate, and assists or directly catalyzes the transfer of ubiquitin from the E2 to the protein substrate. In simple and more general terms, the ligase enables movement of ubiquitin from a ubiquitin carrier to another thing (the substrate) by some mechanism. The ubiquitin, once it reaches its destination, ends up being attached by an isopeptide bond to a lysine residue, which is part of the target protein. E3 ligases interact with both the target protein and the E2 enzyme, and so impart substrate specificity to the E2. Commonly, E3s polyubiquitinate their substrate with Lys48-linked chains of ubiquitin, targeting the substrate for destruction by the proteasome. However, many other types of linkages are possible and alter a protein's activity, interactions, or localization. Ubiquitination by E3 ligases regulates diverse areas such as cell trafficking, DNA repair, and signaling and is of profound importance in cell biology. E3 ligases are also key players in cell cycle control, mediating the degradation of cyclins, as well as cyclin dependent kinase inhibitor proteins. The human genome encodes over 600 putative E3 ligases, allowing for tremendous diversity in substrates.

Ubiquitination system
The ubiquitin ligase is referred to as an E3, and operates in conjunction with an E1 ubiquitin-activating enzyme and an E2 ubiquitin-conjugating enzyme. There is one major E1 enzyme, shared by all ubiquitin ligases, that uses ATP to activate ubiquitin for conjugation and transfers it to an E2 enzyme. The E2 enzyme interacts with a specific E3 partner and transfers the ubiquitin to the target protein. The E3, which may be a multi-protein complex, is, in general, responsible for targeting ubiquitination to specific substrate proteins.

The ubiquitylation reaction proceeds in three or four steps depending on the mechanism of action of the E3 ubiquitin ligase. In the conserved first step, an E1 cysteine residue attacks the ATP-activated C-terminal glycine on ubiquitin, resulting in a thioester Ub-S-E1 complex. The energy from ATP and diphosphate hydrolysis drives the formation of this reactive thioester, and subsequent steps are thermoneutral. Next, a transthiolation reaction occurs, in which an E2 cysteine residue attacks and replaces the E1. HECT domain type E3 ligases will have one more transthiolation reaction to transfer the ubiquitin molecule onto the E3, whereas the much more common RING finger domain type ligases transfer ubiquitin directly from E2 to the substrate. The final step in the first ubiquitylation event is an attack from the target protein lysine amine group, which will remove the cysteine, and form a stable isopeptide bond. One notable exception to this is p21 protein, which appears to be ubiquitylated using its N-terminal amine, thus forming a peptide bond with ubiquitin.

Ubiquitin ligase families
Humans have an estimated 500-1000 E3 ligases, which impart substrate specificity onto the E1 and E2. The E3 ligases are classified into four families: HECT, RING-finger, U-box, and PHD-finger. The RING-finger E3 ligases are the largest family and contain ligases such as the anaphase-promoting complex (APC) and the SCF complex (Skp1-Cullin-F-box protein complex). SCF complexes consist of four proteins: Rbx1, Cul1, Skp1, which are invariant among SCF complexes, and an F-box protein, which varies. Around 70 human F-box proteins have been identified. F-box proteins contain an F-box, which binds the rest of the SCF complex, and a substrate binding domain, which gives the E3 its substrate specificity.

Mono- and poly-ubiquitylation
Ubiquitin signaling relies on the diversity of ubiquitin tags for the specificity of its message. A protein can be tagged with a single ubiquitin molecule (monoubiquitylation), or variety of different chains of ubiquitin molecules (polyubiquitylation). E3 ubiquitin ligases catalyze polyubiquitination events much in the same way as the single ubiquitylation mechanism, using instead a lysine residue from a ubiquitin molecule currently attached to substrate protein to attack the C-terminus of a new ubiquitin molecule. For example, a common 4-ubiquitin tag, linked through the lysine at position 48 (K48) recruits the tagged protein to the proteasome, and subsequent degradation. However, all seven of the ubiquitin lysine residues (K6, K11, K27, K29, K33, K48, and K63), as well as the N-terminal methionine are used in chains in vivo.

Monoubiquitination has been linked to membrane protein endocytosis pathways. For example, phosphorylation of the Tyrosine at position 1045 in the Epidermal Growth Factor Receptor (EGFR) can recruit the RING type E3 ligase c-Cbl, via an SH2 domain. C-Cbl monoubiquitylates EGFR, signaling for its internalization and trafficking to the lysosome.

Monoubiquitination also can regulate cytosolic protein localization. For example, the E3 ligase MDM2 ubiquitylates p53 either for degradation (K48 polyubiquitin chain), or for nuclear export (monoubiquitylation). These events occur in a concentration dependent fashion, suggesting that modulating E3 ligase concentration is a cellular regulatory strategy for controlling protein homeostasis and localization.

Substrate recognition
Ubiquitin ligases are the final, and potentially the most important determinant of substrate specificity in ubiquitination of proteins. The ligases must simultaneously distinguish their protein substrate from thousands of other proteins in the cell, and from other (ubiquitination-inactive) forms of the same protein. This can be achieved by different mechanisms, most of which involve recognition of degrons: specific short amino acid sequences or chemical motifs on the substrate.

N-degrons
Proteolytic cleavage can lead to exposure of residues at the N-terminus of a protein. According to the N-end rule, different N-terminal amino acids (or N-degrons) are recognized to a different extent by their appropriate ubiquitin ligase (N-recognin), influencing the half-life of the protein. For instance, positively charged (Arg, Lys, His) and bulky hydrophobic amino acids (Phe, Trp, Tyr, Leu, Ile) are recognized preferentially and thus considered destabilizing degrons since they allow faster degradation of their proteins.

Phosphodegrons
A degron can be converted into its active form by a post-translational modification such as phosphorylation of a tyrosine, serine or threonine residue. In this case, the ubiquitin ligase exclusively recognizes the phosphorylated version of the substrate due to stabilization within the binding site. For example, FBW7, the F-box substrate recognition unit of an SCFFBW7ubiquitin ligase, stabilizes a phosphorylated substrate by hydrogen binding its arginine residues to the phosphate, as shown in the figure to the right. In absence of the phosphate, residues of FBW7 repel the substrate.

Oxygen and small molecule dependent degrons
The presence of oxygen or other small molecules can influence degron recognition. The von Hippel-Lindau (VHL) protein (substrate recognition part of a specific E3 ligase), for instance, recognizes the hypoxia-inducible factor alpha (HIF-α) only under normal oxygen conditions, when its proline is hydroxylated. Under hypoxia, on the other hand, HIF-a is not hydroxylated, evades ubiquitination and thus operates in the cell at higher concentrations which can initiate transcriptional response to hypoxia. Another example of small molecule control of protein degradation is phytohormone auxin in plants. Auxin binds to TIR1 (the substrate recognition domain of SCFTIR1ubiquitin ligase) increasing the affinity of TIR1 for its substrates (transcriptional repressors: Aux/IAA), and promoting their degradation.

Misfolded and sugar degrons
In addition to recognizing amino acids, ubiquitin ligases can also detect unusual features on substrates that serve as signals for their destruction. For example, San1 (Sir antagonist 1), a nuclear protein quality control in yeast, has a disordered substrate binding domain, which allows it to bind to hydrophobic domains of misfolded proteins. Misfolded or excess unassembled glycoproteins of the ERAD pathway, on the other hand, are recognized by Fbs1 and Fbs2, mammalian F-box proteins of E3 ligases SCFFbs1and SCFFbs2. These recognition domains have small hydrophobic pockets allowing them to bind high-mannose containing glycans.

Structural motifs
In addition to linear degrons, the E3 ligase can in some cases also recognize structural motifs on the substrate. In this case, the 3D motif can allow the substrate to directly relate its biochemical function to ubiquitination. This relation can be demonstrated with TRF1 protein (regulator of human telomere length), which is recognized by its corresponding E3 ligase (FBXO4) via an intermolecular beta sheet interaction. TRF1 cannot be ubiquinated while telomere bound, likely because the same TRF1 domain that binds to its E3 ligase also binds to telomeres.

Disease relevance
E3 ubiquitin ligases regulate homeostasis, cell cycle, and DNA repair pathways, and as a result, a number of these proteins are involved in a variety of cancers, including famously MDM2, BRCA1, and Von Hippel-Lindau tumor suppressor. For example, a mutation of MDM2 has been found in stomach cancer, renal cell carcinoma, and liver cancer (amongst others) to deregulate MDM2 concentrations by increasing its promoter’s affinity for the Sp1 transcription factor, causing increased transcription of MDM2 mRNA. Several proteomics-based experimental techniques are available for identifying E3 ubiquitin ligase-substrate pairs, such as proximity-dependent biotin identification (BioID), ubiquitin ligase-substrate trapping, and tandem ubiquitin-binding entities (TUBEs).

Examples

 * A RING (Really Interesting New Gene) domain binds the E2 conjugase and might be found to mediate enzymatic activity in the E2-E3 complex
 * An F-box domain (as in the SCF complex) binds the ubiquitinated substrate. (e.g., Cdc 4, which binds the target protein Sic1; Grr1, which binds Cln).
 * A HECT domain, which is involved in the transfer of ubiquitin from the E2 to the substrate.

Individual E3 ubiquitin ligases

 * E3A
 * mdm2
 * Anaphase-promoting complex (APC)
 * UBR5 (EDD1)
 * SOCS/ BC-box/ eloBC/ CUL5/ RING
 * LNXp80
 * CBX4, CBLL1
 * HACE1
 * HECTD1, HECTD2, HECTD3, HECTD4
 * HECW1, HECW2
 * HERC1, HERC2, HERC3, HERC4, HERC5, HERC6
 * HUWE1, ITCH
 * NEDD4, NEDD4L
 * PPIL2
 * PRPF19
 * PIAS1, PIAS2, PIAS3, PIAS4
 * RANBP2
 * RNF4
 * RBX1
 * SMURF1, SMURF2
 * STUB1
 * TOPORS
 * TRIP12
 * UBE3A, UBE3B, UBE3C, UBE3D
 * UBE4A, UBE4B
 * UBOX5
 * UBR5
 * VHL
 * WWP1, WWP2
 * Parkin
 * MKRN1