User:Barokmusiklvr/sandbox

HAT families
HATs are traditionally divided into two different classes based on their subcellular localization. Type A HATs are located in the nucleus and are involved in the regulation of gene expression through acetylation of nucleosomal histones in the context of chromatin. They contain a bromodomain, which helps them recognize and bind to acetylated lysine residues on histone substrates. Gcn5, p300/CBP, and TAFII250 are some examples of type A HATs that cooperate with activators to enhance transcription. Type B HATs are located in the cytoplasm and are responsible for acetylating newly synthesized histones prior to their assembly into nucleosomes. These HATs lack a bromodomain, as their task is to recognize newly synthesized core histones, which are unacetylated. The acetyl groups added by type B HATs to the histones are removed by HDACs once they enter the nucleus and are incorporated into chromatin. Hat1 is one of the few known examples of a type B HAT. Despite this historical classification of HATs, some HAT proteins function in multiple complexes or locations and would thus not easily fit into a particular class.



Gcn5-related N-acetyltransferases (GNATs)
HATs can be grouped into several different families based on sequence homology as well as shared structural features and functional roles. The Gcn5-related N-acetyltransferase (GNAT) family includes Gcn5, PCAF, Hat1, Elp3, Hpa2, Hpa3, ATF-2, and Nut1. These HATs are generally characterized by the presence of a bromodomain, and they are found to acetylate lysine residues on histones H2B, H3, and H4. All members of the GNAT family are characterized by up to four conserved motifs (A-D) found within the catalytic HAT domain. This includes the most highly conserved motif A, which contains an Arg/Gln-X-X-Gly-X-Gly/Ala sequence that is important for acetyl-CoA recognition and binding. The C motif is found in most GNATs, but it is not present in the majority of other known HATs. The yeast Gcn5 (general control nonderepressible-5) HAT is one of the best characterized members of this family. It has four functional domains, including an N-terminal domain, a highly conserved catalytic (HAT) domain, an Ada2 interaction domain, and a C-terminal bromodomain. PCAF (p300/CBP-associated factor) and GCN5 are mammalian GNATs that share a high degree of homology throughout their sequences. These proteins have a 400-residue N-terminal region that is absent in yeast Gcn5, but their HAT functions are evolutionarily conserved with respect to the latter. Hat1 was the first HAT protein to be identified. It is responsible for most of the cytoplasmic HAT activity in yeast, and it binds strongly to histone H4 by virtue of its association with an additional subunit, Hat2. Elp3 is an example of a type A HAT found in yeast. It is part of the RNA polymerase II holoenzyme and plays a role in transcriptional elongation.

MYST HATs
The MYST family of HATs is named after its four founding members MOZ, Ybf2 (Sas3), Sas2, and Tip60. Other important members include Esa1, MOF, MORF, and HBO1. These HATs are typically characterized by the presence of zinc fingers and chromodomains, and they are found to acetylate lysine residues on histones H2A, H3, and H4. Several MYST family proteins contain zinc fingers as well as the highly conserved motif A found among GNATs that facilitates acetyl-CoA binding. A cysteine-rich region located in the N terminus of the HAT domain of MYST proteins is involved in zinc binding, which is essential for HAT activity. Tip60 (Tat-interactive protein, 60 kDa) was the first human MYST family member to exhibit HAT activity. Sas3 found in yeast is a homolog of MOZ (monocytic leukemia zinc finger protein), which is an oncogene found in humans. Esa1 was the first essential HAT to be found in yeast, and MOF is its homolog in fruit flies. The HAT activity of the latter is required for the twofold increased transcription of the male X chromosome (dosage compensation) in flies. Human HBO1 (HAT bound to ORC1) was the first HAT shown to associate with components of the origin of replication complex. MORF (MOZ-related factor) exhibits very close homology to MOZ throughout its entire length. It contains an N-terminal repression region that decreases its HAT activity in vitro as well as a C-terminal activation domain that is functional in the absence of the HAT domain.

Others
In addition to those that are members of the GNAT and MYST families, there are several other proteins found typically in higher eukaryotes that exhibit HAT activity. These include p300/CBP, nuclear receptor coactivators (e.g. ACTR/SRC-1), TAFII250, TFIIIC, Rtt109, and CLOCK. p300/CBP are metazoan-specific and contain several zinc finger regions, a bromodomain, a catalytic (HAT) domain, and regions that interact with other transcription factors. Importantly, the HAT domain shows no sequence homology to other known HATs, and it is required for p300/CBP to function in transcriptional activation. In addition, these proteins contain several HAT domain motifs (A, B, and D) that are similar to those of the GNATs. They also possess a novel motif E that is homologous to sequences in the HAT domains of GNATs. TFIIIC is one of the general transcription factors involved in RNA polymerase III-mediated transcription. Three components in the human protein have been shown to possess independent HAT activity (hTFIIIC220, hTFIIIC110, and hTFIIIC90). Rtt109 is a fungal-specific HAT that requires association with histone chaperone proteins for activity. The HAT activities of the human TAFII250 and CLOCK coactivators have not been studied as extensively. TAFII250 is one of the TBP-associated factor subunits of TFIID, and it shares a Gly-X-Gly pattern with Gcn5 that is important for HAT activity. CLOCK is a circadian rhythm master regulator that functions with BMAL1 to carry out its HAT activity.

Nuclear receptor coactivators
Three important nuclear receptor coactivators that display HAT activity are SRC-1, ACTR, and TIF-2. Human SRC-1 (steroid receptor coactivator-1) is known to interact with p300/CBP and PCAF, and its HAT domain is located in its C-terminal region. ACTR (also known as RAC3, AIB1, and TRAM-1 in humans) shares significant sequence homology with SRC-1, particularly in the N-terminal and C-terminal (HAT) regions as well as in the receptor and coactivator interaction domains. ACTR also interacts with p300/CBP and PCAF. The former can prevent ACTR from binding to and activating its receptor by acetylating it in its receptor interaction domain. TIF-2 (transcriptional intermediary factor 2; also known as GRIP1) is another nuclear receptor coactivator with HAT activity, and it also interacts with p300/CBP.

A table summarizing the different families of HATs along with their associated members, parent organisms, multisubunit complexes, histone substrates, and structural features is presented below.

Overall structure


HATs are generally characterized by a structurally conserved core region made up of a three-stranded β-sheet followed by a long α-helix parallel to and spanning one side of it. The core region, which corresponds to motifs A, B, and D of the GNAT proteins, is flanked on opposite sides by N- and C-terminal α/β segments that are structurally unique for a given HAT family. The central core and the flanking segments together form a cleft over the former, which is where histone substrates can bind prior to catalysis. While the central core domain (motif A in GNATs) is involved in acetyl-CoA binding and catalysis, the N- and C-terminal segments assist in binding histone substrates. Unique features related to the sequence and/or structure of the N- and C-terminal regions for different HAT families may help to explain some observed differences among HATs in histone substrate specificity. CoA binding has been observed to widen the histone binding groove in the central core by moving the C-terminal segment of Gcn5 outward. In addition, since contacts between CoA and protein facilitate the formation of favorable histone-protein contacts, it is likely that CoA binding precedes histone binding in vivo.

GNAT and MYST families
HATs in the GNAT family are most notably characterized by an approximately 160-residue HAT domain and a C-terminal bromodomain, which binds to acetylated lysine residues. Those in the MYST family have HAT domains that are about 250 residues in length. Many MYST proteins also contain a cysteine-rich, zinc-binding domain within the HAT region in addition to an N-terminal chromodomain, which binds to methylated lysine residues.

On a broader scale, the structures of the catalytic domains of GNAT proteins (Gcn5, PCAF) exhibit a mixed α/β globular fold with a total of five α-helices and six β-strands. The overall topology resembles a vise, with the central core of the protein at the base and the N- and C-terminal segments on the sides.

p300/CBP family
The p300/CBP HATs have larger HAT domains (about 500 residues) than those present in the GNAT and MYST families. They also contain a bromodomain as well as three cysteine/histidine-rich domains that are thought to mediate interactions with other proteins. The structure of p300/CBP is characterized by an elongated globular domain, which contains a seven-stranded β-sheet in the center that is surrounded by nine α-helices and several loops. The structure of the central core region associated with acetyl-CoA binding is conserved with respect to GNAT and MYST HATs, but there are many structural differences in the regions flanking this central core. Overall, the structural data is consistent with the fact that p300/CBP HATs are more promiscuous than GNAT and MYST HATs with respect to substrate binding.

Rtt109
The structure of Rtt109 is very similar to that of p300 despite there only being 7% sequence identity between the two proteins. Notably, there is a seven-stranded β-sheet that is surrounded by α-helices as well as a loop that is involved in acetyl-CoA substrate binding. Despite the conserved structure, Rtt109 and p300/CBP are functionally unique. For instance, the substrate binding site of the former is more similar to that of the GNAT and MYST HATs. In addition, the residues in the active site of each enzyme are distinct, which suggests that they employ different catalytic mechanisms for acetyl group transfer.

Catalytic mechanisms
The basic mechanism catalyzed by HATs involves the transfer of an acetyl group from acetyl-CoA to the ε-amino group of a target lysine side chain within a histone. Different families of HATs employ unique strategies in order to effect such a transformation.



GNAT family
Members of the GNAT family have a conserved glutamate residue that acts as a general base for catalyzing the nucleophilic attack of the lysine amine on the acetyl-CoA thioester bond. These HATs use an ordered sequential bi-bi mechanism wherein both substrates (acetyl-CoA and histone) must bind to form a ternary complex with the enzyme before catalysis can occur. Acetyl-CoA binds first, followed by the histone substrate. A conserved glutamate residue (Glu173 in yeast Gcn5) activates a water molecule for removal of a proton from the amine group on lysine, which activates it for direct nucleophilic attack on the carbonyl carbon of enzyme-bound acetyl-CoA. After the reaction, the acetylated histone is released first followed by CoA.

MYST family
Studies of yeast Esa1 from the MYST family of HATs have revealed a ping-pong mechanism involving conserved glutamate and cysteine residues. The first part of the reaction involves the formation of a covalent intermediate in which a cysteine residue becomes acetylated following nucleophilic attack of this residue on the carbonyl carbon of acetyl-CoA. Then, a glutamate residue acts as a general base to facilitate transfer of the acetyl group from the cysteine to the histone substrate in a manner analogous to the mechanism used by GNATs. Interestingly, when Esa1 is assembled in the piccolo NuA4 complex, it loses its dependence on the cysteine residue for catalysis, which suggests that the reaction may proceed via a ternary bi-bi mechanism when the enzyme is part of a physiologically relevant multiprotein complex.

p300/CBP family
In human p300, Tyr1467 acts as a general acid and Trp1436 helps orient the target lysine residue of the histone substrate into the active site. These two residues are highly conserved within the p300/CBP HAT family and, unlike enzymes in the GNAT and MYST families, p300 does not employ a general base for catalysis. Rather, it is likely that members of the p300/CBP family use a Theorell-Chance (i.e. “hit-and-run”) acetyl transfer mechanism.

Rtt109
Finally, Rtt109 is likely to employ a mechanism that is different from that of the other HATs. The yeast enzyme has very low catalytic activity in the absence of the histone chaperone proteins Asf1 and Vps75, which may be involved in delivering histone substrates to the enzyme for acetylation. Moreover, a general acid or base have not yet been identified for this HAT.

Substrate binding and specificity
The structures of several HAT domains bound to acetyl-CoA and histone substrate peptides reveal that the latter bind across a groove on the protein that is formed by the central core region at the base and is flanked on opposite sides by the variable N- and C-terminal segments that mediate the majority of the interactions with the substrate peptide. It is likely that these variable regions are at least in part responsible for the observed specificity of different HATs for various histone substrates.

Members of the GNAT and MYST families as well as Rtt109 exhibit greater substrate selectivity than p300/CBP, which is rather promiscuous with regard to substrate binding. Whereas it appears that only three to five residues on either side of the lysine to be acetylated are necessary for effective substrate binding and catalysis by members of the GNAT and p300/CBP families, more distal regions of the substrate may be important for efficient acetylation by MYST family HATs.

Lysine selectivity
Different HATs, usually in the context of multisubunit complexes, have been shown to acetylate specific lysine residues in histones.

GNAT family
Gcn5 cannot acetylate nucleosomal histones in the absence of other protein factors. In the context of complexes like SAGA and ADA, however, Gcn5 is able to acetylate H3K14 among other sites within histones H2B, H3, and H4 (e.g. H3K9, H3K36, H4K8, H4K16). Both Gcn5 and PCAF have the strongest site preference for H3K14, either as a free histone or within a nucleosome. Hat1 acetylates H4K5 and H4K12, and Hpa2 acetylates H3K14 in vitro.

MYST family
In flies, acetylation of H4K16 on the male X chromosome by MOF in the context of the MSL complex is correlated with transcriptional upregulation as a mechanism for dosage compensation in these organisms. In humans, the MSL complex carries out the majority of genome-wide H4K16 acetylation. In the context of their cognate complexes, Sas2 (SAS) and Esa1 (NuA4) also carry out acetylation of H4K16, particularly in the telomere regions of chromosomes. Sas2 is also observed to acetylate H3K14 in vitro on free histones. Esa1 can also acetylate H3K14 in vitro on free histones as well as H2AK5, H4K5, H4K8, and H4K12 either in vitro or in vivo on nucleosomal histones. H2AK7 and H2BK16 are also observed to be acetylated by Esa1 in vivo. Notably, neither Sas2 nor Esa1 can acetylate nucleosomal histones in vitro as a free enzyme. This happens to be the case as well for Sas3, which is observed to acetylate H3K9 and H3K14 in vivo as well as lysine residues on H2A and H4. MOZ can also acetylate H3K14.

Others
p300/CBP acetylate all four nucleosomal core histones equally well. In vitro, it has been observed to acetylate H2AK5, H2BK12, H2BK15, H3K14, H3K18, H4K5, and H4K8. SRC-1 acetylates H3K9 and H3K14, TAFII230 (Drosophila homolog of human TAFII250) acetylates H3K14, and Rtt109 acetylates H3K9, H3K23, and H3K56 in the presence of either Asf1 or Vps75.

Non-histone substrates (in vitro)
In addition to the core histones, certain HATs acetylate a number of other cellular proteins including transcriptional activators, basal transcription factors, structural proteins, polyamines, and proteins involved in nuclear import. Acetylation of these proteins can alter their ability to interact with their cognate DNA and/or protein substrates. The idea that acetylation can affect protein function in this manner has led to inquiry regarding the role of acetyltransferases in signal transduction pathways and whether an appropriate analogy to kinases and phosphorylation events can be made in this respect.

PCAF
PCAF and p300/CBP are the two main HATs that have been observed to acetylate a number of non-histone proteins. For PCAF, these include the non-histone chromatin (high mobility group (HMG)) proteins HMG-N2/HMG17 and HMG-I(Y), the transcriptional activators p53, MyoD, E2F(1-3), and HIV Tat, and the general transcription factors TFIIE and TFIIF. Other proteins include CIITA, Brm (chromatin remodeler), NF-κB (p65), TAL1/SCL, Beta2/NeuroD, C/EBPβ, IRF2, IRF7, YY1, KLF13, EVI1, AME, ER81, and the androgen receptor (AR). PCAF has also been observed to acetylate c-MYC, GATA-2, retinoblastoma (Rb), Ku70, and E1A adenovirus protein. It can also autoacetylate, which facilitates intramolecular interactions with its bromodomain that may be involved in the regulation of its HAT activity.

p300/CBP
p300/CBP has many non-histone substrates, including the non-histone chromatin proteins HMG1, HMG-N1/HMG14, and HMG-I(Y), the transcriptional activators p53, c-Myb, GATA-1, EKLF, TCF, and HIV Tat, the nuclear receptor coactivators ACTR, SRC-1, and TIF-2, and the general transcription factors TFIIE and TFIIF. Other substrates include the transcription factors Sp1, KLF5, FOXO1, MEF2C, SRY, GATA-4, and HNF-6, HMG-B2, STAT3, the androgen and estrogen (α) receptors, GATA-2, GATA-3, MyoD, E2F(1-3), p73α, retinoblastoma (Rb), NF-κB (p50, p65), Smad7, importin-α, Ku70, E1A adenovirus protein, and S-HDAg (hepatitis delta virus small delta antigen). p300/CBP has also been observed to acetylate β-catenin, RIP140, PCNA, the DNA metabolic enzymes flap endonuclease-1, thymine DNA glycosylase, and Werner syndrome DNA helicase, STAT6, Runx1 (AML1), UBF, Beta2/NeuroD, CREB, c-Jun, C/EBPβ, NF-E2, SREBP, IRF2, Sp3, YY1, KLF13, EVI1, BCL6, HNF-4, ER81, and FOXO4 (AFX).

Multisubunit HAT complexes
The formation of multisubunit complexes has been observed to modulate the substrate specificity of HATs. In general, while recombinant HATs are able to acetylate free histones, HATs can only acetylate nucleosomal histones when they are in their respective in vivo HAT complexes. Some of the proteins that associate with HATs in these complexes function by targeting the HAT complex to nucleosomes at specific regions in the genome. For instance, it has been observed that HAT complexes (e.g. SAGA, NuA3) often use methylated histones as docking sites so that the catalytic HAT subunit can carry out histone acetylation more effectively. In addition, the formation of multisubunit HAT complexes influences the lysine specificity of HATs. The specific lysine residues that a given HAT acetylates may become either broader or more restricted in scope upon association with its respective complex. For example, the lysine specificity of MYST family HATs toward their histone substrates becomes more restricted when they associate with their complexes. In contrast, Gcn5 acquires the ability to acetylate multiple sites in both histones H2B and H3 when it joins other subunits to form the SAGA and ADA complexes. Moreover, the acetylation site specificity of Rtt109 is dictated by its association with either Vps75 or Asf1. When in complex with the former, Rtt109 acetylates H3K9 and H3K27, but when in complex with the latter, it preferentially acetylates H3K56.

Regulation of HAT activity
The catalytic activity of HATs is regulated by two types of mechanisms: (1) interaction with regulatory protein subunits and (2) autoacetylation. A given HAT may be regulated in multiple ways, and the same effector may actually lead to different outcomes under different conditions. Although it is clear that the association of HATs with multiprotein complexes provides a mechanism for the regulation of both HAT activity and substrate specificity in vivo, the molecular basis for how this actually occurs is still largely unknown. However, data suggests that associated subunits may contribute to catalysis at least in part by facilitating productive binding of the HAT complex to its native histone substrates.

The MYST family of HATs, p300/CBP, and Rtt109 have all been shown to be regulated by autoacetylation. Human MOF as well as yeast Esa1 and Sas2 are autoacetylated at a conserved active site lysine residue, and this modification is required for their function in vivo. Human p300 contains a highly basic loop embedded in the middle of its HAT domain that is hyperacetylated in the active form of the enzyme. It has been proposed that, upon autoacetylation, this loop is released from the electronegative substrate binding site where it sits in the inactive HAT. Acetylation of yeast Rtt109 at Lys290 is also required for it to exhibit full catalytic activity. Some HATs are also inhibited by acetylation. For example, the HAT activity of the nuclear receptor coactivator ACTR is inhibited upon acetylation by p300/CBP.