DNA demethylation



For molecular biology in mammals, DNA demethylation causes replacement of 5-methylcytosine (5mC) in a DNA sequence by cytosine (C) (see figure of 5mC and C). DNA demethylation can occur by an active process at the site of a 5mC in a DNA sequence or, in replicating cells, by preventing addition of methyl groups to DNA so that the replicated DNA will largely have cytosine in the DNA sequence (5mC will be diluted out).

Methylated cytosine is frequently present in the linear DNA sequence where a cytosine is followed by a guanine in a 5' → 3' direction (a CpG site). In mammals, DNA methyltransferases (which add methyl groups to DNA bases) exhibit a strong sequence preference for cytosines at CpG sites. There appear to be more than 20 million CpG dinucleotides in the human genome (see genomic distribution). In mammals, on average, 70% to 80% of CpG cytosines are methylated, though the level of methylation varies with different tissues. Methylated cytosines often occur in groups or CpG islands within the promoter regions of genes, where such methylation may reduce or silence gene expression (see gene expression). Methylated cytosines in the gene body, however, are positively correlated with expression.

Almost 100% DNA demethylation occurs by a combination of passive dilution and active enzymatic removal during the reprogramming that occurs in early embryogenesis and in gametogenesis. Another large demethylation, of about 3% of all genes, can occur by active demethylation in neurons during formation of a strong memory. After surgery, demethylations are found in peripheral blood mononuclear cells at sites annotated to immune system genes. Demethylations also occur during the formation of cancers. During global DNA hypomethylation of tumor genomes, there is a minor to moderate reduction of the number of methylated cytosines (5mC) amounting to a loss of about 5% to 20% on average of the 5mC bases.

Early embryonic development
The mouse sperm genome is 80–90% methylated at its CpG sites in DNA, amounting to about 20 million methylated sites. After fertilization, the paternal chromosome is almost completely demethylated in six hours by an active process, before DNA replication (blue line in Figure).

Demethylation of the maternal genome occurs by a different process. In the mature oocyte, about 40% of its CpG sites in DNA are methylated. While somatic cells of mammals have three main DNA methyltransferases (which add methyl groups to cytosines at CpG sites), DNMT1, DNMT3A, and DNMT3B, in the pre-implantation embryo up to the blastocyst stage (see Figure), the only methyltransferase present is an isoform of DNMT1 designated DNMT1o. DNMT1o has an alternative oocyte-specific promoter and first exon (exon 1o) located 5' of the somatic and spermatocyte promoters. As reviewed by Howell et al., DNMT1o is sequestered in the cytoplasm of mature oocytes and in 2-cell and 4-cell embryos, but at the 8-cell stage is only present in the nucleus. At the 16 cell stage (the morula) DNMT1o is again found only in the cytoplasm. It appears that demethylation of the maternal chromosomes largely takes place by blockage of the methylating enzyme DNMT1o from entering the nucleus except briefly at the 8 cell stage. The maternal-origin DNA thus undergoes passive demethylation by dilution of the methylated maternal DNA during replication (red line in Figure). The morula (at the 16 cell stage), has only a small amount of DNA methylation (black line in Figure).

DNMT3b begins to be expressed in the blastocyst. Methylation begins to increase at 3.5 days after fertilization in the blastocyst, and a large wave of methylation then occurs on days 4.5 to 5.5 in the epiblast, going from 12% to 62% methylation, and reaching maximum level after implantation in the uterus. By day seven after fertilization, the newly formed primordial germ cells (PGC) in the implanted embryo segregate from the remaining somatic cells. At this point the PGCs have about the same level of methylation as the somatic cells.

Gametogenesis
The newly formed primordial germ cells (PGC) in the implanted embryo devolve from the somatic cells. At this point the PGCs have high levels of methylation. These cells migrate from the epiblast toward the gonadal ridge. As reviewed by Messerschmidt et al., the majority of PGCs are arrested in the G2 phase of the cell cycle, while they migrate toward the hindgut during embryo days 7.5 to 8.5. Then demethylation of the PGCs takes place in two waves. At day 9.5 the primordial germ cells begin to rapidly replicate going from about 200 PGCs at embryo day 9.5 to about 10,000 PGCs at day 12.5. During days 9.5 to 12.5 DNMT3a and DNMT3b are repressed and DNMT1 is present in the nucleus at a high level. But DNMT1 is unable to methylate cytosines during days 9.5 to 12.5 because the UHRF1 gene (also known as NP95) is repressed and UHRF1 is an essential protein needed to recruit DNMT1 to replication foci where maintenance DNA methylation takes place. This is a passive, dilution form of demethylation.

In addition, from embryo day 9.5 to 13.5 there is an active form of demethylation. As indicated below in "Molecular stages of active reprogramming," two enzymes are central to active demethylation. These are a ten-eleven translocation methylcytosine dioxygenase (TET) and thymine-DNA glycosylase (TDG). One particular TET enzyme, TET1, and TDG are present at high levels from embryo day 9.5 to 13.5, and are employed in active demethylation during gametogenesis. PGC genomes display the lowest levels of DNA methylation of any cells in the entire life cycle of the mouse at embryonic day 13.5.

Learning and Memory


Learning and memory have levels of permanence, differing from other mental processes such as thought, language, and consciousness, which are temporary in nature. Learning and memory can be either accumulated slowly (multiplication tables) or rapidly (touching a hot stove), but once attained, can be recalled into conscious use for a long time. Rats subjected to one instance of contextual fear conditioning create an especially strong long-term memory. At 24 hours after training, 9.17% of the genes in the genomes of rat hippocampus neurons were found to be differentially methylated. This included more than 2,000 differentially methylated genes at 24 hours after training, with over 500 genes being demethylated. Similar results to that in the rat hippocampus were also obtained in mice with contextual fear conditioning.

The hippocampus region of the brain is where contextual fear memories are first stored (see figure of the brain, this section), but this storage is transient and does not remain in the hippocampus. In rats contextual fear conditioning is abolished when the hippocampus is subjected to hippocampectomy just one day after conditioning, but rats retain a considerable amount of contextual fear when hippocampectomy is delayed by four weeks. In mice, examined at 4 weeks after conditioning, the hippocampus methylations and demethylations were reversed (the hippocampus is needed to form memories but memories are not stored there) while substantial differential CpG methylation and demethylation occurred in cortical neurons during memory maintenance. There were 1,223 differentially methylated genes in the anterior cingulate cortex of mice four weeks after contextual fear conditioning. Thus, while there were many methylations in the hippocampus shortly after memory was formed, all these hippocampus methylations were demethylated as soon as four weeks later.

Demethylation in Cancer
The human genome contains about 28 million CpG sites, and roughly 60% of the CpG sites are methylated at the 5 position of the cytosine. During formation of a cancer there is an average reduction of the number of methylated cytosines of about 5% to 20%, or about 840,00 to 3.4 million demethylations of CpG sites.

DNMT1 methylates CpGs on hemi-methylated DNA during DNA replication. Thus, when a DNA strand has a methylated CpG, and the newly replicated strand during semi-conservative replication lacks a methyl group on the complementary CpG, DNMT1 is normally recruited to the hemimethylated site and adds a methyl group to cytosine in the newly synthesized CpG. However, recruitment of DNMT1 to hemimethylated CpG sites during DNA replication depends on the UHRF1 protein. If UHRF1 does not bind to a hemimethylated CpG site, then DNMT1 is not recruited and cannot methylate the newly synthesized CpG site. The arginine methyltransferase PRMT6 regulates DNA methylation by methylating the arginine at position 2 of histone 3 (H3R2me2a). (See Protein methylation.) In the presence of H3R2me2a UHRF1 can not bind to a hemimethylated CpG site, and then DNMT1 is not recruited to the site, and the site remains hemimethylated. Upon further rounds of replication the methylated CpG is passively diluted out. PRMT6 is frequently overexpressed in many types of cancer cells. The overexpression of PRMT6 may be a source of DNA demethylation in cancer.

Molecular stages of active reprogramming
Three molecular stages are required for actively, enzymatically reprogramming the DNA methylome. Stage 1: Recruitment. The enzymes needed for reprogramming are recruited to genome sites that require demethylation or methylation. Stage 2: Implementation. The initial enzymatic reactions take place. In the case of methylation, this is a short step that results in the methylation of cytosine to 5-methylcytosine. Stage 3: Base excision DNA repair. The intermediate products of demethylation are catalysed by specific enzymes of the base excision DNA repair pathway that finally restore cystosine in the DNA sequence.

Stage 2 of active demethylation


Demethylation of 5-methylcytosine to generate 5-hydroxymethylcytosine (5hmC) very often initially involves oxidation of 5mC (see Figure in this section) by ten-eleven translocation methylcytosine dioxygenases (TET enzymes). The molecular steps of this initial demethylation are shown in detail in TET enzymes. In successive steps (see Figure) TET enzymes further hydroxylate 5hmC to generate 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). Thymine-DNA glycosylase (TDG) recognizes the intermediate bases 5fC and 5caC and excises the glycosidic bond resulting in an apyrimidinic site (AP site). This is followed by base excision repair (stage 3). In an alternative oxidative deamination pathway, 5hmC can be oxidatively deaminated by APOBEC (AID/APOBEC) deaminases to form 5-hydroxymethyluracil (5hmU). Also, 5mC can be converted to thymine (Thy). 5hmU can be cleaved by TDG, MBD4, NEIL1 or SMUG1. AP sites and T:G mismatches are then repaired by base excision repair (BER) enzymes to yield cytosine (Cyt). The TET family of dioxygenases are employed in the most frequent type of demethylation reactions.

TET family
TET dioxygenase isoforms include at least two isoforms of TET1, one of TET2 and three isoforms of TET3. The full-length canonical TET1 isoform appears virtually restricted to early embryos, embryonic stem cells and primordial germ cells (PGCs). The dominant TET1 isoform in most somatic tissues, at least in the mouse, arises from alternative promoter usage which gives rise to a short transcript and a truncated protein designated TET1s. The isoforms of TET3 are the full length form TET3FL, a short form splice variant TET3s, and a form that occurs in oocytes and neurons designated TET3o. TET3o is created by alternative promoter use and contains an additional first N-terminal exon coding for 11 amino acids. TET3o only occurs in oocytes and neurons and is not expressed in embryonic stem cells or in any other cell type or adult mouse tissue tested. Whereas TET1 expression can barely be detected in oocytes and zygotes, and TET2 is only moderately expressed, the TET3 variant TET3o shows extremely high levels of expression in oocytes and zygotes, but is nearly absent at the 2-cell stage. It is possible that TET3o, high in neurons, oocytes and zygotes at the one cell stage, is the major TET enzyme utilized when very large scale rapid demethylations occur in these cells.

Stage 1 of demethylation - recruitment of TET to DNA
The TET enzymes do not specifically bind to 5-methylcytosine except when recruited. Without recruitment or targeting, TET1 predominantly binds to high CG promoters and CpG islands (CGIs) genome-wide by its CXXC domain that can recognize un-methylated CGIs. TET2 does not have an affinity for 5-methylcytosine in DNA. The CXXC domain of the full-length TET3, which is the predominant form expressed in neurons, binds most strongly to CpGs where the C was converted to 5-carboxycytosine (5caC). However, it also binds to un-methylated CpGs.



For a TET enzyme to initiate demethylation it must first be recruited to a methylated CpG site in DNA. Two of the proteins shown to recruit a TET enzyme to a methylated cytosine in DNA are OGG1 (see figure Initiation of DNA demethylation at a CpG site) and EGR1.

OGG1
Oxoguanine glycosylase (OGG1) catalyses the first step in base excision repair of the oxidatively damaged base 8-OHdG. OGG1 finds 8-OHdG by sliding along the linear DNA at 1,000 base pairs of DNA in 0.1 seconds. OGG1 very rapidly finds 8-OHdG. OGG1 proteins bind to oxidatively damaged DNA with a half maximum time of about 6 seconds. When OGG1 finds 8-OHdG it changes conformation and complexes with 8-OHdG in its binding pocket. OGG1 does not immediately act to remove the 8-OHdG. Half maximum removal of 8-OHdG takes about 30 minutes in HeLa cells in vitro, or about 11 minutes in the livers of irradiated mice. DNA oxidation by reactive oxygen species preferentially occurs at a guanine in a methylated CpG site, because of a lowered ionization potential of guanine bases adjacent to 5-methylcytosine. TET1 binds (is recruited to) the OGG1 bound to 8-OHdG (see figure). This likely allows TET1 to demethylate an adjacent methylated cytosine. When human mammary epithelial cells (MCF-10A) were treated with H2O2, 8-OHdG increased in DNA by 3.5-fold and this caused about 80% demethylation of the 5-methylcytosines in the MCF-10A genome.

EGR1
The gene early growth response protein 1 (EGR1) is an immediate early gene (IEG). EGR1 can rapidly be induced by neuronal activity. The defining characteristic of IEGs is the rapid and transient up-regulation—within minutes—of their mRNA levels independent of protein synthesis. In adulthood, EGR1 is expressed widely throughout the brain, maintaining baseline expression levels in several key areas of the brain including the medial prefrontal cortex, striatum, hippocampus and amygdala. This expression is linked to control of cognition, emotional response, social behavior and sensitivity to reward. EGR1 binds to DNA at sites with the motifs 5′-GCGTGGGCG-3′ and 5'-GCGGGGGCGG-3′ and these motifs occur primarily in promoter regions of genes. The short isoform TET1s is expressed in the brain. EGR1 and TET1s form a complex mediated by the C-terminal regions of both proteins, independently of association with DNA. EGR1 recruits TET1s to genomic regions flanking EGR1 binding sites. In the presence of EGR1, TET1s is capable of locus-specific demethylation and activation of the expression of downstream genes regulated by EGR1.

DNA demethylation intermediate 5hmC
As indicated in the Figure above, captioned "Demethylation of 5-methylcytosine," the first step in active demethylation is a TET oxidation of 5-methylcytosine (5mC) to 5-hydroxymethylcytosine (5hmC). The demethylation process, in some tissues and at some genome locations, may stop at that point. As reviewed by Uribe-Lewis et al., in addition to being an intermediate in active DNA demethylation, 5hmC is often a stable DNA modification. Within the genome, 5hmC is located at transcriptionally active genes, regulatory elements and chromatin associated complexes. In particular, 5hmC is dynamically changed and positively correlated with active gene transcription during cell lineage specification, and high levels of 5hmC are found in embryonic stem cells and in the central nervous system. In humans, defective 5-hydroxymethylating activity is associated with a phenotype of lymphoproliferation, immunodeficiency and autoimmunity.

Stage 3 base excision repair


The third stage of DNA demethylation is removal of the intermediate products of demethylation generated by a TET enzyme by base excision repair. As indicated above in Stage 2, after 5mC is first oxidized by a TET to form 5hmC, further oxidation of 5hmC by TET yields 5fC and oxidation of 5fC by TET yields 5caC. Both 5fC and 5caC are recognized by a DNA glycosylase, TDG, a base excision repair enzyme, as an abnormal base. As shown in the Figure in this section, TDG removes the abnormal base (e.g. 5fC) while leaving the sugar-phosphate backbone intact, creating an apurinic/apyrimidinic site, commonly referred to as an AP site. In this Figure, the 8-OHdG is left in the DNA, since it may have been present when OGG1 attracted TET1 to the CpG site with a methylated cytosine. After an AP site is formed, AP endonuclease creates a nick in the phosphodiester backbone of the AP site that was formed when the TDG DNA glycosylase removed the 5fC or 5caC. The human AP endonuclease incises DNA 5′ to the AP site by a hydrolytic mechanism, leaving a 3′-hydroxyl and a 5′-deoxyribose phosphate (5' dRP) residue. This is followed by either short patch or long patch repair. In short patch repair, 5′ dRP lyase trims the 5′ dRP end to form a phosphorylated 5′ end. This is followed by DNA polymerase β (pol β) adding a single cytosine to pair with the pre-existing guanine in the complementary strand and then DNA ligase to seal the cut strand. In long patch repair, DNA synthesis is thought to be mediated by polymerase δ and polymerase ε performing displacement synthesis to form a flap. Pol β can also perform long-patch displacement synthesis. Long-patch synthesis typically inserts 2–10 new nucleotides. Then flap endonuclease removes the flap, and this is followed by DNA ligase to seal the strand. At this point there has been a complete replacement of the 5-methylcytosine by cytosine (demethylation) in the DNA sequence.

Demethylation after exercise
Physical exercise has well established beneficial effects on learning and memory (see Neurobiological effects of physical exercise). BDNF is a particularly important regulator of learning and memory. As reviewed by Fernandes et al., in rats, exercise enhances the hippocampus expression of the gene Bdnf, which has an essential role in memory formation. Enhanced expression of Bdnf occurs through demethylation of its CpG island promoter at exon IV and this demethylation depends on steps illustrated in the two figures.

Demethylation after exposure to traffic related air pollution
In a panel of healthy adults, negative associations were found between total DNA methylation and exposure to traffic related air pollution. DNA methylation levels were associated both with recent and chronic exposure to Black Carbon as well as benzene.

Peripheral sensory neuron regeneration
After injury, neurons in the adult peripheral nervous system can switch from a dormant state with little axonal growth to robust axon regeneration. DNA demethylation in mature mammalian neurons removes barriers to axonal regeneration. This demethylation, in regenerating mouse peripheral neurons, depends upon TET3 to generate 5-hydroxymethylcytosine (5hmC) in DNA. 5hmC was altered in a large set of regeneration-associated genes (RAGs), including well-known RAGs such as Atf3, Bdnf, and Smad1, that regulate the axon growth potential of neurons.