User:Dcbennett2/sandbox/editing

Genome editing, or genome editing with engineered nucleases (GEEN) is a type of genetic engineering in which DNA is inserted, replaced, or removed from a genome using artificially engineered nucleases, or "molecular scissors." The nucleases create specific double-strand breaks (DSBs) at desired locations in the genome and harness the cell’s endogenous mechanisms to repair the induced break by natural processes of homologous recombination (HR) and nonhomologous end-joining (NHEJ). There are currently four families of engineered nucleases being used: Zinc finger nucleases (ZFNs), Transcription Activator-Like Effector-based Nucleases (TALEN®), the CRISPR/Cas system, and engineered meganuclease re-engineered homing endonucleases.

In order to understand the function of a gene or a protein, one interferes with it in a sequence-specific way and monitors its effects on the organism. However, in some organisms, it is difficult or impossible to perform site-specific mutagenesis, and therefore more indirect methods must be used, such as silencing the gene of interest by short RNA interference (siRNA). However, gene disruption by siRNA can be variable or incomplete. Nucleases such as ZFNs or TALEN constructs can in principle cut any targeted position in the genome and introduce a modification of the endogenous sequences for genes that are impossible to specifically target using conventional RNAi.

Genome editing was selected by Nature Methods as the 2011 Method of the Year.

Concept
A common approach in the modern biological research is to manipulate the genetic sequence (genotype) of an organism (or a single cell) and observe the impact of this change on the organism (phenotype). Such approach is called reverse genetics and its significance for the modern biology lies in its relative simplicity. This method contrasts with that of forward genetics, where a new phenotype is first observed and then its genetic basis is studied. This course is more complex because phenotypic changes are often a result of multiple genetic interactions.

Among the key requirements of reverse genetic analysis is the ability to modify the genetic code. This can be achieved by:
 * site-directed mutagenesis employing either phage- or polymerase chain reaction (PCR)-mediated methods and oligonucleotides containing the desired mutation . These methods are most useful in organisms with straightforward methods for the introduction and selection of genes of interest, such as bacteria and yeast.
 * recombination based methods that utilize the natural ability of cells to exchange DNA between its own genetic information and an exogenous DNA. These methods have been made possible in yeast and mice.

These approaches have several drawbacks: The use of other mutagenic techniques (such as P-element transgenesis in Drosophila) also has limitations, the major one being the randomness of incorporation and the possibility of affecting other genes and expression patterns.
 * PCR- and phage-mediated approaches are less successful in more complex organisms such as mammals, where delivery becomes more difficult.
 * They also require stringent selection steps and thus the addition of selection-specific sequences, along with those incorporated into the DNA.
 * Recombination-based methods can be quite inefficient - e.g. in mouse embryonic stem cells treated with donor DNA, in only 1 in a million DNA molecules was incorporated at the desired position.

Hence, genome editing with engineered nucleases, is a promising new approach. This rapidly evolving technology overcomes these shortcomings and uses relatively simple concepts.

Double stranded breaks and their repair
First and foremost in understanding the use of nucleases in genome editing is the understanding of DNA double stranded break (DSB) repair mechanisms. Two of the known DSB repair pathways that are essentially functional in all organisms are the non-homologous end joining (NHEJ) and homology directed repair (HDR).

NHEJ uses a variety of enzymes to directly join the DNA ends in a double-strand break. In contrast, in HDR, a homologous sequence is utilized as a template for regeneration of missing DNA sequence at the break point. The natural properties of these pathways form the very basis of nucleases based genome editing.

NHEJ is error prone such that it was shown to cause mutations at the repair site in approximately 50% of DSB in mycobacteria and also its low fidelity has been linked to mutational accumulation in leukemias. Thus if one is able to create a DSB at a desired gene in multiple samples, it is very likely that mutations will be generated at that site in some of the treatments because of errors created by the NHEJ infidelity.

On the other hand, the dependency of HDR on a homologous sequence to repair DSBs can be exploited by inserting a desired sequence within a sequence that is homologous to the flanking sequences of a DSB which, when used as a template by HDR system, would lead to the creation of the desired change within the genomic region of interest.

Despite the distinct mechanisms, the concept of the HDR based gene editing is in a way similar to that of homologous recombination based gene targeting. However, the rate of recombination is increased by at least three orders of magnitude when DSBs are created and HDR is at work thus making the HDR based recombination much more efficient and eliminating the need for stringent positive and negative selection steps. So based on these principles if one is able to create a DSB at a specific location within the genome, then the cell’s own repair systems will help in creating the desired mutations.

Site-specific double stranded breaks
Creation of a DSB in DNA should not be a challenging task as the commonly used restriction enzymes are capable of doing so. However, if genomic DNA is treated with a particular restriction endonuclease many DSBs will be created. This is a result of the fact that most restriction enzymes recognize a few base pairs on the DNA as their target and very likely that particular base pair combination will be found in many locations across the genome. To overcome this challenge and create site-specific DSB, three distinct classes of nucleases have been discovered and bioengineered to date. These are the Zinc finger nucleases (ZFNs), transcription-activator like effector nucleases (TALEN) and meganucleases. Below is a brief overview and comparison of these enzymes and the concept behind their development.

Current engineered nucleases
Meganucleases, found commonly in microbial species, have the unique property of having very long recognition sequences (>14bp) thus making them naturally very specific. This can be exploited to make site-specific DSB in genome editing; however, the challenge is that not enough meganucleases are known, or may ever be known, to cover all possible target sequences. To overcome this challenge, mutagenesis and high throughput screening methods have been used to create meganuclease variants that recognize unique sequences. Others have been able to fuse various meganucleases and create hybrid enzymes that recognize a new sequence. Yet others have attempted to alter the DNA interacting aminoacids of the meganuclease to design sequence specific meganucelases in a method named rationally designed meganuclease (US Patent 8,021,867 B2).

Meganucleases have the benefit of causing less toxicity in cells than methods such as ZFNs, likely because of more stringent DNA sequence recognition; however, the construction of sequence-specific enzymes for all possible sequences is costly and time consuming, as one is not benefiting from combinatorial possibilities that methods such as ZFNs and TALEN-based fusions utilize.

As opposed to meganucleases, the concept behind ZFNs and TALEN technology is based on a non-specific DNA cutting enzyme, which can then be linked to specific DNA sequence recognizing peptides such as zinc fingers and transcription activator-like effectors (TALEs). The key to this was to find an endonuclease whose DNA recognition site and cleaving site were separate from each other, a situation that is not common among restriction enzymes. Once this enzyme was found, its cleaving portion could be separated which would be very non-specific as it would have no recognition ability. This portion could then be linked to sequence recognizing peptides that could lead to very high specificity. A restriction enzyme with such properties is FokI. Additionally FokI has the advantage of requiring dimerization to have nuclease activity and this means the specificity increases dramatically as each nuclease partner would recognize a unique DNA sequence. To enhance this effect, FokI nucleases have been engineered that can only function as heterodimers and have increased catalytic activity. The heterodimer functioning nucleases would avoid the possibility of unwanted homodimer activity and thus increase specificity of the DSB. Although the nuclease portions of both ZFNs and TALEN constructs have similar properties, the difference between these engineered nucleases is in their DNA recognition peptide. ZFNs rely on Cys2-His2 zinc fingers and TALEN constructs on TALEs. Both of these DNA recognizing peptide domains have the characteristic that they are naturally found in combinations in their proteins. Cys2-His2 Zinc fingers typically happen in repeats that are 3 bp apart and are found in diverse combinations in a variety of nucleic acid interacting proteins such as transcription factors. TALEs on the other hand are found in repeats with a one-to-one recognition ratio between the amino acids and the recognized nucleotide pairs. Because both zinc fingers and TALEs happen in repeated patterns, different combinations can be tried to create a wide variety of sequence specificities. Zinc fingers have been more established in these terms and approaches such as modular assembly (where Zinc fingers correlated with a triplet sequence are attached in a row to cover the required sequence), OPEN (low-stringency selection of peptide domains vs. triplet nucleotides followed by high-stringency selections of peptide combination vs. the final target in bacterial systems), and bacterial one-hybrid screening of zinc finger libraries among other methods have been used to make site specific nucleases.

One recent improvement combines the DNA binding specificity of TALEs with the nuclease specificity of meganucleases; these "megaTALs" are compatible with all current technologies and may represent improvements on existing methods.

Precision of Engineered Nucleases
Because "off-target" activity of an active nuclease would have potentially dangerous consequences at the genetic and organismal levels, the precision of meganucleases, ZFNs, CRISPR, and TALEN-based fusions has been an active area of research. While variable figures have been reported, ZFNs tend to have more cytotoxicity than TALEN methods or RNA-guided nucleases, while TALEN and RNA-guided approaches tend to have the greatest efficiency and fewer off-target effects. Based on the maximum theoretical distance between DNA binding and nuclease activity, TALEN approaches result in the greatest precision.

Applications
Over the past decade, efficient genome editing has been developed for a wide range of experimental systems ranging from plants to animals, often beyond clinical interest, and the method holds a promising future in becoming a standard experimental strategy in research labs. The recent generation of rat, zebrafish, maize and tobacco ZFN-mediated mutants and the improvements in TALEN-based approaches testify to the significance of the methods, and the list is expanding rapidly. Genome editing with engineered nucleases will likely contribute to many fields of life sciences from studying gene functions in plants and animals to gene therapy in humans. For instance, the field of synthetic biology which aims to engineer cells and organisms to perform novel functions, is likely to benefit from the ability of engineered nuclease to add or remove genomic elements and therefore create complex systems. In addition, gene functions can be studied using stem cells with engineered nucleases.

Listed below are some specific tasks this method can carry out:


 * Targeted gene mutation
 * Creating chromosome rearrangement
 * Study gene function with stem cells
 * Transgenic animals
 * Endogenous gene labeling
 * Targeted transgene addition



Targeted gene modification in plants
Genome editing using meganucleases, ZFNs, and TALEN provides a new strategy for genetic manipulation in plants and are likely to assist in the engineering of desired plant traits by modifying endogenous genes. For instance, site-specific gene addition in major crop species can be used for 'trait stacking' whereby several desired traits are physically linked to ensure their co-segregation during the breeding processes. Progress in such cases have been recently reported in Arabidopsis thaliana  and Zea mays. In Arabidopsis thaliana, using ZFN-assisted gene targeting, two herbicide-resistant genes (tobacco acetolactate synthase SuRA and SuRB) were introduced to SuR loci with as high as 2% transformed cells with mutations. In Zea mays, disruption of the target locus was achieved by ZFN-induced DSBs and the resulting NHEJ. ZFN was also used to drive herbicide-tolerance gene expression cassette (PAT) into the targeted endogenous locus IPK1 in this case. Such genome modification observed in the regenerated plants has been shown to be inheritable and was transmitted to the next generation.

In addition, TALEN-based genome engineering has been extensively tested and optimized for use in plants. TALEN fusions have also been used to improve the quality of soybean oil products and to increase the storage potential of potatoes

Several optimizations need to be made in order to improve editing plant genomes using ZFN-mediated targeting. These include the reliable design and subsequent test of the nucleases, the absence of toxicity of the nucleases, the appropriate choice of the plant tissue for targeting, the routes of introduction or induction of enzyme activity, the lack of off-target mutagenesis, and a reliable detection of mutated cases.

Gene therapy
The ideal gene therapy practice is that which replaces the defective gene with a normal allele at its natural location. This is advantageous over a virally delivered gene as there is no need to include the full coding sequences and regulatory sequences when only a small proportions of the gene needs to be altered as is often the case. The expression of the partially replaced genes is also more consistent with normal cell biology than full genes that are carried by viral vectors.

Gene targeting through ZFNs or TALEN-based approaches can also be used to modify defective genes at their endogenous chromosomal locations. Examples include the treatment of X-linked severe combined immunodeficiency (X-SCID) by ex vivo gene correction with DNA carrying the interleukin-2 receptor common gamma chain (IL-2Rγ) and the correction of Xeroderma pigmentosum mutations in vitro using TALEN. Insertional mutagenesis by the retroviral vector genome induced leukemia in some patients, a problem that is predicted to be avoided by these technologies. However, ZFNs may also cause off-target mutations, in a different way from viral transductions. Currently many measures are taken to improve off-target detection and ensure safety before treatment.

Recently, Sangamo BioSciences (SGMO) introduced the Delta 32 mutation (a suppressor of CCR5 gene which is a co-receptor for HIV-1 entry into T cells therefore enabling HIV infection) using Zinc Finger Nuclease (ZFN). Their results were presented at the 51st Interscience Conference on Antimicrobial Agents and Chemotherapy (ICAAC) held in Chicago from September 17–20, 2011. Researchers at SGMO mutated CCR5 in CD4+ T cells and subsequently produced an HIV-resistant T-cell population.

One avenue of exciting research involves the generation of custom immune cells with modifications of interest. For example, recent report indicated that T cells could be modified to inactivate the glucocorticoid receptor; the resulting immune cells are fully functional but resistant to the effects of commonly used corticosteroids. Similarly, scientists at Cellectis recently generated custom T-cells expressing chimeric antigen receptors using TALEN® technology. These T-cells can be engineered to be resistant to anti-cancer drugs and to invoke immune responses against targets of interest.

Physicians at the Great Ormond Street Hospital recently announced the first clinical use of TALEN-based genome editing. An 11-month old baby suffering from CD19+ acute lymphoblastic leukemia was treated with modified donor T cells that had been engineered to attack leukemia cells, to be resistant to Alemtuzumab, and to evade detection by the host immune system after introduction. A few weeks after therapy, the patient's condition improved; though physicians are cautious, the patient has been in remission for several months following treatment.

Other uses
Selective genome modification via TALEN® technology has been used with high efficiency in diatoms to enable the production of biofuels.

Prospect and limitations
In the future, research into genome editing with engineered nucleases needs to focus on improving safety and specificity of the nucleases. For example, improving the ability to detect off-target events can improve our ability to learn about ways of preventing them. In addition, zinc-fingers used in ZFNs are seldom completely specific, and some may cause toxicity. However, the toxicity has been reported to be reduced by modifications done on the cleavage domain of the ZFN.

In addition, the study by Dana Carroll looking at modifying the genome with engineered nucleases shows the requirement of better understanding of the basic recombination and repair machinery of DNA. In the future, a possible method to identify secondary targets would be to capture broken ends from cells expressing the ZFNs and to sequence the flanking DNA using high-throughput sequencing.

Genome editing occurs also as a natural process without artificial genetic engineering. The agents that are competent to edit genetic codes are viruses or subviral RNA-agents.

Lastly, although GEEN has higher efficiency than many other methods in reverse genetics, it is still not highly efficient; in many cases less than half of the treated populations obtain the desired changes. For example, when one is planning to use the cell's NHEJ to create a mutation, the cell's HDR systems will also be at work correcting the DSB with lower mutational rates.