Gene conversion

Gene conversion is the process by which one DNA sequence replaces a homologous sequence such that the sequences become identical after the conversion event. Gene conversion can be either allelic, meaning that one allele of the same gene replaces another allele, or ectopic, meaning that one paralogous DNA sequence converts another.

Allelic gene conversion
Allelic gene conversion occurs during meiosis when homologous recombination between heterozygotic sites results in a mismatch in base pairing. This mismatch is then recognized and corrected by the cellular machinery causing one of the alleles to be converted to the other. This can cause non-Mendelian segregation of alleles in germ cells.

Nonallelic/ectopic gene conversion
Recombination occurs not only during meiosis, but also as a mechanism for repair of double-strand breaks (DSBs) caused by DNA damage. These DSBs are usually repaired using the sister chromatid of the broken duplex and not the homologous chromosome, so they would not result in allelic conversion. Recombination also occurs between homologous sequences present at different genomic loci (paralogous sequences) which have resulted from previous gene duplications. Gene conversion occurring between paralogous sequences (ectopic gene conversion) is conjectured to be responsible for concerted evolution of gene families.

Mechanism
Conversion of one allele to the other is often due to base mismatch repair during homologous recombination: if one of the four chromatids during meiosis pairs up with another chromatid, as can occur because of sequence homology, DNA strand transfer can occur followed by mismatch repair. This can alter the sequence of one of the chromosomes, so that it is identical to the other.

Meiotic recombination is initiated through formation of a double-strand break (DSB). The 5’ ends of the break are then degraded, leaving long 3’ overhangs of several hundred nucleotides. One of these 3’ single stranded DNA segments then invades a homologous sequence on the homologous chromosome, forming an intermediate which can be repaired through different pathways resulting either in crossovers (CO) or noncrossovers (NCO). At various steps of the recombination process, heteroduplex DNA (double-stranded DNA consisting of single strands from each of the two homologous chromosomes which may or may not be perfectly complementary) is formed. When mismatches occur in heteroduplex DNA, the sequence of one strand will be repaired to bind the other strand with perfect complementarity, leading to the conversion of one sequence to another. This repair process can follow either of two alternative pathways as illustrated in the Figure. By one pathway, a structure called a double Holliday junction (DHJ) is formed, leading to the exchange of DNA strands. By the other pathway, referred to as Synthesis Dependent Strand Annealing (SDSA), there is information exchange but not physical exchange. Gene conversion will occur during SDSA if the two DNA molecules are heterozygous at the site of the recombinational repair. Gene conversion may also occur during recombinational repair involving a DHJ, and this gene conversion may be associated with physical recombination of the DNA duplexes on the two sides of the DHJ.

Biased vs. unbiased gene conversion
Biased gene conversion (BGC) occurs when one allele has a higher probability of being the donor than the other in a gene conversion event. For example, when a T:G mismatch occurs, it would be more or less likely to be corrected to a C:G pair than a T:A pair. This gives that allele a higher probability of transmission to the next generation. Unbiased gene conversion means that both possibilities occur with equal probability.

GC-biased gene conversion
GC-biased gene conversion (gBGC) is the process by which the GC content of DNA increases due to gene conversion during recombination. Evidence for gBGC exists for yeasts and humans and the theory has more recently been tested in other eukaryotic lineages. In analyzed human DNA sequences, crossover rate has been found to correlate positively with GC-content. The pseudoautosomal regions (PAR) of the X and Y chromosomes in humans, which are known to have high recombination rates also have high GC contents. Certain mammalian genes undergoing concerted evolution (for example, ribosomal operons, tRNAs, and histone genes) are very GC-rich. It has been shown that GC content is higher in paralogous human and mouse histone genes that are members of large subfamilies (presumably undergoing concerted evolution) than in paralogous histone genes with relatively unique sequences. There is also evidence for GC bias in the mismatch repair process. It is thought that this may be an adaptation to the high rate of methyl-cytosine deamination which can lead to C→T transitions.

BGC of the Fxy gene in Mus musculus
The Fxy or Mid1 gene in some mammals closely related to house mice (humans, rats, and other Mus species) is located in the sex-linked region of the X chromosome. However, in Mus musculus, it has recently translocated such that the 3’ end of the gene overlaps with the PAR region of the X-chromosome, which is known to be a recombination hotspot. This portion of the gene has experienced a dramatic increase in GC content and substitution rate at the 3rd codon position as well as in introns but the 5’ region of the gene, which is X-linked, has not. Because this effect is present only in the region of the gene experiencing increased recombination rate, it must be due to biased gene conversion and not selective pressure.

Impact of GC-biased gene conversion on human genomic patterns
GC content varies widely in the human genome (40–80%), but there seem to be large sections of the genome where GC content is, on average, higher or lower than in other regions. These regions, although not always showing clear boundaries, are known as isochores. One possible explanation for the presence of GC-rich isochores is that they evolved due to GC-biased gene conversion in regions with high levels of recombination.

Adaptive function of recombination
Studies of gene conversion have contributed to our understanding of the adaptive function of meiotic recombination. The ordinary segregation pattern of an allele pair (Aa) among the 4 products of meiosis is 2A:2a. Detection of infrequent gene conversion events (e.g. 3:1 or 1:3 segregation patterns during individual meioses) provides insight into the alternate pathways of recombination leading either to crossover or non-crossover chromosomes. Gene conversion events are thought to arise where the “A” and “a” alleles happen to be near the exact location of a molecular recombination event. Thus, it is possible to measure the frequency with which gene conversion events are associated with crossover or non-crossover of chromosomal regions adjacent to, but outside, the immediate conversion event. Numerous studies of gene conversion in various fungi (which are especially suited for such studies) have been carried out, and the findings of these studies have been reviewed by Whitehouse. It is clear from this review that most gene conversion events are not associated with outside marker exchange. Thus, most gene conversion events in the several different fungi studied are associated with non-crossover of outside markers. Non-crossover gene conversion events are mainly produced by Synthesis Dependent Strand Annealing (SDSA). This process involves limited informational exchange, but not physical exchange of DNA, between the two participating homologous chromosomes at the site of the conversion event, and little genetic variation is produced. Thus, explanations for the adaptive function of meiotic recombination that focus exclusively on the adaptive benefit of producing new genetic variation or physical exchange seem inadequate to explain the majority of recombination events during meiosis. However, the majority of meiotic recombination events can be explained by the proposal that they are an adaptation for repair of damage in the DNA that is to be passed on to gametes.

Of particular interest, from the point of view that recombination is an adaptation for DNA repair, are the studies in yeast showing that gene conversion in mitotic cells is increased by UV and ionizing radiation

Genetic diseases of humans
In the discussions of genetic diseases in humans, pseudogene mediated gene conversions that introduce pathogenic mutations into functional genes is a well known mechanism of mutation. In contrast, it is possible that pseudogenes could serve as templates. During the course of evolution, functional source genes which are potentially advantageous have been derived from multiple copies in their single source gene. The pseudogene-templated changes might eventually become fixed as long as they did not possess deleterious effects. So, in fact, pseudogenes can act as sources of sequence variants which can be transferred to functional genes in novel combinations and can be acted upon by selection. Lectin 11 (SIGLEC11), a human immunoglobulin that binds to sialic acid, can be considered an example of such a gene conversion event which has played a significant role in evolution. While comparing the homologous genes of human SIGLEC11 and its pseudogene in the chimpanzee, gorilla and orangutan, it appears that there was gene conversion of the sequence of 5’ upstream regions and the exons that encode the sialic acid recognition domain, approximately 2kbp from the closely flanking hSIGLECP16 pseudogene (Hayakawa et al., 2005). The three pieces of evidence concerning this event have together suggested this as an adaptive change which is very evolutionarily important in genus Homo. Those includes that only in human lineage this gene conversion happened, the brain cortex has acquired an important expression of SIGLEC11 specifically in human lineage and the exhibition of a change in substrate binding in human lineage when compared to that of its counterpart in chimpanzees. Of course the frequency of the contribution of this pseudogene-mediated gene conversion mechanism to functional and adaptive changes in evolution of human is still unknown and so far it has been scarcely explored. In spite of that, the introduction of positively selective genetic changes by such mechanism can be put forward for consideration by the example of SIGLEC11. Sometimes due to interference of transposable elements in to some members of a gene family, it causes a variation among them and finally it may also cease the rate of gene conversion due to lack of sequence similarity which leads to divergent evolution.

Genomic analysis
From various genome analyses, it was concluded that the double-strand breaks (DSB) can be repaired via homologous recombination by at least two different but related pathways. In case of major pathway, homologous sequences on both sides of the DSB will be employed which seems to be analogous to the conservative DSB repair model that was originally proposed for meiotic recombination in yeast. where as the minor pathway is restricted to only one side of the DSB as postulated by nonconservative one-sided invasion model. However, in both cases the sequence of the recombination partners will be absolutely conserved. By virtue of their high degree of homology, the new gene copies that came into existence following the gene duplication naturally tend to either unequal crossover or unidirectional gene conversion events. In the latter process, there exists the acceptor and donor sequences and the acceptor sequence will be replaced by a sequence copied from the donor, while the sequence of the donor remains unchanged.

The effective homology between the interacting sequences makes the gene conversion event successful. Additionally, the frequency of gene conversion is inversely proportional to the distance between the interacting sequences in cis, and the rate of gene conversion is usually directly proportional to the length of uninterrupted sequence tract in the assumed converted region. It seems that conversion tracts accompanying crossover are longer (mean length = ~460 bp) than conversion tracts without crossover (mean length = 55–290 bp). In the studies of human globulin genes, it has long been supported that the gene conversion event or branch migration events can either be promoted or inhibited by the specific motifs that exist in the vicinity of the DNA sequence (Papadakis and Patrinos, 1999). Another basic classification of gene conversion events is the interlocus (also called nonallelic) and interallelic gene conversions. The cis or trans nonallelic or interlocus gene conversion events occur between nonallelic gene copies residing on sister chromatids or homologous chromosomes, and, in case of interallelic, the gene conversion events take place between alleles residing on homologous chromosomes (Adapted from Chen et al., (2007). If the interlocus gene conversion events are compared, it will be frequently revealed that they exhibit biased directionality. Sometimes, such as in case of human globin genes (Papadakis and Patrinos, 1999), the gene conversion direction correlates with the relative expression levels of the genes that participate in the event, with the gene expressed at higher level, called the ‘master’ gene, converting that with lower expression, called the ‘slave’ gene. Originally formulated in an evolutionary context, the ‘master/slave gene’ rule should be explained with caution. In fact, the increase in gene transcription exhibits not only the increase in likelihood of it to be used as a donor but also as an acceptor.

Effect
Normally, an organism that has inherited different copies of a gene from each of its parents is called heterozygous. This is generically represented as genotype: Aa (i.e. one copy of variant (allele) 'A', and one copy of allele 'a'). When a heterozygote creates gametes by meiosis, the alleles normally duplicate and end up in a 2:2 ratio in the resulting 4 cells that are the direct products of meiosis. However, in gene conversion, a ratio other than the expected 2A:2a is observed, in which A and a are the two alleles. Examples are 3A:1a and 1A:3a. In other words, there can, for example, be three times as many A alleles as a alleles expressed in the daughter cells, as is the case in 3A:1a.

Medical relevance
Gene conversion resulting in mutation of the CYP21A2 gene is a common underlying genetic cause of congenital adrenal hyperplasia. Somatic gene conversion is one of the mechanisms that can result in familial retinoblastoma, a congenital cancer of the retina, and it is theorized that gene conversion may play a role in the development of Huntington's disease.