User:Cblanken28/sandbox

Epitranscriptomics describes an aspect of molecular genetics, or a study thereof, that depends on biochemical modifications of RNA.[1][2] By analogy to the term epigenetics, described as "functionally relevant changes to the genome that do not involve a change in the nucleotide sequence", epitranscriptomics can be defined as a functionally relevant changes to the transcriptome that do not involve a change in the ribonucleotide sequence. The epitranscriptome, therefore, is defined as the ensemble of such functionally relevant changes.

There are several types of RNA modifications that impact gene expression. These modifications happen to all types of cellular RNA including, but not limited to, ribosomal RNA(rRNA), transfer RNA (tRNA), messenger RNA (mRNA), and small nuclear RNA (snRNA)[3]. There are more than one hundred documented RNA modifications. A database maintained by the University of Albany details each modification[4]. The most common and well-understood mRNA modification at present is the N6-Methyladenosine (m6A), which has been observed to occur an average of three times in every mRNA molecule[5].

The relative youth of this field means there is still much progress to be made in characterizing all modifications to the transcriptome and elucidating their mechanisms of action. Once these questions are answered and biologists have a better sense of the amount of variation in RNA modification, the focus will turn to each modification’s biological function[6]. This has already been investigated in a select few proteins such as adenosine deaminase, which acts on RNA (ADAR). ADAR has been shown to affect antibody production and the innate immune system as well as transcripts encoding important receptors for the central nervous system. This plurality in function has caused some scientists to speculate that the epitransciptome may be even more expansive than the better defined epigenome[7].

N6-Methyladenosine (m6A )
m6A describes the methylation of the nitrogen at position 6 of the adenosine base within mRNA. Discovered in 1974, m6A is the most abundant eukaryotic mRNA modification. The term "epitranscriptome" was coined following transcriptome-wide mappings of m6A sites, but does not necessarily exclude other post-transcriptional mRNA modifications.

m6A methylation regulates nuclear export of mature mRNA and mRNA stability. How, and in response to what stimulus, the cell endogeneously regulates the level of m6A methylation remains unclear at present.

N6-Methyladenosine (m6A) on Alternative Splicing
The terms “eraser” and “reader” have been associated with RNA modification. “Eraser” is a general term to describe an enzyme that de-methylates m6A. Changes that mutate the gene encoding the “eraser” enzyme lead to obesity and cancer. “Reader” proteins are involved in gene expression where there are abundant m6A; “reader” proteins have a higher propensity to bind with greater affinity, while the de-methylated form has been reported to have a decreased binding affinity.

mRNA are subject to layers of regulatory gene expression. One known mechanism involves the formation of RNA stem-loops. Stem-loops occur when complementary bases within a single-stranded RNA molecule form Watson-Crick base pairs on the stem while forming an unpaired end or loop. Stem-loops do not have one definite function, but a plethora of functions. In the case of the m6A regulatory mechanism, it is involved in alternative splicing. These stem and loop structures are subject to alterations regarding changes in pH, temperature, ion concentrations, binding nature of proteins and also nucleic acids.

m6A has been observed to be located within the loops opposite of the HNRNPC binding site. HNRNPC is single-stranded RNA binding protein where it participates in post-transcriptional regulation, specifically alternative splicing. HNRNPC protein binds to its site (uridine rich region on the stem loop) when methylated adenosine is present.The HNRNPC binding site on the mRNA consists of an abundance of uridine nucleotides. Studies have concluded that methylated adenosine residues destabilize the hairpin structure, elongating the uridine nucleotide stretch, causing the binding site to be more accessible for efficient HNRNPC protein binding.

Evidence supporting this claim identified that decreased m6A levels in the transcriptome lead to significantly reduced HNRNPC binding, so alternative splicing is co-regulated by methylation and HNRNPC binding activity. However, the m6A modification does not directly cause protein binding. Rather, it alters the loop structure to regulate gene expression by acting as a switch that exposes the HNRNPC region. It is essentially a two-step, co-regulatory mechanism prevalent in biochemistry in controlling alternative splicing. Note that demethylase enzyme can indeed “erase” the methyl group, thus inhibiting alternative splicing: it is the reverse of the two-step regulation mechanism.

mRNA capping by NAD+, NADH & dpCoA
mRNA capping in eukaryotes have shown to exist and have been studied extensively regarding the 5’ 7-methylguanylate cap and its poly-A tail. It provides RNA stability, processing, localization and translational efficiency. Recently, researchers have proven a similar mechanism in RNA pre-processing in prokaryotes. In prokaryotic (E.coli) mRNA, the 5’ is capped with nicotinamide adenine dinucleotide (NAD+ or NADH) or 3’-desphospho-coenzyme A (dpCoA). It was previously thought that the NAD+,NADH, dpCoA modification occurs after transcription analogously with the 7-methylguanylate cap; however, recently it has been shown that modifications are incorporated during transcription initiation by acting as a non-canonical initiating nucleotides (NCIN) for de novo transcription by cellular RNA polymerase. RppH and NudC pyrophosphohydrolases cleaves specific phosphate bonds to eliminate the cap modification. ATP was initially known to cap an RNA product in prokaryotes while NAD+, NADH and dpCoA was still being studied. RppH specifically cleaves 5’-triphosphate and 5’-diphosphate RNAs to 5’-monophosphate RNA products which only targets the ATP capped RNAs. On the other hand, NudC cleaves 5’-(NADH,NAD+,dpCoA) capped RNAs to 5’-monophosphate RNAs but did not cleave ATP capped RNA. This discovery of two specific hydrolases that targets specific products lead to the discovery of NCIN-mediated transcription. Both prokaryotes and eukaryotes transcripts have been observed with x-ray crystallography to have a NCIN capped RNA product in addition to being observed under in vivo condition. The efficiency of NCIN capping is largely influenced by promoter sequence especially the -35 box and -10 box upstream of the transcription start site. Studies so far have defined the mechanism and structural basis of NCIN mediated capping but the function of these caps are still to be discovered. Recent consensus is that NCIN-mediated ab initio capping occurs in all organisms.

Pseudo-seq and the regulation of pseudouridyltaion in yeast and human cells
Pseudouridine is a modified nucleoside found within non-coding RNAs (ncRNA). It increases the function of tRNA and rRNA by stabilizing the structure. Though mRNAs are not known for containing pseudouridine, the artificial process of pseudouridylation has an affect on the function of mRNA: it changes the genetic code by making non-canonical base pairing possible in the ribosome decoding center.

A certain paper looks at pseudouridylation in yeast and human RNAs using pseudo-seq, a process that utilizes a single-nucleotide-resolution method for pseudouridine identification. It identifies the known modification sites as well as other sites in ncRNAs in addition to the many pseudouridylated sites in mRNA.

There are more than 100 classes of RNA modifications that have been found, a majority of which are in tRNA and rRNA while only three modified nucleotides have been discovered inside a coding sequence of mRNA: m6A, 5-methylcytosine (m5C), and inosine. Research has shown, however, that pseudouridines are quite scarce in yeast. Nonetheless, much of the regulation in regards to pseudouridylation is regulated through the environment. In yeast this may be nutrient deprivation and in humans it is the serum starvation. When looking at yeast, research has utilized perturbing pseudouridine synthases deletion strains grown to high density and identified mRNA targets for each PUS protein. Results came back showing that most mRNA targets showed increase modification during post-diauxic growth. The pseudo-seq method identified 96 pseudouridines in 89 mRNAs, similar to yeast the growth of pseudouridine was regulated by cellular growth state. This approach provides an analysis of RNA pseudouridylation with single-nucleotide resolution and shows endogenous mRNAs are specifically pseudouridylated in a highly regulated manner in yeast and human cells. mRNA pseudouridyltaion could also bring a change in translation initiation efficiency, RNA localization, and other processes all cause pseuduridine stabilizes RNA structure.

Epitrancriptome and modulating sections of RNA
The concept of epitranscriptomics has been seen to have an effect not only on RNA but also on protein synthesis. RNA methylase NSun2 methylates mRNAs. This methylation has an effects the components of the postsynaptic neurons. The RNA modification sites are seen to occur at adenosine to create methyl-6-adenosine found around stop codons. Not much is known about the purpose of this methylation but it was found that human patients lacking NSun2 are characterized by intellectual disability and neural.

Engineered RNA Modification Techniques
Modifying RNA is possible using specific enzymes. AlkB is a demethylating enzyme found in E. Coli that removes methyl groups from methylated Cytosine and Adenine nucleotides. This enzyme can be modified to demethylate methyl-Guanosine as well. Modifying the RNA in this way allows for more accurate sequencing, which has medical applications. Given the more-than-100 different modified nucleotides found in nature, AlkB can be used to almost undo the complications introduced by modifying nucleotides in the epitranscriptome, thus allowing the sequencing of heavily modified mRNA strands.