Aminoacyl tRNA synthetase

An aminoacyl-tRNA synthetase (aaRS or ARS), also called tRNA-ligase, is an enzyme that attaches the appropriate amino acid onto its corresponding tRNA. It does so by catalyzing the transesterification of a specific cognate amino acid or its precursor to one of all its compatible cognate tRNAs to form an aminoacyl-tRNA. In humans, the 20 different types of aa-tRNA are made by the 20 different aminoacyl-tRNA synthetases, one for each amino acid of the genetic code.

This is sometimes called "charging" or "loading" the tRNA with an amino acid. Once the tRNA is charged, a ribosome can transfer the amino acid from the tRNA onto a growing peptide, according to the genetic code. Aminoacyl tRNA therefore plays an important role in RNA translation, the expression of genes to create proteins.

Mechanism
The synthetase first binds ATP and the corresponding amino acid (or its precursor) to form an aminoacyl-adenylate, releasing inorganic pyrophosphate (PPi). The adenylate-aaRS complex then binds the appropriate tRNA molecule's D arm, and the amino acid is transferred from the aa-AMP to either the 2'- or the 3'-OH of the last tRNA nucleotide (A76) at the 3'-end.

The mechanism can be summarized in the following reaction series:
 * 1) Amino Acid + ATP → Aminoacyl-AMP + PPi
 * 2) Aminoacyl-AMP + tRNA → Aminoacyl-tRNA + AMP

Summing the reactions, the highly exergonic overall reaction is as follows:


 * Amino Acid + tRNA + ATP → Aminoacyl-tRNA + AMP + PPi

Some synthetases also mediate an editing reaction to ensure high fidelity of tRNA charging. If the incorrect tRNA is added (aka. the tRNA is found to be improperly charged), the aminoacyl-tRNA bond is hydrolyzed. This can happen when two amino acids have different properties even if they have similar shapes—as is the case with valine and threonine.

The accuracy of aminoacyl-tRNA synthetase is so high that it is often paired with the word "superspecificity” when it is compared to other enzymes that are involved in metabolism. Although not all synthetases have a domain with the sole purpose of editing, they make up for it by having specific binding and activation of their affiliated amino acids. Another contribution to the accuracy of these synthetases is the ratio of concentrations of aminoacyl-tRNA synthetase and its cognate tRNA. Since tRNA synthetase improperly acylates the tRNA when the synthetase is overproduced, a limit must exist on the levels of aaRSs and tRNAs in vivo.

Classes
There are two classes of aminoacyl tRNA synthetase, each composed of ten enzymes:


 * Class I has two highly conserved sequence motifs. It aminoacylates at the 2'-OH of a terminal adenosine nucleotide on tRNA, and it is usually monomeric or dimeric (one or two subunits, respectively).
 * Class II has three highly conserved sequence motifs. It aminoacylates at the 3'-OH of a terminal adenosine on tRNA, and is usually dimeric or tetrameric (two or four subunits, respectively). Although phenylalanine-tRNA synthetase is class II, it aminoacylates at the 2'-OH.

The amino acids are attached to the hydroxyl (-OH) group of the adenosine via the carboxyl (-COOH) group.

Regardless of where the aminoacyl is initially attached to the nucleotide, the 2'-O-aminoacyl-tRNA will ultimately migrate to the 3' position via transesterification.

Bacterial aminoacyl-tRNA synthetases can be grouped as follows: Amino acids which use class II aaRS seem to be evolutionarily older.



Structures
Both classes of aminoacyl-tRNA synthetases are multidomain proteins. In a typical scenario, an aaRS consists of a catalytic domain (where both the above reactions take place) and an anticodon binding domain (which interacts mostly with the anticodon region of the tRNA). Transfer-RNAs for different amino acids differ not only in their anticodon but also at other points, giving them slightly different overall configurations. The aminoacyl-tRNA synthetases recognize the correct tRNAs primarily through their overall configuration, not just through their anticodon. In addition, some aaRSs have additional RNA binding domains and editing domains that cleave incorrectly paired aminoacyl-tRNA molecules.

The catalytic domains of all the aaRSs of a given class are found to be homologous to one another, whereas class I and class II aaRSs are unrelated to one another. The class I aaRSs feature a cytidylyltransferase-like Rossmann fold seen in proteins like glycerol-3-phosphate cytidylyltransferase, nicotinamide nucleotide adenylyltransferase and archaeal FAD synthase, whereas the class II aaRSs have a unique fold related to biotin and lipoate ligases.

The alpha helical anticodon binding domain of arginyl-, glycyl- and cysteinyl-tRNA synthetases is known as the DALR domain after characteristic conserved amino acids.

Aminoacyl-tRNA synthetases have been kinetically studied, showing that Mg2+ ions play an active catalytic role and therefore aaRs have a degree of magnesium dependence. Increasing the Mg2+ concentration leads to an increase in the equilibrium constants for the aminoacyl-tRNA synthetases’ reactions. Although this trend was seen in both class I and class II synthetases, the magnesium dependence for the two classes are very distinct. Class II synthetases have two or (more frequently) three Mg2+ ions, while class I only requires one Mg2+ ion.

Beside their lack of overall sequence and structure similarity, class I and class II synthetases feature different ATP recognition mechanisms. While class I binds via interactions mediated by backbone hydrogen bonds, class II uses a pair of arginine residues to establish salt bridges to its ATP ligand. This oppositional implementation is manifested in two structural motifs, the Backbone Brackets and Arginine Tweezers, which are observable in all class I and class II structures, respectively. The high structural conservation of these motifs suggest that they must have been present since ancient times.

Evolution
Most of the aaRSs of a given specificity are evolutionarily closer to one another than to aaRSs of another specificity. However, AsnRS and GlnRS group within AspRS and GluRS, respectively. Most of the aaRSs of a given specificity also belong to a single class. However, there are two distinct versions of the LysRS - one belonging to the class I family and the other belonging to the class II family.

The molecular phylogenies of aaRSs are often not consistent with accepted organismal phylogenies. That is, they violate the so-called canonical phylogenetic pattern shown by most other enzymes for the three domains of life - Archaea, Bacteria, and Eukarya. Furthermore, the phylogenies inferred for aaRSs of different amino acids often do not agree with one another. In addition, aaRS paralogs within the same species show a high degree of divergence between them. These are clear indications that horizontal transfer has occurred several times during the evolutionary history of aaRSs.

A widespread belief in the evolutionary stability of this superfamily, meaning that every organism has all the aaRSs for their corresponding amino acids, is misconceived. A large-scale genomic analysis on ~2500 prokaryotic genomes showed that many of them miss one or more aaRS genes whereas many genomes have 1 or more paralogs. AlaRS, GlyRS, LeuRS, IleRS and ValRS are the most evolutionarily stable members of the family. GluRS, LysRS and CysRS often have paralogs, whereas AsnRS, GlnRS, PylRS and SepRS are often absent from many genomes.

With the exception of AlaRS, it has been discovered that 19 out of the 20 human aaRSs have added at least one new domain or motif. These new domains and motifs vary in function and are observed in various forms of life. A common novel function within human aaRSs is providing additional regulation of biological processes. There exists a theory that the increasing number of aaRSs that add domains is due to the continuous evolution of higher organisms with more complex and efficient building blocks and biological mechanisms. One key piece of evidence to this theory is that after a new domain is added to an aaRS, the domain becomes fully integrated. This new domain's functionality is conserved from that point on.

As genetic efficiency evolved in higher organisms, 13 new domains with no obvious association with the catalytic activity of aaRSs genes have been added.

Application in biotechnology
In some of the aminoacyl tRNA synthetases, the cavity that holds the amino acid can be mutated and modified to carry unnatural amino acids synthesized in the lab, and to attach them to specific tRNAs. This expands the genetic code, beyond the twenty canonical amino acids found in nature, to include an unnatural amino acid as well. The unnatural amino acid is coded by a nonsense (TAG, TGA, TAA) triplet, a quadruplet codon, or in some cases a redundant rare codon. The organism that expresses the mutant synthetase can then be genetically programmed to incorporate the unnatural amino acid into any desired position in any protein of interest, allowing biochemists or structural biologists to probe or change the protein's function. For instance, one can start with the gene for a protein that binds a certain sequence of DNA, and, by directing an unnatural amino acid with a reactive side-chain into the binding site, create a new protein that cuts the DNA at the target-sequence, rather than binding it.

By mutating aminoacyl tRNA synthetases, chemists have expanded the genetic codes of various organisms to include lab-synthesized amino acids with all kinds of useful properties: photoreactive, metal-chelating, xenon-chelating, crosslinking, spin-resonant, fluorescent, biotinylated, and redox-active amino acids. Another use is introducing amino acids bearing reactive functional groups for chemically modifying the target protein.

Certain diseases’ causation (such as neuronal pathologies, cancer, disturbed metabolic conditions, and autoimmune disorders) have been correlated to specific mutations of aminoacyl-tRNA synthetases. Charcot-Marie-Tooth (CMT) is the most frequent heritable disorder of the peripheral nervous system (a neuronal disease) and is caused by a heritable mutation in glycol-tRNA and tyrosyl-tRNA. Diabetes, a metabolic disease, induces oxidative stress, which triggers a build up of mitochondrial tRNA mutations. It has also been discovered that tRNA synthetases may be partially involved in the etiology of cancer. A high level of expression or modification of aaRSs has been observed within a range of cancers. A common outcome from mutations of aaRSs is a disturbance of dimer shape/formation which has a direct relationship with its function. These correlations between aaRSs and certain diseases have opened up a new door to synthesizing therapeutics.

Noncatalytic domains
The novel domain additions to aaRS genes are accretive and progressive up the Tree of Life. The strong evolutionary pressure for these small non-catalytic protein domains suggested their importance. Findings beginning in 1999 and later revealed a previously unrecognized layer of biology: these proteins control gene expression within the cell of origin, and when released exert homeostatic and developmental control in specific human cell types, tissues and organs during adult or fetal development or both, including pathways associated with angiogenesis, inflammation, the immune response, the mechanistic target of rapamycin (mTOR) signalling, apoptosis, tumorigenesis, and interferon gamma (IFN-) and p53 signalling.

Substrate Depletion
In 2022, it was discovered that aminoacyl-trna synthetases may incorporate alternative amino acids during shortages of their precursors. In particular, tryptophanyl-tRNA synthetase (WARS1) will incorporate phenylalanine during tryptophan depletion, essentially inducing a W>F codon reassignment. Depletion of the other substrate of aminoacyl-tRNA synthetases, the cognate tRNA, may be relevant to certain diseases, e.g. Charcot–Marie–Tooth disease. It was shown that CMT-mutant glycyl-tRNA synthetase variants are still able to bind tRNA-gly but fail to release it, leading to depletion of the cellular pool of glycyl-tRNA-gly, what in turn results in stalling of the ribosome on glycine codons during mRNA translation.

Clinical
Mutations in the mitochondrial enzyme have been associated with a number of genetic disorders including Leigh syndrome, West syndrome and CAGSSS (cataracts, growth hormone deficiency, sensory neuropathy, sensorineural hearing loss and skeletal dysplasia syndrome).

Prediction servers

 * ICAARS: B. Pawar, and GPS Raghava (2010) Prediction and classification of aminoacyl tRNA synthetases using PROSITE domains. BMC Genomics 2010, 11:507
 * MARSpred:
 * Prokaryotic AARS database: