User:Ckemet/Reverse pharmacology

Reverse Pharmacology or target-based pharmacology, is a process of drug development where identity of a molecular target (receptor, enzyme, protein, etc) drives compound screening. Classical pharmacology involves determining the functional activity of a compound through in vitro and in vivo models. Once the activity of compound is found, the compounds ligands are identified, purified, and synthesized and go through biological screening assays. The most selective and potent drug is then further screened for toxicity and efficacy. Classical pharmacology can be time consuming and expensive. Reverse pharmacology was first established in the 70's by Dir Ram Nath Chopra and Gannath Sen. Reverse pharmacology, in contrast, takes potential drug compounds, designed specifically to targets (receptors, enzyme, proteins, etc) involved in disorders or diseases. Binding assays are used to identify the molecular target. The compound then undergoes animal functional studies to show the desired effect. Compounds identified through reverse pharmacology are thought to increase efficacy. The goal of reverse pharmacology is to utilize disease pathology in order to identify specific and targetable elements that novel compounds can be modeled from.

Reverse Vaccinology
A sub category of reverse pharmacology, reverse vaccinology is a computational approach for discovery of vaccines through utilization of the genome. Traditionally vaccines have been developed through the isolation, inactivation, and re-injection of viruses. Conventional vaccinology is both time consuming and limited to antigens that are able to be purified for testing.

Rino Rappuoli and the J. Craig Venter Institute used reverse vaccinology to develop a vaccine against Serogroup B meningococcus. Vaccines utilizing reverse vaccinology tend to have better selectivity; reducing side effects. These vaccines can increase immunity of multiple strains by incorporating multiple proteins.

The reverse vaccinology approach uses the genome sequence of the pathogen itself. Researchers are able to determine the all of the protein antigens that a pathogen can express. Reverse vaccinology begins with the genomic sequence of the pathogen and computer prediction of canidates for vaccines. Scientists use computational analysis to obtain the genome of the virus which allows for the determination of proteins that are secreted during viral infection. Through the secreted proteins, scientists are then able to identify and purify the virus, allowing further research consisting of immunizing laboratory animals. The elicited immune response is studied and are used for identification of a vaccine. Conventional vaccinology differs from reverse vaccinology in that the proteins purified from a cultured pathogen are used as candidates for a vaccine.

Applications of Reverse Vaccinology
Diseases such as Malaria, Tuberculosis, and Syphilis have fully been sequences and lists of all possible genes can be accessed.

Group B Menigococcus
Group B Menigococcus is the first application of reverse vaccinology. The polysaccharide that was used to develop early vaccines was poorly immunogenic and caused autoimmunity. A vaccine needed to be made against the surface exposed proteins and were able to be folded within the outer membrane. Rino Rappuoli and the J. Craig Venter Insitute were able to screen DNA fragments for genes that coded surface exposed and exported proteins. These proteins were purified and used to immunize mice. 25 of 85 surface proteins were shown to produce antibodies. These proteins were the basis of the vaccine against Group BMenigococcus.

Limits to Reverse Vaccinology
The goal a vaccine is protective immunity against a pathogen. Vaccinology relies the availability of databases that can predict whether the candidates can provide protective immunity against the pathogen. Lack of knowledge surrounding vaccine immunology and effects of mutations down the line, it is hard to predict protective immunity. Another limitation of reverse vaccinology is the identification antigens that are not proteins.

Reverse Vaccinology Tools and Applications
More than 4000 viral genomes have been identified. Reverse vaccinology heavily uses bioinformatics to analyze and obtain vast viral genomes.

NERVE
New Enhanced Reverse Vaccinology Environment (NERVE) is a reverse vaccinology software that imports pathogen protein sequences and predicts biological sequences. This software predicts the sub-cellular localization, adhesion probability, topology, human sequence similarity, and conservation of these proteins. NERVE uses four criteria to predict potential vaccine candidates: proteins that do not lie in the cytoplasm, proteins with 2 or less transmembrane helices, a probability of adhesion >0.46 and proteins that have low similarity to human proteins.

Vaxign
Vaxign was the first vaccine design program for reverse vaccinology and vaccine development. It uses both external and internal tools and programs to predict vaccine targets. Users input amino acids from proteins are genomes and is able to predict subcellular localization, transmembrane domains, adhesion probability, protein conservation among genomes, exclusion of nonpathogenic strains, comparison of proteins and host, prediction of binding to MHC class I and II, and analysis of the protein function.

Vaxign has two broad methods of vaccine design: "General" and "Specific" Methods. Within "General Methods" users can further choose to search under Vaxign Query or Dynamic Vaxign Analysis. Vaxign Query allows the users to search precomputed results for around 300 genomes. Users are able to choose genomes for vaccine targets based on desired parameters or protein sequences. The Dynamic Vaxign Analysis has users input protein sequences and set up parameters. This analytical tool uses the automatic Vaxign pipeline. This pipeline includes predictions for sublocation, adhesion, epitope binding to MCH class I and class II, and similarity to the host genome sequences.

Under the "Specific Methods", users have the option of Vaxitop and Vaxign-ML. Vaxitop makes predictions on vaccine epitopes based on reverse vaccinology. Vaxitop specifically predicts the binding to MHC Class and II. Vaxitop allows users to perform a genome whide query for different MHC host species. Vaxign-ML uses machine learning to produce vaccine candidates.

EpiVax and iVAX Toolkit
EpiVax is a private company, based in Providence, RI, that uses in silico, in vitro, and in vivo applications to design new vaccines. EpiVax created the iVAX Toolkit, an in silico platform that allows users to identify and predict epitopes for vaccine development.

iVAX is a computational vaccine design program that encompasses epitope mapping, antigen selection, and immunogen design. This toolkit uses immunoinformatics algorithms to identify candidate antigens and select for conserved T cell epitopes, eliminating epitopes from regulatory T cells. iVAX has a collection of tools such as Conservatrix, EpiMatrix, ClustiMer, and EpiAssembler. Vaccine design begins with searching for MHC class I and II ligands. EpiMatrix performs this initial search by parsing and evaluating each input sequence for binding efficacy. The program removes low quality binders to curate personalized predictions. These epitopes can be further analyzed for clusters using ClustiMer. Users can find cross-strain, conserved epitopes using Conservatrix. This toolkit integrates in silico and ex vivo/ in vitro technology to allow vaccine developers to access toxicity, efficacy, and performance of vaccines.



Reverse Pharmacognosy
Pharmacognosy is a multi disciplinary science that studies the applications of natural compounds. Pharmacognosy is derived from the Greek pharmakon, meaning drug or recipe, and gnosis, meaning knowledge. Pharmacognosy is not limited to the natural compound application in therapeutics, but can also include cosmetics, agricultures, and dyes. Conventional pharmacognosy utilizes traditional knowledge of living organisms to find new bioactive molecules. Conventional pharmacognosy begins with using ethopharmacological data to select plants. Once these plants are selected, extracts of these plants are made and tested in biological assays. If an extract is biologically active, the extracts are fractionated and retested multiple times to identify the molecules responsible for the activity.

Reverse Pharmacognosy attempts to use the knowledge generated from pharmacognosy to introduce new therapeutic activities of natural products. Molecules are first selected based on criteria (eg. structure, chemical family, activity). Next, the selected compounds are used to identify potential targets. Compounds can have a variety of different targets in metabolic pathways. This information gives insight on potential off-target effects and synergetic applications. "Inverse screening" involves identifying new properties for the selected compounds. Predicted interaction partners can be validated using in vitro binding assays or virtual screenings. === Summary of Reverse Pharmacognosy ===

Selection of Molecules
The first step in reverse pharmacognosy is the selection of natural compounds. Criteria can be applied depending on the compound is proposed for: structural criteria, molecules from the same chemical family, compounds with drug-like properties, etc. Natural product databases can also be helpful for compound selection.

Target Identification and Discovery of Activities
The second step in reverse pharmacognosy is identifying the target which will bind to the selected compounds. There can be many targets that a ligand can interact with, these interactions can illicit either negative or favorable effects, so it is important to identify all possible interactions. Researchers at this step commonly use "inverse screening" where they screen proteins which will potentially bind their molecules. Predictions about selectivity and synergy can be calculated which cannot be achieved through classical docking.

Biological Assays and Organism Associated Activities
While virtual screening are fairly accurate at predicting the biological activities of compounds and their proteins, these interactions can only be confirmed through in vitro biological assays. in vivo models of biological activities are needed to confirm that there is the same biological properties from the in vitro experiments.

Activity Optimization
Derivatives of natural products may be more potent, less toxic, more accessible from the compounds that were originally probed. Database of active extracts and metabolites can assist with this optimization.

Reverse Pharmacognosy Tools and Applications:
Chemoinformatics

Inverse Screening Tools and Target Databases

Greenpharma's Reverse Pharmacognosy Platform
Greenpharma is a French R&D company created in 2000 who supplies tailored products and services in the life sciences. They focus on natural substances and their platform consists of five components: analytical chemistry, lab scale extractions, chemoinformatics, organic/bio synthesis, and cosmetic formulations. Greenpharma offers three compound libraries for reverse pharmacognosy needs: Greenpharma Natural Compound Library (GPNCL), Greenpharma Ligand Library (LIGENDO), and Greenpharma Plant Extract Library (GPEL).

The GPNCL is a collection of 150,000 natural compound structures for lead discovery. This library also has access to 30 million compounds from Ambinter. This library does not include amino acids, peptides, nucleic acids, or long fatty acid chains. They also have continuous stock of compounds at >90% purity. The GPNCL provides the physico-chemical properties and phytochemistry of each of their compounds for researchers.

LIGENDO is a library source of natural, pure compounds for chemogenomics and biological pathway hopping. This library is composed of 400 human endogenous ligands. Compounds are given in microplates of 80 and data is supplied in the database with compound name, structure, implied metabolic pathway, physico-chemical properties, and protein partners.

GPEL is a plant extract library that combines botany, pharmacology, and pharmacognosy to present a wide range of possible extracts. The library is suitable for a high throughput screening which 80 extracts on each microplate for 20 plants. Greenpharma provides 4 different polarity solvent fractions. Information on the plant family, genus, species, and organ data is also provided.

Selnergy
Selnergy is a virtual high throughput screening platform that allows users to explore interactions chemogenomics. It contains a database of 10,000 protein structures, sectioned by their biological properties. This platforms allows for in silico profiling of ligand and ligand-protein interactions. It allows the user to predict the selectivity and synergy between candidate compounds and their protein targets.

Potential Use in Traditional Systems of Medicine
Current drug discovery entails the identification of drug targets in disease pathology, large iterations of chemical compounds to discover drug candidates and performing biological assays to test for toxicity, potency, and efficacy. This traditional approach is often considered costly and time consuming. Much of the world relies on traditional systems of medicine (TSM): Ayurveda, traditional Chinese medicine (TCM), etc. While these therapies are popular in non-western countries, their evidence of therapeutic benefits are seen as incomplete in western societies. Reverse pharmacology began as a way to study Ayurvedic plants chemically and clinically. The issue with studying Ayurvedic plants is that there was no defined approach in quantifying its benefits. The study of their herbal therapies can be investigated using reverse pharmacology.

Target Identification and Characterization
Proteins that are thought to be critical in pathogenesis are identified. These protein structures and their predicted function can be analyzed through bioinformatics. This enables the identification of candidate ligands. Receptor/ ligand interactions are often identified through high throughput screenings.

High Throughput Screening
High throughput screening is method in drug discovery that allows scientists to conduct pharmacological tests. In regards to reverse pharmacology, this process is utilized for the identification of compounds involved in a pathophysiology pathway.



Molecular Docking
Molecular docking is an in silico method used in drug discovery to identify novel compounds of interest. It has the ability to predict the binding conformation of small molecule ligands to their binding site. Docking was first introduced in the 1970s, and allows researchers the ability to predict interactions between the target and potential ligands.

Molecular docking is currently used for the prediction of targets for compounds, prediction of adverse drug reactions, polypharmaoclogy, virtual screenings, and drug repositioning. Virtual screening using molecular docking utilizes large collections of synthesized and designed molecules to find macromolecule binding sites. Curated molecules are scored based on their binding energies and other parameters.

Ligand-Based approaches are used to identify suitable protein conformations for the docking screenings. This approach can also be used to confirm the prediction from the docking screenings. Researchers can use the similarity between the predicted binding confirmation and the experimental conformation when the ligand is crystalized with the protein. Molecular dynamics (MD) and binding free energy estimations are both structure based approaches, often used in combination. Residue flexibility and conformational changes can be evaluated through molecular dynamics. It can be used to determine the stability of different protein conformations. The use of artificial intelligence and statistical methods are new in the molecular docking pipeline. These methods can utilize publicly available information on the structural, chemical and activity of compounds for better predictions.

DOCK
DOCK was the first docking software, created in the 1980's. DOCK uses an algorithm that searches potential binding modes. DOCK can superimpose the ligand onto the binding pocket and finds the lowest energy binding conformation. In the newer versions, the algorithm can perform both rigid and flexible docking.

MORDOR
MOlecular Recognition with a Driven dynamics OptimizeR (MORDOR) is a docking software designed for accurate docking predictions. In experimental observations, a ligand can affect the conformation of a receptor. MORDOR takes this into consideration and allows for induced fit; the simulation moves with the ligand. The ligand is also able to explore the different possible binding pockets. The software uses DOCK to perform rigid docking to reduce the size of the library. After curating a smaller library, MODOR is used. MORDOR uses a "dummy sphere" to move along the receptor to identify different binding pockets. The result is an energy map with all known binding pockets.

AMBER
Assisted Model Building with Energy Refinement (AMBER) was created in 2002 by developers at the University of California, San Francisco. AMBER refers to a set of "force fields" that simulate biomolecules and refers to the software package that simulated a family of force fields for molecular dynamics. "Force fields" are the parameters of the bonds, angles, dihedrals, and atom types within the system. The suite allows users to complete full molecular dynamics simulations with water or Born solvent models.

Reverse Docking
Reverse docking, or inverse docking, is an in silico method to find proteins targets to a specific ligand. Much like regular docking methods, ligand-target conformations are scored and ranked based on preset parameters. Large and properly constructed databases need to be created with target structures. These databases also need to define the binding sites for each proteins. There are multiple reverse docking tools that use a variety of databases and can be used in identifying targets and potential off-target interactions. The problem with efficacy of reverse docking comes from high computational time and lack of databases for target structures.

INVDOCK
INVDOCK was the first reverse docking tool, created in 2001. This software aligns ligands to the binding site and binding conformations are analyzed using energy minimization. INVDOCK can be used to identify unknown and secondary targets of drugs, leads, etc. It can also predict the ADME of targets. One of the limitations of INVDOCK is the lack of optimization; users input a threshold binding energy and once the ligand is positioned successfully, within the binding energy threshold, the program moves on. There is no optimization for multiple low energy conformations.

TarFisDock
TarFisDock was first developed in 2006 and is a tool that ranks targets using an in house database, Potential Drug Target Database (PDTD). This tool calculates the binding energy of targets and their ligands. Users input their small molecule to be tested and TarFisDock searches for proteins using docking techniques. Viable targets are usually contained in the top 2,5, or 10% of its rankings.

idTarget
idTarget is a web based docking tool that allows for multiple binding sites of a protein to be identified. This tool uses all the protein structures within the Protein Data Bank (PDB). This tool is also able to determine off targets of compounds.

Challenges with Molecular Docking
Molecular is a powerful took for visualization of ligand-target interactions. Molecular docking does not always correctly predict the binding modes due to the algorithms being estimations. Flexibility of binding sites and molecules can cause false binding sites that do not exist in experimental observation. Accurate prediction of binding is limited by size of both the ligand and receptor and knowledge surrounding the receptor. There are also problems with the scoring functions of virtual docking platforms. Commonly these scoring functions are based on binding affinity estimations. Entropy, atom randomization, binding flexibility are the most challenging issues related to predicting the binding affinity.

Orphan Receptors
Orphan receptors are protein receptors that are activated by unknown ligands. Though the ligands for these receptors are unknown, their structures are often similar to already identified receptors. Orphan receptors generally are apart of two distinct receptor families: nuclear receptors and G protein coupled receptors (GPCR). Nuclear receptors are cytosolic proteins that act as transcription factors once activated. GPCRs are seven transmembrane receptors that activate G proteins which signal transduce down stream effector proteins.There are more than 700 genes that code for GPCRs. It is thought that of these 700 genes, around half code for sensory receptors and the remaining may have viability as potential drug targets. Currently, more than 200 ligands for these GPCRs have been identified, leaving around 150 receptors orphaned. It is important to develop a suitable assay to successfully identify ligands.

Currently, receptor sequences can be cloned and their structural information determined. These structures can be compared to GPCRs with known ligands; receptor activity can be predicted if an orphan receptor has significant homology to a receptor with known ligands. Further functional identification is done using functional assays .Commonly, the cDNA of the GPCR of interest is expressed in relevant cell lines and used as bait to determine endogenous ligands.

The first GPCRs to be deorphanized was the serotonin 1A receptor (5-HT1A) and the D2 dopamine receptors. It is thought that there are more GPCRs than known potential ligands; these receptors bind to characterized ligands.

Orphan GPCRs are categorized into three classes: Class A, Class B (adhesion GPCRs), and Class C. Class A orphans have preliminary evidence for an endogenous ligand that has been published and linked to a disease. Class G, or adhesion, GPCRs are identified based on their extracellular region. The N terminal shares similar homology with adhesive domains. Class C orphans do not have a known endogenous ligand.

Bioluminescent Energy Transfer (BRET)
Generation of assays are important for "deorphanizing" these receptors. β-arrestin has been used to deorphanize GPCRs. Measuring the recruitment of β-arrestin can provide insight into the internalization of GPCRs. Once the GPCR becomes activated and phosphorylated, β-arrestin is recruited to the cell surface. Researchers have created a chimeric β-arrestin attached to green fluorescent protein (GFP) so that it can be tracked throughout the cell. Activation of a GPCR is indicated by the chimeric β-arrestin being compartmentalized. This technique has been utilized for the deorphanization of Drosophila neuropeptide receptors.

Adaption of Pheromone Receptor Pathway
This assays is useful in identifying the specificity of G protein coupling to the orphan receptor. Chimeric G protein are used to enable to Saccharomyces cerevisiae G-Protein, Gpa1, linked signaling through the MAPKinase cascade. This assay give low background due lack of endogenous GPCR background.

Bead-Based Screening with Xenopus melanophores
Peptide receptors are receptors that binds to multiple peptides or signaling proteins. Many of these receptors have been implicated in diseases. Using combinatorial chemistry, GPCRs that respond to peptide ligands may be able to be identified. Beads of potential ligand peptides are first pooled so that each pool would have the same first amino acid. "Lawns" containing melanophores are transfected with peptide multiple receptor sets. These were groups of both receptors with known ligands and receptors that were orphaned. The beads are spread on the "lawn" and GPCR activation is determined through pigment translocation, which is dependent on G protein signaling. The peptides on these beads can then be sequenced for further assays.

Pharmacochaperone Screening Assay
A pharmacochaperone is a drug that acts as a chaperone for a protein. These chaperones assist with the folding/ unfolding and assembly/disassembly of macromolecule structures. Pharmacochaperones correct the folding of misfolded and unfolded proteins, allowing them to be correctly routed in the cel's system. The pharmacochaperone screening assay exploits pharmacochaperones, small molecules that allow for the trafficking of receptors to the plasma membrane, to identify ligands. A point mutation is introduced which causes the receptor to be retained in the endoplasmic reticulum. As a result, molecules that act as chaperones will bring the receptor to the surface. These molecules are coupled with a beta-galactosidase reporter system for identification.

Identification of Orexin
Orexins are neuropeptides, important for their role in energy homeostasis and regulate sleep/wakefulness. Takeshi Sakurai and their lab were able to identify orexin-A and orexin-B, two endogenous ligands for two orphan GPCRs. Their lab identified orexin peptides that were able to express orexin-1 receptor (OX1R). This receptor has high affinity for orexin-A. the second receptor, orexin-2 (OX2R) binds both orexin-A and B at the same affinity. They used 50 transfectant cell lines, expressing multiple orphan GPCR cDNA. Cells were challenged with high performance liquid chromatography (HPLC) fractions and monitored for signal transduction.