Process chemistry

Process chemistry is the arm of pharmaceutical chemistry concerned with the development and optimization of a synthetic scheme and pilot plant procedure to manufacture compounds for the drug development phase. Process chemistry is distinguished from medicinal chemistry, which is the arm of pharmaceutical chemistry tasked with designing and synthesizing molecules on small scale in the early drug discovery phase.

Medicinal chemists are largely concerned with synthesizing a large number of compounds as quickly as possible from easily tunable chemical building blocks (usually for SAR studies). In general, the repertoire of reactions utilized in discovery chemistry is somewhat narrow (for example, the Buchwald-Hartwig amination, Suzuki coupling and reductive amination are commonplace reactions). In contrast, process chemists are tasked with identifying a chemical process that is safe, cost and labor efficient, “green,” and reproducible, among other considerations. Oftentimes, in searching for the shortest, most efficient synthetic route, process chemists must devise creative synthetic solutions that eliminate costly functional group manipulations and oxidation/reduction steps.

This article focuses exclusively on the chemical and manufacturing processes associated with the production of small molecule drugs. Biological medical products (more commonly called “biologics”) represent a growing proportion of approved therapies, but the manufacturing processes of these products are beyond the scope of this article. Additionally, the many complex factors associated with chemical plant engineering (for example, heat transfer and reactor design) and drug formulation will be treated cursorily.

Considerations in process chemistry
Cost efficiency is of paramount importance in process chemistry and, consequently, is a focus in the consideration of pilot plant synthetic routes. The drug substance that is manufactured, prior to the formulation, is commonly referred to as the active pharmaceutical ingredient (API) and will be referred to as such herein. API production cost can be broken into two components: the “material cost” and the “conversion cost.” The ecological and environmental impact of a synthetic process should also be evaluated by an appropriate metric (e.g. the EcoScale).

An ideal process chemical route will score well in each of these metrics, but inevitably tradeoffs are to be expected. Most large pharmaceutical process chemistry and manufacturing divisions have devised weighted quantitative schemes to measure the overall attractiveness of a given synthetic route over another. As cost is a major driver, material cost and volume-time output are typically weighted heavily.

Material cost
The material cost of a chemical process is the sum of the costs of all raw materials, intermediates, reagents, solvents, and catalysts procured from external vendors. Material costs may influence the selection of one synthetic route over another or the decision to outsource production of an intermediate.

Conversion cost
The conversion cost of a chemical process is a factor of that procedure's overall efficiency, both in materials and time, and its reproducibility. The efficiency of a chemical process can be quantified by its atom economy, yield, volume-time output, and environmental factor (E-factor), and its reproducibility can be evaluated by the Quality Service Level (QSL) and Process Excellence Index (PEI) metrics.

Atom economy
The atom economy of a reaction is defined as the number of atoms from the starting materials that are incorporated into the final product. Atom economy can be viewed as an indicator of the “efficiency” of a given synthetic route.


 * $$AE = \frac{\text{MW(product)}}{\sum \text{MW(raw materials)}}\times 100\%$$

For example, the Claisen rearrangement and the Diels-Alder cycloaddition are examples of reactions that are 100 percent atom economical. On the other hand, a prototypical Wittig reaction has an especially poor atom economy (merely 20 percent in the example shown).

Process synthetic routes should be designed such that atom economy is maximized for the entire synthetic scheme. Consequently, “costly” reagents such as protecting groups and high molecular weight leaving groups should be avoided where possible. An atom economy value in the range of 70 to 90 percent for an API synthesis is ideal, but it may be impractical or impossible to access certain complex targets within this range. Nevertheless, atom economy is a good metric to compare two routes to the same molecule.

Yield
Yield is defined as the amount of product obtained in a chemical reaction. The yield of practical significance in process chemistry is the isolated yield—the yield of the isolated product after all purification steps. In a final API synthesis, isolated yields of 80 percent or above for each synthetic step are expected. The definition of an acceptable yield depends entirely on the importance of the product and the ways in which available technologies come together to allow their efficient application; yields approaching 100% are termed quantitative, and yields above 90% are broadly understood as excellent.

There are several strategies that are employed in the design of a process route to ensure the adequate overall yield of the pharmaceutical product. The first is the concept of convergent synthesis. Assuming a very good to excellent yield in each synthetic step, the overall yield of a multistep reaction can be maximized by combining several key intermediates at a late stage that are prepared independently from each other.

Another strategy to maximize isolated yield (as well as time efficiency) is the concept of telescoping synthesis (also called one-pot synthesis). This approach describes the process of eliminating workup and purification steps from a reaction sequence, typically by simply adding reagents sequentially to a reactor. In this way, unnecessary losses from these steps can be avoided.

Finally, to minimize overall cost, synthetic steps involving expensive reagents, solvents, or catalysts should be designed into the process route as late stage as possible, to minimize the amount of reagent used.

In a pilot plant or manufacturing plant setting, yield can have a profound effect on the material cost of an API synthesis, so the careful planning of a robust route and the fine-tuning of reaction conditions are crucially important. After a synthetic route has been selected, process chemists will subject each step to exhaustive optimization in order to maximize the overall yield. Low yields are typically indicative of unwanted side product formation, which can raise red flags in the regulatory process as well as pose challenges for reactor cleaning operations.

Volume-time output
The volume-time output (VTO) of a chemical process represents the cost of occupancy of a chemical reactor for a particular process or API synthesis. For example, a high VTO indicates that a particular synthetic step is costly in terms of “reactor hours” used for a given output. Mathematically, the VTO for a particular process is calculated by the total volume of all reactors (m3) that are occupied times the hours per batch divided by the output for that batch of API or intermediate (measured in kg).



The process chemistry group at Boehringer Ingelheim, for example, targets a VTO of less than 1 for any given synthetic step or chemical process.

Additionally, the raw conversion cost of an API synthesis (in dollars per batch) can be calculated from the VTO, given the operating cost and usable capacity of a particular reactor. Oftentimes, for large-volume APIs, it is economical to build a dedicated production plant rather than to use space in general pilot plants or manufacturing plants.

Environmental factor (e-factor) and process mass intensity (PMI)
Both of these measures, which capture the environmental impact of a synthetic reaction, intend to capture the significant and rising cost of waste disposal in the manufacturing process. The E-factor for an entire API process is computed by the ratio of the total mass of waste generated in the synthetic scheme to the mass of product isolated.


 * $$E=\frac{\sum \text{mass of waste}}{\text{mass of isolated product}}=\frac{\sum \text{mass of materials}-\text{mass of isolated product}}{\text{mass of isolated product}}$$

A similar measure, the process mass intensity (PMI) calculates the ratio of the total mass of materials to the mass of the isolated product.


 * $$\text{PMI}=\frac{\sum \text{mass of materials}}{\text{mass of isolated product}} = E +1$$

For both metrics, all materials used in all synthetic steps, including reaction and workup solvents, reagents, and catalysts, are counted, even if solvents or catalysts are recycled in practice. Inconsistencies in E-factor or PMI computations may arise when choosing to consider the waste associated with the synthesis of outsourced intermediates or common reagents. Additionally, the environmental impact of the generated waste is ignored in this calculation; therefore, the environmental quotient (EQ) metric was devised, which multiplies the E-factor by an “unfriendliness quotient” associated with various waste streams. A reasonable target for the E-factor or PMI of a single synthetic step is any value between 10 and 40.

Quality service level (QSL)
The final two "conversion cost" considerations involve the reproducibility of a given reaction or API synthesis route. The quality service level (QSL) is a measure of the reproducibility of the quality of the isolated intermediate or final API. While the details of computing this value are slightly nuanced and unimportant for the purposes of this article, in essence, the calculation involves the ratio of satisfactory quality batches to the total number of batches. A reasonable QSL target is 98 to 100 percent.

Process excellence index (PEI)
Like the QSL, the process excellence index (PEI) is a measure of process reproducibility. Here, however, the robustness of the procedure is evaluated in terms of yield and cycle time of various operations. The PEI yield is defined as follows:


 * $$\text{PEI yield}=\frac{\text{average yield}\cdot 100\%}{\text{aspiration level yield}}=\frac{\text{average yield}\cdot 100\%}{\frac{\text{median yield}+\text{best yield}}{2}}$$

In practice, if a process is high-yielding and has a narrow distribution of yield outcomes, then the PEI should be very high. Processes that are not easily reproducible may have a higher aspiration level yield and a lower average yield, lowering the PEI yield.

Similarly, a PEI cycle time may be defined as follows:


 * $$\text{PEI cycle time}=\frac{\text{aspiration level cycle time}\cdot 100\%}{\text{average cycle time}}=\frac{\frac{\text{median cycle time}+\text{best cycle time}}{2}\cdot 100\%}{\text{average cycle time}}$$

For this expression, the terms are inverted to reflect the desirability of shorter cycle times (as opposed to higher yields). The reproducibility of cycle times for critical processes such as reaction, centrifugation, or drying may be critical if these operations are rate-limiting in the manufacturing plant setting. For example, if an isolation step is particularly difficult or slow, it could become the bottleneck for API synthesis, in which case the reproducibility and optimization of that operation become critical.

For an API manufacturing process, all PEI metrics (yield and cycle times) should be targeted at 98 to 100 percent.

EcoScale
In 2006, Van Aken, et al. developed a quantitative framework to evaluate the safety and ecological impact of a chemical process, as well as minor weighting of practical and economical considerations. Others have modified this EcoScale by adding, subtracting and adjusting the weighting of various metrics. Among other factors, the EcoScale takes into account the toxicity, flammability, and explosive stability of reagents used, any nonstandard or potentially hazardous reaction conditions (for example, elevated pressure or inert atmosphere), and reaction temperature. Some EcoScale criteria are redundant with previously considered criteria (e.g. E-factor).

Boehringer Ingelheim HCV protease inhibitor (BI 201302)
Macrocyclization is a recurrent challenge for process chemists, and large pharmaceutical companies have necessarily developed creative strategies to overcome these inherent limitations. An interesting case study in this area involves the development of novel NS3 protease inhibitors to treat Hepatitis C patients by scientists at Boehringer Ingelheim. The process chemistry team at BI was tasked with developing a cheaper and more efficient route to the active NS3 inhibitor BI 201302, a close analog of BILN 2061. Two significant shortcomings were immediately identified with the initial scale-up route to BILN 2061, depicted in the scheme below. The macrocyclization step posed four challenges inherent to the cross-metathesis reaction.


 * 1) High dilution is typically necessary to prevent unwanted dimerization and oligomerization of the diene starting material. In a pilot plant setting, however, a high dilution factor translates into lower throughput, higher solvent costs and higher waste costs.
 * 2) High catalyst loading was found to be necessary to drive the RCM reaction to completion. Because of the high licensing costs of the ruthenium catalyst that was used (1st generation Hoveyda catalyst), a high catalyst loading was financially prohibitive. Recycling of the catalyst was explored but proved impractical.
 * 3) Long reaction times were necessary for reaction completion, due to the slow kinetics of the reaction using the selected catalyst. It was hypothesized that this limitation could be overcome using a more active catalyst. However, while the second-generation Hoveyda and Grubbs catalysts were kinetically more active than the first-generation catalyst, reactions using these catalysts formed large amounts of dimeric and oligomeric products.
 * 4) An epimerization risk under the cross-metathesis reaction conditions. The process chemistry group at Boehringer Ingelheim performed extensive mechanistic studies showing that epimerization most likely occurs through a ruthenacyclopentene intermediate.  Furthermore, the Hoveyda catalyst employed in this scheme minimizes epimerization risk compared with the analogous Grubbs catalyst.

Additionally, the final double SN2 sequence to install the quinoline heterocycle was identified as a secondary inefficiency in the synthetic route.



Analysis of the cross-metathesis reaction revealed that the conformation of the acyclic precursor had a profound impact on the formation of dimers and oligomers in the reaction mixture. By installing a Boc protecting group at the C-4 amide nitrogen, the Boehringer Ingelheim chemists were able to shift the site of initiation from the vinylcyclopropane moiety to the nonenoic acid moiety, improving the rate of the intramolecular reaction and decreasing the risk of epimerization. Additionally, the catalyst employed was switched from the expensive 1st generation Hoveyda catalyst to the more reactive, less expensive Grela catalyst. These modifications allowed the process chemists to run the reaction at a standard reaction dilution of 0.1-0.2 M, given that the rates of competing dimerization and oligomerization reactions was so dramatically reduced.

Additionally, the process chemistry team envisioned a SNAr strategy to install the quinoline heterocycle, instead of the SN2 strategy that they had employed for the synthesis of BILN 2061. This modification prevented the need for inefficient double inversion by proceeding through retention of stereochemistry at the C-4 position of the hydroxyproline moiety.



It is interesting to examine this case study from a VTO perspective. For the unoptimized cross-metathesis reaction using the Grela catalyst at 0.01 M diene, the reaction yield was determined to be 82 percent after a reaction and workup time of 48 hours. A 6-cubic meter reactor filled to 80% capacity afforded 35 kg of the desired product. For the unoptimized reaction:


 * $$\text{VTO} = \frac{6\text{ m}^{3} \times 48\text{ h}}{35\text{ kg}} = 8.2\text{ m}^{3} \cdot \text{ h / kg}$$

This VTO value was considered prohibitively high and a steep investment in a dedicated plant would have been necessary even before launching Phase III trials with this API, given its large projected annual demand. But after reaction development and optimization, the process team was able to improve the reaction yield to 93 percent after just 1 hour (plus 12 hours for workup and reactor cleaning time) at a diene concentration of 0.2 M. With these modifications, a 6-cubic meter reactor filled to 80% capacity afforded 799 kg of the desired product. For this optimized reaction:


 * $$\text{VTO} = \frac{6\text{ m}^{3}\times 13\text{ h}}{799\text{ kg}} = 0.1\text{ m}^{3} \cdot \text{ h / kg}$$

Thus, after optimization, this synthetic step became less costly in terms of equipment and time and more practical to perform in a standard manufacturing facility, eliminating the need for costly investment in a new dedicated plant.

Biocatalysis and enzymatic engineering
Recently, large pharmaceutical process chemists have relied heavily on the development of enzymatic reactions to produce important chiral building blocks for API synthesis. Many varied classes of naturally occurring enzymes have been co-opted and engineered for process pharmaceutical chemistry applications. The widest range of applications come from ketoreductases and transaminases, but there are isolated examples from hydrolases, aldolases, oxidative enzymes, esterases and dehalogenases, among others.

One of the most prominent uses of biocatalysis in process chemistry today is in the synthesis of Januvia®, a DPP-4 inhibitor developed by Merck for the management of type II diabetes. The traditional process synthetic route involved a late-stage enamine formation followed by rhodium-catalyzed asymmetric hydrogenation to afford the API sitagliptin. This process suffered from a number of limitations, including the need to run the reaction under a high-pressure hydrogen environment, the high cost of a transition-metal catalyst, the difficult process of carbon treatment to remove trace amounts of catalyst and insufficient stereoselectivity, requiring a subsequent recrystallization step before final salt formation.



Merck's process chemistry department contracted Codexis, a medium-sized biocatalysis firm, to develop a large-scale biocatalytic reductive amination for the final step of its sitagliptin synthesis. Codexis engineered a transaminase enzyme from the bacteria Arthrobacter through 11 rounds of directed evolution. The engineered transaminase contained 27 individual point mutations and displayed activity four orders of magnitude greater than the parent enzyme. Additionally, the enzyme was engineered to handle high substrate concentrations (100 g/L) and to tolerate the organic solvents, reagents and byproducts of the transamination reaction. This biocatalytic route successfully avoided the limitations of the chemocatalyzed hydrogenation route: the requirements to run the reaction under high pressure, to remove excess catalyst by carbon treatment and to recrystallize the product due to insufficient enantioselectivity were obviated by the use of a biocatalyst. Merck and Codexis were awarded the Presidential Green Chemistry Challenge Award in 2010 for the development of this biocatalytic route toward Januvia®.

Continuous/flow manufacturing
In recent years, much progress has been made in the development and optimization of flow reactors for small-scale chemical synthesis (the Jamison Group at MIT and Ley Group at Cambridge University, among others, have pioneered efforts in this field). The pharmaceutical industry, however, has been slow to adopt this technology for large-scale synthetic operations. For certain reactions, however, continuous processing may possess distinct advantages over batch processing in terms of safety, quality, and throughput.

A case study of particular interest involves the development of a fully continuous process by the process chemistry group at Eli Lilly and Company for an asymmetric hydrogenation to access a key intermediate in the synthesis of LY500307, a potent ERβ agonist that is entering clinical trials for the treatment of patients with schizophrenia, in addition to a regimen of standard antipsychotic medications. In this key synthetic step, a chiral rhodium-catalyst is used for the enantioselective reduction of a tetrasubstituted olefin. After extensive optimization, it was found that in order to reduce the catalyst loading to a commercially practical level, the reaction required hydrogen pressure up to 70 atm. The pressure limit of a standard chemical reactor is about 10 atm, although high-pressure batch reactors may be acquired at significant capital cost for reactions up to 100 atm. Especially for an API in the early stages of chemical development, such an investment clearly bears a large risk.

An additional concern was that the hydrogenation product has an unfavorable eutectic point, so it was impossible to isolate the crude intermediate in more than 94 percent ee by batch process. Because of this limitation, the process chemistry route toward LY500307 necessarily involved a kinetically controlled crystallization step after the hydrogenation to upgrade the enantiopurity of this penultimate intermediate to >99 percent ee.



The process chemistry team at Eli Lilly successfully developed a fully continuous process to this penultimate intermediate, including reaction, workup and kinetically controlled crystallization modules (the engineering considerations implicit in these efforts are beyond the scope of this article). An advantage of flow reactors is that high-pressure tubing can be utilized for hydrogenation and other hyperbaric reactions. Because the headspace of a batch reactor is eliminated, however, many of the safety concerns associated with running high-pressure reactions are obviated by the use of a continuous process reactor. Additionally, a two-stage mixed suspension-mixed product removal (MSMPR) module was designed for the scalable, continuous, kinetically controlled crystallization of the product, so it was possible to isolate in >99 percent ee, eliminating the need for an additional batch crystallization step.

This continuous process afforded 144 kg of the key intermediate in 86 percent yield, comparable with a 90 percent isolated yield using the batch process. This 73-liter pilot-scale flow reactor (occupying less than 0.5 m3 space) achieved the same weekly throughput as theoretical batch processing in a 400-liter reactor. Therefore, the continuous flow process demonstrates advantages in safety, efficiency (eliminates the need for batch crystallization), and throughput, compared with a theoretical batch process.

Academic research institutes in process chemistry
Institute of Process Research & Development, University of Leeds