User:Dixiang Yuan/sandbox

Protein crystallization (NEW)
Protein crystallization is the process of formation of a protein crystal. In the process, proteins will dissolve in an aqueous environment, and sample solution to reach the supersaturated state. Different methods are used to reach that state. Such as, Vapor diffusion, Microbatch, Microdialysis, Free-interface diffusion, etc. In additional, some parameters including pH, temperature, ionic strength in the crystallization solution and other factors could influence the yield of crystallization as well. These crystals can then be used in structural biology to study the molecular structure of the protein, or for various industrial or biotechnological purposes.

Based on the crystals, the determination of protein structure can be achieved traditionally by utilizing X-Ray Diffraction (XRD). Alternatively, cryo-electron microscopy (cryo-EM) and nuclear magnetic resonance (NMR) could also be used for protein structure determination. The structural of proteins is significant to the structural analysis in biochemistry and translational medicine. Meanwhile, the protein structure is essential for the development of targeted therapy in modern drug advancement.

Development of protein crystallization (NEW)
Crystallization of protein molecules has been known for over 150 years.

In 1840, Friedrich Ludwig Hünefeld accidentally discovered the formation of crystalline material in samples of the earthworm blood held under two glass slides and occasionally observed small plate‐like crystals in desiccated swine or human blood samples. These crystals ware named as ‘haemoglobin’, by Felix Hoppe‐Seyler in 1864. The seminal findings of Hünefeld have inspired lots of scientist in the future.

Thus, in 1851, Funke described the process of producing human haemoglobin crystals by diluting red blood cells with solvents such as pure water, alcohol or ether, followed by slow evaporation of the solvent from the protein solution 23. In 1871, William T. Preyer, Professor at University of Jena, published a book entitled Die Blutkrystalle (The Crystals of Blood), reviewing the features of haemoglobin crystals from around 50 species of mammals, birds, reptiles and fishes.

In 1909 when the physiologist Edward T. Reichert, together with the mineralogist Amos P. Brown, published an treatise on the preparation, physiology and geometrical characterization of haemoglobin crystals from several hundreds animals, including extinct species such as the Tasmanian wolf. Increasing protein crystals were found.

In 1934, John Desmond Bernal and his student Dorothy Hodgkin discovered that protein crystals surrounded by their mother liquor gave better diffraction patterns than dried crystals. Using pepsin, they were the first to discern the diffraction pattern of a wet, globular protein. Prior to Bernal and Hodgkin, protein crystallography had only been performed in dry conditions with inconsistent and unreliable results. This is the first X‐ray diffraction pattern of a protein crystal.

In 1958, the structure of myoglobin (a red protein containing heme), determined by X-ray crystallography, was first reported by John Kendrew. Kendr ew shared the 1962 Nobel Prize in Chemistry with Max Perutz for this discovery.

Now, based on the protein crystals, the structures of them play a significant role in biochemistry and translational medicine.

The theory of protein crystallization
The essential of crystal formation is letting the sample solution to reach the supersaturated state. Supersaturation is defined by McPherson et al 2014 as “a non-equilibrium condition in which some quantity of the macromolecule in excess of the solubility limit, under specific chemical and physical conditions, is nonetheless present in solution.” The formation of solids in solution, such as aggregation and crystals, favors the re-establishment of equilibrium. The system wants to re-establish equilibrium so every component in the energy expression is at a minimum. There are three main factors involved in the energy expression, which are enthalpy (∆H), entropy (∆S) and temperature (T). ∆H in this expression relates to the ∆H of the chemical bonds being formed and broken upon reactions or phase changes. ∆S relates to the degree of freedom or the measurement of uncertainty that molecules can have. The spontaneity of a process, Gibb’s free energy (∆G), is defined as ∆G = ∆H- T∆S. Hence, either the increase of ∆S or decrease of ∆H will contribute to the spontaneity of the overall process, making ∆G more negative, thus reaching a minimum energy condition of the system. When crystals form, protein molecules become more ordered, which leads to a decrease in ∆S and makes ∆G more positive. Therefore, in order for crystallization to be spontaneous, there must be a sufficiently negative ∆H to overcome the loss of entropy from the more ordered system.

A molecular view going from solution to crystal
Crystal formation requires two steps: nucleation and growth. Nucleation is the initiation step for crystallization. At the nucleation phase, protein molecules in solution come together as aggregates to form a stable solid nucleus. As the nucleus forms, the crystal grows bigger and bigger by molecules attaching to this stable nucleus. The nucleation step is critical for crystal formation since it is the first-order phase transition of samples moving from having a high degree of freedom to obtaining an ordered state (aqueous to solid). In order for the nucleation step to succeed, the manipulation of crystallization parameters is essential. The approach behind getting a protein to crystallize is to yield a lower solubility of the targeted protein in solution. Once the solubility limit is exceeded and crystals are present, crystallization is accomplished.

Vapor diffusion
Three methods of preparing crystals, A: Hanging drop. B: Sitting drop. C: Microdialysis Vapor diffusion is the most commonly employed method of protein crystallization. In this method, a droplet containing purified protein, buffer, and precipitant are allowed to equilibrate with a larger reservoir containing similar buffers and precipitants in higher concentrations. Initially, the droplet of protein solution contains comparatively low precipitant and protein concentrations, but as the drop and reservoir equilibrate, the precipitant and protein concentrations increase in the drop. If the appropriate crystallization solutions are used for a given protein, crystal growth will occur in the drop. This method is used because it allows for gentle and gradual changes in concentration of protein and precipitant concentration, which aid in the growth of large and well-ordered crystals.

Vapor diffusion can be performed in either hanging-drop or sitting-drop format. Hanging-drop apparatus involve a drop of protein solution placed on an inverted cover slip, which is then suspended above the reservoir. Sitting-drop crystallization apparatus place the drop on a pedestal that is separated from the reservoir. Both of these methods require sealing of the environment so that equilibration between the drop and reservoir can occur.

Microbatch
A microbatch usually involves immersing a very small volume of protein droplets in oil (as little as 1 µl). The reason that oil is required is because such low volume of protein solution is used and therefore evaporation must be inhibited to carry out the experiment aqueously. Although there are various oils that can be used, the two most common sealing agent are paraffin oils (described by Chayen et al.) and silicon oils (described by D’Arcy). There are also other methods for Microbatching that don't use a liquid sealing agent and instead require a scientist to quickly place a film or some tape on a welled plate after placing the drop in the well.

Besides the very limited amounts of sample needed, this method also has as a further advantage that the samples are protected from airborne contamination, as they are never exposed to the air during the experiment.

Microdialysis
Microdialysis takes advantage of a semi-permeable membrane, across which small molecules and ions can pass, while proteins and large polymers cannot cross. By establishing a gradient of solute concentration across the membrane and allowing the system to progress toward equilibrium, the system can slowly move toward supersaturation, at which point protein crystals may form.

Microdialysis can produce crystals by salting out, employing high concentrations of salt or other small membrane-permeable compounds that decrease the solubility of the protein. Very occasionally, some proteins can be crystallized by dialysis salting in, by dialyzing against pure water, removing solutes, driving self-association and crystallization.

Free-interface diffusion
This technique brings together protein and precipitation solutions without premixing them, but instead, injecting them through either sides of a channel, allowing equilibrium through diffusion. The two solutions come into contact in a reagent chamber, both at their maximum concentrations, initiating spontaneous nucleation. As the system comes into equilibrium, the level of supersaturation decreases, favouring crystal growth.

pH
The basic driving force for protein crystallization is to optimize the number of bonds one can form with another protein through intermolecular interactions. These interactions are known to be dependent on electron densities of molecules and the protein side chains that change as a function of pH. The tertiary and quaternary structure of proteins are determined by intermolecular interactions between the amino acids’ side groups, in which the hydrophilic groups are usually facing outwards to the solution to form a hydration shell to the solvent (water). As the pH changes, the charge on these polar side group will also change with respect to the solution pH and the protein’s pKa. Hence, the choice of pH is essential either to promote the formation of crystals where the bonding between molecules to each other is more favorable than with water molecules. pH is one of the most powerful manipulations that one can assign for the optimal crystallization condition.

Temperature
Temperature is another interesting parameter to discuss since protein solubility is a function of temperature. In protein crystallization, manipulation of temperature to yield successful crystals is one common strategy. Unlike pH, temperature of different components of the crystallography experiments could impact the final results such as temperature of buffer preparation, temperature of the actual crystallization experiment, etc.

Chemical Additives
Chemical additives are small chemical compounds that are added to the crystallization process to increase the yield of crystals. The role of small molecules in protein crystallization had not been well thought of in the early days since they were thought of as contaminants in most case. Smaller molecules crystallize better than macromolecules such as proteins, therefore, the use of chemical additives had been limited prior to the study by McPherson. However, this is a powerful aspect of the experimental parameters for crystallization that is important for biochemists and crystallographers to further investigate and apply.

Specialized protein crystallization techniques
Some proteins present unique challenges for crystallization. Membrane proteins frequently require the addition of a detergent for isolation and crystallization, and tend to form "very small, weakly (x-ray) diffracting, radiation-sensitive crystals". Proteins that form fibres need to be stabilized in a monomeric form. Small proteins can have poor solubility in water and require specialized crystallization techniques.

Technologies that assist with protein crystallization
=== High throughput crystallization screening === High through-put methods exist to help streamline the large number of experiments required to explore the various conditions that are necessary for successful crystal growth. There are numerous commercials kits available for order which apply preassembled ingredients in systems guaranteed to produce successful crystallization. Using such a kit, a scientist avoids the hassle of purifying a protein and determining the appropriate crystallization conditions.

Liquid-handling robots can be used to set up and automate large number of crystallization experiments simultaneously. What would otherwise be slow and potentially error-prone process carried out by a human can be accomplished efficiently and accurately with an automated system. Robotic crystallization systems use the same components described above, but carry out each step of the procedure quickly and with a large number of replicates. Each experiment utilizes tiny amounts of solution, and the advantage of the smaller size is two-fold: the smaller sample sizes not only cut-down on expenditure of purified protein, but smaller amounts of solution lead to quicker crystallizations. Each experiment is monitored by a camera which detects crystal growth.

Protein engineering
Techniques of molecular biology, especially molecular cloning, recombinant protein expression, and site-directed mutagenesis can be employed to engineer and produce proteins with increased propensity to crystallize, or can even direct polymorph selection during protein crystallization. Frequently, problematic cysteine residues can be replaced by alanine to avoid disulfide-mediated aggregation, and residues such as lysine, glutamate, and glutamine can be changed to alanine to reduce intrinsic protein flexibility, which can hinder crystallization.

Technologies that identify the structure of protein crystals (NEW)
For the macromolecule structural solving, Nuclear Magnetic Resonance (NMR), X-Ray Diffraction (XRD) and Cryo-electron microscopy (Cryo-EM) are the three main ways in the field.

Nuclear magnetic resonance (NMR)
Specifically for proteins, NMR covers the smaller sizes of the range. The largest protein that has had its structure successfully solved by NMR was malate synthase G with 723 amino acid residues at 81.4kDa in 2002. This puts a huge limitation on NMR usage in analyzing complex protein structures with molecular weight above that limit.

X-ray powder diffraction (XRD)
Due to having no limit to protein molecular weight, the use of XRD in protein structural determination is more popular compared to NMR. As a reference, XRD had successfully solved and provided high resolution structures (< 1.5Å) for proteins such as human phosphodiesterase 2A at a molecular weight of 161.4kDa (and with a resolution of 1.43Å), which NMR would not have been able to achieve.

Cryogenic electron microscopy (cryo-EM)
Cryo-EM is most notably known for the advantage of completely eliminating the need of crystallizing protein samples. The cryo-EM sample preparation process is relatively more instantaneous and easier than XRD. The protein sample for cryo-EM analysis is usually prepare by fast freezing using liquid ethane. After fast freezing, samples are ready for visualization under EM. This completely avoids the time and effort needed for protein crystallization for XRD analysis. Yet, in comparison with structures being solved by XRD, structures solved by cryo- EM are significantly lower in resolution.

Alternatives
Some proteins do not fold properly outside their native environment, e.g. proteins which are part of the cell membrane like ion channels and G-protein coupled receptors, their structure is altered by interacting proteins or switch between different states. All those conditions prevent crystal growth or give crystal structures which do not represent the natural structure of the protein. In order to determine the 3D structure of proteins which are hard to crystallize researchers may use nuclear magnetic resonance, also known as protein NMR, which is best suited to small proteins, or transmission electron microscopy, which is best suited to large proteins or protein complexes.

Applications of protein crystallization
Protein crystallization is required for structural analysis by X-ray diffraction, neutron diffraction, and some techniques of electron microscopy. These techniques can be used to determine the molecular structure of the protein. For a better part of the 20th century, progress in determining protein structure was slow due to the difficulty inherent in crystallizing proteins. When the Protein Data Bank was founded in 1971, it contained only seven structures. Since then, the pace at which protein structures are being discovered has grown exponentially, with the PDB surpassing 20,000 structures in 2003, and containing over 100,000 as of 2014.

Crystallization of proteins can also be useful in the formulation of proteins for pharmaceutical purposes.