Sage Journals: Discover world-class research

Abstract

Fragment-based drug discovery (FBDD) has come of age in the last decade with the FDA approval of four fragment-derived drugs. Biophysical methods are at the heart of hit discovery and validation in FBDD campaigns. The three most commonly used methods, thermal shift, surface plasmon resonance, and nuclear magnetic resonance, can be daunting for the novice user. We aim here to provide the nonexpert user of these methods with a summary of problems and challenges that might be faced, but also highlight the potential gains that each method can contribute to an FBDD project. While our view on FBDD is slightly biased toward enabling structure-guided drug discovery, most of the points we address in this review are also valid for non-structure-focused FBDD.

Keywords

FBDD biophysics drug discovery fragment screening

Introduction

Fragment-based drug discovery (FBDD) has established itself as a widely used target-based method in drug discovery.^1,2 The central tenet of the method is the finding that successful small-molecule drugs are often characterized by high ligand efficiencies (LEs).³ The LE of a drug against its target is defined as the free energy of binding to the target divided by the heavy-atom count of the drug. FBDD considers this fact by starting from very low-molecular-weight (MW) initial hits displaying high LE. Such starting points are usually found by screening libraries consisting of several thousand compounds typically with a MW of <300 Da—so-called fragments. Initial hits are characterized by low affinities (K_d > 10 µM) and hence require sensitive biophysical screening methods for detection. The low MW typically translates to a high LE, despite the low affinity. Maintaining the high LE of initial fragment hits is a key goal in the hit-to-lead and lead optimization stages of FBDD. Due to their small size, fragments are more likely to form high-quality interactions with their target,⁴ which is central to the high LE values commonly observed with fragment hits. A corollary of this is the fact that smaller molecules can sample chemical space more efficiently. There are an estimated 10⁶ to 10¹² fragment-sized molecules⁵ (compared with 10⁵⁰ for molecules of <500 Da⁶). Therefore, a library of a couple thousand fragments allows much better coverage of chemical space than a high-throughput screening library of several million compounds. The reduced effort required to establish and maintain a fragment library has made FBDD particularly interesting for smaller companies, and its proven track record has led to major pharmaceutical companies establishing their own FBDD groups.

From its first description in the literature,⁷ FBDD has matured into a successful approach for lead discovery and delivered four FDA-approved drugs ( Table 1 ).

Table 1.

Four FDA-Approved Drugs Have Been Derived from FBDD.

Drug	Primary Hit	Affinity of Primary Hit, µM	Primary Screening Method	Reference
Vemurafenib (PLX4032, Zelboraf)	7-Azaindole	>200 (IC₅₀)	Biochemical assay (AlphaScreen)	Tsai et al.,⁹⁶ Bollag et al.⁹⁷
Venetoclax (ABT-199, Venclexta)	Biphenyl-4-carboxylic acid; Tetrahydronapthol	300 (K_d); 4300 (K_d)	NMR	Muchmore et al.,⁹⁸ Oltersdorf et al.⁹⁹
Erdafitinib (JNJ42756493, Balversa)	Quinoxaline	~10 (K_d)	X-ray/NMR/Tm shift	Perera et al.¹⁰⁰
Pexidartinib (PLX3397, Turalio)	7-Azaindole	>100 (IC₅₀)	Biochemical assay	Zhang et al.,¹⁰¹ Tap et al.¹⁰²

In our opinion, the strengths of FBDD are most efficiently harnessed in combination with structure-guided chemistry optimization strategies. Astex is extremely structure focused, to a degree that only targets with a structural system amenable to high-throughput crystallography are pursued.⁸ It is, however, possible to employ FBDD methods in the absence of high-resolution structural information. A comprehensive review on this subject has recently been published.⁹ This review will focus on a pipeline where one or several biophysical screening methods are used to feed a high-throughput structure determination method.

Currently, x-ray crystallography is the only structural biology technique that can be considered high-throughput. However, the spectacular recent advances in cryo-EM, in terms of resolution and sample-to-structure times, make this method seem the obvious candidate to join x-ray crystallography as a high-throughput structure determination technique. Nuclear magnetic resonance (NMR) is the third technique able to deliver atomic resolution structures, but it has never been able to compete with x-ray in terms of applicability and throughput. Promising NMR methods for simplified protocols to determine protein–ligand structures have been developed over the years, but we feel these methods are not being routinely used in the drug discovery industry. If not for structural elucidation purposes, NMR does, however, have its merit in the process of FBDD (see below).

As stated, biophysical techniques are extensively utilized in FBDD, from screening of fragment libraries through to hit validation and lead optimization. Although many reviews on the principles of biophysical techniques have been published,^10,11 there is little describing the practical aspects of applying biophysical techniques to the problem of fragment-based screening (FBS), and less again on how to extract the most value from biophysical primary screening data. In this review, we hope to expose the reality of biophysical FBS in a useful fashion, aspiring to help the anxious biophysicist withstand the relentless unforeseen. We discuss three of the most widely utilized biophysical methods employed in FBS: NMR, surface plasmon resonance (SPR), and Tm shift. It is beyond the scope of this review to cover every aspect of each technique and every flavor of instrument and software. We instead focus on the more general and practical considerations of biophysical FBS.

Biophysics in Fragment Screening

The State of My Protein

Before a screening pipette is raised in earnest, one should get to know the enemy. Protein mass, oligomeric state, posttranslational modifications, oxidation state, level of heterogeneity, and the oft-forgotten binding of endogenous ligands should all be known. Liquid chromatography–mass spectrometry (LC-MS) and native mass spectrometry (native MS) can give insight into most of these criteria. Solution-based characterization is also highly important: size exclusion chromatography (SEC), multiangle light scattering (MALS), and analytical ultracentrifugation (AUC) can confirm the oligomeric solution state of proteins; protein NMR and Tm shift can give insight into the folded state of proteins; isothermal titration calorimetry (ITC) can provide invaluable protein quality control (QC) via the binding stoichiometry term (what fraction of my protein is competent for binding?) and the measured dissociation constant (K_d) (does in-house data tally with published values?).

Endogenous ligand binding to a protein target has been observed in-house several times. In each case the presence of the endogenous ligand was unexpected and only revealed by x-ray crystallography, native MS and ITC titrations exhibiting either weaker than expected K_d values, or complex multiphasic titrations. The presence of bound endogenous ligands at sites of interest will be problematic for most biophysical readouts and must be monitored carefully. With any new target, the authors strongly recommend performing an ITC titration if there is a good tool compound available, both to QC the protein and to provide a K_d benchmark for other techniques.

Tm Shift

Background

The thermal shift principle has been around for a long time,¹² but it is continually reemerging under a profusion of pseudonyms, for example, thermofluor,¹³ thermal shift, Tm shift, and differential scanning fluorimetry (DSF).¹⁴ The widespread appeal of Tm shift is due to the following:

Ease of setup—it can be run on most real-time PCR instruments.

Low cost of instruments and consumables.

High throughout—96- or 384-well formatting and ~1 h/plate.

Low protein consumption—typically a few milligrams per screen.

Relative ease of data analysis.

At its heart, Tm shift exploits the coupling of the protein denaturation equilibrium (Native ⇌ Denatured) with the ligand binding equilibrium (Protein + Ligand ⇌ Protein:Ligand). Protein thermal denaturation is often irreversible, and in most cases, protein aggregation ensues after some degree of thermal denaturation. Even so, most protein thermal denaturation curves exhibit a single sigmoidal transition, appearing two-state, despite not being true representations of reversible protein denaturation equilibria. Ideally, bona fide protein ligands that bind the native state of a protein will shift the entire thermal denaturation curve to the right, which is conveniently expressed as a shift in the melting temperature, or “Tm shift” ( Fig. 1a,b ).

Figure 1.

(a) A typical protein thermal denaturation curve monitored by the binding of Sypro Orange. (b) A target protein will be thermally stabilized in the presence of a bona fide tool compound, which binds specifically to the native state of the protein. (c) Various categories of Tm shift fragment screening data to be familiar with.

Compound screening using Tm shift was first implemented using fluorescent dyes that preferentially bind partially or completely denatured states of proteins,¹³ with Sypro Orange emerging as the most widely used fluorescent dye for Tm shift assays.¹⁵ More recently, intrinsic tryptophan fluorescence has made a return to Tm shift buffer screens,¹⁶ and the thermal shift principle has also been adapted to a cell-based readout and the issue of target engagement.^17–21 The following section discusses Tm shift fragment screening monitored by Sypro Orange binding, but similar principles should apply to any fluorescent method that is used to monitor the thermal denaturation process.

Fragment Screening Using Tm Shift

Tm shift setup for fragment screening relies on first establishing conditions under which a clear denaturation transition can be observed and fitted. Specifically, one hopes to see well-fit replicates displaying low standard deviations in the fitted parameters. It is worth performing thermal denaturation experiments using other complementary methods, for example, far-UV circular dichroism (far-UV CD) or differential scanning calorimetry (DSC), to validate the apparent Tm observed using the fluorescent dye. Far-UV CD and DSC can also provide vital insights into more complex protein thermal denaturation events, for example, for multimeric proteins or intrinsically disordered proteins.

Exact plate layouts for Tm shift screening are at the discretion of the user, but care must be taken regarding cross-plate variation and edge effects. A rule of thumb at Astex is that for eight DMSO protein control replicates on a 96-well plate the standard deviation for the fitted Tm should be <0.2 °C and preferably <0.1 °C. It is also important to consider the overall shape of the thermal denaturation curve. Ideally, the predenaturation baseline should be linear, and the denaturation transition should be a single sigmoid ( Fig. 1 ). A surprisingly large number of proteins exhibit this type of profile. Buffer, additive, and protein construct screens can be useful to identify conditions where the thermal denaturation profile is well fit. It is also worth testing a narrow range of protein and fluorescent dye concentrations. Typically, the higher the protein MW, the lower the protein concentration that will be required. Regarding the Sypro Orange concentration, we have never had to veer too far from the standard 2.5× final concentration of Sypro Orange (from a 5000× commercial stock). The absolute Tm of the protein control is not important; robust fitting of the protein control denaturation curves and obtaining a low standard deviation among the replicates are what matter most.

In terms of buffer additives, it is worth remembering that fluorescent dyes used for Tm shift screening will readily interact and fluoresce in the presence of detergents. Even small amounts of detergent can compromise Tm shift experiments and are best avoided. A Tm shift method has been published for membrane proteins using covalent modification of cysteine residues with a coumarin dye.²²

As mentioned earlier, most proteins begin to aggregate upon thermal denaturation. The fluorescent signal for Sypro Orange-bound protein aggregates typically decreases after denaturation, giving the latter half of any thermogram a “horsetail” appearance ( Fig. 1a ). This horsetail is often removed before performing any data fitting.

Tool Compound Binding—Caveat Emptor

In an ideal world, every new drug target would arrive on the lab bench with a bona fide tool compound alongside it. Alas, this is not the case, and to make matters worse, the literature is strewn with compound binding artifacts.^23–25 It is necessary, therefore, to maintain some degree of cynicism about literature reports of compound binding to one’s target of interest. It is good practice to keep a list of pan-assay interference compounds (PAINS)^23,26 at hand and to be particularly wary of literature describing the discovery of compounds with neither direct binding nor structural support. One’s critical faculties should be especially heightened regarding the quality of any published biophysically derived K_d values, kinetics, or mechanisms of action. Many compounds seem ingeniously designed to confound the drug discovery process and catch out even the most vigilant scientist.²⁷

Small molecules, peptides, cofactors, substrates, products, and effectors can all be utilized as positive controls to illicit a ligand-dependent Tm shift. Ideally, the positive control replicates will show a significant thermal shift (>2 °C) and exhibit low standard deviations in the fitted parameters.

Fragment Screening Using Thermal Shift

With control data in place, the next concern is the compound screening concentration, which involves consideration of the fragment library complexity, the fragment library solubility, and the perceived ligandability of the target binding site. Gauging ligandability is an inexact science, and it can vary markedly even within a specific target class. In addition, novel chemotypes or mechanisms of action may be what are desired rather than yet another kinase hinge binder. Nevertheless, having some sense of the ligandability of a protein target binding site is important, as it is clear both empirically (>50 screens at Astex) and theoretically (degree of binding site occupancy required) that Tm shift fragment screens perform best on more ligandable target binding sites, that is, in situations where fragment K_d values are at the more potent end of the fragment affinity spectrum (<300 µM). In fact, an early screening stage Tm shift screen can provide a useful steer on how ligandable a new protein target is.^28,29

The magnitude of a Tm shift is a function of the ligand concentration, the ligand K_d, the number of ligand binding sites, and the thermodynamics of both protein denaturation and ligand binding.^29–32 Crucially, eliciting a ligand-dependent Tm shift requires an excess of ligand relative to the K_d, or in other words, the ligand binding site occupancy needs to be high. Thus, at a fragment screening concentration of 1 mM we might expect to achieve ~80% occupancy for fragment K_d values of <250 µM. Higher ligand concentrations are required to detect more weakly binding fragments. However, increasing the fragment library screening concentration will increase the number of fragments that are insoluble. This, in turn, will increase the number of fragments that will interfere with the Tm shift readout. It is therefore worth knowing the solubility of one’s fragment library and factoring this into what fragment concentrations will be used for a Tm screen. At Astex we typically run a Tm screen using fragment concentrations of 1, 2, and 5 mM.

Tm Shift Screening Output

The majority of thermograms from a Tm shift fragment screen should resemble the control data in terms of overall shape and goodness of fit. Clear artifacts can be immediately dismissed from further consideration, but invariably the quiddity of the screening process throws up an assortment of more troublesome thermograms ( Fig. 1c ).

Fragments that generate a negative Tm shift are perhaps the most insidious Tm screen curiosities. Something about the siren call of these protein destabilizers draws the ever-optimistic screener to see some meaning or value in them. We have never derived anything useful from negative Tm shifts, although there are cases (with the benefit of hindsight and/or orthogonal assays) where bona fide hits have been found that generate negative Tm shifts.³³ The rationale for negative Tm shifts is well described^31,34 and often results from multiple weak binding events to the denatured or partially denatured state of a protein. In theory, certain fragments can bind to both the native and denatured conformations of a target protein, making the net Tm shift difficult to interpret. Screening pragmatism dictates that, at least for a Tm shift primary screen, fragments that generate negative Tm shifts are best dismissed without further consideration.

Thermograms that exhibit a high initial fluorescence that slopes downward before merging with the latter part of the genuine protein denaturation event. This type of thermogram can be fit to give an apparently positive Tm shift. In most cases this is a false positive produced by the foreshortening of the denaturation curve. There is a bit of a gray area depending on how close the initial sloping baseline approaches linearity prior to the thermal denaturation transition. A useful steer on the permissibility of such compounds is whether the peak of the denaturation transition is also right-shifted.

As for any fluorescence technique, there will be issues with compound-dependent fluorescence quenching. A multicolored DMSO screening plate will undoubtedly exhibit varying degrees of fluorescence quenching; however, some bona fide fragment hits can still be identified, particularly if the shape and fit of the thermograms are satisfactory.

Over the course of many in-house Tm shift fragment screens we have seen genuine fragment Tm shift values as large as ~7 °C for some particularly ligandable binding sites. A recent Tm shift screen identified several fragments that generated Tm shift values of ~15 °C. None of these big “shifters” were hits in any other method. Historically, in-house Tm shift values of >2 °C have translated well to x-ray and other biophysical methods. It is worth keeping in mind that protein thermal denaturation often proceeds via multiple intermediate states. The apparent two-state denaturation transition we hope for in Tm shift screening is often a gross simplification of the actual thermal denaturation process.

Grouping Tm Shift Hits

Tm shift fragment screening data are most usefully output as the ΔTm (Tm + Fragment – Average Tm of the DMSO controls). Although data fitting and ΔTm derivation are often automated, it is essential to manually inspect every denaturation curve; most data can be inspected in a cursory manner but flawed automated fits must be identified and more painstakingly analyzed. Thermograms that are compromised by compound aggregation or fluorescence interference can be quickly removed from further consideration, although it is worth monitoring how such fragments behave at different concentrations and across multiple target classes. Once the interfering data have been removed, there comes the issue of hit classification.

We typically group Tm shift hits into three classes, the boundaries of which depend on the spread of ΔTm values observed in the screen.

For more ligandable binding sites, the grouping of ΔTm values might be as follows:

>2.5 °C—strong hit.

1–2.5 °C—medium hit.

0.5–1 °C—weak hit.

For less ligandable binding sites, the grouping of ΔTm values might be as follows:

>1.5 °C—strong hit.

0.5–1 °C—medium hit.

0.2–0.5 °C—weak hit.

There are exceptions, as always, but the veracity of a Tm shift hit is strongly dependent on the magnitude of the ΔTm value. Another rule of thumb is the following:

1&x46;ΔTm > 2 °C—probably real and worth following up by other methods.

ΔTm = 1–2 °C—moderate chance of being real and might be worth following up.

ΔTm < 1 °C—most often a false positive, although occasionally a weak-affinity hit.

Rules of thumb aside, when pursuing a difficult or high-value protein target or when protein expression levels are low and no other technique is available, there can still be value in pursuing small ΔTm hits by x-ray crystallography and bioassay.

Tm Shift Data Analysis

Several data fitting methods are standard in the field of Tm shift data analysis.^{13,32,35–37} We feel that fitting protein thermal denaturation curves from a fragment screen are best undertaken using a physical two-state protein denaturation model and not the commonly utilized first derivative or some unfounded spline interpolation. Several versions of the protein thermal denaturation equation exist, all of which include terms for the initial/pre-/native fluorescence signal, the final/post-/denatured fluorescence signal, the midpoint of the thermal denaturation curve or Tm, and various forms of the free energy term for protein denaturation. Although the first derivative is a common way to derive the Tm, it tends to mask other problems in raw screening data, so plucking just the Tm from a first derivative peak is a dangerous game. Automated fitting routines can be configured, but as mentioned, the user must manually inspect all raw thermal denaturation curves and often manually refit individual thermal denaturation curves that need to have input ranges altered to improve the fit quality.

In summary, Tm shift is a generic method and binding site agnostic. It often requires little protein, little time, or little experience to set up and run—benefits that cannot be overstated and which distinguish Tm shift assays from every other FBS method. Most importantly, Tm shift assays can rapidly deliver fragment hits on target proteins with more ligandable binding sites.

Surface Plasmon Resonance

Modern biophysics instrumentation and software have become increasingly user-friendly, leading to a generation of users who can “do” a technique but who are not completely clear about how the instrument functions or the extent of raw data manipulation that is acceptable or meaningful. SPR is particularly user-friendly and particularly prone to artifact, overinterpretation, and varying degrees of frustration, especially in the area of FBS.

Anyone who has attended an SPR conference will have noticed that the SPR users at the antibody end of the SPR spectrum are a lot more “data happy” than those of us lurking in the long grass of SPR fragment screening. The data “unhappiness” of a typical SPR fragment screener is born out of a specific set of challenges: obtaining an immobilization method that provides a reasonable level of active protein, sorting the wheat from the chaff of a fragment screen, and dealing with linear fragment dose responses. Hope is provided, however, by some excellent SPR FBS screening publications^38–44 and an exemplary guide to the aspiring SPR fragment screener.⁴⁵

Instrumentation

Biacore has dominated the SPR instrumentation field, and the Biacore T200 has been the workhorse instrument for many labs undertaking SPR fragment screens in the last decade. A slightly higher-sensitivity instrument, the Biacore S200, and the higher-throughput Biacore 8K continue to keep Biacore at the forefront of SPR FBS. Other SPR manufacturers, such as ForteBio, Sensiq, and Creoptix, also sell instruments that are aimed at the small-molecule/fragment screening market and are worth investigating. It is essential, however, to factor in the quality of customer technical support, service contracts, and the supply time for consumables when deciding which instrument to purchase. Most of the following SPR review sections are based on our in-house experience with a Biacore S200 and a Sensiq Pioneer, but we hope the guidelines will be relevant to any SPR instrument used for FBS.

Much has been written about the importance of maintenance and care required to get the most out of an SPR instrument.^45–47 This cannot be overemphasized, and the reader is strongly advised to perform all proper maintenance checks and controls, particularly on a communal instrument, prior to running any real samples of interest.

Response Units and Signal

The operating principle of an SPR instrument and the physical basis of the raw SPR signal are well described elsewhere.^48–50 Conventionally, SPR fragment screens have the protein target of interest immobilized onto a chip surface while the fragment library is injected serially over the chip surface. The magnitude of an SPR response is proportional to the MW of the interacting protein and compound; for example, if we have 3000 response units (RU) of an immobilized 45 kDa protein, we would expect a maximum signal of 10 RU for an interacting 150 Da fragment, assuming the binding site is saturated and the protein is 100% active on the chip surface.

(150 / 45, 000) \times 3000 RU = 10 RU

Obviously, problems will arise if the protein MW increases and the percentage surface activity drops; for example, a 450 kDa protein that is 10% active will only give a 0.1 RU signal for a 150 Da fragment, which would be difficult to detect.

(150 / 450, 000) X 3000 RU X (10 %) = 0.1 RU

Most target proteins are in a more SPR-friendly MW range, and the sensitivity of most modern SPR machines means that, in principle, the low MW of fragments is not limiting their detection.

Immobilization

Protein immobilization is the first step of configuring an SPR fragment screen. Each individual protein target requires a unique set of immobilization conditions, the major variables being protein construct design, choice of immobilization chemistry, and choice of buffer components. Some targets will look good with little effort, while other targets may require many months of optimization or may never work at all. Many other biophysical techniques (NMR, ITC, and Tm shift) do not require so much up-front optimization for every target, which means that the SPR immobilization process can get frustrating, particularly as the process defies much rationalization. Instead, SPR immobilization requires a wholly systematic approach, and crucial to the immobilization effort is the availability of a tool compound; without a tool compound, one is running blind in a fragment screen.

Three major protein immobilization methods are utilized in SPR FBS:

NiNTA capture or NiNTA capture coupling. Histidine tags for SPR immobilization are typically longer than the standard six histidine tags used for protein purification purposes. Lengthening the histidine tag helps to minimize the extent of protein leaching from the NiNTA chip surface. The joint SPR/purification utility of the longer histidine tags is still convenient, and 6 + 6, 6 + 8, or 12 histidine tags have been shown to increase the avidity of histidine capture by NiNTA.⁵¹ A poly-NINTA chip available from Xantec has also been claimed to increase the avidity of the histidine tag interaction. Combining histidine capture with standard amine coupling is a popular method for stable protein immobilization. Utilizing the histidine capture step provides a more uniform protein orientation on the chip surface than standard amine coupling and eliminates the need for the low-pH capture step.

Streptavidin capture immobilization requires biotinylation of the target protein, typically via a site-specific protein tag (AviTag), which is a substrate for the biotin ligase enzyme. Endogenous levels of AviTag biotinylation in E. coli can be very low but more than enough for efficient protein capture on a streptavidin chip surface. Co-expression of biotin ligase (BirA) with the target protein can provide higher levels of biotinylation within bacterial, insect cell, or mammalian expression systems, although this will not give effective biotinylation of secreted proteins. The AviTag sequence of a purified protein can also be biotinylated in vitro using a commercial biotin ligase kit or the purified biotin ligase enzyme.

Amine coupling via surface lysine residues or the N-terminal amine requires the target protein to get to the chip surface before the covalent immobilization reaction can take place. This is achieved by having the target protein at a pH below the isoelectric point, which can cause some proteins to aggregate or denature. N-terminal polylysine protein tags can help make amine coupling more uniform in protein orientation, in a similar way to histidine and biotin tags.

In our experience, the search for the most suitable immobilization conditions often necessitates trying all three immobilization methods. Protein tags can be combined in some protein constructs and should be tested at both the N- and C-termini. For screening purposes, it is vital to have a stable level of immobilized protein over the course of each screening run—hence the preference for capture coupling and the exploitation of the biotin–streptavidin affinity. A longer histidine tag may provide a sufficient level of surface stability, in the absence of covalent coupling, if users need the option to regenerate NiNTA chips. Usually, obtaining a reasonable level of immobilized protein (3000–10,000 RU) is straightforward by one or all the methods listed above. The thorny issue is how best to achieve a high level of surface activity, which we assess using a tool compound.

Tool Compound Binding and Surface Activity

It is vital that a well-validated tool compound is used to assess SPR immobilization trials. Well validated means that the compound solubility is known and the purported K_d/K_i/IC₅₀ value has been confirmed in-house by ITC, NMR, microscale thermophoresis (MST), or bioassay. Ideally, the tool compound should also be structurally validated and engage the site of interest for the SPR fragment screen; validating one specific protein target binding site by SPR does not mean that any other protein binding site is SPR validated, although validating one site is clearly better than nothing. Access to bona fide fragment hits or weaker-affinity tool compounds is also useful, in order to get a sense for how well the immobilized protein setup copes with nonspecific binding (NSB) at higher ligand concentrations and how well SPR K_d estimates for fragments compare with other direct binding methods.

Assuming we have access to a validated tool compound, we can next consider surface activity, which is defined as

({Experimental R}_{\max} / {Theoretical R}_{\max}) x 100

Having high surface activity seems intuitively beneficial both in terms of limiting the fraction of “dead” protein on the chip surface and in terms of attenuating the NSB problem by permitting the use of a lower level of immobilized protein.

We can backcalculate a value for the minimum immobilization level that is required based on (1) the lowest fragment MW, (2) the protein MW, (3) sensitivity of the instrument, and (4) the level of protein immobilization. Suppose we start with the assumption that we would like a 5 RU binding signal for a 150 Da fragment binding to a 45 kDa protein.

(45, 000 / 150) x 5 RU = 1500 RU of protein if we have 100 % surface activity

In real-life cases, however, it can be difficult to achieve such high levels of surface activity; activity levels <30% are not uncommon. In the above example, if we assume the surface activity is 10%, then, theoretically, we would need to immobilize 15,000 RU (not trivial) of our 45 kDa protein to see a 5 RU signal for a 150 Da fragment. Unfortunately, observing 10% activity at 1500 RU does not mean we will see 10% activity at 15,000 RU; the percent activity will almost certainly be lower at the higher immobilization level.

Protein construct design is often crucial for maximizing surface activity levels. In addition to testing multiple immobilization methods, it is worthwhile to assess the following:

Immobilization tags being on the N- and C-termini—consider the relative position of the binding site of interest to the immobilization tag.

The sequence and length of any spacer amino acid sequences between the immobilization tag and the target protein sequence.

Various length constructs, for example, full-length and individual domains.

Utilizing glutathione S-transferase (GST), maltose binding protein (MBP), or other fusion constructs to facilitate antibody-mediated immobilization.

Increasing the level of surface activity can be also be achieved through screening buffer components, for example, choice of pH, buffer, reducing agent, detergent, salt, and any relevant additives. Immobilizing a protein in the presence of a potent ligand can also improve surface activity.

It is not standard practice in the SPR field to publish the protein surface activity level for any reported SPR experiments, although it feels like an important piece of information when trying to gauge the credibility of reported affinity and kinetic values; for example, a K_d value reported from a surface that is 90% active is a lot more convincing than a K_d value reported from a surface that is 10% active.

Just as important as surface activity level is surface stability. It is essential to check, using repeated tool compound dose responses, that the surface activity level does not decline significantly over the time required for a screening plate run. A certain degree of surface activity decline can be accommodated in the data analysis of a screening run, but this is best kept to a minimum. Buffer optimization and lower temperatures can help preserve the surface activity levels of sensitive proteins. Ultimately, it is imperative to put in considerable effort in order to achieve a stable and high level of surface protein activity when assessing protein immobilization methods for SPR fragment screening.

Screening Concentration and NSB

Fundamental to the challenge of SPR fragment screening is the battle against NSB. The plague of NSB is a curious artifact of the relatively high compound concentrations required for FBS and for weaker-affinity K_d determination. Something about a protein-immobilized surface is irresistible to many compounds at a concentration above ~100 µM, which can give rise to an NSB signal that is often linear with increasing compound concentration (see “SPR Fragment Dose Responses”). The extent of NSB is dependent on the protein, the compound, the compound concentration, the protein immobilization level, and the immobilization method. Of these variables, we have some control over the protein immobilization level; however, the NSB problem is more readily attenuated by keeping the compound concentration low. A survey of published SPR FBS conditions^42,52–54 reveals that many users are opting to screen at fragment concentrations of <300 µM in order to combat the problem of NSB. Although it might attenuate NSB, low-concentration SPR fragment screening limits the fragment binding affinity range that is detectable.

It is worth considering what fragment binding affinities we would expect to detect from an SPR FBS performed at a fragment concentration of 100 µM. If we assume that the screening raw data signal is only a function of the K_d value, then it becomes apparent that low-concentration SPR fragment screening will only succeed with the most ligandable target binding sites. Achieving 80% fractional occupancy at a screening concentration of 100 µM would require fragment K_d values to be <25 µM. Even at 50% fractional occupancy we are restricted to detecting K_d values of <100 µM. The question arises as to what fractional occupancy is worth following up in a fragment screen, especially when the picture is muddied by the inevitable chaff and artifact of an SPR screen (see “SPR Fragment Dose Responses”).

Certain library fragments will bind and dissociate slowly from certain immobilized protein surfaces, which means that the signal from subsequent fragment injections is compromised. This phenomenon is attenuated by performing a “clean screen” in order to identify and remove such fragments from an SPR fragment screening library. Problem fragments will be immobilization surface and protein specific, so it is recommended to run a clean screen for each new target, even if it means removing some hitherto bona fide fragment hits.

Since SPR chips contain multiple channels (typically three or four), it is possible to include a “reference” protein on one channel.³⁹ Selecting an appropriate reference protein is a tricky business and requires the availability of a positive control for the reference protein. Ideally, a reference protein should mimic the NSB liabilities of the target protein while having significantly weaker affinity for any fragments that specifically bind the target protein. Even if such a universal reference protein existed, it is hard to calculate the real value in knowing whether certain library fragments bind to it or not, and how that should inform the user on what is happening with the real protein target. At Astex, we have yet to employ a reference protein during a fragment screen. We have run very different target proteins in parallel on the same chip and can see the value of running variants of certain proteins in parallel, for example, clinically relevant mutations, selectivity targets, or domain deletions from multidomain targets. But caution is clearly required when drawing conclusions from screening data with reference proteins or with multiple proteins run in parallel.

SPR Screening Configuration and Output

The theory, process, and configuration of any SPR FBS are lucidly and generously explained in the excellent review by Giannetti.⁴⁵ This publication should be the first piece of reading for anyone undertaking an SPR fragment screen, and users would do well to follow many of the guidelines.

Data analysis of SPR screening output follows a series of steps: solvent correction, blank subtraction, and normalization based on positive control data. It is essential to check that all the buffer blanks, the positive controls, and the solvent corrections are within the expected ranges. Typically, the final fragment RU signal is then output and compared with a theoretical R_max signal calculated for each fragment to give a percentage or stoichiometry term; for example, a fragment with a theoretical R_max of 10 RU and an observed screening response of 5 RU would be designated as 50% or as having a 0.5 stoichiometry. Next comes the prickly problem of binning data based on sensorgram amplitude and shape ( Fig. 2 ).

Figure 2.

Examples of data classes from analysis of an SPR fragment screen. Taken from Biacore S200 evaluation software. The sensorgrams have the time and RU axes removed for clarity.

It is easy to immediately dismiss any fragment with a signal less than three times the standard deviation (SD) of the negative controls (buffer blanks)—these are clear misses. Fragments with a classic rectangular shape and rapid kinetics can then be binned according to percentage, dismissing any fragment with a response greater than ~200%. Within the remaining percentage range (200% − 3 × SD%) it is worth thinking carefully about the value of finer binning. The inclination is often to think that higher-percentage hits are better, but it is worth remembering that a bona fide 90%–100% binding signal would equate to a K_d value that was at least ~5- to 10-fold below the screening concentration. What is the likelihood of discovering such high-affinity fragment hits for the target being prosecuted? Some level of skepticism is justified. Common sense, SPR dose responses, and other biophysical direct binding techniques should hopefully dictate the plausibility of such high-scoring SPR screening hits.

In many cases, it is the lower-percentage hits that have the potential to be real fragment hits with tenable K_d values; for example, a 33% hit at 100 µM would correspond to a K_d value of 200 µM. Clearly, the vagaries of SPR screening mean that, even within the subset of well-behaved fragment sensorgrams, there will be many contributing factors to the final RU/percentage score other than the K_d value. Thus, it is best not to bin these percentages too finely and instead designate such fragments as hits for follow-up.

Having taken care of well-behaved hits and clear misses, we are left with assessing what, if any, value can be extracted from “problem” sensorgrams ( Fig. 2 ). Such sensorgrams could be dismissed outright, but they can constitute ~20% of the total screening output and, in addition, there is a danger of throwing the baby out with the bathwater; for example, at Astex we have observed that some high-quality fragments, which bind multiple targets (x-ray and ITC validated), have shown slow association, super-stoichiometry, or slow dissociation in SPR fragment screens. It is worth keeping a note of such fragments and getting a sense of how often they generate unusual SPR screening responses; is it a function of the target protein, the immobilization method, or the fragment screening concentration?

It seems sensible to run an SPR fragment screen at multiple concentrations, being careful to balance the likely onset of NSB with the ligandability of the target binding site. In practice, this means running SPR fragment screens at several concentrations in the 300 µM to 30 µM range. Seeing a clear, sensible dose dependence for fragment hit responses at different screening concentrations gives us a little more faith in their veracity. Ultimately, single-concentration SPR fragment screens are probably viewed, at best, as being a way to quickly identify ~20% of a fragment library that is worthy of follow-up. SPR benefits from low protein consumption and relatively rapid-throughput, making it feasible to quickly follow up on many promising initial hits. This leads us naturally on to the subject of SPR fragment dose responses.

SPR Fragment Dose Responses—Reading between the Lines

With the initial SPR fragment screen complete, designated SPR fragment hits are typically progressed for dose response analysis. This is necessary to validate the initial screening result but, more importantly, to sort the wheat from the remaining SPR screening chaff. Without knowledge of the K_d values for these SPR fragment hits, it is worth having the highest concentration of the dilution series in the 500 µM to 2 mM range, such that a 7- or 11-point halving dilution series will cover enough data points to derive fragment K_d values in the middling to higher-affinity range (<300 µM). Typically, SPR dose responses for fragment hits fall into four categories:

Higher-affinity fragment hits (<100 µM) will often show saturating or near-saturating dose–response curves and, more rarely, the comforting sign of some kinetics ( Fig. 3a ).

Many datasets will show some hint of curvature before showing a strong linear dose response at higher fragment concentrations ( Fig. 3b ). Trying to extract a K_d from this type of data is a vertiginous process. The user will often have to delete the higher concentration data points until the fit and R_max look plausible; however, this process almost always results in a dataset that fits better to a straight-line function than a dose–response function. Alternatively, such datasets can produce a K_d estimate close to the top fragment concentration allowed in the fitting; that is, the fit hinges on one data point and, if real, corresponds to ~50% fractional occupancy.

An unambiguous straight-line dose response from beginning to end ( Fig. 3c ).

No signal or no interpretable signal.

It is helpful to take a step back and discuss the range of fragment affinities we might expect for hits from a fragment screen. Obviously, fragment affinities will vary according to binding site ligandability and fragment size. In addition, the fragments for which affinity data are available are most often at the measurable end of the fragment affinity spectrum; that is, SPR, MST, ITC, and bioassay will tend to perform better when estimating K_d/IC₅₀ values in the <300 µM fragment affinity range. Deriving lower-affinity fragment K_d values is often the sole remit of protein-observed (PO) NMR or competitive ligand-observed (LO) NMR methods (see “Affinity Determination by NMR”). although these NMR methods are relatively slow and require milligram levels of protein.

Figure 3.

Examples of fragment dose responses that are difficult to reliably fit and determine K_d estimates from. (a) Saturating, well-fit SPR dose response. The K_d value is indicated by the vertical black line in the lower panel. (b) Dose response with some early hint of curvature prior to an increasing linear phase. (c) Example of linear dose response up to 1 mM fragment concentration.

In-house we have observed fragment K_d values over a broad range of affinities. Fragments with K_d values of <10 µM are rare, although perfectly feasible for certain target binding sites. Fragments with K_d values in the 10–100 µM range are more common and will often be the best hits from screening campaigns against highly ligandable binding sites. Fragments with K_d values in the 100–1000 µM range are the most common, comprising moderate to weak hits from highly ligandable sites and better fragment hits against more challenging target classes. Fragments with K_d values of >1000 µM have been measured in-house for many target classes, mainly using NMR methods. In addition, x-ray fragment screening at Astex has identified fragments that are not detectable by any other method, presumably binding with K_d values of >10 mM.

In theory, therefore, we might expect most biophysical FBS hits to have affinities of >100 µM for most target sites being prosecuted. It is evident, however, that published SPR fragment screens tend to be on more ligandable target proteins with high-affinity fragment hits. Thinking back to how we bin SPR screening hits, it is feasible that even the lowest percentage hit scores could correspond to valuable bona fide hit matter, especially for more difficult protein targets. The problem that arises for SPR follow-up dose responses is our old friend NSB. As mentioned above, many SPR dose responses will appear relatively linear with fragment concentration and defy reliable fitting unless the K_d values are in the <100 µM range. One suggestion to overcome the problems of NSB is to only use the lowest concentration dose response points and then to extrapolate the fit to a theoretical fixed R_max value. We do not recommend this practice. Determining a K_d value requires a fractional occupancy of ~80%, evident by clear curvature in the fragment dose response as saturation is approached. It could be argued that quoting an SPR binding percentage at a certain, pre-NSB concentration could provide enough steer for SAR purposes, although this is debatable, and likely of limited value. In certain cases, an SPR competition format will provide a more effective means to site specifically validate and rank SPR primary hits.

Competition SPR

If the binding site of interest has a known tool compound, it is possible to configure a competition SPR format in which a tool compound is immobilized onto the chip surface, for example, through a biotinylated linker. Several linker lengths, surface densities, and immobilization strategies may have to be tested in order to achieve a suitable competition configuration. Since the protein target is in solution and not immobilized on the chip surface, we are more likely to avoid the problems of NSB that are caused by the protein–chip surface interface. This is a major advantage and allows the use of much higher competing fragment concentrations than would be possible in a direct binding format.

Titrating the protein over the immobilized tool compound surface should provide a healthy SPR response and a K_d estimate for the interaction that is close to the independently determined K_d value. It is important that the protein dissociates rapidly and completely from the immobilized tool compound surface, which means that the interaction K_d should ideally be in a moderate affinity range, that is, 0.1–10 µM. A titration of unmodified tool compound should effectively compete with the immobilized tool compound for binding to the target protein, giving rise to a dose response that can be fit to generate an IC₅₀ close to the estimated K_d value for the interaction. It is crucial that the binding signal for the protein to the immobilized tool compound is completely knocked down by the titration of free unmodified tool compound. All being well, the competition format can be configured to follow up on SPR primary screening hits; it will allow higher concentrations of fragments to be used, weaker IC₅₀ values to be determined, and an opportunity of delivering SAR through the hit validation phase of FBDD. More detail on the configuration and analysis of SPR competition experiments can be found in the following references.^54–56

Some Thoughts on the Future of SPR Fragment Screening

In terms of FBS, SPR scores highly in being high-throughput and in having very low protein requirements, which are major advantages for many protein target classes. However, the problem of NSB continues to be a major drawback for SPR fragment screening, limiting its utility to highly ligandable target binding sites. The NSB problem appears to be due to the high level of immobilized protein required to detect binding of low MW compounds. Continued improvements in instrument sensitivity should mean that users will be able to lower the level of immobilized protein and perhaps attenuate the extent of the NSB problem. In addition, there is a clear unmet need for a biophysical technique that can routinely, rapidly, and robustly deliver affinity measurements for FBDD programs, particularly in the hit validation and hit-to-lead phases. It is hoped that SPR can meet this need one day.

Nuclear Magnetic Resonance

NMR can be used in FBDD campaigns in numerous ways. In this section, we focus on the impact that NMR can have on SBDD supported by high-throughput crystallography. In addition, we review experiments probing the interaction of target proteins and ligands. We begin by briefly describing the instrumental role that NMR plays in establishing and maintaining a high-quality fragment library.

Fragment Library QC

At the foundation of every successful FBDD campaign lies a well-curated fragment library. We are not going into the details of selecting fragments to get good coverage of chemical space, a topic for which a host of excellent articles exist.^57–61 Instead, we focus here on what is needed for a library to be amenable to biophysical screening. For biophysical and x-ray screening to deliver hits, fragments must be soluble and stable under specific assay conditions. All fragments of the Astex library are quality controlled by NMR for their integrity, aggregation, and stability (including redox activity). Integrity is typically probed by one-dimensional (1D) proton NMR and supplemented by multidimensional and/or heteronuclear experiments where required. Stability and aggregation behavior are monitored by quantitative NMR⁶² and waterLOGSY,⁶³ respectively.

Target QC

NMR can be used to assist construct design for the target protein in early-stage drug discovery. The signal dispersion observed in simple 1D-NMR experiments with unlabeled proteins up to 50 kDa can be used to assess whether a protein sample is folded and stable under the chosen conditions ( Fig. 4a ). The presence of peaks in the upfield (<0.5 ppm) and downfield (>9 ppm) regions indicates a protein with tertiary structure. Furthermore, the oligomeric state of a target protein can be judged by measuring its diffusion coefficient or its rotational correlation time for unlabeled⁶⁴ or labeled⁶⁵ proteins, respectively ( Fig. 4b ). Binding of tool compounds can often be assessed from simple 1D experiments. If isotope-labeled protein is available, tool compounds can be used to map the binding pocket ( Fig. 4c ). Such information can be useful for assigning hits from biophysical screening campaigns to the same site as the tool compound or to alternative sites. Two-dimensional (2D) NMR can also indicate the presence of highly flexible loops/termini or large-scale conformational exchange processes that could be a hindrance to crystallography ( Fig. 4d ). All these data allow for QC of the purified target protein and/or iterative rounds of protein engineering to arrive at a target protein sample of sufficient quality.

Figure 4.

(a) Unfolded proteins are characterized by a narrow chemical shift dispersion. An unstructured 16-mer peptide shows no resonance signals downfield of 9 ppm and no signals upfield of 0.5 ppm (top panel). This contrasts with a stably folded protein, which is characterized by resonance signals in these regions. The peak at 0 ppm is from the Trimethyl-silyl propionate (TSP) reference. (b) The diffusion coefficient of a protein can be measured to assess the oligomeric state of a sample. The decrease of the aliphatic signals around 0.8 ppm over a range of gradient strengths discriminates the oligomeric form of a kinase (blue) from the monomeric form (blue). (c) The footprint of two overlapping peptides with different lengths can be used for “epitope mapping” via NMR. The peak at 1.5 ppm is only shifted for the “long peptide” and can therefore be mapped to bind specifically to the sequence devoid in the “short peptide” but present in the long peptide. (d) 2D-NMR indicates large-scale conformational exchange, evident from the broadening of many of the observed peaks (blue spectrum). A different construct shows much more uniform peak intensities and almost all the expected H–N resonances (red spectrum). Note that the two spectra have been offset relative to each other to allow for better visual comparison.

Fragment Screening by NMR

Despite its relatively low-throughput, NMR has evolved over the years to be the most prominent biophysical fragment screening method.^66–68 NMR offers a range of advantages over other popular biophysical screening methodologies such as SPR or thermal shift, for example, the ability to detect binding at ligand concentrations well below the K_d. NMR can in principle be used in two different formats for fragment screening: ligand-observed (LO) and protein-observed (PO) NMR. LO-NMR offers the distinct advantage that it does not require labeled target protein. Furthermore, no MW restrictions apply as is the case for PO-NMR (see below). In fact, LO-NMR experiments tend to work better for bigger target molecules.

The large chemical shift range that a typical fragment library spans usually allows for cocktailing of fragments. Given that the resonances of the individual fragments are known prior to screening, cocktails can be designed to minimize spectral overlap in the cocktail ( Fig. 5a–e ). Typical cocktail sizes of four to eight directly translate to equal time savings for screening a given library. Cocktailing can be taken to its extreme when screening fluorine libraries. The enormous ¹⁹F chemical shift range of >100 ppm means that cocktail size is no longer dictated by spectral overlap. Factors such as the maximally acceptable total organic load, expected hit rate, and NMR spectrometer hardware limitations, such as the maximal chemical shift range that can be covered reliably with radiofrequency pulses,⁶⁹ put an upper limit on the cocktail size. Screening campaigns using up to 36 fragments per cocktail have been reported in the literature.⁷⁰ Together with the advent of cryogenically cooled ¹⁹F detection coils,⁷¹ this allows a library of several thousand fragments to be screened in a matter of days. We are not going to discuss the pros and cons of the ¹⁹F nucleus compared with ¹H. Instead, we refer the reader to a series of excellent papers discussing this in great detail.^70,72,73 As far as this review is concerned, ¹⁹F-NMR will be treated as just another flavor of NMR that can be applied to the same tasks as ¹H-NMR (e.g., for screening and for reporter assay development; see below).

Figure 5.

(a–d) 1D reference spectra of four fragments. (e) 1D reference spectrum of a cocktail of fragments (1–4) displayed individually in panels a–d. Note that the most downfield-shifted peak of fragment 3 around 8.5 ppm is absent from the cocktail, indicating a problem with that fragment, highlighting the QC powers of fragment screening by NMR. The two peaks at 8.25 and 8.30 ppm are from a cofactor added to the buffer. (f) waterLOGSY spectra of the cocktail displayed in panel e in the absence (blue) and in the presence (red) of the target protein. Fragments 2 and 4 are identified as hits from the waterLOGSY experiment. (g) CPMG spectra of the cocktail displayed in panel e in the absence (blue) and in the presence (red) of the target protein. Only fragment 2 is identified as a hit from the CPMG experiment. (h) STD spectrum of the cocktail displayed in panel e in the presence (red) of the target protein. No fragments are identified as hits from the STD experiment. (i) Difference spectrum (magenta) of the two waterLOGSY spectra shown in panel f and 1D reference (black). (j) Same as in panel i, but the black the reference spectrum is multiplied by a factor of 0.24 to match the strongest peak in the waterLOGSY difference spectrum, giving a waterLOGSY score of 0.24 for fragment 2.

LO-NMR makes use of the fact that a small-molecule ligand experiences a dramatic increase in its rotational correlation time upon binding to a higher-MW target protein. While small molecules are characterized by small positive cross-relaxation rates, these become smaller and then large and negative for high-MW systems. This change in cross-relaxation behavior is exploited in the saturation transfer difference (STD)⁷⁴ and waterLOGSY^63,75 experiments ( Fig. 5f–h ). In addition to large, negative cross-relaxation rates, large molecules are characterized by high transverse relaxation rates. This is manifest by a faster loss of coherences in the time domain, leading to broader spectral lines/lines with reduced intensity in the frequency domain. This effect lies at the heart of the CPMG^76,77 experiment ( Fig. 5g ), which is the third commonly applied LO-NMR experiment used for fragment screening. In addition to the increased linewidths (and therefore decreased intensities) for protein-bound ligands in comparison with free ligands, the dynamic process of binding and dissociating from the target protein further amplifies the line broadening effect.⁷⁸

We are not going to review any further the technical details of the different LO-NMR experiments that can be used for fragment screening. We refer the interested reader to a large body of literature that covers these techniques in detail.^79–82 Instead, we want to focus on the analysis of LO-NMR data and the conclusions that can be drawn from them.

We commonly conduct a control experiment containing the fragment cocktail in buffer in the absence of protein. This so-called “buffer control” sample serves a twofold purpose. First, it serves as a reference spectrum to form the difference between the LO-NMR screening experiment in the presence and absence of the target protein.^* Second, it is then this difference spectrum that is compared with a conventional 1D-NMR spectrum of the buffer control sample. Each peak of the buffer control reference 1D spectrum is scaled to each peak of the difference spectrum of the LO-NMR screening experiment. Each fragment in the cocktail is then assigned a score, which is the biggest score observed across all peaks of that fragment ( Fig. 5i,j ).

While the affinity of a fragment for a given target does have an influence on the measured score, there are many other factors that contribute to the score, so that we abstain from relating score intensities to affinities—except in reporter screening experiments. In these experiments, a signal of a reporter molecule, known to bind to the site of interest, is observed for perturbations by the addition of a cocktail of fragments or fragment singletons. In this scenario, the same signal is always monitored and the effect that a test fragment exerts on the reporter signal can directly be related to the affinity of the test fragment for the site of interest. This can be a very useful feature, especially when the aim is to only progress the highest-affinity hits into further validation experiments.

Analysis of LO-NMR screening experiments is a relatively laborious process, especially in the case of CPMG and waterLOGSY experiments, where difference spectra need to be generated. In our experience, this is best done manually for each cocktail, as often small lateral shifts in the protein or buffer reference spectra are required to obtain high-quality difference spectra. This is not the case for STD experiments where formation of the difference can be hard-coded into the NMR pulse sequence, a feature that is made possible by the fact that the “negative control” experiment is measured on the same sample. As strictly speaking no buffer control sample is required for measuring STD experiments, the preparation and measurement of such a sample can be omitted, if one accepts the use of a conventional 1D-NMR spectrum containing the target protein for scaling purposes. This allows for significant time savings in sample preparation and data acquisition. Even more time savings are possible for reporter screens. Likewise, for these no reference sample needs to be measured. In addition, as one and the same signal is monitored throughout the screening of a whole fragment library, the automated analysis of reporter screening data is much more straightforward than for the “direct detection” LO-NMR screening experiments.

Hit Validation by NMR

Another big advantage of NMR over other techniques is its inherent QC feature: at the same time as screening for ligands binding to a given target molecule, the experimental data delivers information on the integrity (identity, concentration, aggregation, etc.) of the fragment under investigation. This feature is central to the low false-positive hit rates that are commonly observed when using NMR for fragment screening.

When using PO-NMR, resonance signals of the target protein are observed and interrogated for fragment-induced changes. At the same time, PO-NMR delivers information on the integrity (folding, aggregation, oxidation, etc.; see “Target QC” section above) of the target protein under investigation. Together with the complementary LO-NMR experiment, this allows a stringent assessment of a protein–ligand interaction in the framework of the NMR validation cross.⁷⁹

There are cases where unlabeled protein has been used for PO-NMR screens both in-house and as reported in the literature. However, in general PO-NMR requires isotopically labeled target proteins. This, together with its relatively high demands in terms of protein consumption, is the main limitation of PO-NMR.

In principle, MW considerations come into play with PO-NMR. The advent of transverse-relaxation optimized (TROSY) NMR, together with uniform, high levels of deuteration, has pushed the MW limit for the ¹⁵N-¹H correlation experiments to circa 100 kDa.⁸³ More advanced isotopic labeling methods, especially the selective ¹³C-labeling and protonation of methyl groups in an otherwise highly deuterated background (also referred to as “selective methyl labeling”), has pushed the MW limit another order of magnitude toward systems of 1 MDa.^84–87 Thus, the limiting factor for (fragment) screening using PO-NMR is no longer MW, but rather the production of sufficient amounts of labeled target protein.

For drug discovery projects at Astex, the translation rate into x-ray-validated hits is the critical figure of merit for any biophysical screening technique. While thermal shift and SPR can often reliably detect binding events with K_d values of <100 µM, the detection threshold for NMR is a good order of magnitude higher. X-ray itself is well suited to detect fragment binders as weak as 100 mM. And even if such weakly binding fragments may not be of interest as starting points for hit-to-lead campaigns, they serve an important function in mapping the site of interest in terms of hot and warm spots. This feature has been exploited by adding high concentrations of cosolvents^88,89 and, more recently, with an approach termed MiniFrags, in which highly soluble molecules with heavy-atom counts of <12 are soaked into protein crystals at concentrations as high as 1 M.⁹⁰

A critical factor for all LO-NMR techniques is the availability of a high-affinity tool compound. Such a molecule can be used to specifically compete with binders to the site of interest and, therefore, plays a crucial role in validating a LO-NMR screening method. Translation rates for NMR screening hits into validated x-ray structures vary greatly from target to target. Rates as low as 5% and as high as >80% have been observed in screening campaigns carried out at Astex over the years. Generally, translation rates for fragment hits that have been found to be competed off by a high-affinity tool compound are higher than hits found in noncompetition screens. However, the gain in translation rate may not always justify the additional experimental effort. Carrying out a small-scale prescreen using different LO-NMR screening methods, with and without a competition step, serves an important purpose in finding the optimal screening cascade to ensure the highest possible translation rates into validated x-ray structures.

Affinity Determination by NMR

In addition to serving as a biophysical screening method for obtaining primary hit matter for FBDD campaigns, NMR is well suited to determine affinities of weakly binding molecules. This can again be in either a PO-NMR format or a LO format. The PO format is particularly interesting in the absence of any known binding compounds for a given target, when multiple binding pockets are present in a target or in general in the very early stages of a project. It requires relatively little investment of resources other than the purification of suitably labeled protein. In the presence of a tool compound or a natural ligand, resonances sensitive to binding events in the pocket of interest can often be mapped (see Fig. 4c ) without the need to undertake lengthy resonance assignment work. PO-NMR experiments can then serve to characterize initial hits and/or different binding sites. The general requirements for affinity measurements by PO-NMR are that (1) the exchange process is fast on the NMR chemical shift timescale, (2) the protein concentration of the assay be ~2-fold below the measured/expected K_d values, and (3) the ligand is ideally soluble above its K_d.⁹¹ In such cases, we usually measure titration curves with 5–10 data points. We generally aim for experimental conditions allowing the measurement of one data point per hour. If several peaks show shifts upon ligand binding, we fit affinities in a global fashion, whereby all curves are forced to have the same K_d value.^92,93 We are not going further into the data acquisition and interpretation for deriving ligand affinities from PO-NMR but refer the interested reader to exhaustive reviews on that subject.^91,93 Even though NMR is often considered too costly and time-consuming to be of use as an affinity determination method in drug discovery projects, we find it uniquely powerful and often indispensable in the very early stages of FBDD campaigns. There are, in our opinion, no other techniques that work as reliably for weak-affinity determination (K_d > 100 µM). Of course, the aim will most often be to hand over affinity determination to a less laborious, higher-throughput method.

The PO-NMR work described above is often carried out with the aim of establishing a LO-NMR affinity assay that is both higher-throughput and does not require labeled target protein. Such LO-NMR reporter assays work in analogy to biological assays, with which readers may be more familiar. Basically, all LO-NMR experiments (e.g., STD, waterLOGSY, and CPMG) can be used in LO-NMR reporter assays. In the following, we exemplify the use of a LO-NMR reporter assay based on a CPMG experiment. In a CPMG experiment an NMR signal (a “resonance”) is quenched (by a combination of increased transverse relaxation when bound to the target and line broadening by the exchange event itself) when the reporter is bound to the target protein. Adding a test molecule that competes for the same site as the reporter frees some of the reporter molecule, leading to the reappearance of its signal ( Fig. 6a ). Ideally, a good reporter molecule should have an affinity within an order of magnitude of the expected affinities of the test compounds and a well-resolved, high-intensity resonance (preferably in the form of a singlet). The latter factors enable the easy measurement of the signal intensity or integral, whereas the former is a requirement dictated by the mathematics required to describe a competitive binding equilibrium. After having found a suitable reporter system, up to 20 test molecules can be evaluated within a day of spectrometer time, making the method well suited for the initial stages of hit validation and hit-to-lead chemistry.

Figure 6.

(a) Titration of a reporter molecule with increasing target protein concentrations. All panels show the reporter signal at 3.12 ppm in a 600 ms (black) and 10 ms (gray) CPMG experiment. Each panel is normalized to the 10 ms intensity. The leftmost panel is from the reporter in the absence of protein. The following three panels are with increasing protein concentrations. The last two panels are at equal protein concentrations, but with a tight binder to the site of interest added in the last panel. This experiment shows that the reporter signal is restored to the intensity in the absence of target protein (leftmost panel) and shows that the reporter signal binds specifically to the site of interest. (b) The intensity of the reporter signal is converted to a fraction of the bound reporter and fit to an exponential equation. This is possible if the affinity of the reporter for the target and the exact concentrations of the target protein and reporter are known. (c) Determining the protein–reporter complex concentration as a function of an added unknown competitor allows the calculation of the K_d value of the unknown competitor. Here a series of competitors with affinities of 3.7 mM (dark green), 1.4 mM (pale green), 0.3 mM (light blue), and 0.1 mM (dark blue) are shown. Note that the 3.7 mM and 0.1 mM curves are near the top and bottom of the competition assay, respectively.

Below we describe some of the practicalities of setting up a LO-NMR reporter assay. The assay can use a proton or a fluorine signal as reporter. For both methods, the same considerations as outlined below apply. We usually start by identifying potentially interesting reporter molecules from the crystallographically validated hit matter that is generated during a fragment screening campaign. Ideally, one of these crystallographic hits can be used. If that is not possible or desired, we look for suitable commercially available analogs of crystallographic hits. Candidate reporter molecules then undergo affinity determination against the target, often by PO-NMR. We then titrate the reporter with an increasing concentration of target protein ( Fig. 6a ). If the affinity of the reporter and the concentrations of the target protein and reporter molecule are known, a parametric equation relating the intensity of the reporter signal as a function of the fraction of the bound reporter molecule can be deduced. We typically use an exponential function for this ( Fig. 6b ). This function merely parameterizes the observed reporter signal change as a function of target protein concentration and does not seek to explain any underlying physical principles. As such, any mathematical function describing the observed behavior adequately in a least squares sense can be used. Addition of a competing test molecule then allows calculation of its binding affinity through measurement of the change in the reporter signal intensity ( Fig. 6c ). If a high-affinity tool compound is available, its addition should lead to the reporter signal reaching the same intensity as in the absence of the target protein. This essentially proves specific binding of the reporter to the target protein and serves as an important assay validation benchmark. To validate our reporter assays further, we generally use a handful of fragments as competitors in the LO-NMR reporter assay for which we have determined affinities in PO-NMR experiments before. We would like to further comment on some practicalities of establishing robust LO-NMR assays. To minimize experimental errors, it is good practice to include a suitable reference molecule in the assay. The reference molecule must not interact with the target protein, and its signal serves as a normalization factor for the reporter signal; we always measure the ratio of the reporter to the reference signal intensity. It should be noted that spectroscopic techniques have been developed to generate the reference signal artificially by the spectrometer electronics,^94,95 and this method can be used instead of a “real” reference molecule. As in a LO-NMR reporter assay, in which the same signals are always measured, it lends itself ideally to automated analysis, further increasing its attractiveness for medium-throughput applications such as regular assays during HV or H2L efforts. With such an automated analysis workflow, we find that LO-NMR reporter assays are competitive with more traditional biochemical assays in terms of “active effort” and offer the distinct advantage of not suffering from many of the problems that biochemical assays generally encounter when setting them up for the determination of weak affinities.⁸¹

Discussion/Outlook

FBDD has achieved its most important milestones, delivering active molecules both to the clinic and to the market. Thus, it has reached the status of a bona fide lead generation method across the pharmaceutical industry. But of course—as with any scientific field—the method continues to innovate and adapt to emerging technologies. From an SBDD perspective, cryo-EM is undoubtedly the most exciting new opportunity for FBDD, as it continues to bring within reach a host of targets deemed “intractable” to date. It is our belief that, within the next decade, cryo-EM will have established itself firmly in the world of SBDD. Taking on challenging targets such as large, multiprotein complexes or membrane proteins will also pose challenges for the biophysical methods that are used in SBDD, for both screening and affinity determination. Some of the methods mentioned in this review offer advantages for such targets (e.g., the low material requirements of thermal shift assays), whereas others are facing challenges (e.g., fragment screening in the presence of ill-defined amounts of lipids/detergents). Innovation in these fields will likely follow suit with the new problems that are encountered. Other challenges are of a more fundamental nature, such as the expected increased hit rates for large multiprotein complexes.

The multitude of screening possibilities and hit validation and hit elaboration strategies make FBDD ideal for collaborative projects. This is exemplified by the industry–industry partnering and industry–academia collaborations that led to the discovery of two of the four FBDD-derived drugs that are currently on the market. NICR and Monash University played vital roles in the discovery of erdafitinib and pexidartinib, respectively.

The fact that fragment libraries require less establishment and curation effort makes the method particularly interesting to smaller companies and academic institutions. Nowadays, there are a variety of commercially available fragment libraries, catering to every taste of screening method—from fluorinated libraries (https://enamine.net/fragments/fragment-collection/fluorinated-fragments) for fast and efficient ¹⁹F LO-NMR screening to ultrasoluble mini-fragments (Enamine and Astex are commercializing the Astex MiniFrag⁹⁰ library) aimed at the crystallography community. If anything, this is expected to further popularize the FBDD method in the coming years.

Footnotes

Acknowledgements

We would like to thank Marc O’Reilly, Chris Murray, and David Rees for helpful comments during the writing of this manuscript.

Declaration of Conflicting Interests

The authors declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: All authors are employed by Astex Therapeutics Limited and their research and authorship of this article was completed within the scope of their employment with Astex Therapeutics Limited.

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Reto Walser

*

Except for the STD experiment, where the difference is formed by subtracting two STD experiments that have been irradiated at very different frequencies, and the buffer control is, strictly speaking, not necessary to form the difference spectrum.

References

Erlanson

D. A.

Fesik

S. W.

Hubbard

R. E.

; et al. Twenty Years On: The Impact of Fragments on Drug Discovery. Nat. Rev. Drug Discov. 2016, 15, 605–619.

Murray

C. W.

Verdonk

M. L.

Rees

D. C.

Experiences in Fragment-Based Drug Discovery. Trends Pharmacol. Sci. 2012, 33, 224–232.

Hopkins

A. L.

Groom

C. R.

Alex

Ligand Efficiency: A Useful Metric for Lead Selection. Drug Discov. Today 2004, 9, 430–431.

Hann

M. M.

Leach

A. R.

Harper

Molecular Complexity and Its Impact on the Probability of Finding Leads for Drug Discovery. J. Chem. Inf. Comput. Sci. 2001, 41, 856–864.

Ruddigkeit

van Deursen

Blum

L. C.

; et al. Enumeration of 166 Billion Organic Small Molecules in the Chemical Universe Database GDB-17. J. Chem. Inf. Model. 2012, 52, 2864–2875.

Bohacek

R. S.

McMartin

Guida

W. C.

The Art and Practice of Structure-Based Drug Design: A Molecular Modeling Perspective. Med. Res. Rev. 1996, 16, 3–50.

Shuker

S. B.

Hajduk

P. J.

Meadows

R. P.

; et al. Discovering High-Affinity Ligands for Proteins: SAR by NMR. Science 1996, 274, 1531–1534.

Hartshorn

M. J.

Murray

C. W.

Cleasby

; et al. Fragment-Based Lead Discovery Using X-Ray Crystallography. J. Med. Chem. 2005, 48, 403–413.

Erlanson

D. A.

Davis

B. J.

Jahnke

Fragment-Based Drug Discovery: Advancing Fragments in the Absence of Crystal Structures. Cell Chem. Biol. 2019, 26, 9–15.

10.

Holdgate

Geschwindner

Breeze

; et al. Biophysical Methods in Drug Discovery from Small Molecule to Pharmaceutical. Methods Mol. Biol. 2013, 1008, 327–355.

11.

Renaud

J. P.

Chung

C. W.

Danielson

U. H.

; et al. Biophysics in Drug Discovery: Impact, Challenges and Opportunities. Nat. Rev. Drug Discov. 2016, 15, 679–698.

12.

Semisotnov

G. V.

Rodionova

N. A.

Razgulyaev

O. I.

; et al. Study of the “Molten Globule” Intermediate State in Protein Folding by a Hydrophobic Fluorescent Probe. Biopolymers 1991, 31, 119–128.

13.

Pantoliano

M. W.

Petrella

E. C.

Kwasnoski

J. D.

; et al. High-Density Miniaturized Thermal Shift Assays as a General Strategy for Drug Discovery. J. Biomol. Screen. 2001, 6, 429–440.

14.

Niesen

F. H.

Berglund

Vedadi

The Use of Differential Scanning Fluorimetry to Detect Ligand Interactions That Promote Protein Stability. Nat. Protoc. 2007, 2, 2212–2221.

15.

M. C.

Aulabaugh

Jin

; et al. Evaluation of Fluorescence-Based Thermal Shift Assays for Hit Identification in Drug Discovery. Anal. Biochem. 2004, 332, 153–159.

16.

Kotov

Bartels

Veith

; et al. High-Throughput Stability Screening for Detergent-Solubilized Membrane Proteins. Sci. Rep. 2019, 9, 10379.

17.

Jafari

Almqvist

Axelsson

; et al. The Cellular Thermal Shift Assay for Evaluating Drug Target Interactions in Cells. Nat. Protoc. 2014, 9, 2100–2122.

18.

Martinez Molina

Jafari

Ignatushchenko

; et al. Monitoring Drug Target Engagement in Cells and Tissues Using the Cellular Thermal Shift Assay. Science 2013, 341, 84–87.

19.

Henderson

M. J.

Holbert

M. A.

Simeonov

; et al. High-Throughput Cellular Thermal Shift Assays in Research and Drug Discovery. SLAS Discov. 2020, 25, 137–147.

20.

Herledan

Andres

Lejeune-Dodge

; et al. Drug Target Engagement Using Coupled Cellular Thermal Shift Assay-Acoustic Reverse-Phase Protein Array. SLAS Discov. 2020, 25, 207–214.

21.

McNulty

D. E.

Bonnette

W. G.

; et al. A High-Throughput Dose-Response Cellular Thermal Shift Assay for Rapid Screening of Drug Target Engagement in Living Cells, Exemplified Using SMYD3 and IDO1. SLAS Discov. 2018, 23, 34–46.

22.

Alexandrov

A. I.

Mileni

Chien

E. Y.

; et al. Microscale Fluorescent Thermal Stability Assay for Membrane Proteins. Structure 2008, 16, 351–359.

23.

Baell

J. B.

Holloway

G. A.

New Substructure Filters for Removal of Pan Assay Interference Compounds (PAINS) from Screening Libraries and for Their Exclusion in Bioassays. J. Med. Chem. 2010, 53, 2719–2740.

24.

Prinz

Schlange

Asadullah

Believe It or Not: How Much Can We Rely on Published Data on Potential Drug Targets?

Nat. Rev. Drug Discov. 2011, 10, 712.

25.

Begley

C. G.

Ellis

L. M.

Drug Development: Raise Standards for Preclinical Cancer Research. Nature 2012, 483, 531–533.

26.

Baell

J. B.

Nissink

J. W. M.

Seven Year Itch: Pan-Assay Interference Compounds (PAINS) in 2017-Utility and Limitations. ACS Chem. Biol. 2018, 13, 36–44.

27.

Baell

Walters

M. A.

Chemistry: Chemical Con Artists Foil Drug Discovery. Nature 2014, 513, 481–483.

28.

Carver

T. E.

Bordeau

Cummings

M. D.

; et al. Decrypting the Biochemical Function of an Essential Gene from Streptococcus pneumoniae Using ThermoFluor Technology. J. Biol. Chem. 2005, 280, 11704–11712.

29.

Chilton

Clennell

Edfeldt

; et al. Hot-Spotting with Thermal Scanning: A Ligand- and Structure-Independent Assessment of Target Ligandability. J. Med. Chem. 2017, 60, 4923–4931.

30.

Straume

Freire

Two-Dimensional Differential Scanning Calorimetry: Simultaneous Resolution of Intrinsic Protein Structural Energetics and Ligand Binding Interactions by Global Linkage Analysis. Anal. Biochem. 1992, 203, 259–268.

31.

Waldron

T. T.

Murphy

K. P.

Stabilization of Proteins by Ligand Binding: Application to Drug Screening and Determination of Unfolding Energetics. Biochemistry 2003, 42, 5058–5064.

32.

Matulis

Kranz

J. K.

Salemme

F. R.

; et al. Thermodynamic Stability of Carbonic Anhydrase: Measurements of Binding Affinity and Stoichiometry Using ThermoFluor. Biochemistry 2005, 44, 5258–5266.

33.

Horn

J. R.

Shoichet

B. K.

Allosteric Inhibition through Core Disruption. J. Mol. Biol. 2004, 336, 1283–1291.

34.

Cimmperman

Baranauskiene

Jachimoviciute

; et al. A Quantitative Model of Thermal Stabilization and Destabilization of Proteins by Ligands. Biophys. J. 2008, 95, 3222–3231.

35.

Eftink

M. R.

The Use of Fluorescence Methods to Monitor Unfolding Transitions in Proteins. Biophys. J. 1994, 66, 482–501.

36.

Huynh

Partch

C. L.

Analysis of Protein Stability and Ligand Interactions by Thermal Shift Assay. Curr. Protoc. Protein Sci. 2015, 79, 28.9.1–28.9.14.

37.

Augustijn

Mahapatra

Streicher

; et al. Novel Non-Linear Curve Fitting to Resolve Protein Unfolding Transitions in Intrinsic Fluorescence Differential Scanning Fluorimetry. Eur. J. Pharm. Biopharm. 2019, 142, 506–517.

38.

Maynard

J. A.

Lindquist

N. C.

Sutherland

J. N.

; et al. Surface Plasmon Resonance for High-Throughput Ligand Screening of Membrane-Bound Proteins. Biotechnol. J. 2009, 4, 1542–1558.

39.

Navratilova

Hopkins

A. L.

Fragment Screening by Surface Plasmon Resonance. ACS Med. Chem. Lett. 2010, 1, 44–48.

40.

Elinder

Geitmann

Gossas

; et al. Experimental Validation of a Fragment Library for Lead Discovery Using SPR Biosensor Technology. J. Biomol. Screen. 2011, 16, 15–25.

41.

Shepherd

C. A.

Hopkins

A. L.

Navratilova

Fragment Screening by SPR and Advanced Application to GPCRs. Prog. Biophys. Mol. Biol. 2014, 116, 113–123.

42.

Navratilova

Aristotelous

Picaud

; et al. Discovery of New Bromodomain Scaffolds by Biosensor Fragment Screening. ACS Med. Chem. Lett. 2016, 7, 1213–1218.

43.

Chavanieu

Pugniere

Developments in SPR Fragment Screening. Expert Opin. Drug Discov. 2016, 11, 489–499.

44.

Bergsdorf

Wright

S. K.

A Guide to Run Affinity Screens Using Differential Scanning Fluorimetry and Surface Plasmon Resonance Assays. Methods Enzymol. 2018, 610, 135–165.

45.

Giannetti

A. M.

From Experimental Design to Validated Hits: A Comprehensive Walk-Through of Fragment Lead Identification Using Surface Plasmon Resonance. Methods Enzymol. 2011, 493, 169–218.

46.

Myszka

D. G.

Improving Biosensor Analysis. J. Mol. Recognit. 1999, 12, 279–284.

47.

Myszka

D. G.

Abdiche

Y. N.

Arisaka

; et al. The ABRF-MIRG’02 Study: Assembly State, Thermodynamic, and Kinetic Analysis of an Enzyme/Inhibitor Interaction. J. Biomol. Tech. 2003, 14, 247–269.

48.

Liedberg

Nylander

Lundstrom

Biosensing with Surface Plasmon Resonance—How It All Started. Biosens. Bioelectron. 1995, 10, i–ix.

49.

Myszka

D. G.

Rich

R. L.

Implementing Surface Plasmon Resonance Biosensors in Drug Discovery. Pharm. Sci. Technol. Today 2000, 3, 310–317.

50.

Homola

Present and Future of Surface Plasmon Resonance Biosensors. Anal. Bioanal. Chem. 2003, 377, 528–539.

51.

Fischer

Leech

A. P.

Hubbard

R. E.

Comparative Assessment of Different Histidine-Tags for Immobilization of Protein onto Surface Plasmon Resonance Sensorchips. Anal. Chem. 2011, 83, 1800–1807.

52.

Dennis

M. L.

Morrow

B. J.

Dolezal

; et al. Fragment Screening for a Protein-Protein Interaction Inhibitor to WDR5. Struct. Dyn. 2019, 6, 064701.

53.

Rombouts

F. J. R.

Alexander

Cleiren

; et al. Fragment Binding to beta-Secretase 1 without Catalytic Aspartate Interactions Identified via Orthogonal Screening Approaches. ACS Omega 2017, 2, 685–697.

54.

Huber

Casagrande

Hug

M. N.

; et al. SPR-Based Fragment Screening with Neurotensin Receptor 1 Generates Novel Small Molecule Ligands. PLoS One 2017, 12, e0175842.

55.

Perspicace

Banner

Benz

; et al. Fragment-Based Screening Using Surface Plasmon Resonance Technology. J. Biomol. Screen. 2009, 14, 337–349.

56.

Pollack

S. J.

Beyer

K. S.

Lock

; et al. A Comparative Study of Fragment Screening Methods on the p38alpha Kinase: New Methods, New Insights. J. Comput. Aided Mol. Des. 2011, 25, 677–687.

57.

Keseru

G. M.

Erlanson

D. A.

Ferenczy

G. G.

; et al. Design Principles for Fragment Libraries: Maximizing the Value of Learnings from Pharma Fragment-Based Drug Discovery (FBDD) Programs for Use in Academia. J. Med. Chem. 2016, 59, 8189–8206.

58.

Davis

B. J.

Erlanson

D. A.

Learning from Our Mistakes: The ‘Unknown Knowns’ in Fragment Screening. Bioorg. Med. Chem. Lett. 2013, 23, 2844–2852.

59.

Murray

C. W.

Rees

D. C.

Opportunity Knocks: Organic Chemistry for Fragment-Based Drug Discovery (FBDD). Angew. Chem. Int. Ed. Engl. 2016, 55, 488–492.

60.

Hall

R. J.

Mortenson

P. N.

Murray

C. W.

Efficient Exploration of Chemical Space by Fragment-Based Screening. Prog. Biophys. Mol. Biol. 2014, 116, 82–91.

61.

Jacoby

Davies

Blommers

M. J.

Design of Small Molecule Libraries for NMR Screening and Other Applications in Drug Discovery. Curr. Topics Med. Chem. 2003, 3, 11–23.

62.

Holzgrabe

Deubner

Schollmayer

; et al. Quantitative NMR Spectroscopy—Applications in Drug Analysis. J. Pharm. Biomed. Anal. 2005, 38, 806–812.

63.

Dalvit

Fogliatto

Stewart

; et al. WaterLOGSY as a Method for Primary NMR Screening: Practical Aspects and Range of Applicability. J. Biomol. NMR 2001, 21, 349–359.

64.

D. H.

Chen

A. D.

Johnson

C. S.

An Improved Diffusion-Ordered Spectroscopy Experiment Incorporating Bipolar-Gradient Pulses. J. Magn. Reson. 1995, 115, 260–264.

65.

Lee

Hilty

Wider

; et al. Effective Rotational Correlation Times of Proteins from NMR Relaxation Interference. J. Magn. Reson.2006, 178, 72–76.

66.

Jhoti

Cleasby

Verdonk

; et al. Fragment-Based Screening Using X-Ray Crystallography and NMR Spectroscopy. Curr. Opin. Chem. Biol. 2007, 11, 485–493.

67.

Sugiki

Furuita

Fujiwara

; et al. Current NMR Techniques for Structure-Based Drug Discovery. Molecules 2018, 23, 148.

68.

Villar

H. O.

Yan

Hansen

M. R.

Using NMR for Ligand Discovery and Optimization. Curr. Opin. Chem. Biol. 2004, 8, 387–391.

69.

Kobzar

Skinner

T. E.

Khaneja

; et al. Exploring the Limits of Broadband Excitation and Inversion Pulses. J. Magn. Reson. 2004, 170, 236–243.

70.

Dalvit

Vulpetti

Ligand-Based Fluorine NMR Screening: Principles and Applications in Drug Discovery Projects. J. Med. Chem. 2019, 62, 2218–2244.

71.

Dalvit

Mongelli

Papeo

; et al. Sensitivity Improvement in 19F NMR-Based Screening Experiments: Theoretical Considerations and Experimental Applications. J. Am. Chem. Soc. 2005, 127, 13380–13385.

72.

Dalvit

Piotto

¹⁹F NMR Transverse and Longitudinal Relaxation Filter Experiments for Screening: A Theoretical and Experimental Analysis. Magn. Reson. Chem. 2017, 55, 106–114.

73.

Vulpetti

Dalvit

Fluorine Local Environment: From Screening to Drug Design. Drug Discov. Today 2012, 17, 890–897.

74.

Mayer

Meyer

Characterization of Ligand Binding by Saturation Transfer Difference NMR Spectroscopy. Angew. Chem. Int. Ed. Engl. 1999, 38, 1784–1788.

75.

Dalvit

Pevarello

Tato

; et al. Identification of Compounds with Binding Affinity to Proteins via Magnetization Transfer from Bulk Water. J. Biomol. NMR 2000, 18, 65–68.

76.

Carr

H. Y.

Purcell

E. M.

Effects of Diffusion on Free Precession in Nuclear Magnetic Resonance Experiments. Phys. Rev. 1954, 94, 630–638.

77.

Meiboom

Gill

Modified Spin-Echo Method for Measuring Nuclear Relaxation Times. Rev. Sci. Instrum. 1958, 29, 688–691.

78.

Lepre

C. A.

Moore

J. M.

Peng

J. W.

Theory and Applications of NMR-Based Screening in Pharmaceutical Research. Chem. Rev. 2004, 104, 3641–3676.

79.

Gossert

A. D.

Jahnke

NMR in Drug Discovery: A Practical Guide to Identification and Validation of Ligands Interacting with Biological Macromolecules. Prog. Nucl. Magn. Reson. Spectrosc. 2016, 97, 82–125.

80.

Campos-Olivas

NMR Screening and Hit Validation in Fragment Based Drug Discovery. Curr. Topics Med. Chem. 2011, 11, 43–67.

81.

Dalvit

NMR Methods in Fragment Screening: Theory and a Comparison with Other Biophysical Techniques. Drug Discov. Today 2009, 14, 1051–1057.

82.

Schade

Oschkinat

NMR Fragment Screening: Tackling Protein-Protein Interaction Targets. Curr. Opin. Drug Discov. Dev. 2005, 8, 365–373.

83.

Pervushin

Riek

Wider

; et al. Attenuated T2 Relaxation by Mutual Cancellation of Dipole-Dipole Coupling and Chemical Shift Anisotropy Indicates an Avenue to NMR Structures of Very Large Biological Macromolecules in Solution. Proc. Natl. Acad. Sci. U.S.A. 1997, 94, 12366–12371.

84.

Tugarinov

Hwang

P. M.

Ollerenshaw

J. E.

; et al. Cross-Correlated Relaxation Enhanced 1H–13C NMR Spectroscopy of Methyl Groups in Very High Molecular Weight Proteins and Protein Complexes. J. Am. Chem. Soc. 2003, 125, 10420–10428.

85.

Tugarinov

Kay

L. E.

An Isotope Labeling Strategy for Methyl TROSY Spectroscopy. J. Biomol. NMR 2004, 28, 165–172.

86.

Sprangers

Kay

L. E.

Quantitative Dynamics and Binding Studies of the 20S Proteasome by NMR. Nature 2007, 445, 618–622.

87.

Macek

Kerfah

Boeri Erba

; et al. Unraveling Self-Assembly Pathways of the 468-kDa Proteolytic Machine TET2. Sci. Adv. 2017, 3, e1601601.

88.

Mattos

Ringe

Locating and Characterizing Binding Sites on Proteins. Nat. Biotechnol. 1996, 14, 595–599.

89.

English

A. C.

Done

S. H.

Caves

L. S.

; et al. Locating Interaction Sites on Proteins: The Crystal Structure of Thermolysin Soaked in 2% to 100% Isopropanol. Proteins 1999, 37, 628–640.

90.

O’Reilly

Cleasby

Davies

T. G.

; et al. Crystallographic Screening Using Ultra-Low-Molecular-Weight Ligands to Guide Drug Design. Drug Discov. Today 2019, 24, 1081–1086.

91.

. NMR Methods for the Determination of Protein-Ligand Dissociation Constants. Prog. Nucl. Magn. Reson. Spectrosc. 2007, 51, 219–242.

92.

Arai

Ferreon

J. C.

Wright

P. E.

Quantitative Analysis of Multisite Protein-Ligand Interactions by NMR: Binding of Intrinsically Disordered p53 Transactivation Subdomains with the TAZ2 Domain of CBP. J. Am. Chem. Soc. 2012, 134, 3792–3803.

93.

Williamson

M. P.

Using Chemical Shift Perturbation to Characterise Ligand Binding. Prog. Nucl. Magn. Reson. Spectrosc. 2013, 73, 1–16.

94.

Mehr

John

Russell

; et al. Electronic Referencing Techniques for Quantitative NMR: Pitfalls and How to Avoid Them Using Amplitude-Corrected Referencing through Signal Injection. Anal. Chem. 2008, 80, 8320–8323.

95.

Akoka

Barantin

Trierweiler

Concentration Measurement by Proton NMR Using the ERETIC Method. Anal. Chem. 1999, 71, 2554–2557.

96.

Tsai

Lee

J. T.

Wang

; et al. Discovery of a Selective Inhibitor of Oncogenic B-Raf Kinase with Potent Antimelanoma Activity. Proc. Natl. Acad. Sci. U.S.A. 2008, 105, 3041–3046.

97.

Bollag

Hirth

Tsai

; et al. Clinical Efficacy of a RAF Inhibitor Needs Broad Target Blockade in BRAF-Mutant Melanoma. Nature 2010, 467, 596–599.

98.

Muchmore

S. W.

Sattler

Liang

; et al. X-Ray and NMR Structure of Human Bcl-xL, an Inhibitor of Programmed Cell Death. Nature 1996, 381, 335–341.

99.

Oltersdorf

Elmore

S. W.

Shoemaker

A. R.

; et al. An Inhibitor of Bcl-2 Family Proteins Induces Regression of Solid Tumours. Nature 2005, 435, 677–681.

100.

Perera

T. P. S.

Jovcheva

Mevellec

; et al. Discovery and Pharmacological Characterization of JNJ-42756493 (Erdafitinib), a Functionally Selective Small-Molecule FGFR Family Inhibitor. Mol. Cancer Ther. 2017, 16, 1010–1020.

101.

Zhang

Ibrahim

P. N.

Zhang

; et al. Design and Pharmacology of a Highly Specific Dual FMS and KIT Kinase Inhibitor. Proc. Natl. Acad. Sci. U.S.A. 2013, 110, 5689–5694.

102.

Tap

W. D.

Wainberg

Z. A.

Anthony

S. P.

; et al. Structure-Guided Blockade of CSF1R Kinase in Tenosynovial Giant-Cell Tumor. N. Engl. J. Med. 2015, 373, 428–437.

Applied Biophysical Methods in Fragment-Based Drug Discovery

Abstract

Keywords

Introduction

Biophysics in Fragment Screening

The State of My Protein

Tm Shift

Background

Fragment Screening Using Tm Shift

Tool Compound Binding—Caveat Emptor

Fragment Screening Using Thermal Shift

Tm Shift Screening Output

Grouping Tm Shift Hits

Tm Shift Data Analysis

Surface Plasmon Resonance

Instrumentation

Response Units and Signal

Immobilization

Tool Compound Binding and Surface Activity

Screening Concentration and NSB

SPR Screening Configuration and Output

SPR Fragment Dose Responses—Reading between the Lines

Competition SPR

Some Thoughts on the Future of SPR Fragment Screening

Nuclear Magnetic Resonance

Fragment Library QC

Target QC

Fragment Screening by NMR

Hit Validation by NMR

Affinity Determination by NMR

Discussion/Outlook

Footnotes

Acknowledgements

Declaration of Conflicting Interests

Funding

ORCID iD

*

References