Abstract
Computer-aided drug design has seen constantly increasing application over the past two decades in every area of drug discovery. It can offer significant advantages over conventional approaches, being far less expensive and faster than conventional methods, or offering the possibility to predict molecular behaviours that cannot be elucidated in any other way. Recent developments in software and hardware make it possible to simulate increasingly complex molecular environments, widening the applicability of in silico studies from the interactions of small molecules with key protein residues, to the simulation of the dynamic evolution of complex biological systems with atomic resolution. Antiviral research offers several open challenges, from a biological, biochemical and pharmaceutical point of view. Computational approaches are already providing some answers and will undoubtedly give more in the near future. Here, we present a brief overview of the cutting-edge computational methods that play a major role in present and future antiviral research.
Introduction
Molecular modelling techniques have been playing an increasingly crucial role in the search for new drugs and their optimization, in every area of drug design. Although they were initially confined to the visualization of potential drug interactions in targeted enzyme binding, these techniques are now being applied by many research groups worldwide as a powerful tool to help define the relationship between biological activities, binding geometries and mechanisms of action in physiological or pathological processes. This is clear from the increasing number of literature sources concerned with in silico approaches and the boost in the number of molecular modelling packages, both commercial and non-commercial, available to researchers [1].
The reason for this boost is threefold: the improvement in the mathematical models that describe chemical phenomena, which grants higher-precision results; the increase in the number of proteins with known three-dimensional (3D) structure solved through crystallographic or NMR experiments; and the development of cheaper and more powerful hardware. Moreover, the adoption of more intuitive programme interfaces and ergonomic devices, such as haptic styli, has increased the interactivity of the user with the machines [2]. The large range of applications that computer-aided drug design (CADD) methods have acquired confirms that CADD techniques will play a role of increasing importance in drug discovery in the future. The current software is not only able to predict the binding conformation of a small molecule inside a biological target or estimate its binding affinity, but is also used in predicting ADMET characteristics and physiochemical properties, such as solubility and chemical stability, and can help when studying protein interactions and folding [3,4].
The application of in silico technologies to antiviral research has already led to the design of several compounds approved by the US Food and Drug Administration and now used in clinical therapy [5]. Indeed, one of the earliest successes of these techniques is represented by the discovery of zanamivir [6]. Palese and Schulman [7] postulated that the sialidase enzyme was fundamental for influenza virus pathogenesis in humans, although the first inhibitors of that enzyme failed to demonstrate their efficacy as anti-influenza agents. Only after the determination of the 3D structure of sialidase [7–9], did the opportunity arise to design new selective inhibitors by using structure-based approaches. Despite the initial limited commercial success of zanamivir, the CADD strategies employed in its development were important first steps in the development of further sialidase inhibitors.
At present, many antiviral drug discovery programmes include a molecular modelling component and we here give a brief overview on the main computational methods currently used.
Discussion
CADD approaches are many and diverse, and their thorough classification is beyond the aims of this review. CADD applications can be broadly classified as structure-based or ligand-based. The former approach is used in the presence of 3D information of the target (generally derived from crystallographic or NMR studies). The latter methodology is applied when the 3D structure of the target is not known, and information about the binding mode is derived from a series of related active compounds.
Structure-based approaches
Molecular docking is the most commonly used computational technique when the 3D structure of the desired target is available. This methodology tries to predict the geometry of the complex of two molecular entities (usually a small molecule and its target) starting from their individual structures. This is a fast technique and it allows the screening in silico of large compound libraries in a relatively short time. The use of such virtual screening (VS) approaches is increasing constantly, often associated with pharmacophore filtering and other refinement methodologies. Several examples from the antiviral field on the use of VS are available in the literature and they cover a variety of protocols. A few recent examples include screening of the freely available ZINC library against the H5N1 haemagglutinin [10]; screening NCI libraries against the dengue virus E protein [11]; and ensemble-based VS on avian flu neuraminidase [12].
Another notable example is represented in the work reported by Pérez-Nueno et al. [13] on the HIV entry inhibitors. In this study, different VS approaches were compared and it should be noted that the structure of the cellular receptors targeted by this class of inhibitor has not been solved to date. However, the authors were able to build a model for these receptors in silico, by homology modelling using bovine rhodopsin as structural template.
An interesting evolution of traditional VS techniques is the so called ‘steered’ VS. This is an attempt to take into account information about the protein and the ligands in the simulation. For example, the software ProPose [14] implements a general method for performing simultaneous protein–ligand docking, ligand–ligand alignment and pharmacophore queries with the aim to incorporate all a priori knowledge into screening protocol. With this particular procedure of steered VS it is possible to take into account the flexibility of the receptor, using more than one 3D structure, and the ligand data, using different alignment rules and pharmacophore queries, thus improving the enrichment factors and overcoming the problem of weak-binding or non-binding false-positive ligands. The application of steered VS to herpes simplex virus thymidine kinase with the possibility to consider chemical, biological and structural factors in the early steps of VS, has given the opportunity to identify a much better starting point for lead discovery and optimization [15].
Another exciting field of application of VS might be found in the inhibition of protein–protein interactions [16]. Protein–protein binding involves the interaction of a small number of residues on the exposed surface of both binding partners, called hot-spots, the identification of which can be obtained by alanine scanning or other mutagenesis studies. These kinds of interactions are at the base of most cellular pathways and enzymatic control mechanisms, and are involved in virtually every physiological and pathological process. By interacting with known hot-spots on a protein surface, a small molecule could inhabit the protein–protein interactions by preventing the binding of the two protein partners. Therefore, protein–protein interaction inhibition has a great therapeutic potential. This method is producing significant results in anti-cancer research [17,18], and its application to the antiviral area could achieve the same important results. In this regard, the work of Betzi et al. [19] on the SH3 binding surface of the HIV type-1 Nef protein is interesting. In that paper, a virtual screening on 1,420 compounds filtered by a docking procedure and then by a pharmacophore model built using the hot-spot knowledge is reported [19]. Remarkably, all the screened compounds were also evaluated by a biological assay, and the selected leads from VS and highthroughput screening were the same. The results of the use of the VS method in the identification of the small molecules as protein–protein interaction inhibitors led to the discovery of the first two compounds that bind the HIV type-1 Nef SH3 in the micromolar range.
Protein–protein interactions were also at the core of the work reported by De Luca et al. [20]. However, in this case, the interaction targeted was one between a viral protein (HIV integrase) and a cellular cofactor (LEDGF/p75). Interestingly, the VS simulation was performed using a structure-based pharmacophore.
De novo drug design is another interesting structure-based methodology. This approach can be described as the process of producing novel molecular structures with desired pharmacological properties, using the structural information of the biological target [21]. It is a fascinating field, but its use has been somehow restricted by some inherent limitations of the technique, in particular the inability of computers to accurately assess the chemical feasibility of the proposed structures. For this reason, an extensive user intervention in the design process is often required, as is noted in the discovery of novel HCV helicase inhibitors that we have recently reported [22].
It is now common practice to combine different in silico approaches instead of relying on a single technique to generate more accurate results. Indeed, strong positive results have been obtained where a combination of different experimental and in silico techniques were used in conjunction. The discovery of HIV protease inhibitors is an excellent example of this combined approach: crystallography, molecular docking and de novo simulations allowed a very rapid development of a class of compounds that are now extremely important in the HIV therapy [23–25].
Molecular dynamics (MD) simulations are a further example of structure-based approaches. The aim of MD is the prediction of the dynamic evolution of a molecular system over time. This is of key importance in the study of processes that involve conformational changes or in the evaluation of molecular binding interactions and energies. This is a computationally expensive technique, but constantly improving computer speed gives the possibility to use more time-consuming approaches such as meta- and quantum mechanics molecular dynamics (QM-MD) on ligand–receptor complexes and also on ternary complexes (for example ligand receptor nucleic acids or ligand receptor cofactors) [26]. These MD methods provide the opportunity to analyse conformational modifications over time and the interactions between atoms and the stability of the targeted complex (ΔG and Keq calculation). MD simulations find application in the elucidation of several aspects of drug activity. For instance, MD can be helpful in understanding, from a structural point of view, the reasons why the mutation of a particular residue in the binding site of a targeted receptor gives resistance to an inhibitor. Furthermore, QM-MD could be useful to evaluate the polarization energy of all the residues of the binding site of a targeted inhibitor–enzyme complex. This, in turn, means the possibility to know, with a high rate of accuracy, the contributions of individual binding site residues to the enzyme–inhibitor interactions. A recent paper reports the application of a QM-MD analysis on three high affinity HIV protease inhibitors (nelfinavir, mozenavir and tipranavir) [27]. Hensen et al. [27] showed, by using electrostatic energy analysis on the inhibitors and on the binding site residues, that the two specific amino acids (Asp25 and Ile50) of both polypeptide chains play a crucial role in the inhibitor binding and that the 4-hydroxy-dyhydropyrone of tipranavir gives the strongest polarization effects as a result of a stronger binding force of this moiety. It is easy to imagine the importance of this kind of information in the design process of novel biologically active molecules. Although QM-MD analysis and the computation of the polarization energy can not be applied to a large number of compounds, it could be very useful, (for example, after a VS, when the number of scaffolds is limited) to identify the core moiety with the higher probability to become a lead.
Ligand-based approaches
Quantitative structure–activity relationship (QSAR) predictions are possibly the most significant example of ligand-based drug design. This approach involves establishing a mathematical function that correlates biological data (such as 50% inhibitory concentration values) of a series of compounds with the properties of those molecules (descriptors). More than 50 years ago, Corwin Hansch, created the first QSAR model quantifying a relationship between three physical–chemical properties and the biological activities of a molecular set and, since then, QSAR calculations have been applied to a wide range of projects [28]. The number and identity of the descriptors used in QSAR can be very different and have varied significantly over the years. The understanding of the important role played by the spatial geometry of the compounds and the adoption of 3D descriptors was the key to the development of the more advanced 3D QSAR, which allows the analysis of both ligand binding conformations and directional forces (for example, hydrogen bonds). Although 3D QSAR has scored several notable successes and many exciting discoveries that have been obtained by it exist, the determination of the correct bioactive conformation, the protonation state and the alignment rules are not free of bias [29]. Four-dimensional and five-dimensional QSAR methods are a recent development, and could offer a convincing answer to these limitations: four-dimensional QSAR models include a sampling of the different conformations accessible to a compound, as well as its protonation states, whereas five-dimensional QSAR allows the model to include induced fit effects on the receptor as well. The application of these new QSAR methods can offer a new lead optimization strategy and some interesting results in antiviral research have been reported. Their application to HIV reverse transcriptase [30,31] and protease [32] inhibitors provided information that was not obtainable by other methods, confirming the crucial role of these tools and of CADD methods in general as a powerful way to identify a lead compound in a quicker and more cost-effective way than physically synthesizing and testing it.
Alignment and superposition of small molecules is also at the core of another ligand-based set of methodologies based on the concept of shape recognition [33]. The rationale is that molecules that interact with the same target might have a certain degree of shape similarity. Furthermore, in addition to the simple structure superposition, a series of molecular descriptors, such as cLogP or pKa, can be considered, generating a rich set of information that can be used for drug design. Some examples of applications of these approaches in the antiviral field available in the literature include a work on HIV type-1 non-nucleoside reverse transcriptase inhibitors [34] and a study on HIV protease inhibitors [35]. Recently, a novel non-superposition-based shape recognition methodology, which has been validated also using a set of neuraminidase inhibitors, has been reported [36]. This approach proved to be considerably faster than other shape-based methods and it could be extremely useful in ligand-based VS projects.
Conclusions
Drug discovery is a time-consuming and expensive process, with a high failure rate. For these reasons new drug design approaches and methodologies are constantly developed in order to reduce the time and costs of this process. In particular, in recent years, interest has increasingly been focused on computer-based techniques and molecular modelling to design and virtually evaluate novel potential drugs on a computer before being prepared in a laboratory. Large and free databases such as the ZINC library [37] can now be tested in a VS simulation on a desktop machine in a matter of days. Computer-based drug design has already been used successfully in the antiviral field, as the discovery of Relenza and the development of several HIV protease inhibitors demonstrate, and it is easy to predict that these methodologies will continue to have a significant effect in future discoveries. The fast-paced adaptation that occurs in many viruses, with resistant strains emerging constantly, requires diversity in designing new drugs and in the optimization of the existing lead compounds. In silico methodologies are particularly suited to tackle this issue, because of their flexibility and increasing reliability. Furthermore, the constant validation and identification of novel biological targets and, because of the efforts of consortia such as the VIZIER [38], the rapidly increasing number of 3D structures available (currently >1,500 structures from different viruses are present in the Protein Data Bank [39]) are offering new exciting opportunities for the applications of molecular modelling in the design of the next generation of antivirals.
Footnotes
Acknowledgements
The authors are grateful to the Istituto Pasteur – Fondazione Cenci Bolognetti for supporting AC.
The authors declare no competing interests.
