Advanced in silico Approaches in Antiviral Research

Abstract

Computer-aided drug design has seen constantly increasing application over the past two decades in every area of drug discovery. It can offer significant advantages over conventional approaches, being far less expensive and faster than conventional methods, or offering the possibility to predict molecular behaviours that cannot be elucidated in any other way. Recent developments in software and hardware make it possible to simulate increasingly complex molecular environments, widening the applicability of in silico studies from the interactions of small molecules with key protein residues, to the simulation of the dynamic evolution of complex biological systems with atomic resolution. Antiviral research offers several open challenges, from a biological, biochemical and pharmaceutical point of view. Computational approaches are already providing some answers and will undoubtedly give more in the near future. Here, we present a brief overview of the cutting-edge computational methods that play a major role in present and future antiviral research.

Introduction

Molecular modelling techniques have been playing an increasingly crucial role in the search for new drugs and their optimization, in every area of drug design. Although they were initially confined to the visualization of potential drug interactions in targeted enzyme binding, these techniques are now being applied by many research groups worldwide as a powerful tool to help define the relationship between biological activities, binding geometries and mechanisms of action in physiological or pathological processes. This is clear from the increasing number of literature sources concerned with in silico approaches and the boost in the number of molecular modelling packages, both commercial and non-commercial, available to researchers [1].

The reason for this boost is threefold: the improvement in the mathematical models that describe chemical phenomena, which grants higher-precision results; the increase in the number of proteins with known three-dimensional (3D) structure solved through crystallographic or NMR experiments; and the development of cheaper and more powerful hardware. Moreover, the adoption of more intuitive programme interfaces and ergonomic devices, such as haptic styli, has increased the interactivity of the user with the machines [2]. The large range of applications that computer-aided drug design (CADD) methods have acquired confirms that CADD techniques will play a role of increasing importance in drug discovery in the future. The current software is not only able to predict the binding conformation of a small molecule inside a biological target or estimate its binding affinity, but is also used in predicting ADMET characteristics and physiochemical properties, such as solubility and chemical stability, and can help when studying protein interactions and folding [3,4].

The application of in silico technologies to antiviral research has already led to the design of several compounds approved by the US Food and Drug Administration and now used in clinical therapy [5]. Indeed, one of the earliest successes of these techniques is represented by the discovery of zanamivir [6]. Palese and Schulman [7] postulated that the sialidase enzyme was fundamental for influenza virus pathogenesis in humans, although the first inhibitors of that enzyme failed to demonstrate their efficacy as anti-influenza agents. Only after the determination of the 3D structure of sialidase [7 –9], did the opportunity arise to design new selective inhibitors by using structure-based approaches. Despite the initial limited commercial success of zanamivir, the CADD strategies employed in its development were important first steps in the development of further sialidase inhibitors.

At present, many antiviral drug discovery programmes include a molecular modelling component and we here give a brief overview on the main computational methods currently used.

Discussion

CADD approaches are many and diverse, and their thorough classification is beyond the aims of this review. CADD applications can be broadly classified as structure-based or ligand-based. The former approach is used in the presence of 3D information of the target (generally derived from crystallographic or NMR studies). The latter methodology is applied when the 3D structure of the target is not known, and information about the binding mode is derived from a series of related active compounds.

Structure-based approaches

Molecular docking is the most commonly used computational technique when the 3D structure of the desired target is available. This methodology tries to predict the geometry of the complex of two molecular entities (usually a small molecule and its target) starting from their individual structures. This is a fast technique and it allows the screening in silico of large compound libraries in a relatively short time. The use of such virtual screening (VS) approaches is increasing constantly, often associated with pharmacophore filtering and other refinement methodologies. Several examples from the antiviral field on the use of VS are available in the literature and they cover a variety of protocols. A few recent examples include screening of the freely available ZINC library against the H5N1 haemagglutinin [10]; screening NCI libraries against the dengue virus E protein [11]; and ensemble-based VS on avian flu neuraminidase [12].

Another notable example is represented in the work reported by Pérez-Nueno et al. [13] on the HIV entry inhibitors. In this study, different VS approaches were compared and it should be noted that the structure of the cellular receptors targeted by this class of inhibitor has not been solved to date. However, the authors were able to build a model for these receptors in silico, by homology modelling using bovine rhodopsin as structural template.

An interesting evolution of traditional VS techniques is the so called ‘steered’ VS. This is an attempt to take into account information about the protein and the ligands in the simulation. For example, the software ProPose [14] implements a general method for performing simultaneous protein–ligand docking, ligand–ligand alignment and pharmacophore queries with the aim to incorporate all a priori knowledge into screening protocol. With this particular procedure of steered VS it is possible to take into account the flexibility of the receptor, using more than one 3D structure, and the ligand data, using different alignment rules and pharmacophore queries, thus improving the enrichment factors and overcoming the problem of weak-binding or non-binding false-positive ligands. The application of steered VS to herpes simplex virus thymidine kinase with the possibility to consider chemical, biological and structural factors in the early steps of VS, has given the opportunity to identify a much better starting point for lead discovery and optimization [15].

Another exciting field of application of VS might be found in the inhibition of protein–protein interactions [16]. Protein–protein binding involves the interaction of a small number of residues on the exposed surface of both binding partners, called hot-spots, the identification of which can be obtained by alanine scanning or other mutagenesis studies. These kinds of interactions are at the base of most cellular pathways and enzymatic control mechanisms, and are involved in virtually every physiological and pathological process. By interacting with known hot-spots on a protein surface, a small molecule could inhabit the protein–protein interactions by preventing the binding of the two protein partners. Therefore, protein–protein interaction inhibition has a great therapeutic potential. This method is producing significant results in anti-cancer research [17,18], and its application to the antiviral area could achieve the same important results. In this regard, the work of Betzi et al. [19] on the SH3 binding surface of the HIV type-1 Nef protein is interesting. In that paper, a virtual screening on 1,420 compounds filtered by a docking procedure and then by a pharmacophore model built using the hot-spot knowledge is reported [19]. Remarkably, all the screened compounds were also evaluated by a biological assay, and the selected leads from VS and highthroughput screening were the same. The results of the use of the VS method in the identification of the small molecules as protein–protein interaction inhibitors led to the discovery of the first two compounds that bind the HIV type-1 Nef SH3 in the micromolar range.

Protein–protein interactions were also at the core of the work reported by De Luca et al. [20]. However, in this case, the interaction targeted was one between a viral protein (HIV integrase) and a cellular cofactor (LEDGF/p75). Interestingly, the VS simulation was performed using a structure-based pharmacophore.

De novo drug design is another interesting structure-based methodology. This approach can be described as the process of producing novel molecular structures with desired pharmacological properties, using the structural information of the biological target [21]. It is a fascinating field, but its use has been somehow restricted by some inherent limitations of the technique, in particular the inability of computers to accurately assess the chemical feasibility of the proposed structures. For this reason, an extensive user intervention in the design process is often required, as is noted in the discovery of novel HCV helicase inhibitors that we have recently reported [22].

It is now common practice to combine different in silico approaches instead of relying on a single technique to generate more accurate results. Indeed, strong positive results have been obtained where a combination of different experimental and in silico techniques were used in conjunction. The discovery of HIV protease inhibitors is an excellent example of this combined approach: crystallography, molecular docking and de novo simulations allowed a very rapid development of a class of compounds that are now extremely important in the HIV therapy [23 –25].

Molecular dynamics (MD) simulations are a further example of structure-based approaches. The aim of MD is the prediction of the dynamic evolution of a molecular system over time. This is of key importance in the study of processes that involve conformational changes or in the evaluation of molecular binding interactions and energies. This is a computationally expensive technique, but constantly improving computer speed gives the possibility to use more time-consuming approaches such as meta- and quantum mechanics molecular dynamics (QM-MD) on ligand–receptor complexes and also on ternary complexes (for example ligand receptor nucleic acids or ligand receptor cofactors) [26]. These MD methods provide the opportunity to analyse conformational modifications over time and the interactions between atoms and the stability of the targeted complex (ΔG and K_eq calculation). MD simulations find application in the elucidation of several aspects of drug activity. For instance, MD can be helpful in understanding, from a structural point of view, the reasons why the mutation of a particular residue in the binding site of a targeted receptor gives resistance to an inhibitor. Furthermore, QM-MD could be useful to evaluate the polarization energy of all the residues of the binding site of a targeted inhibitor–enzyme complex. This, in turn, means the possibility to know, with a high rate of accuracy, the contributions of individual binding site residues to the enzyme–inhibitor interactions. A recent paper reports the application of a QM-MD analysis on three high affinity HIV protease inhibitors (nelfinavir, mozenavir and tipranavir) [27]. Hensen et al. [27] showed, by using electrostatic energy analysis on the inhibitors and on the binding site residues, that the two specific amino acids (Asp25 and Ile50) of both polypeptide chains play a crucial role in the inhibitor binding and that the 4-hydroxy-dyhydropyrone of tipranavir gives the strongest polarization effects as a result of a stronger binding force of this moiety. It is easy to imagine the importance of this kind of information in the design process of novel biologically active molecules. Although QM-MD analysis and the computation of the polarization energy can not be applied to a large number of compounds, it could be very useful, (for example, after a VS, when the number of scaffolds is limited) to identify the core moiety with the higher probability to become a lead.

Ligand-based approaches

Quantitative structure–activity relationship (QSAR) predictions are possibly the most significant example of ligand-based drug design. This approach involves establishing a mathematical function that correlates biological data (such as 50% inhibitory concentration values) of a series of compounds with the properties of those molecules (descriptors). More than 50 years ago, Corwin Hansch, created the first QSAR model quantifying a relationship between three physical–chemical properties and the biological activities of a molecular set and, since then, QSAR calculations have been applied to a wide range of projects [28]. The number and identity of the descriptors used in QSAR can be very different and have varied significantly over the years. The understanding of the important role played by the spatial geometry of the compounds and the adoption of 3D descriptors was the key to the development of the more advanced 3D QSAR, which allows the analysis of both ligand binding conformations and directional forces (for example, hydrogen bonds). Although 3D QSAR has scored several notable successes and many exciting discoveries that have been obtained by it exist, the determination of the correct bioactive conformation, the protonation state and the alignment rules are not free of bias [29]. Four-dimensional and five-dimensional QSAR methods are a recent development, and could offer a convincing answer to these limitations: four-dimensional QSAR models include a sampling of the different conformations accessible to a compound, as well as its protonation states, whereas five-dimensional QSAR allows the model to include induced fit effects on the receptor as well. The application of these new QSAR methods can offer a new lead optimization strategy and some interesting results in antiviral research have been reported. Their application to HIV reverse transcriptase [30,31] and protease [32] inhibitors provided information that was not obtainable by other methods, confirming the crucial role of these tools and of CADD methods in general as a powerful way to identify a lead compound in a quicker and more cost-effective way than physically synthesizing and testing it.

Alignment and superposition of small molecules is also at the core of another ligand-based set of methodologies based on the concept of shape recognition [33]. The rationale is that molecules that interact with the same target might have a certain degree of shape similarity. Furthermore, in addition to the simple structure superposition, a series of molecular descriptors, such as cLogP or pKa, can be considered, generating a rich set of information that can be used for drug design. Some examples of applications of these approaches in the antiviral field available in the literature include a work on HIV type-1 non-nucleoside reverse transcriptase inhibitors [34] and a study on HIV protease inhibitors [35]. Recently, a novel non-superposition-based shape recognition methodology, which has been validated also using a set of neuraminidase inhibitors, has been reported [36]. This approach proved to be considerably faster than other shape-based methods and it could be extremely useful in ligand-based VS projects.

Conclusions

Drug discovery is a time-consuming and expensive process, with a high failure rate. For these reasons new drug design approaches and methodologies are constantly developed in order to reduce the time and costs of this process. In particular, in recent years, interest has increasingly been focused on computer-based techniques and molecular modelling to design and virtually evaluate novel potential drugs on a computer before being prepared in a laboratory. Large and free databases such as the ZINC library [37] can now be tested in a VS simulation on a desktop machine in a matter of days. Computer-based drug design has already been used successfully in the antiviral field, as the discovery of Relenza and the development of several HIV protease inhibitors demonstrate, and it is easy to predict that these methodologies will continue to have a significant effect in future discoveries. The fast-paced adaptation that occurs in many viruses, with resistant strains emerging constantly, requires diversity in designing new drugs and in the optimization of the existing lead compounds. In silico methodologies are particularly suited to tackle this issue, because of their flexibility and increasing reliability. Furthermore, the constant validation and identification of novel biological targets and, because of the efforts of consortia such as the VIZIER [38], the rapidly increasing number of 3D structures available (currently >1,500 structures from different viruses are present in the Protein Data Bank [39]) are offering new exciting opportunities for the applications of molecular modelling in the design of the next generation of antivirals.

Footnotes

Acknowledgements

The authors are grateful to the Istituto Pasteur – Fondazione Cenci Bolognetti for supporting AC.

The authors declare no competing interests.

References

Jorgensen

. Efficient drug lead discovery and optimization. Acc Chem Res 2009;42:724–733.

Zonta

Grimstead

Avis

Brancale

. Accessible haptic technology for drug design applications. J Mol Model 2009;15:193–196.

Hutter

. In silico prediction of drug properties. Curr Med Chem 2009;16:189–202.

Zhang

Wang

Protein folding simulations: From coarse-grained model to all-atom model. IUBMB Life 2009;61:627–643.

Congreve

Murray

Blundell

. Structural biology and drug discovery. Drug Discov Today 2005;10:895–907.

von Itzstein

Kok

Rational design of potent sialidase-based inhibitors of influenza virus replication. Nature 1993;363:418–423.

Palese

Schulman

. Inhibitors of viral neuraminidase as potential antiviral drugs. In Oxford

(Editor). Chemoprophylaxix and virus infection of the upper respiratory tract. Vol. 1. Cleveland: CRC Press 1997; pp. 189–205.

Varghese

Colman

. Structure of the influenza virus glycoprotein antigen neuraminidase at 2.9 Å resolution. Nature 1983;303:35–40.

Varghese

Colman

. Structural basis of antigenic variation: Studies of influenza virus neuraminidase. J Mol Biol 1991;221:473–486.

10.

Nandi

. Proposed lead molecules against hemagglutinin of avian influenza virus (H5N1). Bioinformation 2008;2:240–244.

11.

Zhou

Khaliq

Suk

Antiviral compounds discovered by virtual screening of small-molecule libraries against dengue virus E protein. ACS Chem Biol 2008;3:765–775.

12.

Cheng

Amaro

Ensemble-based virtual screening reveals potential novel antiviral compounds for avian influenza neuraminidase. J Med Chem 2008;51:3878–3894.

13.

Pérez-Nueno

Ritchie

Rabal

Comparison of ligand-based and receptor-based virtual screening of HIV entry inhibitors for the CXCR4 and CCR5 receptors using 3D ligand shape matching and ligand-receptor docking. J Chem Inf Model 2008;48:509–533.

14.

Seifert

MHJ

Schmitt

Herz

Kramer

. ProPose: A docking engine based on a fully configurable protein-ligand interaction model. J Mol Model 2004;10:342–357.

15.

Seifert

MHJ

. ProPose: Steered virtual screening by simultaneous protein–ligand docking and ligand–ligand alignment. J Chem Inf Model 2005;45:449–460.

16.

Arkin

Wells

. Small-molecule inhibitors of protein–protein interactions: Progressing towards the dream. Nat Rev Drug Discov 2004;3:301–317.

17.

Trosset

Dalvit

Knapp

Inhibition of protein–protein interactions: The discovery of drug-like catenin inhibitors by combining virtual and biophysical screening. Proteins 2006;64:60–67.

18.

Clackson

Wells

. A hot spot of binding energy in a hormone-receptor interface. Science 1995;267:383–386.

19.

Betzi

Restouin

Opi

Protein–protein interaction inhibition (2P2I) combining high throughput and virtual screening: Application to the HIV-1 nef protein. Proc Natl Acad Sci U S A 2007;104:19256–19261.

20.

De Luca

Barreca

Ferro

. Pharmacophore-based discovery of small-molecule inhibitors of protein–protein interactions between HIV-1 integrase and cellular cofactor LEDGF/p75. ChemMedChem 2009;4:1311–1316.

21.

Schneider

Fechner

. Computer-based de novo design of drug-like molecules. Nat Rev Drug Discov 2005;4:649–663.

22.

Kandil

Biondaro

Vlachakis

Discovery of a novel HCV helicase inhibitor by a de novo drug design approach. Bioorg Med Chem Lett 2009;19:2935–2937.

23.

Wlodawer

Vondrasek

. Inhibitors of HIV-1 protease: A major success of structure-assisted drug design. Annu Rev Biophys Biomol Struct 1998;27:249–284.

24.

Gehlhaar

Moerder

Zichi

Sherman

Ogden

Freer

. De novo design of enzyme inhibitors by Monte Carlo ligand generation. J Med Chem 1995;38:466–472.

25.

Lebon

Ledecq

. Approaches to the design of effective HIV-1 protease inhibitors. Curr Med Chem 2000;7:455–477.

26.

Karplus

McCammon

. Molecular dynamics simulations of biomolecules. Nat Struct Biol 2002;9:646–652.

27.

Hensen

Hermann

Nam

A combined QM/MM approach to protein-ligand interactions: Polarization effects of the HIV-1 protease on selected high affinity inhibitors. J Med Chem 2004;47:6673–6680.

28.

Gedeck

Lewis

. Exploiting QSAR models in lead optimization. Curr Opin Drug Discov Devel 2008;11:569–575.

29.

Vedani

Dobler

. Multidimensional QSAR: Moving from three- to five-dimensional concepts. Quant Struct Act Relat 2002;21:383–391.

30.

Hopfinger

Wang

Tokarski

Construction of 3D-QSAR models using the 4D-QSAR analysis formalism. J Am Chem Soc 1997;119:10509–10524.

31.

Bak

Polanski

. A 4-D QSAR study on anti-HIV HEPT analogues. Bioorg Med Chem 2006;14:273–279.

32.

Santos-Filho

Hopfinger

. The 4D-QSAR paradigm: Application to a novel set of non-peptidic HIV protease inhibitors. Quant Struct Act Relat 2002;21:369–381.

33.

Putta

Beroza

. Shapes of things: Computer modeling of molecular shape in drug discovery. Curr Top Med Chem 2007;7:1514–1524.

34.

Mestres

Rohrer

Maggiora

. A molecular-field-based similarity study of non-nucleoside HIV-1 reverse transcriptase inhibitors. J Comput Aided Mol Des 1999;13:79–93.

35.

Zauhar

Moyna

Tian

Welsh

. Shape signatures: A new approach to computer-aided ligand- and receptor-based drug design. J Med Chem 2003;46:5674–5690.

36.

Ballester

Finn

Richards

. Ultrafast shape recognition: Evaluating a new ligand-based virtual screening technology. J Mol Graph Model 2009;27:836–845.

37.

Irwin

Shoichet

. ZINC – a free database of commercially available compounds for virtual screening. J Chem Inf Model 2005;45:177–182.

38.

Coutard

Gorbalenya

Snijder

The VIZIER project: Preparedness against pathogenic RNA viruses. Antiviral Res 2008;78:37–46.

39.

The Protein Data Bank. (Updated 26 January 2010. Accessed 28 January 2010.) Available from http://www.pdb.org.