Abstract
From 2003 to 2013, Indonesia had the highest number of avian influenza A cases in humans, with 192 cases and 160 fatalities. Avian influenza is caused by influenza virus type A, such as subtype H5N1. This virus has two glycoproteins: hemagglutinin and neuraminidase, which will become the primary target to be neutralized by vaccine. Vaccine is the most effective immunologic intervention. In this study, we use the epitope-based vaccine design from hemagglutinin and neuraminidase of H5N1 Indonesian strain virus by using immunoinformatics approach in order to predict the binding of B-cell and T-cell epitopes (class I and class II human leukocyte antigen [HLA]). BCPREDS was used to predict the B-cell epitope. Propred, Propred I, netMHCpan, and netMHCIIpan were used to predict the T-cell epitope. Two B-cell epitopes of hemagglutinin candidates and one B-cell epitope of neuraminidase candidates were obtained to bind T-cell CD4+ (class II HLA), and also five T-cell epitope hemagglutinin and four T-cell epitope neuraminidase were obtained to bind T-cell CD8+ (class I HLA). The visualization of epitopes was done using MOE 2008.10. It shows that the binding affinity of epitope–HLA was based on minimum binding free energy (ΔGbinding). Based on this result, visualization, and dynamic simulation, four hemagglutinin epitopes (MEKIVLLLA, CPYLGSPSF, KCQTPMGAI, and IGTSTLNQR) and two neuraminidase epitopes (NPNQKIITI and CYPDAGEIT) were computed as having the best binding affinity from HLA ligand. The results mentioned above are from in silico experiments and need to be validated using wet experiment.
Introduction
Avian influenza, also known as bird flu or avian flu, has become a global health issue in several countries and has spread from Asia to Africa, Middle East, and Europe continent, with an estimated 622 cases that occurred within 2003-2013 worldwide.1–3 According to the World Health Organization, in 2012, Indonesia had the highest mortality of avian influenza disease out of 15 countries, since it was first found in Banten in 2005, with 160 deaths out of 192 cases. It has spread out to 11 provinces, including West Java, Bali, North Sumatera, and South Sulawesi.4,5 While this disease has been a serious threat across the country, there are still no effective vaccines or antiviral drugs yet for combating this highly pathogenic disease.6,7 The development of new, efficient vaccines or drugs is urgently needed in order to decrease the mortality rate of avian influenza.
Avian influenza is caused by several subtypes of influenza type A viruses, such as H5N1, H5N2, and H7N9, which belong to the Orthomyxoviridae family.8–10 The influenza A virus genome has eight ssRNA segments that encode 10 proteins with two major surface glycoproteins: hemagglutinin and neuraminidase. 11 Hemagglutinin is the essential antigen that neutralizes human antibody and binds to the host cell receptor, and it mediates the sialic acid binding receptor's surface cell that initiates the virus infection on the host cell.12,13 Neuraminidase is a glycoprotein that is known for its role to catalyze the cleavage of the sialic acid bond from D-galactose or D-galactosamine residues by cutting off the α-keto acid bond between neuraminic acid and glycosyl residue of glycoprotein, glycolipid, or colominic acid. 14 The recent utilization of serological analysis has categorized influenza A viruses into 18 hemagglutinin subtypes (H1-H18) and 9 neuraminidase subtypes (N1-N9), which allow them to form 162 different kinds of influenza subtypes.15–19 Because of their vital role in the pathogenicity of the virus, hemagglutinin and neuraminidase have become primary targets in the vaccine design and development fields for neutralizing the avian influenza virus. 20
Usage of vaccine is one of the most effective immunological interventions in controlling the infection of H5N1 virus. To date, several H5N1 vaccine design and development efforts have been conducted, yet the effective, affordable vaccine has to be found.21,22 Most immunization has not provided sufficient protection from epitope antigens.23,24 An ideal vaccine should have humoral immune and cellular responses that can be used to trigger the B-cells and T-cells selectively. As in the inactivated vaccine, the vaccine provides only the humoral immune response and causes little or no cellular immune response. 23 The latest influenza vaccine has been developed by using the epitope-based approach. The epitope is an amino acid that binds to the antibodies, and it is located on the surface of the antigen. The specific epitope identification of pathogens has significantly improved the epitope-based vaccine design and development. 25
The epitope itself is related to human leukocyte antigen (HLA). Every individual has his or her own specific set of class I and class II HLA. 26 The epitope that has high binding affinity to several HLA alleles tends to be a potential candidate for epitope-based vaccine design. 27 The epitope-based vaccine has already been designed and tested in the scientific community.28,29 Hence, the ideal vaccine design should be able to cope with the challenge of antigenic drift. 30 The purpose of this study is to design a new epitope-based H5N1 vaccine that can bind to the HLA by using in silico approach. Therefore, the epitope-based vaccines of hemagglutinin and neuraminidase from H5N1 of Indonesian strain virus were designed to elicit the immune response of class I and class II HLA.
Research Methodology
The utilized pipeline for this research was modified and extended from the existing ones.31–33 The established pipelines were also taken into account.34,35 Moreover, our research is based on immunoinformatics approach that was already validated in the scientific community.36–39 Thus, immunoinformatics approach has been proven to be useful for examining the response of immune system toward pathogens.40–43 All of the experiment methods were done by using a personal computer with 2.1 GHz Intel Pentium Dual Core, RAM 1 GB, and Windows XP Professional Edition. The experiment was also conducted with the Internet support and several kinds of online and offline softwares.
Sequence of Hemagglutinin and Neuraminidase H5N1 Virus
The sequences of hemagglutinin and neuraminidase of H5N1 Indonesian strain were retrieved from Influenza Virus Resource of National Center Biotechnology Information website (http://www.ncbi.nlm.nih.gov/genomes/FLU/FLU.html/). 44
Prediction B-cell and T-cell Epitope Hemagglutinin and Neuraminidase H5N1 Virus
Following Velkov et al, the mapping of our epitopes has taken into account their tendency of antigenic drifting. The prediction of proteosomal cleavage H5N1 virus was analyzed by using PAPROC-I tools (http://www.paproc.de/), while the transport antigen presentation (TAP) binding was predicted by using TAPPRED server (http://www.imtech.res.in/raghava/tappred/).45,46 Propred and netMHCpan servers were used to identify the bond region on the class I HLA of the antigen.47,48
Visualization and Molecular Dynamics Using MOE 2008.10
The selected epitopes were further analyzed their conformation and stability by performing epitope visualization and molecular dynamics simulation, respectively. All of these steps were completed by using MOE 2008.10 software. 49 The epitope visualization itself was used to determine class I and class II HLA of the selected epitopes. The evaluation of the residues was conducted by counting the free energy minimization for the interactions between HLA and epitope. The 10 ns molecular dynamics simulation was executed in order to observe the interaction and stability between epitope and HLA. 50
Confirmation of the Origin of Hemagglutinin Epitopes
Following Velkov et al, the origin of Indonesian hemagglutinin epitopes was confirmed by using BLASTp tools (http://blast.ncbi.nlm.nih.gov/).
Results and Discussion
Hemagglutinin and Neuraminidase H5N1 Sequence
In this study, the sequences of Indonesian H5N1 virus, including both hemagglutinin and neuraminidase, were collected from National Center Biotechnology Information (GenBank Nos BAL61222.1 for hemagglutinin and BAL61230.1 for neuraminidase). The result of Basic Local Alignment Search Tool indicated that all of the Indonesian H5N1 virus sequences showed a high similarity, from 97% to 100%. Group A viruses have the highest percentage of homology with chicken virus isolated in Indonesia in 2010. 51
B-cell Epitope Prediction
An antibody would bind to a specific epitope on the surface of the virus. When a virus is destroyed by macrophage, it would facilitate epitope binding. 52 In this study, the epitope prediction of H5N1 hemagglutinin and neuraminidase of the B-cells was done through BCPREDS 1.0 at the following website address: http://ailab.cs.iastate.edu/bcpreds/. The peptide sequence is the required input, and the output will provide information about the position of the peptide sequences along with scores based on its accessibility. The analysis was performed every nine amino acids (nonamer) on a series of peptide sequences with a specificity of 80%. Accessibility is very important because if the score is increasing, the epitope peptide will be having a great accessibility to be recognized by B-cells or bind to the antibody. Table 1 is the predicted epitope of H5N1 hemagglutinin that has accessibility scores against B-cell epitope prediction from the server BCPREDS. It predicts 15 hemagglutinin epitope candidates. They will be expressed in the CD4+ T-cells and bind to class II HLA on the IGTSTLNQR and MVSLVKSDQ.
Results of hemagglutinin epitope prediction against B-cells.
Table 2 shows H5N1 neuraminidase epitope that has an accessibility score against B-cells. A total of 17 candidates of neuraminidase epitope derived from the B-cell epitope prediction. Epitopes will be expressed with the introduction of the CD4+ T-cells and bind to class II HLA on the YNGIITDTI.
Results of neuraminidase epitope prediction against B-cells.
Humoral immune systems have an important role in relieving the infection of H5N1 avian influenza virus. During the process of infection, antibodies are produced for all the major glycoproteins (hemagglutinin and neuraminidase) on the surface of the H5N1 virus. Specific antibodies to hemagglutinin are very important for virus neutralization at the mucosal surface by blocking the entry of the virus into the cell. Specific antibodies for neuraminidase are effective in reducing the release of virus from infected cells. Antibodies bind to a particular epitope on the surface of the virus and act as opsonins that facilitate the binding and virus destruction by macrophages.
B-cell epitope candidates of hemagglutinin H5N1 were recognized by the receptor of CD4+ T-cell and bound with class II HLA, which were IGTSTLNQR and MVSLVKSDQ. Furthermore, YNGIITDTI was an epitope candidate of neuraminidase for B-cell.
T-cell Epitope-bound Class I HLA Prediction
Proteasome is a key factor in the degradation of cytosolic proteins in which the C-terminal ligand of class I HLA is selected by proteome. 45 In defining the epitope of hemagglutinin and neuraminidase H5N1 that can binds to class I HLA, the protein sequences were selected by using PaProC server to see the sequences that can be interrupted by proteome. In general, the effective epitope antigen should have a sequence that do not split the proteome and can be transported by TAP. 27 In this study, we use TAPpred server for predicting the sequences of the effective epitope antigen. Data analysis results of the epitope of hemagglutinin and neuraminidase with TAP are shown in Tables 3 and 4, respectively.
The analyzed results of TAP (transport antigen presentation) on H5N1 virus hemagglutinin by using TAPpred server.
The anaiyzed results of TAP (transport antigen presentation) on H5N1 virus neuraminidase by using TAPpred server.
The immune response is induced by T-cell epitope, which has an important role in vaccine designing. T-cell epitopes are presented on the surface of an antigen-presenting cell, which are bound to class I and II HLA. 53 After selecting a sequence of hemagglutinin and neuraminidase, we proceeded with the determination of epitope potential vaccine candidates. In this study, the determination of hemagglutinin and neuraminidase H5N1 epitopes on class I HLA has been done and analyzed by using two servers’ prediction. The first prediction is netMHCpan server version 2.4 with 4% threshold that can be accessed at the website address of http://www.cbs.dtu.dk/services/NetMHCpan/. This server predicts the potential epitope binding to class I HLA of supertypes A and B, so it can be recognized by the receptor CD8+ T-cells. This method consists of 326 HLA, divided into 43 different alleles at the HLA-A and HLA-B. 54 In addition, class I HLA-bound epitopes were also predicted using the ProPred 1 server on the threshold of 4%, which can be accessed online at the website address of http://www.imtech.res.in/raghava/propred1/. ProPred 1 server is derived from TEPITOPE, namely a matrix program position. The server was developed for 47 class I HLA with an algorithm based on the linear coefficient matrix. 48
Table 5 shows that there are five hemagglutinin epitope candidates bound by CD8+ T-cell, such as MEKIVLLA, EKIVLLAM, CPYLGSPSF, KCQTPMGAI, and KAVDGVTNK. Otherwise, there are four neuraminidase epitope candidates bound to class I HLA: NPNQKIITI, CYPDAGEIT, IRPCFWVEL, and RPCFWVELI (Table 6).
The prediction results of hemagglutinin epitopes toward class I HLA.
The prediction result of neuraminidase epitopes toward class I HLA.
T-cell Epitope-bound Class II HLA Prediction
Any peptide molecule that can bind to class II HLA will be presented on CD4+ T-cells, which is very important in the regulation of the B-cell responses and CD8+ T-cells. 26 If the response of CD4+ does not exist, then the quality of the cytotoxic response to the antigen administration gradually declined and failed to respond effectively. 55 The determination of hemagglutinin and neuraminidase epitopes on class II HLA has been done by using two prediction servers. The first one is netMHCIIpan version 2.1 with 4% threshold that can be accessed at the website address of http://www.cbs.dtu.dk/services/NetMHCIIpan/. This server predicts epitope binding to class II HLA for two supertypes of DRB (antigen D Related, Beta chain), so it can be recognized by receptors CD4+ T-cells. The selection of these alleles are most often found in Asian populations, especially in the parts of Malay distribution. 56 NetMHCIIpan method was used to evaluate the epitope binding to class II HLA with a measuring accuracy of prediction on the area under the receiver operating characteristic curve (AROC), which is the curve for evaluating the accuracy of diagnostic markers. AROC values that are close or equal to one indicate that the marker has a high diagnostic accuracy. 57 netMHCpan method has been identified as the best predictive value with AROC >0.9. Epitope-binding affinity to HLA-DR alleles was analyzed by netMHCIIpan. Identification of potential epitope is classified as a strong binder, weak binder, and no binder to choose the alleles of class II HLA. Selection is based on the binding affinity threshold of [X] ≤ 50 nM, 50 nM < [X] ≤ 500 nM, and [X] . 500 nM. 58
The subsequent prediction was done using the ProPred server with 4% threshold that can be accessed online at the website address of http://www.imtech.res.in/raghava/propred/. The server was developed for 51 HLA-DR alleles by extracting the database matrix. 48 Prediction of peptides that bind to class II HLA by using the ProPred server in the development of vaccines for Leptospira interrogans serovar Lai was already in place. 59 By using ProPred and netMHCpan II, there are two hemagglutinin epitope candidates bound by CD4+ T-cell (class II HLA), MVSLVKSDQ and IGTSTLNQR, and also one neuraminidase epitope candidate, YNGIITDTI (Tables 7 and 8).
The prediction results of hemagglutinin epitopes toward class II HLA
The prediction results of neuraminidase epitopes toward class II HLA.
Visualization and Molecular Dynamics of Hemagglutinin and Neuraminidase Epitope Using MOE 2008.10
The visualization was useful to predict the binding free energy from class I and class II HLA–epitope of hemagglutinin and neuraminidase. Position and affinity binding from class I and class II HLA–epitope of hemagglutinin and neuraminidase are shown in Tables 9 and 10, respectively.
The position and affinity of hemagglutinin epitope H5N1-bound class I and class II HLA.
The position and affinity of neuraminidase epitope H5N1-bound class I and class II HLA.
For hemagglutinin epitope-bound class I HLA, the weak binder is EKIVLLLAM, which has AG of -22.4538 kcal/mol, and KCQTPMGAI is one of the strong binder candidates, which has ΔG of -33.6109 kcal/mol (Fig. 1, the description of legend could be observed in the Supplementary material here: http://staff.ui.ac.id/system/files/users/aditya.parikesit/material/supplementary_material_feimmy.pdf). The interaction between class II HLA and hemagglutinin epitope has the weak binder candidate (MVSLVKSDQ) with ΔG of -11.7756 kcal/mol and the strong binder candidate (IGTSTLNQR) with ΔG of -56.9580 kcal/mol (Fig. 2).

The visualization of KCQTPMGAI with HLA B*0702 PDB: 3VCL (MOE 2008.10). 62

The visualization of IGTSTLNQR with HLA DRB1*0301 PDB: 1A6A (MQE 2008.10). 63
For neuraminidase, there are two epitope candidates, which have strong interactions, such as CYPDAGEIT and NPNQKIITI. The strongest interaction between epitope and class I HLA is NPNQKIITI (ΔG -43.5280 kcal/mol) (Fig. 3). The KCQTPMGAI epitope was visualized because of its role as strong binder candidate for class I HLA, while NPNQKIITI was visualized as having the strongest interaction with class I HLA.

The visualization of NPNQKIITI with HLA B*5101 PDB: 1E27 (MQE 2008.10). 64
Figure 4 shows epitope–HLA interaction in the initialization process of 0-100 ps that fluctuates and stabilizes in more than 100 ps. It tends to be in a linear form graphically. The variation of the curve form is determined for three epitopes of hemagglutinin (MEKIVLLLA, CPYLGSPSF, and IGTSTLNQR) and one epitope of neuraminidase (CYPDAGEIT) with high fluctuation. It is indicated that epitope–HLA structure conformity changes during the time of forming. The curve becomes linear on more than 100 ps, confirming that the epitope–HLA structure conformity has not changed significantly. The IGTSTLNQR epitope was included both in molecular dynamics and in visualization due its tendency as having the most spontaneous ΔGbinding of all epitopes and acting as both T-cell and B-cell epitopes. The CYPDAGEIT epitope was included in the molecular dynamics simulation because of its role as the strongest binding neuraminidase-based epitope. Concerning the molecular dynamics of hemagglutinin epitopes, out from the data range, MEKIVLLA was taken as the mean value, CPYLGSPSF was the minimum value, and IGTSTLNQR was the maximum value.

The molecular dynamics simulation of MEKIVLLLA, CPYLGSPSF, IGTSTLNQR, and CYPDAGEIT epitopes by constructing the plot of RMSD (in nm) vs t (in ps).
In order to confirm the homology of our epitopes with the marketed vaccine, CLUSTAL was applied to seek homology with hemagglutinin and neuraminidase of H5N1 NIBRG-14 strain. 60 Based on the CLUSTAL results, the hemagglutinin epitopes of MEKIVLLLA, CPYLGSPSF, and IGTSTLNQR were confirmed to be highly conserved with hemagglutinin of H5N1 NIBRG-14 strain that commonly used in the wet experimentation of vaccine development. Thus, the neuraminidase epitope of CYPDAGEIT was confirmed to be highly conserved with neuraminidase of H5N1 NIBRG-14 strain also.
Confirmation of the Origin of Hemagglutinin Epitopes
Following Velkov et al and Basic Local Alignment Search Tool computation, the epitope of CPYLGSPSF was derived from hemagglutinin of influenza A virus (A/Indonesia/TLL014/2006(H5N1)), the epitope of KCQTPMGAI was derived from borne transmissible avian influenza H5 hemagglutinin mutant from the influenza A virus (A/Indonesia/5/2005(H5N1)), and the epitope of IGTSTLNQR was derived from hemagglutinin influenza A virus (A/Indonesia/TLL014/2006(H5N1)). Thus, the other hemagglutinin epitopes were not detected as part of H5N1 of Indonesian strain as mentioned earlier. 61 ).
Conclusion
There were four epitopes of hemagglutinin and two epitopes of neuraminidase, based on visualization and dynamics molecular simulation that have high binding affinity to several human HLA. The epitopes were MEKIVLLLA (ΔGbinding -40.0148 kcal/mol), CPYLGSPSF (ΔGbinding -28.0471 kcal/mol), KCQTPMGAI (ΔGbinding -33.6109 kcal/mol), IGTSTLNQR (ΔGbinding -56.9580 kcal/mol), NPNQKIITI (ΔGbinding -23.8779 kcal/mol), and CYPDAGEIT (ΔGbinding -33.0200 kcal/mol). Most of the hemagglutinin epitopes were derived from H5N1 of Indonesian strain. Both hemagglutinin and neuraminidase epitopes were confirmed to have a high degree of homology with the H5N1 NIBRG-14 strain that is commonly used in vaccine development.
The evaluation of epitopes based on dynamics molecular simulation on normal human body temperature confirms that the epitope–HLA structure conformity has not changed significantly. Therefore, it is concluded that the epitopes can be improved as H5N1 vaccine candidate. We also suggest testing this potential epitopes through in vitro and in vivo studies to validate the result of this study.
Author Contributions
Supervised this research: USFT, AAP. Worked on the technical details: FRPS. Suggested the improvement of our pipelines: DK. All authors reviewed and approved the final article.
Footnotes
Acknowledgments
The authors thank Abdul Hakim from Statistics Study Program, Faculty of Mathematics and Science, University of Indonesia, and Mochammad Arfin Fardiansyah Nasution from Bioinformatics Research Group, Faculty of Mathematics and Science, University of Indonesia, for the excellent proofreading of this article.
