Abstract
Shigellosis remains a major global health concern, particularly in regions with poor sanitation and limited access to clean water. This study used immunoinformatics and reverse vaccinology to design a potential mRNA vaccine targeting
Introduction
Bacillary dysentery, caused by the gram-negative bacterium
Given the pressing need for an effective and broadly protective
Materials and Methods
Retrieval of protein sequence
We obtained the amino acid sequences of

A methodological flow diagram of in silico vaccine design against common
Prediction of exoproteome, secretome, and their essentiality
Using the entire proteome of
Prediction of virulent proteins and transmembrane helices
VirulenPred 43 (http://crdd.osdd.net/raghava/virulenpred/) was used to evaluate the exoproteome and secretome for virulence. We were careful to choose just potentially harmful proteins. In addition, we only chose proteins that included at least one predicted transmembrane helix using TMHMM 46 (http://www.cbs.dtu.dk/services/TMHMM/).
Determination of human homologs and molecular weight estimation
To see whether any of the virulence proteins have human homologs, we ran a BLASTp 44 search (https://blast.ncbi.nlm.nih.gov/Blast.cgi) against the human proteome. Any protein similar to a human protein (35%) was ruled out. Using the Expasy tool (https://web.expasy.org/compute_pi/), 45 we assessed the molecular weight of the remaining proteins and chose only those with molecular weights of less than 110 kDa.
Protein antigenicity, maturation, and conservation analysis with various Shigella pathotypes
We assessed the antigenicity of proteins using Vaxijen V 2.049
47
(http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html), categorizing those with antigenicity values higher than 0.4. Eleven
Screening for B- and T-cell epitopes
Epitopes for both cytotoxic T lymphocytes (CTLs) and helper T lymphocytes (HTLs) were predicted using the immune epitope database (IEDB) website 63 (http://www.iedb.org/). We predicted epitopes recognized by CTLs that were shared by the HLA allele reference set that covered more than 97% of the population. Antigenicity, allergenicity, toxicity, conservancy, immunogenicity, and population coverage were determined by profiling the top 100 peptides. We employed the IEDB server’s conservancy tool (http://tools.iedb.org/conservancy), immunogenicity tool (http://tools.iedb.org/immunogenicity), and population coverage tool (http://tools.iedb.org/population). Antigenicity, toxicity, and allergenicity predictions were made using the VaxiJen (http://www.ddgpharmfac.net/vaxijen/VaxiJen/VaxiJen.html), ToxinPred2, and AllerTOP v.2.0 (https://www.ddg-pharmfac.net/AllerTOP/index.html) software packages. Based on the lowest percentile rank, maximum antigenicity, immunogenicity, and binding affinity, we picked the top 10 peptides that fulfilled all criteria. HTL epitope predictions were made using an HLA allele reference collection with above 99% population coverage. Antigenicity, allergenicity, toxicity, conservancy, induction of interferon-gamma (IFN-γ), interleukin-4 (IL-4), and interleukin-10 (IL-10) as well as population coverage criteria were used to identify the top 100 peptides for profiling. Online resources such as IFNepitope (https://webs.iiitd.edu.in/raghava/ifnepitope/predict.php), IL4pred (https://webs.iiitd.edu.in/raghava/il4pred/index.php), and IL-10Pred (https://webs.iiitd.edu.in/raghava/il10pred/predict) were used for cytokine prediction. The peptides with the greatest antigenicity binding affinity and the lowest percentile rank were chosen as the top 10. We also used docking research using Autodock Vina 64 (https://vina.scripps.edu/) to evaluate the binding affinity of the potential CTLs and HTLs with their corresponding HLA alleles. To do this, the PEP FOLD 3 webserver 65 was used to predict the 3D structure of each peptide. Concurrently, the 3-dimensional structures of HLA-A02:01 (PDB ID 4u6y) and HLA-DRB104:01 (PDB ID 5lax) were obtained from the protein data bank to serve as receptors for MHC-I and MHC-II epitopes, respectively. Linear B-cell epitopes were predicted using the IEDB server’s (http://www.iedb.org/) Bepipred Linear Epitope Prediction 2.0, Emini Surface Accessibility Prediction, and Kolaskar and Tongaonkar Antigenicity tools. Antigenicity ratings greater than 0.4 were used to identify peptides with a length between 8 and 20 amino acids. We tested the selected peptides for toxicity and allergenicity using the same servers as previously for single epitope estimation.
Homology modeling and epitope mapping
The 3D structures of the 4 target proteins were modeled using SWISS-MODEL. The highest-quality templates were selected based on sequence identity (>80%), and GMQE (>.50). Structural validation was performed using PROCHECK (Ramachandran plot analysis). After validation, epitope regions were mapped onto the surface of the modeled proteins. PyMOL and Discovery Studio were used to visualize the predicted epitopes, allowing assessment of their spatial distribution, surface accessibility, and potential immunogenic properties. Selected epitopes (CTL, HTL, and B-cell) were color-coded to visualize their locations on each protein.
Design and construction of multi-epitope vaccine candidate
We used GGGS, GPGPG, and KK amino acid linkers to join the top 24 CTL, HTL, and B-cell epitope candidates from the epitope prediction stage to create a possible multitope vaccination. To complete the constructed vaccine, we included the PADRE sequence and defensin adjuvant in addition to the epitopes. Using the same servers as before for single epitope estimate, we analyzed the vaccine candidate’s antigenicity score, 47 allergenicity, 66 and toxicity probability. 67 We analyzed the physicochemical characteristics of the proposed vaccination using the ProtParam program, which can be found on the Expasy server 45 (https://web.expasy.org/protparam/).
Secondary and tertiary structure of the SS-Ss046
We used the SOLpro server
51
(http://scratch.proteomics.ics.uci.edu/) and the PSIPRED 4.0 webserver
68
(http://bioinf.cs.ucl.ac.uk/psipred/) to make predictions about the likelihood of overexpression in
Prediction of conformational B-cell epitopes
After performing 3D structure prediction and refinement, we made predictions about the conformational B-cell epitopes that would be used in the multitope design. For this prediction, we used ElliPro Server 54 (http://tools.iedb.org/ellipro/), a trustworthy program for identifying B-cell epitopes against a given antigen. A minimum score threshold of 0.5 and maximum distance of 6 angstrom was applied to ensure epitope reliability. ElliPro approximates protein shape using ellipsoids, with higher PI values indicating greater solvent accessibility.
Disulfide engineering of the designed vaccine
We included disulfide links to strengthen the 3D structure of the planned construct before beginning docking research for the created potential vaccine. Disulfide linkages are thought to increase protein stability by altering the protein’s geometric shape. For this, our team turned to Disulfide by Design 2.0 55 (http://cptweb.cpt.wayne.edu/dbd2/).
Docking of designed vaccine with hTLR-4
For our epitope-based synthetic vaccine design, we selected hTLR-4 (PDB ID: 4G8A) as the target receptor. Docking analysis between the ligand and receptor was performed using the ClusPro 2.0 server 57 (https://cluspro.org/). By running billions of conformations, aggregating the 1000 lowest energy structures created, and filtering out steric conflicts, this service can accurately predict the optimum docking models. The ligand and receptor PDB files were uploaded to the ClusPro service, and docking was conducted using the default settings.
Dihedral coordinate-based normal-mode analyses
To learn more about how the built epitope vaccine moves with respect to the bound hTLR-4 protein target, we used the iMODS server 58 (http://www.imods.chaconlab.org/). This server’s speed and effectiveness are definite pluses. A high eigenvalue, for example, indicates a more severe deformation as predicted by this service.
Immune simulation of the designed vaccine
Using a computational method, we predicted the stimulated immune response to the proposed vaccination using the C-ImmSim server 73 (http://www.iac.rm.cnr.it/filippo/C-IMMSIM/). For this study, we injected the planned vaccine 3 times, each time waiting 4 weeks between doses, following the prime-booster-boost strategy. This strategy was used to induce a persistent immunological response.
Molecular dynamic simulation
A 100 ns Molecular dynamics (MD) simulation was carried out using the GROningen Machine for Chemical Simulations aka GROMACS. 56 The CHARMM36m force field was used for the simulation. Using the TIP3P water model, a water box was constructed with its borders 1 nm from the protein surface. The appropriate ions were used to neutralize the systems. The simulation was performed using periodic boundary conditions and a temporal integration step of 2 fs after energy minimization, isothermal-isochoric (NVT), and Isobaric (NPT) equilibration of the system. For this trajectory analysis, a 100 ps snapshot interval was used. Root mean square deviation (RMSD), root mean square fluctuation (RMSF), radius of gyration (Rg), and solvent accessible surface area (SASA) were calculated after the simulation was finished using the rms, rmsf, gyrate, and sasa modules included into the GROMACS program. As for the plots, they were all made using the ggplot2 tool in RStudio. All MD simulations were performed on high-performance simulation equipment at the Bioinformatics Division, National Institute of Biotechnology, using the Ubuntu 20.04.4 LTS operating system.
Codon-optimization and analysis of the vaccine mRNA
The Java Codon Adaptation Tool (JCat) server (http://www.prodoric.de/JCat) was used to measure the multi-epitope vaccine expression level in
The secondary structure of mRNA was then predicted using the RNAfold web server (http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAfold.cgi). This service forecasts the minimal free energy (MFE) and partition function created thermodynamically by the query mRNA structures. To analyze mRNA folding and vaccine secondary structure, the optimized DNA sequence was transformed to a possible RNA sequence using DNA <-> RNA-> Protein (http://biomodel.uah.es/en/lab/cybertory/analysis/trans.htm).
Result
Proteome analysis for selection of vaccine candidates
Among
Prediction of T-cell and B-cell epitopes from the selected vaccine candidates
We used IEBD to generate peptides from the selected vaccine candidates followed by a comprehensive evaluation of their immunological properties. The top 100 generated MHC-1 peptides from IEBD per each protein candidate were estimated for their antigenicity score, allergenicity, toxicity probabilities, conservancy, immunogenicity, binding affinity, and population coverage. Out of the 400 candidates screened, peptides that passed the selection process were shown to be antigenic, with top scores ranging from 1.3031 to 0.4210. Notably, all peptides were predicted to be probable nonallergens and nontoxic, indicating their safety profile for potential vaccine development. Furthermore, these peptides exhibited the lowest percentile rank, ranging from 0.01 to 0.41, suggesting their uniqueness and potential efficacy in eliciting an immune response. Among the selected peptides, some demonstrated exceptionally high immunogenicity scores. The highest immunogenicity score recorded was 0.37078, with binding affinities ranging from −9.7 to −8.6 kcal/mol. In addition, all peptides exhibited complete conservancy, with a 100% conservancy score. This implies that the selected epitopes are highly conserved across different variants or strains of the corresponding proteins. These top-ranked epitopes (MHC-1 peptides) of each protein are listed in Table 1.
Characteristics of top-ranked linear MHC-1 epitopes on protein surface, predicted through ElliPro in IEDB-analysis resource from 4 selected vaccine candidates.
Here, PR = Percentile Rank, AS = Antigenicity Score, Conserv. = conservancy, Im = Immunogenicity.
The binding affinity of the selected CTL candidates was evaluated through a docking study, with the resulting docked complexes illustrated in Supplemental Figure S2 and the corresponding binding scores presented in Supplemental Table S3. Subsequently, the top 100 generated MHC-2 peptides from IEBD per each protein candidate were thoroughly evaluated for their antigenicity score, allergenicity, toxicity probabilities, binding affinity, and population coverage as well as the ability to induce IFN-γ, IL-4, and IL-10. From a pool of 400 promising candidates, we identified peptides that surpassed our stringent selection process. These peptides demonstrated high antigenicity scores ranging from 1.3408 to 0.4656 and were predicted to be non-allergenic and non-toxic, reinforcing their suitability for vaccine development. We found that the top-ranked peptides had the lowest percentile ranks, ranging from 0.15 to 5.1. In addition, they displayed an exceptional binding affinity ranging from −9.1 to −7 (kcal/mol), signifying strong interactions with the target MHC-2 molecules. Moreover, we assessed the ability of the epitopes to induce specific cytokine responses. Cytokines, such as IL-4, IFN-γ, and IL-10, play crucial roles in modulating the immune response. Epitopes capable of eliciting positive cytokine responses were considered favorable for their potential to induce robust and targeted immune reactions. For a detailed overview, we have listed the top-ranked epitopes (MHC-2 peptides) of each protein in Table 2.
Characteristics of top-ranked linear MHC -2 epitopes on protein surface, predicted through ElliPro in IEDB-analysis resource from 4 selected vaccine candidates.
Here, NON = Absent, PR = Percentile Rank, AS = Antigenicity Score, Tox = Toxicity Negative (−) = Absent, Positive (+) = Present.
The binding affinity of the selected HTL candidates was also analyzed, with the docked complexes displayed in Supplemental Figure S3 and their binding scores listed in Supplemental Table S3.
Docking validation for epitopes and MHC molecules
Two control peptides are known to bind with high affinity to the HLA-A*02:01 and DRB-1*04:01 alleles, respectively, and were used to verify the docking simulations. The peptide TSKGLFRAAVPSGAS (alpha-enolase peptide 26-40) 74 and NLVPMVATV (derived from cytomegalovirus protein pp65) 75 served as a positive control in our docking investigation for MHC-I and MHC-II binding, respectively. The docking scores for the MHC-I control peptide were recorded at −8.9 kcal/mol, while the MHC-II control peptide yielded a score of −7.5 kcal/mol.
In comparison, the selected vaccine candidate epitopes exhibited binding energy scores ranging from −8.6 to −9.7 kcal/mol for MHC-I interactions and from −7.0 to −9.1 kcal/mol for MHC-II interactions (Supplemental Table S3), indicating stronger or comparable binding affinities relative to the control peptides. By doing numerous independent dockings with various random seeds, we confirmed that the predicted binding affinities are not skewed by the random source used in the prediction process.
Population coverage of the predicted epitopes
We calculated the population coverage of the predicted epitopes using the IEDB tool. The combined CTL epitopes covered 98.55% of the world population, while the combined HTL epitopes were predicted to cover 99.88% of the global population. The multitope vaccine composed of all the selected epitopes covered 100% of the world population (Figure 2). Further regional-specific population coverage details are elaborated in the Supplemental Table S4, while the Global HLA Allele Reactivity Profiles for Selected Peptides including Coverage and Compatibility Analysis are detailed in the Supplemental Table S5.

World population coverage of the multitope vaccine composed of all the selected epitopes, showing 100% coverage of the world population. The x-axis indicates the Number of epitope hits/HLA combinations recognized and the y-axis (on the left) shows the Percentage of individuals that can recognize vaccine epitopes. The y-axis on the right shows how many pairs can be recognized by the world population. The blue line displays the total number of pairs, whereas the green line displays the percentage of pairs that are recognized. The ideal situation is depicted by the red line, in which everyone can use at least one vaccination epitope to combat pathogens.
Screening for B-cell epitopes
Bepipred Linear Epitope Prediction 2.0, Emini Surface Accessibility Prediction, and Kolaskar and Tongaonkar Antigenicity prediction tools were used for B-cell epitope identification. We identified 13 B-cell epitopes for Q3YZL0_outer membrane protein, 88 for Q3YZM5_PapC-like porin protein, 14 for Q3Z3I2_fimbrial-like protein, and 74 for Q3Z5V5_LPTD. Among the generated peptides for each protein, epitopes with a length between 8:20 amino acids were analyzed for their antigenicity, and peptides with antigenicity score >0.4 were tested for their allergenicity and toxicity. A list of predicted top B-cell epitopes of each protein and their characteristics are given in Table 3.
Properties of highly ranked linear B-cell epitopes on protein surface predicted using ElliPro in IEDB-analysis resource for 4 chosen vaccine candidates.
Homology modeling and epitope mapping
High-confidence 3D models were successfully generated for all target proteins, with validation metrics confirming their structural reliability. The Ramachandran plot analysis revealed that 100% of residues in the putative outer membrane protein (Q3YZL0) model, 99.5% in the PapC-like porin protein (Q3YZM5) model, 98.2% in the putative fimbrial-like protein (Q3Z3I2) model, and 99.7% in the LPS-assembly protein LptD (Q3Z5V5) model were in the favored regions. GMQE scores for the models ranged from 0.58 to 0.88, further supporting their structural accuracy and confidence for downstream analysis.
Molecular visualization of the mapped epitopes revealed their accessibility and immunological relevance. In the putative outer membrane protein (Q3YZL0), epitopes were found to be surface-exposed and readily accessible for immune recognition (Figure 3A). The PapC-like porin protein (Q3YZM5) exhibited epitope regions strategically positioned near porin channel openings, enhancing their potential for immune interaction (Figure 3B). The putative fimbrial-like protein (Q3Z3I2) displayed epitopes distributed across functionally significant domains (Figure 3C), while the LPS-assembly protein LptD (Q3Z5V5) demonstrated epitopes localized to extracellular and solvent-accessible regions (Figure 3D).

Structural visualization of the modeled target proteins with mapped epitope regions highlighted in yellow: (A) Putative outer membrane protein (Q3YZL0) showing surface-exposed and accessible epitope regions. (B) PapC-like porin protein (Q3YZM5) with epitope regions located near porin channel openings. (C) Putative fimbrial-like protein (Q3Z3I2) displaying epitopes distributed across functional domains. (D) LPS-assembly protein LptD (Q3Z5V5) with epitopes positioned in extracellular and solvent-accessible regions.
Multitope vaccine construction
We chose 8 epitopes per table of 1, 2, and 3 (2 epitopes from each protein candidate) and the sum of which is 24 candidates of CTL, HTL, and B-cell epitopes joined using GGGS, GPGPG, and KK as linkers. β-defensin and PADRE peptide were also incorporated using EAAK linker to finalize a potential vaccine sequence of 490 amino acids in length (Figure 4) and its sequence was as follows:
“EAAAKGIINTLQKYYCRVRGGRCAVLSCLPKEEQIGKCSTRGRKCCRRKKEAAAKAKFVAAWTLKAAAEAAAKAKFVAAWTLKAAAGGGSSGTDSSQVGYGGGSNSFRVSYSKGGGSVTKNATFTFGGGSTVTFKVDYIGGGSSVMAGPSVRVGGGSSEHQSTLSAGGGSFYLPYYWNIGGGSSSRRWLFYWGPGPGSGQKAMAVLRLQDGSGPGPGREEQTNYNIMLSHYFGPGPGSGVKTDVPIALEGCDGPGPGQATTDAATNVALQMYGPGPGRWFSVMAGPSVRVNEGPGPGVRNRWFSVMAGPSVRGPGPGKYTTTNYFEFYLPYYGPGPGDPSYFNDFDNKYGSSKKEGNGAAVYTNMKKTGNDKEMYTATYNQNFKKNVSATKLQTNGAVSGVKKKSGTADGVQPTAFANQATTDAKKSVMAGPSVRVNKKAGDKNRQLTRYSDTRWHKKPSYFNDFDNKYGSSTDGYKKHQKEAPGQGGGS.”

The multitope vaccine is built using different sequences connected by specific linkers. The adjuvant (red) and PADRE (yellow) sequences are joined at the N- and C-terminal respectively. Eight CTL epitopes are connected with GGGS linkers (Green), while 8 HTL epitopes are connected with GPGPG linkers (orange). Finally, 8 BCL epitopes are connected with KK linkers (Blue).
This construct was predicted to be nonallergenic, nontoxic, and had an antigenicity score of 1.2568 (VaxiJen).
Physicochemical features, protein solubility assessment, and secondary structure prediction
The designed vaccine construct consists of 490 amino acids, with a molecular weight of 51 523.53 Da. The theoretical isoelectric point (pI) of the vaccine is 9.79, indicating a positively charged nature under physiological conditions, which is supported by the presence of 64 positively charged residues (Arg + Lys) compared to 31 negatively charged residues (Asp + Glu). Moreover, the protein exhibits moderate stability, as indicated by its estimated half-life of approximately 1 hour in mammalian reticulocytes in vitro and more than 10 hours in Escherichia coli in vivo. The vaccine’s structural stability is further supported by a low instability index of 38.77. In addition, the protein demonstrates hydrophilic characteristics, with a negative GRAVY value of −0.629. The physicochemical properties of the predicted vaccine construct are given in the Supplemental Table S6. In addition, the vaccine is predicted to be soluble with a probability score of 0.852386 (SOLpro). The vaccine secondary structure predicted by the PESIPRED server is composed of 14.08% helix, 32.24% strand, and 46.33% coil structure. The structure is given in Supplemental Figure S4.
Tertiary structure prediction, refinement, and validation
Phyle2 and 3Dpro were used to make 3D structure predictions and then Procheck and Prosa were used to double-check those predictions. Validation showed that the model predicted by the 3Dpro server was superior to those projected by phyle2. Figure 5 shows the results of using the GalaxyRefine server, a computer software, to refine the projected 3D structure (3Dpro) of the prospective vaccine and structural validation. We used the Ramachandran plot (PROCHECK) (Figure 5B) and the ProSA web service (Figure 5C) to compare the initial and final structures. Around 92.4% of residues in the optimized structure are in the primary allowed zone, 6.0% in the supplementary permitted region, 1.0% in the generously allowed region, and 0.5% in the unfavorable region. The Z-score went up from −2.54 to −3.41 as well. The main structure included 0.8% of disallowed residues, 1.6% of residues in the generously permitted zone, 1.6% in the extra allowed region, and 84.8% of residues in the most preferred region.

The 3-dimensional structure of vaccine obtained after molecular refinements and validation by Ramachandran plot analysis and ProSA. (A) The constructed vaccine model, with adjuvants highlighted in red, CTL epitopes in green, HTL epitopes in yellow, and B-cell epitopes in blue. (B) The red, yellow, gray, and white color sections represent residues in the most favored, additional permitted, generously allowed, and disallowed regions, respectively. (C) A Z-plot of the revised vaccination model derived from the Pro-SA webserver. The Y-axis depicts the Z-score acquired by NMR or X-ray crystallography for natural proteins, while the X-axis represents the number of residues. The black dot on the Z-plot represents our vaccine’s achieved Z-score.
Conformational B-cell epitope prediction
The tertiary structure and folding of the designed vaccine may generate new conformational B-cell epitopes and for this purpose, we used ElliPro server conformational. In the current assessment, the server predicts 11 new epitopes, and their scores were between 0.532 and 0.877 which are given in the Supplemental Table S7. The predicted 3D models of the generated epitopes are shown in Supplemental Figure S5.
Vaccine disulfide engineering
Disulfide by Design 2.0 server suggested that 35 pairs of amino acids are eligible to make disulfide bonds. Mutation was created for pairs having energy values less than 2. Both LEU228-TYR231 and VAL434-ASN437 pairs were engineered to have lower energy scores and chi-3 values of −102.89 and 104.79, respectively.
Molecular docking of the vaccine with TLR-4
Molecular docking analysis using ClusPro 2.0 generated 30 docking models, with the top-ranked model (model 0.00) exhibiting the lowest binding energy of −1378.5 kcal/mol, indicating a highly stable and strong interaction between the designed multitope vaccine and human TLR-4 (Figure 6). The PDBsum server provided detailed insights into the vaccine-TLR-4 interface, highlighting significant interactions across multiple receptor chains. Among these, chain B demonstrated the most extensive binding with the vaccine, involving 30 and 32 interface residues across an interaction area of 1505 Ų and 1519 Ų, respectively. This interaction was stabilized by the formation of 2 salt bridges, 17 hydrogen bonds, and 193 nonbonded contacts, underscoring a robust molecular association (Figure 7). Chain A exhibited moderate interaction, with 7 and 6 interface residues spanning areas of 274 Ų and 292 Ų, contributing 1 salt bridge, 7 hydrogen bonds, and 60 nonbonded contacts. Chain D also displayed notable interactions, involving 9 and 15 interface residues over areas of 621 Ų and 554 Ų, with 1 salt bridge, 6 hydrogen bonds, and 76 nonbonded contacts. These findings collectively suggest that the designed vaccine establishes strong and stable interactions with TLR-4, potentially enhancing immune activation and downstream signaling.

Docked complex of vaccine constructs with human TLR-4 (A, B); vaccine constructs in magenta color and TLR-4 receptor in red and MD2 in green color.

Receptor-vaccine docking analysis. (A) Depicts the docked complex of TLR-4/MD2-vaccine, with the vaccine construct highlighted in red, where the specific interacting residues are highlighted in purple. (B) The interacting residues in the interface of the docked complex, where interacting chains are connected by colored lines, each denoting a different type of interaction as indicated in the key provided.
Dihedral coordinate-based normal-mode analyses
NMA was used on the iMODS simulation server to evaluate the stability and adaptability of the vaccine-TLR-4 complex. We first compared the vaccine-TLR-4 complex to the TLR-4 protein using deformability (Figure 8A) and B-factor (Figure 8B) analysis and found that the vaccine-TLR-4 complex had much less distortion. In comparison to the eigenvalue of the target TLR-4 protein, which was 2.480454e 05, the eigenvalue of the vaccine-TLR-4 complex was 3.0261702e 08 (Figure 8 C), showing that much less energy is required to deform the complex. In addition, the variance analysis was used to represent the stiffness of the complex (Figure 8D), and it was shown to be inversely proportional to the eigenvalue. The iMOS generated a covariance matrix showing the linked residue pairs with either anti-correlated (blue) or correlated (red) movements, or without any correlation (white). Predicted associated residue-pair movements were greater for the protein complex than for TLR-4, while uncorrelated motions were lower for the complex protein (Figure 8E). In addition, an elastic network analysis was carried out, with each spring standing in for a specific atomic pair. Analysis of the residues close to the carboxy terminus showed the dark gray band around the spring and noncontinuous gray bands around the same immobility normal string (Figure 8F).

Dihedral coordinate-based normal-mode analyses of the vaccine-hTLR-4 complex; ligand-receptor interaction was assessed throughout comparative (A) Deformabilities, (B) b-factor, (C) eigenvalues, (D) variance, (E) covariance of residue indices, and (F) elastic network analysis.
Immune simulation of the designed vaccine
Secondary immunological responses, such as IgM + IgG, IgM, IgG1 + IgG2, IgG1, and IgG2, were shown to be highly stimulated by the potential vaccine, and they increased with further doses of the vaccine (Figure 9A). There was an uptick in the production of cytokines such as interferon-gamma (IFN-γ), transforming growth factor beta (TGF-β), and interleukin-10 (IL-10) after vaccination. A significant elevation in IFN-γ levels was seen after the first vaccination, with concentrations reaching over 400 000 ng/mL. This is consistent with the activation of cellular immune responses. After the second dosage, IFN-γ levels remained stable, demonstrating a sustained induction. Interferon-γ levels were high but slightly rose after the third treatment, reaching over 450 000 ng/mL (Figure 9B). Transforming growth factor-β levels followed a distinct pattern, with a first-dose induction of about 150 000 ng/mL. As can be seen in Figure 7B, Transforming growth factor-β concentrations steadily declined throughout many administrations, from a peak of about 75 000 ng/mL after the second dosage to around 10 000 ng/mL after the third dose. After the first 2 vaccination doses, IL-10 levels peaked at approximately 50 000 ng/mL, suggesting an early immunological response. However, IL-10 concentrations dropped dramatically after the third dosage (Figure 9B), indicating that IL-10 production naturally declines with time. It was also discovered that the vaccination doses prompted a rise in the number of activated B and T cells. Figure 9C shows that the third vaccination dosage produced the largest number of B cells, whereas Figure 9D shows that the second vaccine dose produced the highest number of T cells. This suggests that both T and B cell populations expand in response to vaccination, with separate maxima occurring after different doses of the vaccine.

Anticipated immune response following the injection of the designed vaccine. Panel (A) and (B) display the number of antibodies and cytokines, respectively, in response to the vaccine. Panel (C) and (D) depict the population of B and T cells, respectively.
Molecular dynamic simulation
We performed several investigations using MD simulations based on Cα atoms, to determine the stability of the vaccine-TLR-4 complex. Protein conformational changes during ligand binding were analyzed using the RMSD. After a dramatic spike in the first 10 ns, the complex’s RMSD value stabilized at roughly 0.2 nm for the rest of the run. After reaching its peak at 30 ns, it gradually dropped over the next 20 ns. At the end of the molecular dynamic simulations, the complex settled into a rather stable form (Figure 10A). We used the RMSF to assess the protein’s local adaptability. Increases in RMSF values suggest more positional flexibility in certain amino acids. Particularly flexible were residues at positions 10, 250, and 450 (Figure 10B), indicating a possible role in interactions with the receptor. Using the Rg, we could evaluate how compact the protein was. Stable protein folding is represented by a constant Rg value, whereas changes in this value imply unfolding. Our investigation showed that after 40 ns, the Rg rapidly decreased, then increased, and then decreased again, resulting in a compact condition (Figure 10C). Solvent-accessible surface area research was used to evaluate the protein’s hydrophobic core stability. Increases in SASA reflects a higher propensity for protein instability due to solvent accessibility. The SASA values dropped steadily during the simulation and reached rock bottom at the conclusion. Figure 10D shows that this arrangement decreases the likelihood of solvent disruption and implies that the protein is more stable.

Analysis of conformational changes, flexibility, compactness, and hydrophobic core stability of the vaccine-TLR-4 complex across 100 ns explicit molecular dynamics simulation run. (A) RMSD, (B) RMSF, (C) RG, and (D) SASA. Here, RMSD, root mean square deviation; RMSF, root mean square fluctuation; RG, mean radius of gyration, and SASA meaning solvent-accessible surface area.
Codon optimization and mRNA prediction of the vaccine construct
We employed a sophisticated codon optimization strategy for the
Advancing the molecular architecture of the vaccine, we leveraged the RNAfold server to predict the mRNA secondary structure of our vaccine construct (Figure 11). This critical step, informed by our codon-optimized sequence, allowed for a rigorous assessment of the mRNA’s structural integrity. The server’s analysis predicted a highly stable mRNA structure characterized by substantial base pairing and a marked absence of unstable regions, underpinning the vaccine’s potential efficacy. Notably, the minimum free energy (MFE) of the optimal secondary structure was calculated to be −475.90 kcal/mol, with the secondary centroid structure exhibiting a free energy of −369.28 kcal/mol. Furthermore, the ensemble free energy was determined to be −503.05 kcal/mol, underscoring the vaccine mRNA’s robust stabilization across a spectrum of potential structural conformations.

Optimizing codons and predicting mRNA vaccine structure. (A) Optimal secondary structure, (B) Centroid secondary structure of the vaccine mRNA retrieved using RNAfold Webserver. (C) CAI value, (D) a dot plot containing the base pair probabilities, (E) a mountain plot representation of the MFE structure, the thermodynamic ensemble of RNA structures, and the centroid structure along with the positional entropy for each position.
Discussion
Millions of children under the age of 5 are disproportionately affected by shigellosis, a severe diarrhea illness caused by the gram-negative bacteria
The computational approach of reverse vaccinology was used to find possible vaccine candidates.
80
In this research, vaccine candidates were derived from the proteome of
Our in-silico approach integrated multiple computational methods to predict T-cell and B-cell epitopes, ensuring their immunogenicity, stability, and broad population coverage. The successful generation of high-confidence 3D models for all target proteins highlights their structural reliability and suitability for epitope mapping. Validation metrics, including Ramachandran plot analysis and GMQE scores, confirmed the model’s accuracy. Mapping of CTL, HTL, and B-cell epitopes onto these structures revealed their surface-exposed and solvent-accessible locations, critical features for effective immune recognition and antigenicity. 85 The distribution of epitopes across extracellular loops and functional domains, as observed in the putative outer membrane protein (Q3YZL0) and LPS-assembly protein LptD (Q3Z5V5), emphasizes their immunological relevance. Similarly, the positioning of epitopes near porin channel openings in PapC-like porin protein (Q3YZM5) and across functional domains in the putative fimbrial-like protein (Q3Z3I2) underscores their potential for eliciting robust immune responses.
The designed vaccine construct demonstrated favorable physicochemical properties, including solubility, stability, non-allergenicity, and non-toxicity. After 3 vaccinations, the body produced abundant quantities of antibodies (IgM + IgG, IgG1 + IgG2) and cytokines (IFN-γ, TGF-β, IL-10). These predicted responses align with known protective immune mechanisms observed in
Comparing this approach to other in silico-designed peptide vaccines, such as those targeting
A key innovation of this study is the development of an mRNA-based vaccine, which offers significant advantages over conventional DNA-based MEVs that require plasmid construction and cloning.
96
The mRNA vaccine design leveraged the RNAfold server and related algorithms to optimize secondary structure stability and free energy profiles. The codon optimization process tailored the mRNA sequence for optimal expression in
Overall, the results of this work highlight the promise of reverse vaccinology for developing mRNA
In summary, our research offers a potential reverse vaccinology mRNA vaccine based on multiple epitopes that can be used to protect against the most frequent
Conclusion
The goal of this research was to create an mRNA vaccine against the most prevalent
Supplemental Material
sj-docx-1-bbi-10.1177_11779322251328302 – Supplemental material for Development of a Novel mRNA Vaccine Against Shigella Pathotypes Causing Widespread Shigellosis Endemic: An In-Silico Immunoinformatic Approach
Supplemental material, sj-docx-1-bbi-10.1177_11779322251328302 for Development of a Novel mRNA Vaccine Against Shigella Pathotypes Causing Widespread Shigellosis Endemic: An In-Silico Immunoinformatic Approach by Abdur Razzak, Otun Saha, Khandokar Fahmida Sultana, Mohammad Ruhul Amin, Abdullah bin Zahid, Afroza Sultana, Uditi Paul Bristi, Sultana Rajia, Nikkon Sarker, Md Mizanur Rahaman, Newaz Mohammed Bahadur and Foysal Hossen in Bioinformatics and Biology Insights
Footnotes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
