Abstract
This study presents a machine learning (ML) framework for predicting the critical buckling loads of tapered axially functionally graded (AFG) nanobeams resting on elastic foundations, governed by Eringen’s nonlocal elasticity theory. Due to their position-dependent stiffness and complex boundary conditions, closed-form solutions for such nanobeams are computationally expensive and analytically intractable. To address this, four supervised data-driven regression models are developed: linear regression (LR), polynomial regression (PR), support vector machine (SVM), and artificial neural network (ANN). These models are used to predict buckling loads for three boundary conditions: clamped–clamped (C–C), simply supported (S-S), and clamped-simply supported (C–S). A comprehensive dataset is generated using a finely discretized parametric sweep of the beam’s geometric and material properties. Pearson correlation coefficient (PCC) and variance inflation factor (VIF) analyses are conducted to ensure feature relevance and independence. Results show that the ANN model consistently outperform other test models, achieving coefficient of determination (R2) values exceeding 99%, root mean square error (RMSE) values below 1.1 (in normalized units), and mean absolute error (MAE) values below 0.67 (in normalized units) across all boundary conditions. The findings demonstrate that ML and ANN offer a robust and computationally efficient surrogate to traditional methods, with strong generalization capabilities.
Keywords
Introduction
Nanobeams are fundamental load-bearing members in many micro- and nano-electro-mechanical systems and flexible electronics platforms. 1 Because their slender geometry makes elastic instability more critical than material strength, the critical buckling load is the primary design metric. 2 To tune stiffness, weight, and multifunctional response, designers increasingly use axially functionally graded (AFG) materials together with deliberate geometric tapering along the span.3,4 The combined variation of material moduli, density, and cross-section converts the bending rigidity into a spatially varying quantity, which violates the constant-coefficient assumptions behind classical Euler–Bernoulli theory. 5 Recent elastostatic work on tapered functionally graded beams shows that the combined taper and spatially varying material properties lead to position-dependent stiffness coefficients, so specialized polynomial-coefficient finite-element formulations are required instead of simple closed-form solutions. 6
At nanoscale dimensions, non-local stress interactions bring in characteristic length parameters that must be carried through the governing equations. 7 Large-scale molecular-dynamics benchmarks confirm that size-dependent stiffness and critical load can diverge markedly from classical predictions, highlighting the need for enriched continuum models. 8
Multiscale analyses on carbon-nanotube-reinforced laminates show that environmental conditions and microstructural imperfections reduce the critical buckling load. 9 Studies on asymmetric composite sandwich panels with tapered regions likewise find that geometric asymmetry and material heterogeneity cause shear-buckling damage and matrix-splitting cracks, with ultimate load dictated by core-thickness-dependent stiffness. 10 Because these mechanisms must be resolved in detail, high-fidelity finite-element or mesh-free models become costly. A recent butt-joint stiffened thermoplastic composite panel analysis demanded very fine meshing and nonlinear solvers merely to converge, 11 and mapping the influence of fastener layout and shear-connector stiffness on built-up cold-formed columns required 84 separate finite-element runs. 12
Data-driven surrogate modeling offers a practical alternative. An artificial neural network (ANN) surrogate trained on 500 finite-element samples predicts lower-bound buckling loads for cylindrical shells under localized axial compression with 67% of test cases showing errors below 5% and 87% below 10%. 13 Physics-informed neural networks replace repeated nonlinear simulations for axially compressed shells while retaining equilibrium consistency, 14 while also being used to evaluate compression buckling of laser-welded stiffened plates. 15 Further, a hybrid machine-learning framework combines generalized surrogates with interval analysis to perform uncertain buckling assessment. 16 Surrogate-driven optimization of variable-stiffness laminates further demonstrates the scalability of these approaches to larger design spaces. 17
In Table 1, we present a comprehensive summary of 15 recent studies leveraging machine learning (ML) techniques to predict buckling behavior in structural components. These studies span a diverse set of geometries and structural forms, including columns, plates, shells, and beams, with dataset sources ranging from finite element simulations and experimental results to prior literature. ANNs appear most frequently, reflecting their strong predictive capabilities across various contexts. Hybrid and ensemble models such as ANFIS variants, gradient boosting regression (GBR), and CatBoost also demonstrate notable success, particularly in enhancing prediction accuracy and model stability. Key test metrics reported include mean squared error (MSE), root mean square error (RMSE), mean absolute error (MAE), and coefficient of determination (R2), among others. Noteworthy findings include the superior performance of ANN in predicting buckling loads for complex geometries,18–21 the effectiveness of polynomial regression for additively manufactured panels, 22 and the identification of beam geometric parameters as critical features in ensemble-based models. 23 Furthermore, CatBoost and GBR models have emerged as top performers in recent comparative studies,24,25 suggesting the growing potential of boosting-based algorithms in structural stability prediction.
Comprehensive summary of recent studies utilizing ML techniques to estimate buckling load.
Altogether, the widespread application of ML in buckling load estimation is well justified, given its high predictive accuracy and ease of use. Whether coupled with finite element analysis (FEA) or classical stability theories, ML models offer efficient, scalable alternatives to costly numerical analyses, enabling rapid evaluation across large parameter spaces and supporting data-driven design in complex structural systems.
While ML has been widely applied to buckling prediction across a variety of structural forms, direct transfer of these models to tapered AFG nanobeams governed by nonlocal elasticity and supported by elastic foundations is not straightforward. In this regime, stiffness is modulated simultaneously by material gradation, geometric taper, and foundation interaction, with each effect further shaped by the chosen boundary conditions. These factors interact nonlinearly, producing parameter couplings absent in conventional column, plate, or shell datasets. Models trained on those simpler geometries cannot reliably extrapolate here, as nonlocal effects shift stiffness and stability thresholds in a manner that depends on both load and support conditions. Without a tailored predictive framework, designers of micro-electromechanical systems and nanoscale devices face costly simulations and limited ability to explore lightweight, stability-critical designs.
In this work, we construct a high-fidelity, synthetic dataset spanning the coupled space of gradation, taper, foundation stiffness, nonlocal parameter, and boundary condition, and use it as a controlled testbed to evaluate four supervised learning strategies: linear regression (LR), polynomial regression (PR), support vector machine (SVM), and artificial neural network (ANN). This convergence of advanced spectral solution methods for rapid data generation with modern nonlinear ML architectures enables, for the first time, tractable multi-factor buckling prediction in tapered AFG nanobeams. Compared with traditional numerical methods, which require repeatedly solving governing equations for each potential design configuration and boundary condition, the proposed learning strategies offer a highly efficient alternative. Once trained on representative datasets, these models can (i) capture complex nonlinear relationships between the design attributes and the corresponding critical buckling loads across various boundary conditions without relying on explicit differential formulations; (ii) instantly estimate buckling responses for new design configurations within the training domain, making them highly attractive for optimization and design applications while eliminating the need for repeated meshing or re-computation.
This paper is structured as follows. Section 2 details the methodological framework, including dataset construction, statistical analysis, and model implementation. Section 3 presents the results, comparing model performance across BCs using standard evaluation metrics. Section 4 concludes with key findings and recommendations for future work.
Methods
This section outlines the methodology to develop an ML framework for predicting critical buckling loads of tapered AFG nanobeams on elastic foundations. First, a dataset is generated for three boundary conditions (BCs): clamped–clamped (C–C), simply supported (S-S), clamped–simply supported (C–S) by performing a parametric sweep of beam geometry, material gradation and nonlocal parameters. Second, feature relevance and independence are ensured via statistical preprocessing using Pearson correlation coefficient (PCC) and variance inflation factor (VIF). Third, four regression models LR, PR, SVM, and ANN are trained and validated using an 80/20 hold-out strategy repeated over multiple trials. Model performance is evaluated using R2, RMSE, MAE, and computational time on validation and test sets. Finally, predictive accuracy and robustness are assessed through predicted versus true comparisons and residual analysis to confirm superior generalization.
Data description
This subsection outlines the data-generation procedure used to predict critical buckling loads of tapered AFG nanobeams on elastic foundations. Figure 1 illustrates the beam geometry and the nature of the BCs under investigation. Our approach, like prior literature,18,19,21 relies on datasets drawn from earlier works. A comprehensive description of dataset generation and the governing equation is presented here. For additional details, readers are referred to Sari et al. 32 for full derivations and parameter definitions.

Nonlocal AFG tapered Euler–Bernoulli beam with (a) simply supported and (b) clamped BCs resting on elastic foundation and subjected to a constant axial compressive load.
It is believed that Eringen’s nonlocal theory can be exploited for nanostructures with certain BCs as S-S and C–C. For nanobeams with S-S edges, the BCs are the deflection
Furthermore, there is a common belief that the nonlocal elasticity theory has a lot in common with other theories such as the strain gradient elasticity. 33 Besides, it is known that the nonlocal elasticity theory can predict the performance of a wide range of nanostructures by avoiding the most complex equations. 34
Additionally, the nonlocal elasticity theory remains a foundational framework that has inspired numerous refined models, including the strain-driven and stress-driven theories, as well as the nonlocal strain gradient and integral–differential hybrid theories. Therefore, despite its mathematical and physical limitations in some cases, Eringen’s nonlocal elasticity theory retains a central role in nano mechanics as a reference model and as a basis for developing more advanced theories. To ensure the reliability of Erigen’s nonlocal elasticity theory, few references in which this theory has been utilized to investigate the behavior of nanostructures have been added to the list of references, for completeness.34–46
Hence, we adopt Eringen’s nonlocal Euler–Bernoulli beam model to derive the dimensionless governing equation of motion shown in equation (1):
Here,
Where
Applying the Chebyshev collocation method, these conditions are expressed in equation (10):
Where
In the current study, the initial Chebyshev points are moved to fall within the interval of (0, 1) as the x-axis is normalized to this range. Since the elements of the Chebyshev differentiation matrices depend on the distribution of the points, they will differ from those derived by Trefethen. 47 Equation (10) is expressed in matrix vector form as equation (12):
Considering equation (12),
where
Thus, the Chebyshev collocation matrices are adjusted as in equation (15):
where
where
where
The boundary conditions are expressed using the spectral collocation method as in equations (15) and (16), except that each differentiation matrix
The spectral collocation method can be used to discretize these conditions as explained earlier. Once the governing equation and the boundary conditions are discretized, the inputs (hereafter denoted as
The typical ranges of these inputs are (0, 1), (0, 0.8), (0, 0.8), (0, 400), and (0, 10), for
Proposed methodology
This section presents the methodology proposed in this work to estimate the critical buckling loads of three different BCs, as depicted in Figure 2. Specifically, let

The methodology proposed to estimate the critical buckling loads for three different BCs of a tapered nanobeam under study.
The proposed modeling methodology comprises the following four systematic steps:
It is worth mentioning that even though the Chebyshev spectral collocation method (Section 2.1) has been proven effective in providing highly accurate and numerically exact solutions to the governing equations at hand, the established datasets (
Moreover, to detect and mitigate multicollinearity among the input parameters, the VIF 49 is computed. In practice, multicollinearity leads to unstable and unreliable regression models. Typically, as a rule of thumb, VIF > 5 indicates problematic levels of collinearity and design parameters with high VIF values should then be removed or transformed.
In summary, this step has double effects by ensuring that the selected features are both (i) relevant (through the computation of PCC) and (ii) independent (through the computation of VIF), supporting a stable and interpretable ML model development process of Step 3.
• Linear regression (LR): The LR assumes a simple linear relationship between the inputs and each output, individually. The model development stage aims to estimate the
The objective is to minimize the residual sum of squares (RSS) between the individually estimated (

LR schematic.
• Polynomial regression (PR): The PR extends the LR to include second-degree terms and cross-products, allowing the model to capture basic curvature in the training dataset. The objective is then to find the optimal quadratic equation that minimizes the mismatch between the individually estimated (
This allows more flexibility in modeling the relationship without introducing higher model complexity as in the subsequent models investigated. Independent PR models are developed for each boundary condition,
Support vector machines (SVMs): The SVM model uses a nonlinear kernel function (
Artificial neural networks (ANNs): The ANN regression model is adopted to approximate complex nonlinear mathematical relationships between the design input parameters and each critical buckling load response in output. The model learns from a set of historical/representative data (i.e. the training dataset,
To ensure robust and generalizable results, all models are trained and validated using a 20% hold-out validation strategy, while repeating the simulation

PR schematic.

SVM schematic.

ANN schematic.
Model evaluation
The model’s performance is quantified using standard regression evaluation metrics, including:
RMSE (equation (22)). It measures the average magnitude of estimation/prediction mismatch. Small values reflect the effectiveness of the built-model and vice versa.
MAE (equation (23)). It measures the average absolute mismatches between estimated/predicted and actual buckling load values. Small values reflect the effectiveness of the built-model and vice versa.
R 2 (equation (24)). It reflects the proportion of variance in the output explained by each built-model. Higher values reflect the effectiveness of the built-model and vice versa.
where
In addition to the above-mentioned performance metrics, the computational time (Time) required to train each model is also recorded during its development to provide insight into the trade-off between accuracy and efficiency.
The hold-out validation process is repeated
Results and discussion
This section presents the results obtained by the application of the proposed methodology, step by step.
The training (
Figure 7 presents the PCC values obtained for each input design parameter with respect to each buckling load of the three BCs examined, that is,

PCC values obtained for the five design parameters with respect to each critical buckling load.
The training dataset is then fed to the four selected ML models to estimate the associated buckling loads, evaluated individually on both the validation and test data points across
The ANN model clearly outperforms the other models across all performance metrics, achieving RMSE values below 1.1 (in normalized units), MAE values below 0.67 (in normalized units), and R2 values exceeding 0.99 for all three BCs.
The ANN performance is also shown to be consistent across the three BCs, in contrast to the larger variability observed in the performance of the other models. This highlights the ANN’ robustness and its ability to accurately estimate the buckling loads across different structural configurations.
The relatively poor performance of the LR model suggests a highly nonlinear relationship between the input design parameters and the output buckling loads. This is further supported by the improved performance of the PR model. Conversely, the SVM model performed worse than PR and LR in terms of predictive accuracy, implying that either extensive hyperparameter tuning is needed or that alternative modeling techniques such as ANN are more suitable for this application.
The SVM and ANN models exhibit slightly higher variability across the 10 runs compared to the LR and PR models. This is expected, as both SVM and ANN involve hyperparameter initialization and iterative training (i.e. model uncertainty), which introduce additional sources of uncertainty beyond data splitting. Specifically, the standard deviations (
Table 2 reports the computational efforts required by the models for their development across the 10 replicates for each boundary condition. As expected, the LR required the least computational effort, while the ANN model exhibited the highest. Moreover, a noticeable variability was observed across the 10 runs, particularly for the ANN model, reflecting sensitivity to initialization and training dynamics.

Performance metrics obtained by each of the four models on the validation set across the 10 trials, together with the corresponding standard deviation (
Computation time in seconds required by the predictive models on the validation sets across the 10 simulation trials.
Similar insights are observed when the models are applied to the unseen test dataset across the 10 runs, as depicted in Figure 9. The ANN model maintains its superior performance across all three metrics and BCs, demonstrating strong generalizability and robust predictive capability on unseen data. For clarity, the standard deviations (

Performance metrics obtained by each of the four models on the test dataset across the 10 trials, together with the corresponding standard deviation (
To further illustrate the models’ capability in estimating the three buckling loads, Figure 10 shows the predicted versus true/actual load values for each model and BC on the validation set, shown for one arbitrary replicate (i.e. replicate #1). The ANN consistently demonstrates the best alignment with the ideal diagonal line (dashed line) across all conditions, confirming its superior predictive accuracy and generalization. In contrast, the LR and PR models show systematic underestimation, especially at higher load values. The SVM model significantly deviates from the ideal behavior, indicating limited predictive capability. These patterns reinforce the superior performance of the ANN model, as previously highlighted by the quantitative metrics.

Predictive versus true/actual buckling load values for the four models across the three BCs: (a) C–C, (b) S-S, and (c) C–S.
For illustration purposes, Figure 11 presents the true (actual) and predicted load values for one BC, specifically, the C–C case, using the ANN model on the test dataset for the first 50 examples (Figure 11(a)) and the associated prediction residuals (mismatch; Figure 11(b)). The predicted values are shown to consistently follow the trend of the actual loads, indicating the model’s capability to capture the underlying patterns and variations in the data.

(a) True versus predicted buckling load values for the C–C case using the ANN model on the first 50 test data points and (b) the associated prediction residuals (mismatch).
Buckling is a critical factor in structural and mechanical engineering as it can cause abrupt and catastrophic failure with minimal warning, unlike gradual failures such as yielding. It is a stability problem in which a structure loses its load-carrying capacity due to geometric deformation, even though the material itself remains intact. This is especially substantial in slender components like columns, beams, plates, and shells that are widely used in bridges, towers, buildings, aircraft, and vehicles where lightweight and material efficiency are prioritized. These elements are highly susceptible to instability under compressive loads, making accurate prediction of critical buckling loads essential to avoid failure and improve designs. In high-performance applications such as aerospace and automotive industries, the use of thin-walled structures for weight reduction further increases vulnerability to buckling, where even small imperfections or load variations can trigger instability. As a result, buckling analysis is critical for selecting appropriate shapes, materials, and safety features, as well as informing inspection methods, maintenance planning, and design code formulation to guarantee structures are safe, efficient, and long-lasting.
Conclusions and future directions
This paper developed an ML framework to predict the critical buckling loads of tapered AFG nanobeams on elastic foundations using non-local elasticity theory. A synthetic dataset was generated through a comprehensive parametric sweep of design inputs across three boundary conditions (C–C, S-S, C–S). Feature screening was performed using PCC and VIF to ensure input relevance and independence. Four regression models (LR, PR, SVM, ANN) were trained and evaluated using repeated hold-out validation and standard error metrics on both validation and test sets. Key findings include:
PCC analysis revealed that
All design inputs passed the VIF threshold (VIF = 1), confirming no multicollinearity and supporting model stability.
ANN achieved R2 > 0.99, RMSE < 1.1 (in normalized units), and MAE < 0.67 (in normalized units) across all BCs on both validation and test datasets, indicating high predictive accuracy and generalization.
The ANN model showed strong consistency across different BCs, with low performance variance, making it robust to structural configuration changes.
LR failed to capture the nonlinear input-output relationships, with significantly lower R2 and higher error metrics.
PR improved upon LR, but both models still underestimated loads, especially at higher values, and showed limited generalization to unseen data. This contrasts with previous findings, such as Rayhan et al., 22 where the PR model, optimized with respect to the polynomial degree, achieved the highest accuracy. In the present study, a second-degree PR model was adopted to illustrate the transition from a linear model (LR), to nonlinear (PR), and highly nonlinear (SVM and ANN) models, while assessing their predictive capabilities across the considered metrics. Therefore, future studies in this domain could focus on evaluating the influence of higher-degree terms on prediction accuracy, as well as comparing additional diverse models, since performance appears to be case-dependent.
SVM showed poor accuracy and high variance in predictions, likely due to default kernel settings and sensitivity to hyperparameters.
ANN predictions closely tracked true buckling loads across all 50 test examples in the C–C case, with low residuals and minimal bias.
ANN required moderate training time (∼28–32 s) and offered the best trade-off between accuracy, robustness, and efficiency.
SVM incurred the highest computational cost (up to 975.6 s in S-S), with no advantage in prediction quality.
While the ANN model provided excellent predictive performance, several areas remain open for further development. The current study relies on a synthetic dataset based on idealized formulations and extending the approach to experimental or real-world datasets would help validate applicability by capturing potential imperfections inherent in nanoscale structures, including fabrication tolerances, surface defects, and thermal or geometric variations. In addition, while ANN proved reliable, incorporating uncertainty quantification or probabilistic bounds, through techniques such as Monte Carlo simulations, would enhance confidence in safety-critical settings. Future work could also explore transfer learning, physics-informed networks, or hybrid ML-FEA pipelines to support broader generalization and interpretability across different nano structural forms and loading scenarios.
Footnotes
Handling Editor: Aarthy Esakkiappan
Author contributions
Sameer Al-Dahidi: conceptualization, methodology, software, formal analysis, validation, investigation, writing – original draft, writing – review and editing, visualization. Ma’en S. Sari: conceptualization, methodology, data curation, software, writing – original draft, visualization, formal analysis.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data availability statement
Data will be made available on request.
