Sage Journals: Discover world-class research

Abstract

This study presents a data-driven approach for smart tire management through the application of unsupervised machine learning techniques. Using a real-world dataset comprised with synchronized records collected at 1 Hz from a sensor-equipped fleet vehicle, the research investigates how the use of clustering algorithms – K-Means and BIRCH with Agglomerative Clustering – can be employed to identify distinct operational stages in the usage cycle of tires. A comprehensive descriptive analysis was first conducted to understand the behavior and correlations among pressure, temperature, and speed data. The clustering analysis, applied both globally and by individual tire positions, revealed that the optimal number of clusters can vary depending on the tire’s location. The findings highlight the importance of position-aware tire analytics and support the development of intelligent tire management systems capable of optimizing performance, enhancing safety, and extending tire lifespan.

Keywords

machine learning clusters pattern recognition data descriptive analysis sensorized tires

Introduction

Rubber, a crucial commodity in various daily products, reached a production volume of 29.6 million metric tons by 2022, with a substantial portion allocated to tire manufacturing.¹ The quality of tires on the market can be substantially improved through advanced monitoring, with the automotive industry investing in solutions such as Tire Pressure Monitoring Systems (TPMS), which will soon be mandatory in Europe.²

In this context, the project “Traceable Smart Products and Interoperability in the Automotive Industry Supply Chain with the Application to Sensorized Tires – Tire IoT,” financed by FINEP (Brazil), aims to create a smart tire equipped with Radio-Frequency Identification (RFID) tags, as well as pressure and temperature sensors, for ongoing tire monitoring from production to disposal. This device has been engineered to withstand the demanding conditions of the vulcanization process and daily use; besides, it should also feature real-time communication capabilities, connecting the tire to both the vehicle’s Controller Area Network (CAN) network, other instruments, and to the cloud, ensuring traceability and data connectivity throughout the production chain.

Internet of Things (IoT) tires’ benefits to the market include optimizing manufacturing and maintenance processes, reducing operating expenses, and increasing vehicle safety.³ For consumers, these tires provide greater safety through continuous tire condition monitoring, and sustainability through extended tire life and reduced waste.⁴ However, these benefits can only be achieved through the development of models that allow the analysis of the acquired sensor data. Thus, the development and application of IoT and Artificial Intelligence (AI) technologies in tires represent a significant advance for the automotive industry and for the end-user experience.

It is important to note that the use of machine learning algorithms to identify patterns in sensor-equipped tires has the potential to alert about out-of-specification usage conditions and contribute to the prediction of tire lifespan.⁵ This can lead to an extended tire lifespan, a better maintenance schedule, and consequently a lower Cost per Kilometer (CPK). These capabilities support efficient management of operating conditions and aid in stakeholder decision-making, providing advantages for both consumers and the environment.

However, traditional supervised machine learning techniques are not straightforward in this domain because there is a lack of labeled data.⁶ For example, fleet managers lack the labeled data (i.e. known examples of “good,”“acceptable,” or “disposable” conditions) required to train classification algorithms. This gap necessitates an alternative approach.

Accordingly, this study adopts an exploratory, data-driven approach that combines descriptive analysis and unsupervised machine learning to analyze the tire usage cycle. By applying clustering models to the observable variables – tire pressure, temperature, and vehicle speed – the primary objective is to identify the distinct operational stages a tire naturally undergoes during normal service.

The unlabeled dataset, comprised of 1 Hz time-series data from six sensor-equipped tires on a fleet bus over a 30-day period, is leveraged to achieve this goal. The unlabeled dataset used in this study was provided by Prometeon Tyre Group (publicly available in Harvard Dataverse⁷).

Thus, this work seeks to investigate operational patterns in fleet tires and address the following research question: can an exploratory approach combining descriptive analysis and unsupervised machine learning applied to fleet tire sensor data reveal distinct operational states associated with tire position and usage conditions, thereby supporting the development of intelligent, data-driven tire management strategies?

Determining the optimal number of clusters in a dataset of fleet tire data can be a crucial step toward uncovering underlying patterns in tire usage behavior. Clustering facilitates the identification of intrinsic groupings within the dataset, which may correspond to variations in operational conditions, such as route types, vehicle loads, or driving styles.³ These groupings enable a more nuanced understanding of tire performance, supporting data-driven approaches to predictive maintenance, resource allocation, and lifecycle optimization. Additionally, clustering serves to reduce the dimensional complexity of the data, allowing for clearer interpretation and the development of targeted strategies for each subgroup.⁸ Establishing the most appropriate number of clusters also provides a foundation for subsequent modeling tasks, including anomaly detection and condition-based forecasting. As such, this analysis plays a pivotal role in advancing intelligent fleet management practices grounded in empirical evidence.

This work makes three primary contributions: (i) the development of an integrated framework that combines descriptive statistics and clustering algorithms to extract operational patterns from fleet tire data, enabling a deeper understanding of this domain; (ii) the application of unsupervised clustering techniques (K-Means and BIRCH) to identify tire operational phases without prior labeling; and (iii) the characterization of tire-specific behaviors across different wheel positions, highlighting the importance of position-aware analysis to improve maintenance strategies, safety, and service life. Collectively, these contributions demonstrate the potential of integrating Internet of Things (IoT) technologies with machine learning approaches for intelligent tire management.

This paper is organized as follows: the next section reviews the state-of-the-art in tire wear modeling and the application of machine learning to tire data. Subsequently, unsupervised learning techniques for clustering tasks are introduced. The following section presents the proposed methodology, which encompasses a descriptive analysis of the data and the application of K-Means and BIRCH, followed by Hierarchical Agglomerative Clustering, algorithms to group tires based on similarity. The experimental results are then presented and their respective discussions are provided. Finally, the paper concludes with a summary of the main findings and future perspectives.

Literature review

This section provides a comprehensive overview of existing research on smart tire technology, focusing on two key domains: the development of tire wear models and the diverse applications of machine learning for tire analysis. The aim is to contextualize the present study within the broader academic landscape by examining the state-of-the-art and identifying current research gaps.

Tire wear models

Accurately predicting tire wear is a critical aspect of smart tire technology, with ongoing research dedicated to improving model fidelity under real-world conditions.^9,10 The literature reveals two primary streams of investigation: one centered on physics-informed models and another on data-driven methods for real-time monitoring.

The first stream focuses on developing comprehensive, physics-based, and hybrid models. A notable example is the work of Sakhnevych and Genovese,¹⁰ who presented an integrated model unifying adhesive and hysteretic friction by incorporating rubber viscoelastic properties, road roughness, and thermodynamic states. Their theoretical work was complemented by an experimental methodology using camera-assisted observation of micro-damage, which broadened the understanding of abrasion processes by highlighting the critical role of material relaxation. Similarly, Napolitano Dell’Annunziata et al.¹¹ proposed a hybrid model that combines physical principles with statistical analysis to create a more robust predictive methodology than purely phenomenological approaches. Further blending these techniques, Kim et al.¹² integrated finite element analysis with a one-dimensional convolutional neural network (CNN), demonstrating how physics-based simulations can deepen the understanding of wear dynamics and improve predictive accuracy.

In parallel, a second research stream has prioritized data-driven and sensor-based methods for adaptable, real-time applications. Seeking to dispense with complex physical models, Han et al.¹³ developed a machine learning system that uses acceleration signals from an intelligent tire to train a deep learning model for wear detection. This approach is highly viable for practical, on-the-fly monitoring. Focusing on sensor integration, Chang et al.¹⁴ engineered a solution combining three-axis and Hall sensors to estimate vehicle mileage and tire wear with over 99% accuracy. Its high precision and low power consumption make it a promising method for embedded intelligent tire systems.

While these approaches have advanced the field, a persistent gap remains in integrating heterogeneous fleet data and validating models under the full spectrum of real-world operating conditions. To address this gap, the present study shifts the focus from direct wear prediction to a more foundational analysis of tire operational behavior. Instead of predicting a final wear value, our primary objective is to apply clustering models to identify and characterize the distinct operational stages a tire naturally undergoes during normal service. This approach provides a crucial contextual layer – defining what a tire is doing at any given moment – that is essential for developing the next generation of granular and context-aware wear models.

Applications of machine learning for tires analysis

The increasing availability of tire data, often facilitated by Internet of Things (IoT) technologies, has spurred significant research into the application of machine learning (ML) for tire analysis. ML algorithms provide powerful tools for pattern identification, prediction, and optimization, addressing a range of tire-related challenges.¹⁵ Recent contributions can be broadly grouped by their primary objective: monitoring tire health through indirect means, predicting tire life and manufacturing quality, and estimating real-time dynamic states.

A prominent research trend involves the indirect monitoring of tire health, which leverages existing vehicle signals to infer tire conditions without dedicated sensors. Wei et al.,¹⁶ for instance, developed a low-cost framework that detects pressure loss by extracting features from wheel-speed signals, achieving 96.18% accuracy with a support vector machine (SVM). In a similar vein, Svensson et al.¹⁷ created axle-specific classifiers to detect both incorrect pressure and insufficient tread depth across a vehicle fleet. These sensor-less approaches are complemented by studies like that of Rahman et al.,¹⁸ who demonstrated that supervised algorithms like XGBoost could robustly classify tire condition with over 95% accuracy using simple physical measurements of tread-depth and pressure.

Beyond real-time monitoring, another critical application area is the prediction of tire longevity and the analysis of manufacturing quality. Zhu et al.¹⁹ integrated image processing with ML to estimate tire life by classifying texture features from images of tires at various wear levels. In the context of manufacturing, Biantoro and Hernadewita²⁰ applied the unsupervised K-means clustering algorithm to identify sources of quality issues from Radial Run Out data. Their approach successfully linked defect clusters to specific manufacturing processes, enabling targeted improvements. This study is particularly notable for its use of unsupervised learning for root-cause analysis.

In a more dynamic context, other researchers have focused on the real-time estimation of tire-road forces and slip parameters using intelligent tire systems. Xu et al.²¹ developed a slip ratio estimation model using accelerometer data from the tire’s inner liner, finding that vertical acceleration provided the most robust features. A subsequent study by the same authors extended this to estimate longitudinal, lateral, and vertical forces with high accuracy.²² To overcome the challenge of extensive data collection, Strano et al.²³ proposed a novel framework that uses a physics-based model to generate large virtual datasets for training supervised algorithms, which can then predict tire forces from real-time sensor data.

The reviewed literature demonstrates that supervised machine learning is a well-established and effective tool for predictive tasks in tire analysis, such as estimating pressure loss,¹⁶ predicting wear,¹⁹ and estimating forces.²² However, there has been less focus on applying unsupervised learning for the exploratory analysis of tire behavior from real-world operational data. While Biantoro and Hernadewita²⁰ successfully used clustering to analyze manufacturing data, its application to in-service tire performance remains largely unexplored. Therefore, the present study addresses this gap by leveraging an unsupervised, data-driven methodology not to predict a predefined variable, but to discover and characterize the distinct operational stages of a tire throughout its service life.

Methodological background

This section provides the methodological background for the unsupervised machine learning techniques employed in this study. It introduces the fundamental principles of data clustering and details the specific algorithms applied: K-Means and BIRCH followed by Agglomerative Clustering. Furthermore, it describes the internal validation metrics, such as the silhouette score and the Davies-Bouldin score, used to evaluate the quality and determine the optimal number of clusters in the subsequent analysis.

Unsupervised machine learning for clustering task

Unsupervised machine learning usually handles two main problems: clustering and dimensionality reduction. Especially for data clustering, the idea is to use algorithms to analyze and cluster unlabeled datasets without human intervention. There are several clustering algorithms, for example, K-Means Clustering,^8,24 Hierarchical Clustering,^25,26 and BIRCH.²⁷

The selection of K-Means and BIRCH algorithms for this study was influenced not only by their effectiveness in handling unlabeled data but also by their computational efficiency for potential real-time deployment scenarios.

The computational complexity of the K-means algorithm is $O (ndki)$ , where $n$ represents the number of data points, $d$ the dimensionality of the data, $k$ the number of clusters, and $i$ the number of iterations until convergence.²⁸ It is linear in $n$ for fixed $d$ , $k$ , and $i$ . BIRCH is particularly well-suited for large-scale and streaming data due to its incremental Clustering Feature Tree structure, which allows single-pass data processing with near-linear time and memory complexity while avoiding storage of the entire dataset.²⁷

K-Means is a clustering algorithm that divides a data set into distinct groups, where each data point is assigned to the cluster closest to its centroid, representing the cluster’s center.²⁹ K-Means algorithm’s basic idea can be described in four fundamental steps³⁰: definition of the number of clusters ( $k$ ), initialization, relocation, and update. For initialization, the algorithm randomly selects $k$ initial centroids of each group based on the data points. In the relocation step, each point is associated with the cluster whose centroid is closest, using, for example, the Euclidean distance as a measure of proximity. Moving on to the update step, the centroids of each cluster are recalculated, taking the mean of the points that belong to it. This mean point then becomes the new centroid of the cluster. These steps are repeated until there are no more changes in the centroids or until a limit of iterations is reached. The result is a set of $k$ -clusters represented by their respective centroids.²⁹

The K-Means’ often challenge is determining the ideal number of clusters, represented by $k$ . This value refers to the limit at which adding more clusters does not significantly improve the variation explained by each group. One way to determine this value is through the Elbow Method. The procedure involves running the K-Means algorithm varying the $k$ value, ranging from 1 to a maximum stipulated value. At each iteration, the sum of the quadratic distances (Within Cluster Sum of Square – WCSS) between each data point and the centroid of the cluster to which it was assigned is calculated for each value of $k$ .³¹

Another two approaches to determine the optimal number of clusters $k$ are the silhouette score and the Davies-Bouldin score. Both are internal metrics used to evaluate clustering quality, meaning they assess how well the data have been clustered without needing ground truth labels. However, they differ in formulation and interpretation.

Silhouette score evaluates how similar a data point is to its own cluster (cohesion) compared to other clusters (separation). The score ranges from $- 1$ to $1$ , where a higher value indicates better-defined clusters.³²

For each piece of data $i$ , the silhouette coefficient $s (i)$ is calculated as equation (1), where $a (i)$ is the average distance between $i$ and all other data belonging to the same cluster and $b (i)$ is the minimum average distance from $i$ to all data in any other cluster, of which $i$ is not a member.

s (i) = \frac{b (i) - a (i)}{\max {a (i), b (i)}}

(1)

To evaluate the overall quality of the clustering, the average of the silhouette coefficients of all points is calculated according to equation (2).

Silhouette score = \frac{1}{N} \sum_{i = 1}^{N} s (i)

(2)

Davies-Bouldin score measures the average similarity between each cluster and its most similar one, based on intra-cluster dispersion and inter-cluster separation. It then penalizes high intra-cluster distances (loose clusters) and low inter-cluster distances (clusters too close).³³ Equation (3) presents how this score is calculated, where $s_{i}$ is the average distance of all points in cluster $i$ to the centroid of $i$ ; $s_{j}$ is the average distance of all points in cluster $j$ to the centroid of $j$ ; $d_{ij}$ is the distance between the centroids of clusters $i$ and $j$ ; and $k$ is the number of clusters.

Davies - Bouldin score = \frac{1}{k} \sum_{i = 1}^{k} \max_{j \neq i} (\frac{s_{i} + s_{j}}{d_{ij}})

(3)

Another algorithm used for clustering is the Hierarchical Clustering. This method creates a hierarchy of clusters, where clusters are grouped into more significant subgroups, forming a tree structure or dendrogram. Hierarchical clustering methods can be broadly categorized into divisive and agglomerative. For divisive clustering, the main question is how to select a cluster for the next splitting procedure according to dissimilarity and how to split the selected cluster. For the agglomerative approach, clustering begins with each cluster containing one object. It recursively merges the two most similar clusters in terms of the similarity measure until all objects are included in a single cluster. Although both methods yield a dendrogram representing the data’s hierarchical structure, the clustering results can vary significantly based on the similarity or dissimilarity measure applied.³⁴

One key point of the Agglomerative method³⁵ is the similarity measure used to select the two most similar clusters for the next merge. Many agglomerative clustering algorithms have been proposed in terms of the different ways in which the similarity measure is defined. The merging of clusters is usually done by using linkage algorithms based on distance between clusters. There are four important types of linkage algorithms: ward, single, average, and complete. ^26,36

While it yields good results, the Agglomerative Hierarchical Clustering is not scalable due to its computational and memory complexity. The primary limitation arises from the need to compute and store a pairwise distance matrix between all data points. In practice, this means that as the dataset grows, the number of required computations increases quadratically. For large datasets, this results in prohibitive memory usage and excessively long processing times, making hierarchical clustering impractical without significant approximations or subsampling.

To address this limitation, BIRCH (Balanced Iterative Reducing and Clustering using Hierarchies)²⁶ was introduced as a scalable alternative that combines the strengths of hierarchical clustering with efficient data summarization. BIRCH incrementally builds a compact tree structure known as a Clustering Feature Tree (CF Tree), which stores small groups of similar data points, called subclusters, along with summary statistics such as the number of points, linear sums, and squared sums.

These subclusters can either serve as the final clustering result or be used as input to another clustering algorithm, such as Agglomerative Clustering, for further refinement. The final clustering configuration, including the optimal number of clusters, can be evaluated using internal validation metrics such as the silhouette score.

Proposed methodology and dataset description

This work aims to present methods to manage and analyze tires based on real data. To this end, the Fleet Tire Metrics dataset³⁷ was used. This dataset is comprised over $~ 17, 000$ synchronized records collected at $1 Hz$ and contains features on tire Pressure (PSI) and Temperature (°C), as well as Vehicle travel speed (km/h) and Timestamp of the acquired data. Data were collected during multiple trips, over a 30-day period, along a designated route between the cities of São Paulo and Presidente Prudente, both located in the state of São Paulo, Brazil, using a fleet bus equipped with six monitored tires, model $295 / 80 R 22.5 FR 01$ . Tire pressure and temperature measurements were recorded using Schrader TPMS (Tire Pressure Monitoring System) sensors – model TMS5.

The tires were labeled as FR (Front Right), FL (Front Left), Rear Right Inner (RRI), Rear Left Inner (RLI), Rear Right Outer (RRO), and Rear Left Outer (RLO), which indicates the position of the tires, according to Figure 1. The dataset also includes an attribute labeled Old Data, which indicates whether a record is newly acquired ( $False$ ) or a repetition of the previous one ( $True$ ). For the purposes of both descriptive and clustering analysis in this study, only records marked $False$ , representing new data entries, were considered.

Figure 1.

Bird’s-eye view of the position of tires on the fleet bus during data acquisition. FR stands for Front Right, FL for Front Left, RRI for Rear Right Inner, RLI for Rear Left Inner, RRO for Rear Right Outer, and RLO for Rear Left Outer.

External factors such as weather and road conditions were not included because no corresponding data was available. Moreover, as the vehicle is part of a bus fleet, its load conditions can be assumed to remain practically constant during data acquisition.

The internal temperature of the tire is particularly important, as it determines the aging rate of the tire, with higher temperatures accelerating aging and consequently reducing the product’s useful life.³⁷ Thus, analysis that takes temperature into account can provide fleet managers with the information needed to decide whether to discard or retread, for instance, a tire.

The methodology was divided into two phases. First, a descriptive analysis was carried out to enhance understanding of the data. In the second phase, K-Means and BIRCH algorithms were employed to explore the usage cycles of a typical fleet tire: the objective was to identify the number of operational stages that emerge during normal service. The quantity of clusters that exist for each configuration was assessed with internal validation metrics (silhouette and Davies–Bouldin score), and, exclusive for K-Means, the Elbow Method was also used. External validation metrics were not employed in this work because the dataset does not contain labeled data.³⁸

Descriptive analysis was performed using statistical information, variable distributions, potential correlations, and temporal behavior of the data. Due to different units and scales among the variables, data standardization was applied when appropriate, depending on the specific requirements of each analysis.

The clustering analysis was initially conducted on the entire dataset, and subsequently applied to each tire individually, according to its position on the vehicle. In this work, BIRCH result was always refined by using its subclusters as input for the Agglomerative Clustering (BIRCH followed by Agglomerative Clustering).

Several tools were employed to implement the algorithms. Python was used as the programming language, as well as several libraries, such as Pandas³⁹ for data processing, NumPy⁴⁰ for vectorization support, Matplotlib,⁴¹ Seaborn,⁴² and Plotly⁴³ for graphs, and Scikit-Learn³⁶ for standardization, metrics, and machine learning algorithms.

While the Fleet Tire Metrics dataset comprises time-series data with inherent temporal structure, our methodological approach to the clustering process deliberately treats individual sensor readings as independent observations. This design decision is justified by several theoretical and practical considerations:

Limited temporal window of the dataset: the data acquisition period of approximately 30 days (around 20,000 km) is relatively short to robustly capture temperature and pressure variations arising from changes in the operational state of fleet vehicle tires. Therefore, a 30-day window may be sufficient for preliminary analysis, but longer periods may be necessary to observe significant trends in the tires’ behavior.

Operational state versus temporal transitions: The primary objective of this study is to identify distinct operational states that tires experience during normal service, rather than modeling the temporal transitions between these states. Each sensor reading (pressure, temperature, and speed) represents a snapshot of tire operational conditions that can be meaningfully categorized regardless of its temporal context. This approach aligns with the goal of developing position-aware tire analytics for fleet management applications.

Real-time deployment considerations: For practical implementation in intelligent tire management systems, the ability to classify individual sensor readings into operational categories without requiring historical context enables real-time decision-making. This independence assumption allows immediate anomaly detection and operational state identification without maintaining temporal buffers or complex state transition models.

Computational efficiency: The independent observation approach significantly reduces computational complexity from $O (n^{2})$ for temporal sequence analysis to $O (n)$ for static clustering, making the methodology suitable for embedded automotive systems with limited processing capabilities.

For both K-Means and BIRCH with Agglomerative Clustering, it is necessary to set initial parameters. Therefore, the following parameters were used for the K-Means:

random state = $100$ – to enable reproduction of results.

init = k-means++– ensuring that the initialization method is k-means++ to speed up convergence (default value in scikit-learn).

max_iter = $300$ – stipulating a maximum of $300$ iterations (default value in scikit-learn).

n_cluster = $k$ , where $k$ will be found during the analysis.

For BIRCH with Agglomerative Clustering, the parameters were as follows:

threshold = $t$ , where $t$ will be found during the analysis – it is the radius of the subcluster formed by merging the new sample with its nearest subcluster must remain below the defined threshold.

linkage = ward– to ensure the formation of well-defined and distinct clusters – Agglomerative Clustering parameter.

n_cluster = $k$ , where $k$ will be found during the analysis.

Experiments and results

This section presents the experimental procedures and results of the study. Initially, a descriptive analysis was conducted to explore the statistical properties, distributions, and relationships among the variables, providing a comprehensive understanding of the dataset. Following this, unsupervised machine learning techniques were applied to identify patterns and groupings among the tires based on their usage cycles. The goal was to uncover the number of clusters that exist during a ordinary usage of fleet tire, so it could support data-driven strategies for tire management and maintenance.

Descriptive analysis

The first experiments were conducted to better understand the behavior of the explored relationships between the variables involved. Figure 2 shows the evolution of the variables over time. Due to significant fluctuations in vehicle speed, a moving average of $50$ measures was employed to smooth the data, facilitating the creation of clearer graphs and more accurate analysis. This value was chosen because it allows for analysis with less noise and this window size (50 s) does not introduce significant phase delay for vehicle speed.

Figure 2.

Evolution of the observed variables over time considering all tires’ position. Vehicle speed is the moving average of $50$ measures.

It can be seen that there is a correlation between the measurements; however, physical quantities like temperature (°C), pressure (PSI), and vehicle speed (km/h) have different units and scales, which can make the analysis difficult and tricky. Therefore, a standardization of the data was performed by removing the mean and scaling to unit variance, known as the z-score normalization,⁴⁴ $z = (x - μ) / σ$ , where $x$ is the sample, $μ$ is the mean of the feature, and $σ$ is the standard deviation. Thus, the evolution over time after standardization is shown in Figure 3.

Figure 3.

Evolution of the observed variables over time after standardization considering all tires’ position. Vehicle speed is the moving average of $50$ measures.

The strong correlation between pressure and temperature was expected based on the ideal gas law, often written as $pV = nRT$ . It is also known that vehicle speed contributes to the increase of temperature, because it increases the frequency of the deflection cycle and centrifugal force, which changes the stresses and strains, leading to heat accumulation.⁴⁵ Thereby, Pairwise Pearson Correlation was used to measure the similarity between the data, as shown in Figure 4, where a high Pearson correlation coefficient of $0.98$ can be seen between these variables.

Figure 4.

Pairwise Pearson correlation.

The subsequent analysis was performed without incorporating the temporal dimension, considering the three variables jointly. The results indicate a clear linear relationship between Temperature and Pressure, which becomes evident in the 3D plot shown in Figure 5. This consistent relationship is further explored through the pairwise interactions among the three variables in Figure 6. Additionally, Figures 7 to 9 provides a closer examination of these relationships using raw data (non-standardized). Figure 7 shows that the most frequently observed tire temperature during operation is approximately $50$ °C, while the most common pressure, as illustrated in Figure 8, is around $137 PSI$ . The overall mean temperature is $47.45 \pm 8.41$ °C, while the mean pressure is $135.13 \pm 4.93 PSI$ . For vehicle speed, the mean value is $66.32 \pm 32.58 km / h$ . The corresponding median values are, respectively, $49.00$ °C, $135.61 PSI$ , and $83.00 km / h$ .

Figure 5.

All tires’ positions together: Pressure versus Temperature versus Vehicle speed (standardized data).

Figure 6.

Pairwise relationship between the three variables (standardized data).

Figure 7.

Tire temperature distribution – all tires’ positions together.

Figure 8.

Tire pressure distribution – all tires’s positions together.

Figure 9.

Vehicle speed distribution.

A Q–Q (Quantile–Quantile) plot⁴⁶ was employed (Figure 10) to visually assess whether Temperature and Pressure data follow a normal distribution. Since the data points align closely with the reference diagonal line, it suggests that both variables approximate a normal distribution.

Figure 10.

Q–Q plot for normality check of tire Temperature and Pressure.

After analyzing the overall distributions across all tire positions, the study proceeded to examine the distributions for each specific tire position on the vehicle. Figures 11 and 12 present box plots of Temperature and Pressure, respectively, for each tire position. Box plot graph summarizes the distribution of the dataset, showing the median, quartiles, minimum and maximum values, and potential outliers. An in-depth examination of the plots reveals that the tires in the RLO and RRI positions exhibit slightly lower temperatures – median value around $45$ °C compared to approximately $50$ °C of the other positions – and, consequently, lower pressures, while RRO exhibit slightly higher temperatures and, consequently, higher pressures.

Figure 11.

Temperature by tire position in the vehicle.

Figure 12.

Pressure by tire position in the vehicle.

Variations in the internal air temperature of the tires are primarily driven by differences in the heat dissipation mechanism, which is influenced by material properties, operational conditions, and contact characteristics.⁴⁸ These differences can also be affected by the performance and action of the braking system.^48,49

In this case study, these three positions are the most distinct from the others, a pattern that is supported by the computation of effect sizes for each tire position, when compared to all other positions. Table 1 presents the Cohen’s $d$ effect sizes⁵⁰ for both Temperature and Pressure, computed by comparing each tire position against the distribution formed by all remaining positions (excluding the one being compared). As per Cohen’s conventions, absolute values around $0.2$ , $0.5$ , and $0.8$ are typically interpreted as small, medium, and large effects, respectively.

Table 1.

Cohen’s $d$ effect size comparing each tire position against all others (excluding the group itself).

	All other positions (disregarding the one being compared)
Tire position	d for Temperature	d for Pressure
RLI	0.1816	0.3476
FR	0.0463	0.0400
RLO	−0.7533	−0.8752
RRO	0.4361	0.4216
RRI	−0.4738	−0.3276
FL	0.1480	0.0176

Boldface values indicate the negative and positive extremes

Notably, the RLO position shows a large negative effect for both temperature $(d = - 0.75)$ and pressure $(d = - 0.88)$ , indicating that it consistently exhibits lower values than other positions. Similarly, RRI also shows moderate to large negative effects. In contrast, RRO exhibits moderate positive effects for both variables, suggesting higher values relative to other positions. RLI displays small to moderate effects, while FR and FL show negligible effect sizes. Due to the large size of the dataset, statistical significance between groups reflected in extremely small $p$ -values, even when the actual differences are minimal, therefore only Cohen’s $d$ effect sizes were used.

Unsupervised machine learning for clustering

The experiments conducted in this section were applied to identify patterns and groupings among the tires. The goal is to uncover the optimal quantity of groups that can be formed using the available data and, with this, to understand the number of cycles the fleet tire has during ordinary usage. To this end, the experiments were conducted considering the temperature, pressure of the tires, and vehicle speed. All the data used by the models of this section were first standardized.

As observed in the statistical analysis, tire temperature and pressure tend to vary depending on the tire position. This suggests that a greater number of clusters may emerge when all tire positions are analyzed together, as the overall data becomes more dispersed.

All tires’ positions together

In this experiment, all tire positions were analyzed together for each of the models considered: K-Means and BIRCH with Agglomerative Cluster.

K-Means

It was used with the number of clusters $k$ ranging from 2 to 14. The Elbow Method was applied to determine the optimal value of $k$ . The resulting graph is shown in Figure 13, where a noticeable inflection point suggests that the optimal number of clusters likely falls within the range of $4$ – $6$ . Additionally, the silhouette score was employed as an alternative metric to support the selection of the optimal $k$ . Figure 14 illustrates these results, showing a peak silhouette score at $k = 3$ . Since there was no exact agreement between the best number of clusters between the two analysis, $k = 4$ stands out as a consistent choice, as it corresponds to the third-highest silhouette score and also aligns with a distinct inflection point observed in Figure 13.

Figure 13.

Elbow method for number of cluster analysis.

Figure 14.

Silhouette score for different number of clusters found by K-Means.

Although $k = 4$ , four clusters, appears to be the most suitable choice based on Figures 13 and 14, the optimal number of clusters was not perfectly consistent between the two plots. To support a more conclusive determination of $k$ , a detailed silhouette analysis was performed by calculating the silhouette coefficient for each data point in the dataset. Figure 15 shows the silhouette plot for $k$ ranging from $2$ to $5$ . The red vertical lines represent the silhouette score, which corresponds to the average score across all samples. It is possible to verify that $k = 2$ and $k = 3$ are not good choices as they exhibit significant imbalance in cluster sizes, where group $0$ (zero) shows greater thickness. Thus, the decision here is more likely to be between $4$ or $5$ clusters or groups.

Figure 15.

Silhouette analysis of K-Means clustering considering all tire positions together.

Finally, the analysis was performed using the Davies–Bouldin score (Figure 16), which yielded its lowest value (and therefore the best result) for $k = 4$ . Based on the combined evaluation of all metrics and analyzes, $k = 4$ was identified as the most appropriate number of clusters when K-Means was adopted for all tires’ positions being analyzed collectively.

Figure 16.

Davies-Bouldin score for different number of clusters using K-Means.

The application of the K-Means clustering algorithm with $k = 4$ , followed by segmentation by tire position, resulted in the distribution shown in Figure 17. It is possible to notice that RRO stands out as the most distinct position, being strongly associated to cluster $3$ ( $red$ ). Front tires (FL and FR) share a similar cluster distribution, indicating that they can be treated as a functional group.

Figure 17.

Clusters found by K-Means per tire position for $k = 4$ .

Overall, the clustering results reinforce that tire position has a significant influence on the analyzed variables (e.g. Temperature and Pressure), which supports the decision to conduct a position analysis over the quantity of clusters.

BIRCH with Agglomerative Cluster

The second applied method was the BIRCH with Agglomerative Cluster. It was studied with the number of clusters $k$ ranging from $2$ to $9$ , and the threshold parameter tested over the set ${0.5, 1.0, 1.5, 2.0}$ .

To support the selection of the optimal number of clusters ( $k$ ) and $threshold$ , the silhouette score was employed as an evaluation metric. Figure 18 displays the corresponding results, with the top six scores indicated by the red horizontal line. In the same way the analysis was performed with K-Means, a detailed silhouette analysis was also performed with BIRCH. Figure 19 shows the silhouette plot for the top six scores: $k = 2$ with $threshold = 1.0$ ; $k = 2$ with $threshold = 2.0$ ; $k = 3$ with $threshold = 1.5$ ; $k = 2$ with $threshold = 1.5$ ; $k = 4$ with $threshold = 1.5$ ; and $k = 4$ with $threshold = 1.0$ .

Figure 18.

Silhouette score for different number of clusters and thresholds found by BIRCH with Agglomerative Cluster.

Figure 19.

Silhouette analysis for BIRCH with Agglomerative Cluster.

The configuration with $k = 2$ with $threshold = 1.0$ yielded the highest average silhouette score $(~ 0.55)$ , indicating strong overall cluster cohesion and separation. However, this configuration appears suboptimal in practice, as it suffers from severe cluster imbalance, since nearly all samples are assigned to cluster $0$ ( $Zero$ ).

Alternatively, the configuration with $k = 4$ and the same $threshold = 1.0$ provided a reasonable balance between cluster quality and granularity, with a silhouette score around $0.42$ . The visual inspection of the silhouette plots confirms that the clusters in this configuration are relatively well defined and balanced. Therefore, although $k = 2$ offers the best score, $k = 4$ solution with $threshold = 1.0$ is preferred for capturing the internal variability of the data without a substantial loss in clustering quality.

Then, the analysis was performed using the Davies–Bouldin score (Figure 20) across different values of $k$ and $threshold$ parameters. Among the tested configurations, the lowest Davies–Bouldin scores were achieved at $k = 2$ with $threshold = 1.0$ . However, the silhouette score showed that $2$ clusters present an imbalanced scenario. In contrast, $k = 4$ with $threshold = 1.0$ performs consistently well, achieving a relatively low score of $~ 0.73$ .

Figure 20.

Davies-Bouldin score for different number of clusters ( $k$ ) and $thresholds$ using BIRCH with Agglomerative Cluster.

The application of the BIRCH algorithm with a $threshold$ of $1.0$ , followed by Agglomerative Clustering with $k = 4$ segmented by tire position, resulted in the distribution presented in Figure 21. Cluster $0$ ( $blue$ ) dominates all positions, with the exception of RLO, indicating that a substantial portion of the dataset shares similar characteristics, possibly representing a baseline or stable operational condition. In the same way as with K-Means, the front tires (FL and FR) exhibit very similar distributions. In contrast, the rear tires reveal more variability. The overall result indicates that cluster $0$ ( $blue$ ) may represent normal or stable conditions, while clusters $1 - 3$ likely capture outliers, transitional states, or specific usage patterns. Cluster $2$ ( $green$ ) likely represents conditions with lower Temperature and Pressure compared to the other tire positions. This aligns with the fact that RLO was predominantly associated with Cluster $2$ , as it exhibited the lowest Temperature and Pressure values based on the Cohen’s $d$ analysis shown in Table 1.

Figure 21.

Clusters found by BIRCH with $threshold = 1.0$ and Agglomerative Clustering with $k = 4$ per tire position.

Individual tires’ positions

Although the all-together analysis – where all tire data is submitted to cluster algorithms regardless of the positions – can reveal general behavioral patterns and assist in unsupervised clustering, it may overlook critical specific dynamics due related to tire location, as each position (FR, FL, RRO, RRI, RLI, and RLO) may exhibit distinct dynamics that can be lost in an overall evaluation.

The study of this section was made with the same two algorithms considered in the last section: K-Means, and BIRCH followed by Agglomerative Clustering.

K-Means

The Elbow Method was initially applied, and the resulting plot is presented in Figure 22. To enable a consistent comparison of the inflection points across all tire positions, the WCSS of each curve was normalized using its corresponding value at $k = 1$ as the reference $(wcs s_{norm} = wcss [k] / wcss [1])$ . Similarly to the analysis with all tire positions together, the inflection point suggests that the optimal number of groups is approximately $4$ (red dashed line on graph), regardless of the position of the tire.

Figure 22.

Elbow method per tire position using normalized within-cluster sum of squares (WCSS).

The silhouette score was also used to support the selection of the optimal number of clusters. As shown in Figure 23, for all tire positions, $k = 2$ does not appear to be a good choice, due to the presence of clusters with below-average silhouette scores. At $k = 3$ , an imbalance in cluster sizes is evident across all positions except RLO. Both $k = 4$ and $k = 5$ present more balanced and well-separated clusters, making them strong candidates. To further refine the choice, the Davies–Bouldin score was analyzed, with results shown in Figure 24. This analysis indicates that for the front tires (FR and FL), the optimal number of clusters is $k = 5$ , while for the remaining positions, $k = 4$ yields the lowest Davies–Bouldin values.

Figure 23.

Silhouette analysis across tire positions using K-Means clustering with multiple values of $k$ .

Figure 24.

Davies-Bouldin score per tire position using K-Means.

This may mean that the front tires go through more wear steps in a normal usage cycle, as they can wear quickly than the rear ones. In general, the individual analysis of positions followed what was found in the analysis of all the tire positions together.

BIRCH followed by Agglomerative Clustering

The silhouette score was utilized to support the identification of the optimal number of clusters. The experiment was made with BIRCH $threshold$ equal to $1.0$ . As depicted in Figure 25, the configuration with $k = 2$ is not good across all tire positions, as it yields clusters characterized by below-average silhouette values and a marked imbalance in cluster sizes. At $k = 3$ , it is possible to notice several negative values for FR (in cluster 0 – $gray$ ) and RRO (in cluster 1 – $lightblue$ ), which indicates that data points are potentially assigned to the wrong clusters; furthermore, there are cases of cluster size imbalance, as in FL, RRI, RLI, and RLO. $k = 4$ presents slightly more balanced clusters, except for RRO, which presents many negative values in cluster 0 ( $gray$ ) and RRI with below-average silhouette values in cluster 1 ( $blue$ ). $k = 5$ also presents slightly more balanced clusters, except for FR and FL, which present many negative values in cluster 1 and RRI with below-average silhouette values in cluster 0.

Figure 25.

Silhouette analysis for BIRCH with $threshold = 1.0$ followed by Agglomerative Clustering per tire position with multiple values of $k$ .

The analysis of the Davies-Bouldin score across different tire positions, as shown in Figure 26, reveals notable variations in cluster compactness and separation as a function of the number of clusters ( $k$ ). Rear outer positions (RRO and RLO) exhibit a consistent improvement in clustering quality with increasing $k$ , reaching good performance at $k = 5$ . This suggests a higher degree of heterogeneity in these positions, potentially reflecting more complex wear patterns or usage conditions. According to Table 1, these two positions deviate the most from the average.

Figure 26.

Davies-Bouldin score per tire position using BIRCH with $threshold = 1.0$ followed by Agglomerative Clustering.

In turn, while FR had a distinct number of $4$ clusters, the score obtained for FL was increased from $k = 2$ to $k = 4$ , with a small drop at $k = 5$ . This suggests that no value of $k$ provides particularly good separation when BIRCH was used, but $k = 2$ has the smallest value and was selected.

Conclusion

This study presents a comprehensive and data-driven approach to smart tire management through the application of unsupervised machine learning techniques. Leveraging real-world data acquired from a sensorized fleet vehicle equipped with Tire Pressure Monitoring Systems (TPMS), the research highlights how clustering algorithms, specifically K-Means and BIRCH followed by Agglomerative Clustering, can be employed to identify recurring patterns in tire usage cycles without the need for labeled data.

The first phase of the analysis focused on descriptive statistics, revealing meaningful correlations among temperature, pressure, and vehicle speed. These variables exhibited significant variance across different tire positions, with rear outer tires (RRO and RLO) showing notably more dispersed values compared to other positions (Table 1). This positional dependency reinforces the need for individualized analysis rather than treating the tire system as a homogeneous entity.

In the second phase, the application of unsupervised learning enabled the segmentation of the tire usage data into distinct operational clusters. Internal validation metrics such as the silhouette score and Davies-Bouldin Index were systematically used to guide the selection of the optimal number of clusters. When analyzing all tire positions together, a configuration of four clusters consistently emerged as a suitable choice for both K-Means and BIRCH-based methods. For K-Means, the RRO position stands out as the most distinct (Figure 17), being strongly associated with cluster 3. In the BIRCH-Agglomerative approach, the RLO position was the only one with a majority of examples assigned to class 2 (Figure 21), whereas all other positions had most examples in class 0. This suggests that the RLO position may be considered an outlier relative to the others, and this insight could be extended to other tire cases to detect anomalies, for example.

Further refinement through individual tire analysis using K-Means revealed that front tires (FR and FL) tend to progress through more usage phases – indicated by a higher optimal number of clusters ( $k = 5$ ) – than their rear counterparts. This behavior is likely due to dynamic load shifts and steering demands typically experienced by front tires, particularly in urban driving conditions. In the same individual tire analysis using the BIRCH-Agglomerative method, the rear outer positions (RRO and RLO) exhibited better clustering also at $k = 5$ , suggesting a higher degree of heterogeneity in these positions, which may reflect more complex wear patterns.

The practical implications of these findings are significant. By enabling the identification of distinct operational states in tire behavior, the proposed approach lays the groundwork for real-time predictive maintenance, enhanced safety diagnostics, and more sustainable tire lifecycle management. Fleet managers and manufacturers can benefit from position-aware models that inform replacement schedules, load balancing strategies, and anomaly detection frameworks. Moreover, the methods presented here can be generalized to other vehicular components or systems where sensor-rich environments and unlabeled data dominate.

From a scientific perspective, the combination of clustering methods and internal validation metrics demonstrated here offers a robust methodology for extracting knowledge from complex, unlabeled datasets. The integration of BIRCH with Agglomerative Clustering proved particularly effective for the global analysis, when all tires’ positions were studied together as a single dataset. In turn, K-Means seems to be more valuable during the individual position analysis.

A current limitation of this work is the lack of end-of-life tire data, which would allow correlating the identified clusters with retreading levels and thus improve their practical interpretability.

Another limitation is that the analysis is restricted to a single vehicle type and sensor configuration; broader validation across diverse fleet compositions would strengthen generalizability. From a computational deployment perspective, while both K-Means and BIRCH demonstrate relatively low computational requirements (evidenced by their successful processing of our dataset with minimal resource demands), practical implementation in embedded tire systems requires addressing hardware constraints including processing power limitations, memory restrictions, and energy consumption optimization.

Future work should extend this research by: (i) integrating temporal modeling to capture transitions between operational states; (ii) incorporating derivative and fluctuation-based features, such as temperature rise rate and pressure fluctuation amplitude; (iii) exploring semi-supervised learning when limited labels become available; (iv) incorporating additional sensor modalities such as vibration and acceleration, and (v) developing optimized implementations for resource-constrained embedded systems. Moreover, deploying the proposed models in real-time fleet management systems would enable adaptive decision-making in vehicle safety and maintenance strategies, though practical recommendations for fleet management based on tire cluster migration patterns still require development.

This work underscores the transformative potential of combining IoT data acquisition with advanced machine learning techniques in the domain of automotive engineering. It not only enhances the understanding of tire behavior under real-world conditions but also provides practical tools to improve the efficiency, reliability, and sustainability of modern transportation systems.

Footnotes

Handling Editor: Chenhui Liang

ORCID iDs

Danilo Hernani Perico

Rafael Gomes Alves

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by the Financiadora de Estudos e Projetos (FINEP), grant number 01.22.0262.00 (Ref. 1164/21), as part of the Rota 2030 program, and by the Ministério da Ciência, Tecnologia e Inovação (MCTI) and the Ministério da Economia. This publication was also financed by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) – Finance Code 001. All these funding agencies are based in Brazil.

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

References

Statista Research Department. Rubber industry worldwide, https://www.statista.com/topics/3268/rubber/ (2024).

Akhill

Vishnu

Nirav

, et al. Protecting of automotive vehicle tyre by maintaining the pressure in tyre through pressure monitoring system. Int J Sci Technol Eng 2018; 4: 232–235.

Visconti

Rausa

Del-Valle-Soto

, et al. Innovative driver monitoring systems and on-board-vehicle devices in a smart-road scenario based on the Internet of Vehicle paradigm: a literature and commercial solutions overview. Sensors 2025; 25(2): 562.

Kubba

Jiang

A comprehensive study on technologies of tyre monitoring systems and possible energy solutions. Sensors 2014; 14(6): 10306–10345.

Liu

Chen

, et al. Impact of vehicle type, tyre feature and driving behaviour on tyre wear under real-world driving conditions. Sci Total Environ 2022; 842: 156950.

Tamborski

Rojek

Mikołajewski

Revolutionizing tire quality control: AI’s impact on research, development, and real-life applications. Appl Sci 2023; 13(14): 8406.

Calhabeu

Hubmann

Perico

, et al. Fleet tire metrics, https://doi.org/10.7910/DVN/IWOWUI (2025, accessed 30 June 2025).

MacQueen

. Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Berkeley symposium on mathematical statistics and probability, Berkley, CA, 1967.

Braghin

Cheli

Melzi

, et al. Tyre wear model: validation and sensitivity analysis. Meccanica 2006; 41(2): 143–156.

10.

Sakhnevych

Genovese

Tyre wear model: a fusion of rubber viscoelasticity, road roughness, and thermodynamic state. Wear 2024; 542–543: 205291.

11.

Napolitano Dell’Annunziata

Adiletta

Farroni

, et al. Tire wear sensitivity analysis and modeling based on a statistical multidisciplinary approach for high-performance vehicles. Lubricants 2023; 11(7): 269.

12.

Kim

Park

Kim

Comparison of performance of predicting the wear amount of tire tread depending on sensing information. Sensors 2023; 23(1): 459.

13.

Han

Kwon

Lee

, et al. Experimental evaluation of tire tread wear detection using machine learning in real-road driving conditions. IEEE Access 2023; 11: 32996–33004.

14.

Chang

Juang

Huang

, et al. Estimation of tire mileage and wear using measurement data. Electronics 2021; 10(20): 2531.

15.

Jordan

Mitchell

TM.

Machine learning: trends, perspectives, and prospects. Science 2015; 349(6245): 255–260.

16.

Wei

Wang

, et al. A low-cost tire pressure loss detection framework using machine learning. IEEE Trans Ind Electron 2021; 68(12): 12730–12738.

17.

Svensson

Thelin

Byttner

, et al. Indirect tire monitoring system: machine learning approach. IOP Conf Ser Mater Sci Eng 2017; 252: 012018.

18.

Rahman

Kamal

Abdullah

, et al. Tire condition classification based on tread depth using machine learning. In: Proceedings of the 2022 IEEE 20th student conference on research and development (SCOReD), Bangi, Malaysia, 8–9 November 2022, pp.51–55. New York: IEEE.

19.

Zhu

Han

Wang

Automobile tire life prediction based on image processing and machine learning technology. Adv Mech Eng 2021; 13(3): 6878140211002727.

20.

Biantoro

Hernadewita . K-means clustering on quality of radial run out tires. IOP Conf Ser Mater Sci Eng 2021; 1034(1): 012122.

21.

Tang

Askari

, et al. Direct tire slip ratio estimation using intelligent tire system and machine learning algorithms. Mech Syst Signal Process 2022; 175: 109085.

22.

Askari

Huang

, et al. Tire force estimation in intelligent tires using machine learning. IEEE Trans Intell Transp Syst 2022; 23(4): 3565–3574.

23.

Strano

Terzo

Tordela

. A supervised machine learning framework for smart tires. In: Proceedings of the 2021 IEEE 6th international forum on research and technology for society and industry (RTSI), Naples, Italy, 6–9 September 2021, pp.364–369. New York: IEEE.

24.

Lloyd

Least squares quantization in PCM. IEEE Trans Inf Theory 1982; 28(2): 129–137.

25.

Johnson

SC.

Hierarchical clustering schemes. Psychometrika 1967; 32(3): 241–254.

26.

Patel

Sihmar

Jatain

. A study of hierarchical clustering algorithms. In: Proceedings of the 2015 2nd international conference on computing for sustainable global development (INDIACom), New Delhi, India, 11–13 March 2015, pp.537–541. New York: IEEE.

27.

Zhang

Ramakrishnan

Livny

. BIRCH: an efficient data clustering method for very large databases. In: Proceedings of the 1996 ACM SIGMOD international conference on management of data, Montreal, QC, Canada, June 1996, pp.103–114. New York: ACM.

28.

Nazeer

Kumar

Sebastian

. Enhancing the k-means clustering algorithm by using a O(n logn) heuristic method for finding better initial centroids. In: Proceedings of the 2011 second international conference on emerging applications of information technology, Kolkata, India, 19–20 February 2011, pp.261–264. New York: IEEE.

29.

Jin

Han

K-means clustering. In: Sammut

Webb

(eds) Encyclopedia of machine learning. Springer US, 2010, pp.563–564.

30.

Pei

Zhong

, et al. A self-error corrector integrating K-means clustering with Markov model for marine craft maneuvering prediction with experimental verification. Ocean Eng 2023; 285: 115420.

31.

Humaira

Rasyidah

Determining the appropiate cluster number using elbow method for k-means algorithm. EAI, 2018.

32.

Rousseeuw

PJ.

Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 1987; 20: 53–65.

33.

Davies

Bouldin

DW.

A cluster separation measure. IEEE Trans Pattern Anal Mach Intell 1979; PAMI-1(2): 224–227.

34.

Ran

, et al. Comprehensive survey on hierarchical clustering algorithms and the recent developments. Artif Intell Rev 2023; 56(8): 8219–8264.

35.

Zepeda-Mendoza

Resendis-Antonio

Hierarchical agglomerative clustering. In: Dubitzky

Wolkenhauer

Cho

, et al. (eds) Encyclopedia of systems biology. Springer New York, 2013, pp.886–887.

36.

Pedregosa

Varoquaux

Gramfort

, et al. Scikit-learn: machine learning in Python. J Mach Learn Res 2011; 12: 2825–2830.

37.

Altman

Beutler

Kohler

JT.

Erroneous or Arrhenius—potential impact of oven temperature variations on laboratory aging of tires. Tire Sci Technol 2011; 39(2): 79–94.

38.

Gutierrez-Portela

Almenares Mendoza

Calderón-Benavides

Evaluation of the performance of unsupervised learning algorithms for intrusion detection in unbalanced data environments. IEEE Access 2024; 12: 190134–190157.

39.

McKinney

Data structures for statistical computing in Python. In: van der Walt

Millman

(eds) Proceedings of the 9th Python in science conference. 2010, pp.56–61.

40.

Harris

Millman

van der Walt

, et al. Array programming with NumPy. Nature 2020; 585(7825): 357–362.

41.

Hunter

JD.

Matplotlib: a 2D graphics environment. Comput Sci Eng 2007; 9(3): 90–95.

42.

Waskom

ML.

Seaborn: statistical data visualization. J Open Source Softw 2021; 6(60): 3021.

43.

Plotly Technologies Inc. Collaborative data science, https://plot.ly (2015, accessed 30 June 2025).

44.

Han

Kamber

Pei

Data mining: concepts and techniques. 3rd ed. Morgan Kaufmann, 2012.

45.

Gent

Walter

(eds). The pneumatic tire. Report no. DOT HS 810 561, U.S. Department of Transportation, National Highway Traffic Safety Administration, 2006.

46.

Dodge

Q-Q Plot (Quantile to Quantile Plot). In: Dodge

(ed.) The concise encyclopedia of statistics. Springer New York, 2008, pp.437–439.

47.

LaClair

TJ.

Rolling resistance. In: Gent

Walter

(eds) The pneumatic tire. National Highway Traffic Safety Administration, U.S. Department of Transportation, 2005, p.508.

48.

Costa

ALA

Natalini

Inglese

, et al. Tire bead overheating in urban buses and trucks using drum brake systems. Tire Sci Technol 1998; 26(1): 51–62.

49.

Nguyen

Do Cong

Dinh

Rolling tires on the flat road: thermo-investigation with changing conditions through numerical simulation. Appl Sci 2023; 13(8): 4834.

50.

Cohen

Statistical power analysis for the behavioral sciences. 2nd ed. Lawrence Erlbaum Associates, 1988.

Toward smart tire management: A data-driven unsupervised learning approach

Abstract

Keywords

Introduction

Literature review

Tire wear models

Applications of machine learning for tires analysis

Methodological background

Unsupervised machine learning for clustering task

Proposed methodology and dataset description

Experiments and results

Descriptive analysis

Unsupervised machine learning for clustering

All tires’ positions together

K-Means

BIRCH with Agglomerative Cluster

Individual tires’ positions

K-Means

BIRCH followed by Agglomerative Clustering

Conclusion

Footnotes

ORCID iDs

Funding

Declaration of conflicting interests

References