Abstract
The early recognition of wheel wear is an important task to the safe and efficient operation of a railway network. This article presents a new dictionary learning approach for wheel condition monitoring based on an adaptive parametric algorithm of blind source separation and extending K-means and singular value decomposition algorithm. Numerical simulations confirm the effectiveness of the proposed method. An experiment of wheel condition monitoring is conducted using a JD-1 wheel/rail simulation facility. Data calculation and theoretical analysis of wheel–rail contact dynamic show that the proposed method can adaptively learn and accurately identify wheel defects and verify the performance of the proposed method.
Introduction
Condition monitoring techniques of railway wheel wear have been researched globally for many years.1,2 Broadly speaking, there are two types of monitoring system: infrastructure-based monitoring and rolling-stock-based monitoring. Infrastructure-based monitoring equipment, such as acoustic and vision sensors, is mainly set up at specific sections, and measurement speed is generally lower than the normal train speed.3,4 This article examines rolling-stock-based monitoring based on vibration measurements made by accelerometers which detect wheel defect information produced by the impact of the wheel and rail. Vibration measurement is an important approach which is based on the idea that the wheel has specific vibration signatures under standard conditions and the signature changes with the development of the wheel wear, and the possible existing wheel defects can be recognized by signal processing techniques. 5
However, signal analysis of wheel wear requires further improvement. It is a hard work to recognize wheel faults in the beginning stage, although eventually which can be detected by static measurement devices. A significant challenge is that the observation signal of wheel–rail interaction due to wheel wear is the mixture of the wear impact response signal and the regular wheel–rail contact vibration signal which overlap in time domain and frequency domain. 6 Dictionary learning is an attractive tool in decomposing mixed signal components whose locations in time and frequency vary widely. This article presents a new dictionary learning approach for wheel condition monitoring.
During the last decade, dictionary learning has attracted a lot of attention in signal processing, machine learning, and compressive sensing of audio and visual data. 7 The goal of dictionary learning is to decompose an observation signal into a linear expansion of some analytical bases, called atoms, that are selected from a large and, in general, redundant family of functions called a dictionary. 8 Two of the most well-known dictionary learning algorithms are the method of optimal directions (MOD) 9 and the K-means and singular value decomposition algorithm (K-SVD). 10 Some approaches have been introduced to approximate the desired solution, such as greedy pursuit methods which iteratively refine the current estimate for the coefficient vector by modifying one or several coefficients chosen to yield a substantial improvement in approximating the signal. The relaxation methods also approximate the desired solution by replacing the combinatorial problem with a tractable one. 11 Greedy pursuit methods include orthogonal matching pursuit (OMP), 12 compressive sampling matching pursuit, 13 subspace pursuit, 14 and so on. Iterative shrinkage-thresholding, 15 smoothed norm, 16 and interior-point methods 17 are some examples of the relaxation methods.
Effective study of these algorithms in recent years has established when the sought solution is sparse enough on a prespecified set of dictionary atoms: wavelets, Gabor bases, and more. The success of such dictionaries in applications depends on how suitable they are to sparsely describe the signals in question. Sparsity of observation signals is a prerequisite for classical dictionary learning methods. It can be very challenging as the signals are non-sparse in their current domain or transform domain.
Attempting to address the non-sparse problem, this article considers a different route for designing dictionary based on adaptive learning. A new dictionary learning method is proposed based on the adaptive parametric algorithm of blind source separation (ApBSS) and the extending K-SVD to exploit a dictionary by the observation signal itself. The proposed method consists of two steps: atoms extraction and dictionary representation. In the first step, the ApBSS is used to accurately extract sources from the observation signal, and a local dictionary is adaptively learned from sources. In the second step, the extending K-SVD is performed to compute the best decomposition of the signals, and representation patterns are updated to minimize the error according to given supports.
The remainder of the article is organized as follows. Section “Classical dictionary learning method” describes the principle and the limitation of conventional dictionary learning methods. In section “A new method of dictionary learning in blind source basis,” a new dictionary learning approach on blind source basis is proposed. Numerical simulations confirm its effectiveness. In section “Wheel condition monitoring,” an experiment of wheel condition monitoring is conducted using a JD-1 wheel/rail simulation facility. The results of data calculation and theoretical analysis of wheel–rail contact dynamic show that the proposed method can adaptively learn and accurately identify wheel defects and verify the performance of the proposed method. Section “Conclusion” summarizes the entire work.
Classical dictionary learning method
This section describes the procedure of dictionary learning from sparse approximation to signal recovery. Also, the limitation associated with conventional dictionary learning methods is illustrated by numerical simulations.
Using an overcomplete dictionary matrix
where
Exact determination of sparsest representations proves to be an NP-hard problem.18,19 Approximate solutions are considered instead, and some efficient pursuit algorithms have been proposed. Among the existing methods for decomposing a signal in terms of dictionary atoms, OMP
20
is one of the most widely used approaches. It forms a
Initialize the residual
Find the index
Augment the index set and the matrix of chosen atoms
Solve a least squares problem to obtain a new signal estimate
Calculate the new approximation of the data and the new residual
Increment t and return to Step 2 if
The estimate
A group of mutually independent source signals is shown in Figure 1(a). The first signal is a frequency-varying sinusoid. The second is a transient square. The third is an amplitude modulation signal. The fourth is a sinusoid. The fifth and the sixth are exponential signals with abrupt varying changes. The seventh is a Gaussian random signal.

(a) Source signals, (b) frequency spectrums of sources, (c) observation signals, and (d) recovery signals by OMP.
Figure 1(b) shows frequency spectrums of these source signals. For comparison, performing a simple linear instantaneous mixture of the sources produces observation signals as shown in Figure 1(c). The measurement matrix is obtained by the Fourier orthonormal basis decomposition of the source signals. Applying OMP to the observation signals obtains the recovery signals as shown in Figure 1(d). Comparing Figure 1(d) with Figure 1(c), the obvious differences between the observation signals and the recovery signals can be observed. This indicates that the classical dictionary learning algorithm fails to accurately recover the observation signals.
Many signals have sparse representations in some analytical basis (Fourier, wavelets, Gabor, etc.) and can be expressed using a linear combination of only a small set of basis vectors by classical dictionary learning algorithms. Conversely, many signals contain non-stationary components whose locations in time and frequency vary widely. The solutions are not sparse enough on prespecified dictionaries, so they cannot be characterized by specific waveforms or spectral contents in time–frequency domain. In this simulation, the observation signals are not sufficiently sparse because of the overlap distribution patterns of the source signals. Therefore, they cannot be represented accurately by the classical dictionary learning method.
A new method of dictionary learning in blind source basis
Atoms extraction by ApBSS
The first step of the proposed method is to blindly extract the sources from the observation signals. BSS is an attractive tool due to its excellent performance in separating source signals from their mixtures when no detailed knowledge of the sources and the mixing process is assumed. 21 BSS algorithms, such as independent component analysis, 22 non-negative matrix/tensor factorization, 23 latent variable analysis, 24 and sparse component analysis, 25 have been gradually applied in engineering fields.
The first step of the proposed method relies on the BSS method to accurately perform source extraction. In this article, the ApBSS that has been recently explored by the authors 22 is used to accomplish this task. ApBSS was found to be an effective approach for blind separation of many types of non-stationary signals. Its principle is reviewed briefly as follows.
Assuming statistical independence of the source signals, the instantaneous mixing model under consideration is
where
The adaptive average of the separated signal is defined as
where
The parametric function is defined as follows
where
Considering the gradient of the parametric function to zero, the solution
Applying ApBSS as well as two classical BSS methods, FastICA and Tensor Factorization, separate the non-stationary observation signals as shown in Figure 1(c). The separation results are shown in Figure 2(b)–(d), respectively. Comparison between the separation results and the source signals shown in Figure 2(a) illustrates the performance of ApBSS algorithm.

(a) Source signals, (b) separations by FastICA, (c) separations by tensor factorization, (d) separations by ApBSS, and (e) recovery signals by OMP with ApBSS atoms.
Using the separation results in Figure 2(d) as dictionary atoms, applying OMP recovery algorithm creates the recovery signals as shown in Figure 2(e). The recovery signals agree with the observation signals in Figure 1(c). The applicability of the ApBSS algorithm for atoms extraction is found to be superior.
Figure 3 plots the representation coefficients of the observation signals by the extracted atoms. The number of nonzero columns equals the number of observation signals. This kind of sparsest representations illustrates that ApBSS is a powerful algorithm for dictionary building. In addition, the extracted atoms by ApBSS are mutually independent and with zero means, and therefore they would be valuable as orthonormal bases to develop an optimal sparse representation for arbitrary independent signals.

Representation coefficients of the observation signals using adaptive atoms extracted by ApBSS.
Dictionary representation by extending K-SVD
The step of dictionary representation is performed by the extending K-SVD method.
26
The dictionary
where K is the maximum number of atoms allowed,
The representation patterns are updated to minimize the error according to the given supports
As the shift operators
where
Fixing the atoms
Procedure of the proposed method
The steps for the proposed dictionary learning method are the following:
Wheel condition monitoring
The procedure of the proposed method for railway wheel condition monitoring is illustrated in Figure 4.

Procedure of dictionary learning on blind source basis for wheel condition monitoring.
After the data acquisition from the front wheel and the rear wheel, the centrobaric spectrum is calculated as follows
where
Corresponding to different wear conditions, the wheel has different vibration signatures that largely affect the centrobaric spectrum. Therefore, FC is used as a real-time indicator to reflect the wheel condition. When the centrobaric spectrums
The experiment of wheel condition monitoring is conducted using a JD-1 wheel/rail simulation facility, as shown in Figure 5(a).

(a) JD-1 wheel/rail simulation facility, (b) wheel flat spot, and (c) wheel tread indentation.
The apparatus is composed of a small roller that serves as the wheel and a large roller that serves as the rail. Two defects are made on the wheel roller: a flat spot shown in Figure 5(b) and a tread indentation shown in Figure 5(c). They are machined 180° from each other. Motor speed imposed on the rollers can be controlled accurately. Two three-dimensional (3D) accelerometers are used to measure the vibration signals. The sampling frequency is 10 kHz.
The wheel roller rotates at a speed of 56 r/min (200 km/h). The longitudinal vibration signals

Vibration observation signals of the wheel roller with two defects.
By the proposed method, the sparse feature columns containing wheel defect information separated by ApBSS are used to build dictionary. The atoms adaptively learned are shown in Figure 7. Atoms a1, a2, and a3 indicate the vertical, longitudinal, and transverse impact responses of the flat spot, respectively. Atoms b1, b2, and b3 are the corresponding responses of the tread indentation impact.

Wheel defect atoms extracted by the proposed method.
Take the vertical wheel–rail impact response due to the wheel flat to analyze the theoretical correctness of the defect atoms extracted by the proposed method. For descriptions of wheel–rail contact dynamic procedure, given a wheel with a theoretical flat of length l and depth d, as shown in Figure 8.

Rolling wheel with a theoretical flat and wheel–rail contact force due to the flat.
Michaël and Steenbergen
27
qualitatively derived the vertical contact force
where
The flat response signal extracted by ApBSS is shown in the upper graph with blue line in Figure 9, which is excellently correspondent to the theoretical derived function as plotted in Figure 9 with red dotted line, especially when the elasticity of track and wheel is taken into account.

Flat response signal and its Choi–Williams time–frequency distribution.
The time–frequency representation of the wheel flat is analyzed by Choi–Williams time–frequency distribution, as shown in the bottom graph in Figure 9. Choi–Williams time–frequency distribution is a kind of Cohen’s class of distributions defined as follows 28
where
Experimental measurement and theoretical analysis are validated mutually, which indicates that the wheel–rail interaction due to wheel wear extracted by the proposed method accurately reflects the wheel wear feature. The proposed method can adaptively learn and accurately identify the wheel defects.
Conclusion
Online recognition of wheel wear is crucial for effective maintenance and is a challenging area of railway safety operation. The observation signal of wheel–rail interaction due to wheel wear is the mixture of the wear impact response signal and the common wheel–rail contact signal. Because slight wear signals are often concealed by severe wheel–rail contact signals in time domain and their spectrums overlap in frequency domain, the conventional time–frequency analysis and frequency filtering technique often cannot accurately extract the early wheel wear from the wheel–rail interaction observation signals.
A new dictionary learning approach for wheel condition monitoring is proposed. Flexible decompositions are particularly important for representing signal components whose locations in time and frequency vary widely. In order to overcome the deficiency of the existing dictionary learning methods, this article presents the new dictionary learning method based on ApBSS and extending K-SVD and applies it to wheel defect detection. Numerical simulations confirm the effectiveness of the developed method. An experiment of wheel condition monitoring is conducted using a JD-1 wheel/rail simulation facility. By adapting the atoms to the input, the proposed method yields sparse representations for arbitrary independent signals as well as capture detail defect information from the observation signals. Data calculating and theoretical analysis of wheel–rail contact dynamic show that the proposed method can adaptively learn and accurately identify the wheel defects and verify the performance of the proposed method.
Footnotes
Academic Editor: Dong Wang
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by grants from the National Natural Science Foundation of China (no. 51205323).
