Sage Journals: Discover world-class research

Abstract

In the automotive application for Electronic Control Units (ECU), dynamic models with high-precision to simulate mechatronic actuators are required for various analysis and design tasks like Hardware-in-the-Loop (HiL) simulation. It is known that the modeling of systems with significant friction effects is not accessible. The piecewise-affine (PWA) model has recently attracted considerable interest due to its high approximation capability for modeling systems with friction. Because the model quality depends intensely on the test signal design, a model-free optimized excitation signal design for identifying PWA models is presented in this paper. The presented method is demonstrated with an automotive throttle, and a PWA model is identified with a four-step identification approach. The result shows that the optimization method can achieve a better model quality, which is sufficient for the HiL simulation.

Keywords

System identification PWA-model excitation signal design automotive throttles

Introduction

Motivation and literature review

Improved fuel efficiency and reduced emissions are steadily increasing in the development and optimization of modern powertrain. If the dynamic system models are available, model-based analysis and design methods can help achieve the powertrain’s improved performance. Hardware-in-the-Loop (HiL) simulation provides a virtual vehicle by modeling the dynamical behavior. Electronic Control Units (ECU) can be developed and tested invariant or extreme real-time driving conditions without real driving risk. Therefore, models with good prediction quality are steadily required under real-time test conditions.

Usually, the mathematical modeling of dynamic processes can be achieved based on physical principles. However, challenges like lack of pre-knowledge or complex parameter estimation methods and conditions should be faced when using such techniques.^1–4 This problem motivated the development of data-driven modeling methods, in which precise pre-knowledge is unnecessary, and parameterization is relatively simple. As a typical data-driven method, piecewise-affine (PWA) models have recently become attractive due to their high approximation capability for nonlinear systems. PWA-model is defined by mapping regression space into polyhedral convex regions and establishing affine models in each region. PWA-model can efficiently solve several challenging problems in system analysis, prediction and simulation modeling, and controller design.^5–8 Globally, the PWA model can approximate nonlinear systems, and locally, the mapping from the regression to the output space allows the transfer of linear theory directly to nonlinear systems. As an alternative, neural-fuzzy models were also researched in recent years. A typical application is the neural-fuzzy based Hammerstein-Wiener model, which is successfully implemented to approximate complex nonlinear industrial processes.^9,10

The identification quality needs to excite the target system in the possible operating range for the data-driven modeling with the PWA model. The excitation signal plays a crucial role in the data information content, which influences the quality of the model.^11–14 However, the selection of excitation signal is usually a necessary procedure to obtain system response with high quality so that the multi-model can be precisely estimated in the system identification. Because the measurement in the system identification is time-consuming and expensive, the effort for data acquisition should be kept as low as possible.¹⁵ This problem motivates the design of optimal excitation signals for fast identification with optimized experiment costs. Once an optimized data set contains enough target system information, accurate identification can be guaranteed. Often, a suitable input signal for identifying a nonlinear system is a signal that should try to cover the input space and relevant frequencies with little effort for the target application.^13,14 Therefore, the main objective of excitation signal design is to extract as much information as possible from the dynamic system using a “good” input signal, which exposes all relevant areas of the input space and minimizes the variance in the estimated model parameters.^15,16

Typical methods of excitation signal design can be categorized as model-based and model-free approaches.^16–18 In most of these methods, the main task of excitation signal design is to optimize the parameters of a pre-defined excitation signal according to different criteria. Model-based approaches are often called optimal experiment designs (OED), which utilize pre-defined model structures to optimize the variance of model output or model parameters. Generally, the Fisher information matrix (FIM) is used as the optimization criterion, which is achieved in practice by maximizing the track of inverse (A-optimality), determinant (D-optimality), or maximum eigenvalues (E-optimality) of FIM. On the contrary, the model-free design does not need to assume the model structure. The model-free methods design a uniform distribution of the excitation signal in the input domain. Typical methods are the Latin-hypercube design (LHD), block and factorial, and space-filling designs.

In the case of the first category, many methods are based on D-optimal experiment design. Wang et al.¹⁹ identified a nonlinear physical ship maneuvering model using a D-optimal experiment design. Nevertheless, this work is limited due to its high optimization effort. Hartmann et al.¹⁵ proposed an active learning method called HilomotDoE. A pseudo-random Monte Carlo approach was used to homogenously distribute the samples and minimize the model error of local model networks. However, this method is sensitive to the design parameters and the measurement noise. Suzuki and Yamakita²⁰ address OED for PWARX (Piecewise AutoRegressive eXogenous) models for hybrid systems. They push the samples toward the discriminating surfaces to optimize the local model region boundaries. A cost function punishes the distance from the surface, and constraints on the input are considered. As the design requires knowledge of region boundaries, alternating OED and identification are proposed. Both works in Deflorian and Zaglauer¹⁶ and Hametner et al.²¹ optimized test signals using D-optimal design for the identification of MLP (MultiLayer Perceptron) networks. The main drawback of both works is that this method demands prior knowledge of the target system. In Himmelsbach and Kroll¹⁸ and Kroll and Dürrbaum,²² a scalar measure of FIM was used to determine the optimum experimental design to identify TS (Takagi-Sugeno) fuzzy models. These works are limited due to the choice of FIM measure, which impacts the computational cost. Based on FIM, Neilsen et al.²³ dealt with the uncertainty in the quantities of interest for an efficient machine learning algorithm in an ocean environment. Nevertheless, more work is required to research the generalization performance of the method.

As for the model-free methods, LHD has been researched intensely in recent years. Several works focused on the global optimization of the distribution using methods like particle swarm optimization,²⁴ simulated annealing,²⁵ evolutionary algorithm,²⁶ geometrically ordered design,²⁷ or local search algorithms.^28–30 It is noted that LHD is time-consuming due to the searching process for such a design. As alternative model-free approaches, factorial methods were also studied: Farzaneh and Akbarzadeh Tootoonchi³⁰ designed optimal experiments using a three-level factorial method for TS systems and Pontes et al.³¹ trained an MLP network using data developed by a complete factorial design. However, factorial methods require rigorous planning at the design phase based on accurate pre-knowledge. Another class of model-free approaches are the space-filling designs. Skrjanc³² used a supervised clustering algorithm to identify a fuzzy model, in which a pseudo-Monte-Carlo sampling algorithm realized the space-filling design. Gringard and Kroll³³ designed test signals to cover the input space for identifying locally affine dynamic Takagi-Sugeno models. Gringard and Kroll³⁴ considered the output space-filling technique to identify Takagi-Sugeno models. However, the balance between the design choices and the computational burden should be considered.

Scope of this research

This contribution investigates the design of the optimized excitation signal to identify the PWA model. In the literature above, the optimization procedure to generate an optimal input for the problem uses either model-based or model-free methods, often criticized for their dependency on pre-knowledge and high optimization effort. This problem motivates the development of efficient excitation signal design methods.

This work’s main contribution is a model-free optimized excitation signal design method, which uses an iterative optimization method to reach a sufficient excitation of the whole operation range. In contrast to other methods, which are limited due to pre-knowledge or computational costs, the proposed optimized excitation signal design method can generally be applied to other data-driven applications without prior knowledge or a higher computational burden. Furthermore, because clustering-based identification methods are the mainstream for the PWA modeling, a clustering-based four-step identification approach is conducted for the identification of the PWA model, and the proposed method is applied to automotive throttles for the HiL-simulation.

The work is structured as follows: modeling for automotive throttles with the PWA model is discussed in section 2. Section 3 introduces the new design method. Experiment results with discussion from applying the methods to a typical throttle are recorded in section 4. Section 5 introduced the clustering-based identification method briefly. Conclusion and outlook are discussed in section 6.

Modeling and identification of automotive throttles with PWA model

Description of automotive throttles

A typical automotive throttle is shown in Figure 1, mainly composed of a DC servo motor, a gearbox, a return spring, a potentiometric angle sensor, and a throttle plate. The DC servo motor drives the throttle plate through the gearbox. The angle sensor provides the measurement of the throttle plate position. The input signal of the throttle is the PWM (Pulse Width Modulation) signal, and the DC-motor is then controlled by varying the duty cycle.

Figure 1.

A typical throttle and its technology scheme.³⁵

It is noticed that the throttle is a typical system with friction, in which the friction occurs in the gearbox and motor shaft bearings. Usually, in physical models, friction is modeled as a nonlinear function of the angular velocity.³⁵ Unlike physical models, a data-driven identification method of PWA models is discussed in the following section.

Description of PWA model structure

As a typical multi-model structure, the PWA model is used primarily for hybrid systems for simulation and model-based control. This paper will use a standard PWARX model to model automotive throttles. The model is described as follows:

y (x) = f (x (k)) = {\begin{matrix} θ_{1}^{T} x (k) if x (k) \in χ_{1} \\ ⋮ \\ θ_{c}^{T} x (k) if x (k) \in χ_{c} \end{matrix}

(1)

with the regressor:

\begin{array}{l} x (k) = [y (k - 1) \dots y (k - n_{a}) \\ u (k - 1) \dots u (k - n_{b}) 1]^{T} \end{array}

(2)

where $n_{a}$ and $n_{b}$ are model orders,⁶ which are defined as integers. The regression space is split into c polyhedral partitions $χ_{i}$ and each sub-model is valid within its partition. The parameter vector of the ith local model is comprised of the coefficients $a_{i, j}$ of the lagged outputs, $b_{i, j}$ of the lagged inputs, and the affine term $c_{i}$ . The transpose of the parameter vector is:

θ_{i}^{T} = [a_{i, 1} \dots a_{i, n_{a}} b_{i, 1} \dots b_{i, n_{b}} c_{i}]

(3)

With the suggestions in Ferrari-Trecate et al.,⁵ the clustering-based identification of the PWA system is conducted as follows in four steps:

Selection of feature vectors for clustering. Particular attention should be paid to identifying systems with friction using the clustering-based method. The original regressor in (2) consists of past inputs and outputs, which are unsuitable for capturing effects like friction. Therefore, the proper feature for the clustering will be selected in this step.

Partition of regressor vectors using clustering. The regressor vectors will be allocated to clusters using the standard c-means algorithm. Because the c-means algorithm does not automatically select the number of clustering, several validity measures, and the model quality are used to determine the number of clusters.

Estimation of region hyperplanes. Given the labeled regressor vectors, the region hyperplanes will be estimated using linear classification techniques like support vector machine (SVM).

Parameter estimation. The parameter vectors in (3) are then estimated recursively using a parallel evaluation, in which the predicted outputs are fed back as model inputs.

All local models of the PWA model in this work share the same structure (i.e. $n_{a}$ and $n_{b}$ are the same, and all regressors are active) and the selection of the model orders $n_{a}$ and $n_{b}$ is based on prior knowledge, which could be extended to arbitrary model orders through a further regression selection (not yet implemented).

The proposed optimized excitation signal design method will be introduced in the next section. As mentioned before, this method achieves a sufficient excitation of the whole operation range by using an iterative optimization method and generates a uniform distribution in the regressor space, yielding better partitioning for the clustering-based identification of models.

Optimization of the excitation signal

Selection of excitation signal

The selection of excitation signals for system identification is of importance in the case of nonlinear systems. Typically, PRMS (pseudo-random-multilevel-signal), multi-sine signal and random signal are often used to identify nonlinear systems. As mentioned before, an optimal excitation signal should cover the whole operation range of the input space and all relevant frequencies. For the nonlinear dynamic system like mechatronic actuators, an optimal experiment design using the multi-sine signal will be preferred for identification since^4,11–14,36:

Multi-sine signal retains as far as possible the advantage of the sine wave, which reduces the measurement time

The multi-sine signal can optimally excite the nonlinear system through optimization of phases and pre-defined frequencies

Multi-sine signal is a deterministic and periodic signal, which is less sensitive against disturbance and leaves no leakage-error

The signal bandwidth can be pre-defined to cover the exciting system bandwidth

The applied periodic multi-sine signal allows an efficient design and offers some advantages for postprocessing the measurement data. Therefore, the multi-sine signal is used as the excitation signal to identify the PWA model, which can be designed and optimized from measurements on an actuator test bench.

Parameterization of the multi-sine signal

As a deterministic, periodic signal, the multi-sine signal can be specified which frequencies to use (from the available frequency grid due to a period signal). Therefore, a multi-sine signal is used, which is formed by summing d harmonic oscillations:

u (t) = \sum_{k = 1}^{d} a_{k} \cos (ω_{k} \cdot t + φ_{k})

(4)

where $a_{k}$ , $ω_{k}$ , and $φ_{k}$ are the respective amplitude, frequency, and phase of each sine signal. Figure 2 shows the multi-sine signal with its system feedback, which has a finite duration (100 s), and its total energy is finite (364.29).

Figure 2.

Multi-sine signal and the feedback for identification.

The design of the multi-sine excitation signal composes mainly of:

the frequencies $ω_{k}$

the amplitudes of the individual signals $a_{k}$

the phases of the respective signals $φ_{k}$

With the Fourier transform, the measurement on the test bench can be performed precisely in the frequency domain, and the required bandwidth of the excitation signal can then be calibrated. Additionally, the transform permits also filtering of noise in the frequency domain.

To avoid unwanted nonlinear distortions, the frequencies are suggested by Atkinson and Donev¹² as:

f_{k} = Δ f \cdot k with Δ f = \frac{1}{N \cdot T_{S}}

(5)

where $Δ f$ is the frequency resolution, N is the number of data points, and T_S is the sampling time.

For the selection of the amplitudes, it should be noted that, on the one hand, the operation range of the actuator should be sufficiently excited. On the other hand, the restrictions of the test signal due to lower and upper mechanical stops should also be considered. Therefore, the amplitudes will be experimentally determined. It is noticed that all harmonic oscillations now have constant amplitude in this research, which should be optimized in future research.

As mentioned above, the measurement should be distributed uniformly over the achievable range. Therefore, the phases of the multi-sine excitations will be optimized according to the following procedure.

Optimal excitation signal design

Inspired by Witters and Swevers,³⁶ a design process is proposed in this section to guarantee a sufficient excitation of the whole operation range. The proposed optimized excitation signal design method yields a uniform distribution in the regressor space and requires no particular weighting on specific regions. Unlike Witters and Swevers,³⁶ the proposed design method with Schroeder phases as initial values leads to a better-homogenized distribution than signals with constant or random phases, improving optimization convergence. Furthermore, a convergence-oriented criterion is used, which requires no prior knowledge to evaluate the distribution. The method iteratively optimizes the phase of the excitation signal so that the signal can uniformly distribute in the operation space. In this work, an index Coverage is defined, which measures the homogenization of the signal distribution, which could reach a minimal value. As the initial signal, the Schroeder phases³⁷ will be used for the iterative optimization. For the evaluation of Coverage, all data points of the test signal (e.g. duty cycle) are divided into intervals, and Coverage is as follows:

\begin{matrix} Coverage = \frac{1}{N \cdot (1 - \frac{1}{n_{z}})} \cdot \sum_{p = 1}^{n_{Z}} {(n_{A} - n_{I})}^{2} \\ with n_{I} = \frac{N}{n_{Z}} \end{matrix}

(6)

where N is the number of the data points, $n_{Z}$ is the number of intervals, $n_{A}$ and $n_{I}$ are the current and ideal numbers of data points within each interval.

The basic idea of the optimization method is the iterative changing of the phases to linearize the inverse cumulative density function (CDF) until the excitation signal reaches a uniform distribution. Starting from the initial Schroeder phases, the phases of the multi-sine signal will then be iteratively altered to optimize Coverage through the following algorithm:

1. Calculate a permutation matrix T, so that

u_{1} (t) = T \cdot u (t)

(7)

where $u_{1} (t)$ is the ascending sorting of the initial excitation signal with the Schroeder phases

φ_{k} = - k \cdot (k - 1) \cdot \frac{π}{d}

(8)

2. Generate a new sequence $u_{2} (t)$ through

u_{2} (t) = (N - 2 \cdot t + 1) \sqrt{3} \frac{u_{rms}}{N - 1}

(9)

where $u_{rms}$ is the root mean square of $u (t)$ .

3. A new sequence $u_{3} (t)$ can then be obtained by multiplying $u_{2} (t)$ with the inverse of the permutation matrix T

u_{3} (t) = T^{- 1} u_{2} (t)

(10)

4. Calculate the phases $φ_{k}$ of $u_{3} (t)$ and the amplitudes $a_{k}$ of $u (t)$ using the Discrete Fourier Transformation (DFT). Then, the new test signal $u_{new} (t)$ is generated by calculating the inverse DFT of the sequence with the phases $φ_{k}$ of $u_{3} (t)$ and the amplitudes $a_{k}$ of $u (t)$ .

5. Evaluate the distribution of the excitation signal

6. Update $u (t) \leftarrow u_{new} (t)$

7. Repeat 1–6 until the defined convergence condition is fulfilled.

The convergence is based on the difference between two consecutive iteration steps:

Δ Coverage = Coverag e_{Iter} - Coverag e_{Iter - 1}

(11)

Then, the convergence condition is:

Δ Coverage < ε

(12)

where $ε$ is provided by the user. Alternatively, a user-defined maximal number of iterations can be used.

Figure 3 shows the clustering with the Schroeder phases (above) and the optimized phases (below). It can be seen that better Coverage of the regressor space has been achieved through the phase optimization method.

Figure 3.

Example of regressor before optimization (above) and after optimization (below).

As shown in Figure 4, the entire value range of the test signal (duty cycle in %) was divided into 21 intervals. For the initial signal with the Schroeder phases, 70% of data points are distributed between 0% and 25% duty cycle intervals. With the iterative homogenization method, the Coverage reaches its optimal value after 90 iterations and data points are uniformly distributed over the whole operation range.

Figure 4.

(a) Iterative minimization of the evaluation criterion coverage, (b) initial and optimized signals, (c) data distribution of the initial signal, and (d) data distribution of the optimized signal.

Identification procedure

In this section, the clustering-based identification method is briefly introduced.

Step 1: Selection of feature vector for clustering

Evaluating the “similarity” of data is essential for clustering-based identification procedures. Appropriate feature vectors should be selected to achieve a better clustering quality. On the one hand, classes should be well separated, and on the other hand, as few features as possible should be used for clustering due to costs and complexity. For mechatronic applications, friction’s nonlinear effects should depend on features. It is noted that the nonlinear effects cannot be easily captured based on the original feature space (see Figure 5).

Figure 5.

Original regressor space.

Thus, the feature vector should be firstly selected before identification. Literature and insights into the mechatronic throttles have shown that the input and the velocity are the two variables that most influence the throttle response.^4,35 Based on the first-order difference equation,⁴ the output difference $y (k - 1) - y (k - 2)$ approximates the velocity.³⁵ Together with the input $u (k - 1)$ , new feature vectors are displayed in Figure 6. In step 2, the new feature space will be clustered using the c-means algorithm.

Figure 6.

New feature space.

Step 2: Partition of feature vectors

The main difficulty of the PWA identification lies in the classification problem, in which each data point must be associated with one region and the corresponding sub-model.⁵

As discussed above, the next step of the method will cluster the regressor vectors into c disjoint subsets. As an efficient clustering method, the c-means algorithm with Euclidean distance allocates the feature vectors in Figure 7 into c clusters. Each point in the feature space is marked with the index of its cluster (see Figure 7) and will be used to estimate the model partition in the next step. It is noticed that in the following steps, the original regressor $[y (k - 1) y (k - 2) u (k - 1) 1]$ is used and identified with the labeled data.

Figure 7.

Example of the clustered feature vectors.

Nevertheless, the drawback of c-means is that the number of clusters c should be pre-defined before the clustering. Theoretically, the c-means clustering should identify well-separated and compact clusters. With increased value from interval [c_min, c_max], the number of clusters can be iteratively selected according to cluster validity measures, which provide information about the separation and compactness of clusters. Since none of the available measures can work perfectly alone, four cluster validity measures are adopted: Davies-Bouldin,³⁸ Calinski-Harabasz,³⁹ Dunn,⁴⁰ and Hartigan⁴¹ index.

The Davies-Bouldin (DB) index is calculated by:

DB = \frac{1}{c} \sum_{i = 1, i \neq j}^{c} \max (\frac{{\bar{d}}_{i} + {\bar{d}}_{j}}{v_{i} - v_{j}})

(13)

where ${\bar{d}}_{i}$ is the average distance between each point and the center of the ith cluster $v_{i}$ .

The Calinski-Harabasz (CH) index is defined as:

CH = \frac{S_{B}}{S_{W}} \cdot \frac{N - c}{N - 1}

(14)

where $S_{B}$ is the sum of squares between the clusters and $S_{W}$ is the sum of squares within the clusters.

The Dunn (Du) index is given by:

Du = \frac{d_{\min}}{d_{\max}}

(15)

where $d_{\min}$ denotes the smallest distance between two objects from different clusters, and $d_{\max}$ is the maximum distance of two objects from the same cluster.

The Hartigan (Ha) index is defined as:

Ha = \log \frac{S_{B}}{S_{W}}

(16)

Besides cluster validity measures, a model performance criterion can be combined with those four measures to select c.

In short, this step delivers partitioned regressor vectors to the next step, where the boundary hyperplanes between adjacent regions are estimated using the SVM algorithm.

Step 3: Estimation of region hyperplanes

The feature vectors are partitioned in the previous step with the c-means clustering method. With the clustered data, this step aims to estimate the polyhedral regions, which can be viewed as separating c sets of points using linear classifiers (hyperplanes). The separating hyperplanes between two adjacent regions $χ_{i}$ and $χ_{j}$ are defined as:

a_{ij}^{T} x = 0

(17)

where $a_{ij}^{T}$ is the parameter vector of the hyperplane separating two adjacent regions $χ_{i}$ and $χ_{j}$ . And the following separation rules should be fulfilled

x \in χ_{i} \Rightarrow a_{ij}^{T} x < 0

(18)

x \in χ_{j} \Rightarrow a_{ij}^{T} x > 0

(19)

Typically, this problem can be solved in the literature by employing the linear classification method like Support Vector Machine (SVM). The SVM algorithm is a supervised learning process which tries to find an optimal maximal margin separating hyperplane between two classes. There are generally two approaches for the multi-class classification: combining several binary SVMs and the Multicategory M-SVM. Compared to binary SVMs, M-SVM has the advantage that it can efficiently estimate all hyperplanes and does not leave “holes” in the regression space. Then, this step provides regions with estimated boundaries to the next step, in which parameter vectors in each region are estimated.

Step 4: Estimation of parameter vectors

Once the regressor vectors are classified and the boundaries of regions are estimated in the previous steps, the parameter vector $θ_{i}$ of each sub-model will be estimated.

According to Kroll,⁴ the parameter estimation can be achieved using “serial-parallel evaluation” or “parallel evaluation” (see Figure 8). The “serial-parallel evaluation” calculates the output from the measurement containing past input and output and can lead to biased non-optimal parameters because the preconditions for an unbiased estimation are rarely met in practice.⁴ In contrast, a parallel evaluation estimates parameters recursively in which the predicted outputs are fed back as model inputs. The “parallel identification” calculates the output from the past input and predicted “noise-free” output, corresponding to the simulative usage of a model without measured outputs. The “parallel identification” causes the estimation problem to no longer be “linear in the parameters,” requiring elaborate iterative nonlinear optimization but a higher prediction quality.

Figure 8.

Serial-parallel identification (above) and parallel identification (below).⁴

Therefore, a standard optimization method like the downhill simplex optimization approach can be used to optimize the following cost function:

\hat{θ} = \underset{θ}{argmin} \frac{1}{N} \sum_{k = 1}^{N} {[\hat{y} (k, θ) - y (k)]}^{2}

(20)

Here $\hat{y} (k)$ is the estimation for $y (k)$ . The initial values are calculated using the standard least squares method^1–4,42:

{\hat{θ}}_{i} = {(x_{i}^{T} x_{i})}^{- 1} x_{i}^{T} y

(21)

Experiment results

Model quality evaluation

The method mentioned above will be used for HiL simulation. Hence, the model should simulate the throttle position precisely for functional testing of ECU. Two performance criteria are used for the model quality evaluation: the normalized root mean squared error (NRMSE) and the maximum norm of the prediction error $(‖ Δ y_{\infty} ‖)$ :

NRMSE = \sqrt{\frac{\sum_{k = 1}^{N} {(y (k) - \hat{y} (k))}^{2}}{\sum_{k = 1}^{N} {(y (k) - \bar{y})}^{2}}}, \bar{y} = \frac{1}{N} \sum_{k = 1}^{N} y (k)

(22)

‖ Δ y_{\infty} ‖ = \max_{k} (| \hat{y} (k) - y (k) |)

(23)

For the current application, it is required that $‖ Δ y_{\infty} ‖ \leq 5 °$ and NRMSE should be as small as possible.³⁵

The validation of the models shall be carried out with “fresh” measurement data, which are not used in the identification (see Figure 9). An amplitude-modulated multi-step signal (APRTS) will be used, which was applied for validation in industrial applications and introduced as a benchmark set.⁴³ This signal was designed to test the modeling performance of the throttle at various operating regions, including the hard mechanical stops. It consists of a series of random pulses of different widths ranging from the input range of the duty cycle from 0% to 100%. Besides, another multi-sine signal with different phases and amplitudes in industrial applications will also be used for the validation (see Figure 10).

Figure 9.

APRTS and the feedback.

Figure 10.

Multi-sine and the feedback for validation.

Table 1 gives an overview of the identification and validation signal. In order to distinguish both multi-sine signals, the signal used for identification is named “multi-sine 1,” and the one for validation is named “multi-sine 2.”

Table 1.

Overview of identification and validation signal.

	Signal	Duration (s)	T_S (ms)
Identification	Multi-sine 1	100	10
Validation	APRTS	25
	Multi-sine 2	20

Test stand

A test stand was built for data collection and model evaluation (see Figure 11), and National Instruments products are used as the data acquisition equipment.

Figure 11.

Test stand with a throttle.

The data acquisition software was implemented using NI LabVIEW 2017 graphical programming to perform different experiment tasks, including signal generation and measurement recording. The data acquisition hardware is NI USB-6251. After each measurement, two types of data should be collected: the duty cycle as input and the angle position as output. The measurement data can be automatically recorded with a defined sampling time $T_{S} = 10 ms .$ The sampling time is selected based on a priori knowledge of the system. On the one hand, it should not be too small compared to the underlying dynamics; on the other hand, it should not be too large that it may skip the essential high-frequency characteristics of the system. Selecting too small a sampling time can cause serious problems, such as numerical-ill conditioning at the parameter estimation stage and high sensitivity to high-frequency noise and disturbances. Therefore, selecting a sampling time or interval is always a trade-off. Roughly, it was selected as 10% of the settling time of the open-loop step response of the system.⁴

Results

The presented excitation signal design algorithm and the identification method of the PWA model are applied to a mechatronic throttle in the case study.

Based on the discussion before, $n_{a} = 2$ and $n_{b} = 1$ is used based on prior knowledge, which could be extended to arbitrary model orders through a further regression selection (not yet implemented). In order to determine the number of sub-models, the c-means algorithm is ten times initialized for an interval (c_min = 2 and c_max = 10), which was derived from the prior knowledge of the target system. The mean values of the simulation results for four cluster validity measures are shown in Figure 12. The Davies-Bouldin and Dunn indices indicate that the optimal number should be at c = 8. The Hartigan index has a weak knee point at c = 8. The Calinski-Harabasz index is growing continuously with an increased number of sub-models. Besides, the model quality criterion NRMSE shows that c = 9 and 10 do not provide better results than c = 8. Therefore, c = 8 is selected in this application.

Figure 12.

Four cluster validity measures (above) and NRMSE (below) for different numbers of clusters/sub-models.

Finally, the PWA model for the mechatronic throttle is:

y (x) = f (x (k)) = {\begin{matrix} θ_{1}^{T} x (k) if x (k) \in χ_{1} \\ ⋮ \\ θ_{8}^{T} x (k) if x (k) \in χ_{8} \end{matrix}

(24)

with the regressor:

x (k) = [y (k - 1) y (k - 2) u (k - 1) 1]^{T}

(25)

and the parameter vectors:

[\begin{matrix} θ_{1}^{T} \\ θ_{2}^{T} \\ θ_{3}^{T} \\ θ_{4}^{T} \\ θ_{5}^{T} \\ θ_{6}^{T} \\ θ_{7}^{T} \\ θ_{8}^{T} \end{matrix}] = [\begin{matrix} 1.5674 & - 0.5690 & - 0.0527 & 0.4471 \\ 1.5602 & - 0.5630 & - 0.0474 & 1.0162 \\ 1.5559 & - 0.5588 & - 0.0503 & 1.0885 \\ 1.6108 & - 0.6132 & - 0.0469 & 0.5057 \\ 1.2034 & - 0.2039 & - 0.0061 & 0.1076 \\ 1.6308 & - 0.6337 & - 0.0413 & 0.4908 \\ 1.5507 & - 0.5506 & - 0.0539 & 0.3816 \\ 1.5843 & - 0.5566 & - 0.0513 & 1.0982 \end{matrix}]

(26)

The estimated parameter values $a_{ij}^{T}$ of the hyperplane separating two adjacent regions $χ_{i}$ and $χ_{j}$ in (17) are given in the Appendix.

The simulated and the measured angular position of the throttle is shown in Figure 13. The model and the real system are pretty similar, and the PWA model with the optimized excitation signal method can reach a better result than the model before the optimization. To summarize, the presented model captures the throttle’s dynamic and static effects, and the model quality is sufficient for testing motor control functions for HiL simulation.

Figure 13.

Measurement and model prediction before and after optimization: APRTS (top) and Multi-sine 2 (below).

The model prediction results with identification and validation data before and after excitation optimization are quantitatively compared in Table 2 for the throttle. After the optimization, the maximum absolute error with the identification data is 3.85°, and the validation data (APRTS) is 4.33°. The NRMSE for both models are 0.06°, which means that the PWA model provides the best model quality and the target of $Δ y_{\infty} \leq 5^{°}$ can be achieved.

Table 2.

Performance comparison before and after optimization design.

Criterion		Before optimization design	After optimization design
Multi-sine 1	NRMSE	0.05	0.04
	$‖ y_{\infty} ‖$ in °	4.63	3.85
APRTS	NRMSE	0.06	0.06
	$‖ y_{\infty} ‖$ in °	5.32	4.33
Multi-sine 2	NRMSE	0.05	0.04
	$‖ y_{\infty} ‖$ in °	4.19	3.81

It is noted that the results with these different data sets for identification and validation are not comparable in an equivalent condition since they were submitted to a different number of samples.⁴⁴ Besides, the proposed method is compared with other existing methods like the space-filling design,³³ and the performance comparison is shown in Table 3. It is noted that the proposed design method better model quality than identification based upon the space-filling design.

Table 3.

Performance comparison between the proposed method and the space-filling method.

Criterion		Proposed method	Space-filling method
APRTS	NRMSE	0.06	0.07
	$‖ y_{\infty} ‖$ in °	4.33	5.76
Multi-sine 2	NRMSE	0.04	0.05
	$‖ y_{\infty} ‖$ in °	3.81	4.92

Conclusion and outlook

This work presented a model-free optimized excitation signal design for identifying piecewise-affine models. The presented method was applied for automotive throttles. For the excitation, periodic multi-sine signals are used. The amplitude spectra are chosen to approximate the excitation the throttle experiences in a vehicle when driving in a normal situation. The phases of multi-sine signals are optimized to obtain a uniform distribution of the samples over the achievable working range.

As a typical data-driven model method, the PWA model with the clustering-based identification method was briefly reported, in which friction effects were addressed by a specific feature vector design. Finally, a mechatronic throttle was utilized in the case study to compare the dynamic response with the presented models. Results with the proposed model before and after the optimization showed that the presented method could efficiently provide high-quality models, which is sufficient for the functional tests of ECU in HiL simulation. The proposed methods can also be transferred to other systems with similar effects. Future research will improve the signal amplitude design and the model-based controller design.

Footnotes

Appendix

The estimated parameter values of hyperplanes in (17) are:

(A.1)

[\begin{matrix} a_{12}^{T} \\ a_{13}^{T} \\ a_{14}^{T} \\ a_{15}^{T} \\ a_{16}^{T} \\ a_{17}^{T} \\ a_{18}^{T} \end{matrix}] = [\begin{matrix} 0.5405 & - 0.5389 & - 0.1565 & 2.1854 \\ 0.5477 & - 0.5453 & - 0.0933 & 0.7719 \\ - 1.0798 & 1.0787 & 0.0501 & - 1.9663 \\ - 0.5972 & 0.5964 & 0.0290 & - 1.5409 \\ 0.2415 & - 0.2384 & - 0.0446 & - 0.2604 \\ 0.5382 & - 0.5366 & - 0.0541 & 0.0505 \\ - 1.8690 & 1.8688 & 0.0931 & - 2.7520 \end{matrix}]

(A.2)

[\begin{matrix} a_{21}^{T} \\ a_{23}^{T} \\ a_{24}^{T} \\ a_{25}^{T} \\ a_{26}^{T} \\ a_{27}^{T} \\ a_{28}^{T} \end{matrix}] = [\begin{matrix} - 0.5405 & 0.5389 & 0.1565 & 2.1850 \\ 1.6229 & - 1.6234 & - 0.0983 & 0.1078 \\ - 0.6790 & 0.6785 & 0.0478 & - 1.3547 \\ - 0.3761 & 0.3752 & 0.0353 & - 1.2441 \\ 0.5986 & - 0.5979 & - 0.0261 & - 0.7666 \\ 1.1185 & - 1.1182 & - 0.0400 & - 0.6207 \\ - 0.8791 & 0.8785 & 0.0817 & - 1.6340 \end{matrix}]

(A.3)

[\begin{matrix} a_{31}^{T} \\ a_{32}^{T} \\ a_{34}^{T} \\ a_{35}^{T} \\ a_{36}^{T} \\ a_{37}^{T} \\ a_{38}^{T} \end{matrix}] = [\begin{matrix} - 0.5477 & 0.5453 & 0.0933 & - 0.7719 \\ - 1.6229 & 1.6234 & 0.0983 & - 0.1078 \\ - 0.3243 & 0.3225 & 0.0494 & - 0.8507 \\ - 0.1510 & 0.1478 & 0.0425 & - 0.8626 \\ 0.6139 & - 0.6097 & - 0.0521 & - 1.3239 \\ 1.3960 & - 1.3919 & - 0.1000 & - 1.5021 \\ - 0.3872 & 0.3845 & 0.0684 & - 0.8143 \end{matrix}]

(A.4)

[\begin{matrix} a_{41}^{T} \\ a_{42}^{T} \\ a_{43}^{T} \\ a_{45}^{T} \\ a_{46}^{T} \\ a_{47}^{T} \\ a_{48}^{T} \end{matrix}] = [\begin{matrix} 1.0798 & - 1.0787 & - 0.0501 & 1.9663 \\ 0.6790 & - 0.6785 & - 0.0478 & 1.3547 \\ 0.3243 & - 0.3225 & 0.0494 & - 0.8507 \\ - 0.9754 & 0.9726 & 0.0633 & - 3.8304 \\ 0.1604 & - 0.1578 & - 0.0332 & 0.1011 \\ 0.2996 & - 0.2981 & - 0.0362 & 0.4300 \\ 1.4282 & - 1.4244 & - 0.1032 & 4.0416 \end{matrix}]

(A.5)

[\begin{matrix} a_{51}^{T} \\ a_{52}^{T} \\ a_{53}^{T} \\ a_{54}^{T} \\ a_{56}^{T} \\ a_{57}^{T} \\ a_{58}^{T} \end{matrix}] = [\begin{matrix} 0.5972 & - 0.5964 & - 0.0289 & 1.5409 \\ 0.3761 & - 0.3751 & - 0.0353 & 1.2441 \\ 0.1510 & - 0.1478 & - 0.0425 & 0.8626 \\ 0.9754 & - 0.9726 & - 0.0633 & 3.8304 \\ 0.0243 & - 0.0188 & - 0.0370 & 0.2250 \\ 0.1657 & - 0.1634 & - 0.0329 & 0.5656 \\ 0.6492 & - 0.6464 & - 0.0504 & 2.4242 \end{matrix}]

(A.6)

[\begin{matrix} a_{61}^{T} \\ a_{62}^{T} \\ a_{63}^{T} \\ a_{64}^{T} \\ a_{65}^{T} \\ a_{67}^{T} \\ a_{68}^{T} \end{matrix}] = [\begin{matrix} - 0.2415 & 0.2384 & 0.0446 & 0.2604 \\ - 0.5986 & 0.5979 & 0.0261 & 0.7666 \\ - 0.6139 & 0.6097 & 0.0521 & 1.3239 \\ - 0.1604 & 0.1578 & 0.0332 & - 0.1011 \\ - 0.0243 & 0.0188 & 0.0370 & - 0.2250 \\ - 0.9935 & 0.9897 & 0.0579 & 2.3849 \\ - 0.1426 & 0.1385 & 0.0429 & 0.0506 \end{matrix}]

(A.7)

[\begin{matrix} a_{71}^{T} \\ a_{72}^{T} \\ a_{73}^{T} \\ a_{74}^{T} \\ a_{75}^{T} \\ a_{76}^{T} \\ a_{78}^{T} \end{matrix}] = [\begin{matrix} - 0.1657 & 0.1634 & 0.0329 & - 0.5656 \\ - 0.2996 & 0.2981 & 0.0362 & - 0.4300 \\ - 1.3961 & 1.3919 & 0.1000 & 1.5021 \\ - 0.3531 & 0.3512 & 0.0470 & - 0.3003 \\ - 0.5382 & 0.5366 & 0.0541 & - 0.0503 \\ 0.9935 & - 0.9897 & - 0.0579 & - 2.3849 \\ - 1.1185 & 1.1182 & 0.0408 & - 0.6207 \end{matrix}]

(A.8)

[\begin{matrix} a_{81}^{T} \\ a_{82}^{T} \\ a_{83}^{T} \\ a_{84}^{T} \\ a_{85}^{T} \\ a_{86}^{T} \\ a_{87}^{T} \end{matrix}] = [\begin{matrix} - 0.3761 & 0.3752 & 0.0353 & - 1.2441 \\ - 0.6790 & 0.6785 & 0.0478 & - 1.3547 \\ 1.6229 & - 1.6234 & - 0.0983 & 0.1078 \\ - 0.8791 & 0.8785 & 0.0817 & - 1.6340 \\ - 0.5405 & 0.5389 & 0.1565 & - 2.1854 \\ 0.5986 & - 0.5979 & - 0.0261 & - 0.7666 \\ 1.1185 & - 1.1182 & - 0.0408 & - 0.6207 \end{matrix}]

Acknowledgements

The author would like to thank anonymous reviewers for carefully reading the paper.

Declaration of conflicting interests

The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research is supported by the Natural Science Foundation of Shanxi Province, China (Grant No. 20210302123188).

ORCID iD

Zhenxing Ren

References

Ljung

System identification. Theory for the user. 2nd ed. Upper Saddle River, NJ: Prentice-Hall, 1992.

Isermann

Münchhof

Identification of dynamic systems: an introduction with applications. Berlin: Springer-Verlag, 2011.

Nelles

Nonlinear system identification: from classical approaches to neural networks and fuzzy models. Berlin: Springer-Verlag, 2001.

Kroll

Computational intelligence: methods and technical applications. 2nd ed. London: De Gruyter Oldenbourg, 2016.

Ferrari-Trecate

Muselli

Liberati

, et al. A clustering technique for the identification of piecewise affine systems. Automatica 2003; 39(2): 205–217.

Wang

Song

Zhao

, et al. A PWA model identification method for nonlinear systems using hierarchical clustering based on the gap metric. Comput Chem Eng 2020; 138: 106838.

Liu

Qiu

, et al. Online identification of piecewise affine systems using integral concurrent learning. IEEE Trans Circuits Syst I Regul Pap 2021; 68(10): 4324–4336.

Zhang

Jing

Liu

, et al. A novel PWA lateral dynamics modeling method and switched T-S observer design for vehicle sideslip angle estimation. IEEE Trans Ind Electron 2022; 69(2): 1847–1857.

Chen

, et al. Modeling and parameter learning method for the Hammerstein–Wiener model with disturbance. Meas Control 2020; 53(5–6): 971–982.

10.

Yao

, et al. A novel learning algorithm of the neuro-fuzzy based Hammerstein–Wiener model corrupted by process noise. J Franklin Inst 2021; 358(3): 2115–2137.

11.

Fedorov

Theory of optimal experiments. New York, NY: Academic Press, 1972.

12.

Atkinson

Donev

Optimum experimental designs. Oxford: Clarendon Press, 1992.

13.

Godfrey

Perturbation signals for system identification. Hoboken, NJ: Prentice-Hall, 1993.

14.

Pintelon

Schoukens

System identification: a frequency-domain approach. New York, NY: IEEE Press, 2001.

15.

Hartmann

Ebert

Nelles

Model-based design of experiments based on local model networks for nonlinear processes with low noise levels. In: Proceedings of the 2011 American control conference, San Francisco, CA, USA, 29 June–1 July 2011, pp.5306–5311. New York: IEEE.

16.

Deflorian

Zaglauer

Design of experiments for nonlinear dynamic system identification. IFAC Proc Vol 2011; 44(1): 13179–13184.

17.

Karimshoushtari

Novara

Design of experiments for nonlinear system identification: a set membership approach. Automatica 2020; 119: 109036.

18.

Himmelsbach

Kroll

On optimal test signal design and parameter identification schemes for dynamic Takagi-Sugeno fuzzy models using the Fisher information matrix. Int J Fuzzy Syst 2022; 24: 1012–1024.

19.

Wang

Guedes Soares

Zou

Optimal design of excitation signal for identification of nonlinear ship manoeuvring model. Ocean Eng 2020; 196: 106778.

20.

Suzuki

Yamakita

Input design for hybrid system identification for accurate estimation of submodel regions. In: Proceedings of the 2011 American control conference, 29 June–1 July 2011, pp.1236–1241. New York: IEEE.

21.

Hametner

Stadlbauer

Deregnaucourt

, et al. Optimal experiment design based on local model networks and multilayer perceptron networks. Eng Appl Artif Intell 2013; 26: 251–261.

22.

Kroll

Dürrbaum

On joint optimal experiment design for identifying partition and local model parameters of Takagi-Sugeno models. IFAC Papers-Online 2015; 48: 1427–1432.

23.

Neilsen

Transtrum

Van Komen

, et al. Optimal experimental design for machine learning using the Fisher information matrix. J Acoust Soc Am 2018; 144(3): 1730.

24.

Chen

Hsieh

Hung

, et al. Optimizing Latin hypercube designs by particle swarm. Stat Comput 2013; 23(5): 663–676.

25.

Morris

Mitchell

TJ.

Exploratory designs for computational experiments. J Stat Plan Inference 1995; 43(3): 381–402.

26.

Jin

Chen

Sudjianto

An efficient algorithm for constructing optimal design of computer experiments. J Stat Plan Inference 2005; 134(1): 268–287.

27.

Viana

FAC

Venter

Balabanov

An algorithm for fast optimal Latin hypercube design of experiments. Int J Numer Methods Eng 2010; 82(2): 135–156.

28.

Grosso

Jamali

Locatelli

Finding maximin Latin hypercube designs by iterated local search heuristics. Eur J Oper Res 2009; 197(2): 541–547.

29.

Ebert

Fischer

Belz

, et al. Extended deterministic local search algorithm for maximin Latin hypercube designs. In: Proceedings of the IEEE symposium series on computational intelligence, Cape Town, South Africa, 07–10 December 2015, pp.375–382, New York: IEEE.

30.

Farzaneh

Akbarzadeh Tootoonchi

A novel data reduction method for Takagi–sugeno fuzzy system design based on statistical design of experiment. Appl Soft Comput 2009; 9: 1367–1376.

31.

Pontes

Amorim

Balestrassi

, et al. Design of experiments and focused grid search for neural network parameter optimization. Neurocomputing 2016; 186: 22–34.

32.

Skrjanc

Evolving fuzzy-model-based design of experiments with supervised hierarchical clustering. IEEE Trans Fuzzy Syst 2015; 23: 861–871.

33.

Gringard

Kroll

On the systematic analysis of the impact of the parametrization of standard test signals. In: Proceedings of the 26th workshop computational intelligence, 2016.

34.

Gringard

Kroll

On considering the output in space-filling test signal designs for the identification of dynamic Takagi-Sugeno models. IFAC-PapersOnLine 2020; 53(2): 1200–1205.

35.

Ren

Kroll

Sofsky

, et al. On methods for automated modeling of dynamic systems with friction and their application to electro-mechanical throttles. In: Proceedings of the 49th IEEE conference on decision and control, Atlanta, GA, USA, 15–17 December 2010. New York: IEEE.

36.

Witters

Swevers

Black-box model identification for a continuously variable, electro-hydraulic semi-active damper. Mech Syst Signal Process 2010; 24(1): 4–18.

37.

Schroeder

Synthesis of low-peak-factor signals and binary sequences with low autocorrelation (Corresp.). IEEE Trans Inf Theory 1970; 16(1): 85–89.

38.

Davies

Bouldin

DW.

A cluster separation measure. IEEE Trans Pattern Anal Mach Intell 1979; 1(2): 224–227.

39.

Calinski

Harabasz

A dendrite method for cluster analysis. Commun Stat Simul Comput 1974; 3(1): 1–27.

40.

Dunn

JC.

Well-separated clusters and optimal fuzzy partitions. J Cybern 1974; 4(1): 95–104.

41.

Hartigan

Clustering algorithms. New York, NY: Wiley, 1975.

42.

Soderstrom

Stoica

System identification. New York, NY: Prentice-Hall, 1989.

43.

Zaidi

Kroll

Electro-mechanical throttle as a benchmark problem for nonlinear system identification with friction. In: Proceedings of the 24th workshop computational intelligence, Dortmund, 27–28 November 2014.

44.

Wei

HL.

A robust model structure selection method for small sample size and multiple datasets problems. Inf Sci 2018; 451–452: 195–209.

An optimized excitation signal design for identification of PWA model and application to automotive throttles

Abstract

Keywords

Introduction

Motivation and literature review

Scope of this research

Modeling and identification of automotive throttles with PWA model

Description of automotive throttles

Description of PWA model structure

Optimization of the excitation signal

Selection of excitation signal

Parameterization of the multi-sine signal

Optimal excitation signal design

Identification procedure

Step 1: Selection of feature vector for clustering

Step 2: Partition of feature vectors

Step 3: Estimation of region hyperplanes

Step 4: Estimation of parameter vectors

Experiment results

Model quality evaluation

Test stand

Results

Conclusion and outlook

Footnotes

Appendix

Acknowledgements

Declaration of conflicting interests

Funding

ORCID iD

References