Abstract
Instantaneous fuel consumption estimation of fleet vehicles provides essential tools for fleet operation optimization and intelligent fleet management. This study aims to develop practical and accurate models to estimate instantaneous fuel consumption based on on-board diagnostics (OBD) data. Fuel consumption data is measured by a high-precision fuel flow meter. Two machine learning algorithms of Random Forest (RF) and Artificial Neural Networks (ANN) are trained with real-world urban and highway driving data of four fleet vehicles with different types and powertrain systems. In addition, the cold-start period of the vehicle operation is included to cover the fuel consumption penalty in the warm-up period. The validation results show that the RF method is more accurate than the ANN method, and both of the machine learning models have a better accuracy compared to the existing fuel consumption calculation methods based on the engine control unit (ECU) parameters.
Keywords
Introduction
Intelligent fleet management systems aim to improve efficiency and reduce costs of operating fleet vehicles, while complying the fleet operation with regulations and environmental responsibilities. 1 Fuel consumption is one of the major operating expenses in a fleet which has a direct impact on the emission of greenhouse gases and air pollutants from fleet vehicles. Therefore, monitoring the fuel consumption of fleet vehicles is one of the important tasks involved in fleet management.2,3
There exist methods for calculating “average” fuel consumption based on the total amount of fuel consumption and the distance traveled. 4 However, for fleet management systems to minimize fuel consumption, cumulative or average fuel consumption is often insufficient as the average data does not provide enough information to:
detect driving situations that result in high fuel consumption and CO2 tailpipe emissions5,6
determine powertrain efficiency under various operating circumstances
choose the optimal (i.e. fuel-efficient) routes 7
determine how fleet driver behavior affects fuel consumption8,9
develop fuel-specific emission factors in different driving conditions 10
As a result, “instantaneous” fuel consumption monitoring offers additional benefits for intelligent fleet management systems compared to average fuel consumption. 13 Given fleet vehicles are commonly equipped with global positioning system (GPS), instantaneous fuel consumption, along with vehicle location and road grade data, can be utilized to understand the total fuel consumption of individual vehicles and then used to develop intelligent fleet management strategies for route selection, vehicle type selection, and driver training.
Engine operational parameters and fuel consumption are correlated. Many engine performance parameters are accessible through on-board diagnostics (OBD) from the electronic control unit (ECU) of the vehicle. These parameters demonstrate real-time engine operation and have been used extensively in fleet management applications.14,15 Some parameters provided by OBD affect fuel consumption, so these parameters can be used to estimate a vehicle instantaneous fuel consumption.
Instantaneous fuel consumption can be obtained through the three following methods:
ECU torque-based estimation: For some vehicles, an estimation of instantaneous fuel consumption is provided through the OBD data from the engine ECU. This could be based on the estimated engine torque and the steady-state brake specific fuel consumption (BSFC) map. These estimations are inaccurate under engine transient operation which is common for on-road vehicles.16,17
ECU air-fuel-ratio-based estimation: Fuel consumption can be estimated by the intake air mass flow rate (MAF) and the air to fuel ratio commanded by ECU. 18 This method also has limitations and can be inaccurate. For example, the ECU-commanded air-fuel equivalence ratio is not necessarily equal to real air-fuel equivalence ratio. In addition, the MAF data is not available from OBD data in all vehicles. 19
Direct measurement: By installing a fuel flow rate sensor in the engine fuel feed line, instantaneous fuel consumption can be measured. This method requires a dedicated data collection system, an expensive sensor, and costly system maintenance. 20 Besides, a vehicle with a compression ignition (diesel) engine requires two fuel flow meters in the supply and return fuel lines.
An alternative method is to develop data-driven mathematical models based on available engine parameters through OBD data and measured fuel consumption for training the models. In this approach, OBD and measured fuel consumption data from representative fleet vehicles are collected to create models. Once experimentally validated models are developed, the instantaneous fuel consumption of the fleet vehicles is estimated from the OBD data of each vehicle. Machine learning methods are useful to develop data-driven models based on measured data. 21
The approach used in this paper to estimate the instantaneous fuel consumption of fleet vehicles is implemented as a case study on the transportation fleet of the University of Alberta. The fleet of the University of Alberta consists of 173 vehicles, all of which are tracked daily utilizing cellular GPS and telematics OBD readers as part of an intelligent fleet management system. This study aims to develop a reliable and accurate machine-learning-based method for monitoring the instantaneous fuel consumption of university fleet vehicles using OBD data. Four different vehicles are chosen as representatives of the fleet to collect data in two different real-driving scenarios – highway driving and urban driving. This study includes extensive data collection at 2 Hz sampling frequency for four fleet vehicles.
The University of Alberta is located in Edmonton, which is a city with a cold climate and long winters, therefore the cold start period of vehicle engines is long and the fuel consumption penalty during the cold start period is high. 22 In the urban driving scenario, which is shorter, the cold start period is more significant. In this study, the cold start interval is covered so that the fuel consumption penalty in the cold start is also included in the models.
Machine learning algorithms have recently been used to estimate vehicular fuel consumption, both at the vehicle level and at the fleet level. 23 One of the ways to categorize different studies in this field is to divide the models into average and instantaneous fuel consumption models.
Wu and Liu24,25 developed back-propagation and radial basis artificial neural networks (ANN) models to predict the average fuel consumption of vehicles during Environmental Protection Agency (EPA) Federal Test Procedure or FTP-75 driving cycle. The model inputs include vehicle make, vehicle weight, engine style, and transmission type. Radial basis ANN performed better compared to back-propagation ANN models. Zeng et al. 26 used a large-scale dataset to develop machine learning models to predict average fuel consumption based on multiple vehicle and trip variables (e.g. trip distance, average speed, number of intersections per km, engine displacement volume, and the coefficient of variance of speed). Among the algorithms utilized, support vector machines (SVM) model offered the best prediction compared to the multiple linear regression model and ANN. In another study, Gong et al. 27 compared different machine learning algorithms on a large data set of 1153 trips from 34 heavy-duty diesel trucks to predict average fuel consumption. Different categories of influencing factors were considered including vehicle-related factors (e.g. engine and transmission operating state), environment-related factors (e.g. weather condition), driving-related factors (e.g. driver behavior), and road-related factors (e.g. road grade). The random forest (RF) method showed the best prediction performance of all algorithms tested.
Katreddi and Thiruvengadam 28 estimated instantaneous fuel consumption based on very few key parameters including engine load, engine speed, and vehicle speed, using different machine learning algorithms. All data, including fuel combustion rate, was collected through OBD port of ECU. The study shows that ANN performed slightly better than other machine learning techniques such as linear regression and RF. Moradi and Miranda-Moreno 18 used support vector regression (SVR) and ANN machine learning methods to develop vehicle-specific instantaneous fuel consumption models based on GPS and inertial measurement unit (IMU) data, accessible through smartphones. To improve the estimation accuracy, they trained a separate model to estimate engine speed based on the available GPS and IMU data and used it as a feature for the final model of each vehicle. The fuel consumption data used to train the models is an estimated fuel consumption based on OBD data. Another study by Kanarachos et al. 29 used smartphone’s GPS parameters to estimate instantaneous fuel consumption by different ANN models. They also used the OBD-calculated fuel consumption to train the models. They concluded recurrent ANN models are more appropriate compared to long-short-term-memory combined with ANN models to estimate fuel consumption. In a study by Vilaça et al., 30 different machine learning models are compared and a boosted trees algorithm provided the most accurate estimate of fuel consumption based on GPS-derived speed, acceleration, and road grade. Perrotta et al. 31 used three algorithms of SVM, RF, and ANN to estimate instantaneous fuel consumption of a large fleet of trucks based on truck telematics, road geometry, and condition data. RF was the most accurate model for estimation. In a study by Wickramanayake and Bandara, 32 instantaneous and cumulative fuel consumption of a bus over a specific route was modeled using different machine learning models based on GPS data. Fuel consumption data was collected using a capacitive fuel sensor. Based on the analysis, it can be concluded that the RF technique produces a more accurate prediction compared to other utilized techniques, including gradient boosting and neural networks.
The majority18,28,29,33–36 of previous studies to model instantaneous fuel consumption of vehicles used ECU-estimated fuel consumption. However, ECU estimation is inaccurate particularly under transient conditions. 37 In addition, many studies did not consider cold start and non-stoichiometric operating conditions of the vehicles.13,18,24,26,29,33,35,36,38 Furthermore, some of the prior studies need GPS data for accurate instantaneous fuel consumption estimation.13,18,29,32,33,35 This study aims to address the existing gap in the literature by presenting a practical and accurate instantaneous fuel consumption estimation method that: (i) only requires OBD data, (ii) is based on actual fuel consumption measurement data, (iii) includes cold start and warm-up periods of vehicles, and (iv) is experimentally validated for different vehicle types (sedan, SUV, pickup truck), and different powertrain types (conventional internal combustion engine, hybrid electric, plug-in hybrid electric) for engines ranging from 1.5 to 6.2 l. This study applies an ultrasonic fuel flow meter which allows to measure ultra low-volume fuel flow operation with a sampling frequency as high as 5 Hz. Using fuel consumption and OBD data, accurate machine learning fuel consumption models are developed for both urban and highway driving conditions.
Methodology
Driving routes
Two different driving routes were selected for two different driving cycles of highway and urban driving. Both routes are designated to cover broad ranges of driving conditions. The highway route is longer and mostly consists of highways and out of town roads, while the urban route goes through urban streets and avenues. The origin and destination of both routes are the University service station located at the University of Alberta South Campus.
Highway driving route
The highway route, shown in Figure 1(a), starts in residential and urban areas but highway driving is the dominant part of it. The total distance of the route is 100 km with a duration of about 1 h and 30 min. The ratio of highway driving condition to the total distance covered is 75%.

Test routes used in this study in Edmonton and suburbs: (a) the 100 km highway driving route and (b) the 20 km urban driving route.
Urban driving route
The urban route, shown in Figure 1(b), mostly covers residential and urban areas and includes 31 intersections with traffic lights, 13 intersections with stop signs, and six pedestrian crossing lights. It passes through the University of Alberta North Campus, the campus with the highest population and the site of the majority of the university-related activities. The speed limit in the campus part of the driving route ranges from 10 to 40 km/h. The total distance is 20.5 km long and takes roughly 36 min to complete. Urban driving conditions account for 70% of the entire distance traveled.
Test vehicles
The four vehicles selected for this study are listed in Table 1. The vehicles include a hybrid electric sedan, a full-size pickup truck, a conventional compact sports utility vehicle (SUV), and a plug-in hybrid electric (PHEV) SUV. These vehicles are a good representation of the main vehicle types of the University of Alberta fleet. Table 1 shows the selected vehicles specifications. These vehicles have distinct powertrain systems, including 2.5-l hybrid electric, 2.5-l plug-in hybrid electric, 1.5 l turbocharged, and 6.2-l gasoline engines.
Test vehicles.
Fuel flow measurement
An ultrasonic fuel flow meter was installed on each vehicle to measure instantaneous fuel consumption, as shown in Figure 2. The fuel flow meter was Sentronics FlowSonic® Low-Flow Sensor with the technical specifications listed in Table 2. This fuel flow meter was chosen for its ability to measure low-volume fuel flow (as in the idling condition of a small fleet vehicle), its capability to measure various fuels (e.g. gasoline and diesel), its robustness against vibrations and pulsating flows, its high measurement accuracy, and its small size and light weight that make it simple to install on any engine. Using a sampling rate of 5 Hz, the fuel flow data were recorded and sent to the data acquisition system through a controller area network (CAN) connection.

Fuel flow meter installation in the fuel path of the studied vehicles: (a) Ford Escape PHEV and (b) Ford F-350.
Specifications of Sentronics FlowSonic LF ultrasonic fuel flow meter.
CAN data collection
The data collection procedure utilizing a CAN data logger is shown in Figure 3. OBD data was collected over the CAN bus and synchronized with Sentronics sensor fuel measurement data. CAN data, including OBD and fuel flow measurement data, were collected using the CSS Electronics CANedge2 CAN data logger. The CAN data logger recorded OBD data at a sampling rate of 2 Hz and synchronized it with fuel flow CAN data. Table 3 shows the OBD parameters that were collected by the CAN data logger (CANedge2) for each tested vehicle.

Schematics of the data collection process using CAN.
OBD parameters collected real time at 2 Hz sampling frequency (✓ collected, × not collected).
Selected features
Instantaneous fuel consumption models developed in this study are aimed for estimating fuel consumption for fleet vehicles in real-time based on OBD data. The OBD parameters that are used to train these models should be chosen such that they are readily available in all vehicles and directly affect the engine’s instantaneous fuel consumption. The selected parameters are engine load, engine speed, intake manifold absolute pressure (MAP), throttle position, air-fuel equivalence ratio, and engine coolant temperature and are listed in Table 4.
Parameters used to train machine learning models.
The fuel consumption during cold phase – when the engine coolant has not reached to fully warmed-up condition – is more considerable for shorter trips since it accounts for a larger fraction of the overall trip time and distance. Thus it is important to capture the cold start phase in the training data for urban driving fuel consumption models. Coolant temperature from OBD data is used in the training process to represent engine temperature.
Machine learning models
Four machine learning methods were initially implemented for estimating fuel consumption: random forest (RF), artificial neural networks (ANN), support vector machines (SVM), and k-nearest neighbors (KNN). The preliminary investigation revealed that ANN and RF provide the highest accuracy, so they are the focus of the subsequent instantaneous fuel consumption estimation.
Random forest is a machine learning method that combines the results of multiple decision trees to make more accurate predictions. Each decision tree is built using a random subset of the training data, as well as a random subset of the available features. The final result is determined by averaging the predictions of all decision trees. Figure 4 shows the algorithm of the random forest regression method visually. RF training is resistant to over-fitting since each decision tree learns from a slightly different perspective. 39 Additionally, it does not require normalized data and functions well with features that cover a diverse and broad range of feature values. 40 For the RF method, the design parameters are listed in Table 5 and the number of decision trees for each model is optimized during training and validation.

Schematic of the random forest regression algorithm for estimating vehicular instantaneous fuel consumption.
Design parameters of the RF model.
ANN, the other machine learning method used in this study, consists of interconnected nodes, called artificial neurons, organized in layers. These layers typically include an input layer, one or more hidden layers, and an output layer. The connections between neurons have associated weights that are adjusted during the training stage. Each neuron in the input layer represents a feature of the input data, while the output layer typically consists of a single neuron representing the predicted output value. In a hidden layer, each neuron takes inputs from the previous layer and performs a weighted sum of those inputs, followed by an activation function to introduce nonlinearity. The iterative training process continues until the network achieves satisfactory performance.41,42 Here, an ANN method with one hidden layer and Rectified linear unit (ReLU) as its activation function is used with the design parameters listed in Table 6. Figure 5 is a visual representation of the ANN algorithm. The optimal number of hidden layer neurons for each model is determined during training and validation.
Design parameters of the ANN model.

Schematic of the ANN algorithm for estimating vehicular instantaneous fuel consumption.
The hidden layer neurons in Figure 5 are calculated as:
where,
The output neuron is calculated as:
where, FC is fuel consumption,
As part of the data preparation procedure for the ANN model training, the data must be normalized because, in contrast to the RF model, the ANN model is sensitive to variations in the range of feature values. 43 The normalized data for each feature is formed as:
where, x is the actual data and
To develop the models, the data collected with CANedge2 was divided into two parts. The first part, consisting of 70% of the total data, was used in a five-fold cross-validation method. To do this, 70% of data was divided into five parts and four parts were used for the first repetition and the fifth part was used for validation of the model. The remaining 30% of the data was used to test the two trained machine learning models.
ECU-based calculation
For gasoline engines, there is an ECU-based method for approximating instantaneous fuel consumption that is frequently used in literature and also available in some commercial equipment. This approach calculates fuel flow based on MAF and the ECU-commanded air-fuel ratio. If MAF data is available on the vehicle data, fuel consumption can be estimated as:
where
Even though most modern vehicles have a sensor to measure MAF, not all of them show the MAF value through the OBD acquisition system. If MAF data is not available, it must be calculated using other ECU-reported engine parameters (e.g. intake manifold air temperature, MAP). MAF can be calculated using:
where N is engine speed in revolution per minute (rpm), Pi is MAP in kPa, Ti is intake air temperature in K, Vd is engine displacement volume in liter, R is the ambient air gas constant equal to 0.287 J/(g K), and
Since only the Ford Fusion Hybrid MAF data is available among the four selected vehicle types, the outcomes of this method against those from Ford Fusion Hybrid machine learning models are compared. For Ford Escape PHEV and Ford Escape S, MAF measured data is not available via OBD, but is estimated based on other ECU-reported engine parameters. The OBD data of Ford F-350 lack MAF and intake air temperature, so ECU-based fuel consumption calculation would be inaccurate and is not investigated in this study.
Results and discussion
Highway driving models
The training data set needs to include all potential operating points to train an accurate fuel consumption model. The two most significant parameters describing the engine operating points are engine load and engine speed. Figure 6 shows the collected data for the drive cycle shown in Figure 1(a) for two vehicles covering a broad engine load (0–100%) and engine speed (idle – 6000 rpm) operating conditions. Ford Fusion is a hybrid electric car, and in some operating conditions the electric battery drives the vehicle instead of the gasoline engine; hence, the engine operating range in Figure 6(a) is less than that in Figure 6(b), for Ford F-350.

Engine load and speed for the collected vehicle data in the highway driving cycle: (a) Ford Fusion Hybrid and (b) Ford F-350.
The time series of measured vehicle speed and instantaneous fuel consumption during the trip for both vehicles are shown in Figure 7. Speed profiles demonstrate that the driving cycles of the two vehicles were similar with slight differences due to the volume of traffic on the road at the time of the vehicle testing. Comparing Figure 7(b) with Figure 7(a) shows that heavier traffic during the first 1000 s of the test for Ford F-350 caused more frequent stop/starts, which led to a longer test time on a same path. Driving behavior and operational conditions have a substantial effect on fuel consumption, as shown by the considerable rise and peak in instantaneous fuel consumption that occurs as vehicles accelerate. Ford F-350 consumes more fuel than Ford Fusion Hybrid due to its larger engine and vehicle mass. This fuel consumption difference is clear when comparing Figure 7(b) to 7(a).

Vehicle speed and measured instantaneous fuel consumption for highway driving cycle as a function of time: (a) Ford Fusion Hybrid and (b) Ford F-350.
The ANN model cross-validation results are shown in Table 7. The results show that the ANN model can estimate well. The hidden layer size for each model is tuned by minimizing the root mean square error (ERMS) of the estimated fuel consumption. In addition, normalized mean absolute error (En) has been reported for each cross-validation performed on the data set. The reported ERMS and En are the average of validation fold values over five times of training in five-fold cross-validation.
The ANN fuel consumption model cross validation results for highway driving data.
The RF model cross-validation results, listed in Table 8, outperform the ANN model in terms of fuel consumption estimation accuracy for both vehicles. Although the ERMS of Ford F-350 compared to Ford Fusion Hybrid is higher, it is still very low compared to the fuel flow rate of both vehicle types.
The RF fuel consumption model cross validation results for the highway driving data.
Now the models are tested on unseen test data and the RF model has a lower estimation error (ERMS and En) of fuel consumption for both vehicles (Table 9). Plotting measured versus estimated instantaneous fuel consumption is done in Figure 8. A perfect model should line up on the diagonal line. The RF model results (Figure 8(a) and (b)) are closer to the diagonal line than the ANN model results in Figure 8(c) and (d), indicating that the estimated results are closer to the measured values.
Performance of the models on the unseen test data set for highway driving condition.

Performance of the RF and ANN models along with the ECU-based calculation for estimating instantaneous fuel consumption for the highway driving test data: (a) RF model – Ford Fusion Hybrid, (b) RF model – Ford F-350, (c) ANN model – Ford Fusion Hybrid, (d) ANN model – Ford F-350, and (e) ECU-based model – Ford Fusion Hybrid.
The results of the Ford Fusion Hybrid ECU-based estimating approach are shown in Figure 8(e) and are less accurate than the two machine learning models.
Urban driving models
Following the same analysis as the highway driving data, the collected data over the urban driving cycle demonstrated in Figure 1(b) for both Ford Escape PHEV and Ford Escape S is shown in Figure 9. The collected data covers a broad engine load (0–100%) and engine speed (idle – 4000 rpm) operating conditions. The urban driving cycle is much shorter than the highway driving cycle and is intended to cover only urban driving conditions, so a more limited engine operating range compared to the highway driving cycle is observed. In addition, since Ford Escape PHEV was operating in a hybrid electric mode, in some operating conditions the electric battery and electric motors drive the vehicle instead of the gasoline engine, so the engine operating range in Figure 9(a) is less than 9(b).

Engine load and speed for the collected vehicle data in the urban driving cycle: (a) Ford Escape PHEV and (b) Ford Escape S.
The time series of the two vehicles’ instantaneous fuel consumption and speed during the urban driving cycle is shown in Figure 10. Again real road traffic resulted in slight differences in speed profiles. Instantaneous fuel consumption clearly increases when vehicles accelerate so driving behavior and operating conditions have a significant influence on fuel consumption. Ford Escape S has a smaller engine (1.5 l) compared to the Ford Escape PHEV engine (2.5 l), but its 1.5 l engine is turbocharged, so the fuel consumption range between the two vehicles is comparable.

Vehicle speed and measured instantaneous fuel consumption for urban driving cycle as a function of time: (a) Ford Escape PHEV and (b) Ford Escape S.
As shown in Figure 11, more than a third of the urban trip times is in the cold start period. As shown in Figures 10(a) and 11(a), despite the fact that at the beginning of the trip, there are many stops and the vehicle speed is low and the electric motor must supply the power of the hybrid electric vehicle, but the combustion engine is idling for better powertrain performance. The same pattern happens for Ford Escape S in Figures 10(b) and 11(b), where the engine does not stop operating at the beginning of the trip, even though the vehicle is equipped with the stop/start technology. By doing so, the engine will warm up and the coolant temperature will rise.

Engine speed and coolant temperature for urban driving cycle as a function of time: (a) Ford Escape PHEV and (b) Ford Escape S.
Both vehicles have acceptable ANN validation results with En less than 22% for instantaneous fuel consumption, as listed in Table 10. By minimizing the estimated fuel consumption’s ERMS, the hidden layer size for each model is optimized.
The ANN fuel consumption model cross validation for urban driving data.
For both vehicles, the RF model error was lower than the ANN model with En less than 6.5%, as listed in Table 11.
The RF fuel consumption model cross validation for urban drivin
The performance of trained models of the urban driving cycle on the unseen test data is shown in Table 12. Similar to the highway driving models, the RF model provides a more accurate estimation of fuel consumption for both vehicles during the model testing stage. Both ERMS and En show that the RF model produces a precise estimate of fuel consumption (ERMS < 7 g/min, En < 6%). The points of the RF model plots (Figure 12(a) and (b)) are closer to the diagonal line, suggesting that the estimated values are in better agreement with the actual measured data.
Performance of the models on the test data set for urban driving condition.

Performance of the RF and ANN models along with the ECU-based calculation for estimating instantaneous fuel consumption for the urban driving test data: (a) RF model – Ford Escape PHEV, (b) RF model – Ford Escape S, (c) ANN model – Ford Escape PHEV, (d) ANN model – Ford Escape S, (e) ECU-based – Ford Escape PHEV, and (f) ECU-based – Ford Escape S.
The performance of RF and ANN can vary depending on the specific dataset, problem complexity, and parameter settings. In this study, RF consistently demonstrated superior estimation accuracy compared to ANN across various datasets. This can be attributed to RF’s effective handling of noisy data and outliers through random sampling during decision tree construction which reduced the impact of individual noisy samples or outliers. Additionally, RF’s hierarchical structure of decision trees allows it to capture complex nonlinear relationships in the data. It automatically learns and represents nonlinear interactions between input variables, contributing to improved estimation accuracy. 40 In contrast, ANN may require careful tuning of network architecture and activation functions to effectively model nonlinear relationships, particularly when using a simplified ANN structure with only one hidden layer.
As shown in Figure 12(e) and (f) and Table 12, the results of the ECU-based calculation of fuel consumption have the largest error compared to the two machine learning models developed in this study.
As shown in Figure 11, the cold start duration is substantial in the urban driving condition. The coolant temperature data is collected and included among the features to cover the fuel consumption penalty during the warm-up period. Table 12 also exhibits the performance of the models that are trained without the coolant temperature as a feature on the test data set. Comparing these results to the models with coolant temperature as a feature shows that including coolant temperature in the feature set increases the accuracy and reduces the estimation error.
Conclusions
Four different vehicles from the University of Alberta fleet were selected to develop machine learning models to estimate instantaneous fuel consumption based on OBD data. Two different routes, highway and urban driving, were used to collect OBD and fuel consumption data.
For each vehicle, two machine learning models of RF and one-layer ANN were developed to estimate fuel consumption. The OBD parameters used to estimate instantaneous fuel consumption by machine learning models are air/fuel equivalence ratio, throttle position, manifold absolute pressure, engine load, and engine speed. Due to Edmonton’s cold climate, the cold start period of vehicles is long and considerable, particularly in short trips. Therefore, the coolant temperature is added to the features to cover its effect on the fuel consumption penalty in the cold start period. All these OBD parameters are easily accessible through OBD port of most vehicles.
Overall, RF models provided the best estimation accuracy of instantaneous fuel consumption with mean errors of less than 6% in all cases. The one-layer ANN models had a higher error than the RF models. In all the cases, the developed machine learning models had a better estimation performance compared to the instantaneous fuel consumption calculated by measured or estimated MAF data from ECU, which is currently widely used to monitor instantaneous fuel consumption.
Footnotes
Appendix
Acknowledgements
The authors would like to thank Jim Laverty, the manager, and the staff of Transportation Services at the University of Alberta for their support for vehicle instrumentation and testing. The authors also thank Energy Management and Sustainable Operations (EMSO) of the University of Alberta, especially Michael Versteege and Shannon Leblanc for supporting this project.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the University of Alberta, Energy Management and Sustainable Operations (EMSO).
