Abstract
This study represents an advanced approach to road weather information system (RWIS) network planning. Here, a methodological framework is developed to determine optimal RWIS locations by integrating two analysis domains: space and time. Using a case study, the application of the proposed method is demonstrated using three critical RWIS variables: air temperature, road surface temperature, and dew point temperature. With these three variables, a series of geostatistical semivariogram analyses are performed to construct a single spatiotemporal model named joint semivariogram, which is able to preserve both spatial and temporal aspects. The constructed joint semivariogram is then used to find the optimal RWIS locations for a randomly generated study area using a popular heuristic algorithm—spatial simulated annealing. The proposed method enhances the previously developed RWIS location allocation model by considering both spatial and temporal components of multiple variables. The finding from this analysis reveals that optimal RWIS location strongly depends on the spatiotemporal autocorrelation structure of the variable of interest. Consequently, location solutions generated using the three variables are found to be different from each other. The variation among the RWIS location solutions is then further quantified by developing a spatial similarity index that is used to measure the degree of spatial similarities between different variables. Overall, the findings documented in this study will provide RWIS planners with a more complete and conclusive location allocation strategy and can act as a decision support tool for long-term RWIS network planning.
Adverse climatic conditions and an increasing number of weather-related road incidents persist as vital and challenging issues in cold regions. As an important part of modern transportation engineering, intelligent transportation systems (ITS) play an essential role in our everyday life by improving transportation safety and mobility through minimizing weather-related road crashes. Road weather information systems (RWIS)—a critical part of ITS infrastructure—contain advanced sensors that collect, process, and distribute road weather and condition information like air and road surface temperatures, dew point temperature, atmospheric pressure, windspeed, and many others (
Several studies were conducted in the past to establish proper guidelines for RWIS installations (
While the above discussed literature has contributed to advancing RWIS location models, the studies so far suffer from several limitations. First, optimal RWIS locations were generated by investigating the spatial characteristics of one single RWIS variable, road surface temperature (RST). Although RST is an important measurement, RWIS provide many other weather variables that need to be considered alongside. Measurements like AT (air temperature) and DPT (dew point temperature) are just as critical in forecasting road weather conditions (for example, generation of black ice). Second, previous studies dealt solely with spatial domain of a single RWIS variable, which does not account for inherent temporal correlation of weather and road surface conditions. And since the weather parameters vary over space and time, it is essential to investigate both the spatial and the temporal variability of the variables of interest.
Spatiotemporal semivariogram analysis is an advanced geostatistical method that is used to evaluate spatial and temporal dependency of parameters that tend to fluctuate over space and time (e.g., road weather variables). This method works by combining spatial and time-series analysis to preserve the interactive effect of time variation on spatial domain and vice versa, allowing for the visualization of the spatiotemporal variability in the variable of interest and the determination of autocorrelation range over space and time. For this reason, spatiotemporal analysis has been established as more accurate than spatial analysis alone (
Therefore, the main objective of this study is to develop a methodological framework to optimize RWIS location by considering both spatial and temporal variability of multiple weather variables. The problem was formulated on the basic premise that data from RWIS stations within a region should be collectively used to maximize their overall monitoring quality. By considering multiple variables, this study will evaluate the spatiotemporal autocorrelation structure of these variables and examine their effects in generating optimal location solutions and their spatial similarities.
Methodology
Overview of Research Procedures
The first phase of this study involves analyzing spatiotemporal correlation of RWIS measurements, followed by RWIS location optimization using semivariogram parameters. Three different variables of interest are used in this study; namely, Air Temperature (AT), Road Surface Temperature (RST) and Dew Point Temperature (DPT). The raw RWIS measurements are extracted from the RWIS database, and are then checked using the following steps: data completeness test, reasonable range test, and a neighborhood value comparison. Once the data has been checked, the RWIS data is detrended with respect to time using a generalized additive model (GAM), followed by statistical analysis using descriptive statistics and correlation analysis. In the second stage, one of the most comprehensive spatial sampling techniques, called geostatistical analysis, is employed in conjunction with geographically distributed data. This technique maximizes the probability of capturing the spatial and temporal variations of the RWIS variables and minimizes the potential bias associated with input data. More specifically, spatiotemporal analysis is performed by constructing empirical variograms from processed data, which optimizes parameter estimations for unsampled locations and captures the possible autocorrelation associated with the RWIS variables. Joint semivariogram models are then developed by combining spatial and temporal semivariograms. Finally, a model-based approach via kriging is utilized in the final stage to obtain unbiased estimates with the lowest variance (i.e., uncertainty) to determine the optimal RWIS locations. The study is concluded by generating three sets of optimal RWIS locations and performing a similarity analysis. Research procedures for this study are summarized in Figure 1.

An overview of the research procedure.
Data Quality Diagnostics
Downloaded RWIS data is processed to remove missing and erroneous data. All three types of data are cross checked with one another to search for outliers. After that, data completeness is checked by identifying the total amount of missing data for each sensor—any stations with more than 15% of missing data are not used in this study. Following this, reasonable ranges of each variable under investigation are checked based on historical data ranges for the associated region and month. Filtered data is then used for pattern analysis by plotting the day of the month versus average daily temperature for all selected sensors, all weather parameters, and all months. All selected sensors for a specific weather parameter are expected to show a similar pattern throughout the month. If any unusual pattern is noticed for a specific station, the associated data is further investigated. In total, six sets of data (two months for three weather parameters) are analyzed in this study. At the end of data processing, AT, RST, and DPT data is de-trended with respect to time using a GAM to incorporate shorter scale variation in the temporal domain. Here, GAM works as a generalized linear model with linear predictors. The GAM function is formulated as
Spatiotemporal Semivariogram Analysis
Spatiotemporal analysis is generally conducted for variables that vary over space and time. In this study, spatial and temporal continuity analysis for weather variables is performed using geostatistical spatiotemporal semivariogram modeling. The traditional spatial analysis conducted in our previous efforts is incorporated with temporal analysis to consider spatiotemporal effects.
Spatial semivariogram is a graph of semivariance versus lag separation distance between pairs of measured points. Semivariance can be defined as a measure of dissimilarity between two measurements as a function of separation distance. The semivariogram has three basic parameters: nugget, sill, and range. Nugget represents measurement or sampling error; it can be defined as dissimilarity with zero separation distance, which is theoretically zero. Sill is the semivariance value at which the semivariogram levels off. The difference between sill and nugget is called partial sill and this value is often encountered during semivariogram analysis. The distance at which the semivariogram reaches the sill value is defined as the spatial range of autocorrelation, where there is no correlation among the measurements beyond this spatial range (

A typical: (
Spatiotemporal semivariogram modeling, on the other hand, is conducted by integrating both spatial and temporal effects of regionalized random variables (e.g., road weather). Generally, a set of variables in a spatiotemporal field can be defined as
where
After constructing the empirical variogram, a mathematical model is used to smoothen the graph by resolving the irregular pattern. There are several covariance models used for spatiotemporal semivariogram modeling (
Here,
Spatial, temporal, and joint nugget are estimated separately in this model. A spatiotemporal anisotropy parameter (StAni) is used to create the joint semivariogram by combining the spatial and temporal semivariance. The number of space units equivalent to one time unit is defined as StAni. RMSE (root mean square error) is used in this study to measure the goodness-of-fit of the resultant model.
Location Allocation via Spatial Simulated Annealing
An innovative RWIS location modeling framework was developed in our previous efforts where the problem was formulated as an integer programming problem with the objective of minimizing the spatially averaged kriging variance (in other words, maximizing spatial coverage) across the road network (
In this study, a more refined location optimization model is proposed by integrating joint semivariogram parameters generated for different weather variables to represent their distinctive spatiotemporal characteristics. Similar to our previous studies, the objective function is again formulated to minimize the sum of mean ordinary kriging (OK) estimation variance. The equations of the objective function and its related computation process are shown below.
where
where
Based on the above three equations, the objective function of this work can be formulated as
subject to
The RWIS location modeling being tackled here requires mathematical and computational methods to find optimal solutions for an objective function, which is usually performed under some form of constraints. For a larger-sized optimization problem, a heuristic algorithm is an effective method for finding the solutions (

Workflow of spatial simulated annealing.
The optimization follows an iterative process where stations are added incrementally into the study; and locations are selected based on heuristic attempts to minimize the objective function. When adding new stations, the placement area is limited to a square region within the study area. This is done to reduce computational complexity and algorithm runtime. The number of RWIS stations to be located in this square region is arbitrarily limited to 10, which is equal to the existing number of RWIS stations in the square region. For the optimization process, two top criteria are implemented. If the number of iterations exceeds 100,000, the optimization process will stop. And if no improvements are made in the objective function after 200 iterations, the algorithm is set to automatically stop (
Case Study—Iowa, United States
Study Area and RWIS Network
The study area is in the state of Iowa, U.S.A. Iowa is generally a flatland area consisting of rolling plain lands and flat prairies. The altitude of this state is 146 m to 509 m (

Distribution of road weather information system (RWIS) stations for Iowa.
Data Description and Quality Diagnostics
RWIS data for Iowa was downloaded from Iowa State University website (http://mesonet.agron.iastate.edu/RWIS/) as an Excel file. Measurements from a typical RWIS station include, but are not limited to AT, RST, DPT, visibility, wind speed, and road surface conditions. RWIS measurements were collected every 15 to 20 min, totaling 1488 h of data to be used in this study. As discussed previously (Figure 1), these data were processed to remove “noise” via four steps: (a) data completeness test to identify missing data, (b) reasonable range test to find erroneous data, (c) comparison with neighboring observations, and (d) detrending processed data with respect to time using GAM.
Statistical analysis was then performed using descriptive statistics of the processed data and correlation analysis among weather parameters. Descriptive statistics of AT, RST and DPT are presented in Table 1. On closer inspection of Table 1, it was revealed that there was relatively less variation in monthly temperatures in the mid-winter month than in the shoulder month for AT and RST, whereas DPT showed the opposite trend. AT varied from −6°C to 33°C over the month of October 2016, while the RST and DPT varied from −1°C to 41°C and from −8°C to 24°C, respectively. For the month of January 2017, temperature varied from −27°C to 20°C. Figure 5 shows the maximum, minimum, average, and standard deviation of weather data for the study area.
Descriptive Statistics of AT, RST, and DPT for the Study Area

Plot of variation found in AT, RST, and DPT over the shoulder and winter months.
Correlation analysis was conducted among the weather variables to find how strongly two variables were correlated to each other using values between −1 and +1. A correlation coefficient of positive 1 indicates an ideal positive correlation, whereas a correlation coefficient of negative 1 indicates an ideal negative correlation. A correlation coefficient near zero indicates no correlation at all (
Correlation Coefficient for AT, RST, and DPT for Study Periods
According to Table 2, the weather variables were more correlated during the mid-winter month than the shoulder month. Correlation between AT and RST was higher than for any other variable pairs, having a correlation value of over 0.9 in both study months. In contrast, the lowest level of correlation was observed between RST and DPT, thereby further attesting to the need for taking into consideration the distinctive road weather characteristics later in the location optimization phase. Figure 6 presents a plot comparing the correlation coefficient among AT, RST, and DPT of Iowa, and clearly shows that the three variables considered have different degrees of similarities over different months.

Correlation coefficients comparison of multiple weather variables.
Spatiotemporal Semivariogram Modeling
As evidenced in the previous section, correlation coefficients varied from one variable to another, and over different months. Spatiotemporal semivariogram modeling was subsequently conducted to gain a deeper understanding of their spatial and temporal variability, and to use as input to the location optimization model. For this purpose, RWIS data for those two select months were further processed and aggregated using 1 h interval for the time domain. A space–time matrix was then formulated as an input for the spatiotemporal analysis. Separate analysis was conducted for each month and each weather parameter, that is, AT, RST, and DPT. According to previous studies, 30 or more sampling points are needed to construct a reliable semivariogram model (
Spatiotemporal Semivariogram Analysis Result
According to Table 3, higher spatial ranges were obtained for the mid-winter month than for the shoulder month. The spatial range is close to 20 km for all weather variables in January 2017, and a 10–15 km spatial range was obtained for October 2016. The temporal range was found to be 8.5–12 h for the shoulder month, and 14–21.5 h for the mid-winter month. Such findings make intuitive sense since road weather and surface conditions tend to change more abruptly during shoulder months than they do during mid-winter months when the variability of weather is relatively low (
Effect of Road Weather Variables on Optimal RWIS Locations
Optimal RWIS locations were determined by relying on spatiotemporal semivariogram analysis results that were obtained in the previous section. To quantitively appreciate the effect each road weather variable has on the resulting RWIS locations, the optimization was performed separately for each set of weather variables and month of data using the R statistical package—version 3.2.5 (

Plot of objective function with respect to number of iterations for three sets of road weather information system (RWIS) location optimization: (
As a result, the RWIS network optimization output under different weather variables is presented in Figure 8, a and b, for October 2016 and January 2017, respectively. A 10 km buffer was created around the stations arbitrarily to help better recognize the distribution of stations within the rectangle study area. It is evident that the RWIS location solution is substantially different depending on the variable of interest and over the month of analysis. These findings reveal that the spatiotemporal autocorrelations of the three weather variables have strong effect on generating optimal RWIS locations, and that there is a resurgent need to consider these effects collectively in the optimization framework so that locations selected are most representative of their uniquely different spatiotemporal characteristics.

Spatial distribution of optimized road weather information system (RWIS) locations with respect to three variables: (
Spatial Similarity Analysis
It is obvious from the location allocation output (Figure 8) that the optimized location solutions are visually different from one another. To quantitatively evaluate the closeness of the optimal RWIS station distributions, a spatial similarity index was developed. For this, a sensitivity analysis was performed to objectively measure the similarity of optimal locations using ArcGIS (

Similarity among optimal road weather information system (RWIS) station placements.
According to Figure 9, the percentage of intersecting area with respect to different buffer sizes follows an exponential function. It is clear from the figure that buffer size does not affect the results significantly. In most cases, location solutions generated using the mid-winter month (January 2017) data set have been found to be closer than those of the shoulder-month (October 2016) data set. The reason behind this outcome is that the daily fluctuation in weather data is greater during shoulder months than during mid-winter months, which generates relatively higher spatiotemporal autocorrelation of weather data for January than for October. For a specific buffer size, percentage of intersecting area between AT and RST solution sets was found to be the highest, followed by, in decreasing order, AT and DPT solution sets, RST and DPT solution sets, and AT, RST, and DPT solution sets. This phenomenon is quite similar to the correlation analysis results among the weather variables, where the correlation between AT and RST was the highest among all, followed by AT and DPT, and RST and DPT. The findings of this analysis indicate that spatial similarity of location solutions can be examined by creating buffer polygons regardless of the size implemented.
RWIS Location Allocation for the Entire Iowa State
In the previous section, it was confirmed using the hypothetical square region that the generated locations were dependent on weather variables. Based on this finding, the proposed location optimization method was expanded to cover the entire Iowa state. To achieve a comparable result to that for the existing RWIS locations, a constrained optimization was performed with respect to the road network using a shoulder-month data set of weather variables. Although the existing number of RWIS stations is 88, the total number of all new station has been set to 61 for the location optimization. The number 61 was chosen based on our previous study on optimal RWIS density guidelines (

Plot of objective function with respect to number of iterations for location optimization of Iowa.

Existing and optimized RWIS locations of Iowa generated with three variables for October 2016.
Conclusion and Recommendations
This study aimed at developing an advanced location optimization model by using spatiotemporal semivariogram parameters combining both spatial and temporal effects of three key RWIS variables—AT, RST, and DPT. In the proposed method, a joint spatiotemporal semivariogram model was generated using three RWIS measurements; and the parameters of the model were used for determining optimal locations for RWIS stations. The location allocation problem was solved using a popular mathematical programming approach—an SSA algorithm that has been proven effective in and has gained recognition for RWIS location optimization problems. The methodological framework developed here builds on our previous effort by integrating the temporal domain in the location allocation problem. Additionally, RWIS data for multiple weather variables was used to generate three sets of optimal location solutions. The similarity among the generated location solutions was also quantified in this study using a spatial similarity index. The key findings of this research are listed below:
An advanced RWIS location allocation model was developed based on the premise that monitoring capabilities can be increased by minimizing the spatially averaged kriging variance in both space and time. The framework developed represents the first in the existing literature that attempts to determine optimal regional RWIS sampling design by taking into account both spatial and temporal attributes of multiple road weather variables.
This study investigated and confirmed that the location solutions generated using AT, RST, and DPT were significantly different from one another. Their closeness was further quantified and objectively validated using a spatial similarity index.
RWIS measurements of AT, RST, and DPT were processed and analyzed, and their correlations quantified, revealing that variables were more correlated with each other during mid-winter months than during shoulder months. Most notably, similarity between AT and RST was found to be significantly high.
The effective coverage of RWIS measurements was determined by analyzing multiple critical weather variables. Comparatively, a higher spatial and temporal continuity range of autocorrelation was observed for mid-winter months than for shoulder months. The spatiotemporal continuity range (from the joint semivariogram model) was determined to be 17 km for the mid-winter month, and 7.5–10 km for the shoulder month.
Recommendations for further research are given below:
This study covered a flatland region as the study area. Therefore, more case studies including wider regions can be examined for better understanding of the effect of spatiotemporal structure on RWIS location optimization.
The study period of this research was limited to 2 months: October 2016 and January 2017. Lengthening the study period can potentially improve the level of confidence in the output.
Three different weather variables were used in this analysis: AT, RST, and DPT. Thus, other RWIS measured data, that is, subsurface temperature, road surface conditions, and so forth could be included in the analysis to strengthen the output.
Microclimatic factors could be considered by combining RWIS data with other road weather and surface data extracted from entities such as automated surface observing system (ASOS) and airport weather observation system (AWOS). The combined spatially rich data set allows the RWIS location optimization algorithm to consider both the local and regional weather characteristics in the regions of interest.
Finally, a framework for a multi-criteria location optimization model should be established incorporating the joint semivariograms generated from weather variables.
Footnotes
Acknowledgements
The authors would like to thank Iowa DOT for providing the data necessary to complete this research.
Author Contributions
The authors confirm contribution to the paper as follows: study conception and design: S. Biswas, T. J. Kwon; data collection: S. Biswas; analysis and interpretation of results: S. Biswas, T. J. Kwon; draft manuscript preparation: S. Biswas, T. J. Kwon. All authors reviewed the results and approved the manuscript.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research is funded by Natural Sciences and Engineering Research Council (NSERC).
