Abstract
This paper presents the findings on modeling the demand for shared e-scooter services (SES); specifically, spatio-temporal variation of SES demand. A zero-inflated negative binomial (ZINB) model is developed using the count data of trip origins at the dissemination area level from Kelowna, Canada. The motivation for adopting the ZINB model is the presence of excess zeros in the count data. ZINB has two components: the zero-inflated component accounts for excess zeros, and the count component accounts for the over-dispersion characteristics of data resulting from excess zeros. In addition to the ZINB, several other count models including hurdle models are estimated. The goodness-of-fit measures suggest that the ZINB model outperforms other methods. The model results confirm the effects of temporal, weather, transportation infrastructure, land use, and neighborhood characteristics. For example, the count model results reveal that SES demand is more likely to be higher during summer, mid-day on weekends, afternoons of weekdays, and days without rainfall. Furthermore, higher e-scooter index, higher density of cycle tracks, heterogeneous land use, urban centers, lower elevation, and neighborhoods with higher density of hotels and younger population might induce higher demand. The zero component results of the model are consistent with the findings revealed by the count component. The model is validated using a hold-out sample, and the validation results confirm that the prediction performance of the model is reasonably satisfactory. The findings of this study provide important insights into when and where the demand is higher, which will assist in effective policy-making supporting e-scooter use.
Keywords
Micro-mobility has emerged as a viable and sustainable alternative to automobiles for traveling shorter distances (
This study investigates the demand for SES by utilizing e-scooter count data from Kelowna, Canada. One of the key features of this study is to examine the combined spatio-temporal variation of SES demand. In the case of spatial characteristics, built environment attributes such as transportation infrastructure, land use, and neighborhood features are extensively tested. Temporal characteristics include the time of day, week, season, and weather attributes. A zero-inflated negative binomial (ZINB) model is developed using the zonal count of trips originating at the dissemination area level (DA). DA refers to the smallest unit of geographic area in Canada for which all the census data are disseminated. The motivation for adopting the ZINB model is the presence of excess zeros in the count data. The excess zeros are classified as structural (i.e., DAs always with zero counts) and sampling zeros (i.e., DAs with zero counts only at specific time periods). The over-dispersion attribute of data is also tackled within the ZINB framework.
Literature Review
SES is a fairly recent addition to the shared micro-mobility services. It provides several advantages including reducing vehicular emissions and congestion, serving as first- and last-mile mobility option, and station-less service (i.e., flexibility to pick up and return e-scooters), among others (
In the domain of shared micro-mobility, several studies have attempted to analyze the demand. The majority of them have focused on bikeshare services, particularly focusing on the spatial factors (
Few studies, however, have investigated the demand for SES. One of the primary reasons is the new and emerging nature of this technology. Among the few studies, Jiao and Bai (
In the case of modeling methods, the majority of the previous studies analyzing demand for micro-mobility services have adopted regression modeling techniques using count data. For example, Shen et al. (
Contributions of the Current Study
This study contributes by investigating the demand for SES. One of the key features of this study is to explore the spatio-temporal variation of SES demand. Temporal attributes such as time of day, week, season, and weather characteristics are tested. In the case of spatial characteristics, built environment attributes including transportation infrastructure, such as e-scooter index; land use, such as land use index; and neighborhood attributes, such as hotel density, are examined. Methodologically, a ZINB model is developed using the count data at the zonal level of DA in Kelowna, Canada. ZINB model is developed to tackle the over-dispersion characteristics of the count data resulting from excess zeros. The model is formulated as a two-component mixture model. The zero-inflated component is assumed to have a binary distribution. This component of the model accounts for excess zeros. The other component is a count model assuming a NB distribution.
Data
Sources of Data
The primary data source for this study is the shared mobility manager database collected from the City of Kelowna through the Populus platform. This portal includes information on all shared services in Kelowna, Canada. Populus uses two categories of data specifications: (i) General Bikeshare Feed Specification (GBFS) for live map, and (ii) Mobility Data Specification (MDS) for all other maps and historical data (
Secondary data are extracted from various resources. Transportation infrastructure and land use attributes are obtained from Kelowna Open Data. Transportation infrastructure-related data involves bike infrastructure, road networks, and bus stops, among others. Land use attributes include the percentage of areas classed as residential, commercial, park, aquatic, and urban center, among others. The location of activity points is collected from Desktop Mapping Technologies Inc. (DMTI). This information includes the location of food stores, educational services, health services, shopping centers, hotels, and recreational centers, among others. Neighborhood features are extracted from Statistics Canada. This data includes population density, dwelling density, dwelling type, average age, average household size, and employment rate, among others. Weather information is collected from Environment Canada. This data involves the historical temperature, wind speed, relative humidity, and rainfall, among others.
Overview of SES Usage in Kelowna
SES started in Kelowna on July 13, 2019 and most of the operators were not active in the winter season. Therefore, this study utilizes data from July 13 to October 25, 2019. The majority of the SES companies (e.g., OGO, Bunny, and Zip) were fully functional during this time-frame. During this period, a total of 22,720 SES trips originated in Kelowna. The origin of the SES trips in any DA of Kelowna is dominated by the availability of e-scooters in that DA. E-scooters are removed from the streets at night, rebalanced, recharged, and returned within the city-authorized e-scooter network the next morning. The percentage share of total weekday and weekend trips per day was found to be 42% and 58%, respectively. Approximately 70% of trips were counted during the months of July and August, which is in the summer season in Kelowna. Figure 1 illustrates the percentage distribution of total SES demand over time of day and day of week in Kelowna. The analysis reveals that higher demand is observed during mid-day (i.e., around 34% of total trips) and afternoon (i.e., around 36% of total trips). Among these trips, a higher number of trips originated during the weekends. In contrast, the lowest demand was captured during the early morning period. Interestingly, demand was higher during the evening (i.e., 20% of total trips) compared with the morning (i.e., 4% of total trips) and night time (i.e., 5% of total trips).

Percentage distribution of total shared e-scooter service (SES) demand in Kelowna by day of week and time of day.
The spatial unit of analysis for this study is the zonal level of the DA. Figure 2 shows the observed SES demand in Kelowna. It is evident that higher demand is revealed near the Okanagan Lake area adjacent to the downtown area of Kelowna. Furthermore, a secondary peak in demand is observed near the urban city center and the other urban centers such as Midtown, Capri landmark, and South Pandossy. The analysis suggests that around 91% of the total trips originate from DAs in the downtown and urban city center. Figure 2 further suggests that the majority of DAs have zero trip count. The excess zeros in the count data can be classified as either structural or sampling zeros. Structural zeros occur from DAs that always have zero counts. Out of the 167 DAs in Kelowna, 114 DAs are found to have zero count at all times. One of the reasons for structural zeros is the unavailability of e-scooters and bike infrastructure facilities in those DAs. On the other hand, sampling zeros might occur because of sampling variability. Sampling zero occurs because some DAs have zero counts only at specific periods.

Observed shared e-scooter service (SES) demand in Kelowna.
Explanatory Variables
This study extensively tests the effects of spatial and temporal attributes on SES demand. Spatial factors include transportation infrastructure, land use, and neighborhood attributes. All the explanatory variables are generated at the DA level using ArcGIS. Among the transportation infrastructure characteristics, an e-scooter index (EI) is generated using the collected transportation infrastructure attributes, following similar methodology to the bike index (BI) (
Methodology
This study develops a ZINB model to investigate the demand for SES using the trip counts originating at the zonal level of DAs in Kelowna. One of the purposes of developing a ZINB model is that it accounts for excess zeros in count data. However, in the case of count data analysis, Poisson regression models are predominantly used. The Poisson probability distribution can be expressed as:
where
One of the limitations of the Poisson model is the restrictive assumption that conditional variance equals conditional mean. The count data of this study has a variance that is greater than the mean (i.e., over-dispersed). This limitation is relaxed in the NB modeling method (
where
It is assumed that
where
One of the key reasons for over-dispersion is the existence of excess zeros. Excess zeros can be present because of structural and sampling zeros. Structural zeros are the zero counts that will always be zero, whereas sampling zeros refer to the zero counts that can be greater than zero, but appear to be zero because of sampling variability. The NB model is often limited to address the over-dispersion resulting from excess number of zero counts (
The count data of this study include excess sampling and structural zeros. For example, sampling zeros occur when SES counts are not observed in a DA, however, counts are recorded for other time intervals. In contrast, DAs that never recorded SES counts are considered as structural zeros. Therefore, this study utilizes the ZINB modeling technique to analyze the SES count data. The ZINB model adopts a two-component mixture modeling approach. The first component is referred to as the zero-inflated component. Let a binary distribution with probability
The ZINB model considers
where
The parameters are estimated by maximizing the likelihood function. The log-likelihood function can be written as:
where
ZINB model is not nested within the NB model. Vuong statistic (VS) is utilized to differentiate between the NB and ZINB models. The VS can be determined as:
where
The goodness-of-fit of the ZINB model compared with other models is evaluated by using Akaike information criteria (AIC) and Bayesian information criteria (BIC) measures.
Discussion of Model Results
This study develops a ZINB model to estimate the SES demand in Kelowna. Among the 65,534 observations, 55,704 observations—which is approximately 85% of the data—were randomly selected for developing the model and the rest were used for model validation. A summary statistic of the variables retained in the final ZINB model is reported in Table 1.
Descriptive Summary of Independent Variables
Goodness-of-Fit Measures
For comparison purposes, in addition to the ZINB model, this study estimates the following models: Poisson, NB, hurdle Poisson (HP), hurdle negative binomial (HNB), and zero-inflated Poisson (ZIP) models. In the case of the goodness-of-fit measures, the ZINB model is compared with the NB model using the VS measure. A value greater than 1.96 for the VS favors the ZINB model. Results suggest that VS value for testing the ZINB versus NB model is 14.8 (Table 2). Therefore, the ZINB model outperforms the NB model. Further comparative results suggest that the ZINB model shows a lower BIC value compared with the other models. For example, BIC values for the ZINB, ZIP, HNB, HP, and Poisson are 10,756.1, 23,480.4, 11,162.7, 24,221.7, and 27,781.5, respectively. Similarly, the ZINB model shows a lower AIC value than the other models. Since the model with lower BIC and AIC measures fits the data best, the ZINB model is selected as the final model for further discussion. The estimation results from the ZINB model are presented in Table 3. It is found that the majority of the parameters show a statistical significance at least at the 1% significance level.
Comparison of Goodness-of-Fit Measures of the Demand Models
Parameter Estimation Results of the Zero-Inflated Negative Binomial (ZINB) Model
significance at 1% level
significance at 5% level
significance at 10% level.
Parameter Estimation Results of the ZINB Model
Results of Count Component of the ZINB Model
The count component of the ZINB model confirms the effect of temporal, weather, transportation infrastructure, land use, and neighborhood characteristics on SES demand (Table 3). Model results reveal that the dispersion parameter is statistically significant. In the case of temporal characteristics, variable representing the summer season shows a positive sign. This indicates that the demand for SES is more likely to increase during the summer. Similarly, results suggest that there is a higher probability of increased demand for SES during the mid-day of weekends and afternoon of weekdays. This indicates that individuals might use SES for recreational purposes during the mid-day of weekends and for commuting purposes during the afternoon peak hour of weekdays. In the case of weather characteristics, rainfall is found to have a negative impact on SES volume. This implies that demand might be lower when rainfall occurs. A similar negative relationship is observed for wind speed coupled with fall season, suggesting that riding e-scooters in windy weather conditions during the fall season might not be attractive.
Results further reveal that transportation infrastructure attributes are found to have substantial influence on SES demand. For example, the variable representing the density of cycle track shows a positive relationship. This implies that SES demand might remarkably increase in the presence of cycle tracks. Cycle tracks are dedicated and protected bike infrastructure available for e-scooters, which might attract users to ride SES. This is because individuals riding e-scooters on cycle tracks are less vulnerable to collisions and feel safer. The variable representing the EI also positively influences demand. This is anticipated because EI signifies the overall e-scooter friendliness of an area. As a result, higher EI is more likely to increase demand. Similarly, the variable representing the ratio of length of bike lane to traffic lane shows a positive sign.
Among the land use attributes, the LUI shows a positive sign, implying that users might be attracted to ride SES in areas of heterogeneous land use mix. This is expected, as heterogeneous land-use areas allow individuals to participate in diverse activities. The variable representing the urban centers also exhibits a positive relationship. Urban centers are the hub of government, business, and personal services. Therefore, SES demand might increase within the urban centers. Conversely, a negative relationship is observed in the case of the variable representing the mean elevation. This reveals that areas with higher mean elevation might observe lower SES demand. This may be explained by steeper grades being more difficult for e-scooters to negotiate. Riding in areas with higher elevation also puts extra pressure on the battery and motor, eventually reducing the performance and efficiency of e-scooters.
In the case of neighborhood features, density of hotels interacted with weekend shows a positive sign. This implies that there is a higher likelihood of increased demand for SES on weekends in neighborhoods with a higher density of hotels. This can be explained by tourist users staying at hotels who might consider using SES for recreational purposes during the weekends. Similarly, a positive relationship is revealed for the variable representing the percentage of employed people in the neighborhood. This indicates that neighborhoods with higher percentage of employed people are more likely to induce higher demand for SES. Interestingly, the variable representing the average age exhibits a negative relationship. This suggests that neighborhoods with younger population are more likely to show higher SES demand.
Results of the Zero-Inflation Component of the ZINB Model
The zero-inflation component of the ZINB model also confirms the effects of temporal, weather, transportation infrastructure, land use, and neighborhood characteristics on SES demand (Table 3). The results of the zero component are consistent with the findings from the count component of the model. For example, among the temporal and weather characteristics, mid-day on weekends is negatively associated with the probability of zero SES demand. This reveals that there is higher propensity of e-scooter volume during the mid-day on weekends. This is expected as individuals are more likely to use SES for recreational purposes during the weekend. Similarly, the variable representing evening interacted with weekday exhibits a negative sign. This implies that demand is less likely to be zero during the evening of weekday. Individuals might use SES for returning home from their workplace in the evening on weekdays. In the case of weather attributes, the variable representing relative humidity in summer is positively associated with the propensity of zero demand. This is expected, as higher humidity in the summer makes it feel hotter than the actual air temperature, which is not the ideal weather condition for riding e-scooters. A similar positive relationship is observed for wind speed during the fall, implying that higher wind speed in the fall season might increase the likelihood of zero demand.
In the case of transportation infrastructure and land use attributes, EI shows a negative relationship. This indicates that there is a higher probability of SES demand in the areas with a higher EI. However, the percentage of residential area exhibits a positive sign, implying that higher proportion of residential land use increases the probability of being in zero-demand state. Interestingly, number of bus stops coupled with a higher percentage of residential area influences the zero-demand state negatively. This indicates that SES might solve the first- and last-mile problem of urban trips by connecting the users with increased accessibility to public transit in residential areas. Furthermore, the variable representing the percentage of park and aquatic land use reveals a negative relationship. It suggests that areas with parks and lakes might be attractive for SES users.
Among the neighborhood characteristics, density of recreational activity points is negatively associated with the probability of zero demand. This reveals that neighborhoods with higher density of recreational activity points are more likely to induce demand. A similar positive relationship is observed for the variable representing the percentage of high-rise apartments in the neighborhood. Interestingly, the variable representing the average age in the neighborhood shows a positive sign. This suggests that an older population in the neighborhood is less likely to induce demand. Younger individuals might find SES more attractive.
Model Validation Results
The ZINB model results are validated using a hold-out sample to evaluate the predictive performance of the model. The hold-out sample includes 9,830 observations. The validation results suggest that the value of adjusted pseudo R-squared is 0.53 for the ZINB model which is higher than the HNB (i.e., 0.52) and NB (i.e., 0.47) models. Therefore, ZINB model reveals a better fit of the model for the validation sample. The predictive performance of the ZINB model is also compared with the HNB and NB models (Table 4). Validation results indicate a higher prediction precision for the ZINB model than that of other models. For example, the mean absolute deviation (MAD) value for the count component of the ZINB model is 0.303, HNB model is 0.319, and NB model is 0.337. The smaller the value of MAD, the higher the accuracy of prediction. Furthermore, the absolute percent error (APE) for zero counts is smaller for the ZINB model (2.64%) than for the HNB model (2.85%) and NB model (3.44%). Therefore, it can be concluded that the ZINB model offers satisfactory results for predicting the demand for SES for the validation sample. Figure 3 illustrates the predicted demand for SES aggregated in the DAs of Kelowna. The figure reveals some aspects identical to the SES demand observed in Kelowna. For instance, the density of predicted demand is very high near the Okanagan Lake adjacent to the downtown area. Moreover, a secondary peak of demand is predicted within the urban city center. In the case of observed demand, a similar trend is noticed (Figure 2). This further confirms the satisfactory model fit and prediction accuracy of the ZINB model.
Model Validation Results
where

Predicted shared e-scooter service (SES) demand in Kelowna.
Conclusions
This study investigates the demand for SES; specifically, it examines the spatio-temporal variation of SES demand. A ZINB model is developed using the counts of SES trip origins at the DA level for Kelowna, Canada. One of the reasons for adopting the ZINB model is that it addresses the existence of excess zero counts in the data. The excess zeros are categorized as structural (i.e., DAs always with zero counts) and sampling zeros (i.e., DAs with zero counts only at specific time periods). The ZINB model is a two-component model, where the zero-inflated component follows a binary distribution to account for excess zeros. The count model component adopts a NB distribution and accommodates the over-dispersion characteristics of data resulting from excess zeros. One of the key aspects of this research is to investigate the effects of temporal attributes such as time of day, week, season, and weather characteristics, as well as spatial attributes such as transportation infrastructure, land use, and neighborhood features.
In addition to the ZINB model, NB, ZIP, HNB, HP, and Poisson models are also estimated for comparison purposes. The goodness-of-fit measures suggest that the ZINB model outperforms other methods on AIC, BIC, and VS measures. The model results of the count component of the ZINB model suggest that demand is more likely to be higher during the mid-day of weekends, afternoons of weekdays, and summer season. In contrast, rainfall and higher wind speed during the fall season might induce a lower demand. Among the transportation infrastructure attributes, density of cycle track, EI, and a higher ratio of bike lane to traffic lane are positively associated with demand. In the case of land use attributes, demand is likely to be higher in areas with higher LUI and in urban centers. However, demand might be lower in areas with a higher mean elevation. In the case of neighborhood features, demand is more likely to be higher in areas with a higher employment rate, and areas with a higher density of hotels—specifically during the weekends. In contrast, neighborhoods with a larger share of older adults are less likely to induce higher demand.
The results of the zero-inflated component of the model are consistent with the count model. For example, results suggest that demand is less likely to be zero during the mid-day of weekends and evenings of weekdays. Furthermore, higher EI, higher transit accessibility in the residential areas, higher percentage of park and aquatic areas, higher density of recreational activity points, and higher percentage of high-rise apartments might induce demand. On the other hand, higher relative humidity during summer, higher wind speed during fall, higher percentage of residential land use, and neighborhoods with older age population might induce zero volume of SES. Furthermore, the ZINB model is validated using a 15% hold-out sample. The validation indicators such as the MAD and APE measures suggest that the ZINB model shows a higher prediction precision. Model results are further validated by comparing the illustration of the predicted demand for SES with the observed demand. The validation results suggest that the performance of the model is reasonably satisfactory.
This study has certain limitations. For example, the demand model is developed based on a small sample of data from Kelowna, British Columbia. The majority of the data was utilized to build the model; only 15% of the data was used as hold-out sample for model validation. Future research should focus on collecting more data for longer periods of time so that the data can be utilized as hold-out sample for improved model validation. Furthermore, this research does not test the spatial transferability of the developed model to cities other than Kelowna. One of the approaches could be to apply the e-scooter demand model for another city and assess the performance by estimating the predictive accuracy of the transferred model for the application context. In such case, goodness-of-fit measures of the predictive accuracy such as root mean square error (RMSE) (
Footnotes
Acknowledgements
The authors would like to thank the City of Kelowna for providing the data for analysis.
Author Contributions
The authors confirm contribution to the paper as follows: study conception and design: M. Orvin, M. Fatmi; data collection: J. Bachhal, M. Fatmi; analysis and interpretation of results: M. Orvin, J. Bachhal, M. Fatmi; draft manuscript preparation: M. Orvin, J. Bachhal, and M. Fatmi. All authors reviewed the results and approved the final version of the manuscript. The authors do not have any conflicts of interest to declare.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The authors would like to thank the Natural Sciences and Engineering Research Council (NSERC) - Discovery Grant for their financial support.
