Abstract
Taxi fare is related to the daily life of residents. Reasonable taxi fare not only meets the travel demand of residents but also improves the income of drivers and promotes employment. The spatial variation of taxi fares exists between different regions and cities. Previous studies on taxi fares have been conducted mostly in individual cities, and there has been no study on the spatial differentiation pattern of taxi fares on a national scale. Taking 336 cities across China as the research object, a multiple linear regression model of taxi fares was established by demonstrating the spatial variation pattern of taxi fares, the global differentiation index, spatial autocorrelation analysis, and kernel density analysis, etc. The significance of the study is to explore the law of spatial differentiation of taxi fares in China and to provide a stenographic record of taxi fare adjustment. The results show that: (1) the spatial variation of taxi fares across the country is significant, with the starting taxi fare range being between RMB 4 and RMB 14. (2) The global differentiation index of taxi fares is large, with two low-low clusters and three high-high clusters appearing spatially, and the results of the kernel density analysis surface a dispersion distribution centered on provincial capitals. (3) The divergence pattern of taxis nationwide is influenced by several factors. A multiple linear regression model is selected to establish a multiple linear regression model of urban disposable income per capita, regional GDP, urbanization rate, and urban population density, which shows that urban disposable income per capita has the greatest influence on taxi fares. The model shows that urban disposable income per capita has the greatest influence on taxi fares, and the fare of a 5 km taxi ride is 6.07. Taxi fares have a clear pattern of spatial differentiation in China and are most affected by urban disposable income per capita. Through this study, we can gain a deeper understanding of the variation in taxi fares across the country and provide data and theoretical support for the rationality of taxi fare adjustments.
Introduction
Taxi services make a significant contribution to mobility in medium and large cities in almost every country in the world (Gallo, 2018), and taxis have become an important mode of transport for people going to work, shopping, and other travel activities (C. Hu & Thill, 2019). Taxis have many beneficial and desirable features, such as personalized, door-to-door, and demand response services (Wong et al., 2020). As an indispensable mode of transport in large cities, taxis can fill the gap between other modes of travel in terms of space and time (Huang et al., 2021; M. Li et al., 2017). The number of taxis in China has grown steadily since the 21st century, with positive growth in all years except 2006 and 2018, when there was a small decrease, with an average annual growth rate of 3.26%. In particular, the number of taxis in China grew rapidly in 2021, with a growth rate of 24.99% (Figure 1). Reasonable taxi fares meet the commuting needs of residents while improving driver incomes and increasing employment (Phiboonbanakit & Horanont, 2016). Therefore, it is necessary to conduct a study on taxi fares. Are there differences in taxi fares in different cities, and what factors influence such differences?

Total number of taxis in China from 2000 to 2021.
Existing research on taxi fares has mainly focused on fare forecasting or improvement in a particular city. Studies on taxi fares include how to formulate a reasonable pricing strategy for taxis (Gallo, 2018; Gan et al., 2013; B. Li et al., 2022; T. Liu et al., 2019; H. Yang et al., 2010; Siyal et al., 2021), how to balance the supply and demand of urban taxis through taxi fares (Kamga et al., 2015; J. Wang et al., 2013; H. Yang et al., 2000), policy and regulatory issues on taxi fares (Hoover, 2022; Schaller, 1998; Wyman, 2017), and the relationship between taxi fares and services (Xu et al., 1999; H. Yang et al., 2002; T. Yang et al., 2014). For instance, W. Zhang et al. (2018) built a model to analyze how taxi drivers find customers when they get off the train outside the city center and to determine which internal and external factors influence their preferences (W. Zhang et al., 2018). Guo et al. (2021) found time-dependent pricing could avoid cross-subsidization among travelers in different periods based on a microeconomic model for the design of a time-dependent transit pricing scheme considering elastic and spatiotemporally distributed demand (Guo et al., 2021). A study has applied a mathematical model to the pricing of taxis in the city of Barcelona, and the optimum fare for the taxi sector is 0.68 euros/km and the optimum number of taxis is 19 taxis/km2 (Grau & Estrada, 2019). Silveira-Santos et al. (2023) combined supervised and unsupervised techniques to predict online taxi fares in Madrid in comparison with taxi fares (Silveira-Santos et al., 2023). Wong and Szeto (2022) proposed a surcharge on taxi customers taking taxis during peak hours and/or traveling to congested areas as a way to address the acute shortage of taxi supply during peak hours (Wong & Szeto, 2022). The study found that fare cost is the most important factor driving transport mode preference when there are multiple options available and taxi drivers should focus their business on meeting the needs of these customers (Ulahannan & Birrell, 2022).
Research on taxis and spatial patterns has focused on the micro level within cities. Relevant topics of the study include energy consumption and pollutant emissions from taxis (Luo et al., 2017), the impact of urban-rural spatial structure on mobility networks (Nie et al., 2023), spatial-temporal characteristics and patterns of urban travel (H. Wang et al., 2019), Structure of the travel trajectory network (Y. Yang et al., 2018), and so on. The methodology of the study includes exploratory spatial data analysis (Deng & Ji, 2011), a multiscale geographically weighted regression model (Lyu et al., 2023), a spatial network-based Markov decision process approach (Zhou et al., 2018), and so on. Related application directions include mining the spatio-temporal patterns of urban residents (Mao et al., 2016), modeling the spatial and temporal distribution of the population (Rahimi et al., 2021), projections of urban transportation emissions (J. Liu et al., 2019), and so on. Taking Wuhan as an example, Tang et al. (2017) calculated the performance of taxi drivers based on their GPS trajectories for a week, classified taxi drivers into top drivers and ordinary drivers, and analyzed the spatial and temporal distribution and operational patterns of the top drivers (Tang et al., 2017). Using New York as an example, W. Zhang et al. (2020) explored the impact of the presence of an app-based taxi service on total daily taxi and street taxi ridership, and the impact of dynamic pricing on app-based hourly taxis (W. Zhang et al., 2020). Using Seoul as an example, Choi et al. (2023) measured the taxi pooling effect and its spatial and temporal patterns and proposed a method for identifying high-demand taxi pick-up and drop-off areas that can support a limited shared system (Choi et al., 2023). Z.-T. Li et al. (2023) processed taxi trajectory data from Chengdu and New York City to empirically and systematically analyze these urban travel networks based on network hierarchies, based on complex network theory (Z.-T. Li et al., 2023). Using the density of the gross domestic product (GDP), density of population, rate of urbanization, and access to transportation as evaluation indicators, Jin et al. (2020) rebalanced spatial protection and development of Wuhan city (Jin et al., 2020). In general, most of the existing research on taxis has focused on a particular city and no studies have examined the spatial pattern of taxi fares at a macro level.
Research on the spatial differentiation of taxi fares and their drivers can be based on location theory (North, 1955), supply and demand theory (H. Yang et al., 2002), and pricing theory (X. Wang et al., 2016). Among them, location theory emphasizes that any geospatial phenomenon is related to the location in which it is situated (Chan, 2001). At the municipal scale, taxi fares are closely related to the economic development of the city in which they are located, and taxi fares are generally higher in cities with better economic development (Cetin & Deakin, 2019). Supply and demand theory focuses on the game between supply and demand (Thathsarani et al., 2023). Taking taxi prices as an example, the supply side is the taxi company and the demand side is the passengers, and taxis have the attributes of a bilateral market. Passengers and drivers have complementary demands, and an increase in the number of passengers or a decrease in the number of taxi drivers will make the supply smaller than the demand, thus affecting taxi fares, and vice versa (J. Wang et al., 2013; H. Yang et al., 2002). Pricing theory is to make decisions about the price of a product under certain objectives, and the commonly used pricing methods are cost-oriented pricing method, market-oriented pricing method, and competition-oriented pricing method. The cost-oriented pricing method prices taxis based on their costs (including vehicle depreciation, fuel costs, management fees, etc.) plus a certain amount of profit. Market-oriented pricing is a consumer demand-centered pricing method that prices taxis based on consumer demand, willingness to demand, and cost of living. Competition-oriented pricing method is a pricing method centered on the pricing of competitors, and the main competitors of taxis include internet taxis, buses, and subways (Gómez-Lobo et al., 2022; He et al., 2018). This paper takes the city as the basic research unit. It focuses on the law of divergence and influencing factors of taxi fares among different cities, so it is more suitable to use the location theory as the theoretical framework. Under this theory, the taxi fares in different cities are different, which mainly depends on the different locations in which the cities are located. The more specific manifestations of city location are total GDP, population size, city size, distance from the sea, capital city, provincial capitals, and so on.
The geographic environment is a whole, and the elements that make up the geographic environment influence and constrain each other, together shaping the overall characteristics of the geographic environment. Different geographic environments also have regional differences, and the law of such differences is the law of spatial differentiation of the geographic environment (Jiang, 2015). The study of the law of differentiation between physical and human geography is one of the important directions of geography, as the totality and variation of the geographic environment are its basic characteristics. Human geography studies on the law of differentiation mainly include: the law of spatial differentiation between urban and rural development (Chen et al., 2019; Feng et al., 2023; Hou et al., 2022; Y. Yang et al., 2021), the law of spatial differentiation between tourist attractions and leisure places (Kang et al., 2022; Liao et al., 2022; P. Wang et al., 2022; Xiao et al., 2023), the law of spatial differentiation between transportation development and accessibility (D. Wang et al., 2022; J. Wang et al., 2020; Zhao et al., 2018), and the law of spatial differentiation between economic development and price levels (Y. Liu & Fang, 2022; Wu et al., 2020; Xing & Ye, 2023; Yu et al., 2023; L. Zhang et al., 2022). The obvious pattern of divergence in price levels of different commodities in different parts of the country has become an important topic of research in human geography and regional economics (S. Wang et al., 2016). In general, existing studies on taxi fares focus on the micro level, and take a city as an example to construct a model to calculate a reasonable taxi fare. Fewer studies have analyzed the spatial variation of taxi fares on a large scale. Reasonable taxi fares play an important role in meeting residents’ travel needs, increasing drivers’ income, and promoting employment. Therefore, this study takes Chinese cities as an example and intends to address two main questions. First, what is the spatial pattern of taxi fares in China, which cities have higher and lower taxi fares, and have agglomerations formed? Second, what factors are associated with the formation of this spatial pattern, and can a model of taxi fares be derived from the influencing factors?
Materials and Methods
Study Area and Data Sources
The 333 prefecture-level cities (including prefecture-level cities, regions, autonomous prefectures, and leagues) and four municipalities directly under the Central Government, excluding Sansha City in Hainan Province, have a total of 336 basic research units, excluding Hong Kong, Macao, and Taiwan.
Data on taxi fares were mainly obtained from taxi fare adjustment scheme announcements published on the Internet by the development and reform commission, municipal government, and municipal transport commission of each prefecture-level city. The taxi fare policy varies from place to place and mainly consists of the starting price, base price per kilometer, emptying or idling fee, waiting or low-speed fee, nighttime fare increase, and special compensation fee. Waiting or low-speed charges are mainly caused by passengers requesting to wait, traffic jams, waiting for red lights, etc. They are generally free within 5 min, assuming ideal conditions where passengers do not request to wait there are no long traffic jams, and the total time spent waiting for red lights does not exceed 5 min, that is, there is no waiting charge. This study assumes that taxis all travel during the day, that is, there is no night-time fare increase. In some cities, a special compensation charge is added during the Chinese New Year period, usually at a rate of 5 to 10 RMB on top of the original fare, assuming it is not during the Chinese New Year period, that is, there is no special compensation charge. For the sake of consistency and ease of calculation, the taxi fares calculated in this article include only the starting price, base price per kilometer, and emptying charge.
Other data is obtained from provincial and municipal statistical yearbooks and bulletins.
Methods and Formulas
The research methodology is divided into three main parts, including spatial divergence phenomena, spatial divergence characteristics, and modeling. The first two of these characterize the spatial divergence patterns and characteristics of taxi fares in China and demonstrate and analyze the spatial divergence of prices, followed by the development of a taxi fare model to measure the factors influencing taxi fare divergence and the magnitude of its intensity.
The Phenomenon of Spatial Differentiation
Spatial differentiation refers to the visualization of price differentiation in different regions using maps based on the starting price, base price per kilometer, and return fare of taxis in each prefecture across the country. This includes the display of four basic fare differentiation phenomena: starting price, starting price per kilometer, base price per kilometer, and return empty kilometers. In addition to this, the final fare cost is calculated assuming a specific number of kilometers. Five kilometers is assumed to represent the distance traveled by residents nearby, 10 km represents the distance traveled by residents in medium proximity, 20 km represents the distance traveled by residents in long proximity, and 40 km represents the distance traveled by residents in long proximity generally to the airport.
The Law of Spatial Differentiation
The law of spatial differentiation refers to the characteristics of the phenomenon of spatial dispersion in taxi fares using specific methods. The methods used in this paper include a global differentiation index (GDI) taxi fare dispersion measures (Y. Wang et al., 2013), Global Moran’s I (Bivand et al., 2009), and kernel density analysis (Terrell & Scott, 1992). Taxi fares for prefecture-level cities can be either face data, using urban data to represent taxi fares for the whole city and its subordinate counties, or point data, representing only taxi fares in the urban center of the prefecture-level city.
The GDI of taxi fares provides a comprehensive indication of the degree of variation in the overall variation of taxi fares across prefectures nationwide. The main components are the coefficient of variation (CV), the Thiel index (T), the generalized entropy index (GE), and the Atkinson index (A) (Atkinson, 1970; Brown, 1998; Shorrocks, 1980; Wilson & Lockshin, 2003). The coefficient of variation measures the variability of a series of numbers independently of the unit of measurement used for these numbers (Abdi, 2010). Theil Index Decomposition is widely used in the testify and analysis of the total difference in one region and differences between regions (J. Sun et al., 2023). Generalized Entropy is a widely used index for studying income disparity, and the Atkinson index can be used to measure the degree of inequality in an economy (Biewen & Jenkins, 2006). Finally, the Gini coefficient (G) (Dorfman, 1979) is used to portray the degree of divergence. It is calculated as:
Using information entropy to determine the weights as Wj. The GDI of taxi fares is then:
Where: xi is the taxi fares of the ith prefecture-level city, μ is the average taxi fares, and n is the number of prefecture-level cities. ε is a parameter related to the sensitivity of the variance value and takes a range of values greater than 0. The larger the ε, the greater the weight given to the prefecture-level cities with relatively low taxi fares. ε often takes values of .5 and 2 (Atkinson, 1970), and the value of ε is taken as 2 for the calculation of this paper.
Spatial autocorrelation analysis of taxi fares can indicate spatial correlation and aggregation characteristics. The global Moran’s I (GMI) was used to measure the degree of spatial correlation of taxi fare indices at the prefecture level, calculated as:
Where: n is the number of samples; xi and xj are the taxi fares of spatial units i and j respectively; Wij is the spatial weight matrix of each prefecture, and the distance between prefectures is 1 within a set threshold distance, or 0 if greater than that distance. E(I) is the mathematical expectation; Var(I) is the number of variances. Generally when |Z| > 1.96, it means spatial autocorrelation exists at a 95% confidence level; |Z| > 2.58, it means significant at a 99% confidence level.
Kernel density analysis calculates the accumulation of the whole area using the input points, which allows a more accurate portrayal of the overall distribution characteristics of taxi fares. The calculation formula is:
where: K( ) is the kernel function; n is the number of sample points in the bandwidth range; (xxi) denotes the distance from the estimated point to sample xi; and he is the bandwidth and is a positive number related to n.
Taxi Fares Model
There are many factors affecting taxi fares, including transport costs, supply and demand, national policies, local development levels, etc. Among them, transport costs include vehicle acquisition costs, management service fees, vehicle maintenance fees, fuel costs, etc. Supply and demand mainly include the city population, the number of taxis in the city, the level of development of other public transport in the city, etc. National policies mainly include government-led pricing power, tax incentives, etc. The local development level mainly includes the city’s economic development level, per capita GDP, disposable income of residents, etc. Some of the above factors are difficult to quantify, and some of them vary from city to city. The GDP growth rate is an external factor influencing taxi travel. At the same time, the trend in the use of taxis as a means of transportation is closely related to the economic situation of individuals and the number of inhabitants (An et al., 2021; Kim et al., 2018). With the development of urbanization, the taxi market in large cities is rapidly expanding and developing (Qian & Ukkusuri, 2015). Therefore, a total of four factors, namely GDP of regional GDP, disposable income of urban residents, population density in urban areas, and urbanization level, were selected to establish a multiple linear regression model for taxi fares. The criterion for judging whether they are the main influencing factors is the significance level of each factor in this model. The expressions are:
Where: x1, x2…, xn are the factors affecting taxi fares; a1, a2,…, an are the regression coefficients of the factors affecting taxi fares; y is the taxi fare; and a0 is the intercept.
Results
The Phenomenon of Spatial Differentiation in Taxi Fares
Taxi fares are composed of a starting price, a base price, and a return empty price.
Taxi fares in China range from RMB 4 to 14, with short-distance trips being more sensitive to the starting price. Low starting prices are mainly distributed in the central and western regions, while high starting prices are mainly found in the eastern coastal regions, especially in the Beijing-Tianjin, Yangtze River Delta, and Pearl River Delta metropolitan areas. In addition, some provincial capitals in central and western provinces have higher starting prices, such as Lanzhou, Urumqi, Lhasa, Xi’an, and Guiyang, which are significantly higher than other cities in the province. The number of kilometers included in the starting price varies, and the starting price per kilometer reflects whether the taxi is starting at a reasonable price. The starting price per kilometer for taxis in China ranges from RMB 1.67 to RMB 6. The starting price per kilometer is lower in the northeast and northwest regions, and higher in provinces such as Sichuan, Guangxi, Guangdong, and Zhejiang. The base price per kilometer is the price per kilometer of taxi travel beyond the starting kilometer, with medium and long-distance trips being more sensitive to the starting price. The distribution of the base price shows two clear areas of concentrated high values, in the Yangtze River Delta and Pearl River Delta regions, in addition to higher base prices in Beijing and Tianjin, and in Yunnan, a major tourist province, where some cities also have higher base prices. The divergence in base prices is mainly north-south, with the southern region being significantly higher than the northern region. Long-distance taxi rides beyond a certain distance are generally subject to an additional return fare or emptying charge. Distance regulations vary from province to province and municipality to municipality, and charges are generally increased by 50% on the base price, with a minimum increase of 20% and a maximum increase of 100%, with some municipalities also increasing the price in segments depending on the distance. Generally speaking, the further the return distance, the better the discount for the consumer. There is no clear pattern to the national taxi regulations on return air distance, with the closest being 3 km, with return air charges applied beyond the starting kilometer, and the furthest up to 25 km, mainly in Chongqing and some cities in Guangdong (Figure 2).

The pattern of differential taxi fares.
The cost of a 5, 10, 20, and 40 km journey is calculated based on the starting price, base price, and return fare respectively. Five kilometers is the most frequent distance traveled by residents, and this distance is more related to the starting price, so the pattern of variation is roughly the same as the starting price. The average cost of a 5 km taxi journey in China ranges from RMB 7.4 to RMB 18.8, with an average cost of RMB 11.53, with high values in the Beijing-Tianjin region, the Yangtze River Delta and the Pearl River Delta, and a general trend of high in the southeast and low in the northwest, with some western provincial capitals spending more for 5 km. The cost of a 10, 20, and 40 km taxi ride is roughly the same, with the overall trend being higher in the south and lower in the north, with an average cost of RMB22.04, RMB47.54, and RMB99.80 respectively. The high-value areas are concentrated in Jiangsu, Zhejiang, and most of Guangdong, with some areas in Yunnan, Sichuan, and Hunan also higher, while in the north, apart from Beijing and Tianjin and individual cities in Shandong, Liaoning, and Jilin, most costs are lower than the national average (Figure 3).

Differences of taxi fares in different distances.
Spatial Differentiation of Taxi Fares
Global Differentiation Index of Taxi Fares
Using the four elements of the GDI, the degree of divergence was calculated for taxis traveling 5, 10, 20, and 40 km nationwide. The results are shown in Table 1. Overall, the greater the distance traveled, the greater the overall measure of variation index and the greater the Gini coefficient. This indicates that for taxi fares across the country, the greater the distance traveled, the greater the variation in prices between cities.
GDI and Gini Coefficient.
Spatial Autocorrelation Analysis
Taking 5 km as an example, the spatial correlation and clustering characteristics of taxi fares across the country were analyzed.
According to the calculated results, the Moran I result is 0.151521 with a z-score of 20.54, which means that the probability of randomly generating this clustering pattern is less than 1%, indicating that Chinese taxi fares in general show significant spatial clustering characteristics. Based on global spatial autocorrelation, local spatial autocorrelation can also be analyzed. Moran I has the problem of ignoring some of the potential spatial instability when assessing spatial autocorrelation. Local spatial autocorrelation analysis is required when further examining the local spatial clustering of high or low values of observations and which regional units contribute more to global autocorrelation. According to the results of local spatial autocorrelation, the H-H region of Chinese taxi fares is concentrated in the Beijing-Tianjin, Yangtze River Delta, and Pearl River Delta regions, while the L-L region is concentrated in the Northwest and Northeast regions, mainly including Gansu, Ningxia, Shaanxi, and parts of Inner Mongolia in the Northwest.
In addition, this paper calculates the local spatial correlation index Getis-OrdGi* and visualizes it spatially in ArcGIS software, using the natural fracture method to divide the national taxi fares into hot spot areas and cold spot areas. The hotspot areas of taxi fares in China are mainly located in the southeastern coastal areas, while the coldspot areas are mainly located in the northeast, north China, and northwest except Xinjiang, and the distribution of coldspot areas has an obvious divergence pattern from southeast to northwest (Figure 4).

Local spatial autocorrelation aggregation and Getis-OrdGi*.
Kernel Density
Kernel density analysis was carried out of 5 and 40 km, and the results showed that: Firstly, the high kernel density areas are found in the Yangtze River Delta and the Pearl River Delta, but not in the Beijing-Tianjin region, indicating that taxi fares are lower around the Beijing-Tianjin region, while taxi fares are higher in the Yangtze River Delta and the Pearl River Delta as a whole. Secondly, the distribution of the kernel density of taxi fares shows a decreasing trend from the southeast coast to the northwest inland. Finally, the distribution of kernel density tends to be spread out with the provincial capitals as the high-value centers (Figure 5).

Kernel density of 5 and 40 km.
Multiple Linear Regression Model for Taxi Fares
Taxi fares are determined by a combination of factors, with the standard of living of the local population being the most important factor in determining the price of a taxi in a given area, with urban disposable income per capita being the most representative of the standard of living. In addition to this, regional GDP, urbanization levels, and urban population density are also included as factors affecting taxi fares. Accordingly, a scatter diagram (Figure 6) was generated for the relationship between taxi fares and urban disposable income per capita, regional GDP, urbanization level, and urban population density. The graph shows that the above four factors are all positively correlated with taxi fares and that the individual factors are more representative of the differences in taxi fares, thus verifying the reasonableness of the selection of these four influencing factors. Among them, the R2 of urban disposable income per capita is the largest, which proves that this factor has the greatest influence on taxi fares.

Scatter diagram of taxi fares and influencing factors.
The different factors interact with each other and jointly influence taxi fares. Based on the above four factors, a multivariate linear regression model was established to determine the degree of influence of different factors on taxi fares and to provide model support for the adjustment of taxi fares in different regions under different factors.
The model regression yielded a goodness of fit R of .73 and an adjusted R2 of .528. The F value was 96.354, with a significance level of .000, at a confidence level of α = .01, F is much greater than the corresponding F distribution value, indicating that the regression model is highly significant. It is not difficult to find from the table that the collinearity diagnosis results (VIF values) of each explanatory variable are less than 5, indicating that there is no collinearity relationship among the variables. The sig. value of each variable was less than .05, indicating that each variable had a significant influence on the explanatory model (X. Zhang et al., 2023) Also, the absolute values of the explanatory model T-values were greater than 1.96, further indicating that each explanatory variable had a significant impact on the explanatory model. Finally, the sig. values of the explanatory models were all greater than .05, indicating that the explanatory models were statistically significant (Table 2).
Multiple Linear Regression Analysis of Taxi Fares.
A robustness test is necessary. (1) Excluding some city samples. The cities in this paper’s sample include both ordinary prefecture-level cities, as well as municipalities and provincial capitals. Since municipalities and provincial capitals have significantly higher population mobility than ordinary prefecture-level cities, the resulting high demand for rides may affect taxi fares. In this paper, all the samples of municipalities and provincial capitals are deleted, and only the samples of ordinary prefecture-level cities are retained for regression. The results show that the significance levels of the factors remain high, and thus the results are robust. (2) Variable substitution. The dependent variable in this paper is the taxi fare, which was chosen to be the cost of traveling 5 km when the regression was conducted. To further test the robustness of the results, the dependent variable is replaced with the cost required for a taxi to travel 10, 20, and 40 km. After re-regression based on the dependent variable, it is found that the significance of the explanatory variables remains unchanged, proving the robustness of the model. (3) Test method replacement. This paper defines a dummy variable for whether taxi fares are higher than average and uses it as an explanatory variable in a Logit regression, and the main conclusions remain the same.
Endogeneity issue and instrumental variable. There may be an endogeneity problem in this paper. The main reason is that there is no control for any alternative mode of transportation that may lead to missing variables. Taxi fare, as income of services, is contributing to local GDP and resident income. There is a potential reverse causality between the explanatory variables and the explained variables. To mitigate estimation bias due to endogeneity, instrumental variable regressions are conducted in this paper. Referring to related studies, the water quality of the city is selected as an instrumental variable for city GDP (Russ et al., 2022), and the air quality of the city is selected as an instrumental variable for the disposable income of urban residents (L. Wang et al., 2018). The regression results in this paper remain robust after accounting for model endogeneity issues.
A taxi fares model of China was developed as follows:
Where y is the taxi fare, x1 is the regional GDP, x2 is the level of urbanization, x3 is the per capita disposable income of urban residents, and x4 is the population density of urban areas. For every 10 billion yuan increase in regional GDP, the taxi fare increases by RMB 0.007, for every 1% increase in urbanization level, the taxi fare increases by RMB 0.032, for a 10,000 yuan increase in urban residents’ per capita disposable income, the taxi fare increases by RMB 0.862, and for a thousand people per square kilometer increase in urban population density, the taxi fare increases by RMB 0.124.
The constant, or intercept, is 6.065, which roughly indicates that in even the least economically developed areas of China, a 5 km taxi ride fare is more than RMB 6. The biggest influence on taxi fares is the per capita disposable income of urban residents, which is the factor most closely related to residents’ lives. In areas where the per capita disposable income is higher, taxi fares are already higher and residents are less sensitive to the strength of taxi fare increases when they are made.
Discussion
Taxi fares are a matter of people’s lives and have an important impact on social stability. On the one hand, residents want cheaper taxi fares so that they can reduce their travel costs. On the other hand, taxi drivers want higher taxi fares so that they can increase their income. From the above analysis, it can be found that the overall pattern of taxi fares in China matches the level of economic development. Taxi fares are more expensive in medium and large cities and cheaper in smaller cities. Taxi fares are more expensive in the economically developed regions of Beijing, Tianjin, and Hebei, the Yangtze River Delta, and the Pearl River Delta, and cheaper in the northeast and western regions.
The biggest influence on taxi fares is the per capita disposable income of urban residents, the amount of income directly determines the level of consumption of residents. The other three items have a smaller impact on taxi fares compared to the per capita disposable income of urban residents. Relevant studies have shown that regions with greater per capita disposable income and per capita consumption expenditure have more taxi orders and tend to have higher taxi fares (B. Hu et al., 2021; H. Yang et al., 2000). Regional GDP is a reflection of the level of comprehensive economic development of a region. However, there are also resource-based cities such as Tangshan, Daqing, and Jiaozuo where GDP is high but taxi fares are low. In cities with higher GDP, travel costs are higher and taxi fares are more expensive (Salanova et al., 2011). Tourism cities such as Sanya, Haikou, and Lijiang where GDP is high but taxis mainly serve foreign tourists and thus are higher. Rapid economic growth and urbanization have dramatically changed China’s urban public transportation, fuel prices, parking fees, and taxi fares (C. Sun et al., 2014). The level of urbanization has a smaller impact on taxi fares for the same reasons as regional GDP, with some small and medium-sized cities such as Karamay, Wuhai, and Jiayuguan having a higher level of urbanization but lower taxi fares, and less urbanized ones such as tourist cities. The relation of taxi cost versus bus cost grows with the density of the city, due to the economies of scale of the Mass Public Transport (Salanova et al., 2011). Urban population density reflects the concentration of population, the greater the population density in urban areas the higher the demand for taxis, if there is a long-term supply shortage, the phenomenon of raising taxi fares may occur, Henan, Shandong, and other places in some urban areas with high population density, but the labor market is sufficient, taxi fares are also lower, tourist cities with a small resident population, low population density, but will taxi fares on the high side of the phenomenon.
Based on analyzing the spatial differentiation of taxi fares in China, this study explores what factors affect taxi fares. Generally speaking, the spatial transmission mechanism of urban taxi fares is more significant at the micro level; however, a macro-level study helps to grasp the pattern of spatial differentiation in general. The main contribution of the study is to make up for the lack of analyzing the spatial differentiation of taxi fares as a research object from the macro level. In the future, with the adjustment of taxi fares in different cities, related studies need to be followed up and the results of the macro and micro studies can be compared. In conclusion, taxi fares in a city are determined by a variety of factors, and the model built in this paper only takes into account some of these factors, and the taxi price model thus built is only a correlation rather than a causal relationship. Based on the four factors selected in this paper, a judgment can be made on taxi fares, and the data model can be used to support the rationality of taxi fare adjustments. The divergent pattern of taxi fares and the setting of price increases in a particular city are influenced by multiple factors, of which only four are discussed in this paper. In addition, macro policy regulation, vehicle transportation costs, and supply and demand games can all affect taxi fares, and further exploration is needed in the future where this paper has not focused.
Conclusions
Based on the prefecture-level city scale, the distribution pattern of taxi fares in various cities across the country was analyzed and some indicators were selected to explore the factors affecting taxi fares, with the following conclusions:
The starting price of taxis across the country ranges from RMB 4 to RMB 14, and the price per kilometer ranges from RMB 1 to RMB 3. In addition to this, there are also return emptying fees, night-time driving fees, and waiting fees. The distribution pattern mainly includes taxi fares being higher in provincial capitals than in non-capital cities, and higher in southern cities than in northern cities, with high values concentrated in the Beijing-Tianjin metropolitan area, the Yangtze River Delta metropolitan area, and the Pearl River Delta metropolitan area.
The greater the distance traveled by taxi fares nationwide, the greater the overall divergence measure index and the greater the Gini coefficient. National taxi fares in general show significant spatial agglomeration characteristics and form two low-low agglomeration and three high-high agglomeration centers. The high nuclear density areas are found in the Yangtze River Delta and the Pearl River Delta, with the provincial capitals as the high-value centers of dispersion.
The pattern of taxi price divergence across the country is related to several factors, the most influential of which is the per capita disposable income of urban residents, in addition to regional GDP, urbanization rate, and urban population density, which also have a greater impact on taxi fares.
In conclusion, there is a clear spatial differentiation of taxi fares in China, and this differentiation becomes larger as the distance traveled by taxis increases. Taxi fares in China are most affected by disposable income per urban resident.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest concerning the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The research was funded by the National Natural Science Foundation of China, Grant no. 42371264.
Data Availability Statement
The data that support the findings of this study are openly available in provincial and municipal statistical yearbooks and bulletins.
