Abstract
Whenever someone posts an online review of a restaurant, museum, or barbershop, they also leave a trace of where they traveled. The author visualizes travel patterns in 11 North American metropolitan areas using geolocated review data. The data are based on approximately 7 million online Yelp.com reviews posted by 2 million reviewers between 2005 and 2020. First, the author demonstrates how individual travel patterns can be mapped using the review data and discusses the potential applications of such individual-level data. The author then turns to aggregate-level maps, creating establishment covisit networks in which two establishments are linked if multiple reviewers visit both. Maps of establishment covisits reveal various intriguing patterns related to consumption and geography, such as the connections between neighborhoods and the centralization and segregation within a metropolitan area. Establishment covisit maps can also inform researchers about the diffusion of ideas and practices, trends in crime, and gentrification.
Keywords
The study of human mobility across urban neighborhoods is essential for understanding the dynamics of urban life and its broader societal impacts, such as gentrification, the diffusion of ideas, and the spread of crime. Historically, this field has seen diverse methodologies, ranging from direct observations and field studies (Jacobs 1961; Whyte 1980) to analog mapping techniques (Gould and White 1974), archival historical data (Braudel 1981), and manual traffic counts (Appleyard, Lynch, and Myer 1964; Levinson and Kumar 1997). The advent of Global Positioning System technology has shifted the focus toward electronic “breadcrumbs” left by individuals, with analyses of cell phone data and social media check-ins that shed light on various urban phenomena (Batty 2013; Buckee et al. 2020; Eagle, Macy, and Claxton 2010; Hui, Fader, and Bradlow 2009; Lu, Bengtsson, and Holme 2012).
In this visualization, I propose an alternative mapping approach using online review data, which may offer valuable insights into travel patterns in urban settings. Online reviews, as a data source, can augment existing sources such as cell phone data in studying movement, offering distinct strengths and weaknesses compared with cell phone data. The primary limitation is that online reviews represent a selected dataset: only a small proportion of people leave online reviews, and even active reviewers do not always review every establishment they visit. However, a significant advantage of these data is their public availability and inherent combination with consumption data.
This study analyzes approximately 7 million Yelp.com reviews posted across 11 North American metropolitan areas, revealing patterns in establishment covisits and their implications for urban and consumption sociology.
Data and Setting
I investigate establishment covisit patterns by examining publicly accessible online review data from Yelp.com, as provided in the 2022 Yelp Open Dataset, available at https://www.yelp.com/dataset. This dataset comprises 6,990,280 reviews from 1,987,897 users, covering businesses across different metropolitan areas, including Tucson, Arizona; Santa Barbara, California; Reno, Nevada; Boise, Idaho; Edmonton, Alberta, Canada; St. Louis, Missouri; Nashville, Tennessee; Indianapolis, Indiana; Philadelphia, Pennsylvania; Tampa, Florida; and New Orleans, Louisiana). 1 The dataset encompasses reviews of various establishments, such as post offices, bars, barbershops, and gyms; however, more than 80 percent of the reviews are related to restaurants. Each review includes several details, such as the date it was written, the identification number of the establishment being evaluated, and an anonymized ID for the person who wrote the review. In this visualization, I also use geolocation data for the establishments, provided in latitude-longitude format.
The dataset spans from November 2005 to January 2022. As the coronavirus disease 2019 pandemic lockdowns significantly altered dining patterns, which had not fully recovered by early 2022 (resulting in insufficient postpandemic observations in the dataset), I limited the observation window from November 2005 to February 2020.
I developed a Python script to calculate both individual and aggregate travel patterns, as well as their visualization; see Appendix B for the full code.
Visualizing Individual-Level Patterns
First, I visualize establishment covisit patterns on an individual level. Understanding individual travel patterns is informative for multiple domains in sociology, such as assessing the consumption habits of individuals, the breadth of their social networks, or their openness to different experiences. Individual-level maps can assist researchers in gaining a clearer understanding of reviewers’ travel and consumption patterns. These data could be used to address various questions, such as whether a person tends to stay local or covers a wide geographical area and whether they visit limited neighborhood types, such as highly affluent areas, or a more diverse range, including downtowns, suburbs, and neighborhoods of varying affluence. Additionally, these patterns could aid researchers in computing measures such as cosmopolitanism (Kovács and Carroll, 2023).
To visualize establishment covisit patterns, I combine the geolocation data of establishments visited by a reviewer and represent it as a directed network, where location A is connected to location B if a person visits location B following location A. Figure 1 illustrates a few such maps. As in this article I cannot include a separate map for each of the 2 million reviewers in the dataset, I picked two reviewers randomly. As the figure shows, the Indianapolis-based reviewer mostly visits establishments in the northern suburbs of Indianapolis and reviews a downtown destination only once. Contrast this with the reviewer from New Orleans, who typically frequents downtown locations and visits only three locations outside the downtown area. Information such as this can help characterize reviewer habits, especially when merged with neighborhood characteristics such as average income, neighborhood racial composition, or average education level.

Mobility patterns of Yelp reviewers in Indianapolis and New Orleans: (a) individual-level Indianapolis, (b) individual-level New Orleans, (c) aggregate-level Indianapolis, and (d) aggregate-level New Orleans.
Visualizing Aggregate-Level Patterns
Although individual-level maps are illustrative, creating a map for each of the approximately 2 million reviewers in the dataset is not feasible. Instead, one can create aggregate-level maps to visualize typical travel patterns in a given metropolitan area. To achieve this, I create what are known as establishment covisit maps. Specifically, for each cosmopolitan area, I calculate the number of reviewers who visited both establishments in each pairwise combination within that area. (These networks are undirected and do not consider timing.) I then create a network visualization overlayed on the map in which two establishments are connected with a line if there are at least 10 reviewers who visited both establishments. Figures 1c and 1d show these maps for Indianapolis and New Orleans. Appendix A includes similar maps for the other nine metropolitan areas in the database. Appendix A also shows, for the case of the Philadelphia metropolitan area, how the choice of the threshold influences the visualization. 2 (This threshold can be easily modified in the Python script in Appendix B. The maps are also zoomable once the script has run.) These maps illustrate the traffic volume between establishments and neighborhoods. For example, the map of Indianapolis shows a hub-and-spoke travel structure with strong connections between downtown and the northern suburbs. The covisit patterns in New Orleans, on the other hand, show a more integrated travel pattern for most neighborhoods (less of a hub-and-spoke pattern); yet there are major neighborhoods in the city that are less frequented by Yelp reviewers (perhaps not surprisingly, these are the areas that were most heavily affected by Hurricane Katrina (https://www.nytimes.com/interactive/2015/08/25/us/mapping-katrina-and-aftermath.html).
Although the systematic analysis of these covisit travel patterns is beyond the scope of this article, research opportunities abound. For example, one could use such maps to document and possibly predict gentrification. Future research could also analyze whether ethnically diverse communities are more connected to the core or stay in the periphery (Eagle et al. 2010). One could explore if there are correlations between income levels, education levels, or demographics and people’s movements between businesses. Urban designers can use such maps to examine how urban design influences people’s movements. For example, do pedestrian-friendly areas or the availability of public transport encourage more movement between neighborhoods? For a more practical purpose, one could identify different customer segments on the basis of their movement patterns. For example, some customers may be authenticity seeking (Kovács, Carroll, and Lehman 2014) and thus connect highly authentic establishments and neighborhoods. Information such as this could provide insights on where to place advertising or how to optimize business hours.
Such maps could also inform studies of tourism. In related work, Hawelka et al. (2014) combined TripAdvisor reviews with transport data in Barcelona and London to model intracity movement flows of different traveler types. Business travelers moved between commerce districts, while other tourists congregated at attractions. Meanwhile, McKercher et al. (2012) inferred tourist mobility archetypes in Singapore from TripAdvisor trails, categorizing patterns such as “museum hoppers” and “resort lovers.” This mobility segmentation has applications for tourism planning.
Finally, I have shown maps for covisits only within metropolitan areas, but the same approach (and the script in Appendix B) could also be applied to study inter–metropolitan area covisits.
Conclusion, Limitations, and Avenues for Future Research
This article has demonstrated the possibility of using online review data to map spatial movement and consumption patterns in metropolitan areas. I provide methods for mapping spatial consumption patterns both at the individual and aggregate levels and suggest research questions for which such maps can be helpful in answering.
The use of online reviews as a data source is intended not to replace but to complement traditional sources such as cell phone data in the analysis of movement patterns. Mapping on the basis of online reviews has its own unique advantages and disadvantages compared with using cell phone data. Its main drawback is that it constitutes a selective dataset: only a limited number of individuals post online reviews, and those who frequently review do not necessarily cover every place they visit. On the other hand, a notable benefit of these data is that they are publicly accessible and intrinsically linked with information on consumer behavior.
Footnotes
Appendix: Additional Maps
Appendix B: Source Code
