Abstract
Individuals are fond of belonging to a social environment with a similar social background, which can impact the individual’s decision to visit specific venues for leisure activities. Using data from Zurich, we have measured the preference for a social environment in four categories of leisure venues: restaurants, cafes, bars, and nightclubs; the estimation was performed using a mixed multinomial logit model to see how homophily for socioeconomic characteristics can impact the decisions of choosing a leisure venue. The models included three homophilic preferences: age, income, and cultural origin as variables of interest. The results show a positive impact of the three variables in different degrees: age is the most relevant in the two venue categories, income only impacts when individuals choose restaurants or cafes, and cultural background is more relevant for nightlife venues. These results show that the sociodemographic characteristics of the social environment are relevant for the choice of leisure destinations. These findings can contribute to the formulation of policies to create more diverse leisure environments and socially cohesive communities.
Keywords
Introduction
Leisure travel is an essential part of urban mobility, accounting for 43% of trips in Switzerland (Bundesamt für Statistik, 2023b), with increasing importance over the last decades. One difference between leisure travel and commuting is its heterogeneity in motivations, schedules, and routines; leisure travel has less rigid spatial and temporal constraints, giving more freedom to the individual to perform it at different times and places (Schlich et al., 2004). Regardless of this heterogeneity of motivations, one common characteristic is its social motivation, since most of these activities are carried out in pairs or groups to maintain social connections or expand one’s social network (Axhausen, 2005, 2008). The two main types of social motivations studied to understand leisure travel have been the ego-alter and the ego-network. Ego-alter analysis is the study of the relationship between an individual (ego) and a friend, family member, or other acquaintance (alter) of the ego, focusing on the sociodemographic characteristics of the individuals, their relationship status, frequency of communication and emotional closeness, and is used to understand how specific individuals belonging to the ego’s social network impact the ego’s travel behavior. Ego-network analysis focuses on the overall characteristics of the ego social network, such as the number of friends and family members, their distribution in space, and the resources available from the network; this analysis helps to understand the macro-level influences that networks have on the egos (Carrasco and Miller, 2009). In this paper, we propose a third type of social motivation for social-leisure travel: the social environment, which is the sociodemographic characteristics of a group of unknown individuals that share a common space, in this case an urban leisure venue. We base our hypothesis on the interest of individuals in belonging to a social context founded on shared experiences, beliefs, or personal characteristics (Mahar et al., 2013), which can be a primary motivation for leisure. However, preferences for venues with specific social environments can lead to self-selection into those venues, contributing to an overall increase in urban segregation.
Since the mid-1970s, segregation has been on the rise in many countries, and understanding its causes and effects has become an important topic of interest in urban studies (Musterd et al., 2017). Segregation is a complex spatial phenomenon, as it responds to multiple factors, including social, economic, and political, and cannot be reduced to purely residential clusters, as residential segregation does not necessarily imply social segregation (Vaughan and Arbaci, 2011). The use of public and semi-public spaces for leisure can have a large impact on interactions between different social groups in daily life, as leisure spaces have replaced firms and workplaces as organizing units in society (Florida, 2003), becoming an essential contribution to the social sustainability of cities and a source of face-to-face interaction between unknown people, generating a sense of community (Jacobs, 1961; Nasar and Julian, 1995) and improving the general quality of life of the population (Bramley and Power, 2009). Therefore, understanding the importance of self-selection for the social environment in leisure activities can help create more socially diverse venues to help heterogeneous social interactions thrive.
To test the importance of the social environment in the leisure destination choice process, we have conducted a survey in Zurich, Switzerland, asking about regularly visited leisure venues. Later, a mixed multinomial logit was estimated; this model measures the impact that homophily, as our variable of interest, has on the decision to go to a venue. Homophilic preferences are multifactorial, with age, income, education, race, and religion being the most important sociodemographic characteristics of tie formations (McPherson et al., 2001). In this paper, we focus on three characteristics of the social environment: age, income, and cultural origin. These preferences are studied in four types of leisure venues: restaurants, cafes, bars, and nightclubs. The models show that the three homophily variables studied positively and significantly impact the decision to choose these venues for leisure activities and can be relevant to understanding how segregation dynamics are generated in daily activities.
Literature review
Activity-based segregation has been a topic of growing interest in recent years. A new concept that has been applied to understand the phenomena is the multi-contextual segregation, defined as the uneven spatio-temporal distribution of daily life and the uses of the city of different social groups; this definition integrates time and space components into a conceptual framework, improving the understanding of segregation based only on space (Park and Kwan, 2018). Differences in time-space uses of the city can be generated by differences in income, activity characteristics, and the built environment, as shown by Wang et al. (2022). The most affected social group is lower-income individuals as they are highly dependent on the areas around their homes to carry out leisure and shopping activities; these spatial boundaries are not due to geographical constraints but to economic affordability (Zhang et al., 2019). For example, in Brazil, high-income white people have the highest access to leisure activities due to the concentration of venues around the neighborhoods where this group lives and the higher access to motorized private transport; while low-income black populations live further away from leisure venues and have less access to mobility tools (Bogado Tomasiello and Giannotti, 2023). Wu et al. (2023) have studied the gathering capacity of different areas of Shenzhen, showing that high-tech areas of the city have a high gathering capacity for heterogeneous populations, while in terms of specific venues, food and beverage services vary in their capacity to gather individuals from different socioeconomic backgrounds, as consumption habits vary significantly between individuals. The results align with studies on the ethnolinguistic co-presence of individuals, showing that the city center can be an essential space to interact with culturally different individuals (Toomet et al., 2015). These studies have focused on how the built environment and accessibility can impact the segregation of leisure activities. However, to date, there is still a need to understand the behavioral mechanisms that can explain activity-space segregation. A first endeavor into this topic was made by Chen et al. (2024), who studied how individual choices affect activity-space segregation, showing that people with medium and high education levels prefer destinations with mixed social environments. Still, more research is still needed to dive deeper into the behavioral motivations that can generate activity-based segregation. For this reason, our interest is to analyze the behavioral origins of segregation with a focus on leisure activities.
To study leisure behavior, the literature has focused on the social context and the need for interaction with other individuals. In order to understand travel decisions for social activities, the main focus has been on ego-alter and ego-network characteristics (Kim et al., 2018). In terms of ego-alter characteristics, the models of social activity participation have estimated different parameters that impact the generation of face-to-face interaction, such as the distance between individuals (Van Den Berg et al., 2010) and the type of relationship (Carrasco and Miller, 2009). Han et al. (2023) has focused on the impact of cliques on restaurant choice, showing that cliques tend to choose venues based on the mean distance to the venue of individuals who belong to that clique. The social influence of travel affects the overall individual’s mobility patterns; individuals with a more dispersed social network also generate a more dispersed leisure activity space (Gramsch-Calvo and Axhausen, 2024a). Furthermore, individuals with more extensive social networks tend to have greater heterogeneity in the type of sites visited and to perform more socially motivated travel (Baburajan, 2019).
In social networks, a recurring topic is homophily; individuals tend to interact and create connections with other similar individuals, creating personal networks that are homogeneous in terms of sociodemographic, behavioral, and intrapersonal characteristics, generating clusters in social space (McPherson et al., 2001). Homophily has been studied for different aspects of social life, such as housing markets (Galster et al., 2021; Pinchak et al., 2021), friendship (Currarini et al., 2009), and migrant communities (Vacca et al., 2022), and can have positive and negative effects on social systems; on one hand, it promotes cooperation between individuals and diffusion of knowledge (Korkmaz et al., 2020; Melamed et al., 2020) but on the other hand, it can generate social segregation (Schelling, 1971).
The decision to conduct leisure in a specific location can be influenced by a wide variety of temporal, spatial, personal, and social factors, making it almost impossible to estimate choice models that can be generalizable to all choice situations, generating a variety of approaches, methodologies, and theoretical structures adjusted for the context of the models (Barnard, 1987). One of the most common methodologies is the multinomial logit model (McFadden, 1973), which can individually disaggregate the alternatives in the choice set. Despite this benefit, there are still challenges associated with destination choice models that are not directly related to Random Utility Theory, but to the data used to estimate them. Most research using this technique still uses zonal-level characteristics, such as the number of venues, population, and OD impedance variables of interest (Wang and Miller, 2014), which can be misleading for low-density areas (Molloy and Moeckel, 2017). To improve the information about the choice process, researchers have started using social networks to collect data on variables of interest, allowing them to increase the available information (Rashidi et al., 2017); however, these models still rely primarily on zonal characteristics. In this paper, we combine data collected using a survey with venue data collected from a Social Network Service (Google Places) to understand the choice process at the venue level, improving the level of detail of the models by using information that was previously expensive to collect, such as prices and ratings of the venues studied.
Urban leisure segregation has been studied mainly as the co-presence of dissimilar individuals in the same areas of a city. However, this is not sufficient to ensure the mixing of social groups, as leisure activities can be performed in adjacent venues with different social environments. For this reason, it is essential to integrate leisure behavior to understand how social travel can be influenced by the preference to be part of a specific homophilic social environment at the venue level, separating the segregation generated by individual behavior from the segregation generated by the built environment.
Data description
To collect data for this study, a three-stage survey was designed and implemented in the Zurich Metropolitan Area, during November and December 2022. It contacted 8000 randomly selected addresses obtained from a private address provider. The invitation was sent by mail with a letter specifying the research, a link and QR code with the first stage of the survey, and a six-letter code for personal identification. After completion of the first stage, an invitation for the second stage was sent 3 days later by email, if provided by the participant, the third stage was sent 3 days after the second. There was a 15 CHF incentive (16.2 USD at the time of the survey) for respondents who completed the three stages of the survey. The analysis of this manuscript only uses the data collected in the second stage; therefore, the data description and survey design will focus on that stage.
The second stage of the survey was separated into two parts, the first part is called the place generator (Gramsch-Calvo and Axhausen, 2024b) which is a methodology adapted from the name generator, widely used in social network analysis; in this part, the respondent had to name the venues they visit on a regular basis in seven categories: restaurants and cafes; bars and nightclubs; parks, forests and other outdoor locations; cultural venues; social or religious centers; gyms and sport-centers; and others. Each category had up to three spaces that could be filled in with the names of the venues. The second part of the stage was the place interpreter, which consisted of a group of questions that were repeated for each of the venues mentioned previously; the questions were related to the visit schedules (i.e., regular visit time, days of the week, and frequency of visit), reasons to go, and characteristics of the place. The main question of interest in this section was to describe the sociodemographic characteristics that the respondent believes they have in common with the other visitors to the venue; this question was later used to reproduce the social environment of the venues included in the study.
Sociodemographic characteristics of surveyed individuals.
Categories used for the estimation.
aThere are 1653 different venues, but the sum of this column is 1799. This is because some venues are part of two categories.
bThe categories were combined for the estimation, but the choice sets were generated only with the same original category (i.e., nightclubs are considered under nightlife but their choice set is composed only other nightclubs).
To obtain the geographic locations, categories, prices, and ratings of the venues, we geocoded the responses with the Google Places API (Cambon et al., 2021). The first information collected by geocoding is the spatial location of the venues. Figure 1 shows the distribution of the venues. Plot A is the spatial distribution of food venues in Zurich; plot B is for the nightlife venues. Plot C is a sample of the network formed by leisure activities, representing how unknown individuals (squares) are connected through leisure venues (circles). By participating in leisure activities, individuals generate the social environment of the venue visited, generating a complex network of interactions and communities. Sample of distribution of: (A) food venues, (B) nightlife venues, and (C) example social networks of a leisure location.
The second set of information obtained from the geocoding process used is the rating and price of the venue. This information, combined with the information provided by the respondents, creates a dataset of venues to analyze. The first analysis compares the rating with the price level of the venues. Figure 2 shows the relationship between these two variables. The rating variable ranges from 1 to 4.9, with most venues in the 4 to 5 range. Regarding the price levels, the variable ranges from 1 to 4. The price level is also a proxy for the type of service offered; restaurants price level 1 are primarily fast food and budget-focus restaurants, while restaurants price level 4 are fine dining; but due to the lack of available locations with price level 4 (10 venues), we have excluded them from the analysis. The correlation between rating and price is negative, but low (−0.08). As the rating is user-based, one could expect that individuals have rate venues depending on the price level and service expected. The distance was calculated as the Euclidean distance from the home location of the respondent to the venue; individuals live in a median distance of 2.9 km from their regular leisure venues. Relation between price and rating of the venues visited.
To recreate the social environment of each venue, we have included a question about the type of visitors, in relation to the respondent, who frequent the venues mentioned. The answers to this question are presented in figure Figure 3. The figure shows the percentage of places where the individual has stated that there is homophily in the venue’s social environment for each characteristic. Nightclubs and bars are the category that has the highest homophily, especially in terms of age, restaurants on the other side have the lowest similarity, with income having the lowest share. Cafés show a similar distribution of the social environment to restaurants. Finally, the I don’t know answer has a very small percentage of responses (the highest is 8.7% for bars), showing that people are aware of the social environment of the venues they visit and are able to describe it in simple terms. Homophily described by individuals.
Assuming that the answers given by the individuals are correct, we have used the information presented above to build the social environment of all venues included in the study. In case of two or more individuals mentioning different social environments for the same venue, we have used both answers as valid to allow for heterogeneity of crowds in time, as venues could attract different groups of individuals on different days. The specific procedure to generate the social environment is the following: • Age: If the respondent answers that the venue social environment’s age is similar to theirs, we assume the social environment of the place is in a range of ± 5 years to the respondent’s age. For example, if an individual aged 34 mentions that the social environment of restaurant X is similar to theirs, the age of the social environment is between 29 and 39. • Income: To create the income social environment of the venues, we used a subjective income variable; the question asks about the perceived level of household income compared to the country’s average, with five Likert options from far below average to far above average. We consider this variable instead of the objective income because the perceived socioeconomic position results from a process of acquiring self-perception and wealth prospects, being related not only to income but also to education and individual and family experiences (Ferreira et al., 2018), therefore individuals with similar household incomes can belong to different socioeconomic groups due to temporal or circumstantial changes in remunerations (i.e., unemployment), different household maintenance costs, or social contexts. If an individual mentions that a venue has a similar socioeconomic social environment, we use its answer to the subjective income question as the income level of the venue’s social environment. • Cultural origin: This variable was created using the parents’ origin as the variable of interest. The first question asked the respondents if the parents were foreign-born; if the answer is “no,” the person is considered Swiss. If the answer is “yes,” the follow-up question asked what macro-region the parents were originally from. Then, if the person answers that the social environment has a similar cultural origin, the venue has a social environment of the culture specified before. Although the definition of culture has changed with globalization and geographic borders do not necessarily explain cultural similarities anymore (Sycara et al., 2013), the origin country of first- and second-generation immigrants remains a relevant factor in social connections (Galster et al., 2021).
Venue-level variables used in the choice model.
Methodology
To estimate the leisure destination choice model, we base our model on the methodology first introduced by Mansky (1977), with a choice set probabilistically integrated into the choice model. The original model proposed by Mansky is as follows:
In which, P ni is the probability that individual n chooses option i, being C n the evoke set; P n (i|C n ) is the probability that individual n chooses option i given C n ; and π n (C n |i, z) is the probability that the choice set of individual n is C n given the observed choice i and a vector of variables z.
Choice set selection
As the choice set is unknown, we have to differentiate between the universal set (all alternatives available) and the evoke set (the alternatives that meet certain criteria and are considered by the individual) (Pagliara and Timmermans, 2009). To estimate π
n
(C
n
|i, z) we have generated a choice set with thresholds on the location characteristics (Cantillo and Ortúzar, 2006). The evoke set C
n
is:
where X nj is an attribute of the alternative j for the individual n, H n is the highest threshold value that the individual n considers for the attribute X. The defined threshold is 1.5 times the distance from the site that the individual visits for leisure. The value was chosen to reduce the possibility that an unchosen alternative is too far from the individual’s activity space, generating a model that estimates more discerning individuals regarding this specific attribute, avoiding implausible behaviors (Kimya, 2018), such as always choosing the closest venue. The average choice set size is 398.2 with a maximum of 1182. We tested other choice set generation strategies such as importance and strategic samplings, with this spatial restriction strategy being the one with the most reliable results in terms of rho2, AIC, and BIC.
Model description
To estimate the probability of choosing an alternative i, following the formulation of McFadden (1973):
In which, V
ni
is the systematic component of the utility of person n given by the alternative i. As π
n
(C
n
|i, z) includes all alternatives below the threshold H
n
, and contains i, it satisfies the positive conditioning property (if i, j ∈ C
n
and P(i ∈ C
n
) > 0, then P(j ∈ C
n
) > 0) and the uniform conditioning property (π
i
(C
n
|i, z) = π
i
(C
n
|j, z)), then the probability of the choice set is canceled out. Then, the log-likelihood function is:
where β is the taste coefficient of each variable and x is the vector of attributes for individual n at venue i, when applicable. Price has a subindex c as it has three levels from 1 to 3, while homophily subindex h represents the three homophilic preferences studied. The scale parameter has a mean value
Model results
Multinomial logit
MNL model of homophily in destination choice.
In terms of the preference for homophily, the nightlife venues model, on average, has higher estimates. In both models, age homophily is the most important variable, with estimates of 1.15 and 1.33, respectively. The second most important homophily variable for food venues is income, with an estimate of 0.69, followed by cultural origin with an estimate of 0.60. For nightlife venues, homophily in cultural origin is more important than income homophily with estimates of 1.13 and 0.36, respectively.
Mixed multinomial logit
Mixed MNL model with heterogeneity in preferences for homophily in destination choice.
Utilities of the socioeconomic groups for the price level of the venues.
aIncome has been excluded of the nightlife table as it shows no statistical significance in any of the interactions with price.
Willingness to travel
Homophily-distance willingness to travel, measured as increase in percentual distance.
aAs the WTT depends on distance, the willingness to travel is estimated by multiplying these values by
A visual representation of the heterogeneity of the WTT estimates is shown in Figure 4, where the conditional (bars) and unconditional (density) distributions of the three preferences for homophily are presented by model. Regarding food services, cultural origin and income have high peaks, showing greater homogeneity in preferences compared to age homophily. In the case of the nightlife venues model, age and cultural origin homophily show a similar distribution of the conditionals, with both distributions having a high heterogeneity, while the preference for income homophily is very close to zero for most of the individuals, which explains the low mean preference shown above. Conditional (bars) and unconditional (density) distributions of willingness to travel. (A, B, and C) Food venues and (D, E, and F) nightlife venues. Note: y-axis of graph E uses a different scale.
Discussion and conclusion
We have analyzed the importance of the social environment in destination choice, measured as the preference for homophily of three characteristics of the social environment: age, income, and cultural origin. The two models, food venues and nightlife venues, have shown a positive and significant impact of the three socioeconomic homophilic preferences when choosing a venue to perform leisure, age homophily being the most relevant variable in both models. In contrast, income homophily is relevant, but with a low WTT, when choosing a restaurant or cafe, whereas its importance is negligible when choosing nightlife venues, which coincides with a statistical insignificance of the income interaction with price in those venues; homophily of cultural origin shows higher relevance when choosing bars or nightclubs than when choosing food venues. The difference in homophilic preferences shows that individuals have different preferences for the social environment depending on the context of the social activity. The higher preferences for homophily in nightlife venues can be related to the social expectations of the activity: bars and nightclubs tend to have more interaction between unknown individuals, and they are better venues to potentially meet new people, while restaurants and cafés depend more on clique interaction than interactions “between tables.” These distinct preferences are consistent with previous research on social networks showing that homophilic preferences have baseline patterns with differences depending on the type of relationship formed between individuals (McPherson et al., 2001). The distribution of the preferences for homophily also depends on the type of venues; preference for income homophily has a higher variance in food venues than in nightlife venues, while the opposite happens with age and cultural origin, which shows that the distribution of preferences in food services tends to be more homogeneous than in nightlife venues. The low WTT of income homophily in both types of services could be related to inequality not being a recurrent problem in Switzerland, which has had stable inequality indicators since 1930 (Institut für Schweizer Wirtschaftspolitik, 2024), while it has increased significantly since 1980 in most countries (Chancel et al., 2022). On the other hand, there has been a growing problem of racial profiling in recent years, which could be related to the preferences for homophily in cultural origin. 1
The study makes two contributions to the literature. First, the measurement of homophilic preferences in leisure activities; homophily is an essential aspect of social networks and can be observed in many types of social interaction, but as far as the authors are aware, this is the first time it has been measured as a characteristic that affects the choice of the specific venue to perform leisure activities. The results can help understand how individuals choose where to share common spaces with unknown individuals, these interactions are an essential aspect of urban social life as they generate a sense of community and belonging, which could result in socialization and the formation of friendship ties; in practice, with these results new public policy can be implemented in order to create more welcoming urban leisure spaces, that integrate different groups of society on a daily basis. Previous studies have found that some areas of the city tend to gather individuals from different social backgrounds, but visiting the same areas does not necessarily mean that they perform leisure activities in the same venues. Therefore, more research is needed to understand the origins and measure leisure segregation and define whether segregation can still occur in physical proximity; this is a first approach to this task.
The second contribution is to show the potential of using venue-level information to estimate models of leisure destination choice. In this paper, we have mixed survey data with social network services (SNS) data to include three types of characteristics: individual level (age and income), venue level (category, price, area attractiveness, and rating), and individual venue level (distance and homophily). The use of venue-level characteristics for leisure destination choice is a methodological advancement in comparison to previous models that analyze meso-destination characteristics, such as demographics and agglomeration effects, because using venue characteristics such as price or rating can increase the number of options available for individuals as well as make comparisons between spatially contiguous venues possible, increasing the level of details the model can provide.
The literature on homophily has shown that preferences lead to network segregation, even when preferences are minor (Goodreau et al., 2009; Hatna and Benenson, 2015; Schelling, 1971); for this reason, it is relevant to understand the preferences of individuals for the social environment of their leisure venues. However, a second potential venue characteristic in leisure activities can lead to age and income segregation: the price category. The preference for price category varies depending on age and income, with the sign of the parameters varying from negative to positive as individuals get older and wealthier; this could generate higher levels of segregation in leisure activities than if only homophilic preferences were to be considered.
We want to point out two main limitations of this study. The first is how the social environment is defined, as collecting data on the social environment for more than 1500 venues is resource intensive; we had to rely on the information provided by the respondents, which can be prone to measurement errors. The second limitation is the number of socioeconomic characteristics used for homophilic preferences; homophily is a complex phenomenon that depends not only on the socioeconomic characteristics and interests of the individual but on shared values, beliefs, or a multifactorial combination of all these variables (Block and Grund, 2014). Additionally, the data has been collected in Zurich, and therefore the preferences estimated are not generalizable to other locations. However, this study has shown the importance of the three homophilic preferences studied in the individual’s process of leisure destination choice, and it is a first step in understanding how individual preferences can generate segregation in daily leisure activities.
Footnotes
Acknowledgments
We are thankful for the contributions of the DiffUrb team supporting with the data collection Prof. Dr. Vincent Kaufmann, Prof. Dr. Adrienne Grêt-Regamey, Dr. Florian Masse, and Heidi Baumann. We are also thankful to all the respondents who finished the survey.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The project was funded by the SNF grant “Difference-Oriented Urban Planning (DiffUrb)” (Grant No. 198565).
Data availability statement
The data set is available on demand; please contact the corresponding author.
