Evaluation Framework for Multi-Modal Public Transport Systems Based on Connectivity and Transfers at Stop Level

Abstract

Multi-modal public transport (PT) networks within metropolitan areas are often characterized by complexity resulting mainly from their infrastructure, design, operations, and demand. This complexity leads to a significant amount of effort on behalf of the transit agencies to properly evaluate their performance at certain locations and proceed with improvements. This study proposes a methodology based on clustering techniques that facilitates the evaluation of PT networks. The evaluation framework refers to the comparison between the levels of supply and demand at a certain stop. Service supply is quantified through an existing connectivity index, whereas demand is considered through the number of transfers that are performed at each stop. Transfers are critical within multi-modal mobility and often serve as a hindrance for choosing PT. The case study here is the Helsinki PT network in Finland. General Transit Feed Specification (GTFS) data are used for quantifying connectivity and a dataset deriving from smartphone ticketing application for quantifying transfers. Results include the evaluation for each PT mode and for the overall multi-modal PT network. Focusing on the evaluation of the overall multi-modal PT network, connectivity and transfers levels for 75.60% of stops are found to be well aligned. Therefore, these stops could be eliminated from the list of candidate stops for performing improvements. Of the remaining stops, 19.73% belongs to the case of higher connectivity than transfers and 4.67% to the case of lower connectivity than transfers. Stops included in these two cases require further attention and prioritization during planning processes.

Keywords

public transportation management and performance performance measures transformative trends in transit data GTFS

There is a vital need for efficient public transport (PT) networks to ensure well-functioning and sustainable cities. Decision-making associated with planning PT systems and services can be very complex, especially in large multi-modal networks where both the set of candidate actions and locations for improving the offered services can be very long. Therefore, the development of tools and methods to evaluate existing PT systems and proceed with planning decisions to improve them accordingly is of high importance, considering also the high costs of constructing new PT systems ( 1 ) and the efforts required for planning infrastructure investments ( 2 ). According to literature, most studies evaluate PT based on supply or demand, but it is important to account for both ( 3 ). In addition, many of the studies in the field focus on one travel mode, as for example buses (3 –6), which highlights the need for more multi-modal approaches.

Multi-modal environments are considered in Carroll and Yamamoto ( 7 ), where the authors highlight the need for balance between system performance and user perspective. The importance of multi-modal approaches in transport evaluation is discussed in Litman ( 8 ). In addition to supporting the role of accessibility in the list of conventional factors commonly used for transport evaluation, the authors also emphasize on the need for recognizing the diversity of travel demands and modes to achieve more comprehensive evaluations. Both supply- and demand-related factors as part of multi-modal evaluations are also considered in Rodríguez González et al. ( 9 ). Supply is considered through generating space–time characterization of the PT system’s operation. Demand is studied through origin–destination matrices that demonstrate mobility patterns. An evaluation of multi-modal trips was performed in Kumar et al. ( 10 ), by considering supply-related performance measures (e.g., level of service) and quantifying user experience (e.g., access/egress time) through surveys.

Considering supply, there are several indicators used for evaluating PT performance, among which is the usage of network connectivity measures (11 –13). Park and Kang ( 14 ) identified the need for a connectivity index that accounts for the PT network’s operational characteristics and thus introduced the transit connectivity index. This index has been further extended and utilized in various studies ( 15 – 17 ). Mishra et al. ( 15 ) emphasized on the role of such an index in prioritizing PT stops for funding and other planning activities in multi-modal networks, among other applications.

Planning activities should account for how people use complex PT networks, as for example, with reference to demand, accessibility, and trip planning ( 18 ). Therefore, understanding the network usage should be part of the methods and tools that are implemented for evaluation purposes. One of the most critical parts of a PT network’s usage refers to where travelers perform transfers. Transfers are in the core of multi-modality and one of the greatest challenges to achieve seamless PT mobility ( 19 , 20 ). According to De Witte et al. ( 21 ), transfers are often associated with great disutility when it comes to mode choice and thus might be a hindrance for choosing PT instead of competing alternatives (e.g., private cars). Therefore, stations serving as transfer points play a major role in PT networks with various studies emphasizing the need for more attention to be given to them ( 22 ).

The quantification of a PT network’s performance as regards supply and demand can be achieved through a variety of available data sources in the field of PT, such as automatic vehicle location (AVL), automatic passenger counts (APC), and smart card data. A widely available data source concerning PT operational characteristics is the General Transit Feed Specification or GTFS data ( 18 , 23 ). As a source for schedule information, it is usually released by PT authorities for users to plan their routes accordingly. Tracking PT users’ trajectories within large multi-modal networks is a complex process and requires the combination of different traditional data sources under several assumptions. Moreover, such sources offer information that is not detailed enough to properly detect transfer activities ( 24 ). Emerging data sources in the field refer to wireless communication technologies ( 25 ) and Bluetooth beacons which have been applied in PT systems in recent years ( 26 ). The potential role of mobile phones in improving mobility has been investigated in literature ( 27 ). Rinne et al. ( 28 ) proposed an automatic method to recognize PT trajectories based on activity information and sensor measurements sensed by mobile phone systems, which provides a new opportunity to sense PT data.

One of the challenges associated with large datasets refers to the need for advanced processing to derive useful results for practice. Machine learning techniques are widely used for analyzing large datasets and producing results and insights of practical interest. Clustering techniques are common machine learning approaches that are used for a variety of PT applications (29 –31). Among them, the hierarchical clustering technique consists of defining clusters of observations progressively and is often used for classification purposes in PT studies (e.g., in He et al. [ 32 ]).

Acknowledging the above, this study develops a methodology to evaluate multi-modal PT systems at the stop level. This methodology aims at assisting planning processes for PT improvement. Clustering techniques are implemented for ordering stops based on their supply and demand levels. Supply and demand clusters are compared according to the proposed methodology to identify the stops that require further attention, thus reducing the number of stops that planners should investigate during their efforts for increasing PT performance. Supply is quantified through the indicator of PT connectivity ( 14 – 16 ) and GTFS data are utilized to obtain the required information. Demand is studied with a focus on the critical case of transfer activities, and data derived from mobile phone applications are used ( 33 ). The case study refers to the PT network of Helsinki (Finland), operated by the Helsinki Regional Transport Authority (“Helsingin Seudun Liikenne” or “HSL”), which also owns the utilized demand dataset called “TravelSense”.

Methodology

This study aims at utilizing traditional and emerging data sources for the quantification of supply and demand to evaluate the performance of a multi-modal PT network that accounts for both. Supply is quantified here through an established metric of transit connectivity. Demand is studied here concerning performed transfers per stop. Clustering techniques are used for evaluating the PT performance. The steps of the proposed methodology are described in detail as follows.

Quantification of Connectivity

The connectivity index considered here was first presented in Park and Kang ( 14 ) and later extended and utilized in other studies ( 15 – 17 ). The goal was to identify a connectivity index that reflects not only the nodes and links within a PT network, but also allows the reflection of its operational characteristics. This connectivity index focuses on a stop $n$ $(θ_{n})$ and quantifies the sum of connecting powers of all routes $r \in R (P_{r, n}^{t})$ at stop $n$ . It can be described by a general formula as follows:

θ_{n} = \sum_{r \in R} P_{r, n}^{t} μ_{r, n}

(1)

where $μ_{r, n}$ takes the value 1 if route $r$ contributes to the connectivity at node n, and 0 otherwise. As regards $P_{r, n}^{t}$ , it represents the connecting power of route $r$ at node $n$ and it is the average of inbound $(P_{r, n}^{i})$ and outbound $(P_{r, n}^{o})$ connecting powers of route $r$ at stop $n$ , described as follows:

P_{r, n}^{t} = \frac{P_{r, n}^{i} + P_{r, n}^{o}}{2}

(2)

The inbound $(P_{r, n}^{i})$ and outbound $(P_{r, n}^{o})$ connecting powers of a route $r$ at stop $n$ are:

P_{r, n}^{i} = α C_{r} \times β V_{r} \times γ D_{r, n}^{i}

(3)

P_{r, n}^{o} = α C_{r} \times β V_{r} \times γ D_{r, n}^{o}

(4)

where $C_{r}$ is the passenger capacity of route $r$ (pax), $V_{r}$ is the speed of route $r$ (km/h), $D_{r, n}^{i}$ is the route distance of stop $n$ from the route origin (km), and $D_{r, n}^{o}$ is the route distance of stop $n$ from the route destination (km). The passenger capacity of a route depends on the passenger capacity of a vehicle and the number of trips that are performed in the time period that is studied. The parameters $α$ , $β$ , and $γ$ are the scaling factor coefficients for capacity, speed, and distance and are the reciprocals of the average capacity of the system, the reciprocal of the average speed on each route, and the reciprocal of the average network route distance, respectively.

Quantification of Demand

Demand in this study refers to the number of transfers that are performed at a certain PT stop. Transfers are a critical part of multi-modal PT trips that are usually challenging to quantify through conventional data sources. As transfers, in this study we consider the number of travelers within the network that boarded a PT vehicle at a certain stop shortly after they alighted another PT vehicle, either at the same or a different PT stop. If two stops are involved in the transfer activity, then both are attributed the transfer activity when counting the number of transfers per stop. The time interval between alighting a PT vehicle and boarding a new PT vehicle that determines whether this activity is a transfer or not, is set according to network and data-specific conditions. These conditions are described in the following section in which the case study and utilized data are described.

Clustering Method

The clustering approach implemented in this study is the hierarchical agglomerative clustering (HAC) technique, which is used in several PT applications (e.g., in Cats et al. [29]). The analysis starts by considering that each point within the dataset is an individual cluster (i.e., it is a bottom-up approach) and a similarity (or distance) matrix is calculated. An iterative process of clustering the two closest data points and updating the similarity matrix is implemented until there is only one cluster left. The similarity between two clusters can be compared with different linkage methods. In ( 34 ) a comparison of different methods is performed and Ward’s method ( 35 ) seems to be the one performing best in most situations examined. Therefore, this method is adopted in this study. There are several ways of determining the optimal number of clusters. The method used here is to consider the silhouette score (SS) presented in Rousseeuw ( 36 ). It is a commonly used validation metric of the consistency of data within a cluster. The score values range from −1 to 1, with score 1 indicating high similarity of points within a cluster and high difference with points outside the cluster.

The PT stops per mode are first clustered according to their connectivity index considering a meaningful maximum number of clusters that will support the explainability of results (e.g., from two to ten). For each clustering, the SS is recorded. Similarly, the PT stops are then clustered according to the transfers that are performed at them, and the respective SSs per clustering are recorded. The number of clusters, $k$ , that has the greatest combination of SSs for the connectivity and the transfers clustering approaches is the one that is considered for the rest of the proposed method.

Evaluation Framework

For each mode, $m \in M$ , the comparison between supply and demand clusters is achieved through constructing a type of matching matrix with dimensions $k$ × $k$ and considering the connectivity and transfer clusters in which the stops belong. The first cluster is the one associated with the lowest and the last cluster (i.e., cluster $k$ ) with the greatest values of supply or demand. In such a matrix, the sum of all cell values should equal the total number of stops per mode, $N_{m}$ . A sample of matching matrix with $N_{m} = \sum_{j = 1}^{k} \sum_{i = 1}^{k} N_{m}^{i, j}$ stops for mode $m$ is given in Figure 1. The values in diagonal cells ( $N_{m}^{i, j}$ with $m \in [1, M]$ , $i \in [1, k]$ , $j \in [1, k]$ and $i = j$ ) present the number of stops of mode $m$ that belong to the same cluster for both supply and demand. Therefore, they could be eliminated during decision-making concerning where to implement improvements in the supply performance of the network. The percentage of stops that belong to the case of equal connectivity and transfers for a mode, $EC T_{m}$ , and for the overall multi-modal network, $EC T_{o}$ , can be derived as follows:

EC T_{m} = \frac{\sum_{i = 1}^{k} N_{m}^{i, i}}{N_{m}}

(5)

EC T_{o} = \frac{\sum_{m = 1}^{M} \sum_{i = 1}^{k} N_{m}^{i, i}}{\sum_{m = 1}^{M} N_{m}}

(6)

Figure 1.

Sample of matching matrix (a) with number of stops per cell and (b) with evaluation of areas. (Color online only).

The cells in Figure 1 that are above the diagonal (also illustrated in red shade color) include the numbers of stops for which the connectivity is in lower cluster than the transfers. These stops require further attention from PT authorities, to ensure that the PT user experience is efficient. The percentage of stops that belong to the case of lower connectivity/higher transfers for a mode, $LCH T_{m}$ , and for the overall multi-modal network, $LCH T_{o}$ , can be derived as follows:

LCH T_{m} = \frac{\sum_{i = 1}^{k} \sum_{j = 1 | j > i}^{k} N_{m}^{i, j}}{N_{m}}

(7)

LCH T_{o} = \frac{\sum_{m = 1}^{k} \sum_{i = 1}^{k} \sum_{j = 1 | j > i}^{k} N_{m}^{i, j}}{\sum_{m = 1}^{M} N_{m}}

(8)

In general, the closer the cells to the diagonal, the better the performance of the included stops relating to alignment between connectivity and transfers. Among the cells that are above the diagonal, the cell with $N_{m}^{1, k}$ stops (darker red color in Figure 1) is the extreme case in which stops that are clustered in the lowest connectivity cluster are also clustered within the highest transfers cluster. These stops require further attention and a proper investigation of whether improvement actions are needed on behalf of the operators. The percentage of stops that belong to this extreme case of the lowest connectivity and the highest transfers for a mode, $LCHT e_{m}$ , and for the overall multi-modal network, $LCHT e_{o}$ , can be derived as follows:

LCHT e_{m} = \frac{N_{m}^{1, k}}{N_{m}}

(9)

LCHT e_{o} = \frac{\sum_{m = 1}^{M} N_{m}^{1, k}}{\sum_{m = 1}^{M} N_{m}}

(10)

In addition to the stops above the diagonal, the stops included in cells below the diagonal (illustrated with green shade color in Figure 1) also indicate cases in which the connectivity and the transfers are not perfectly aligned. In this case, the stops are included in a higher cluster of connectivity compared with the cluster of transfers. Therefore, PT operators should investigate whether they should decrease their efforts to decrease their operational costs. For example, the cost savings could be re-allocated to the stops above the diagonal. In any case, the operators should also take into account the marginal effects of such decisions on the PT users experience. Every action within the network might affect the overall structure of the matching matrix and iterative process of re-structuring it after every intervention might be needed. It is noted that the ideal scenario that results from this method is a matching matrix in which all stops belong to the diagonal. The percentage of stops that belong to the case of higher connectivity/lower transfers for a mode, $HCL T_{m}$ , and for the overall multi-modal network, $HCL T_{o}$ , can be derived as follows:

HCL T_{m} = \frac{\sum_{i = 1}^{k} \sum_{j = 1 | j < i}^{k} N_{m}^{i, j}}{N_{m}}

(11)

HCL T_{o} = \frac{\sum_{m = 1}^{M} \sum_{i = 1}^{k} \sum_{j = 1 | j < i}^{k} N_{m}^{i, j}}{\sum_{m = 1}^{M} N_{m}}

(12)

The percentage of stops that belong to the extreme cell of the highest cluster of connectivity and the lowest cluster of transfers (highlighted with darker green color in Figure 1) for a mode, $HCLT e_{m}$ , and for the overall multi-modal network, $HCLT e_{o}$ , can be derived as follows:

HCLT e_{m} = \frac{N_{m}^{k, 1}}{N_{m}}

(13)

HCLT e_{o} = \frac{\sum_{m = 1}^{M} N_{m}^{k, 1}}{\sum_{m = 1}^{M} N_{m}}

(14)

A summary of notations used in this section is given in Table 1, with notations listed in the order of appearance in the paper. The overall proposed methodology for each mode’s evaluation is summarized in Figure 2. As presented above, the current study focuses on evaluating each mode separately but the proposed method also allows deriving conclusions for the overall multi-modal network performance. The evaluation consists of determining percentages of stops that belong to each one of the cases presented above (i.e., diagonal, above diagonal, below diagonal, and extreme). Such metrics can offer insights to guide decision-makers through implementing changes in the network. The proposed method also allows us to identify the specific stops that require further attention and therefore reduce the efforts of decision-makers when it comes to choosing where (either stop or mode level) to implement improvements.

Table 1.

Summary of Notations

Notation	Description
$θ_{n}$	Transit connectivity index of stop $n$ (unitless)
$P_{r, n}^{t}$	Connecting power of route $r$ at stop $n$ (unitless)
$μ_{r, n}$	Binary variable (1: route $r$ contributes to the connectivity at node $n$ ; 0: otherwise)
$P_{r, n}^{i}$	Inbound connecting power of route $r$ at stop $n$ (unitless)
$P_{r, n}^{o}$	Outbound connecting power of route $r$ at stop $n$ (unitless)
$C_{r}$	Passenger capacity of route $r$ (pax)
$V_{r}$	Speed of route $r$ (km/h)
$D_{r, n}^{i}$	Route distance of stop $n$ from the route origin (km)
$D_{r, n}^{o}$	Route distance of stop $n$ from the route destination (km)
$α$	Scaling factor coefficient for capacity (1/pax)
$β$	Scaling factor coefficient for speed (h/km)
$γ$	Scaling factor coefficient for distance (1/km)
$N_{m}$	Number of stops of mode $m$
$N_{m}^{i, j}$	Number of stops of mode $m$ in connectivity cluster $i$ and transfers cluster $j$
$EC T_{m}$	Percentage of stops of mode $m$ that belong to connectivity cluster $i$ and transfers cluster $j$ , with $i = j$
$EC T_{o}$	Percentage of stops of the overall network that belong to connectivity cluster $i$ and transfers cluster $j$ , with $i = j$
$LCH T_{m}$	Percentage of stops of mode $m$ that belong to connectivity cluster $i$ and transfers cluster $j$ , with $i < j$
$LCH T_{o}$	Percentage of stops of the overall network that belong to connectivity cluster $i$ and transfers cluster $j$ , with $i < j$
$LCHT e_{m}$	Percentage of stops of mode $m$ that belong to connectivity cluster $i$ and transfers cluster $j$ , with $i = 1$ and $j = k$ (where $k$ is the number of clusters)
$LCHT e_{o}$	Percentage of stops of the overall network that belong to connectivity cluster $i$ and transfers cluster $j$ , with $i = 1$ and $j = k$ (where $k$ is the number of clusters)
$HCL T_{m}$	Percentage of stops of mode $m$ that belong to connectivity cluster $i$ and transfers cluster $j$ , with $i > j$
$HCL T_{o}$	Percentage of stops of the overall network that belong to connectivity cluster $i$ and transfers cluster $j$ , with $i > j$
$HCLT e_{m}$	Percentage of stops of mode $m$ that belong to connectivity cluster $i$ and transfers cluster $j$ , with $i = k$ and $j = 1$ (where $k$ is the number of clusters)
$HCLT e_{o}$	Percentage of stops of the overall network that belong to connectivity cluster $i$ and transfers cluster $j$ , with $i = k$ and $j = 1$ (where $k$ is the number of clusters)

Figure 2.

Flow chart of proposed methodology.

Study Area and Data

Helsinki is a metropolitan area that covers $770 k m^{2}$ with a population of approximately 1.3 million inhabitants. It is characterized by a very diverse mobility ecosystem, that includes fixed PT (i.e., metro, tram, train, bus, and ferry), micro-mobility (e.g., shared e-scooters and shared bicycles), and ride-hailing services (e.g., UBER). Flexible services have operated in the past, as for example the Kutsuplus service that was a novel flexible micro-transit service that was operated by the local PT operator from 2012 to 2015. A map of the study area including the current fixed-route PT network is presented in Figure 3. The operational characteristics of the PT network can be quantified through data available in the GTFS dataset. The demand characteristics associated with the PT network can be quantified through an app-based dataset (“TravelSense”), which is owned by the local PT operator.

Figure 3.

Map of the Helsinki public transport (PT) network.

GTFS Data

GTFS is a data specification that allows PT operators to publish data that could be further used for different applications. The GTFS dataset is divided into scheduled component (e.g., schedule and fare information) and real-time component (e.g., arrival predictions and vehicle locations). The focus of this study is on static PT information, with the respective files including information about stops, routes, trips, and fares, among others. The goal of this study is to utilize GTFS data to derive at a stop level:

number of stops: readily available

location of stops: readily available

routes per stop: readily available.

At a route level, the required information obtained from GTFS data refers to:

number of routes: readily available

number of daily trips per route: readily available

shape distances (km) between stops per route: readily available

duration (h) per route: calculated as the difference between the timestamps of vehicle’s dispatch at origin stop and vehicle’s arrival at the terminal stop

length (km) per route: calculated as the sum of shape distances (km) between stop locations within the route

speed (km/h) per route: calculated as the route length (km) over route duration (h)

type of mode per route: readily available.

An additional piece of information needed in this study is the vehicle capacity (pax/veh) per mode, which can be easily identified though the operators’ websites, among other sources (e.g., reports). The required processing for obtaining the above information from GTFS data is fairly fast and straightforward, since most of it is already recorded in the dataset while the rest can be easily calculated.

TravelSense Data

HSL provides PT users with a mobile application which allows them to buy tickets (i.e., single ticket, day ticket and season ticket), as well as to find the best route for their trip and receive information about the PT operation (e.g., timetables, delays). HSL has incorporated in this application the option to record trip trajectories for users who consent and thus detect whether the user is still, walking, cycling, or on board a vehicle, either PT or private. Exact coordinates of locations outside the PT network are not recorded, but are resolved up to grid cells of dimension $250 m \times 250 m$ using GPS data. The required physical sources for the data collection within PT network includes stationary Bluetooth beacons at PT stops, moving Bluetooth beacons in PT vehicles and portable devices (i.e., users’ mobile phones). Each user’s mobile phone is assigned a random ID which is updated every day that the user shares data. Anonymity is preserved to avoid identification of individuals.

The information included in the TravelSense dataset is structured based on “legs” and “trip chains.” A “leg” is a discrete stage within a journey recognized by the data collection system and the pre-processing. The reasons for such recognition could be a pause in the movement or a change in recognized activity. A “trip chain” is a series of legs that have been recognized by the pre-processing as being part of a single journey. A trip chain is ended when the system detects prolonged periods in the same location and no changes in activity. The raw information included in the TravelSense dataset for PT journeys includes:

start and end timestamps of legs

start and end PT stop IDs and coordinates for each leg

PT mode used at each leg

PT route used including direction for each leg.

For journeys outside the PT network, the raw information includes:

start and end timestamps of legs rounded to nearest quarter-hour for privacy purposes

grid cells associated to each leg

type of movement including walking, cycling, or vehicle.

Unlike commonly used data sources for identifying mobility patterns (e.g., smartcard data), the TravelSense dataset offers enough details to illustrate the full trajectory of a door-to-door journey (Figure 4). A full trajectory allows us to detect and quantify transfers at a PT stop level. For example, in Figure 4 the PT user boards the bus (blue vehicle) and after alighting walks to access the respective PT stop to board the metro (orange vehicle). In this study, the analysis considers both parts of a transfer activity. More specifically, the term “transfers alighting” at a PT stop refers to the number of PT users that alight at a certain PT stop to board another PT vehicle, either at the same or at another PT stop of the same or different mode. The term “transfers boarding” refers to the number of PT users that board a PT vehicle after alighting another PT vehicle either at the same or at another PT stop of the same or different mode. The detection of transfers is constrained within an amount of time equal to 80 min, which is the validity time for a ticket. Therefore, if a PT user alights a vehicle and does not board another one within 80 min, then it is assumed that there is no transfer activity. For example, it could be shopping time.

Figure 4.

Example of a trajectory derived from TravelSense data and comparison with other data sources. (Color online only).

The TravelSense dataset depends on a complex system of data collection and requires a careful pre-processing for deriving the required outputs directly from the raw data. Details about the infrastructure required for obtaining the data included in TravelSense, the necessary assumptions required for cleaning the raw data and the process for deriving the needed transfer related results are described in Huang et al. ( 33 ).

Results

GTFS and TravelSense data are obtained, processed, and analyzed to evaluate each PT mode that operates within the multi-modal PT network of Helsinki, following the methodology proposed in this study. The results from each step are presented as follows.

Data Analysis

GTFS Data Analysis

GTFS data were analyzed to derive the components of stop connectivity index: passenger capacity (pax), speed per route (km/h), and length of route (km). The analysis here considers the operation of regular weekdays of April 2022 and identifies the stops and routes that operated during regular weekdays as well as the details of their operation. The analysis here considers the 7,852 stops that belong to the HSL area (i.e., zones A, B, C, and D) and are common among all weekdays. The identified number of routes is 513 for a regular Friday and 509 for the other weekdays.

The four main PT modes that operate in Helsinki are bus, tram, metro, and rail services. There are also four ferry stops, which are not considered here. Bus stops are dominating the modal split with 94.56% of stops being bus stops, 3.27% tram stops, 1.53% rail stops, and 0.64% metro stops. Considering vehicle capacity (pax/veh), the average values used in this study are 97, 180, 600, 700, for bus, tram, rail, metro, respectively, utilizing values from HSL’s official website ( 37 ). Bus mode includes different types of services, such as express, regular, regional, and so forth. With reference to stops per zone, 12.29% of stops belong to zone A, 38.20% to zone B, 26.62% to zone C, and 22.80% to zone D.

The connectivity index was calculated for each stop and each day, and then an average daily connectivity per stop was derived to perform a comparison between the average connectivity per stop and the transfers that were performed at these stops in overall during the 22 weekdays of April 2022, as explained in the following section. For the purposes of this study, the parameters $α$ , $β$ , and $γ$ of Equations 3 and 4 can be calculated either per mode or for the entire multi-modal PT network without affecting the clustering and therefore the desired results. The results of connectivity index per stop are summarized in Figure 5, considering the parameters $α = 0.00028 (1 / pax)$ , $β = 0.031 (h / km)$ , and $γ = 0.054 (1 / km)$ , respectively, that are calculated based on the entire PT network and are found to be the same for every day studied here. As shown in Figure 5, highly connected stops are mostly met in the center of Helsinki (also shown separately in zoom-in box within the figure). For the days studied here, the average connectivity index for bus stops is 1.05, for tram stops is 1.06, for rail stops is 32.95, and metro stops is 46.27. As expected, rail and metro stops are associated with considerably higher connectivity index compared with bus and tram. Figure 6 summarizes the histograms of stop connectivity index per mode.

Figure 5.

Map of under study area with locations of public transport (PT) stops and colorbar indicating their connectivity index.

Figure 6.

Histogram of connectivity index per stop for (a) bus, (b) tram, (c) rail, and (d) metro.

TravelSense Data Analysis

Demand data are analyzed for the 22 regular weekdays of April 2022. For each day, the number of transfers per stop is quantified and then the total number of transfers per stop is derived for the study period. Transfers are aggregated because the number of PT users who are also mobile ticketing app users who have accepted the tracking of their trajectories was low and did not allow a proper analysis at a more disaggregated level. The issue with these low numbers of TravelSense records is discussed in Huang et al. ( 33 ), in which the authors showed that the TravelSense data can be considered representative. It is noted that the aggregation might lead to double counting some daily repeated travel patterns. However, owing to the anonymity of data, it is not possible to directly know which trips might correspond to the same PT user among the studied days and thus it is not possible to know with certainty which trips are repeated.

The spatial distribution of demand and transfers is presented on the map of the study area in Figure 7. Stops near the center of Helsinki are the ones with the greatest demand and transfers, but there are also stops in the suburban areas that are equally highly used by PT users. In overall, for the study period the TravelSense dataset recorded 2.3 million boardings and alightings with 8.17% of them corresponding to transfer activities. The histograms of transfer activities per stop for each one of the four modes studied here is shown in Figure 8.

Figure 7.

Map of under study area with locations of public transport (PT) stops and colorbar indicating the number of PT users (a) accessing a PT stop, (b) egressing a PT stop, (c) alighting at a stop during transfer, and (d) boarding a PT stop during a transfer. (Color online only).

Figure 8.

Histogram with number of transfer activities per stop for (a) bus, (b) tram, (c) rail, and (d) metro.

Clustering Analysis

After connectivity and transfers per stop are quantified, HAC is used for clustering stops of each mode according to their connectivity and transfers. To identify the greatest combination of SSs for clustering based on connectivity and transfers, we used a maximum number of clusters equal to 20. Eventually, metro stops are clustered in two clusters, bus and rail stops in three clusters, and tram stops in four clusters. The results of the clustering procedure for connectivity index per stop are shown in Figure 9. The results of the clustering procedure for transfers per stop are shown in Figure 10. In these figures, the stops per mode are ordered from low to high value according to their connectivity or transfers, with different colors indicating the cluster in which they belong. The resulting mean values per cluster and the SS per cluster are also included in the figures. In this study, “cluster 1” refers to low connectivity and the greater the number of cluster the greater the connectivity. The same holds for transfers clusters.

Figure 9.

Clustering results for connectivity per stop of (a) bus, (b) tram, (c) rail, and (d) metro.

Figure 10.

Clustering results for transfers per stop of (a) bus, (b) tram, (c) rail, and (d) metro.

Evaluation

The matching matrices for each one of the four modes of the Helsinki network are shown in Figure 11. This figure also presents the SS for both the clustering based on connectivity and the clustering based on transfers. As shown, it is always positive and close to one, for all clustering processes implemented here, indicating a high accuracy of results. It is noticeable that in all four modes there are more stops in the area of high connectivity and low transfers (i.e., green color shaded area in Figure 11) compared with the area of low connectivity and high transfers (i.e., red color shaded area in Figure 11). In addition, the extreme case of low connectivity and high transfers includes zero number of stops for tram, rail, and metro. The eight bus stops that belong to the extreme case of low connectivity and high transfers (Figure 11a) require further investigation from the operator to ensure that the user experience is efficient at these stops.

Figure 11.

Matching matrix for (a) bus, (b) tram, (c) rail, and (d) metro mode. (Color online only).

Table 2 summarizes the results of Figure 11 with reference to percentage of stops that belong to the cases of equal connectivity and transfers (ECT), low connectivity-high transfers (LCHT), and high connectivity/low transfers (HCLT). The percentages of stops per mode belonging to the extreme cases of low connectivity/high transfers (LCHTe) and high connectivity/low transfers (HCLTe) are also included in the table. As shown, there is a high percentage of stops that belong to the equivalent cluster of supply and demand for all modes (i.e., more than 50% for all modes). Metro mode is shown in this table to have a high percentage of stops belonging to the case of high connectivity and low transfers, compared with the respective percentages of other modes. It is noted that for metro stops the LCHT and LCHTe values are equal because the stops are clustered in two clusters. The same holds for HCLT and HCLTe.

Table 2.

Evaluation of Stops’ Performance Per Mode

Mode	ECT (%)	LCHT (%)	LCHTe (%)	HCLT (%)	HCLTe (%)
Bus	76.27	4.36	0.11	19.37	0.40
Tram	69.65	11.67	0.00	18.68	0.00
Rail	52.50	10.83	0.00	36.67	2.50
Metro	62.00	0.00	0.00	38.00	38.00
Overall	75.60	4.67	0.10	19.73	0.66

Note: ECT = equal connectivity and transfers; LCHT = low connectivity-high transfers; HCLT = high connectivity-low transfers; LCHTe = low connectivity-high transfers (extreme); HCLTe = high connectivity-low transfers (extreme).

Table 2 also includes the evaluation of the overall multi-modal network based on each mode’s evaluation. Considering the dominating role of bus stops within Helsinki’s multi-modal network (i.e., 94.56% of stops are bus stops), it is noted that the overall multi-modal network’s evaluation is similar to that of the bus mode. Therefore, it has a high percentage of stops within ECT (i.e., more than 75% of Helsinki stops present equivalent connectivity and transfers), and more stops belonging to the area of high connectivity and low transfers compared with the case of low connectivity and high transfers.

Discussion and Conclusions

Summary of Findings and Discussion

This study focuses on the evaluation of multi-modal PT networks’ performance and proposes a clustering-based methodology to compare supply and demand using traditional and emerging data sources. The implementation of the proposed methodology led to the Helsinki’s PT stops being clustered according to their connectivity and transfers. The constructed matching matrices showed that the majority of stops presents a good alignment between supply and transfers (i.e., more than 50% of stops for each mode and for the overall multi-modal network). These stops can be thus eliminated from the investigation of planners as regards where to perform improvements, reducing significantly their planning efforts. Considering stops in which supply and demand are not equivalent, it is shown that they mostly refer to cases of stops belonging to higher clusters of connectivity when compared with the transfers clusters in which they belong. This observation ensures the good quality of offered services, but it is up to the operator to decide whether they would like to reduce their efforts in some stops to properly allocate funding to other stops that require more attention (i.e., to the stops of low connectivity and high transfers). The high performance shown by this analysis is well aligned with a recent survey that has ranked Helsinki’s PT services as the second best among European urban regions with a percentage of user satisfaction equal to 76% ( 38 ).

Implications for Practice

This study required the availability of sufficient information to quantify supply and demand. Quantifying supply was achieved through utilizing an existing connectivity index suitable for PT networks. Acknowledging the importance of proposing a methodology that can be easily replicated, the authors utilized openly available GTFS data, as suggested also by existing literature ( 39 ). However, quantifying transfers is a more challenging task which can be achieved either through fusing traditional data sources or through using emerging sources in this field. The former case is more complex computationally, while the latter requires the availability of such data source. Mobile phone-based demand data sources, like the one utilized here, are promising for quantifying PT demand ( 28 ). Therefore, it is expected that they will become more common in advanced PT networks in the near future, allowing the replication of the proposed method in more case studies. Despite the uncertainties of what percentage of demand can be captured by such a dataset, the TravelSense data were shown to be efficient in revealing the relationship between supply and demand within the Helsinki network as indicated by the high ECT of most modes, thus enhancing the findings of Huang ( 33 ). In that study, the authors found that this dataset’s magnitudes of demand and transfers per stop are much lower than the ones deriving from alternative sources; however, their relative magnitude is considered representative. In a study like the one performed here, the importance is on identifying a dataset that can reveal the relationship among all stops of a network as regards transfers (and not necessarily the actual magnitudes per stop), thus allowing them to be clustered properly.

As regards the evaluation of the proposed methodology’s results, the case of high connectivity and low transfers can be evaluated in different ways. One approach is that the demand data and their filtering processes did not allow the proper representation of transfers at these stops. Another approach is that the effects of COVID-19 on demand were still present in the case study during April 2022. If data are assumed to be fully representative, then this case means that the operators have allocated more efforts than they should on these stops (e.g., as regards budget allocation). Therefore, the operations at these stops should be re-planned to account for the actual levels of transfers that take place at them. For example, that could happen through re-allocating funding from these stops to others that require support. Reducing the supply at a stop, however, is a decision that should be carefully taken, while also accounting for the marginal effects of the respective actions on the entire PT network. A simple example here is that reducing the efforts at a stop might lead PT users to use other stops, therefore increasing their demand and therefore creating new problematic stops in the network. Such scenarios can be investigated by using iteratively the proposed methodology, aiming at the highest possible percentage of stops that belong to equivalent clusters of supply and demand in tandem with the least possible number of problematic stops.

Considering the supply indicator for PT evaluation, an existing transit connectivity index was utilized here, including components of passenger capacity, speed, and length of routes. Therefore, the actions of the operator for adjusting supply to demand levels at a stop should be related to these components, either directly or indirectly. If a network is associated with different needs that cannot be reflected through this indicator, then the proper one should be incorporated in the proposed methodology. The set of actions that a PT operator can take with reference to planning PT services according to demand can vary from less to more intrusive approaches associated with different levels of labor and budget requirements. This study aims at identifying points of interest that require further attention by the PT operators to reduce the set of candidate points that they have to investigate while planning improvements. The following step includes personal judgment on behalf of the operators and/or additional methods for the identification of specific actions. This step lies beyond the scope of this study.

Future Directions

There are several ways in which this study could be extended in the future. A future study refers to identifying the effect of specific changes in the PT network. It is noted that the Helsinki PT network is constantly going through changes aiming at improving the user experience. In recent years, the Helsinki metro was extended, including the transformation of a direct bus network into a metro system with feeder buses ( 40 ). The improvements of metro services continue up to date, with an additional line extension performed during 2023. It is noteworthy that the Helsinki tram network is currently under improvements that started already during 2021 and will continue until 2035 ( 41 ). The evaluation framework proposed in this study could be used for evaluating the effect of a change in the PT network, considering the performance of the network before and after a certain change.

A different supply indicator could be selected (e.g., another connectivity related index), depending on the goals for which the PT operators perform the evaluation. The analysis performed here is static, referring to the PT operations on a daily basis and focusing only on regular weekdays. Future studies could account for the dynamic changes of operational characteristics at smaller time periods within a day. Special days and weekends could also be part of the analysis. Considering data, this study considered the TravelSense demand data which were collected during a time period that could be affected by COVID-19 pandemic and during the early stages of introducing the trajectory tracking option in the ticketing app. An interesting future direction refers to comparing the evaluation of PT performance after the impact of COVID-19 has faded and the TravelSense data have achieved a better penetration among PT users. Finally, this study focused on the evaluation of a multi-modal network as regards alignment between stop connectivity and transfers. Future studies could focus on proposing specific planning actions for improving the performance of these stops and ensuring high quality of services and high user satisfaction.

Footnotes

Acknowledgements

The authors thank HSL for access to the TravelSense dataset, and for their time and discussions about this study. Calculations were performed using computer resources within the Aalto University School of Science “Science-IT’’ project.

Author Contributions

The authors confirm contribution to the paper as follows: study conception and design: C. Sipetas, Z. Huang, A. Espinosa Mireles de Villafranca; data collection: C. Sipetas, A. Espinosa Mireles de Villafranca; analysis and interpretation of results: C. Sipetas, Z. Huang; draft manuscript preparation: C. Sipetas. All authors reviewed the results and approved the final version of the manuscript.

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The work of C. Sipetas was supported by the FinEst Twins Center of Excellence (H2020 Grant 856602). Z. Huang is supported by the NetResilience consortium funded by the Strategic Research Council at the Academy of Finland (grant numbers 345188 and 345183) and Guangdong Science and Technology Strategic Innovation Fund (the Guangdong-Hong Kong-Macau Joint Laboratory Program), Project No.: 2020B1212030009. The work of A. Espinosa Mireles de Villafranca was supported by the Academy of Finland.

ORCID iDs

Charalampos Sipetas

Zhiren Huang

Alonso Espinosa Mireles de Villafranca

Data Accessibility Statement

Data not available.

References

Shafahi

Khani

A Practical Model for Transfer Optimization in a Transit Network: Model Formulations and Solutions. Transportation Research Part A: Policy and Practice, Vol. 44, No. 6, 2010, pp. 377–389.

Asgarpour

Hartmann

Gkiotsalitis

Infrastructure Investment Planning Through Scenario-Based System-of-Systems Modelling. Transportation Planning and Technology, Vol. 46, No. 5, 2023, pp. 527–572.

Eboli

Mazzulla

A Methodology for Evaluating Transit Service Quality Based on Subjective and Objective Measures From the Passenger’s Point of View. Transport Policy, Vol. 18, No. 1, 2011, pp. 172–181.

Mavi

R. K.

Zarbakhshnia

Khazraei

Bus Rapid Transit (BRT): A Simulation and Multi Criteria Decision Making (MCDM) Approach. Transport Policy, Vol. 72, 2018, pp. 187–197.

Walteros

J. L.

Medaglia

A. L.

Riaño

. Hybrid Algorithm for Route Design on Bus Rapid Transit Systems. Transportation Science, Vol. 49, No. 1, 2015, pp. 66–84.

Godavarthi

G. R.

Chalumuri

R. S.

Velmurugun

Measuring the Performance of Bus Rapid-Transit Corridors Based on Volume by Capacity Ratio. Journal of Transportation Engineering, Vol. 140, No. 10, 2014, p. 04014049.

Carroll

M. A.

Yamamoto

E. C.

Level of Service Concepts in Multimodal Environments. In (A. Pande, and B. Wolshon, eds.), Traffic Engineering Handbook: Institute of Transportation Engineers 7th ed., John Wiley & Sons, Hoboken, NJ, 2015, pp. 149–176.

Litman

Toward More Comprehensive and Multi-Modal Transport Evaluation. Victoria Transport Policy Institute, British Columbia, Canada, 2013.

Rodríguez González

A. B.

Vinagre Díaz

J. J.

Wilby

M. R.

Fernández Pozo.

Data-Driven Performance Evaluation Framework for Multi-Modal Public Transport Systems. Sensors, Vol. 22, No. 1, 2022, p. 17.

10.

Kumar

P. P.

Parida

Swami

Performance Evaluation of Multimodal Transportation Systems. Procedia-Social and Behavioral Sciences, Vol. 104, 2013, pp. 795–804.

11.

Freeman

L. C.

A Set of Measures of Centrality Based on Betweenness. Sociometry, Vol. 40, No. 1, 1977, pp. 35–41.

12.

Freeman

L. C.

Centrality in Social Networks Conceptual Clarification. Social Networks, Vol. 1, No. 3, 1978, pp. 215–239.

13.

Bonacich

Lloyd

Eigenvector-Like Measures of Centrality for Asymmetric Relations. Social Networks, Vol. 23, No. 3, 2001, pp. 191–201.

14.

Park

Kang

A Model for Evaluating the Connectivity of Multimodal Transit Networks. Presented at 90th Annual Meeting of the Transportation Research Board, Washington, D.C., 2011.

15.

Mishra

Welch

T. F.

Jha

M. K.

Performance Indicators for Public Transit Connectivity in Multi-Modal Transportation Networks. Transportation Research Part A: Policy and Practice, Vol. 46, No. 7, 2012, pp. 1066–1085.

16.

Mishra

Welch

T. F.

Torrens

P. M.

Zhu

Knaap

A Tool for Measuring and Visualizing Connectivity of Transit Stop, Route and Transfer Center in a Multimodal Transportation Network. Public Transport, Vol. 7, No. 1, 2015, pp. 77–99.

17.

Welch

T. F.

Mishra

A Measure of Equity for Public Transit Connectivity. Journal of Transport Geography, Vol. 33, 2013, pp. 29–41.

18.

Kujala

Weckström

Mladenović

M. N.

Saramäki

Travel Times and Transfers in Public Transport: Comprehensive Accessibility Analysis Based on Pareto-Optimal Journeys. Computers, Environment and Urban Systems, Vol. 67, 2018, pp. 41–54.

19.

Liu

Cats

Gkiotsalitis

A Review of Public Transport Transfer Coordination at the Tactical Planning Phase. Transportation Research Part C: Emerging Technologies, Vol. 133, 2021, p. 103450.

20.

Gkiotsalitis

Cats

Liu

A Review of Public Transport Transfer Synchronisation at the Real-Time Control Phase. Transport Reviews, Vol. 43, No. 1, 2023, pp. 88–107.

21.

De Witte

Macharis

Mairesse

How Persuasive is ‘Free’ Public Transport?: A Survey Among Commuters in the Brussels Capital Region. Transport Policy, Vol. 15, No. 4, 2008, pp. 216–224.

22.

Zhou

Wang

Zhao

Passenger Flow Forecasting in Metro Transfer Station Based on the Combination of Singular Spectrum Analysis and AdaBoost-Weighted Extreme Learning Machine. Sensors, Vol. 20, No. 12, 2020, p. 3555.

23.

Weckström

Mladenović

M. N.

Kujala

Saramäki.

Navigability Assessment of Large-Scale Redesigns in Nine Public Transport Networks: Open Timetable Data Approach. Transportation Research Part A: Policy and Practice, Vol. 147, 2021, pp. 212–229.

24.

Nassir

Hickman

Z.-L.

Activity Detection and Transfer Identification for Public Transit Fare Card Data. Transportation, Vol. 42, No. 4, 2015, pp. 683–705.

25.

Wang

Zhang

ViFi-MobiScanner: Observe Human Mobility Via Vehicular Internet Service. IEEE Transactions on Intelligent Transportation Systems, Vol. 22, No. 1, 2019, pp. 280–292.

26.

Dunlap

Henrickson

Wang

Estimation of Origin and Destination Information From Bluetooth and Wi-Fi Sensing for Transit. Transportation Research Record: Journal of the Transportation Research Board, 2016. 2595: 11–17.

27.

Divall

Kureya

Bishop

Barber

Green

Clark

The Potential Role of Mobile Phone Technology in Rural Motorcycle and Three-Wheeler Taxi Services in Africa. Transportation Planning and Technology, Vol. 44, No. 1, 2021, pp. 30–44.

28.

Rinne

Bagheri

Tolvanen

Hollmén.

Automatic Recognition of Public Transport Trips From Mobile Device Sensor Data and Transport Infrastructure Information. In (R.Guidotti, A.Monreale, D.Pedreschi, S.Abiteboul, eds), International Workshop on Personal Analytics and Privacy, Springer, Cham, Switzerland, 2017, pp. 76–97.

29.

Cats

Wang

Zhao

Identification and Classification of Public Transport Activity Centres in Stockholm Using Passenger Flows Data. Journal of Transport Geography, Vol. 48, 2015, pp. 10–22.

30.

Luo

Cats

van Lint

Constructing Transit Origin-Destination Matrices With Spatial Clustering. Transportation Research Record: Journal of the Transportation Research Board, 2017. 2652: 39–49.

31.

Yap

Luo

Cats

van Oort

Hoogendoorn

Where Shall We Sync? Clustering Passenger Flows to Identify Urban Public Transport Hubs and Their Key Synchronization Priorities. Transportation Research Part C: Emerging Technologies, Vol. 98, 2019, pp. 433–448.

32.

Agard

Trépanier.

A Classification of Public Transit Users With Smart Card Data Based on Time Series Distance Metrics and a Hierarchical Clustering Method. Transportmetrica A: Transport Science, Vol. 16, No. 1, 2020, pp. 56–75.

33.

Huang

de Villafranca

A. E. M.

Sipetas

Sensing Multi-Modal Mobility Patterns: A Case Study of Helsinki Using Bluetooth Beacons and a Mobile Application. 2022 IEEE International Conference on Big Data (Big Data), IEEE, Osaka, Japan, 2022, pp. 2007–2016.

34.

Ferreira

Hitchcock

D. B.

A Comparison of Hierarchical Methods for Clustering Functional Data. Communications in Statistics-Simulation and Computation, Vol. 38, No. 9, 2009, pp. 1925–1949.

35.

Ward

J. H.

Jr.

Hierarchical Grouping to Optimize an Objective Function. Journal of the American Statistical Association, Vol. 58, No. 301, 1963, pp. 236–244.

36.

Rousseeuw

P. J.

Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis. Journal of Computational and Applied Mathematics, Vol. 20, 1987, pp. 53–65.

37.

HSL Website. Traficom Decides to Limit Public Transport Passenger Numbers – Effects on HSL Transport Services. 2021. https://www.hsl.fi/en/hsl/news/news/2021/03/traficom-decides-to-limit-public-transport-passenger-numbers-in–effects-on-hsl-transport-services. Accessed October 31, 2022.

38.

HSL Website. According to the 2019 BEST Survey, satisfaction with public transport in the HSL area remained high. 2020. https://www.hsl.fi/en/hsl/news/news/2020/03/according-to-the-2019-best-survey-satisfaction-with-public-transport-in-the-hsl-area-remained-high. Accessed December 12, 2022.

39.

Hadas

Assessing Public Transport Systems Connectivity Based on Google Transit Data. Journal of Transport Geography, Vol. 33, 2013, pp. 105–116.

40.

Weckström

Kujala

Mladenović

M. N.

Saramäki.

Assessment of Large-Scale Transitions in Public Transport Networks Using Open Timetable Data: Case of Helsinki Metro Extension. Journal of Transport Geography, Vol. 79, 2019, p. 102470.

41.

HSL Website. Transport Service Plan 2022–2023 open for comments until 12 December. 2022. https://www.hsl.fi/en/hsl/participate-and-have-your-say/transport-service-plan-2022-2023-open-for-comments-until-12-december. Accessed December 9, 2022.