MetRoBERTa: Leveraging Traditional Customer Relationship Management Data to Develop a Transit-Topic-Aware Language Model

Abstract

Transit riders’ feedback provided in ridership surveys, customer relationship management (CRM) channels, and, in more recent times, through social media, is key for transit agencies to better gauge the efficacy of their services and initiatives. Getting a holistic understanding of riders’ experience through the feedback shared in those instruments is often challenging, mostly because of the open-ended, unstructured nature of text feedback. In this paper, we propose leveraging traditional transit CRM feedback to develop and deploy a transit-topic-aware large language model (LLM) capable of classifying open-ended text feedback to relevant transit-specific topics. First, we utilize semi-supervised learning to engineer a training dataset of 11 broad transit topics detected in a corpus of 6 years of customer feedback provided to the Washington Metropolitan Area Transit Authority. We then use this dataset to train and thoroughly evaluate a language model based on the RoBERTa architecture. We compare our LLM—MetRoBERTa—with classical machine learning approaches utilizing keyword-based and lexicon representations. Our model outperforms those methods across all evaluation metrics, providing an average topic classification accuracy of 90%. Finally, we provide a value proposition of this work demonstrating how the language model, alongside additional text processing tools, can be applied to add structure to open-ended text sources of feedback such as Twitter. The framework and results we present provide a pathway for an automated, generalizable approach for ingesting, visualizing, and reporting transit riders’ feedback at scale, enabling agencies to better understand and improve customer experience.

Keywords

public transportation big data large language models

Get full access to this article

View all access options for this article.

References

Weinstein

Diverse Strategies in Customer Experience Programs Across North America Help Increase Ridership, Improve Morale. Mass Transit Magazine, 2022.

Marolt

Pucihar

Zimmermann

H.-D.

Social CRM Adoption and Its Impact on Performance Outcomes: A Literature Review. Organizacija, Vol. 48, No. 4, 2015, pp. 260–271.

Nguyen

T. H.

Shirai

Velcin

Sentiment Analysis on Social Media for Stock Movement Prediction. Expert Systems with Applications, Vol. 42, No. 24, 2015, pp. 9603–9611.

Rane

Kumar

Sentiment Classification System of Twitter Data for US Airline Service Analysis. Proc., IEEE 42nd Annual Computer Software and Applications Conference (COMPSAC), Tokyo, Japan, 2018.

Philander

K. S.

Zhong

Y. Y.

Twitter Sentiment Analysis: Capturing Sentiment from Integrated Resort Tweets. International Journal of Hospitality Management, Vol. 55, 2016, pp. 16–24.

Watkins

Berrebi

Erhardt

Hoque

Goyal

Brakewood

Ziedan

Darling

Hemily

Kressner

Recent Decline in Public Transportation Ridership: Analysis, Causes, and Responses. TCRP Research Report, No. 231. Transportation Research Board, Washington D.C., 2021.

Lock

Pettit

Social Media as Passive Geo-Participation in Transportation Planning – How Effective Are Topic Modeling & Sentiment Analysis in Comparison with Citizen Surveys?

Geo-spatial Information Science, Vol. 23, No. 4, 2020, pp. 275–292.

O’Brien

Elon Musk Put New Limits on Tweets. Users and Advertisers Might Go Elsewhere. Associated Press, 2023.

Das

Zubaidi

H. A.

City Transit Rider Tweets: Understanding Sentiments and Politeness. Journal of Urban Technology, Vol. 30, No. 1, 2023, pp. 111–126.

10.

Hosseini

El-Diraby

T. E.

Shalaby

Supporting Sustainable System Adoption: Socio-Semantic Analysis of Transit Rider Debates on Social Media. Sustainable Cities and Society, Vol. 38, 2018, pp. 123–136.

11.

Hosseini

El-Diraby

T. E.

Shalaby

TRA-929: A Standard Lexicon to Measure the Level of Service of Public Transportation Services Through Online Transit-Oriented Discussions. Resilient Infrastructure, London, UK. 2016.

12.

Kabbani

Klumpenhouwer

El-Diraby

Shalaby

What Do Riders Say and Where? The Detection and Analysis of Eyewitness Transit Tweets. Journal of Intelligent Transportation Systems, Vol. 27, No. 3, 2023, pp. 347–363.

13.

Haghighi

Liu

X. C.

Wei

Shao

Using Twitter Data for Transit Performance Assessment: A Framework for Evaluating Transit Riders’ Opinions About Quality of Service. Public Transport, Vol. 10, 2018, pp. 363–377.

14.

Al-Sahar

Klumpenhouwer

Shalaby

El-Diraby

Using Twitter to Gauge Customer Satisfaction Response to a Major Transit Service Change in Calgary, Canada. Transportation Research Record, Online, 2023: 03611981231179167. https://journals.sagepub.com/doi/full/10.1177/03611981231179167.

15.

Liu

X. C.

Webinar: Social Transportation Analytic Toolbox (STAT) for Transit Networks. 2019. https://nitc.trec.pdx.edu/events/professional-development/webinar-social-transportation-analytic-toolbox-stat-transit-networks.

16.

Méndez

J. T.

Lobel

Parra

Herrera

J. C. B.

Using Twitter to Infer User Satisfaction with Public Transport: The Case of Santiago, Chile. IEEE Access, Vol. 7, 2019, pp. 60255–60263.

17.

Luong

T. T.

Houston

Public Opinions of Light Rail Service in Los Angeles, An Analysis Using Twitter Data. Proc., IConference 2015, Irvine, CA, 2015.

18.

Osorio-Arjona

Horak

Svoboda

García-Ruiz

Social Media Semantic Perceptions on Madrid Metro System: Using Twitter Data to Link Complaints to Space. Sustainable Cities and Society, Vol. 64, 2021, p. 102530.

19.

Devlin

Chang

M.-W.

Lee

Toutanova

Bert: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv Preprint arXiv:1810.04805, 2018.

20.

Radford

Child

Luan

Amodei

Sutskever

Language Models Are Unsupervised Multitask Learners. OpenAI Blog, Vol. 1, No. 8, 2019, p. 9.

21.

Liu

Ott

Goyal

Joshi

Chen

Levy

Lewis

Zettlemoyer

Stoyanov

Roberta: A Robustly Optimized Bert Pretraining Approach. arXiv Preprint arXiv:1907.11692, 2019.

22.

Antypas

Ushio

Camacho-Collados

Silva

Neves

Barbieri

, Twitter Topic Classification. Proc., 29th International Conference on Computational Linguistics, International Committee on Computational Linguistics, Gyeongju, Republic of Korea, 2022, pp. 3386–3400.

23.

Barbieri

Camacho-Collados

Espinosa Anke

Neves

TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification. Proc., Findings of the Association for Computational Linguistics: EMNLP 2020, Association for Computational Linguistics, Online, 2020, pp. 1644–1650. https://www.aclweb.org/portal/.

24.

Shuman

Abdelhalim

Stewart

A. F.

Campbell

K. B.

Patel

Sánchez de Madariaga

Zhao

Inferring Mobility of Care Travel Behavior from Transit Origin-Destination Data. arXiv Preprint arXiv:2211.04915, 2022.

25.

Naseem

Razzak

Khushi

Eklund

P. W.

Kim

COVIDSenti: A Large-Scale Benchmark Twitter Data Set for COVID-19 Sentiment Analysis. IEEE Transactions on Computational Social Systems, Vol. 8, No. 4, 2021, pp. 1003–1015.

26.

Gururangan

Marasović

Swayamdipta

Beltagy

Downey

Smith

N. A.

Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks. arXiv Preprint arXiv:2004.10964, 2020.

27.

Northcutt

Jiang

Chuang

Confident Learning: Estimating Uncertainty in Dataset Labels. Journal of Artificial Intelligence Research, Vol. 70, 2021, pp. 1373–1411.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.01 MB