Abstract
This test paper develops and tests 13 direct ridership models (DRMs) for transit sketch planning the Dallas–Fort Worth region. We explore both, machine learning modeling approaches (e.g., ridge regression and random forest) and traditional statistical models (e.g., linear regression and multiplicative regression). This effort provides a detailed description of modeling workflows and of the preprocessing of input data including general transit feed specification (GTFS), employment, socio-demographic, and ridership data. We also describe metrics to compare model performance; in our experiments the ridge regression framework using a Yeo-Johnson power transformation led to the most accurate predictions with an
Keywords
Get full access to this article
View all access options for this article.
