Abstract
Topic modelling is the algorithm of future to expand the horizons of domain knowledge in marketing due to two reasons: (1) its ability to derive marketing insights from burgeoning wave of text data, and (2) its thoroughness in conducting literature reviews to extract latent meaning out of extant research in marketing. This study carries out a comparative assessment of two cutting-edge unsupervised topic modelling algorithms: BERTopic based on bidirectional encoder representations from transformers (BERT), and latent Dirichlet allocation (LDA). The sample for this study includes text data generated from 200 research papers published in Web of Science (WoS) indexed journals during the 5-year period of 2019 to 2023. This labelled curated sample comprises of 50 published papers belonging to each one of the elements of marketing mix – product, price, place and promotion. The topic modelling outputs are evaluated based on a comparison of topical solutions obtained using BERTopic and LDA. The study presents categorical evidence pertaining to superiority of BERTopic over LDA as an unsupervised topic modelling algorithm.
Keywords
Get full access to this article
View all access options for this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
