Sage Journals: Discover world-class research

Abstract

Online retail platforms are increasingly challenged by the proliferation of low-quality products, which may damage their reputation and sales. To address this problem, we propose a system architecture to proactively identify products that are likely to go “out of favor.” Our approach uses historical data to extract useful information from customer ratings and textual reviews. Available data are fed into a state-of-the-art deep learning sequence model to forecast future ratings. We then analyze rating trends, extracting hyperparameters that a binary classifier uses to label products as “out-of-favor” or not. We tested this system on an Amazon dataset comprising nearly 800,000 observations across 2826 electronics products. Our results show that the Long Short-Term Memory (LSTM) model excels in forecasting future product ratings compared to other benchmarks. Ablation analysis shows sentiment-related features significantly improve rating forecasts by up to 40%, with review topics adding 10% and other review characteristics, 4%. Counterintuitively, topic extraction from reviews does not provide substantial benefits, despite the heavy computational resources it requires. Finally, the two-stage classification process, which leverages time-series data and rating trends, offers a more stable and robust performance than conventional single-stage methods. We provide considerations for system architecture development through robustness checks ensuring its resilience to stressors. Our experiments indicate that rating trends can change in subtle ways over time, leading a promising “star” product to turn into a liability (“dog”). E-commerce platforms can use the proposed system architecture proactively to identify and remove potentially dubious products instead of waiting to take reactive action.

Keywords

Product Perceived Quality Management Informational Value User-generated Content Trend Analysis System Architecture Development Sequence Model Forecasting

Get full access to this article

View all access options for this article.

References

Aaker

(1991) Managing Brand Equity: Capitalizing on the Value of a Brand Name. New York: The Free Press.

Agarwal

Chen

Zhang

(2016) The information value of credit rating action reports: A textual analysis. Management Science 62(8): 2218–2240.

Ahmed

Ghabayen

(2020) Review rating prediction framework using deep learning. Journal of Ambient Intelligence and Humanized Computing 13(7): 3423–3432.

Armstrong

Genc

Verbeek

(2017) Going for gold: An analysis of Morningstar analyst ratings. Management Science 65(5): 2310–2327.

Asghar

(2016) Yelp dataset challenge: Review rating prediction. arXiv preprint arXiv:1605.05362.

Bagwell

Riordan

(1991) High and declining prices signal product quality. The American Economic Review 81(1): 224–239.

Blei

Lafferty

(2006) Dynamic topic models. In: Proceedings of the 23rd International Conference on Machine Learning. Pittsburgh, Pennsylvania: Carnegie Mellon University, pp.113–120.

Bryhn

Dimberg

(2011) An operational definition of a statistically meaningful trend. PLoS One 6(4): e19241.

Castillo

(2022) Predicting product ratings with no-code machine learning. Available at: https://www.odury.com/blog/predicting-product-ratings (accessed 23 November 2023).

10.

Chambua

Niu

(2021) Review text based rating prediction approaches: Preference knowledge learning, representation and utilization. Artificial Intelligence Review 54: 1171–1200.

11.

Chen

Hitt

Hong

, et al. (2021) Measuring product type and purchase uncertainty with online product ratings: A theoretical model and empirical application. Information Systems Research 32(4): 1470–1489.

12.

Cheng

Zhang

Yan

(2020) Understanding the impact of individual users’ rating characteristics on the predictive accuracy of recommender systems. INFORMS Journal on Computing 32(2): 303–320.

13.

Chevalier

Mayzlin

(2006) The effect of word of mouth on sales: Online book reviews. Journal of Marketing Research 43(3): 345–354.

14.

Chicco

Jurman

(2020) The advantages of the Matthews Correlation Coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genomics 21(1): 1–13.

15.

Cho

Sosa

Hasija

(2021) Reading between the stars: Understanding the effects of online customer reviews on product demand. Manufacturing Service Operations Management 24(4): 1977–1996.

16.

Choi

Cho

Yim

, et al. (2019) When seeing helps believing: The interactive effects of previews and reviews on e-book purchases. Information Systems Research 30(4): 1164–1183.

17.

Cobb-Walgren

Ruble

Donthy

(1995) Brand equity, brand preference, and purchase intent. Journal of Advertising 24(3): 25–40.

18.

Cui

, et al. (2020) Stacked bidirectional and unidirectional LSTM recurrent neural network for forecasting network-wide traffic state with missing values. Transportation Research Part C: Emerging Technologies 118: 102674.

19.

Dellarocas

Zhang

Awad

(2007) Exploring the value of online product reviews in forecasting sales: The case of motion pictures. Journal of Interactive Marketing 21(4): 23–45.

20.

Deng

Zheng

Khern-am-nuai

, et al. (2021) More than the quantity: The value of editorial reviews for a user-generated content platform. Management Science 68(9): 6865–6888.

21.

Dimitrov

Zamal

Piper

, et al. (2015) Goodreads versus Amazon: The effect of decoupling book reviewing and book selling. In Ninth International AAAI Conference on Web and Social Media.

22.

Duan

Jiang

Jain

(2022) Combining review-based collaborative filtering and matrix factorization: A solution to rating’s sparsity problem. Decision Support Systems 156: 113748.

23.

Flanagin

Metzger

Pure

, et al. (2014) Mitigating risk in ecommerce transactions: Perceptions of information credibility and the role of user-generated ratings in product quality and purchase intention. Electronic Commerce Research 14(1): 1–23.

24.

Forcht

(1994) Computer Security Management. Boston, MA: Course Technology Press.

25.

Gao

Greenwood

Agarwal

, et al. (2015) Vocal minority and silent majority. MIS Quarterly 39(3): 565–590.

26.

Gibbons

Bhaumik

Aryal

(2009) Statistical Methods for Groundwater Monitoring. Hoboken, NJ: John Wiley & Sons.

27.

Graves

Liwicki

Fernández

, et al. (2009) A novel connectionist system for unconstrained handwriting recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(5): 855–868.

28.

Griffiths

Steyvers

(2004) Finding scientific topics. Proceedings of the National Academy of Sciences 101(suppl 1): 5228–5235.

29.

Gunarathne

Rui

Seidmann

(2021) Racial bias in customer service: Evidence from Twitter. Information Systems Research 33(1): 43–54.

30.

Hollenbeck

Proserpio

(2022) The market for fake reviews. Marketing Science 41(5): 896–921.

31.

Hochreiter

Schmidhuber

(1997) Long short-term memory. Neural Computation 9(8): 1735–1780.

32.

Hutto

Gilbert

(2014) VADER: A parsimonious rule-based model for sentiment analysis of social media text. Proceedings of the International AAAI Conference on Web and Social Media 8: 216–225.

33.

Ivanov

Sharman

(2018) Impact of user-generated internet content on hospital reputational dynamics. Journal of Management Information Systems 35(4): 1277–1300.

34.

Kavanagh

(2021) The impact of customer reviews on purchase decisions. Bizrate Insights. Available at: https://bizrateinsights.com/resources/shopper-survey-report-the-impact-reviews-have-on-consumers-purchase-decisions/ (accessed 15 March 2024).

35.

Khan

Niu

(2021) CNN With depthwise separable convolutions and combined kernels for rating prediction. Expert Systems with Applications 170: 114528.

36.

Khan

Niu

Sandiwarno

, et al. (2021) Deep learning techniques for rating prediction: A survey of the state-of-the-art. Artificial Intelligence Review 54(1): 95–135.

37.

Kim

(2021) When does online review matter to consumers? The effect of product quality information cues. Electronic Commerce Research 21(4): 1011–1030.

38.

Kingma

(2014) Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.

39.

Krahnen

Weber

(2001) Generally accepted rating principles: A primer. Journal of Banking & Finance 25(1): 3–23.

40.

Laddha

Mukherjee

(2018) Aspect opinion expression and rating prediction via LDA–CRF hybrid. Natural Language Engineering 24(4): 611–639.

41.

Wen

Shi

(2015) Research on product quality control in Chinese online shopping: Based on the uncertainty mitigating factors of product quality. Total Quality Management & Business Excellence 26(5–6): 602–618.

42.

Zeng

, et al. (2020) Understanding and predicting users’ rating behavior: A cognitive perspective. INFORMS Journal on Computing 32(4): 996–1011.

43.

(2018) Impact of average rating on social media endorsement: The moderating role of rating dispersion and discount threshold. Information Systems Research 29(3): 739–754.

44.

Hitt

(2010) Price effects in online product reviews: An analytical model and empirical analysis. MIS Quarterly 34(4): 809–832.

45.

Linton

Teo

EGS

Bommes

, et al. (2017) Dynamic topic modelling for cryptocurrency community forums. In: Härdle

chen

Overbeck

(eds) Applied Quantitative Finance. Statistics and Computing. Berlin, Heidelberg: Springer, 355–372. https://doi.org/10.1007/978-3-662-54486-0_18

46.

Liu

Feng

Liao

(2017) When online reviews meet sales volume information: Is more or accurate information always better? Information Systems Research 28(4): 723–743.

47.

Huang

, et al. (2013) Promotional marketing or word-of-mouth? Evidence from online restaurant reviews. Information Systems Research 24(3): 596–612.

48.

Meyes

de Puiseau

, et al. (2019) Ablation studies in artificial neural networks. arXiv preprint arXiv:1901.08644.

49.

Mitra

Golder

(2006) How does objective quality affect perceived quality? Short-term effects, long-term effects, and asymmetries. Marketing Science 25(3): 230–247.

50.

Naumzik

Feuerriegel

Weinmann

(2022) I will survive: Predicting business failures from customer ratings. Marketing Science 41(1): 188–207.

51.

Newman

Lau

Grieser

, et al. (2010) Automatic evaluation of topic coherence. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Los Angeles, California, USA. pp. 100–108.

52.

McAuley

(2019) Justifying recommendations using distantly-labeled reviews and fine-grained aspects. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Hong Kong, China, pp.188–197.

53.

Noble

P-JM

Appleton

Radford

, et al. (2021) Using topic modelling for unsupervised annotation of electronic health records to identify an outbreak of disease in UK dogs. PLoS One 16(12): e0260402.

54.

Osterbrink

Alpar

Seher

(2020) Influence of images in online reviews for search goods on helpfulness. Review of Marketing Science 18(1): 43–73.

55.

Palmer

(2021) Amazon gives consumers easier way to file complaints for faulty goods from third-party sellers. CNBC. Available at: https://www.cnbc.com/2021/08/10/amazon-lets-consumers-file-complaints-for-faulty-goods-from-3p-sellers.html (accessed 20 April 2023).

56.

Rindova

Williamson

Petkova

, et al. (2005) Being good or being known: An empirical examination of the dimensions, antecedents, and consequences of organizational reputation. Academy of Management Journal 48(6): 1033–1049.

57.

Sahoo

Dellarocas

Srinivasan

(2018) The impact of online product reviews on product returns. Information Systems Research 29(3): 723–738.

58.

Sahoo

Krishnan

Duncan

, et al. (2012) Research note—the halo effect in multicomponent ratings and its implications for recommender systems: The case of Yahoo! movies. Information Systems Research 23(1): 231–246.

59.

Sari

DAT

Giantari

(2020) Role of consumer satisfaction in mediating effect of product quality on repurchase intention. International Research Journal of Management, IT and Social Sciences 7(1): 217–226.

60.

Shah

Kothari

Thakkar

, et al. (2020) Information and Communication Technology for Sustainable Development. In: Tuba

Akashe

Joshi

(eds) Advances in Intelligent Systems and Computing. vol. 933. Singapore: Springer, 279–288.

61.

Sun

(2012) How does the variance of product ratings matter? Management Science 58(4): 696–707.

62.

Wan

Pan

(2018) Opinion evolution of online consumer reviews in the e-commerce environment. Electronic Commerce Research 18(2): 291–311.

63.

Wang

L-C

Lai

, et al. (2019) Tree-structured regional CNN-LSTM model for dimensional sentiment analysis. IEEE/ACM Transactions on Audio, Speech, and Language Processing 28: 581–591.

64.

Wang

Goes

Wei

, et al. (2019) Production of online word-of-mouth: Peer effects and the moderation of user characteristics. Production and Operations Management 28(7): 1621–1640.

65.

Wells

Valacich

Hess

(2011) What signal are you sending? How website quality influences perceptions of product quality and purchase intentions. MIS Quarterly 35(2): 373–396.

66.

Winer

(2007) Marketing management. Pearson Prentice Hall.

67.

Huang

Long

, et al. (2007) On the trend, detrending, and variability of nonlinear and nonstationary time series. Proceedings of the National Academy of Sciences 104(38): 14889–14894.

68.

Xiao

Benbasat

(2011) Product-related deception in e-commerce: A theoretical perspective. MIS Quarterly 35(1): 169–195.

69.

Gao

Viswanathan

(2014) Strategic behavior in online reputation systems. MIS Quarterly 38(4): 1033–1056.

70.

Zeithaml

(1988) Consumer perceptions of price, quality, and value: A means-end model and synthesis of evidence. Journal of Marketing 52(3): 2–22.

71.

Zhang

(2021) Analyzing cultural expatriates’ attitudes toward “Englishnization” using dynamic topic modeling. Journal of Computer-Assisted Linguistic Research 5: 1–26.

72.

Zheng

Law

, et al. (2021) Identifying unreliable online hospitality reviews with biased user-given ratings: A deep learning forecasting approach. International Journal of Hospitality Management 92: 102658.

73.

Zhou

Duan

(2016) Do professional reviews affect online user choices through user reviews? An empirical study. Journal of Management Information Systems 33(1): 202–228.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.75 MB

From Stars to Dogs: A Data Analytic Approach to Identifying “Out-Of-Favor” Products on E-Commerce Platforms

Abstract

Keywords

Get full access to this article

References

Supplementary Material