Abstract
This research was conducted to aid hotels in identifying and enhancing the hotel selection attributes that customers deem important for maintaining revenue and competitive advantage. Departing from conventional survey methods, we utilized BERTopic on the extensive review data from TripAdvisor to extract crucial hotel selection attributes. Through RIPA and PRCA analyses across various hotel star ratings, we sought insights into customer perceptions. The findings unveiled eight hotel selection attributes deemed significant by customers, revealing no differences in implicitly derived importance, satisfaction performance and overall customer satisfaction for staff, trip type, and resort fees across hotel star ratings. However, variations were evident in other hotel selection attributes. Thus, in contrast to prior studies, this research employed big data techniques like BERTopic for topic extraction and utilized PRCA additionally to compensate for the limitations of RIPA, identifying differences among attributes included in the same quadrant.
Keywords
Introduction
Advancements in information and communication technologies such as the Internet of Things (IoT), artificial intelligence (AI), and mobile technologies have led to a significant surge in users engaging with online services. This trend has also extended to the hospitality and tourism industries, where various online services are offered (Aldebert et al., 2011; Law et al., 2020; Navío-Marco et al., 2018; Yang et al., 2021). Consequently, many customers now use hotel e-commerce platforms to make reservations and pay for their preferred accommodation. One prominent hotel e-commerce platform is TripAdvisor, the world’s largest travel platform, which caters to an average of 490 million users monthly (TripAdvisor, 2019). On these hotel e-commerce platforms, not only do users provide satisfaction ratings for their hotel experiences but they also have the capability to craft detailed reviews of their stays. According to Ady and Quadri-Felitti (2015), as many as 95% of customers make decisions about booking hotels based on customers ratings and reviews. Furthermore, it has been demonstrated that the more positive the evaluations of a hotel are, the higher its revenue tends to be in a proportional manner (Anderson & Lawrence, 2014; Gao et al., 2020). Accordingly, to enhance revenue and secure a competitive edge, hotels should identify and improve the attributes that customers consider important through ratings and reviews (Kim & Park, 2017). Therefore, many previous studies have predominantly utilized traditional IPA (Importance-Performance Analysis) techniques to analyze hotel selection attributes that customers consider important, and their corresponding satisfaction levels (Cvelbar & Dwyer, 2013; Wilkins, 2010). However, traditional IPA has limitations such as multicollinearity issues and the tendency for most hotel selection attribute factors to be skewed toward the first and third quadrants (W. Deng, 2007). To address these limitations, researchers are increasingly utilizing the Revised Importance-Performance Analysis (RIPA) technique to analyze the relative importance of and satisfaction with hotel selection attributes (S. Huang, 2010; S. Huang et al., 2015; Liu, 2010). Most studies employing the RIPA technique have used data collected through surveys. However, previous studies based on survey data collection have limitations in that they can only gather information on artificially defined hotel selection attributes, and it can be challenging to collect large amounts of data. Therefore, in this study, to address these limitations, we collect a vast amount of hotel star ratings and review data from TripAdvisor. These reviews intricately encompass details of customers’’ travel experiences and preferences (Ding et al., 2020). Utilizing the BERTopic, a sophisticated topic modeling technique, we apply it to the gathered review data to extract the hotel selection attributes deemed significant by the customers. These extracted attributes are then analyzed using the RIPA method, similar to prior research (Bi et al., 2019; Joung & Kim, 2021).
However, there exists a limitation in the RIPA results where it becomes challenging to discern differences among attributes within the same quadrant. Even if attributes are within the same quadrant, their impact on overall satisfaction can vary significantly (G. C. Ku & Shang, 2020). Recent studies in the hotel and tourism domain have indicated the presence of asymmetric relationships between service attributes and overall satisfaction (Davras & Caber, 2019; C. Zhang et al., 2021; W. T. Zhang et al., 2022). To solve these problems, PRCA (Penalty-Reward Contrast Analysis) is used to be a key analytical technique in these studies (C. Zhang et al., 2021; W. T. Zhang et al., 2022). Therefore, following the RIPA analysis, this study aims to employ the PRCA method to understand how attributes within each quadrant are influenced by specific factors.
According to previous studies, major differences have been noted in the hotel selection attributes considered significant by customers and overall satisfaction based on hotel star ratings (Soifer et al., 2021). Hotel star ratings serve as the most indicative measure of a hotel’s quality and service scope, shaping customer expectations and biases (Rhee & Yang, 2015). Moreover, varying criteria such as price, provided services, and facilities are associated with different hotel star ratings. Therefore, hotel star ratings can be considered as benchmarks for customers to evaluate service quality (W. J. Huang et al., 2018). Martin-Fuentes (2016) conducted an analysis of overall customer satisfaction based on hotel star ratings by collecting data from over 14,000 hotels in more than 100 cities worldwide from TripAdvisor and Booking. The results indicated a positive correlation between higher hotel star ratings and overall customer satisfaction. Similarly, Mohsin et al. (2019) performed a survey and IPA (Interpretative Phenomenological Analysis) targeting customers of 3, 4, and 5-star hotels in Lisbon, Portugal, analyzing satisfaction regarding hotel selection attributes. Their analysis revealed differences in the importance of hotel selection attributes and satisfaction levels based on hotel star ratings. Therefore, in this study, we aim to classify and compare hotels in New York, USA, based on the rating criteria provided by TripAdvisor. To achieve this, we collect rating and review data from customers who have visited New York hotels on TripAdvisor. Subsequently, we apply BERTopic to the collected review data to extract the hotel selection attributes valued by customers. We plan to utilize RIPA to understand the relative importance and satisfaction of these selection attributes based on hotel star ratings. Finally, by conducting PRCA, we categorize hotel selection attributes into the three-factor theory: Excitement Factor, Performance Factor, and Basic Factor, providing insights accordingly.
In this study, we apply BERTopic to a vast amount of reviews to extract the selection attributes that customers deem important, categorized by hotel star ratings. This approach, based on a substantial volume of data, offers a more extensive analysis of customer selection attributes compared to traditional survey-based research. Consequently, the survey questionnaire developed on this basis can potentially yield more objective survey data than previous methods.
Therefore, this study expands the research on service quality by applying RIPA and additionally utilizes PRCA to understand the impact of hotel selection attributes on overall customer satisfaction. Through this approach, we anticipate providing more specific insights compared to studies that solely rely on RIPA to analyze customer satisfaction regarding hotel selection attributes. Furthermore, based on the results of this study, hotels can enhance their service quality and contribute to overall hotel marketing and management strategies.
Related Work
BERTopic
Topic modeling entails the identification of latent themes embedded within collections of words or documents and grouping akin subjects (Blei, 2012). Latent Dirichlet Allocation (LDA) is one of prominent techniques in this domain (Jelodar et al., 2019). However, basic principle of LDA is deriving topics based on word frequency, which leads to the limitation of not considering the context’s sequence and meaning. Recently, a context-aware topic modeling technique called BERTopic has emerged. Unlike traditional topic modeling methods, BERTopic considers the contextual information of words, making it distinct in its approach (Grootendorst, 2022). Therefore, BERTopic is reported to exhibit superior performance compared to conventional prominent topic modeling techniques, as it is built upon a BERT model trained with consideration of contextual word information (Egger & Yu, 2022; Grootendorst, 2022; Meaney et al., 2022).
The BERTopic algorithm involves a four-step process to derive topics: document embedding extraction, embedding dimension reduction, embedding clustering, and generating topic representations using class-based TF-IDF. First, BERTopic utilizes a pretrained language model, BERT, to perform embedding. Second, Uniform Manifold Approximation and Projection (UMAP) are employed to reduce the dimensions of the embeddings. Third, the Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN) generates clusters of semantically similar documents. Finally, the class-based term frequency-inverse document frequency (c-TF-IDF) was used to extract keywords representing the topics. However, in this study, K-means clustering was used instead of HDBSCAN. This is because prior research has shown that using K-means clustering instead of HDBSCAN in short sentences, such as reviews, can yield more accurate results (de Groot et al., 2022). Therefore, to extract the topics of hotel selection attributes that are significant to customers who visited hotels from the collected reviews, this study utilizes BERTopic.
RIPA
Traditional Importance-Performance Analysis (IPA) was initially proposed by Martilla and James (1977) as an analytical method to assess the importance of and satisfaction with products and services. However, traditional IPA has limitations such as multicollinearity issues and the skewing of attributes related to products and services toward the first and third quadrants (W. J. Deng et al., 2008; Kano, 1984; Matzler et al., 2004). To address these limitations, W. Deng (2007) introduced the RIPA method. RIPA initially computes overall satisfaction and attribute-specific satisfaction for each product and service. Then, a partial correlation analysis was conducted between attribute-specific satisfaction after undergoing a natural logarithm transformation and overall satisfaction. The resulting partial correlation coefficients were used as the relative importance values. Through this approach, the limitations of traditional IPA, where attributes tend to be skewed toward the first and third quadrants, can be minimized.
RIPA consists of four quadrants, as illustrated in Figure 1. The X-axis represents satisfaction performance with products and services, whereas the Y-axis represents implicitly derived importance. The four quadrants were formed based on the average values of the X-axis (Satisfaction Performance) Y-axis (Implicitly derived Importance). Among the four quadrants, the first represents the “Keep up the good work” quadrant. Attributes with both high importance and satisfaction are located in this quadrant. Service providers must maintain the attributes in this quadrant to enhance customer satisfaction continuously. The second quadrant, “Concentrate here” quadrant represents a quadrant where attributes have high importance but low satisfaction. In this quadrant, the attributes require focused improvements by company managers to enhance customer satisfaction. The third quadrant represents the “Low priority” quadrant. This quadrant is characterized by attributes that have both low importance and low satisfaction. For the attributes within this quadrant, it is more effective to allocate more resources to improve the attributes located in the first or second quadrant rather than giving them higher priority. Lastly, the fourth quadrant represents the “Possible overkill” quadrant. This quadrant is characterized by attributes with low importance and high satisfaction. This signifies that excessive resources and effort are being devoted to these attributes.

RIPA (W. Deng, 2007).
In the field of the hotel industry, studies analyzing customer satisfaction using the RIPA method proposed by W. Deng (2007) have been actively conducted (Caber et al., 2012; S. Huang, 2010; G. C. M. Ku & Mak, 2017). However, these previous studies mainly collected data through survey methods, which limited information collection to factors subjectively predefined by researchers and posed challenges in acquiring large-scale data. Therefore, to address these limitations, we utilized data collected from TripAdvisor, including a vast number of reviews and ratings, using web-crawling techniques. Furthermore, we applied BERTopic to the collected reviews to extract hotel selection attributes. Subsequently, we intend to perform RIPA using the derived hotel-selection attributes.
PRCA
The PRCA is an applied method based on Kano (1984) study aimed at identifying attributes that can enhance customer satisfaction. This technique is used to analyze the asymmetric impact of product and service attributes (Brandt, 1987). PRCA involves two main steps in determining how each attribute affects overall satisfaction. First, the performance of each attribute for products and services was divided into two groups: High Performance and Low Performance. Dummy variables were generated based on this division. In the second step, the two sets of dummy variables were treated as independent variables and the overall satisfaction rating was set as the dependent variable. Multiple regression analysis was conducted to determine how these variables collectively influenced the overall satisfaction rating. Subsequently, using the obtained regression coefficients of the two sets of dummy variables, the method determines the categorization of each attribute into one of the three factors within the context of the three-factor theory (Füller & Matzler, 2008).
As depicted in Figure 2, the Three-Factor Theory is classified into three factors: Excitement, Performance, and Basic factors (Matzler & Sauerwein, 2002). Excitement factors lead to satisfaction when fulfilled; however, their absence does not necessarily result in dissatisfaction. Performance factors lead to satisfaction when fulfilled and dissatisfaction when not fulfilled. Finally, the basic factors do not necessarily lead to satisfaction when fulfilled, but dissatisfaction arises when they are not fulfilled. Therefore, in this study, we employed the PRCA technique to determine the category of each hotel selection attribute in terms of the three-factor theory. Subsequently, we intend to incorporate these findings into the interpretation of RIPA results, ultimately providing significant insights and recommendations.

Three-factor theory (Kano, 1984).
Methodology
In this study, we aimed to address the limitations of traditional survey methods, which often incur significant time and cost, by collecting rating and review data from customers’ own experiences on TripAdvisor. Consequently, the collected data were analyzed based on hotel star ratings to identify the hotel selection attributes that customers consider important and to explore the differences among them. To achieve this, our study follows a five-step procedure outlined in (Figure 3). In the first stage, data collection involved gathering information about hotels located in New York, along with reviews and ratings provided by hotel guests from the hotel e-commerce platform, TripAdvisor. In the second stage, prior to BERTopic analysis, sentences were segmented and preprocessed. This study utilized NPMI (Normalized Pointwise Mutual Information) values to estimate the number of topics and to extract topics and keywords. In the third stage, Sentiment Analysis is conducted using the Valence Aware Dictionary for Sentiment Reasoning (VADER) sentiment lexicon (Hutto & Gilbert, 2014). In the fourth stage of RIPA, satisfaction was based on the average sentiment scores of sentences within each topic. Furthermore, the previously segmented sentences are restored to their original reviews. In the restored reviews, the sentiment scores for each topic are subjected to a natural logarithm. Subsequently, a partial correlation analysis was conducted between the transformed sentiment scores and overall satisfaction ratings. The resulting partial correlation coefficients were used as indicators of importance. Finally, in the fifth stage, the PRCA analysis was performed. This stage classifies hotel selection attributes into excitement, performance, and basic factors based on the sentiment scores for each topic during the RIPA analysis stage.

Research framework.
Data Collection
As depicted in Figure 4, comprehensive information on hotels provided by TripAdvisor was gathered during the data collection phase. Specifically, the data collection phase involved gathering information from TripAdvisor regarding hotels located in New York, USA. The reason for choosing New York City in this study is that it has the highest number of hotels and reviews on TripAdvisor (Xiang et al., 2018). Additionally, New York City attracts customers with diverse purposes such as business, leisure, and family travel (Xiang et al., 2018). Therefore, we collected information on reviews written by customers who visited hotels in New York, including the overall satisfaction ratings on a 5-point scale, the dates of the reviews and ratings, hotel names, and hotel star ratings. Hotels with fewer than 200 collected reviews were excluded from the analysis. This exclusion was done to prevent potential biases in the results, as hotels with fewer than 200 reviews might lead to skewed outcomes (Arenas-Márquez et al., 2021).

Data collection examples.
BERTopic
Prior to conducting the BERTopic analysis, the collected reviews were divided into individual sentences, as shown in Figure 5. This is because a single review comprises more than one sentence associated with distinct attributes (Albayrak et al., 2021; Shang et al., 2022). The reviews were segmented into sentences using the Spacy package within Python. Following this, preprocessing steps such as conversion to lowercase, tokenization, removal of special characters, and elimination of stopwords are carried out. Subsequently, the number of topics was estimated. In this study, the topic count was estimated using the NPMI value (Bouma, 2009). NPMI serves as a metric to gauge the semantic coherence of topics. Higher values indicate stronger semantic consistency. After determining the number of topics, BERTopic was executed to derive topics and keywords. Subsequently, suitable names were assigned to each topic, based on the extracted topics and keywords.

Spacy sentence segmentation and BERTopic analysis process.
Sentiment Analysis
In this study, the VADER sentiment lexicon was employed to assign sentiment scores to the reviews for each topic. Sentiment scores were derived within the range of −1 to +1 (Hutto & Gilbert, 2014). A sentiment score close to −1 indicates a negative sentiment, while a score near +1 indicates a positive sentiment for the respective sentence.
In line with TripAdvisor’s provision of a 5-point Likert scale for customers to evaluate their overall satisfaction with hotels, this study referred to Albayrak et al. (2021) and Shang et al. (2022). Using VADER, the derived sentiment scores were transformed into a 5-point Likert scale, as shown in Table 1. Specifically, sentiment scores below −0.55 were converted to 1 point, and scores greater than or equal to −0.55 but less than 0 were transformed to 2 points. For sentiment, scores of 0, 3 were assigned; scores greater than 0 and less than or equal to 0.55 correspond to 4 points; and scores above 0.55 are mapped to 5 points. Figure 6 serves as an example, illustrating the sentiment scores derived from the sentiment analysis of four sentences within Review 1, transformed into a 5-point Likert scale.
Convert Composite Score to 5-Point Scale (Albayrak et al., 2021).

Converting to a 5-point scale after sentiment analysis.
RIPA
In the RIPA phase, sentiment scores on a 5-point Likert scale for each topic were arithmetically averaged to produce an index used as the Satisfaction Performance value in the RIPA method. An example of this is shown in Figure 7. Initially, as shown in Step 1 of Figure 7, Review 1 consists of four sentences. Sentence 1 has a sentiment score of 5 for Topic 2, sentence 2 has a sentiment score of 3 for Topic 1, sentence 3 has a sentiment score of 3 for Topic 2, and finally, sentence 4 has a sentiment score of 2 for Topic N. Thus, both sentences 1 and 3 share a common sentiment score for Topic 2. In such cases, following the approach of Shang et al. (2022), the sentiment scores were arithmetically averaged and restored to their original form. Consequently, the final sentiment score for Topic 2 became 4. Therefore, in Step 2, the sentences that were previously divided were reconstructed to their original form within the review, followed by computing the arithmetic mean for each topic to derive the Satisfaction Performance.

Satisfaction performance derivation process.
In Step 3 of Figure 8, the sentiment scores for each topic in every review were natural logarithmic transformations, and a partial correlation analysis was conducted between the overall satisfaction score of the review and the sentiment scores.

Implicitly derived importance derivation process.
Finally, in Step 4 of Figure 9, the partial correlation coefficients obtained from the analysis are utilized as Implicitly Derived Importance values within the context of the RIPA, enabling a comparative analysis of the implicitly derived importance and satisfaction performance of the selected attributes across different hotel star ratings.

RIPA matrix example.
PRCA
In the PRCA step, we first created two dichotomized High Performance and Low Performance dummy variables for each hotel-selection attribute. We then analyzed how the High Performance and Low Performance dummy variables for each hotel selection attribute affect overall satisfaction using a multiple regression analysis.
In Step 1, we created dummy variables for High Performance and Low Performance using the sentiment scores of the reviews for each hotel-selection attribute topic from the sentiment analysis step (Albayrak & Caber, 2013). If the sentiment score is 4 or 5, High Performance is labeled as 1 and Low Performance is labeled as 0. Conversely, if the emotional score was 1 or 2, we marked High Performance as 0 and Low Performance as 1. For a sentiment score of 3, enter 0 for High Performance and 0 for Low Performance.
In step 2, we set the dummy variables of high and low performance for each topic as independent variables and the rating, which refers to overall hotel guest satisfaction, as the dependent variable. We then classified each hotel selection attribute into factors of the three-factor theory using the nonstandardized coefficients of high and low performance calculated by performing multiple regression analysis for each hotel star rating.
Finally, the unstandardized coefficients of High Performance and Low Performance for each hotel selection attribute, calculated from the multiple regression analysis for each hotel star rating were used to classify each hotel selection attribute into excitement, performance, and basic factors. For this purpose, this study classifies hotel selection attributes as excitement factors if the value of the unstandardized coefficient of High Performance divided by the value of the unstandardized coefficient of Low Performance is greater than 1.1, basic factors if it is less than 0.9, and performance factors if it is greater than 0.9 and less than 1.1, according to the criteria proposed by Füller and Matzler (2008).
Experimental Results
Data Basic Statistics
Customer reviews and ratings were collected from the TripAdvisor platform for 3, 4, and 5-star hotels located in New York, USA, from January to December 2019. The basic statistics for this study are presented in Table 2. Ultimately, 26,879 reviews and ratings for 161 3-star hotels, were further divided into 197,978 sentences. For the 4-star hotels, there were 43,885 reviews and ratings for 159 hotels. The reviews were divided into a total of 318,977 sentences. Finally, 77,690 reviews and ratings for the 5-star hotels, spread across 39 hotels were obtained. These reviews were divided into 571,087 sentences.
Data-Driven Statistics.
Topic Identification
Before conducting the BERTopic analysis, the number of topics was determined using the NPMI values, as illustrated in Figure 10. Figure 10 shows the NPMI values for each topic. It is evident that the NPMI value is highest when the number of topics is set to 8. This indicates that when the number of topics was set to 8, the keywords of the topics exhibited a high level of semantic consistency. Therefore, in this study, the number of topics was set to 8.

NPMI result.
The study derived 8 topics using BERTopic, along with their corresponding 10 keywords, as presented in Table 3. Moreover, each topic was named by considering the keywords within the topic as well as relevant keywords from previous research (Ban et al., 2019; Büschken & Allenby, 2016; Srivastava & Kumar, 2021; Xu & Li, 2016; J. Zhang & Piramuthu, 2018). Topic 1 is composed of keywords such as “room,”“clean,” and “bed,” indicating a topic related to the quality of the room. Topic 2 is composed of keywords such as “breakfast,”“bar,”“restaurant,” and “coffee,” indicating a topic related to F&B (food and beverage) services. In Topic 3, keywords like “bathroom,”“shower,” and “towel” appear, signifying attributes related to the quality and condition of the bathroom. The main keywords in Topic 4 are “staff,”“helpful,” and “service,” focusing on the behavior and attitude of the staff members. Topic 5 is a trip type and consists of keywords such as travel, family, and business. The types of customers who visit the hotel are divided into family, couple, and business, which means customized services are provided according to these types of visits. Topic 6 revolves around location attributes with keywords like “park,”“square,” and “center.” In Topic 7, keywords like “desk,”“reception,” and “bag” appear, highlighting the services offered at the front desk. Lastly, Topic 8 pertains to resort fees, with keywords like “fee,”“charge,”“wifi,” and “tax,” indicating charges that are separate from the base room rate. These could include amenity fees, resort fees, or hidden hotel booking fees that guests might have to pay in addition to the room rate. Table 3 lists the keywords related to each topic as a result of the BERTopic analysis.
BERTopic Result.
RIPA and PRCA Result
The RIPA analysis results for hotel star ratings of 3, 4, and 5-star categories are presented in Figures 11 to 13, respectively. In the RIPA Matrix, the X-axis corresponds to Satisfaction Performance, whereas the Y-axis represents Implicitly Derived Importance. The average satisfaction for 3-star hotels was 3.733, with an average importance of 0.176. For 4-star hotels, the average satisfaction was 3.777, accompanied by an average importance of 0.209. Finally, for 5-star hotels, the average satisfaction was 3.847, with an average importance of 0.168.

3-Star hotel RIPA result.

4-Star hotel RIPA result.

5-Star hotel RIPA result.
For 3-star hotels, the RIPA analysis results indicate that attributes such as room, staff, front desk, and F&B contained the “Keep up the good work” quadrant, while resort fees and bathroom attributes are contained in the “Low Priority” quadrant. Meanwhile, location and trip type are positioned in the “Possible Overkill” quadrant.
Upon examining the RIPA analysis results for 4-star hotels, it is observed that attributes like room, staff, and front desk are contained in the “Keep up the good work” quadrant. The bathroom attribute is contained in the “Concentrate Here” quadrant, and the resort fees attribute falls within the “Low Priority” quadrant. Additionally, attributes related to location, trip type, and F&B are in the “Possible Overkill” quadrant.
Moving to the analysis of 5-star hotels, the results reveal that attributes like staff, room, and F&B are encompassed by the “Keep up the good work” quadrant. Attributes of the bathroom and resort fees are contained in the “Low Priority” quadrant, while attributes related to trip type, front desk, and location fall within the “Possible Overkill” quadrant.
The PRCA results for 3-star hotels are presented in Table 4. The PRCA analysis results for 3-star hotels indicate that attributes such as room, F&B, staff, trip type, and front desk are classified as excitement factors, whereas only the bathroom attribute is a performance factor. Furthermore, the location and resort fee attributes were categorized as basic factors.
3-Star Hotel PRCA Result.
Note. IR = high performance/|low performance|.
p < .001, **p < .01; R2: .358; F: 937.28; ****Unstandardized Beta Coefficient. Dependent Variable: Overall customer satisfaction.
The PRCA results for 4-star hotels are listed in Table 5. In the case of 4-star hotels, the PRCA analysis results demonstrate that attributes such as room, F&B, staff, trip type, and front desk are classified as excitement factors, whereas bathroom, location, and resort fee attributes are classified as basic factors.
4-Star Hotel PRCA Result.
Note. IR = high performance/|low performance|.
p < .001, *p < .05; R2: .347; F: 1456.979; ****Unstandardized Beta Coefficient. Dependent Variable: Overall customer satisfaction.
The PRCA outcomes for 5-star hotels are presented in Table 6. The PRCA analysis results for 5-star hotels reveal that F&B, staff, trip type, location, and front desk attributes are identified as excitement factors. The bathroom attribute was classified as a performance factor, and attributes related to room and resort fees were categorized as basic factors.
5-Star Hotel PRCA Result.
Note. IR = high performance/|low performance|.
p < .001, **p < .01, *p < .05; R2: .294; F: 180.092; ****Unstandardized Beta Coefficient. Dependent Variable; Overall customer satisfaction.
Finally, summarizing the RIPA and PRCA analysis results for 3-star, 4-star, and 5-star hotel categories in this study yields Table 7.
3, 4, 5-Star Hotel RIPA and PRCA Result.
Discussion
In this study, we extracted the factors that hotel visitors consider important using BERTopic and analyzed the attributes and satisfaction that customers consider important for 3, 4, and 5-star hotels through RIPA and PRCA. Firstly, the hotel selection attribute regarding the room is contained in the “Keep up the good work” quadrant for all 3, 4, and 5-star hotels. This is consistent with the findings of the previous study conducted by Mohsin et al. (2019), which supports this result. Thus, it can be inferred that room is a significant factor considered by all customers visiting 3, 4, and 5-star hotels and that it is a selection attribute that currently satisfies their level of satisfaction. Therefore, this study suggests that hotels maintain their current approach to providing room services. According to the PRCA results, 3 and 4-star hotels were classified as excitement factors, while 5-star hotels were classified as basic factors. This suggests that customers of 3 and 4-star hotels primarily visit the area primarily for tourism purposes; therefore, they may not spend a significant amount of time in their rooms (Xue & Zhang, 2020). On the other hand, customers of 5-star hotels tend to enjoy leisure and vacations by utilizing the facilities and services within the hotel premises (Gupta & Dixit, 2022). In essence, it can be inferred that because customers of 3-star and 4-star hotels spend more time within the hotel premises if their satisfaction with the room is not met, dissatisfaction is likely to arise. Therefore, it is recommended that 3-star and 4-star hotels focus on the continuous improvement of services provided within the rooms, such as cleanliness, ambiance, and room service. On the other hand, for 5-star hotels, maintaining this approach is advisable.
Secondly, the aspect of F&B as a hotel selection attribute is placed in the “Keep up the good work” quadrant for both 3-star and 5-star hotels. This indicates that both the importance and satisfaction levels of the customers were high in this regard. On the other hand, for 4-star hotels, the F&B aspect falls within the “Possible Overkill” quadrant, suggesting that while customers perceive lower importance, their satisfaction levels remain high. Satisfaction with F&B services leads to a sense of contentment when met; however, dissatisfaction arises when expectations are not fulfilled. Therefore, both 3-star and 5-star hotels must maintain their current service quality for F&B. On the other hand, 4-star hotels can suffice with their basic F&B offerings, as customers still experience satisfaction. Furthermore, it is evident that F&B acts as an excitement factor for hotels for all 3-star ratings: 3-star, 4-star, and 5-star. Thus, it is anticipated that by maintaining the existing quality of F&B services while offering a more diverse range of services and personalized experiences, customer satisfaction and the intention to revisit can be enhanced.
Thirdly, the hotel selection attribute related to bathrooms was found to be located in the “Low Priority” quadrant for both 3-star and 5-star hotels, identified as a Performance factor. In other words, this attribute has relatively lower importance and satisfaction than other hotel selection attributes. Therefore, even though this attribute is not perceived as highly significant compared with other service qualities, improving service quality beyond its current state could lead to increased customer satisfaction. To address this, it is deemed necessary for 3-star and 5-star hotels to adequately consider bathroom quality attributes as a prospective long-term improvement initiative. On the other hand, 4-star hotels are situated in the “Concentrate Here” quadrant, indicating that there is an immediate need for improvement in this attribute. Furthermore, as indicated by the PRCA results, it was categorized as a basic factor. This signifies that if satisfaction with the bathroom is not met, overall dissatisfaction can arise. Upon reviewing the content of the reviews regarding this aspect, it is evident that comments revolve around factors such as a small bathroom size and instances of fixtures or floors showing signs of wear, which contribute to this perception. Therefore, it is imperative for 4-star hotels to prioritize attributes related to bathrooms over other hotel selection criteria. Regular facility inspections and maintenance measures should be undertaken to address any instances of damaged fixtures and facilities and ensure an improved guest experience. Furthermore, it’s worth noting that many customers prefer the provision of packaged bathroom amenities. Thus, hotels should consider investing in quality bathroom supplies and focus on maintaining high levels of bathroom cleanliness to enhance customer satisfaction (Jeong & Kubickova, 2021).
Fourthly, the staff attribute was contained in the “Keep up the good work” quadrant, indicating its significance in all 3, 4, and 5-star hotels. Furthermore, according to the PRCA analysis results, it was categorized as an excitement factor. Therefore, when customers are satisfied with the hotel selection attribute, satisfaction is generated; however, if customers are not satisfied, it does not necessarily result in dissatisfaction. Accordingly, it is advisable for hotel management to maintain the current employee training approach while periodically assessing how staff service performance behaviors influence customers. For instance, hotels could consider implementing programs that allow customers to evaluate aspects of service such as staff professionalism and attitudes. Therefore, hotel management must enhance and refine the quality of staff services.
Fifthly, the trip type for all 3, 4, and 5-star hotels falls within the “Possible Overkill” quadrant. Furthermore, based on the PRCA results, this attribute was classified as an excitement factor. Therefore, customers experience satisfaction when this hotel selection attribute is fulfilled, but do not necessarily feel dissatisfaction when it is not met. Accordingly, service attributes related to trip type are not considered important by all customers and can be regarded as attributes for which the current satisfaction levels are already met. In light of this, it is deemed that the hotel should not invest further in enhancing services specifically tailored for business and leisure customers, as providing an excessive level of service might not be necessary. Therefore, it is recommended that hotels focus more on services such as room and staff attitude rather than investing extensively in services related to business and leisure.
Sixthly, the location attribute is contained in the “Possible Overkill” quadrant for 3-star, 4-star, and 5-star hotels. Thus, it can be deduced that this attribute generally leads to customer satisfaction but is not considered highly important. Moreover, the PRCA results categorized 3 and 4-star hotels as basic factors, while 5-star hotels were classified as excitement factors. This implies that customers of 3 and 4-star hotels often visit for tourism purposes, favoring hotels with convenient public transportation accessibility. Therefore, dissatisfaction can arise among the customers of 3 and 4-star hotels if their expectations regarding location are not met. By contrast, customers of 5-star hotels tend to indulge in leisure and vacation activities while utilizing the facilities and services within the hotel (Alzoubi et al., 2021; Gupta & Dixit, 2022; Xue & Zhang, 2020). Therefore, it can be inferred that they do not prioritize the accessibility of the hotel’s location. It is crucial for hotel management to maintain the current operational approach rather than rapidly enhancing the accessibility of the hotel’s location.
Seventhly, the front desk attribute has been categorized within the quadrant “Keep up the good work” quadrant for both 3 and 4-star hotels, corroborating the findings of prior research studies (Mohsin et al., 2019; Ying et al., 2018). Furthermore, the PRCA results consistently identified this as an excitement factor. Customers of 3 and 4-star hotels consider the front desk attribute to be significant and are currently experiencing a level of satisfaction fulfillment. Therefore, it is advisable for 3 and 4-star hotels to maintain the current quality of front desk services provided to customers while also considering long-term service expansion. For instance, by expanding the scope of front-desk services in areas not typically offered by 3 and 4-star hotels, such as 24-hr customer support, professional telephone assistance, and multilingual support, hotels can continuously enhance customer satisfaction while broadening their service offerings. On the other hand, it is observed that 5-star hotels are situated in the “Possible Overkill” quadrant, and PRCA results also indicate their classification as excitement factors. Therefore, although customers do not significantly prioritize this attribute, their satisfaction appears to be fulfilled. 5-star hotels, the highest tier of accommodation, are known to offer personalized services to their customers (Ao, 2017). Customers are indeed experiencing satisfaction because of the high-quality services provided. Therefore, it is recommended that 5-star hotels should maintain the current state of their front desk services.
Eighthly, resort fees are contained in the “low-priority” quadrant for all 3, 4, and 5-star hotels. The PRCA results classified this as a basic factor. In essence, this attribute has lower significance and satisfaction than other service qualities, and achieving satisfaction with this aspect does not necessarily result in overall contentment. This pertains to services where customers are charged additional fees for amenities such as Wi-Fi usage, gym access, and similar conveniences during their hotel stay. Rather than investing in enhancing these services, hotels aim to prevent customer dissatisfaction by providing coupons or discount vouchers to ensure a seamless experience.
Conclusions
To address the limitations of previous studies, this research collected an extensive amount of rating and review data. The study conducted a comparative analysis of the implicitly derived importance and satisfaction performance regarding selection attributes categorized by hotel star ratings. For this purpose, a vast amount of overall satisfaction ratings and reviews data was collected from TripAdvisor and utilized in the analysis. Specifically, BERTopic was applied to hotel reviews collected from TripAdvisor, resulting in the identification of 8 topics that customers consider crucial: Room, F&B, Bathroom, Staff, Trip type, Location, Front desk, and Resort fee. Moreover, previous studies utilizing RIPA did not address the differences among factors within the same quadrant, indicating a limitation. To overcome this limitation, our study conducted additional PRCA analysis. As analysis results indicated that there was no significant difference in customer importance and satisfaction regarding staff, trip type, and Resort fees among 3, 4, and 5-star rated hotels. However, differences were observed in the customer importance and satisfaction levels for the remaining selection attributes among the different hotel star ratings.
The theoretical implications of this study are as follows: Firstly, this study addressed the limitations of survey methods, which proved challenging, time-consuming, and costly when investigating hotel guests from various countries and diverse accommodations. In order to overcome these challenges, this research collected voluntarily provided data from hotel patrons. Consequently, this approach holds significant merit in exploring the subject matter.
Secondly, this study conducted RIPA analysis by categorizing hotels based on their star ratings, thus examining the differences in customers’ perceptions of the implicitly derived importance and satisfaction performance related to hotel attributes. Specifically, the observation that only 4-star hotels have the Bathroom attribute falling under the “Concentrate Here” quadrant confirms distinctions in customers’ importance and satisfaction across various star ratings. Furthermore, a review analysis of the Bathroom attribute for 4-star hotels revealed a prevalent dissatisfaction with physical aspects such as bathroom size and equipment damage. Therefore, this underscores the need for detailed consideration of physical factors (size, amenities, facilities) pertaining to bathrooms, and this insight is expected to contribute to future studies on customer satisfaction with hotel attributes.
Thirdly, in the previous RIPA studies, the limitations persisted in understanding the differences among factors falling within the same quadrant. Addressing this gap, the current study conducted additional PRCA to discern how each hotel attribute influences customers. In essence, PRCA was employed to unveil distinctions among attributes situated in the same quadrant according to the RIPA analysis results. Therefore, this research, attempting to overcome these limitations, stands as a pioneering effort, poised to contribute significantly to numerous subsequent studies in the future.
The practical significance of this study is as follows. Firstly, in 3-star hotels, guests typically allocate their travel budget more toward tourism-related expenses such as sightseeing, transportation, and dining out, rather than on hotel accommodations (Qu et al., 2000; Xue & Zhang, 2020). Therefore, based on the findings of this study, the hotel selection attribute for front desk staff in 3-star hotels could focus on maintaining current service quality while integrating services that involve providing information about nearby tourist attractions and recommending preferred tourist destinations to guests.
Secondly, for 4-star hotels, it has become evident that improvements in the bathroom facilities are urgently needed. Specifically, customers have expressed dissatisfaction with physical aspects such as the size of the bathroom, damaged fixtures, and the availability of amenities. Swift action is required to address these concerns. Therefore, it is imperative for 4-star hotels to regularly inspect and enhance bathroom facilities, ensuring prompt repairs and replacements to meet guest expectations.
Thirdly, for 5-star hotels, which consistently offer exceptional services, the research findings affirm an overall higher level of customer satisfaction compared to other hotels. Therefore, it is imperative for these establishments to maintain the existing service quality in areas such as rooms, F&B, and staff. Furthermore, enhancing customer satisfaction specifically in the realm of bathroom facilities would significantly contribute to overall customer satisfaction. To achieve this, the incorporation of sophisticated bathroom designs and the provision of luxury-brand amenities should be considered.
The limitations of this study and avenues for future research are as follows. Firstly, hotel star ratings, overall satisfaction ratings, and reviews were collected and analyzed. However, hotel e-commerce platforms offer diverse information, including customer profiles and the usefulness of reviews. Therefore, in future studies, collecting and analyzing such additional information could potentially yield more detailed and nuanced results than the present analysis.
Secondly, this study did not consider the detailed demographic characteristics of hotel visitors, such as age and gender. According to previous research, the analysis of service satisfaction shows variations based on demographic characteristics (Sann et al., 2020). Moreover, as hotels attract customers from diverse cultural backgrounds, it will be necessary to analyze the cultural characteristics and demographic features of customers in the future.
Thirdly, to mitigate the impact of COVID-19, this study collected and analyzed data from TripAdvisor from January to December 2019. Consequently, owing to the limitations of the collected data, there is a constraint in utilizing data from a more extended timeframe. In the future, collecting additional data and performing analyses categorized by different timeframes or distinguishing between pre-COVID and post-COVID periods could provide a more comprehensive understanding of customer perceptions and satisfaction regarding hotel selection attributes. This approach is expected to offer more detailed insights than that of the current study.
Footnotes
Funding
The author(s) declared no financial support for the research, authorship and/or publication of this article.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.
