Sage Journals: Discover world-class research

Abstract

The purpose of this study was to consolidate machine learning applications and develop a method to simultaneously analyze unstructured text and images pertaining to travel and tourism. This paper extracted city-related tourist-generated content from social media posts and analyzed this content to elucidate public perception of Taipei and identify the factors that make these posts attractive. Amidst the global COVID-19 pandemic of the early 2020s, this study examines social media discourse on urban topics. Focused on the period from 2019 to 2020, it compares content to discern shifts in societal concerns amidst the pandemic’s progression. The analysis aims to illuminate evolving thematic patterns within city-related discussions against the backdrop of this unprecedented public health crisis. Several techniques and technologies, including content mining, Google Cloud Vision AI, topic modeling, and artificial intelligence machine learning were adopted to analyze the images and interactive characteristics of tourist-generated content relating to the city imagery and tourism transformation of Taipei. The data analyzed in this study was collected from Facebook, and RapidMiner was employed as the mining environment to apply topic modeling to identify the topics in tourist-generated content relating to Taipei before and during the pandemic and elucidate expectations and topic evolutions; and extract meaning images and text from the topics and combine them with interactive data from social media posts to identify the topics inductive to the public at different periods of the pandemic. The main graphic theme before the epidemic was to convey the charm of Taipei, compared to the graphic theme during the epidemic, which shifted to a nature-based image.

Keywords

Taipei COVID-19 pandemic social media topic modeling image and text analysis

Introduction

The greatest challenge for the travel and tourism industry in 2020 was undeniably the outbreak of the COVID-19 pandemic. An analysis of the effects of the pandemic on travel revealed that the imposition of border restrictions by various countries in response to the pandemic between February and March of 2020 was the single most influential factor for international travel. In Taiwan, the successive implementation of lockdown policies worldwide drastically reduced the number of inbound passengers to 2,559. Of that number, only ten passengers arrived in Taiwan for the purpose of tourism, presenting a stark contrast to the almost 500,000 monthly international tourists before the pandemic. The impact of public health measures on Taiwan’s tourism industry is undeniable. Fortunately, the pandemic has gradually been brought under control in in Taiwan, and local tourism is showing signs of recovery. To further promote tourism and differentiate Taipei from other cities, the Taipei City Government introduced its worry-free travel initiative, offering a selection of quarantine hotels and tourism-centered discounts to tourists traveling independently or in groups. The purpose of these incentives is to help deepen tourists’ experiences and encourage them to visit Taipei.

New tourism and travel trends emerged in 2020 amidst the COVID-19 pandemic. A survey conducted by the World Trade and Tourism Council (WTTC) on pandemic travelers revealed that 92% trusted the recommendations of their family and friends. These findings coincided with the growing attention to tourist-generated content (TGC) on social media. Social media users post content containing cultural and city elements using images or text that are meaningful to them to showcase their travel experiences and city imagery. Unfortunately, current tools for measuring the online images of tourism cities (Papathanassis & Knolle, 2011; J. Zhang, 2018) lack academic and empirical support (Xu et al., 2023), and most text-based software is ill-equipped to process graphical data. Therefore, the purpose of this study was to consolidate machine learning applications and develop a method to simultaneously analyze unstructured text and images pertaining to travel and tourism.

Social media has become an indispensable part of everyday life. City governments take advantage of social media to interact with the public and promote their cities. However, few studies on the communicative traits of social media have focused on the thematic preferences of social media users during the pandemic or on the effective exchange of social media content. To fill this knowledge gap, this paper extracted city-related TGC from social media posts and analyzed this content to elucidate public perception of Taipei City and identify the factors that make these posts attractive.

Several techniques and technologies, including content mining, Google Cloud Vision AI, topic modeling, and AI machine learning were adopted to analyze the images and interactive characteristics of TGC relating to the city imagery and tourism transformation of Taipei City. The data analyzed in this study was collected from Facebook, and RapidMiner was employed as the mining environment.

The objectives were to apply topic modeling to identify the topics in TGC relating to Taipei City before and during the pandemic and elucidate expectations and topic evolutions; and extract meaning images and text from the topics and combine them with interactive data from social media posts to identify the topics inductive to the public at different periods of the pandemic.

This paper attempted to capture the value dimensions of cities, elucidate how the public interprets and interacts with city images, and investigate how people’s perception of city images is shaped by textual and graphical cues on social media. The term “city image” discussed in this study refers to the textual and graphical TGC on social media. This paper applied this term to deduce how users subjectively interact with one another. A comprehensive algorithm was developed to analyze the interactivity of city-related social media information and construct different topic models. More specifically, this study identified the representative independent variables of the text and image sets. These sets were adopted inputs for function selection and sorting. The results were combined with the text and image content to estimate the city-related TGC characteristics in each condition class. Finally, the aggregated content variables of the text and images were used to highlight valuable city-related posts.

This study examined pre-pandemic and peri-pandemic social media posts to elucidate how social media satisfies user needs and how users use TGC to satisfy their information needs. It applied the findings of this paper to develop a post-pandemic city promotion strategy for Taipei City that encourages collaboration between city promoters and social media users. This study is structured as follows. Section “Literature” briefly discusses existing literature on cities and travel expectations, past viral outbreaks, city image analysis and TGC, and topic analysis and latent Dirichlet allocation (LDA), city-related cues that frequently appear in pre-pandemic and peri-pandemic social media posts and several hypotheses on the effects of these cues on post interactivity. The hypotheses were applied to assess the impact of topical cues on engagement. Section “Methods” discusses the research process and methodology. Section “Results” presents the data analysis results and presents the results and several recommendations for city-related social media content.

Literature

Expectations of Cities and Pandemic Studies

Smart cities have been developing globally, but because studying them involves multiple disciplines, understanding and describing them holistically is difficult. Lim et al. (2021) conducted a study providing a comprehensive overview of smart city literature (Lim et al., 2021). Their findings demonstrate the latest research developments, provide a common foundation for understanding smart cities from a multidisciplinary perspective, and facilitate further research and development.

Technologies such as artificial intelligence (AI) and the Internet of Things have the potential to transform cities into more sustainable smart cities. Herath et al. (2022) discuss the application of AI in smart cities, including in key areas like healthcare, education, environmental and waste management, mobility and smart transportation, agriculture, risk management, and security (Herath & Mittal, 2022). The integration of AI into smart cities can be beneficial by automating operations, reducing human error, making decisions based on valid data, and improving the environment through different systems. This helps create new business opportunities and allows for more efficient city management through automation (Gonzalez et al., 2021).

Urban agglomerations are becoming centers of social, economic, cultural, and artistic activities, and a United Nations report predicts that 68% of the world’s population will live in cities by 2050. In the study by Lim et al. (2021), they explore a case study applying Twitter text mining in a smart city: Quito, the capital of Ecuador. Local governments can use information and communication technologies (ICT) in such ways to collect data via social networking and social mining, which allows for analysis of residents’ opinions and reactions to government actions and thus helps governments make more informed decisions and policies (Lim et al., 2021).

Expectation theory details how people unintentionally form experiences and perceptions and how they apply these experiences and perceptions to process future knowledge and beliefs. When people lack personal experiences, they tend to form expectations to minimize the risk attached to information sources, such as comments or responses posted by their friends (Hsu & Song, 2014). In the absence of personal information, information from the government or travel and tourism vendors can be used to evaluate travel expectations. Expectation refers to an anticipatory process of future or unfinished events. It can be positive or negative and be both rational (a certain outcome may be achieved) and emotional (hope or fear). Previous studies on travel expectations largely focused on expectation fulfillment and the role of expectation in satisfaction psychology (Nath et al., 2016; C. Wang et al., 2016). Therefore, assessing people’s attitudes can uncover their expectations of the city.

In expectations models, attitude is governed by the probability that the subject demonstrates specific subjective traits, and the subject’s overall attitude reflects the relative strength of these traits and can consequently be measured based on the subject’s overall attitude. Therefore, it can be said that expectation is a determinant of subjective attitude. Travelers generally have limited knowledge of the city they are visiting, particularly those they have not yet visited. Therefore, travelers’ attitudes, regardless of correctness, often mediate their revisit intentions. For example, electronic word-of-mouth (eWOM) reflects subjective norms, perceived influence, and behavioral control and positive reviews can change people’s attitudes and expectations. The theory of planned behavior states that when travelers post pictures of their travels, expectations are formed from specific beliefs, and these expectations reflect the traveler’s ideal city imagery. When people learn about a tourism city, they form a subjective city image and expectations (Crump et al., 2023), including traffic conditions around attractions, services and conveniences (McKercher, 2016), cultures, places they want to visit (H. Kim & Chen, 2019; Kruger et al., 2013), affordable products or services (Xie et al., 2016), restaurants and local cuisines (J. Kim & Fesenmaier, 2017), and travel protection and risks (Tasci, 2016).

Liang et al. (2019) examined Ebola-related Twitter feeds in 2019 and modeled the trajectories of Ebola-related messages (Liang et al., 2019). The researchers concluded that community spread and transmission were the most discussed topics on Twitter. Fung et al. (2014) randomly sampled Tweets relating to the Ebola outbreak and found that most of the Tweets originated from the United States, even though the outbreaks primarily occurred in countries with limited Internet access, such as Guinea, Liberia, and Sierra Leone (Fung et al., 2014). Subsequently, most of the Tweets were negative and angry, leading to high levels of anxiety associated with the Ebola virus. Fu et al. (2016) analyzed Ebola-related Tweets by the Centers for Disease Control and Prevention (CDCUSA), World Health Organization (WHO), and Médecins Sans Frontières (MSF) and concluded that Twitter is a useful platform for engaging in meaningful discussions, suggesting that public health authorities can take advantage of social media to spread correct information and combat misinformation (Fu et al., 2016). The researchers suggested that Twitter can take advantage of its platform to provide accurate information about the disease to reduce fear and anxiety in unrelated regions.

Vijaykumar et al. (2018) analyzed Zika-related Tweets and found that among 12 topics, the spread of the Zika virus was the most discussed topic on Twitter (Vijaykumar et al., 2018). Pruss et al. (2019) built tweet corpora in three languages (Spanish, Portuguese, and English). The corpora were then used to construct a multilingual model for identifying key topics across multiple languages. The researchers found that the outbreak of the Zika virus was discussed differently worldwide and that the topics were distributed differently across the three languages (Pruss et al., 2019). In cognitive linguistics, especially in the study of metaphors, public discourse is often analyzed with different figurative and literal frames. Metaphors are often used to discuss different aspects of diseases, such as their treatments, outbreaks, and symptoms. Metaphors are particularly powerful in framing health-related discourse, and they have been shown to affect the overall health of patients (Entman, 1993).

Government-enforced social distancing has spurred Internet users to use social media to express their concerns, opinions, beliefs, and views of reality. On Twitter, tweets containing the hashtags #coronavirus, #COVID-19, or #COVID have exploded. Recently, researchers are mining tweets to gain a better understanding of the discussions surrounding the Zika virus. Miller et al. (2017) combined natural language processing and machine learning techniques to analyze topic distributions associated with four characteristics of the Zika virus: symptoms, transmission, prevention, and treatment (Miller et al., 2017).

Image Analysis and City-Related TGC

Deng et al. (2019) conducted a two-stage destination image (DI) analysis, which included a qualitative state for providing relevant structures (attributes) and a quantitative stage to measure these structures (Deng et al., 2019). Compared to text, images have a greater impact on human memory. Therefore, photos are a powerful medium for destination professionals. They are visual cues shared between experienced and inexperienced people. They reflect what people experience at a destination. Therefore, they are a product of people’s travel experiences (Hunter, 2016). Content included or omitted from photos has the power to shape people’s perceptions. Early studies that used photos and visual content to analyze DIs mainly focused on one or several destinations. Those that adopted qualitative approaches to analyze unstructured data were extremely time-, resource-, and labor-intensive (Xiao et al., 2022). In recent years, the emergence of online platforms and digital technologies has led to the explosion of digital content. This content, also referred to as big data, can be analyzed using statistical analysis and machine learning technologies. The prevalence of the Internet has led to the study of TGC, which is defined as photos or static visual content of travel and tourism created and shared by people over the Internet.

People share their travel experiences online through social media (Taecharungroj & Mathayomchan, 2019), leading to the rapid accumulation of text, images, and videos online (Mak, 2017). Deng and Li (2018) asserted that TGC had become a more reliable source of video content than that produced by destination management/marketing organizations (DMO) (Deng & Li, 2018). Therefore, TGC-related studies have exploded in popularity in recent years. Social media data provide a new way to understand DI (Z. Zhao et al., 2018). TGC refers to destination-related content posted by Internet users voluntarily that influences other users. Compared to DMO content, TGC, which is typically user-centered, is more reliable, and previous studies have found that TGC, including social media posts (Y. Zhao et al., 2019) and online discussions (M. T. Liu et al., 2021), more accurately represents DIs than DMO content.

Compared with traditional sociological techniques, analyzing social media content is a new and cost-effective way of studying image perceptions. By examining and sorting photo content, researchers can identify the traits, similarities, and differences of different cities. Salesses et al. (2013) examined thousands of geotagged photos of New York, Boston, Linz, and Salzburg to compare the safety and unique characteristics of these cities (Salesses et al., 2013). Liu et al. (2016) applied a deep learning technique to sort photos posted on Flickr. The researchers then conducted a statistical analysis of the images of seven classic cities to determine their city images (L. Liu et al., 2016). Long and Zhou (2017) analyzed the metadata of photos of 24 Chinese cities posted on Flickr to determine their traits and similarities (Long & Zhou, 2017). Flicker is one of the most popular photo storage platforms. Many recent studies have examined the content of Flickr for a variety of reasons, including analyzing people’s emotions in different cities (Ashkezari-Toussi et al., 2019), classifying events in different cities (Clarke & Hassanien, 2020), and examining travel and tourism activities (Nechita et al., 2019).

Zhou et al. (2018) surveyed 10 US cities and found that despite 3 to 8 times as many tourists as residents, residents contribute more photos than tourists on average (Zhou et al., 2018). Yuan and Medel (2016) combined Google Cloud Vision AI and LDA to convert visual information into textual information (tags; (H. Yuan et al., 2018)). LDA results showed that in 12 countries, 85% of DIs taken were by residents (Y. Yuan & Medel, 2016). The survey suggested that Flickr users were a mix of tourist and citizens.Taecharungroj and Mathayomchan (2020) was the first study to combine Vision AI and topic modeling to research cities (Taecharungroj, 2019; Taecharungroj & Mathayomchan, 2020). Previous studies on DIs found that users of Flickr prefer to upload images of popular landmarks or city centers and show increased interest in cultural and entertainment destinations.

Topic Analysis and Latent Dirichlet Allocation

With services supporting digital platforms having more and more access to big data, the importance of text mining for business management is clear. Kumar et al. (2021) analyzed the use of text mining methods such as sentiment analysis, topic modeling, and natural language processing in reputed business management journals (Kumar et al., 2021). They then used text mining and topic association analysis, applying visualization tools to understand major research themes and relevance. The findings highlight that topics including social media analysis, market analysis, and competitive intelligence dominate the research on text mining in business management.

Social Media Analytics (SMA) has become an important tool for organizations to gain insights and improve performance and productivity in various areas. However, the field of SMA is becoming increasingly diverse and thus benefits from a comprehensive understanding of its trends and approaches. Rathore et al. (2017) provide a thorough review of the empirical evidence and future research directions in SMA, focusing on applications across domains, including industry, data mining, use cases, and user applications (Rathore et al., 2017). In the studies reviewed, public administration and non-essential consumer sectors are the main areas of application, with Twitter data being the most commonly used source for analysis and categorization techniques and regression models being the most popular analytic methods used.

Stone et al. (2021) explored the relationship between the gender of leadership and social media communication styles (Stone & Can, 2021). The study examined gender language differences in the Twitter feeds of the 100 most populous cities in the United States, with the goal of assessing whether mayors’ tweeting styles conformed to those recognized gender language differences. The influence of a council’s gender composition on tweeting style was also examined, and an awareness of gender differences helped mayors and their teams produce messages for different audiences.

The application of big data analysis in tourism management research is on the rise (Law et al., 2020). However, most studies focus on applying existing methods to TGC (Chang et al., 2020). Many studies have contributed to academia by introducing unique approaches and models, such as sentiment analysis (Geetha et al., 2017), topic modeling (Guo et al., 2017), and clustering and classification (Morosan & DeFranco, 2019). Despite the immense potential of photos, a major limitation of content analysis is the need for the manual formulation of categories and attributes, which renders content analysis ineffective for analyzing vast amounts of images. In recent years, scholars have applied machine learning to the analysis of big data. For example, J. Zhang (2018) applied topic modeling and LDA to identify destination attributes from travel blogs. LDA was used to extract, identify, and analyze the attributes of hotels (H. Zhang et al., 2018).

A review of existing literature on the analysis of city imagery through online text revealed that most studies focused on the calculation and measurement of city imagery (Chan et al., 2021), testing city imagery theories (Priporas et al., 2020), and the analysis and measurement of city imagery cases (Li et al., 2015). Most of these studies used word processing software to process the city-related text and applied the results to determine the relationships between various word frequencies and city imagery. In recent years, many image-related studies have turned to social media to collect data and examine the value of image connections on social media (Mariani et al., 2016; Molinillo et al., 2018). For example, Munar et al. (2014) collected data from TripAdvisor and Flickr to determine the interactive relationships between temporal structure, scope of communication, social value, and content richness (Munar & Jacobsen, 2014).

In terms of methodology, topic modeling has been combined with a number of natural language processors to analyze user-generated online contributions (Chaudhari & Thakkar, 2020), travel recommendation systems (Nitu et al., 2021), and destination similarities (J. Kim et al., 2017). Rahmani et al. (2018) applied topic modeling to analyze user-generated long-form travel content and determine traveler experiences. The researchers also combined topic modeling and exploratory analysis to test theories related to the phenomenology of tourism experience (Rahmani et al., 2018).

In addition to analyzing cities, LDA has been used to analyze modeled topics. For example, Ilyas et al. (2020) examined Brexit-related tweets and discovered a link between Brexit sentiment and the GBP exchange rate (Ilyas et al., 2020). H. Zhang et al. (2018) used topic modeling to mine tweets and elucidate consumers’ attitudes toward vaccines (H. Zhang et al., 2018). Doogan et al. (2020) applied LDA to explain public perception of nonpharmaceutical interventions (NPIs) for COVID-19. The researchers highlighted keyword problems corresponding to the topics of six countries. The results served as a reference for the formulation of NPI strategies (Doogan et al., 2020). LDA topic modeling has also been used to examine Twitter users, identify product features, and quantify various topics (Jeong et al., 2019). Opinion searches are a mechanism for social media operators to collect user views and feedback on specific issues.

Methods

The focus of this study was to analyze the image elements of Taipei City before and during the pandemic. The data analyzed in this paper was derived from Facebook. Relevant data on city images were collected based on Facebook hashtags, posts, and user information. This study retrieved the TGC using Python API. Big data mining is a technique that can minimize repetitive mining (Sohrabi & Barforoush, 2012), ensure the consistency of mined content (Anwar & Abulaish, 2014), produce technical frameworks for network interaction, and articulate user interaction models (Pachidi et al., 2014). Crowd data analysis, clickstream analysis, and classification analysis are the three most popular forms of data mining. This paper combined topic modeling and multiple linear regression (Figure 1).

Figure 1.

Data analysis flow chart.

Data Search and Collection

The onset of the 21st century witnessed the emergence of a profound global health crisis with the rapid proliferation of COVID-19, transforming it into a pervasive pandemic of unprecedented scale. Originating in late 2019, the contagion swiftly traversed international borders, catalyzing a formidable challenge to public health infrastructure and societal resilience worldwide. Notably, by January 13, 2020, the epidemic had disseminated across contiguous territories, including Thailand, Japan, and Korea, underscoring the relentless transnational transmission dynamics of the pathogen.

This research endeavors to delineate temporal differentials in the thematic composition of urban-centric discourse within the domain of social media against the backdrop of the COVID-19 pandemic. With the delineation of the temporal demarcation at the threshold of 2020, a comparative analysis is conducted on the content of social media posts spanning the temporal expanse of 2019 and 2020. The objective therein is to discern and evaluate the presence of discernible disparities in the thematic constituents of city-related discourse between the aforementioned temporal epochs, thereby elucidating potential shifts in societal preoccupations and concerns engendered by the exigencies of the prevailing public health crisis.

In this study, it analyzed users’ Facebook posts relating to Taipei City during the peak of the COVID-19 pandemic between 2019 and 2020. The first step was to search for relevant posts using hashtags. We used RapidMiner to search for these posts. This paper, therefore, used Python to collect post data associated with #Taipei (Lehmann et al., 2012). This study sorted the collected data based on content, type, time, likes, shares, and comments. A total of 29,594 posts (pre-pandemic 16,924, Peri-pandemic 12,670) associated with the image uploaded between 1 January 2019 and 31 December 2020 were collected. All of the data was saved as .csv files.

Text Preprocessing and Data Cleaning

The raw data collected from Facebook were preprocessed using RapidMiner. The data underwent two levels of preprocessing: preparation and preprocessing. First, the raw data of the Excel file were imported into RapidMiner for extraction, conversion, and loading. The standard operators used in this process included “Select Attributes,”“Text in Name,” and “Process Data in File.” After the data was prepared, they were converted to files for subsequent processing. The subsequent preprocessing procedures were tokenization, case conversion, stopword removal, and stemming.

To ensure the validity of the input data, the next step was cleaning the data, which involves several steps. The first step was removing all non-English text. Next, all duplicated content was omitted. Then, post formatting was adjusted and converted into a bag-of-words (BoW) corpus. Finally, words with no semantic meaning, such as “the,”“is,” and “on,” were omitted to improve data quality. In addition, the content was automatically deleted if it did not contain the words “Taipei” or “Taipei City.”

Image Analysis and Tag Testing

Google Cloud Vision AI was employed to extract feature tags from every image and facilitate image analysis. It has been used in many recent studies for image analysis (Hosseini et al., 2017). Google Cloud Vision API allows developers to encapsulate machine learning in a REST API for data extraction. Therefore, it can be used for image classification, object detection, and word recognition. Google Cloud Vision API is able to analyze uploaded images and image sets stored using Google’s cloud services through Jumptuit, automatically detect figurative elements, such as people and objects, in images or video, and categorize the data in learned databases. Next, this paper carried out tag detection, in which objects and features were extracted from the images. This study set the maximum and minimum tag count per image to 10 and 0. Using these parameters, it collected traditional/classic images from TGC and their element features (Galí et al., 2017).

LDA Modeling

In RapidMiner, topic modeling was achieved using the Operator Toolbox extension. The LDA method was employed to detect hidden topics. Because topic modeling is a form of unsupervised learning, all data were preprocessed before grouping. The topics were then grouped into meaningful topics after topic annotation, in which each image was allocated to the most probable topic. Each topic was composed of tags of varying degrees of relevance, with the most relevant tags for each topic displayed and used for topic naming. LDA is a highly efficient, unsupervised machine learning algorithm (Y. Wang & Taylor, 2019) that can be applied to carry out a number of research objectives (Taecharungroj & Mathayomchan, 2020). LDA is a three-level hierarchical Bayesian probability model, in which images and text are assigned a probability distribution topic, and each topic represents a probability distribution keyword (or tag).

import pandas as pd

from sklearn.feature_extraction.text import CountVectorizer

from sklearn.decomposition import LatentDirichletAllocation

data = pd.read_csv('facebook_posts.csv')

documents = data['message'].tolist()

vectorizer = CountVectorizer()

X = vectorizer.fit_transform(documents)

lda = LatentDirichletAllocation(n_components=n_topics)

lda.fit(X)

feature_names = vectorizer.get_feature_names()

for topic_idx, topic in enumerate(lda.components_):

print("Topic %d:" % (topic_idx + 1))

print(" ".join([feature_names[i] for i in topic.argsort()[:-10 - 1:-1]])) print()

new_post = "This is a new post about technology and social media"

new_post_vectorized = vectorizer.transform([new_post])

new_post_topic = lda.transform(new_post_vectorized)

print("New Post Topic:", new_post_topic.argmax() + 1)

Alpha (α) and beta (β) are corpus-level hyperparameters that are sampled once in the process. A smaller α-value represents fewer topics per image, while a smaller β-value represents fewer tags per topic. The α-value and β-value were set at .1 and .001 (Subeno et al., 2018). Theta (θ) is a document-level (photo-level) variable that refers to the probability of certain topics appearing in an image (where the combined probability is equal to 1). After LDA modeling, KNIME can be applied to assign the most probable topic to each image and sort images into topics that help to depict city compositions. Z and w are word-level (tag-level) variables for each photo, where w is the tag and Z is the topic assigned to it. Every word in the document is assigned to a topic. This assignment is determined using conditional probability estimates. After determining the probability of each word, the words were assigned to different topics. Only words with a probability value equal to or greater than this threshold (min count = 5, threshold = 100) were assigned to a corresponding topic.

Topic Assessment and Selection

This study employed two methods for model evaluation. The first was manual evaluation, in which we analyzed the first Nth number of words in a topic. This method was also used to analyze words without topics. The second was the application of quantitative indices: perplexity and conformity. Perplexity is a measure for comparing probabilistic models. It represents the predictive power of the probabilistic model. Generally, a lower perplexity value (approximating 0) denotes a more favorable model function. By comparison, conformity is a measure of semantic similarity between words in a topic. The conformity value ranges between 0 and 1, where a higher conformity value denotes a more favorable topic model.

Previous studies have demonstrated the feasibility of machine learning and LDA in identifying topics from big data. To close the knowledge gap in DI research and highlight the potential of machine learning, this study introduced a consolidated method for analyzing TGC text and images. The topic models were probabilistic models that can be applied to the Bayesian hieratical analysis of raw text to determine the underlying semantic structures (Shafqat & Byun, 2020). In travel and tourism research (Lin et al., 2021), topic modeling is used to discover the abstract topics embedded in the text. These text and images are then used to determine relevance. The extracted destination text and images can be used in destination analysis or travel personalization/recommendation.

import pandas as pd

from sklearn.feature_extraction.text import CountVectorizer

from sklearn.decomposition import LatentDirichletAllocation

data = pd.read_csv('facebook_posts.csv')

aipei_data = data[data['message'].str.contains('Taipei')]

documents = taipei_data['message'].tolist()

vectorizer = CountVectorizer()

X = vectorizer.fit_transform(documents)

lda = LatentDirichletAllocation(n_components=n_topics)

lda.fit(X)

feature_names = vectorizer.get_feature_names()

for topic_idx, topic in enumerate(lda.components_):

print("Topic %d:" % (topic_idx + 1))

print(" ".join([feature_names[i] for i in topic.argsort()[:-10 - 1:-1]])) print()

new_post = "This is a new post about Taipei city and its attractions"

new_post_vectorized = vectorizer.transform([new_post])

new_post_topic = lda.transform(new_post_vectorized)

print("New Post Topic:", new_post_topic.argmax() + 1)

import pandas as pd

from sklearn.feature_extraction.text import CountVectorizer

from sklearn.decomposition import LatentDirichletAllocation

import pyLDAvis.sklearn

data = pd.read_csv('facebook_posts.csv')

taipei_data = data[data['message'].str.contains('Taipei')]

documents = taipei_data['message'].tolist()

vectorizer = CountVectorizer()

X = vectorizer.fit_transform(documents)

lda = LatentDirichletAllocation(n_components=n_topics)

lda.fit(X)

pyLDAvis.enable_notebook()

vis = pyLDAvis.sklearn.prepare(lda, X, vectorizer)

pyLDAvis.display(vis)

Text Topics and Interactivity

Besides elucidating the co-occurrence of attractions, observing implicit semantic information contained in travel and tourism reviews undoubtedly helps to model correlations through different lenses. Attraction image is a message type. It reflects people’s impressions of certain attractions and can be categorized into several topics, including destination (beaches), environment (weather and public health), and experience (Ankarali & KÜLcÜ, 2020). Attractions have a strong correlation when they have similar texts. These texts can be found in travel and tourism reviews and blog posts. Therefore, a model can be developed to extract attraction text and delineate different semantic dimensions. Large amounts of unstructured TGC are generated on travel and tourism websites and social media platforms each day (Chang et al., 2020). These messages are extremely valuable to various city image stakeholders (Huang et al., 2021). However, there is simply too much TGC online for manual analysis (Filieri, 2016).

Image Topics and Interactivity

Adding images to text facilitates presentation. Taking advantage of the symbolic representations to concretize images fortifies the value of images in city dialogs (Hunter, 2016). Michaelidou et al. (2013) analyzed image data to elucidate people’s perceptual responses to city imagery (Michaelidou et al., 2013). Schroeder et al. (2015) compared online images generated by marketers in a city to verify the importance of image content to city marketing (Schroeder & Pennington-Gray, 2015). Social media sites provide a convenient platform (Lo & McKercher, 2015; Syed-Ahmad et al., 2013) for sharing, viewing, and responding to travel and tourism images (Vu et al., 2015). Today, social media sites provide functions to share images to multiple platforms (Hunter, 2016). In terms of examining image cues, Michaelidou et al. (2013) examined photos posted online to elucidate people’s perceptions of tourism city images. The researchers compared the images of the same city to verify perceptional differences (Michaelidou et al., 2013).

Results

Text Topics and Visual Results of Taipei City

In this study, it carried out topic modeling after data extraction and preparation using the pyLDAvis library. Using this library was the first step in the topic modeling process. It visualized different topics from an image comprising multiple circles. Each circle represented a topic. The distance between the circles represented the correlation between the topics, and the sizes of the circles represented the data size of the topic.

The results of this paper found that the textual topics before the pandemic were mostly related to city-related tourism (Table 1). The topics included “event,”“urban,”“recreation,”“tour,” and “hotel.” In particular, “breakfast,”“mall,”“temple,” and “architecture” were the most common textual elements. By comparison, the peri-pandemic textual topics shifted to nature-related tourism. Topics included “landscape,”“natural,”“plant,”“travel,” and “environment.” Keywords such as “happy,”“mountain,” and “groundcover” were used to attract users.

Table 1.

Pre-Pandemic Text Topic.

Topic	Weights
1	0.036“photography” + 0.027“eyelash” + 0.027“event” + 0.027“urban” + 0.018“card” + 0.018“happy” + 0.018“branch” + 0.018“account” + 0.018“saturday” + 0.018“PASAY”
2	0.159“automotive” + 0.113“wheel” + 0.081“vehicle” + 0.042“motor” + 0.040“design” + 0.037“alloy” + 0.030“hubcap” + 0.030“system” + 0.028“synthetic” + 0.028“rubber”
3	0.027“yellow” + 0.027“tour” + 0.027“hotel” + 0.027“kitchen” + 0.014“design” + 0.014“natural” + 0.014“black” + 0.014“white” + 0.014“building” + 0.014“carmine”
4	0.033“landscape” + 0.017“shop” + 0.017“electric” + 0.017“winter” + 0.017“property” + 0.017“material” + 0.017“fashion” + 0.017“thumb” + 0.017“azure” + 0.017“wrist”
5	0.047“architecture” + 0.037“plant” + 0.036“solar” + 0.028“building” + 0.024“temple” + 0.024“free” + 0.024“taipei” + 0.024“PESOS” + 0.014“water” + 0.013“cloud”
6	0.042“design” + 0.031“electric” + 0.017“fashion” + 0.017“card” + 0.017“mall” + 0.017“advertising” + 0.017“PACKAGE” + 0.017“coloring” + 0.017“publication” + 0.017“magenta”
7	0.048“architecture” + 0.033“temple” + 0.017“building” + 0.017“travel” + 0.017“leisure” + 0.017“event” + 0.017“chinese” + 0.017“japanese” + 0.017“facade” + 0.017“place”
8	0.035“design” + 0.035“advertising” + 0.035“happy” + 0.023“airport” + 0.023“mall” + 0.023“royal” + 0.023“PACKAGE” + 0.023“CITY” + 0.023“natural” + 0.023“http”
9	0.057“plant” + 0.057“forest” + 0.029“design” + 0.015“terrestrial” + 0.015“landscape” + 0.015“natural” + 0.015“flowering” + 0.015“biome” + 0.015“branch” + 0.015“broadleaf”
10	0.074“landscape” + 0.045“plant” + 0.038“mountain” + 0.037“natural” + 0.029“horizon” + 0.019“new” + 0.019“credit” + 0.019“mountainous” + 0.019“manila” + 0.019“trip”
11	0.072“kitchen” + 0.049“appliance” + 0.037“design” + 0.037“property” + 0.025“transfer” + 0.025“city” + 0.025“visit” + 0.025“option” + 0.025“hotel” + 0.025“lunch”
12	0.026“breakfast” + 0.026“office” + 0.026“travel” + 0.014“sleeve” + 0.014“cuisine” + 0.014“dishware” + 0.014“comfort” + 0.014“tableware” + 0.014“produce” + 0.014“recipe”

Pre-pandemic text topics in quadrant 1 are dominated by indoor travel keywords, such as “city,”“hotel,”“shop,”“leisure,”“landscape,” and “automotive,” and quadrant 2 includes topics like “mall” and “event” (Figure 2).

Figure 2.

Pre-pandemic text topic.

Peri-pandemic, on the other hand, text topics in quadrant 1 include “landscape” and “natural,” while quadrant 4 includes outbound travel-related key themes of “travel,”“automobile,” and “recreation” (Figure 3).

Figure 3.

Peri-pandemic text topic.

Image Topics and Visual Results of Taipei City

The results of this study showed that the main graphical topics before the pandemic were “service,”“brand,”“cuisine,”“fashion,” and “cityscape.” In particular, “design,”“luxury,”“landmark,” and “urban” were the most common images used to convey the charm of Taipei City (Table 2). By comparison, the peri-pandemic graphical topics shifted to “landscape,”“leisure,”“natural,”“nature,” and “tourism,” and images transitioned to more nature-based tourism images, such as “leisure,”“urban,”“plant,” and “ocean.” Images associated with independent travel, such as “vehicle,”“photography,” and “comfort,” also increased, suggesting that messaging shifted to destinations away from crowds.

Table 2.

Pre-Pandemic Image Topic.

Topic	Weights
1	0.039“personal” + 0.038“rolling” + 0.038“luxury” + 0.025“design” + 0.022“automotive” + 0.020“lighting” + 0.020“asphalt” + 0.017“wheel” + 0.016“service” + 0.016“collar”
2	0.043“event” + 0.035“design” + 0.035“equipment” + 0.034“recreation” + 0.028“advertising” + 0.018“personal” + 0.018“protective” + 0.018“graphic” + 0.018“brand” + 0.018“stage”
3	0.054“automotive” + 0.052“design” + 0.043“flooring” + 0.039“kitchen” + 0.029“lighting” + 0.025“appliance” + 0.023“event” + 0.021“vehicle” + 0.021“floor” + 0.019“stain”
4	0.031“design” + 0.029“spring” + 0.029“facade” + 0.022“comfort” + 0.022“building” + 0.018“estate” + 0.015“grass” + 0.015“cloud” + 0.015“event” + 0.015“fashion”
5	0.034“design” + 0.031“produce” + 0.031“comfort” + 0.028“group” + 0.027“foods” + 0.020“cuisine” + 0.019“fashion” + 0.019“handwriting” + 0.019“advertising” + 0.018“sweetness”
6	0.060“design” + 0.027“graphic” + 0.026“fashion” + 0.025“event” + 0.019“product” + 0.019“advertising” + 0.018“magenta” + 0.017“electric” + 0.017“happy” + 0.014“illustration”
7	0.154“automotive” + 0.097“vehicle” + 0.077“wheel” + 0.044“design” + 0.041“tread” + 0.039“exterior” + 0.038“alloy” + 0.038“motor” + 0.033“hubcap” + 0.032“system”
8	0.042“fashion” + 0.041“event” + 0.033“leisure” + 0.025“human” + 0.021“travel” + 0.021“electric” + 0.017“magenta” + 0.017“accessory” + 0.017“luggage” + 0.017“sports”
9	0.034“advertising” + 0.028“bottle” + 0.028“brand” + 0.027“poster” + 0.021“design” + 0.021“fluid” + 0.021“illustration” + 0.021“cartoon” + 0.021“animated” + 0.021“character”
10	0.055“space” + 0.036“human” + 0.036“public” + 0.035“temple” + 0.034“morning” + 0.030“happy” + 0.029“expression” + 0.029“cloud” + 0.029“settlement” + 0.024“travel”
11	0.094“plant” + 0.070“landscape” + 0.047“natural” + 0.031“nature” + 0.025“grass” + 0.023“leisure” + 0.023“blossom” + 0.020“travel” + 0.020“flowering” + 0.020“petal”
12	0.080“block” + 0.067“condominium” + 0.067“cityscape” + 0.055”landscape” + 0.048“urban” + 0.044“design” + 0.043“horizon” + 0.039“architecture” + 0.038“landmark” + 0.034“estate”

From the cross-quadrant topic grouping, it is obvious that pre-pandemic text topics are dominated by keywords associated with shopping or food, such as “luxury” and “advertising” in quadrant one and “fashion,”“estate,” and “cuisine” in quadrant two. Meanwhile, quadrant four contains words like “brand” and “cuisine” (Figure 4).

Figure 4.

Pre-pandemic image topic.

Peri-pandemic, text topics in quadrant 1 include “landscape” and “comfort,” while quadrant 4 contains “nature” and “travel,” where key themes of “natural” and “brand” are more obvious (Figure 5).

Figure 5.

Peri-pandemic text topic.

Discussion

LDA has been used in many previous studies on travel and tourism to examine various topics, such as hotels (Sutherland et al., 2020), restaurants (Jia, 2020), theme parks (Luo et al., 2020), national parks (J. Wang et al., 2021), and travel routes (Law et al., 2020). LDA was applied in their study to explore people’s perceptions and expectations of cities. TGC is online content produced through co-creation. It is dynamic, interactive, non-linear, and non-commercial. TGC takes many forms, including people’s comments and reviews (Storbacka et al., 2016), their interactions with others (Harrigan et al., 2017), and their collective experiences (Prebensen & Xie, 2017). TGC is likely to have varying degrees of influence on people’s perceptions of cities (Ferrer-Rosell et al., 2017).

In terms of the presentation of city imagery, image presentations are the most effective way to attract mass attention and elicit image perception and impression. Images freely transform personal experiences and help shape unique perceptions. Therefore, incorporating symbolic functions in images helps elicit perceptional responses from viewers. For potential viewers, social media images can increase familiarity and trust (Krumm et al., 2023). Therefore, carrying out visual analyses of city imagery can help managers determine the constructs of city image, including the textual and graphical dimensions or the abstract and figurative dimensions (H. Kim & Stepchenkova, 2015). Subsequently, managers can enhance message interactions by taking advantage of shares and comments on social media (Xu et al., 2023), content sharing (Stepchenkova & Zhan, 2013), and mass chats. However, city image is affected by people’s subjective perceptions. Al-Ghamdi et al. (2015) highlighted the importance of socioeconomic factors, such as customs, history, and urban functions, in city images (Al-Ghamdi & Al-Harigi, 2015). Fedorova (2016) asserted that city images contain visible/tangible perceptual elements and social and cultural meaning embedded in social activity (Fedorova, 2016).

User-generated city information not only conveys city images but also contains impressions of different individuals and groups and reflects the current social culture. Therefore, TGC is useful in projecting city image. It can also be used by marketers to engage in electronic word-of-mouth (eWOM) (Crump et al., 2023) and TGC marketing (Brown et al., 2019). Subsequently, the content of influencers can be consolidated to increase the exposure of city messages (Dunne & Hanrahan, 2017; Shankman, 2014). By consolidating the common ideas within the complementary framework of text and pictures, past researchers were able to create new city reflections from an originally ambiguous city image. The interactive generation of TGC has unintentionally become a diverse city culture. Therefore, it can be used to analyze the source of city image elements (Hunter, 2016; Mak, 2017).

Text Topics

According to the results of this study, forecast topics include “event,”“urban,”“recreation,”“tour,” and “hotel.” In particular, “breakfast,”“mall,”“temple,” and “architecture” were the most common textual elements. Studies that analyze online content are referred to as studies of city projection. Some studies report the sum of all visual attributes communicated by DMOs and other stakeholders. These projections are important because the visual and verbal messages in promotional materials represent destinations to potential visitors. Many recent studies have adopted content analysis methods to analyze various destinations, including those in Seoul (Hunter, 2016), Eastern Taiwan (Mak, 2017), and Zhangjiajie (Z. Zhao et al., 2018). Content analysis allows researchers to determine DI attributes from specific topics. This approach has been shown to help project topic concepts through visual elements (Deng et al., 2019).

City image is often used as an overarching term for a set of tangible resources and characteristics, including tourism infrastructure, landscape, cultural heritage, and local elements and features (Fernández-Cavia et al., 2017). This study found that peri-pandemic, text topics shifted towards nature-related tourism and included topics like “landscape,”“natural,”“plant,”“travel,” and “environment.” Keywords such as “happy,”“mountain,” and “groundcover” have been used to attract users, and weaving these elements into city identity can directly project an image of the city to the public (Mariné-Roig & Clavé, 2016; Marine-Roig & Ferrer-Rosell, 2018). Once city image is embedded in the minds of the public, the physical characteristics and resources of the city become cognitive information that helps promote the city to potential visitors (Xiao et al., 2022), consequently influencing people’s emotions and needs (Villamediana et al., 2019).

The composition of a city’s image is complex, diverse, and dynamic and cannot be explained by simply examining city constructs. For example, city culture is conveyed through everyday experiences (Allam & Newman, 2018). In this way, culture can be observed in all aspects of life, and experiences are affected by the shared attitudes of individual groups. Cultural identity is then formed through common symbols, text, and messages that contain shared beliefs, customs, and values. City culture can therefore be seen as the combination of art and experiences and represents the significance of culture to individuals and groups. Both functional and symbolic interpretations can create dialogs about culture. These processes are similar to how individuals contribute to the development of culture by participating in cultural events. Evaluating the individual properties of a city’s image thus helps researchers uncover what real factors affect a city’s image.

City image refers to individuals’ beliefs and impressions of a city, is co-created by city organizations and residents, and is formed by consolidating the knowledge, emotions, and perceptions of individuals (Xiao et al., 2022). Visual elements are useful for clearly projecting crucial components of city image; they maintain symbolic meaning and are an example of how the use of information can help present the complexities of a city’s image (Hunter, 2016). Visual elements thus significantly impact people’s travel decisions and satisfaction with a city. City identity and value can also be examined to measure community engagement. Previously, scholars used city identity and the creation of city value to determine city image. Researchers also found that community engagement had a significant impact on city image.

Image Topics

Social media provides a high level of spatial and temporal resolution in many urban centers. Through image analysis, this study determined that the main graphical topics before the pandemic were “service,”“brand,”“cuisine,”“fashion,” and “cityscape.” In particular, “design,”“luxury,”“landmark,” and “urban” were the most common image types used to convey the charm of Taipei City. Similarly, Hunter (2016) conducted a semiotic analysis of the cultural characteristics of Okinawa and Kinmen by classifying image characteristics into indices, icons, and symbols (Hunter, 2016). As photo sharing continues to rise in popularity, user responses will become increasingly useful in examining city image. Galí and Donaire (2015) also examined photos taken by social media users to identify their perceptions of tourism cities (Galí & Donaire, 2015), and Liesch (2011) found that both casual and professional photos can serve as windows into tourism imagery. The results of this study support the idea that visual analysis of city photos can effectively illustrate the dimensions of tourism cities’ online image (Liesch, 2011).

Peri-pandemic, graphical topics shifted to “landscape,”“leisure,”“natural,”“nature,” and “tourism,” and images transitioned to more nature-based tourism images, such as those associated with “leisure,”“urban,”“plant,” and “ocean.” Images connected to independent travel, such as “vehicle,”“photography,” and “comfort,” also became more common, suggesting that messaging shifted to destinations further away from crowds. The results confirm that city-related photos can be easily converted into visuals that attract user attention and promote city imagery. This is similar to how travelers convey their travel experiences through photographic records. In constructing these travel memories, important symbols form that can evoke a sense of urgency (Lim et al., 2021). Users often form a sense of familiarity with destinations when viewing photos on social media, and these feelings inspire a desire to visit. To captivate viewers and enhance sensory pleasure, photos are often accompanied by descriptive text. Lim et al. (2021) assert that people’s awareness of tourism cities and their motivation to visit these cities could be reinforced by presenting their reflections with images. Kar and Dwivedi (2020) suggested that travel agencies take advantage of photos and other visual aids to enhance the attractiveness of destinations and add to users’ pre-visit experiences (Kar & Dwivedi, 2020).

City photos and images help viewers form opinions of the city and affect their travel decisions. Photos are thus considered key to the success of travel and tourism imagery and messaging. Given that travel is a unique visual experience, photos can highlight destination features, convey travel messages, and help promote city image. City marketers have become accustomed to using photos to visually represent their cities and elicit desired responses. Visual imagery not only attracts potential audiences but also facilitates the promotion of tourism cities.

Post-Pandemic Recommendations

Data-driven research based on big data uses large datasets containing structured and unstructured data from different platforms, which presents challenges in the Information Systems (IS) field. Computational methods such as sentiment mining, text mining, web science, and graphical analysis are useful for gaining insights.

The COVID-19 pandemic has had a drastic impact on how people perceive cities globally (Graham-Harrison & Smith, 2020). Although the short- and long-term effects of the pandemic on city images have yet to be determined, Kravchenko (2020) predicted that the images of domestic cities would recover more quickly than those of international cities. TripAdvisor.com and other travel and tourism platforms reflect this assessment. A recent hotspot analysis revealed that traffic in lesser-known or less accessible destinations is increasing and that identifying destination topics can help strengthen destination image and status in a highly competitive market (Kravchenko, 2020).

Gössling (2020) mentioned that countries should not rush recovery. Instead, they should take the opportunity to transform their city images into sustainable ones (Gössling et al., 2020). City managers can focus on promoting the distinct features of lesser-known destinations to disperse traffic to more popular and crowded destinations. In this way, cities can capitalize on rising trends while maintaining social distance requirements. For example, the Tourism Authority of Thailand has been working hard to promote secondary destinations, diverting international and domestic travelers to 55 secondary provinces. These efforts help the Thai government identify and promote typical and desirable destination attributes (e.g., the ocean, structures, and cuisines) and divert traffic to secondary destinations.

The Taipei City Government has always focused on creating an inclusive and refined tourism environment that features friendly and high-quality hospitality professionals to attract local and international tourists. It has also designed itineraries that take advantage of local features and cultures to attract tourists. The launch of the worry-free travel initiative in response to the COVID-19 pandemic has greatly increased tourism activity in Taipei and fueled national domestic tourism. The unique situation created by the pandemic has allowed people to better understand local cultures, embrace local assets, and find value in local cities.

Limitations

Due to the limitations of the social media API, this study was only able to collect data within a specific timeframe. Subsequently, the data may have been dominated or affected by viral topics. To strengthen data integrity, it carried out cross-comparisons over an extended period to verify the universality of the proposed model. In this paper, data were only collected from social media sites, which did not include sites like Twitter, LinkedIn, or YouTube. Therefore, it was unable to carry out cross-platform comparisons. Future studies could explore different approaches to examining DIs, such as reviewing online reviews or other social media posts, to complement the results of this study. Alternatively, researchers can consider investigating the effects of user types (e.g., residency and nationality) on relevant themes. Improving the verification process will undoubtedly enhance the quality of the proposed model and model predictions.

Footnotes

Author Contributions

The author contributed to the design and implementation of the research, to the analysis of the results and to the writing of the manuscript.

Declaration of Conflicting Interests

The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was funded by the Ministry of Science and Technology, Digital Humanities Program (MOST 110-2410-H-032-051).

Ethical Approval

This article does not contain any studies with human participants performed by any of the authors.

ORCID iD

Yulin Chen

Data Availability Statement

All data generated or analyzed during this study are included in this published article.

References

Al-ghamdi

S. A.

Al-Harigi

(2015). Rethinking image of the city in the information age. Procedia Computer Science, 65, 734–743.

Allam

Newman

(2018). Redefining the smart city: Culture, metabolism and governance. Smart Cities, 1(1), 4–25.

Ankarali

KÜLcÜ

. (2020). Topic modeling of twitter data. Bilgi Yönetimi, 3(1), 0–10.

Anwar

Abulaish

(2014). A social graph based text mining framework for chat log investigation. Digital Investigation, 11(4), 349–362. https://doi.org/10.1016/j.diin.2014.10.001

Ashkezari-Toussi

Kamel

Sadoghi-Yazdi

(2019). Emotional maps based on social networks data to analyze cities emotional structure and measure their emotional similarity. Cities, 86, 113–124. https://doi.org/10.1016/j.cities.2018.09.009

Brown

Gude

W. T.

Blakeman

van der Veer

S. N.

Ivers

Francis

J. J.

Lorencatto

Presseau

Peek

Daker-White

(2019). Clinical performance feedback intervention theory (CP-FIT): A new theory for designing, implementing, and evaluating feedback in health care based on a systematic review and meta-synthesis of qualitative research. Implementation Science, 14(1), 1–25. https://doi.org/10.1186/s13012-019-0883-5

Chan

Suryadipura

Kostini

(2021). City image: City branding and city identity strategies. Review of Integrative Business and Economics Research, 10, 330–341.

Chang

Y. C.

C. H.

Chen

C. H.

(2020). Using deep learning and visual analytics to explore hotel reviews and responses. Tourism Management, 80, 104129. https://doi.org/10.1016/j.tourman.2020.104129

Chaudhari

Thakkar

(2020). A comprehensive survey on travel recommender systems. Archives of Computational Methods in Engineering, 27, 1545–1571.

10.

Clarke

Hassanien

(2020). An evaluation of Toronto’s destination image through tourist generated content on twitter. International Journal of Customer Relationship Marketing and Management (IJCRMM), 11(2), 1–16.

11.

Crump

R. K.

Eusepi

Moench

Preston

(2023). The term structure of expectations. In Handbook of economic expectations (pp. 507–540). Academic Press.

12.

Deng

X. R.

(2018). Feeling a destination through the “right” photos: A machine learning model for DMOs’ photo selection. Tourism Management, 65, 267–278. https://doi.org/10.1016/j.tourman.2017.09.010

13.

Deng

Liu

Dai

(2019). Different cultures, different photos: A comparison of Shanghai’s pictorial destination image between East and West. Tourism Management Perspectives, 30, 182–192. https://doi.org/10.1016/j.tmp.2019.02.016

14.

Doogan

Buntine

Linger

Brunt

(2020). Public perceptions and attitudes toward COVID-19 nonpharmaceutical interventions across six countries: A topic modeling analysis of twitter data. Journal of Medical Internet Research, 22(9), e21419. https://doi.org/10.2196/21419

15.

Dunne

Hanrahan

(2017). Netnographic Research on Destination Image and Tourism Content Creators. Tourism and Hospitality Research in Ireland, 179.

16.

Entman

R. M.

(1993). Framing: Toward clarification of a fractured paradigm. Journal of Communication, 43(4), 51–58. https://doi.org/10.1111/j.1460-2466.1993.tb01304.x

17.

Fedorova

O. S.

(2016). The analysis of cultural, architectural and artistic factors of the city image formation. Journal of Siberian Federal University. Humanities & Social Sciences, 8, 1874–1879.

18.

Fernández-Cavia

Marchiori

Haven-Tang

Cantoni

(2017). Online communication in Spanish destination marketing organizations: The view of practitioners. Journal of Vacation Marketing, 23(3), 264–273. https://doi.org/10.1177/1356766716640840

19.

Ferrer-Rosell

Coenders

Marine-Roig

(2017). Is planning through the Internet (un)related to trip satisfaction? Information Technology and Tourism, 17(2), 229–244. https://doi.org/10.1007/s40558-017-0082-7

20.

Filieri

(2016). What makes an online consumer review trustworthy? Annals of Tourism Research, 58, 46–64. https://doi.org/10.1016/j.annals.2015.12.019

21.

K. W.

Liang

Saroha

Tse

Z. T. H.

Fung

I. C. H.

(2016). How people react to Zika virus outbreaks on Twitter? A computational content analysis. American Journal of Infection Control, 44(12), 1700–1702. https://doi.org/10.1016/j.ajic.2016.04.253

22.

Fung

I. C. H.

Tse

Z. T. H.

Cheung

C. N.

Miu

A. S.

K. W.

(2014). Ebola and the social media. The Lancet, 384(9961), 2207. https://doi.org/10.1016/S0140-6736(14)62418-1

23.

Gössling

Scott

Hall

C. M.

(2020). Pandemics, tourism and global change: a rapid assessment of COVID-19. ournal of sustainable tourism, 29(1), 1–20. https://doi.org/10.1080/09669582.2020.1758708

24.

Galí

Camprubí

Donaire

J. A.

(2017). Analysing tourism slogans in top tourism destinations. Journal of Destination Marketing and Management, 6(3), 243–251. https://doi.org/10.1016/j.jdmm.2016.04.004

25.

Galí

Donaire

J. A.

(2015). Tourists taking photographs: The long tail in tourists’ perceived image of Barcelona. Current Issues in Tourism, 18(9), 893–902. https://doi.org/10.1080/13683500.2015.1037255

26.

Geetha

Singha

Sinha

(2017). Relationship between customer sentiment and online customer ratings for hotels - An empirical analysis. Tourism Management, 61, 43–54. https://doi.org/10.1016/j.tourman.2016.12.022

27.

Gonzalez

Viana-Barrero

Acosta-Vargas

(2021, July 16–20). Text mining in smart cities to identify urban events and public service problems [Paper presentation]. Paper presented at the Advances in Artificial Intelligence, Software and Systems Engineering: Proceedings of the AHFE 2020 Virtual Conferences on Software and Systems Engineering, and Artificial Intelligence and Social Computing, USA.

28.

Graham-Harrison

Smith

(2020). What is the future for travel and migration in age of Covid-19? Retrieved May 12, 2020, from https://www.theguardian.com/world/2020/may/12/what-is-the-future-for-travel-and-immigration-in-age-of-covid-19.

29.

Guo

Barnes

S. J.

Jia

(2017). Mining meaning from online ratings and reviews: Tourist satisfaction analysis using latent dirichlet allocation. Tourism Management, 59, 467–483. https://doi.org/10.1016/j.tourman.2016.09.009

30.

Harrigan

Evers

Miles

Daly

(2017). Customer engagement with tourism social media brands. Tourism Management, 59, 597–609. https://doi.org/10.1016/j.tourman.2016.09.015

31.

Herath

Mittal

(2022). Adoption of artificial intelligence in smart cities: A comprehensive review. International Journal of Information Management Data Insights, 2(1), 100076.

32.

Hosseini

Xiao

Jaiswal

Poovendran

(2017, December). On the limitation of convolutional neural networks in recognizing negative images. In 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA) (pp. 352–358). IEEE.

33.

Hsu

C. H. C.

Song

(2014). A visual analysis of destinations in travel magazines. Journal of Travel and Tourism Marketing, 31(2), 162–177. https://doi.org/10.1080/10548408.2014.873308

34.

Huang

Zhu

Yao

(2021). Destination image recognition and emotion analysis: Evidence from user-generated content of online travel communities. The Computer Journal, 64(3), 296–304.

35.

Hunter

W. C.

(2016). The social construction of tourism online destination image: A comparative semiotic analysis of the visual representation of Seoul. Tourism Management, 54, 221–229. https://doi.org/10.1016/j.tourman.2015.11.012

36.

Ilyas

S. H. W.

Soomro

Z. T.

Anwar

Shahzad

Yaqub

(2020, June). Analyzing Brexit’s impact using sentiment analysis and topic modeling on Twitter discussion. In The 21st annual international conference on digital government research (pp. 1–6).

37.

Ilyas

S. H. W.

Soomro

Z. T.

Anwar

Shahzad

Yaqub

(2020). Analyzing brexit’s impact using sentiment analysis and topic modeling on twitter discussion [Paper presentation]. Paper presented at the ACM International Conference Proceeding Series.

38.

Jeong

Yoon

Lee

J. M.

(2019). Social media mining for product planning: A product opportunity mining approach based on topic modeling and sentiment analysis. International Journal of Information Management, 48, 280–290. https://doi.org/10.1016/j.ijinfomgt.2017.09.009

39.

Jia

S. S.

(2020). Motivation and satisfaction of Chinese and U.S. tourists in restaurants: A cross-cultural text mining of online reviews. Tourism Management, 78, 104071. https://doi.org/10.1016/j.tourman.2019.104071

40.

Kar

A. K.

Dwivedi

Y. K.

(2020). Theory building with big data-driven research – Moving away from the “What” towards the “Why”. International Journal of Information Management, 54, 102205. https://doi.org/10.1016/j.ijinfomgt.2020.102205

41.

Kim

Chen

J. S.

(2019). The memorable travel experience and its reminiscence functions. Journal of Travel Research, 58(4), 637–649. https://doi.org/10.1177/0047287518772366

42.

Kim

Stepchenkova

(2015). Effect of tourist photographs on attitudes towards destination: Manifest and latent content. Tourism Management, 49, 29–41. https://doi.org/10.1016/j.tourman.2015.02.004

43.

Kim

Fesenmaier

D. R.

(2017). Sharing tourism experiences: The posttrip experience. Journal of Travel Research, 56(1), 28–40. https://doi.org/10.1177/0047287515620491

44.

Kim

Vasardani

Winter

(2017). Similarity matching for integrating spatial information extracted from place descriptions. International Journal of Geographical Information Science, 31(1), 56–80.

45.

Kravchenko

Dakhno

Leshchenko

Tolstokorova

(2020). Machine Learning Algorithms for Predicting the Results of COVID-19 Coronavirus Infection. IT&I Workshops, 371–381.

46.

Kruger

Rootenberg

Ellis

(2013). Examining the influence of the wine festival experience on tourists’ quality of life. Social Indicators Research, 111(2), 435–452. https://doi.org/10.1007/s11205-012-0013-0

47.

Krumm

A. E.

Marcotte

George

B. C.

(2023). Model-based operative performance expectations for quantifying competency in general surgery. JAMA Surgery, 158(5), 515–521.

48.

Kumar

Kar

A. K.

Ilavarasan

P. V.

(2021). Applications of text mining in services management: A systematic literature review. International Journal of Information Management Data Insights, 1(1), 100008. https://doi.org/10.1016/j.jjimei.2021.100008

49.

Law

Leung

Chan

I. C. C.

(2020). Progression and development of information and communication technology research in hospitality and tourism: A state-of-the-art review. International Journal of Contemporary Hospitality Management, 32(2), 511–534. https://doi.org/10.1108/IJCHM-07-2018-0586

50.

Lehmann

Lalmas

Yom-Tov

Dupret

(2012). Models of user engagement. In User Modeling, Adaptation, and Personalization: 20th International Conference, UMAP 2012, Montreal, Canada, July 16–20, 2012. Proceedings 20 (pp. 164–175). Springer Berlin Heidelberg.

51.

Y. R.

Lin

Y. C.

Tsai

P. H.

Wang

Y. Y.

(2015). Traveller-generated contents for destination image formation: Mainland China travellers to Taiwan as a case study. Journal of Travel and Tourism Marketing, 32(5), 518–533. https://doi.org/10.1080/10548408.2014.918924

52.

Liang

Fung

I. C. H.

Tse

Z. T. H.

Yin

Chan

C. H.

Pechta

L. E.

Smith

B. J.

Marquez-Lameda

R. D.

Meltzer

M. I.

Lubell

K. M.

K. W.

(2019). How did Ebola information spread on twitter: Broadcasting or viral spreading? BMC Public Health, 19(1), 1–11. https://doi.org/10.1186/s12889-019-6747-8

53.

Liesch

(2011). Partnerships and photographs: Community conceptions of Keweenaw National Historical Park. Geographical Review, 101(4), 497–517. https://doi.org/10.1111/j.1931-0846.2011.00114.x

54.

Lim

Cho

G.-H.

Kim

(2021). Understanding the linkages of smart-city technologies and applications: Key lessons from a text mining approach and a call for future research. Technological Forecasting and Social Change, 170, 120893.

55.

Lin

M. S.

Liang

Xue

J. X.

Pan

Schroeder

(2021). Destination image through social media analytics and survey method. International Journal of Contemporary Hospitality Management, 33(6), 2219–2238. https://doi.org/10.1108/IJCHM-08-2020-0861

56.

Liu

Zhou

Zhao

Ryan

B. D.

(2016). C-IMAGE: City cognitive mapping through geo-tagged photos. GeoJournal, 81(6), 817–861. https://doi.org/10.1007/s10708-016-9739-6

57.

Liu

M. T.

Liu

K. L.

(2021). Using text mining to track changes in travel destination image: The case of Macau. Asia Pacific Journal of Marketing and Logistics, 33(2), 371–393.

58.

I. S.

McKercher

(2015). Ideal image in process: Online tourist photography and impression management. Annals of Tourism Research, 52, 104–116. https://doi.org/10.1016/j.annals.2015.02.019

59.

Long

Zhou

(2017). Pictorial urbanism: A new approach for human scale urban morphology study. Planners, 33(2), 54–60.

60.

Luo

J. M.

H. Q.

Law

(2020). Topic modelling for theme park online reviews: Analysis of Disneyland. Journal of Travel and Tourism Marketing, 37(2), 272–285. https://doi.org/10.1080/10548408.2020.1740138

61.

Mak

A. H. N.

(2017). Online destination image: Comparing national tourism organisation’s and tourists’ perspectives. Tourism Management, 60, 280–297. https://doi.org/10.1016/j.tourman.2016.12.012

62.

Mariani

M. M.

Di Felice

Mura

(2016). Facebook as a destination marketing tool: Evidence from Italian regional destination management organizations. Tourism Management, 54, 321–343. https://doi.org/10.1016/j.tourman.2015.12.008

63.

Mariné-Roig

Clavé

S. A.

(2016). Destination image gaps between official tourism websites and user-generated content. Information and Communication Technologies in Tourism 2016, 253–265.

64.

Marine-Roig

Ferrer-Rosell

(2018). Measuring the gap between projected and perceived destination images of Catalonia using compositional analysis. Tourism Management, 68, 236–249. https://doi.org/10.1016/j.tourman.2018.03.020

65.

McKercher

(2016). Towards a taxonomy of tourism products. Tourism Management, 54, 196–208. https://doi.org/10.1016/j.tourman.2015.11.008

66.

Michaelidou

Siamagka

N. T.

Moraes

Micevski

(2013). Do marketers use visual representations of destinations that tourists value? Comparing visitors’ image of a destination with marketer-controlled images online. Journal of Travel Research, 52(6), 789–804. https://doi.org/10.1177/0047287513481272

67.

Miller

Banerjee

Muppalla

Romine

Sheth

(2017). What are people tweeting about Zika? An exploratory study concerning its symptoms, treatment, transmission, and prevention. JMIR Public Health and Surveillance, 3(2), e7157. https://doi.org/10.2196/publichealth.7157

68.

Molinillo

Liébana-Cabanillas

Anaya-Sánchez

Buhalis

(2018). DMO online platforms: Image and intention to visit. Tourism Management, 65, 116–130. https://doi.org/10.1016/j.tourman.2017.09.021

69.

Morosan

DeFranco

(2019). Classification and characterization of US consumers based on their perceptions of risk of tablet use in international hotels: A latent profile analysis. Journal of Hospitality and Tourism Technology, 10(3), 264–285. https://doi.org/10.1108/JHTT-07-2018-0049

70.

Munar

A. M.

Jacobsen

J. K. S.

(2014). Motivations for sharing tourism experiences through social media. Tourism Management, 43, 46–54. https://doi.org/10.1016/j.tourman.2014.01.012

71.

Nath

Devlin

Reid

(2016). Expectation formation in case of newer hotels: The role of advertising, price, and culture. Journal of Travel Research, 55(2), 261–275. https://doi.org/10.1177/0047287514541003

72.

Nechita

Demeter

Briciu

V. A.

Varelas

Kavoura

(2019). Projected destination images versus visitor-generated visual content in Brasov, Transylvania. In Strategic Innovative Marketing and Tourism: 7th ICSIMAT, Athenian Riviera, Greece, 2018 (pp. 613–622). Springer International Publishing.

73.

Nitu

Coelho

Madiraju

(2021). Improvising personalized travel recommendation system with recency effects. Big Data Mining and Analytics, 4(3), 139–154.

74.

Pachidi

Spruit

Van De Weerd

(2014). Understanding users’ behavior with software operation data mining. Computers in Human Behavior, 30, 583–594. https://doi.org/10.1016/j.chb.2013.07.049

75.

Papathanassis

Knolle

(2011). Exploring the adoption and processing of online holiday reviews: A grounded theory approach. Tourism Management, 32(2), 215–224. https://doi.org/10.1016/j.tourman.2009.12.005

76.

Prebensen

N. K.

Xie

(2017). Efficacy of co-creation and mastering on perceived value and satisfaction in tourists’ consumption. Tourism Management, 60, 166–176. https://doi.org/10.1016/j.tourman.2016.12.001

77.

Priporas

C.-V.

Stylos

Kamenidou

I. E.

(2020). City image, city brand personality and generation Z residents’ life satisfaction under economic crisis: Predictors of city-related social media engagement. Journal of Business Research, 119, 453–463.

78.

Pruss

Fujinuma

Daughton

A. R.

Paul

M. J.

Arnot

Szafir

D. A.

Boyd-Graber

(2019). Zika discourse in the Americas: A multilingual topic analysis of Twitter. PLoS ONE, 14(5). https://doi.org/10.1371/journal.pone.0216922

79.

Rahmani

Gnoth

Mather

(2018). Tourists’ participation on Web 2.0: A corpus linguistic analysis of experiences. Journal of Travel Research, 57(8), 1108–1120. https://doi.org/10.1177/0047287517732425

80.

Rathore

A. K.

Kar

A. K.

Ilavarasan

P. V.

(2017). Social media analytics: Literature review and directions for future research. Decision Analysis, 14(4), 229–249. https://doi.org/10.1287/deca.2017.0355

81.

Salesses

Schechtner

Hidalgo

C. A.

(2013). The collaborative image of the city: Mapping the inequality of urban perception. PLoS ONE, 8(7), e68400. https://doi.org/10.1371/journal.pone.0068400

82.

Schroeder

Pennington-Gray

(2015). The role of social media in international tourist’s decision making. Journal of Travel Research, 54(5), 584–595. https://doi.org/10.1177/0047287514528284

83.

Shafqat

Byun

Y. C.

(2020). A recommendation mechanism for under-emphasized tourist spots using topic modeling and sentiment analysis. Sustainability (Switzerland), 12(1), 320. https://doi.org/10.3390/SU12010320

84.

Shankman

(2014). The 10 Most Photographed Global Cities On Instagram in 2013. Ski.

85.

Sohrabi

M. K.

Barforoush

A. A.

(2012). Efficient colossal pattern mining in high dimensional datasets. Knowledge-Based Systems, 33, 41–52. https://doi.org/10.1016/j.knosys.2012.03.003

86.

Stepchenkova

Zhan

(2013). Visual destination images of Peru: Comparative content analysis of DMO and user-generated photography. Tourism Management, 36, 590–601. https://doi.org/10.1016/j.tourman.2012.08.006

87.

Stone

J. A.

Can

S. H.

(2021). Gendered language differences in public communication? The case of municipal tweets. International Journal of Information Management Data Insights, 1(2), 100034.

88.

Storbacka

Brodie

R. J.

Böhmann

Maglio

P. P.

Nenonen

(2016). Actor engagement as a microfoundation for value co-creation. Journal of Business Research, 69(8), 3008–3017. https://doi.org/10.1016/j.jbusres.2016.02.034

89.

Subeno

Kusumaningrum

Farikhin . (2018). Optimisation towards latent dirichlet allocation: Its topic number and collapsed Gibbs sampling inference process. International Journal of Electrical and Computer Engineering, 8(5), 3204–3213. https://doi.org/10.11591/ijece.v8i5.pp.3204-3213

90.

Sutherland

Sim

Lee

S. K.

Byun

Kiatkawsin

(2020). Topic modeling of online accommodation reviews via latent dirichlet allocation. Sustainability (Switzerland), 12(5), 1–15. https://doi.org/10.3390/su12051821

91.

Syed-Ahmad

S. F.

Musa

Klobas

J. E.

Murphy

(2013). Audience response to travel photos and Arab destination image. Journal of Travel and Tourism Marketing, 30(1–2), 161–164. https://doi.org/10.1080/10548408.2013.751279

92.

Taecharungroj

(2019). User-generated place brand identity: Harnessing the power of content on social media platforms. Journal of Place Management and Development, 12(1), 39–70. https://doi.org/10.1108/JPMD-11-2017-0117

93.

Taecharungroj

Mathayomchan

(2019). Analysing TripAdvisor reviews of tourist attractions in Phuket, Thailand. Tourism Management, 75, 550–568. https://doi.org/10.1016/j.tourman.2019.06.020

94.

Taecharungroj

Mathayomchan

(2020). The big picture of cities: Analysing Flickr photos of 222 cities worldwide. Cities, 102, 102741. https://doi.org/10.1016/j.cities.2020.102741

95.

Tasci

A. D. A.

(2016). A critical review of consumer value and its complex relationships in the consumer-based brand equity network. Journal of Destination Marketing and Management, 5(3), 171–191. https://doi.org/10.1016/j.jdmm.2015.12.010

96.

Vijaykumar

Nowak

Himelboim

Jin

(2018). Virtual Zika transmission after the first U.S. case: Who said what and how it spread on Twitter. American Journal of Infection Control, 46(5), 549–557. https://doi.org/10.1016/j.ajic.2017.10.015

97.

Villamediana

Küster

Vila

(2019). Destination engagement on Facebook: Time and seasonality. Annals of Tourism Research, 79, 102747. https://doi.org/10.1016/j.annals.2019.102747

98.

H. Q.

Law

B. H.

(2015). Exploring the travel behaviors of inbound tourists to Hong Kong using geotagged photos. Tourism Management, 46, 222–232. https://doi.org/10.1016/j.tourman.2014.07.003

99.

Wang

Hsu

M. K.

(2016). Toward an integrated model of tourist expectation formation and gender difference. Tourism Management, 54, 58–71. https://doi.org/10.1016/j.tourman.2015.10.009

100.

Wang

(2021). Tourism destination image based on tourism user generated content on internet. Tourism Review, 76(1), 125–137.

101.

Wang

Taylor

J. E.

(2019). DUET: Data-driven approach based on latent dirichlet allocation topic modeling. Journal of Computing in Civil Engineering, 33(3), 04019023. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000819

102.

Xiao

Fang

Lin

Chen

(2022). A framework for quantitative analysis and differentiated marketing of tourism destination image based on visual content of photos. Tourism Management, 93, 104585.

103.

Xie

Xiao

(2016). Value co-creation between firms and customers: The role of big data-based cooperative assets. Information and Management, 53(8), 1034–1048. https://doi.org/10.1016/j.im.2016.06.003

104.

Cheung

L. T.

Lovett

Duan

Pei

Liang

(2023). Understanding the influence of user-generated content on tourist loyalty behavior in a cultural World Heritage Site. Tourism Recreation Research, 48(2), 173–187.

105.

Yuan

Lau

(2018). Topic sentiment mining for sales performance prediction in e-commerce. Annals of Operations Research, 270(1–2), 553–576. https://doi.org/10.1007/s10479-017-2421-7

106.

Yuan

Medel

(2016). Characterizing international travel behavior from geotagged photos: A case study of Flickr. PLoS ONE, 11(5), e0154885. https://doi.org/10.1371/journal.pone.0154885

107.

Zhang

Buhalis

(2018). A model of perceived image, memorable tourism experiences and revisit intention. Journal of Destination Marketing and Management, 8, 326–336. https://doi.org/10.1016/j.jdmm.2017.06.004

108.

Zhang

(2018). Big data and tourism geographies–an emerging paradigm for future study? Tourism Geographies, 20(5), 899–904. https://doi.org/10.1080/14616688.2018.1519719

109.

Zhao

Wang

(2019). Predicting overall customer satisfaction: Big data evidence from hotel online textual reviews. International Journal of Hospitality Management, 76, 111–121. https://doi.org/10.1016/j.ijhm.2018.03.017

110.

Zhao

Zhu

Hao

(2018). Share the Gaze: Representation of destination image on the Chinese social platform WeChat moments. Journal of Travel and Tourism Marketing, 35(6), 726–739. https://doi.org/10.1080/10548408.2018.1432449

111.

Zhou

Q. B.

Zhang

X. R.

(2018). Is all authenticity accepted by tourists and residents? The concept, dimensions and formation mechanism of negative authenticity. Tourism Management, 67, 59–70. https://doi.org/10.1016/j.tourman.2017.12.024

Topic	Weights
1	0.036“photography” + 0.027“eyelash” + 0.027“event” + 0.027“urban” + 0.018“card” + 0.018“happy” + 0.018“branch” + 0.018“account” + 0.018“saturday” + 0.018“PASAY”
2	0.159“automotive” + 0.113“wheel” + 0.081“vehicle” + 0.042“motor” + 0.040“design” + 0.037“alloy” + 0.030“hubcap” + 0.030“system” + 0.028“synthetic” + 0.028“rubber”
3	0.027“yellow” + 0.027“tour” + 0.027“hotel” + 0.027“kitchen” + 0.014“design” + 0.014“natural” + 0.014“black” + 0.014“white” + 0.014“building” + 0.014“carmine”
4	0.033“landscape” + 0.017“shop” + 0.017“electric” + 0.017“winter” + 0.017“property” + 0.017“material” + 0.017“fashion” + 0.017“thumb” + 0.017“azure” + 0.017“wrist”
5	0.047“architecture” + 0.037“plant” + 0.036“solar” + 0.028“building” + 0.024“temple” + 0.024“free” + 0.024“taipei” + 0.024“PESOS” + 0.014“water” + 0.013“cloud”
6	0.042“design” + 0.031“electric” + 0.017“fashion” + 0.017“card” + 0.017“mall” + 0.017“advertising” + 0.017“PACKAGE” + 0.017“coloring” + 0.017“publication” + 0.017“magenta”
7	0.048“architecture” + 0.033“temple” + 0.017“building” + 0.017“travel” + 0.017“leisure” + 0.017“event” + 0.017“chinese” + 0.017“japanese” + 0.017“facade” + 0.017“place”
8	0.035“design” + 0.035“advertising” + 0.035“happy” + 0.023“airport” + 0.023“mall” + 0.023“royal” + 0.023“PACKAGE” + 0.023“CITY” + 0.023“natural” + 0.023“http”
9	0.057“plant” + 0.057“forest” + 0.029“design” + 0.015“terrestrial” + 0.015“landscape” + 0.015“natural” + 0.015“flowering” + 0.015“biome” + 0.015“branch” + 0.015“broadleaf”
10	0.074“landscape” + 0.045“plant” + 0.038“mountain” + 0.037“natural” + 0.029“horizon” + 0.019“new” + 0.019“credit” + 0.019“mountainous” + 0.019“manila” + 0.019“trip”
11	0.072“kitchen” + 0.049“appliance” + 0.037“design” + 0.037“property” + 0.025“transfer” + 0.025“city” + 0.025“visit” + 0.025“option” + 0.025“hotel” + 0.025“lunch”
12	0.026“breakfast” + 0.026“office” + 0.026“travel” + 0.014“sleeve” + 0.014“cuisine” + 0.014“dishware” + 0.014“comfort” + 0.014“tableware” + 0.014“produce” + 0.014“recipe”

Topic	Weights
1	0.039“personal” + 0.038“rolling” + 0.038“luxury” + 0.025“design” + 0.022“automotive” + 0.020“lighting” + 0.020“asphalt” + 0.017“wheel” + 0.016“service” + 0.016“collar”
2	0.043“event” + 0.035“design” + 0.035“equipment” + 0.034“recreation” + 0.028“advertising” + 0.018“personal” + 0.018“protective” + 0.018“graphic” + 0.018“brand” + 0.018“stage”
3	0.054“automotive” + 0.052“design” + 0.043“flooring” + 0.039“kitchen” + 0.029“lighting” + 0.025“appliance” + 0.023“event” + 0.021“vehicle” + 0.021“floor” + 0.019“stain”
4	0.031“design” + 0.029“spring” + 0.029“facade” + 0.022“comfort” + 0.022“building” + 0.018“estate” + 0.015“grass” + 0.015“cloud” + 0.015“event” + 0.015“fashion”
5	0.034“design” + 0.031“produce” + 0.031“comfort” + 0.028“group” + 0.027“foods” + 0.020“cuisine” + 0.019“fashion” + 0.019“handwriting” + 0.019“advertising” + 0.018“sweetness”
6	0.060“design” + 0.027“graphic” + 0.026“fashion” + 0.025“event” + 0.019“product” + 0.019“advertising” + 0.018“magenta” + 0.017“electric” + 0.017“happy” + 0.014“illustration”
7	0.154“automotive” + 0.097“vehicle” + 0.077“wheel” + 0.044“design” + 0.041“tread” + 0.039“exterior” + 0.038“alloy” + 0.038“motor” + 0.033“hubcap” + 0.032“system”
8	0.042“fashion” + 0.041“event” + 0.033“leisure” + 0.025“human” + 0.021“travel” + 0.021“electric” + 0.017“magenta” + 0.017“accessory” + 0.017“luggage” + 0.017“sports”
9	0.034“advertising” + 0.028“bottle” + 0.028“brand” + 0.027“poster” + 0.021“design” + 0.021“fluid” + 0.021“illustration” + 0.021“cartoon” + 0.021“animated” + 0.021“character”
10	0.055“space” + 0.036“human” + 0.036“public” + 0.035“temple” + 0.034“morning” + 0.030“happy” + 0.029“expression” + 0.029“cloud” + 0.029“settlement” + 0.024“travel”
11	0.094“plant” + 0.070“landscape” + 0.047“natural” + 0.031“nature” + 0.025“grass” + 0.023“leisure” + 0.023“blossom” + 0.020“travel” + 0.020“flowering” + 0.020“petal”
12	0.080“block” + 0.067“condominium” + 0.067“cityscape” + 0.055”landscape” + 0.048“urban” + 0.044“design” + 0.043“horizon” + 0.039“architecture” + 0.038“landmark” + 0.034“estate”

Is There a Difference in the Perception of City in Pre-Pandemic and Peri-Pandemic on Social Media? Case Study from Taiwan

Abstract

Keywords

Introduction

Literature

Expectations of Cities and Pandemic Studies

Image Analysis and City-Related TGC

Topic Analysis and Latent Dirichlet Allocation

Methods

Data Search and Collection

Text Preprocessing and Data Cleaning

Image Analysis and Tag Testing

LDA Modeling

Topic Assessment and Selection

Text Topics and Interactivity

Image Topics and Interactivity

Results

Text Topics and Visual Results of Taipei City

Image Topics and Visual Results of Taipei City

Discussion

Text Topics

Image Topics

Post-Pandemic Recommendations

Limitations

Footnotes

Author Contributions

Declaration of Conflicting Interests

Funding

Ethical Approval

ORCID iD

Data Availability Statement

References