Sage Journals: Discover world-class research

Abstract

Social Media acts as a primary source of information, opinion, and news source for millions of individuals every day for over a decade. This has never been as apparent as the global pandemic of COVID-19, wherein a span of more than just a year, the evolution of emotions amongst the users of social media has never been so swift and fickle. This study uses Reddit post-extraction and classifies 60,370 posts between the time frame of February 11th, 2020 to January 26th, 2021 from the two major subreddits of r/COVID and r/COVID19. With the help of the Lexicon Approach, the posts are classified into positive, negative, and neutral sentiment polarities, and then distributive frequencies and valence scores are measured for measuring emotional contagion. The findings reveal that there is a high presence of negative sentiments in the posts, and the increase in sentimental extremities occurred in three time-frames, the initial pandemic stage; the implementation of massive lockdowns stage; and the approval and administration of vaccines stage. It also shows that there is a linear relationship between the valence of exposed stimuli and their response. Emotional contagion is present in both positive as well as negative posts. The important implications can be drawn for the emotional wellbeing, perspective, and contagion of the users of Reddit.

Keywords

COVID-19 sentiment analysis Reddit emotional contagion social media

Introduction

The outbreak of Novel Coronavirus (2021) has had a tremendous impact on the physiological as well as mental health of numerous individuals. The physical symptoms are, by and large, visible and tangibly diagnosable; but the mental symptoms are majorly missing (WHO, 2021a, 2021b). Individuals have suffered heavy losses in family and fortunes, mobility, incomes, social engagement along with stress, anxiety, isolation, depressive symptoms, uncertainty with emotions, livelihoods as well as mental strength (Low et al., 2020). There have been a few support groups on the platform known as Reddit, which have allowed users to share, discuss and help those who are in need of help dealing with this pandemic’s experience. This study uses the Natural Language Processing Tools to analyze the invoked emotion in the discourse of COVID-19 on an online platform by evaluating their sentiments and identifying the presence of emotional contagion.

Early findings showed that this fear in the public’s mind was reflected in their online discourse, which necessitated the importance of studying the patterns in the emotional content of these discussions (Shankar & Tewari, 2021a). There has been, therefore, an increase in the literature studying the sentiments of the online discourse related to COVID-19 (Alamoodi et al., 2021; Crocamo et al., 2021; Valenzano et al., 2020). These studies also found that there has been a pattern of intense emotional discussions when it comes to the norms and outcomes of COVID-19, which could result in the development of emotional contagion. Continuous bombardments of online notifications lead to a long-term effect on an individual’s emotional sentiments as well as contagion, leading to people experiencing emotions that are intense, without the apparent awareness (Ferrara & Yang, 2015; Rubin & Wessely, 2020; Shankar & Breithaupt, 2019).

Severe acute respiratory syndrome (SARS) and the Middle East respiratory syndrome (MERS) and the common cold are among the primary illnesses triggered by a family of viruses that are known as Coronaviruses. SARS-CoV-2, or severe acute respiratory syndrome coronavirus-2, has been originated in China in 2019, which was then called COVID-19 by the World Health Organization (WHO, 2020a). It was then declared a pandemic by WHO in March, 2020 (WHO, 2020a). As of 26th June, 2021, there have been over 181 million positive COVID-19 cases reported, with over 165 million of those cases being recovered and nearly 4 million deaths.

This all has its own baggage of fear in its social stigma for people, places, and things that are associated with COVID-19. This pandemic has also led to an “infodemic” (WHO, 2020a) due to an overwhelming response, opinions, emotions as well as conspiracies surrounding the COVID-19 discourse. With the involvement of uncertain emotions, what we get is an emotionally charged crowd that is easily swayed to unreliable dogma. Human interactions are built upon recognizing, acknowledging, and embracing emotions and sentiments (Cowen et al., 2019), but if they are not properly expressed or addressed, they seem to result in dire mental health issues (Wells, 2006). Fear mongering has been a weaponized tool for political, sociological, and psychological attraction and warfare since the days of yore (Aslam et al., 2020). The unfortunate misuse of media for sensationalizations of half-truths or fake news has been done for hundreds of years to propagate false pieces of information (Friedman et al., 1999).

The other coronavirus outbreaks were reported first in Guangdong province, along with Hong Kong, Toronto, Singapore, and Hanoi by many researchers (Hsu et al., 2003; Lee et al., 2003; Tsang et al., 2003) among others, resulting in 8,439 infections and 812 deaths (Liang et al., 2004). That outbreak was clinically named SARS-CoV-1, which bears 70% similarity with SARS-CoV-2, or COVID-19 (Rat et al., 2020). Another outbreak was reported in 2012 as Middle East respiratory syndrome coronavirus [MERS-CoV] first in Saudi Arabia, resulting in 2,400 infections and 850 deaths (Killerby et al., 2020; Wang et al., 2020; Zaki et al., 2012).

The coverage of this new strain of the virus generated fear amongst the public, which, in turn, sparked a scared discourse on all the social media platforms. To this day, COVID-19 remains the top 5 discussed, covered, or conversed topics on all the biggest platforms across the globe, including platforms like Google, Twitter, Reddit, Facebook, etc. (Volkmer, 2021). Even in the early days of the pandemic, and lockdowns and quarantines, a study (Barkur &Vibha, 2020) found that the overall sentiment of the social media users has been overly positive, but then later studies (Aslam et al., 2020; S. Das & Dutta, 2021) found that there is a shift into the negative sentiment towards the social media discourse of COVID-19, many of which is spearheaded by the negative coverage of the virus in the mainstream media. Natural Language Processing (NLP) has a unique and prominent method of classifying sentiments in the texts that are being analyzed, known as Sentiment Analysis (B. Liu, 2012), and there has been an increased use of sentiment analysis in understanding the medical emergencies and research (Sokolova & Bobicev, 2013; Zeng-Treitler et al., 2008). Sentiment analysis uses a technique known as Lexicons which matches the text units (words, sentences, etc.) with emotions and sentiments (Mohammad & Turney, 2013; Taboada et al., 2011), which then segments these sentiments into positive and negative sentiments by assessing the overall polarities of these texts (Pang & Lee, 2008; Strapparava & Mihalcea, 2008; Strapparava et al., 2006; Turney & Littman, 2003; Wilson et al., 2005). The positive sentiment explains the polarity which has favorable sentiment, and the negative sentiment explains the polarity of dissimilar sentiments (Bermingham & Smeaton, 2010). Even though there are obvious challenges to sentiment analysis of big data in real-time (Turney, 2002), organized use of classifiers to continually evolve the classification Lexicon-based approach is important (Pang et al., 2002).

In this study, we have tried to use social media to decipher the communication and discourse amongst users talking about COVID-19. The focus of their discussions would be through the Reddit platform, concentrating on the topics, posts, and news related to COVID-19. The sentiment analysis is done to evaluate the emotional valence and emotional contagion of the outbreak. There have been intensive recent studies that have studied the content that is shared on social media by social media users, such as Facebook (Kim et al., 2021; Kramer et al., 2014; Sturm Wilkerson et al., 2021), Twitter (Arora et al., 2021; Ferrara & Yang, 2015; Park et al., 2021; Shankar & Tewari, 2021b), as well as traditional News Headlines (Aslam et al., 2020; Srivastava & Deepak, 2021); but the studies dealing with the conversations and discourse on Reddit—especially when it pertains to emotional regulation and contagion—in the context of COVID-19, are substantially scarce, even though there are two big subreddits dedicated to COVID-19. The studies done on the other social media and traditional media platforms have used Sentiment Analysis and Deep Learning Techniques to identify and evaluate their users’ emotional and behavioral valence. But the limited literature on Reddit discourse has primarily focused on understanding medical uncertainties or relatedness in responses (Bunting et al., 2021; Thompson et al., 2022), and were linguistically (Low et al., 2020) or geographically (Zhang et al., 2020) constrained. We have tried to justify the consequences that Reddit discourse has on emotional wellbeing, and urges for interventions in the front. The questions that are being answered here are: (1) What is the overall sentiment polarity of the Reddit posts of COVID-19? (2) What are the top conversation topics while discussing positive and negative sentiments of the COVID-19? And (3) Is there an effect of emotional contagion on the users of Reddit discussing COVID-19? Based on the textual analysis, the study proposes intervention strategies that can be employed to identify and convert negative sentiments into positive ones. This can help in identifying the most credible sources of the user pools that can act as information disseminators for the discourse of COVID-19 information.

Review of Literature

NLP techniques have been used in the studies of social media analytics when analyzing varied domains of research. Amongst the most popular techniques of NLP is Sentiment Analysis. Sentiment analysis has been used in multiple ways for emotional mining (Cambria et al., 2013; Feldman, 2013; B. Liu, 2012; Medhat et al., 2014; Pang & Lee, 2008). Since there has been a wide range of domains that have implemented Sentiment Analysis, this section would discuss the use of natural language processing in the domain of pandemic-related social media analytics.

With the advent of COVID-19 and subsequent online interactions, there was an immediate need to address the concerns regarding assessing the emotions of the users interacting on these platforms. Thus, numerous researchers started looking for the sentiments and sentiment analysis of these social media interactions. Several studies include platforms like Facebook and Twitter (Domalewska, 2021; Rianto & Pratama, 2021), YouTube, Instagram, etc. (de Las Heras-Pedrosa et al., 2020; Shukla, 2021), Twitter, and other online platforms (Chakraborty et al., 2020; Dubey, 2020). A study performed a systematic literature review to suggest that sentiment analysis played a substantial role in figuring out the emotional intelligence as well as emotional contagion of online users (Alamoodi et al., 2021).

This review study also provided a stark understanding which formed the initial premise of this study. It found that the impact of social media discourse and its sentiment analysis was prominent in figuring out the presence of emotional contagion, especially when the contents shared or discussed were tangential to the mainstream discourse, or worse, were misinformation.

Sharma et al. (2017) analyzed the impact social media has as an information source that might help in decreasing the pandemic information spread. This study used the Zika virus pandemic as the source to discuss the misleading posts that gain traction and popularity in comparison to accurate posts. Similarly, B. F. Liu and Kim (2011) evaluated the organizational dissemination and subsequent social media responses to the 2009 H1N1 flu virus, showing that legitimate organizations did not use this opportunity to properly discuss the epidemic crisis. Similarly, the 2015 India H1N1 flu epidemic and its related issues were discussed showing Twitter acted as a better platform for effective information transmission, as compared to similar traditional media and social media platforms (Jain & Kumar, 2015). Similar studies have discussed the misinformation regarding the Ebola virus (Apuke & Omar, 2020). There has been an epidemic of false information spread in the field of health (Apuke & Omar, 2021; Pulido et al., 2020).

There is a similar use of social media for COVID-19 and social media interactions. Right from investigating how China and its organizations used social media during this time (Q. Chen et al., 2020), discussing citizen engagement and its impact on misinformation, to India and its organizations’ use of social media (S. Das & Dutta, 2021) talking about government’s handling of the events during the pandemic on Twitter. Some studies have rightly found that there is an obvious connection between social media and misinformation in the pandemic era (Hou et al., 2020; Huynh, 2020; Pennycook et al., 2020). People are looking at social media for seeking information (Huynh, 2020), but the spread of fake news and fake posts has been especially prominent on social media (Frenkel et al., 2020; Pano & Kashef, 2020; Russonello, 2020). Fabricated information involving the origins of the virus, preventive home cures, propagating racial division and hate along with denials of scientific methods and mental health, added to undermine the efforts of front-line workers, scientists, and governments (Lampos et al., 2021).

Similar studies discussed the impact that social media and misinformation have on emotional risk, such as in Vietnam (Huynh, 2020), Taiwan (Frenkel et al., 2020), Nigeria (Alpert, 2020), India (S. Das & Dutta, 2021), Africans (Lampos et al., 2021), United States (Pennycook et al., 2020), among others. These papers found a consistent finding that social media has been inefficient in combating medically unproven “cures,” along with incompetent predictions of multiple millions of deaths in each country due to this pandemic, resulting in unnecessary fear and uncertainty (Aslam et al., 2020; Hassan, 2021; Sahu et al., 2020).

There has also been empirical studies and literature survey for analyzing COVID-19-related opinion, commentaries, discussions, posts, etc. These studies have focused on understanding COVID-19 (Sohrabi et al., 2020), tackling COVID-19 (Lampos et al., 2021), documenting comprehensive reports of COVID-19 (Sahu et al., 2020), studying mental health related to COVID-19 (Rajkumar, 2020), and compiling media reports of COVID-19 (Zhou et al., 2020) along with accessing emotions related to the news headlines from major news outlets regarding COVID-19 (Aslam et al., 2020).

These studies showed that this use of sentiment analysis with social media is a continually evolving topic (L. Das & Dutta, 2020), and this study tries to build upon this topic. With the help of methodologies such as Sentiment Analysis and VADER, we would be able to explain the emotional contagion and emotional valence of the user having an online discourse.

If the COVID-19 related opinions expressed are unpopular or tangential, and if they’re opined during the online discourse, lead to a prompt as well as unnecessary social and cultural discrimination (Akroyd et al., 2020), and could fortify up and divulge into civil and societal unrests (Bloem & Salemi, 2021), as was seen during the peak isolation periods. The increase in mental health and its related issues—including stress, anxiety, motivation, and emotional intelligence (Shankar & Tewari, 2021b), etc. lead to further deterioration of society. This has real, severe, and long-lasting implications for the weaker sections of the society (Shankar & Tewari, 2021b), who could be easily extorted into receiving improper medical attention, leading to counterfeit emotions as well as high emotional contagion (Rincón-Aznar et al., 2020).

Interplay Between Online Discourse and Emotional Contagion

Modern English as well as Medical dictionaries define Contagion as the “transference of disease by contact” (R. P. Das, 2017; Merriam-Webster, 2022). Speaking in explicitly medical terms, the presence of contagion is seen when the transference of an infectious disease happens through the mediums or carriers of the pathogenic microorganisms, usually through the air, water, or other contaminable sources (Valenzano et al., 2020). In our study, we are centrally focused on the theory of emotional contagion (Hatfield et al., 1993).

Emotional Contagion, as defined by Hatfield et al. (1992) provides an underlying understanding of the theoretical foundations of collectivist behavior, afferent behavioral mimicry, and behavioral transmission, along with human cognition, behavior, and emotion as well as other neurophysiological and psychological outcomes. Multiple pieces of research, including a study of a 20-year-long longitudinal study, found that intense emotions have a way of finding a path into a person’s psyche through social media and other online interactions (Fowler & Christakis, 2008). Some other studies found that emotional contagion occurred even when the interactions were non-verbal, online, and were manipulatively controlled (Kramer et al., 2014; Sasaki et al., 2021; Steinert, 2021). These studies suggest that getting emotional cues from social media platforms can have long-term negative effects (Shankar & Tewari, 2021a).

The possibility of information manipulation that users could see or denied seeing is not only well-suited for these platforms, but they are actively involved in it (L. Chen et al., 2022). The ethical concerns are pretty obvious (Ferrara & Yang, 2015), but what is usually glossed over by the individuals are the long-lasting consequences of these manipulative behaviors.

As fear, uncertainty, anxiety, and stress kicked in with the advent of COVID-19, social media awarded us with emotionally charged messaging, and subsequent radical emotional responses. This leads to a toxic emotional environment, which are the breeding grounds for emotional contagion. Users were less worried about the tangible consequences of COVID-19 and its policies and were more concerned with perceptual alignment with the narratives (Altamura et al., 2019).

Although many studies have studied social media interactions regarding COVID-19 and possible psychological and emotional triggers and stress, there is not a very clear understanding of why some triggers affected users more than others. Also, there is uncertainty in figuring out why some were affected more than the others. In light of this premise, this study has tried to identify the presence of emotional contagion in Reddit interactions regarding COVID-19, and how this contagion has impacted the sentiments and responses of these users.

Method

Data

This study collected 60,370 posts from two subreddits, namely r/covid and r/covid19 that are dedicated to discussing about the coronavirus pandemic with nearly over 500k spectators combined in the subreddit communities. Reddit is a social media platform that covers social news and media aggregation as well as content aggregation through posts, comments, replies, images, videos, links, etc. The popularity of a post is measured by the engagements of other users through reply threads, and upvotes/downvotes providing scores to the post.

Unlike other social media platforms, Reddit allows for the extraction of contents through metadata on its website. This study was done using Python to extract all the posts from February 11th, 2020 to January 26th, 2021 for analysis. The summary of the dataset is given in Table 1.

Table 1.

Summary of the Dataset.

Dates of data collection	February 11th, 2020 to January 26th, 2021
Number of posts collected	73,770
Number of posts after pre-processing	60.370
Day on which maximum posts were made	Monday
Organizations’ most discussed words	CDC

The threads and posts were discussing about every aspect of their lives that has been affected by this COVID-19 pandemic. Sentiment analysis was conducted from the posts extracted through these two subreddits.

Data Pre-Processing

The extracted contents of the users usually contain textual as well as non-textual information that warrants cleaning before any kind of NLP analysis. Text analytics (Angiani et al., 2016; Kharde & Sonawane, 2016) is used to look preliminarily into the raw extracted data. The following steps were taken for data processing and data cleaning:

Conversion of posts into a text file.

The texts were converted into a corpus on which the analysis was to be done.

Conversion of all texts of the entire document into lower case.

Removal of punctuation marks, such as commas, hyphens, periods, and other line and page breaks.

Removal of stopwords, such as “like,”“and,”“or,”“in,”“is,”“there,”“were,”“for,”“isn’t,”“couldn’t” etc.

Removal of URLs, mentions, emoticons, and other non-ASCII characters.

Removal of numbers, digits, and numerals.

Removal of unnecessary white spaces, tabs, and other spaces.

Stemming—Removal of words with common occurrence ending with “es,”“ed,” and “s.”

Lemmatizing—Assessing different forms of inflected words to get a better understanding of the text.

Sentiment Analysis

Sentiment analysis is usually defined as a classification task where each classifying type and category is characterized by a sentiment (Prabowo & Thelwall, 2009). This technique is usually defined in two ways: First, is the Lexicon-Based Approach, and the second is the Machine Learning Approach (Kharde & Sonawane, 2016; Piryani et al., 2017; Taboada et al., 2011). The more popular of the two approaches is the Lexicon-Based Approach, as it relies on a set of predefined “lexicons,” or a list of predefined wordset that help in identifying the polarities of the texts to be analyzed. These lexicons have an inbuilt repository-list of wordset, along with their own sentiment polarities (Mohammad, 2015).

NRC Emotion Lexicon is the most popular lexicon used in these researches (S. Das & Dutta, 2021) which uses the rule-based approach to associate the words from its eight basic emotions (Ekman, 1992; Plutchik, 1994). This corpus-level mining approach allows for a simplified understanding of the complex linguistic structure of the language, especially around emotionally-charged topics like COVID-19.

Valence Aware Dictionary and Sentiment Reasoner (VADER)

VADER is amongst the most commonly used Lexicon-Based Approach and Rule-Based Sentiment Analysis model that is predominantly used to analyze the words, texts, and emoticons from the social media platforms (Hutto & Gilbert, 2021). Since it has a predefined repository from which it does its analyses, it is usually found to be much more effective and efficient in terms of speed, time, and accuracy in comparison to its machine learning approach counterparts (Hutto, 2014; Hutto & Gilbert, 2014; Shankar et al., 2021).

All of the values of the textual conversions are first converted into vectors, that provide scores to the sentiments of the texts, which divides the vectors into positive, negative, neutral, and compound polarities, which then normalizes the polarities of negative, positive, and neutral from 0 to 1; and the compound polarities are normalized from −1 to +1 [negative to positive] (Mäntylä et al., 2018, Pano & Kashef, 2020). These scores are identified as VADER scores and they are used to measure the emotional trend of the COVID-19 texts.

VADER has been consistently performing better on human as well as Twitter Data (Hutto & Gilbert, 2014); and has performed better per capita against the other seven popular lexicons (Elbagir & Yang, 2020). There have been several linguistic adaptations of VADER too (Amin et al., 2019; Las Johansen, 2018; Oyewusi et al., 2020; Tymann et al., 2019), that further reinforce the understanding that VADER is the most widely used for collaborative and ensemble understanding of the emotions of the textual data (Bonta & Janardhan, 2019; Borg & Boldt, 2020).

Results

Sentiment Analysis

Sentiment Analysis was done using Python’s package VADER (Hutto, 2014). The sentiment polarities were divided into two polarities, that is, Negative and Positive, and the emotions were classified into eight emotions (Mohammad, 2015; Plutchik, 1994). In order to analyze the word distribution visually, a word cloud was created. Figure 1 shows the word cloud of the words being used in the posts based on their occurrence frequency. The bigger is the word, the higher is the frequency of occurrence of the said word.

Figure 1.

Word cloud of the sentiment polarities based out of COVID-19 post.

It can be clearly seen from the word cloud that in the positive word corpus, the most commonly occurring words were thank, fun, help, please, hope, happy, good, well, etc., whereas in the negative word corpus, the most commonly occurring words were covid, death, people, struggling, feeling, throat, pain, infected, loneliness, etc. For further understanding, a group of sample Reddit posts by the Redditors explaining the positive and negative sentiment polarities is presented in Table 2.

Table 2.

Sample Redditors’ Posts with Emotional Classification and Sentiment Polarity.

Redditors’ negative posts	I’m not quite sure if this is the right group to post this but I’m panicking. Like; literally fighting a panic attack as I write this. English is not my first language, so I’m sorry for any mistakes or typos. I’m in one of the smallest cities in Mexico. There’s a total of 458 cases in the whole state, and 147 in the city where I live. I’ve been taking precautions and all, but I suffer from diagnosed anxiety disorder and depression both increasing during this whole pandemic so it’s been a hard time trying to convince myself that I’m taking precautions and there’s nothing else I can do, literally.
	Well my husband’s Coworkers are dropping like flies. 6 people in the past 2 weeks positive and 1 in the hospital… we just took our test a couple days ago just awaiting results. Hoping it’s negative and so far he doesn’t feel that bad. The kids and I feel fine. I wonder if we are just building it up in our system, doing my best to not freak out.
	As a 20 year old who has been in lockdown or little contact with others much in the past year, Im starting to lose my mind… anyone have any suggestions on helping time pass. I am currently In university but I find it worse with no time to go out and blow off steam. I just feel as if I am losing the best years of my life. I miss my dating life the most being recently single about a month before covid began, its terrible.
Redditors’ positive posts	I recovered from covid at the beginning of the month. Thankfully have my sense of taste back but still can not smell. My memory had gone to shit though. I will go to the gas station and forget what I went there for, I will forget tv episodes I had just watched, I will forget what we had for dinner the previous night, etc.. Has anyone experienced this? How long did it last?
	Hi, everyone. Ill start by saying Ive been happy to comply with mask mandates from the start. I have no qualms about it. If its going to protect others and if the masks of others will protect me, Im game.
	2020 will be remembered not only for a virus that changed our lives, some of our perspectives, our working conditions, health care but also for the fear, the panic, the doubts, the questions and as the year ends, maybe some hope The hope seems to be in the form of a vaccine and there seems to be an unquestionable faith in it. The world seems to be happy with an untested vaccine as long getting vaccinated will ensure that they can travel, party and do all the things they could before 2020.

The posts show that when a user is discussing the illness and its ramifications, it is usually in a negative sentiment polarity, where they are clearly talking about the struggles they have with the disease and how they have tried to deal with it. And when talking about vaccines and uplifting news and stories, they are being presented in the positive sentiment polarity. It is also evident that positive posts have an air of caution in them, and negative posts have a trickle of hope.

The choice of also measuring the intensity of the sentiment polarities when dealing with longer texts is discussed in previous studies (Ferrara & Yang, 2015; Hutto & Gilbert, 2014; Thelwall et al., 2010), with the limitations to intensity of sentiment analysis being presented when the text is limited by the character or word count. With such limitations being absent in Reddit, the efficiency of sentiment intensity increases. Figure 2 presents the sentiment distribution of the top 10 most discussed topics or organizations.

Figure 2.

Average sentiment intensity in the top 10 most discussed topics or organizations.

It can be clearly seen that when the posts are pertaining to organizations (business, medical, or otherwise), the sentiment is majorly positive, exhibiting positive emotions; but when the posts are about the disease or the effects that they have on their lives or livelihoods, then the sentiments are similar in the positive and negative polarities, suggesting a presence of static or neutral emotional presence in these topics.

Effect of Emotional Contagion

There is a long-standing idea that emotions can be transferred through interactions on the online media, especially social media, even when there is an objective absence of non-verbal communications, interactions, and cues, which are considered the core ingredient for any emotional interactions (Fowler et al., 2008; Hatfield et al., 1992).

To achieve this, the posts were vectorized and the instances of occurrence of “covid” were counted through frequency distribution across the time-frame of data collection. This allowed us to figure out the time when there was a significant rise in the discussions related to COVID-19. The results are shown in Figure 3.

Figure 3.

Average daily occurrence of the word “covid.”

This shows that from early March to early April, there was significant use of the word “covid” in the posts, which goes down and returns during late April, as this was the time when COVID-19 was first declared a pandemic and the nomenclature was derived (WHO, 2020b). This spike was again resumed around early June when most of the nations, including the UK, India, etc. went into severe lockdowns, and there was a serious spike in death toll too (Venkata-Subramani & Roman, 2020). Again, the spike in occurrence of the word returned around early December and mid-January, when the first of the vaccines were finalized (BBC, 2020) and approved (WHO, 2020b) by the governing health authorities across the globe, including WHO and the vaccines were beginning to be administered to the public (WHO, 2020b).

The posts were then divided into their sentiment polarities based on their VADER scores. Then for each polarity, a distribution was created through observation of the average daily change. The results are shown Figures 4 and 5. Figure 4 shows the distribution of the sentiment polarities after posting the texts on the subreddits. The sentiments included are positive and negative sentiments, on a daily average. Figure 5 shows the distribution of the standard deviation of the sentiment polarities after posting on subreddits.

Figure 4.

Sentiment average change with time—mean change.

Figure 5.

Sentiment deviation with time—standard deviation.

The figures show that there are overall more positive sentiment posts being posted on the two subreddits, but there were higher average negative sentiments on those posts. Similarly, there was a higher deviation of positive sentiment posts, but there was a consistent presence of negative sentiment posts throughout the time frame. The possible spike in the negative sentiments during the period of June 2020 could be attributed to the severe lockdown restrictions implemented across the globe. A similar spike in the standard deviation of positive sentiment can be attributed to the release and administration of vaccines and the related hope with it. These spikes in negative and positive sentiments have been attributed to immediate mental health issues, that lead to a sustained feeling of negativity or positivity in the continued threads of the post, which has also been reported in similar studies (Low et al., 2020).

These results show the presence of emotional contagion on both the negative and positive sentiments, which is also found in previous studies (Ferrara & Yang, 2015). In order to further solidify the results, measures of anomalies and valence were also done. For anomaly detection, the texts were vectorized according to the VADER score, and then the anomalies were clustered using the DBSCAN package of Python. Figure 6 presents the distribution of sentiments for the anomalies.

Figure 6.

Distribution of sentiments across observed anomalies.

The figure clearly shows that the distribution follows a bimodal distribution, which is pretty visible in all the categories used. It also shows that when the sentiment strength is low to medium, the anomalies in positive and negative sentiments are in higher presence, but when the strength of the sentiments increases, the anomalies of the neutral sentiment topics increase, and positive and negative sentiments almost become negligible. This shows that there are multiple sub-groups that are present in the entire anomaly distribution (Aslam et al., 2020).

The valence method measures the sentiment ranges, where the lower score show is greater disproportion towards negative sentiment, and a higher score shows a greater disproportion towards positive sentiment. The texts are divided into bins of the same size (here: size as 1) which contain a set of posts made by the users with their corresponding sentiments, which are then used to find the valence in the input bins. The results of the method are shown in Figure 7.

Figure 7.

Valence relationship in the posts of the subreddits.

The findings show a very strong and positive linear relationship between stimulus and response valence. The result is also statistically significant (p < .000, SE = 0.026). This shows the presence of strong positive and negative emotional contagion in the contents, with a clear indication that a strong negative stimulus triggers a strong negative response, and vice-versa, including the neutral stimulus-response.

Discussion

The study aimed to evaluate the emotional perspective, emotional wellbeing, and emotional contagion of the users of Reddit, especially on the popular subreddits of COVID-19. Different from the studies carried out on Facebook (Kim et al., 2021; Kramer et al., 2014; Sturm Wilkerson et al., 2021), Twitter (Arora et al., 2021; Ferrara & Yang, 2015; Park et al., 2021), News Headlines (Aslam et al., 2020; Srivastava & Deepak, 2021), etc. where the control variables included only the content shared by users on their social media, this study tried to evaluate the sentiment and emotional content of the posts shared on Reddit.

The findings of the sentiment analysis showed that there is a high connection between posts on Reddit and emotional and sentiment polarity. They especially showed a high emotional score to negative posts shared on Reddit. The outbreak of the pandemic disease and the ineptitude of the governments to prepare for the treatments are not lost on the Redditors, and that has created a sense of fear, uncertainty, and anxiety that is not helpful for the mental and emotional wellbeing of the netizens.

The obviousness of the edge of negative words over positive words presented itself when the cluster of those words was done according to their sentiments. The initial word cloud and the emotional intensity graph showed that there was a slight edge towards the negativity in the sentiments on the discourse, which was accentuated when talking about the medical authority and the allied institutions, and when the discussion was about the disease itself. This finding is in line with some recent studies that have looked into other platforms (Domalewska, 2021; Gulati et al., 2022) who have also found similar edges to negative sentiments over positive sentiments in social network interactions.

The important finding was that there was always an undertone of negative sentiment, in almost all of the posts on Reddit. It also showed that the rising death numbers and loss of loved ones have been leading to chronic mental disorders, which is echoed in the findings of S. Das and Dutta (2021), which showed that mass quarantines and lockdowns have led to overall online community anxiety, and isolation is not helping at all.

It is also important to understand that misleading information, misinformation, or conspiracies surrounding the disease are bound to escalate the fear and anxiety levels amongst the public. The previous literatures studying the SARS/MERS virus outbreaks have clearly shown that the emotional wellbeing of the people can be increased by keeping them well-tested, medicated and informed, stopping the incidents to rise to the situation of mass hysteria, which might make the people who are negative, or are feeling low to mild symptoms, or symptoms of similar viral infections, to feel high levels of uncertainty, anxiety, and concern, thereby further increasing their stress levels, and also of the frontline workers serving and treating them.

This kind of mass hysteria can act as the prompt to unnecessary social and cultural discrimination, which was seen across the globe with the rise in civil unrests during the peak isolation periods. This was visible when the fluctuations in the presence of sentiments were measured, which showed clear spikes around three important time frames of COVID-19, the first being the stage when the WHO declared COVID-19 as an official global pandemic, the second was when the massive lockdowns were implemented around the globe, and the third was the COVID-19 vaccines were first approved and administered. This is important to understand because if medical and administrative leaders want to be fair in figuring out solutions to catastrophic problems, they need to learn about the sentiment intensity of their constituents, especially if they are from a marginalized group. The weaker sections of the society could be extorted into not receiving proper medical care, leading to further breakdown of the societies and economic developments. The implications are quite severe and explicitly harmful (Akroyd et al., 2020), and could cause counterfeit emotions, or emotional contagion (Rincón-Aznar et al., 2020).

The interesting outcome of these findings was that the sentiment spikes were similar in both the emotions around the same time, which is inconsistent with previous findings of sentiment analysis (Crocamo et al., 2021; Srivastava & Deepak, 2021). The difference in these findings can be understood with the findings of Eghtesadi and Florea (2020), who argued that Reddit, as compared to its counterparts like Facebook, Twitter, and Instagram, is comparatively less polarized; and as Allgaier (2016) notes that Reddit is more of an online forum, but others are social network sites. This might be a reason why the spikes were consistent in their time frames and the sentiments themselves.

The findings of emotional contagion showed an irrefutable presence of contagion in all the posts shared on Reddit, irrespective of the sentiment polarities of the text. It was observed that, on average, there is an overall over-exposure of negative or positive posts that generates a similar response. There was a weak presence of anomalies of negative or positive posts, but a very strong presence of anomalies of neutral posts.

As there are now increasingly substantial understandings that non-verbal cues and the absence of interactions can also lead to emotional contagion (Ferrara & Yang, 2015), it was important to see whether even the mentions of “covid” triggered valence. It was found affirmative that the mentions of this disease had an intense emotional reaction, which attributes to the idea that mentions of “covid” are an emotionally charged and behaviorally replicable phenomenon.

There was also a strong positive relationship between the valence stimuli and response, suggesting a strong presence of emotional contagion in the data, which is also present in the previous studies (S. Das & Dutta, 2021; Pano & Kashef, 2020). The relationship was also linear, which essentially amounts to similar contagion regulating both negative as well as positive emotions. We can also determine through the stimuli-response valence that the susceptibility of high-end-high-interaction users (Valenzano et al., 2020) are more prone to feeling the emotion that is highly present in a Reddit post, despite its informative value.

The study was not immune to shortcomings. One clear shortcoming was the data itself. The data covered the head posts shared by the users, but failed to collect all the comments and response threads of these posts. Despite the time frame of the posts collected (almost a year), the absence of a comment thread posed a loss of nuance in the interactions between the users. This would allow measuring the additional contexts to users’ experience, and the pushback to the misinformation on incorrect posts/news. Future studies might try and study the posts and the associated comments in their full context to extract homophily in the users and figure out the community structure. It must be said that these limitations are present in all the studies using big data, NLP, and sentiment analysis as tools of analysis, and do not affect the overall validity and soundness of this study.

Conclusion

In order to understand the impact that online discourse has on the emotional wellbeing, emotional perspective, and emotional contagion of the users, it is imperative to recognize that the average person has made social media their primary source of information, be it for news, community, advice, or any other. There have been numerous studies in recent times that have tried to evaluate its impact on people’s mental health. With this study, we have tried to make the following additions to the literature; first, we have tried to find the sentiment trend of the Redditors when it comes to COVID-19 discourse on their subreddits, by using the largest extraction of post data for almost a year. Second, it has tried to figure out whether the emotional contagion plays a role in the posts and their corresponding responses.

By evaluating the sentiments through the word clouds and variations, this study has tried to figure out the evolution of the attitude of the people connecting on these subreddits, which adds to the growing body of the use of Natural Language Processing while studying about COVID-19 and its impacts on public lives. By evaluating the anomalies and valence of the posts, the study argues the presence of emotional contagion in all the occurrence of emotional stimuli, which triggers a similar strength emotional response. This finding can also be corroborated in the previous literature.

The study was not without its limitations. The major limitation was the evaluation of just the head posts and not the corresponding replies or comments from other users. This robs these analyses of the necessary contexts, especially when the posts are partially misleading, or otherwise. Another one was the language, as all the posts were in the English language, and other language texts were removed in data pre-processing. Future research can focus on translating these other language posts to find a more comprehensive meaning to the posts shared, along with combining the posts with their responses and replies to assess the contexts of the content associated with the head posts. This would also help us understand how the community understanding has been helping people dealing with COVID-19 or navigating life in the aftermath of COVID-19. This would allow us to create an optimal processing strategy, as the opinions and emotions of the pandemic and infodemic changes and evolves.

Footnotes

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Shardul Shankar

Data Availability Statement

The datasets generated during and/or analyzed during the current study are available in the GITHUB repository,

References

Akroyd

Harrington

Nastase

(2020). Rapid literature review: Governance and state capability. Oxford Policy Management.

Alamoodi

A. H.

Zaidan

B. B.

Zaidan

A. A.

Albahri

O. S.

Mohammed

K. I.

Malik

R. Q.

Almahdi

E. M.

Chyad

M. A.

Tareq

Albahri

A. S.

Hameed

Alaa

(2021). Sentiment analysis and its applications in fighting COVID-19 and infectious diseases: A systematic review. Expert Systems with Applications, 167, 114155. https://doi.org/10.1016/j.eswa.2020.114155

Allgaier

(2016). Science and South Park, Reddit and Facebook, Leonardo da Vinci and the Vitruvian Man, and modern fairy tales about emerging technologies: Science communication and popular culture. Journal of Science Communication, 15(2), C01.

Alpert

L. I.

(2020). Coronavirus misinformation spreads on Facebook, watchdog says. The World Street Journal. https://www.wsj.com/articles/coronavirus-misinformation-spreadson-facebook-watchdog-says-11587436159

Altamura

Iuso

D’Andrea

D’Urso

Piccininni

Angelini

Francesco

Margaglione

Padulo

Fairfield

Petito

Bellomo

(2019). Maladaptive coping strategies and neuroticism mediate the relationship between 5HTT-LPR polymorphisms and symptoms of anxiety in elite athletes. Clinical Neuropsychiatry, 16(1), 62. https://doi.org/10.1101/493320

Amin

Hossain

Akther

Alam

K. M.

(2019, February). Bengali VADER: A sentiment analysis approach using modified VADER [Conference session]. 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE) (pp. 1–6). IEEE.

Angiani

Ferrari

Fontanini

Fornacciari

Iotti

Magliani

Manicardi

(2016). A comparison between preprocessing techniques for sentiment analysis in Twitter [Paper presentation]. Proceedings of Conference on KDWeb, September 2016.

Apuke

O. D.

Omar

(2020). How do Nigerian newspapers report COVID-19 pandemic? The implication for awareness and prevention. Health Education Research, 35(5), 471–480.

Apuke

O. D.

Omar

(2021). Fake news and COVID-19: Modelling the predictors of fake news sharing among social media users. Telematics and Informatics, 56, 101475.

10.

Arora

Chakraborty

Bhatia

M. P. S.

Mittal

(2021). Role of emotion in excessive use of Twitter during COVID-19 imposed lockdown in India. Journal of Technology in Behavioral Science, 6(2), 370–377.

11.

Aslam

Awan

T. M.

Syed

J. H.

Kashif

Parveen

(2020). Sentiments and emotions evoked by news headlines of coronavirus disease (COVID-19) outbreak. Humanities and Social Sciences Communications, 7(1), 1–9.

12.

Barkur

Vibha

G. B. K.

(2020). Sentiment analysis of nationwide lockdown due to COVID 19 outbreak: Evidence from India. Asian Journal of Psychiatry, 51, 102089.

13.

BBC. (2020, December 8). Covid-19 vaccine: First person receives Pfizer Jab in UK. BBC News. Retrieved April 5, 2022, from https://www.bbc.com/news/uk-55227325

14.

Bermingham

Smeaton

A. F.

(2010). Crowdsourced real-world sensing: Sentiment analysis and the real-time web [Paper presentation]. AICS 2010—Sentiment Analysis Workshop at Artificial Intelligence and Cognitive Science, Galway, Ireland (pp. 1–8).

15.

Bloem

J. R.

Salemi

(2021). COVID-19 and conflict. World Development, 140, 105294.

16.

Bonta

Janardhan

N. K. N.

(2019). A comprehensive study on lexicon based approaches for sentiment analysis. Asian Journal of Computer Science and Technology, 8(S2), 1–6.

17.

Borg

Boldt

(2020). Using VADER sentiment and SVM for predicting customer response sentiment. Expert Systems with Applications, 162, 113746.

18.

Bunting

A. M.

Frank

Arshonsky

Bragg

M. A.

Friedman

S. R.

Krawczyk

(2021). Socially-supportive norms and mutual aid of people who use opioids: An analysis of Reddit during the initial COVID-19 pandemic. Drug and Alcohol Dependence, 222, 108672.

19.

Cambria

Schuller

Xia

Havasi

(2013). New avenues in opinion mining and sentiment analysis. IEEE Intelligent Systems, 28(2), 15–21.

20.

Chakraborty

Bhatia

Bhattacharyya

Platos

Bag

Hassanien

A. E.

(2020). Sentiment analysis of COVID-19 tweets by deep learning classifiers—A study to show how popularity is affecting accuracy in social media. Applied Soft Computing, 97, 106754.

21.

Chen

Xia

(2022). Social network behavior and public opinion manipulation. Journal of Information Security and Applications, 64, 103060.

22.

Chen

Min

Zhang

Wang

Evans

(2020). Unpacking the black box: How to promote citizen engagement through government social media during the COVID-19 crisis. Computers in Human Behavior, 110, 106380.

23.

Cowen

A. S.

Laukka

Elfenbein

H. A.

Liu

Keltner

(2019). The primacy of categories in the recognition of 12 emotions in speech prosody across two cultures. Nature Human Behaviour, 3(4), 369–382.

24.

Crocamo

Viviani

Famiglini

Bartoli

Pasi

Carrà

(2021). Surveilling COVID-19 emotional contagion on Twitter by sentiment analysis. European Psychiatry, 64(1).

25.

Das

Dutta

(2020). SGLT2 inhibition and COVID-19: The road not taken. European Journal of Clinical Investigation, 50(12), e13339.

26.

Das

R. P.

(2017). Notions of “contagion” in classical Indian medical texts. In Conrad

L. I.

Wujastyk

(Eds.), Contagion: Perspectives from pre-modern societies (pp. 55–78). Routledge.

27.

Das

Dutta

(2021). Characterizing public emotions and sentiments in COVID-19 environment: A case study of India. Journal of Human Behavior in the Social Environment, 31(1–4), 154–167.

28.

de Las Heras-Pedrosa

Sánchez-Núñez

Peláez

J. I

. (2020). Sentiment analysis and emotion understanding during the COVID-19 pandemic in Spain and its impact on digital ecosystems. International Journal of Environmental Research and Public Health, 17(15), 5542.

29.

Domalewska

(2021). An analysis of COVID-19 economic measures and attitudes: Evidence from social media mining. Journal of Big Data, 8(1), 1–14.

30.

Dubey

A. D.

(2020). Twitter sentiment analysis during COVID-19 outbreak. Available at SSRN 3572023.

31.

Eghtesadi

Florea

(2020). Facebook, Instagram, Reddit and TikTok: A proposal for health authorities to integrate popular social media platforms in contingency planning amid a global pandemic outbreak. Canadian Journal of Public Health, 111(3), 389–391.

32.

Ekman

(1992). An argument for basic emotions. Cognition & Emotion, 6(3–4), 169–200.

33.

Elbagir

Yang

(2020). Sentiment analysis on Twitter with Python’s natural language toolkit and VADER sentiment analyzer [Conference session]. IAENG Transactions on Engineering Sciences: Special Issue for the International Association of Engineers Conferences 2019 (pp. 63–80).

34.

Feldman

(2013). Techniques and applications for sentiment analysis. Communications of the ACM, 56(4), 82–89.

35.

Ferrara

Yang

(2015). Measuring emotional contagion in social media. PLoS One, 10(11), e0142390.

36.

Fowler

J. H.

Christakis

N. A.

(2008). Dynamic spread of happiness in a large social network: Longitudinal analysis over 20 years in the Framingham Heart Study. British Medical Journal, 337(dec04 2), 2338. https://doi.org/10.1136/bmj.a2338

37.

Frenkel

Alba

Zhong

(2020). Surge of virus misinformation stumps Facebook and Twitter. The New York Times, 8.

38.

Friedman

S. M.

Dunwoody

Rogers

C. L.

(Eds.). (1999). Communicating uncertainty: Media coverage of new and controversial science. Routledge.

39.

Gulati

Kumar

S. S.

Boddu

R. S. K.

Sarvakar

Sharma

D. K.

Nomani

M. Z. M.

(2022). Comparative analysis of machine learning-based classification models using sentiment classification of tweets related to COVID-19 pandemic. Materials Today: Proceedings, 51, 38–41.

40.

Hassan

(2021). Retrieved June 21, 2021, from https://headtopics.com/ng/covid-19-the-dual-threat-of-a-virusand-a-fake-news-epidemic-by-idayat-hassan-premium-times-opin-12109663

41.

Hatfield

Cacioppo

J. T.

Rapson

R. L.

(1992). Primitive emotional contagion. In Clark

M. S.

(Ed.), Emotion and social behavior: Review of personality and social psychology (pp. 151–177). Sage.

42.

Hatfield

Cacioppo

J. T.

Rapson

R. L.

(1993). Emotional contagion. Current Directions in Psychological Science, 2(3), 96–100.

43.

Hou

Chen

Zhou

Hua

Yuan

Guo

Zhang

Jia

Zhao

Zhang

(2020). The effectiveness of quarantine of Wuhan city against the Corona Virus Disease 2019 (COVID-19): A well-mixed SEIR model analysis. Journal of Medical Virology, 92(7), 841–848.

44.

Hsu

L.-Y.

Lee

C.-C.

Green

J. A.

Ang

Paton

N. I.

Lee

Villacian

J. S.

Lim

P.-L.

Earnest

Leo

Y.-S.

(2003). Severe acute respiratory syndrome (SARS) in Singapore: Clinical features of index patient and initial contacts. Emerging Infectious Diseases, 9(6), 713.

45.

Hutto

C. J.

(2014). VADER-sentiment-analysis, GitHub. Retrieved June 21, 2021, from https://github.com/cjhutto/vaderSentiment

46.

Hutto

C. J.

Gilbert

(2014, May). VADER: A parsimonious rule-based model for sentiment analysis of social media text [Conference session]. Proceedings of the International AAAI Conference on Web and Social Media (Vol. 8, No. 1, pp. 216–225).

47.

Hutto

C. J.

Gilbert

(2021). VADER: A parsimonious rule-based model for sentiment analysis of social media. Aaai.org. Retrieved June 21, 2021, from https://www.aaai.org/ocs/index.php/ICWSM/ICWSM14/paper/view/8109

48.

Huynh

T. L. D.

(2020). Does culture matter social distancing under the COVID-19 pandemic? Safety Science, 130, 104872.

49.

Jain

V. K.

Kumar

(2015). An effective approach to track levels of influenza-A (H1N1) pandemic in India using twitter. Procedia Computer Science, 70, 801–807.

50.

Kharde

Sonawane

(2016). Sentiment analysis of Twitter data: A survey of techniques. International Journal of Computer Applications, 139(11), 5–15.

51.

Killerby

M. E.

Biggs

H. M.

Midgley

C. M.

Gerber

S. I.

Watson

J. T.

(2020). Middle East respiratory syndrome coronavirus transmission. Emerging Infectious Diseases, 26(2), 191.

52.

Kim

Uddin

Z. A.

Lee

Nasri

Gill

Subramanieapillai

Lee

Udovica

Phan

Lui

Iacobucci

Mansur

R. B.

Rosenblat

J. D.

McIntyre

R. S.

(2021). A systematic review of the validity of screening depression through Facebook, Twitter, Instagram, and Snapchat. Journal of Affective Disorders, 286, 360–369. https://doi.org/10.1016/j.jad.2020.08.091

53.

Kramer

A. D.

Guillory

J. E.

Hancock

J. T.

(2014). Experimental evidence of massive-scale emotional contagion through social networks. Proceedings of the National Academy of Sciences, 111(24), 8788–8790.

54.

Lampos

Majumder

M. S.

Yom-Tov

Edelstein

Moura

Hamada

Rangaka

M. X.

McKendry

R. A.

Cox

I. J.

(2021). Tracking COVID-19 using online search. NPJ Digital Medicine, 4(1), 1–11.

55.

Las Johansen

B. C

. (2018). Deciphering west Philippine Sea: A Plutchik and VADER algorithm sentiment analysis. Indian Journal of Science and Technology, 11, 47.

56.

Lee

Hui

Chan

Cameron

Joynt

G. M.

Ahuja

Yung

M. Y.

Leung

C. B.

K. F.

Lui

S. F.

(2003). A major outbreak of severe acute respiratory syndrome in Hong Kong. New England Journal of Medicine, 348(20), 1986–1994.

57.

Liang

Zhu

Guo

Liu

Zhou

Chin

D. P.

Schuchat

& Beijing Joint SARS Expert Group. (2004). Severe acute respiratory syndrome, Beijing, 2003. Emerging Infectious Diseases, 10(1), 25.

58.

Liu

(2012). Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies, 5(1), 1–167.

59.

Liu

B. F.

Kim

(2011). How organizations framed the 2009 H1N1 pandemic via social and traditional media: Implications for US health communicators. Public Relations Review, 37(3), 233–244.

60.

Low

D. M.

Rumker

Talkar

Torous

Cecchi

Ghosh

S. S.

(2020). Natural language processing reveals vulnerable mental health support groups and heightened health anxiety on Reddit during covid-19: Observational study. Journal of Medical Internet Research, 22(10), e22635.

61.

Mäntylä

M. V.

Graziotin

Kuutila

(2018). The evolution of sentiment analysis—A review of research topics, venues, and top cited papers. Computer Science Review, 27, 16–32.

62.

Medhat

Hassan

Korashy

(2014). Sentiment analysis algorithms and applications: A survey. Ain Shams Engineering Journal, 5(4), 1093–1113.

63.

Merriam-Webster. (2022). Contagion definition & meaning. Merriam-Webster. Retrieved April 3, 2022, from https://www.merriam-webster.com/dictionary/contagion

64.

Mohammad

S. M.

(2015). NRC emotion lexicon. Saifmohammad.com. Retrieved June 21, 2021, from https://saifmohammad.com/WebPages/NRC-Emotion-Lexicon.htm

65.

Mohammad

S. M.

Turney

P. D.

(2013). Crowdsourcing a word–emotion association lexicon. Computational Intelligence, 29(3), 436–465.

66.

Novel Coronavirus—China. (2021). Retrieved June 21, 2021, from https://www.who.int/csr/don/12-january-2020-novel-coronavirus-china/en/

67.

Oyewusi

W. F.

Adekanmbi

Akinsande

(2020). Semantic enrichment of Nigerian Pidgin English for contextual sentiment classification. arXiv preprint arXiv:2003.12450.

68.

Pang

Lee

(2008). Opinion mining and sentiment analysis. Foundations and Trends® in Information Retrieval, 2(1–2), 1–135.

69.

Pang

Lee

Vaithyanathan

(2002, July). Thumbs up? Sentiment classification using machine learning techniques [Conference session]. Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP 2002) (pp. 79–86).

70.

Pano

Kashef

(2020). A complete VADER-based sentiment analysis of bitcoin (BTC) tweets during the era of COVID-19. Big Data and Cognitive Computing, 4(4), 33.

71.

Park

Strover

Choi

Schnell

(2021). Mind games: A temporal sentiment analysis of the political messages of the internet research agency on Facebook and Twitter. New Media & Society, 25(3), 463–484.

72.

Pennycook

McPhetres

Zhang

J. G.

Rand

D. G.

(2020). Fighting COVID-19 misinformation on social media: Experimental evidence for a scalable accuracy-nudge intervention. Psychological Science, 31(7), 770–780.

73.

Piryani

Madhavi

Singh

V. K.

(2017). Analytical mapping of opinion mining and sentiment analysis research during 2000–2015. Information Processing & Management, 53(1), 122–150.

74.

Plutchik

(1994). The psychology and biology of emotion. HarperCollins.

75.

Prabowo

Thelwall

(2009). Sentiment analysis: A combined approach. Journal of Informetrics, 3(2), 143–157.

76.

Pulido

C. M.

Villarejo-Carballido

Redondo-Sama

Gómez

(2020). COVID-19 infodemic: More retweets for science-based information on coronavirus than for false information. International Sociology, 35(4), 377–392.

77.

Rajkumar

R. P.

(2020). COVID-19 and mental health: A review of the existing literature. Asian Journal of Psychiatry, 52, 102066.

78.

Rat

Olivier

Dutot

(2020). SARS-CoV-2 vs. SARS-CoV-1 management: Antibiotics and inflammasome modulators potential. European Review for Medical and Pharmacological Sciences, 24(14), 7880–7885.

79.

Rianto

Pratama

A. R.

(2021). Sentiment analysis of COVID-19 vaccination posts on Facebook in Indonesia with crowdtangle. Jurnal Riset Informatika, 3(4), 353–362.

80.

Rincón-Aznar

Mao

Tong

(2020). Global value chains and economic dislocations: Introduction. National Institute Economic Review, 252, R1–R3.

81.

Rubin

G. J.

Wessely

(2020, February 7). Coronavirus: The psychological effects of quarantining a city. The BMJ. Retrieved April 3, 2022, from https://blogs.bmj.com/bmj/2020/01/24/coronavirus-the-psychological-effects-of-quarantining-a-city/

82.

Russonello

(2020, March 13). Afraid of coronavirus? That might say something about your politics. The New York Times.

83.

Sahu

K. K.

Mishra

A. K.

Lal

(2020). Comprehensive update on current outbreak of novel coronavirus infection (2019-nCoV). Annals of Translational Medicine, 8(6), 393.

84.

Sasaki

Nishiyama

Okoshi

Nakazawa

(2021). Investigating the occurrence of selfie-based emotional contagion over social network. Social Network Analysis and Mining, 11(1), 1–14.

85.

Shankar

Breithaupt

(2019). The dark sides of empathy ( Hamilton

A. B.

, trans.; 233 pp, $19.64 [USD]£ 14.53 [GBP]). Cornell University Press. https://doi.org/10.7591/9781501735608

86.

Shankar

Tewari

(2021a). Impact of collective intelligence and collective emotional intelligence on the psychological safety of the organizations. Vision, 27(4), 458–473.

87.

Shankar

Tewari

(2021b). Understanding the emotional intelligence discourse on social media: Insights from the analysis of twitter. Journal of Intelligence, 9(4), 56.

88.

Shankar

Vyas

Tewari

(2021). Applying machine learning algorithms to determine and predict the reasons and models for employee turnover. International Journal of Information Technology and Management, 23(1), 48–63.

89.

Sharma

Yadav

Ferdinand

K. C.

(2017). Zika virus pandemic—Analysis of Facebook as a social media health information platform. American Journal of Infection Control, 45(3), 301–302.

90.

Shukla

(2021). COVID-19 pandemic: An analysis of popular YouTube videos as an alternative health information platform. Health Informatics Journal, 27(2), 1–27.

91.

Sohrabi

Alsafi

O’neill

Khan

Kerwan

Al-Jabir

Iosifidis

Agha

(2020). World Health Organization declares global emergency: A review of the 2019 novel coronavirus (COVID-19). International Journal of Surgery, 76, 71–76.

92.

Sokolova

Bobicev

(2013, September). What sentiments can be found in medical forums? [Conference session] Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013 (pp. 633–639).

93.

Srivastava

R. A.

Deepak

(2021). PIREN: Prediction of intermediary readers’ emotion from news-articles. In Shukla

Unal

Kureethara

J. V.

Mishra

D. K.

Han

D. S.

(Eds.), Data science and security (Lecture Notes in Networks and Systems; Vol. 290, pp. 122–130). Springer.

94.

Steinert

(2021). Corona and value change. The role of social media and emotional contagion. Ethics and Information Technology, 23(1), 59–68.

95.

Strapparava

Mihalcea

(2008, March). Learning to identify emotions in text [Symposium]. Proceedings of the 2008 ACM Symposium on Applied Computing (pp. 1556–1560).

96.

Strapparava

Valitutti

Stock

(2006, May). The affective weight of lexicon [Conference session]. Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06), Genoa, Italy (pp. 423–426).

97.

Sturm Wilkerson

Riedl

M. J.

Whipple

K. N

. (2021). Affective affordances: Exploring Facebook reactions as emotional responses to hyperpartisan political news. Digital Journalism, 9(8), 1040–1061.

98.

Taboada

Brooke

Tofiloski

Voll

Stede

(2011). Lexicon-based methods for sentiment analysis. Computational Linguistics, 37(2), 267–307.

99.

Thelwall

Buckley

Paltoglou

Cai

Kappas

(2010). Sentiment strength detection in short informal text. Journal of the American Society for Information Science and Technology, 61(12), 2544–2558.

100.

Thompson

C. M.

Rhidenour

K. B.

Blackburn

K. G.

Barrett

A. K.

Babu

(2022). Using crowdsourced medicine to manage uncertainty on Reddit: The case of COVID-19 long-haulers. Patient Education and Counseling, 105(2), 322–330.

101.

Tsang

K. W.

P. L.

Ooi

G. C.

Yee

W. K.

Wang

Chan-Yeung

Lam

W. K.

Seto

W. H.

Yam

L. Y.

Cheung

T. M.

Wong

P. C.

(2003). A cluster of cases of severe acute respiratory syndrome in Hong Kong. New England Journal of Medicine, 348(20), 1977–1985.

102.

Turney

P. D.

(2002, July). Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews [Paper presentation]. Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (pp. 417–424).

103.

Turney

P. D.

Littman

M. L.

(2003). Measuring praise and criticism: Inference of semantic orientation from association. ACM Transactions on Information Systems (TOIS), 21(4), 315–346.

104.

Tymann

Lutz

Palsbröker

Gips

(2019, September). GerVADER—A German adaptation of the VADER sentiment analysis tool for social media texts [Conference session]. Proceedings of the Conference on “Lernen, Wissen, Daten, Analysen”—LWDA 2019 (pp. 178–189).

105.

Valenzano

Scarinci

Monda

Sessa

Messina

Monda

Precenzano

Mollica

M. P.

Carotenuto

Messina

Cibelli

(2020). The social brain and emotional contagion: Covid-19 effects. Medicina, 56(12), 640. https://doi.org/10.3390/medicina56120640

106.

Venkata-Subramani

Roman

(2020). The coronavirus response in India—World’s largest lockdown. The American Journal of the Medical Sciences, 360(6), 742–748.

107.

Volkmer

I. (Rep.)

. (2021). Social media and COVID-19: A global study of digital crisis interaction among Gen Z and millennials (pp. 1–67). University of Melbourne.

108.

Wang

Horby

P. W.

Hayden

F. G.

Gao

G. F.

(2020). A novel coronavirus outbreak of global health concern. The Lancet, 395(10223), 470–473.

109.

Wells

(2006). The metacognitive model of worry and generalised anxiety disorder. In Davey

G. C. L.

Wells

(Eds.), Worry and its psychological disorders: Theory, assessment and treatment (pp. 179–199). Wiley.

110.

Wilson

Hoffmann

Somasundaran

Kessler

Wiebe

Choi

Cardie

Riloff

Patwardhan

(2005, October). OpinionFinder: A system for subjectivity analysis [Conference session]. Proceedings of HLT/EMNLP 2005 Interactive Demonstrations (pp. 34–35).

111.

World Health Organization (WHO). (2020a). WHO issues its first emergency use validation for a COVID-19 vaccine and emphasizes need for equitable global access. World Health Organization. Retrieved April 5, 2022, from https://www.who.int/news/item/31-12-2020-who-issues-its-first-emergency-use-validation-for-a-covid-19-vaccine-and-emphasizes-need-for-equitable-global-access

112.

World Health Organization (WHO). (2020b, February 11). Naming the coronavirus disease (COVID-19) and the virus that causes it. World Health Organization. Retrieved April 5, 2022, from https://www.who.int/emergencies/diseases/novel-coronavirus-2019/technical-guidance/naming-the-coronavirus-disease-(covid-2019)-and-the-virus-that-causes-it

113.

World Health Organization (WHO). (2021a). Managing the COVID-19 infodemic: Promoting healthy behaviours and mitigating the harm from misinformation and disinformation. World Health Organization. Retrieved June 21, 2021, from https://www.who.int/news/item/23-09-2020-managing-the-covid-19-infodemic-promoting-healthy-behaviours-and-mitigating-the-harm-from-misinformation-and-disinformation

114.

World Health Organization (WHO). (2021b). Naming the coronavirus disease (COVID-19) and the virus that causes it. World Health Organization. Retrieved June 21, 2021, from https://www.who.int/emergencies/diseases/novel-coronavirus-2019/technical-guidance/naming-the-coronavirus-disease-(covid-2019)-and-the-virus-that-causes-it

115.

Zaki

A. M.

Van Boheemen

Bestebroer

T. M.

Osterhaus

A. D.

Fouchier

R. A.

(2012). Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia. New England Journal of Medicine, 367(19), 1814–1820.

116.

Zeng-Treitler

Goryachev

Tse

Keselman

Boxwala

(2008). Estimating consumer familiarity with health terminology: A context-based approach. Journal of the American Medical Informatics Association, 15(3), 349–356.

117.

Zhang

J. S.

Keegan

B. C.

Tan

(2020). A tale of two communities: Characterizing Reddit response to COVID-19 through/r/China_Flu and/r/Coronavirus. arXiv Preprint ID:PPR268927.

118.

Zhou

W. K.

Wang

A. L.

Xia

Xiao

Y. N.

Tang

S. Y.

(2020). Effects of media reporting on mitigating spread of COVID-19 in the early phase of the outbreak. Mathematical Biosciences and Engineering: MBE, 17(3), 2693–2707.

Measuring Emotional Wellbeing and Emotional Contagion Through Sentiments and Emotions Evoked by Social Media for COVID-19

Abstract

Keywords

Introduction

Review of Literature

Interplay Between Online Discourse and Emotional Contagion

Method

Data

Data Pre-Processing

Sentiment Analysis

Valence Aware Dictionary and Sentiment Reasoner (VADER)

Results

Sentiment Analysis

Effect of Emotional Contagion

Discussion

Conclusion

Footnotes

Declaration of Conflicting Interests

Funding

ORCID iD

Data Availability Statement

References