Abstract
Online incivility and harassment in political communication have become an important topic of concern among politicians, journalists, and academics. This study provides a descriptive account of uncivil interactions between citizens and politicians on Twitter. We develop a conceptual framework for understanding the dynamics of incivility at three distinct levels: macro (temporal), meso (contextual), and micro (individual). Using longitudinal data from the Twitter communication mentioning Members of Congress in the United States across a time span of over a year and relying on supervised machine learning methods and topic models, we offer new insights about the prevalence and dynamics of incivility toward legislators. We find that uncivil tweets represent consistently around 18% of all tweets mentioning legislators, but with spikes that correspond to controversial policy debates and political events. Although we find evidence of coordinated attacks, our analysis reveals that the use of uncivil language is common to a large number of users.
. . . And then I had no idea what was about to happen next. My Twitter feed basically exploded. [. . .] I began to see images, for example, of my youngest daughter, who we adopted from Ethiopia many years ago, who at the time was 7 years old—images of her in a gas chamber with a—Donald Trump in an SS uniform about to push the button to kill her.
To see the attack of a pack on here check out my mentions 600 odd notifications talking about my rape in one night. I think twitter is dead
Introduction
Politics has always been an arena of heated argumentation. Witty, caustic, ironic, and oftentimes vitriolic verbal exchanges are part of a discourse that defines political competition and delineates power relations between political actors. At the same time, such exchanges often provide signals about what is permissible in public discourse. A politician’s choice of words, therefore, can as much restrain and reconciliate, as it can spread division and elevate the status of offensiveness from unacceptable to routine.
Operating in an arena where strongly caustic language and argumentation are common, one would expect that politicians would be unsurprised with (and resilient against) the outcome of the public’s newly acquired capacity to address them directly via new media channels, such as through the microblogging platform Twitter. However as high as a politician’s public approval may be, it is no secret that citizens are cynical toward politicians (Hay, 2007). After all, parties are the democratic institution that tends to be the least trusted of all others, in Europe as well as in the United States (Capella & Jamieson, 1997; Torcal & Montero, 2006). Thus, an anonymous platform for direct and publicly visible communication with one’s representatives can be a natural channel for citizens to communicate some of this frustration in the form of heated remarks. But what if this communication entails more than just a bit of justified frustration expressed in cynical language? What if this interaction is uncivil and entails prejudiced, hurtful, discouraging, and damaging language that is unpreventable and constant?
Online incivility and harassment in political communication have become an important topic of concern among politicians, journalists and academics. Although the phenomenon of rising incivility within the political arena (Mutz, 2015; Uslaner, 1993) and in relation to it (Funk, 2001; Sigelman & Bullock, 1991) has been widely discussed (Mutz & Reeves, 2005), social media communication has reinvigorated this debate by highlighting new aspects to consider due to platforms’ varied affordances. Empowered by the interactive capacities of platforms like Twitter or Facebook, individual users (but also bots) can now directly and publicly address comments to their representatives under conditions of anonymity. The capacity of social media to strengthen the relationship between representatives and their constituents has long been seen as a potentially major advance toward a more inclusive public sphere and toward strengthening public deliberation (Coleman, 2005). Yet, the danger of incivility—and the further concern of its normalization—in such anonymous environments looms large (Coe et al., 2014; Sobieraj & Berry, 2011) and can have important consequences for democracy.
Previous research has found that incivility has strong negative effects on attitudes and behaviors at many different levels (Anderson et al., 2013; Gervais, 2015; Massaro & Stryker, 2012). First, exposure to incivility
In this study, we focus our attention on uncivil interactions between citizens and politicians on Twitter, one of the most popular social media platforms. Twitter constitutes an important new arena of engagement between representatives and their constituents, with major consequences for democracy as, for the first time, the two sides are able to engage in—and both benefit from—direct communication. At the same time, this is also an arena about which, aside from the media coverage of famous cases, we still know little regarding the phenomenon of incivility, its dynamics, and its mechanisms. There are, however, clear indications that it is becoming a worrying trend that needs attention (BBC, 2017; The Guardian, 2016; Halliday, 2012; Hayes, 2008), to the extent that Twitter has been forced to experiment with new measures to crack down on it (Pham, 2017).
Our goal is to offer a descriptive account of the extent to which uncivil behavior toward politicians is prevalent on Twitter and to offer an exploratory look at what explains variation over time and across politicians. We use longitudinal data from the Twitter communication mentioning members of the U.S. Congress across a time span of over a year. This allows us to observe uncivil behavior toward politicians as it unfolds, not only during heated events such as electoral campaigns but also during
We develop and test a set of expectations to explain incivility at three distinct levels—macro, meso, and micro: on the
Our results show that the level of incivility addressed to Members of Congress on Twitter is relatively stable over time, around 18% of all tweets. This figure corroborates findings from other studies concerning the amount of incivility aimed at U.S. Senators (Rheault et al., 2019). However, we also observe spikes that correspond to moments of controversy about specific policy issues, such as health care, or political events, such as the White nationalist rally in Charlottesville, VA. Regarding who is responsible for the uncivil tweets, we find that up to 36% of all unique users in our data set sent at least one uncivil tweet, although again we find evidence of coordinated attacks at specific moments in time. Finally, regarding who is the target of incivility, we actually do not find large differences based on the gender of the legislator; in contrast, party and ideology are more important determinants of who is victim of harassment on Twitter.
Four Dynamic Elements of Incivility on Social Media
Extant literature shows that politicians, especially in the United States and Europe, have adopted social media widely, as tools that can strengthen communication with their constituents (Barberá & Zeitzoff, 2017; Bode & Dalrymple, 2016; Gulati & Williams, 2010, 2013; Heiss et al., 2018; Jungherr, 2016; Nulty et al., 2016; Popa et al., 2019; Williams & Gulati, 2010). Twitter, in particular, allows them to reach voters directly, bypassing middlemen and offering the opportunity to marketize themselves and move closer to voters by presenting a more personalized version of themselves (Enli & Skogerbø, 2013; Jungherr, 2016; Karlsen & Skogerbø, 2015). Although some politicians make interactive use of Twitter, just as with other social media platforms, in their majority they tend to use the platform mostly during electoral campaigns and, in principle, as broadcasting rather than as an interactive tool (Graham et al., 2013, 2014; Jackson & Lilleker, 2011; Larsson, 2015; Williamson, 2010). This has been seen as an unfortunate state of affairs, as interactive use not only leads to more positive evaluations of politicians by citizens (Lee & Shin, 2012; Lyons & Veenstra, 2016) but can also be an opportunity for citizens to learn more about party platforms (Fazekas et al., 2018; Munger et al., 2016).
The one-directional use of social media by politicians has generally been theorized using supply-side explanations, including, for example, that politicians do not have sufficient time to engage, are not technologically up to speed, or are concerned about losing control of the message (Stromer-Galley, 2000; Ward & Lusoli, 2005). Yet the unique dynamics that emerge through citizen–elite interactions on social media—especially Twitter—open the door for other explanations focused on the newly acquired capacities of the demand side to directly address the supply side. In one of the few studies focusing on demand-side explanations, Tromble (2018) suggests that political elites may be failing to directly engage with citizens to avoid meeting hostility with hostility and because the risks of negative reciprocity are high. The important role of harassment and abuse by citizens as major preventive factors for politicians to engage with the public should not come as a surprise. Politicians have been the targets of abuse, threats, and violence long before the arrival of social media. As McLoughlin and Ward (2017) explain in their work on abuse of MPs in the British Parliament, politicians have traditionally attracted abuse due to their elevated public profile and the public’s image of them as people with power, with explanations for these tendencies lying less with psychological factors (James et al., 2016) and more with factors such as disenchantment, boredom, or elements of a broader “trolling” subculture (Every-Palmer et al., 2015; Marwick & Lewis, 2017).
While the phenomenon is not new, social media and their (mis)affordances have amplified and, as public commentators have argued (Kamps, 2016), perhaps more dangerously from a democratic point of view, normalized it (Pelled et al., 2018). As misaffordances, here we consider (a) the combined
Defining (in)civility is a challenging endeavor due to the complexity of the concept (Herbst, 2010, p. 12). Most scholars understand incivility as a multidimensional concept. For example, Sapiro suggests three dimensions, one mapping on good character and virtue, another one on manners, and a final one to its communicative nature (Sapiro, 1999); Sydnor emphasizes tone and perceives it as a continuum “that ranges from the polite to insults to racial slurs and obscenities” (Sydnor, 2019, p. 9); Muddiman (2017) distinguishes between personal- and public-level incivility, whereby the first focuses on rudeness and emotions and the second on aspects such as failing to recognize the legitimacy of opposing views. Distinctions in terms of tone and morality have also been drawn by Papacharissi (2004), who differentiated between impoliteness and incivility early, with the second linking closely to intolerant behavior and thus with the idea of disrespect toward collective democratic traditions. In more recent contributions, it is this last aspect that makes incivility a threatening element to democratic pluralism and distinguishes it from incivility as a rhetorical act—a device that does not necessarily have harmful effects (Rossini, 2019).
As defining incivility is beyond the scope of this study, we follow Herbst (2010, p. 12) and adopt a definition that makes sense of the level and nature of our empirical work. Accordingly, we perceive incivility more broadly as a “disrespectful discourse that silences or derogates alternative views” (Jamieson et al., 2017, p. 206). Building on previous studies acknowledging the importance of context, as well as that of public versus private levels of incivility (Muddiman, 2017), we argue that to understand the occurrence of incivility in Twitter conversations between citizens and politicians—as well as its consequences for the second—it is essential to develop designs that allow us to observe such behavior from a bird’s-eye view. Such a view can reveal the temporal and contextual levels at which such behavior occurs, as well as better capture uncivil conversational dynamics (such as virality) that can become mostly evident on social media platforms such as Twitter due to their distinct architecture. For example, incivility on Twitter may not be limited to an infrequent type of behavior exhibited sporadically by a few frustrated citizens wishing to speak their mind in a one-off, emotional impolite remark. Indeed, as the many cases of harassment on Twitter have shown—such as the hate, racist, and sexist campaigns toward Black actress Leslie Jones (Oluo, 2016), United Kingdom’s Labour MP Tulip Siddiq (Saner, 2016), or Canada’s Progressive Conservative politician Sandra Jensen (BBC, 2016), it may well be a coordinated and continuous effort from a group of people to cause distress, smear, and undermine someone’s public profile by continuously harassing her or by causing a cascade effect in which an impolite or uncivil remark may encourage others to join in.
Similarly, the many media reports about specific groups becoming more often than others targets of incivility (Saner, 2016) may be an indication that prejudice toward a group with specific characteristics may be a default trigger for incivility. These aspects of incivility can be found within the same communicative framework and can be systematically thought of as operating, and being spatially interlinked, on three levels (see Figure 1). We distinguish between the macro level, which allows us to observe how incivility unfolds across time; the meso level, which allows us to study the role of specific events or issues within the political agenda; and the more granular, micro level, which enables us to better understand who gets targeted and by whom. The different elements of incivility we analyze are represented schematically in Figure 1 and discussed separately in the following sections. While our study cannot speak directly to the effects of incivility as a result of these dynamics (e.g., demobilization), our conceptual framework makes the first step toward exploring the occurrence of incivility under different temporal and contextual conditions—as well as logics. As such, this framework can offer several points of departure for future research dealing both with the causes and the effects of the dynamics uncovered here.

Conceptual framework: The three levels of incivility.
Macro Level: Temporality
Temporality can be thought of as the highest level of our analysis. Scholarly work on Twitter use by European Union (EU) parliamentarians has shown that their Twitter communication is not characterized by permanence (Larsson, 2015). At the same time, studies focusing on online incivility (Coe et al., 2014) have shown that incivility tends to occur frequently and in response to interactive use on the part of politicians (Theocharis et al., 2016). Yet, previous work on political incivility on Twitter has been limited by the relatively short time periods examined, usually spanning no more than a month and on specific events. Studying abusive tweets aimed at U.K. parliamentarians for 2½ months, for example, McLoughlin and Ward (2017, p. 11) found evidence of significant groupings of abuse on particular days, concluding that abuse may not be a day-to-day occurrence but rather something fueled by outside factors. Indeed, while some of the existing approaches provide valuable information about the extent of uncivil interactions in specific cases, such as electoral campaigns, they focus on time periods in which political interest and discussion are heightened. As during such periods politicians tend to use Twitter more, patterns of incivility might differ from “quieter” periods in between elections. Our data set involves interactions between citizens and politicians for approximately a year, enabling us to study these nuances across lengthier time spans than any previous study, which allows us to examine also politically quiet and tense periods. We expect that, in general, incivility is not a constant state of affairs in the Twitter communication of politicians, but a phenomenon that peaks around highly mediated events which may not be limited to elections but may include scandals, crises, or terrorist attacks.
Meso Level: Context
The hypothesis stated above opens up one further important question. Regardless of whether incivility peaks at certain times or whether it is a constant state of affairs, are there specific debates or events that lead to larger outbursts of vitriolic reactions? The interactive communication structure of Twitter, in combination with the status and individual characteristics of the political actor tweeting, may imply a similar dynamic when it comes to how—and which—issues drive incivility. This means that incivility toward politicians may be conditioned by contextual factors not limited to electoral campaigns, but including topics such as corruption scandals, economic crises, terrorist attacks, speeches, vote on a specific legislation, and televised addresses or debates.
Previous research on uncivil comments in online newspaper articles by Coe et al. (2014) found that incivility is associated with contextual factors, such as the topic of the article. Specifically, “hard news” issues such as the economy, law, politics, taxes, and foreign affairs tend to receive far more uncivil comments than issues such as health, lifestyle, and journalism (Coe et al., 2014, p. 669). Similarly, research on Twitter communication shows that conversations about “hard” political issues are more polarized discussions (Barberá et al., 2015) which may also lead to, overall, more heated discussions.
The role of the context within which incivility takes place is located at the meso level of our conceptual framework, as it represents distinct, short- or long-lived events within a given broader temporal frame. Thus, building previous work by Coe et al. (2014) and Barberá et al. (2015), our second hypothesis is as follows:
Examining contextual factors driving incivility is important from two points of view. First, it can allow insights into which topics may be alerting politicians about possible forthcoming waves of incivility and how they modify their behavior accordingly (i.e., not engaging, or retreating after being harassed) to avoid a communication crisis. Second, it enables us to study whether high levels of incivility when engaging with users about a certain topic lead to discouragement from engaging with that topic in the future. Aside from the dangers embedded in the coarsening of democratic discourse and the normalization of incivility, such a development would imply the impoverishment of the debate through the avoidance of “unsafe” topics—a well-documented pitfall of choosing not to engage in conflict in democratic deliberation literature (Papacharissi, 2004). A direct result is that less information will be communicated to citizens via this avenue, and there will be more hesitance from the politicians to address potentially important issues.
Micro Level: Coordination and Prejudice
Although some have argued that minorities and other vulnerable populations are often subjected to consistent online harassment, aside from cases that have attracted media attention, there is little research on whether incivility toward a politician has a limited time span or whether it tends to persist across a longer period of time. Furthermore, if incivility toward politicians on Twitter is event-driven, then one might be led to believe that users who engage in this behavior are frustrated citizens who, motivated by the particular event, decide to channel their frustration on Twitter in the form of a one-off aspersion. If this is the case, then incivility will be a sporadic type of behavior, carried out occasionally by different users.
This would be consistent with some of the previous work on online forums by Coe et al. (2014), which examined whether incivility is the purview of frequent or occasional commentators and found the second to be more uncivil. Similarly, McLoughlin and Ward (2017) found that the users who had sent abusive tweets to British MPs in their data set were not “serial transgressors” (or groups of such) but rather that abuse was distributed across a large number of users (p. 16). Twitter’s communication structure and affordances make it an ideal place for engaging in sporadically uncivil behavior. The possibility of remaining anonymous, along with the low-cost of targeting someone, makes it an appealing platform for a spontaneous, hit-and-run type of behavior encouraged possibly by the feeling of it being easier to hide in the crowd (Oz et al., 2018). Yet research on trolling subcultures has shown that organized trolling is a very real—and not new—phenomenon (Phillips, 2016), while much less attention has been paid to the more recent development of nonhuman (i.e., bot) abuse. Moving to the micro level with the aim to better understand the logic and longevity of incivility on the basis of who and how is engaging in this type of behavior, we hypothesize the following:
Although the previous hypothesis reveals some of the mechanics of user-initiated incivility on Twitter, they convey little information about potential motives behind uncivil behavior toward politicians using the platform. Although it is unlikely that any approach can offer firm evidence as to what one’s motives are for targeting a politician, studying the characteristics of the victims can provide information with regard to whether prejudice toward a subgroup with common characteristics might be one of the driving forces of incivility. Here, we focus on gender-based incivility, although this is only one of the many subgroups that incivility has been found to be directed toward, with ethnic, ideology-based, and racial harassment also being common (Munger, 2016).
Research on gendered coverage has found that electoral candidates are often portrayed in terms of long-standing gender stereotypes, and that gender differences have important consequences for voters’ perceptions, especially of female contenders (Kittilson & Fridkin, 2008). In light of the bulk of previous work showing that media coverage further raises the barriers faced by female contenders seeking office (for an overview, see Lühiste & Banducci, 2016), one would think that social media could be an empowering communication tool giving women a more independent and bias-free avenue of communication. Yet, a 2016 study commissioned by
Investigating whether prejudice is the basis of a uncivil behavior is particularly critical from a democratic point of view, as harassment on the basis of individual characteristics (e.g., gender, race) not only implies a stark disrespect toward collective democratic traditions (Papacharissi, 2004, p. 260), but it is also a type of behavior that is very difficult to deal with. At the same time, as other studies have shown, it is also a type of behavior that tends to have poisonous effects on the victims, including making them more anxious for their safety and demobilizing them (Hinduja & Patchin, 2007; Munger, 2016). Jess Philips, the Labour MP quoted at the beginning of this article, is just one of the many examples who, after being harassed on the basis of her gender, felt that she would be better off if she quit Twitter or if she reduced her general use of the platform. Based on these considerations, we hypothesize the following:
Data Collection and Case Selection
We study incivility from the multilevel perspective derived from our conceptual framework by analyzing a large-scale longitudinal Twitter data collection that contains public responses to messages shared by Members of Congress in the United States.
To initiate our data collection, we relied on the list of Twitter accounts of Members of Congress elected to the 115th Congress (2017–2018) available in the
Our full data set contains a total of 16,002,098 tweets mentioning a Member of Congress from October 17, 2016, until December 13, 2017. These tweets can be divided into two categories: 11,222,146
This data set covers a long time span, which is ideal for our temporal investigation, while at the same time including a series of highly politicized events for our event-driven hypotheses, such as the presidential election in November 2016 or the beginning of the Russia investigation in May 2017. As a result, we believe it presents an accurate reflection of periods of both stability and heated political competition and allows us to conduct various investigations on the temporal dynamics of incivility from the macro to micro level.
Method
We test our four hypotheses using a variety of metrics computed using automated text analysis methods. As we explain in greater detail below, we detect incivility in tweets using supervised learning methods, which extrapolate human coding on a random sample of tweets to the entire data set by identifying words that tend to be associated with uncivil responses. To classify tweets by politicians into different types of topics, we instead use a topic model, which gives us greater flexibility in contexts where multiple topics may be relevant. We measure coordinated incivility campaigns by computing the degree to which uncivil tweets are concentrated on a small subset of users using the Gini index. Finally, our examination of who is the target of incivility relies on covariates at the legislator level.
Automatic Detection of Incivility in Social Media Posts
In our analysis, we apply supervised machine learning to detect incivility in users’ tweets. This method relies on a training sample of 4,000 tweets, randomly sampled from our full data set, which were annotated manually by humans along our dimension of interest. For this labeling task, the coders were selected through CrowdFlower (now called Figure Eight), an online platform for crowd-coding that has been found to yield accurate results with high intercoder reliability (Benoit et al., 2016).
For each of the tweets, we asked the coders to label it using a coding scheme inspired by the codebook developed by Theocharis et al. (2016). We consider the following two categories:
Following standard best practices for crowd-coding (Benoit et al., 2016), we used CrowdFlower’s
Note: we understand that civility can be subjective. Here we are looking not for your opinion, but rather what you think most people would respond in this situation. For example, some people may not find the word “weirdo” offensive, but generally it is considered impolite when it is used as an insult.
In our training set, 26% of tweets were labeled as uncivil.
Using our training data set, we then built supervised machine learning classifiers to predict the probability that each individual tweet is uncivil (as opposed to civil). As our initial attempts led to low performance, we increased the size of our training set by using synthetic labels for an additional set of 16,000 tweets using Google’s Perspective API. 2 We trained a logistic regression with L1 regularization, also known as lasso (Friedman et al., 2001), using stemmed unigrams as features. We used fivefold cross-validation to identify the penalty parameter that maximizes in-sample performance.
By leaving out of the estimation a random sample of tweets that corresponds to 20% of our data set, and then assessing how well our classifiers perform on this “test set,” we are able to evaluate the performance of our classifiers. We find that the overall accuracy is 90%, with precision and recall on the “civil” category being 92% and 95%, and precision and recall in the “uncivil” category being 73% and 61%. 3 Although the somewhat low recall on the uncivil category indicates that we may be underestimating the prevalence of incivility, note that this level of performance is typical in past studies that use machine learning to detect incivility (Davidson et al., 2017), given the inherent vagueness of this term and the heterogeneity of the language that can be used to express. However, overall these metrics clearly indicate that our method approximates the quality of human coding, which, as noted above, we estimated to be 89% based on the average intercoder agreement among our coders.
As an additional validation step for our measure of incivility, we also identified the top predictive n-grams for each category (see Table 1). As we expected, the classifier will predict as
Top Predictive Unigrams for Each Category.
Topic Modeling of Legislators’ Tweets
We used a topic model to estimate the policy issue or political event that characterizes the content of each of the 155,540 tweets by Members of Congress published during our period of analysis. In particular, following Barberá et al. (2019), we fit a Latent Dirichlet Allocation model (Blei et al., 2003) with documents that aggregate all tweets by legislators for each party, day, and chamber. To identify the most appropriate number of topics (
As we show in Figure 2, a value of

Choosing the right number of topics.
Metrics of Coordinated Activity
Our metric of
Results
Figure 3 offers empirical evidence regarding the temporality of incivility on social media (Hypothesis 1). Here, we show that the level of vitriol addressed to Members of Congress on Twitter is relatively stable over time: Our machine learning model predicts that between 15 and 20% of tweets every day meet our definition of incivility.

Incivility in tweets mentioning U.S. legislators over time.
However, the figure also reveals clear spikes of vitriol that are correlated over time, which suggests that specific events or policy debates are likely to spark cycles of higher incivility. We see one of those spikes in the month leading to the 2016 election. Based on an examination of a random sample of tweets during this period, it seems the increase is due to responses to anti-Clinton messages by Members of Congress such as Joe Walsh and Jason Chaffetz. Two subsequent spikes, in March and May 2017, can be clearly linked to the debate about the repeal of the Affordable Care Act and the new health care bill, which eventually failed. Another important spike, in August 2017, appears to correspond to reactions to the White supremacist rally in Charlottesville and the subsequent reactions to it by legislators. We observe one last spike in October 2017, which overlaps with the revelation by Florida congresswoman Frederica S. Wilson that President Trump disrespected the family of a U.S. soldier killed in an attack in Niger.
In conclusion, we find mixed evidence in support of Hypothesis 1: Although incivility is always prevalent on Twitter, we also find highly communicative events that lead to significant increases in vitriol and attack targeting politicians.
Our second hypothesis (Hypothesis 2) explores variation at the meso level, in response to the content of the messages that politicians send. Figure 4 summarizes the analysis we conducted to test this hypothesis. For each of the 45 political topics we identified in legislators’ tweets, we fit a linear regression model where the dependent variable is the proportion of uncivil responses that legislators received to each of their tweets, and the independent variable corresponds to the probability that a given tweet falls into the topic in consideration. In our models, we also control for the chamber and party of the politician as well as for the total number of replies that each tweet received. Figure 4 reports only the coefficient for the topic effect from each of the 45 separate regressions. In other words, the coefficients here can be interpreted as the predicted increase in the proportion of replies to a given tweet that are uncivil if that tweet is completely about a particular topic.

Incivility in replies to legislators’ tweets, depending on topic of root tweet.
This analysis reveals large differences in incivility across topics. Contrary to our expectations, when we manually divide the topics into policy issues (health care, immigration, gun control, taxes, etc.) and political events and scandals (Trump’s inauguration, the Clinton email scandal, Supreme Court nominations, hurricanes, etc.), we find that political topics lead to a somewhat larger increase in incivility than policy topics—on average, the coefficient for the former is .16 but .06 for the latter. These results are also consistent with our earlier finding regarding spikes on incivility: Here, we also find that tweets about the Charlottesville White nationalist rally sparked the most uncivil responses.
Turning to the micro level, we were also interested in examining whether uncivil tweeting is carried out by a small minority of users (Hypothesis 3). As it is usually the case on social media, we do find that this holds here too: The 1% (10%) most active users are responsible for 20% (56%) of all uncivil tweets. The Gini coefficient for the entire distribution is .67. However, this level of coordination is actually smaller than if we look at the full data set of tweets mentioning politicians, where the top 1% (10%) most active users produce 29% (69%) of all tweets and the Gini coefficient is .77.
Figure 5 visualizes the full distribution of users and tweets they produce using a Lorenz curve. Overall, this suggests that even if it is indeed the case that a small number of users are responsible for most of the content, this level of concentration is not higher than for other types of tweets. In fact, up to 38% of all the unique users who tweeted at least once retweeting or mentioning the name of a politician (648,690 out of 1,686,540 users) sent at least one tweet that our classifier predicted was uncivil. In other words, consistent with previous findings by McLoughlin and Ward (2017), we do not find that organized trolls or “serial transgressors” are responsible for most of the abuse; on the contrary, uncivil behavior seems to be common among many frustrated citizens who take to Twitter to angrily criticize and insult politicians.

Lorenz curves: Inequality in production of uncivil tweets versus all tweets.
However, that does not mean that trolling campaigns do not exist. When we measure coordination at the day level using the Gini coefficient for the distribution of users and tweets sent each day, shown in Figure 6, we do find clear variation over time. The correlation between this time series and the proportion of uncivil tweets at the day level (in Figure 3) is .26, which suggests that part of the spikes we described earlier could be due to an increase in the level of coordination among users. And indeed, some of the days with the highest values on our coordination metric coincide with the same set of external events we identified as motivating spikes in incivility: for example, the attempted repeal of Obamacare in early March and early May 2017 or the reactions to the Charlottesville White supremacist rally in August 2017. This means that even if incivility is a constant presence on political tweets, we do find that its prevalence appears to be due to coordinated campaigns taking place at specific periods of time.

Coordination of incivility in tweets mentioning U.S. legislators over time.
To further explore the motives for these coordinated campaigns, we now turn our unit of analysis from the individual tweet to the target of the tweet and, in particular, the legislator that is mentioned on it. We will try to identify which characteristics predict who is most likely to be receiving uncivil tweets. Table 2 reports the results of this analysis. Here, we show results of multivariate regression models where the dependent variable is the percentage of uncivil tweets addressed to each of the Members of U.S. Congress in our sample (Models 1 and 2) or the level of coordination in those tweets, measured as the Gini coefficient (multiplied by 100 to facilitate interpretation) for the distribution of users and uncivil tweets sent (Models 3 and 4). The main independent variables measure different characteristics of each politician: gender, type (senator or representative), party affiliation, and extremism, which we measure as the absolute value of DW-NOMINATE (a metric of ideology estimated using roll-call votes in Congress; see Lewis et al., 2018). We also control for the (logged) number of tweets mentioning each of these politicians in our sample and, in our second set of models, for the average incivility in the tweets addressed to each politician (the dependent variable in the first two models).
OLS Regressions of Incivility and Coordination on Legislators’ Characteristics.
Significant at
We do not find clear evidence supporting our hypothesis stating that female politicians were more likely than men to receive uncivil comments (Hypothesis 4). The sign of the coefficient for gender is in the expected direction in Models 1 and 3, but it is not statistically significant in any of the models.
In contrast, chamber and ideological position appear to have large effects on the overall level of incivility received. Members of the House of Representatives who adopt a more extreme ideological position are much more likely to receive uncivil comments. To illustrate the substantive magnitude of this effect, note that the predicted difference between Senators with an extremism score of 0 and Representatives with an extremism score of 1 is 7.3 percentage points (12.3 vs. 4.8 percentage points). This is a substantively large effect that represents approximately one third of the average proportion of incivility at the day level. When it comes to our regression models for coordination, we find that only party identification is a significant predictor: Republican legislators appear to be more likely to be the subject of coordinated attacks on Twitter.
Discussion
Online incivility and harassment in political communication are a growing concern among politicians, journalists, and academics. This article provides a descriptive account of how incivility evolves over time, who is the target of uncivil comments on Twitter, and whether being a target is correlated with individual-level characteristics. We relied on a combination of supervised and unsupervised machine learning methods to examine these questions.
Our empirical findings illuminate past unresolved debates in the literature and also challenge the conventional wisdom within this field. We find that the prevalence of incivility in social media conversations that involve politicians is relatively constant over time, although we also find spikes that correspond to external events. These spikes tend to overlap with the emergence of controversial issues or events that spark outrage and lead ordinary citizens to address vitriolic messages to their representatives. This is in line with previous studies stressing the role of political expression on social media in the influence of temporal dynamics (Jungherr, 2014). Contrary to the conventional wisdom in media accounts and the academic literature, our analysis suggests that uncivil tweets can be sent by any user and not only “professional trolls” who engage in coordinated action (although we also find evidence of such attacks in our analysis). This finding has important implications when it comes to addressing the issue of coordinated attacks and bot inferences in political debates, as it makes it unclear whether measures toward reducing such threats can do much in addressing the issue of toxic language. Finally, we do find variation regarding which politicians are more likely to be victims of attacks, although gender does not have a statistically significant effect.
Although our analysis offers what we believe is the most systematic descriptive evidence of incivility on Twitter, we also caution readers that it may not be without limitations. First, our finding regarding the lack of differences across gender groups might be partially due to measurement error—probably because we have more male politicians than female politicians in our training set, the performance of our classifier of uncivil tweets is somewhat lower when the tweet is addressed to female politicians (90% vs. 88% accuracy; 62% vs. 54% recall on the uncivil category). Performance may also be lower if the insults used to attack female politicians are qualitatively different than those used to attack male politicians, which is likely to be the case. We address this limitation in a follow-up project where we code an equal number of tweets targeting each gender group (Theocharis et al., 2018).
And second, we caution against generalizing our findings to other cases or periods. For example, it is possible that the differences across parties are due to the current political context in the United States, with a Republican president and a Republican majority in both House and Senate. We would need to replicate our analysis with past or future data to determine whether Democrats in Congress may be subject to higher levels of incivility if they held power. Similarly, it is hard to tell how these findings may be different in other countries. Theocharis et al. (2016) found significant differences in the level of incivility on Twitter across four different European countries. It is likely that some of the patterns identified here, such as the variation in the level of coordination or regarding who is the target of harassment, are also different in other contexts.
Despite these caveats, we claim that our analysis offers an important account of the dynamics of incivility toward politicians on social media. As citizens increasingly rely on these platforms to consume political news and to engage in conversations about politics, we believe that understanding the determinants of the high levels of toxicity in these sites is a necessary first step toward design changes that may turn social media platforms into public spaces for deliberation where everyone feels free to intervene and share their opinion.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
