Abstract
In online text-based discussions, people behave less diplomatically because they are more outspoken and less responsive. This can feed
Concerns about polarization within societies are mounting worldwide (Settle, 2018). Online discussions are often blamed for this because they contain rude, demeaning, and less-than-democratic comments (Coe et al., 2014; Hlavach and Freivogel, 2011), and because they might descend into conflict and polarization more easily than discussions held face-to-face (FtF) (Anderson et al., 2014). Consequently, there has been a lot of speculation about, and research into, the differences between online and FtF discussions that might help explain tendencies for conflict and polarization. Several prominent explanations focus on the characteristics of the online medium and the psychological changes this gives rise to. For example, anonymity has been argued to lead to reduced social presence (Cortese and Seo, 2012; Short et al., 1976), a state of deindividuation (Lowry et al., 2016; Sproull and Kiesler, 1986), or a reduced accountability (Santana, 2014; Suler, 2004). Alternatively, anonymity could also lead to increased group identification (Bae, 2016; Postmes et al., 1998) or more self-awareness (Nielsen, 2017). All of these have been associated with changes in the adherence to social norms (Huang and Li, 2016; Lea et al., 1992).
Recent research took a different approach by investigating what happens within the
However, disentangling the role of the above-mentioned factors in polarization is inherently problematic because online and FtF media differ in many ways. The first aim of this article is therefore to develop a new methodology for directly comparing discussions via different media that removes the medium as explanatory variable. By reproducing text-based online discussions in an FtF format, and reproducing FtF conversations in a text-based chat format, we aim to gain insight into the differences between online and FtF discussions. We study this qualitatively, by analyzing the challenges encountered in the process of translating discussions from one medium to the other, and quantitively, by examining how these reproductions are perceived by outside observers. The second aim of this article is to use this methodology to empirically test the role of the diplomatic behaviors uncovered in earlier research in explaining differences in perceived polarization between media.
Online polarization
Political scientists find that politics are more and more central to people’s identity (Mason, 2018). As a result of intergroup social identity dynamics, political outgroups are increasingly hostile (Iyengar et al., 2012; Turner, 1991) and affective polarization occurs (Iyengar and Westwood, 2014). According to Settle (2018), social media make political and affective polarization personal: social media such as Facebook reinforce motivated identity processes, for example, by highlighting the visibility of political conflict (intergroup threat) and by rewarding extreme utterances (reputation management). Because this occurs on social media where posts are seen by “friends”, political position and polarization make their way into social networks.
In the present article we zoom in on microdynamics: what happens
Online disinhibition
Online conversation is in many respects very different from natural FtF interaction. Most research has focused on the following distinguishing features: it is mostly a- or semi-synchronous, poor in subtle social cues, and often anonymous. These characteristics have been connected to the proneness of online interactions to polarization (Lin, 2009; Stromer-Galley and Wichowski, 2011; Suler, 2004). Most prominently, it is assumed that anonymity makes people feel more distant from others (i.e. decreased social presence, Short et al., 1976), which reduces their concerns for being positively evaluated, and motivates them to act with less inhibitions (i.e. online disinhibition, Sproull and Kiesler, 1986; Suler, 2004). In the context of contentious discussions, this disinhibition can take a toxic form, resulting in hostile and aggressive online behavior, like rude language and threats (often referred to as flaming, Lapidot-Lefler and Barak, 2012).
The evidence supporting this theory is mixed, however (Clark-Gordon et al., 2019; Lea et al., 1992; Papacharissi, 2004). While the online disinhibition thesis still receives wide support (Asker and Dinas, 2019; Lapidot-Lefler and Barak, 2012; Lowry et al., 2016), some research points to the contrary: by redirecting people’s attention from their individual identity to their social group membership, anonymity makes people
Diplomatic behaviors FtF
This research is based on a growing literature showing that FtF interaction partners often rely, consciously or not, on everyday diplomatic skills to communicate and infer the status of their relationship (e.g. Koudenburg et al., 2017; Reis and Clark, 2013). A first important skill is
The second diplomatic skill of interest is
Diplomatic behaviors online
The characteristics of text-based online media could make the enactment of these diplomatic skills challenging. Roos et al. (2020) argued that the reduced synchronicity will make online conversation less responsive (i.e. comments appearing with delays and sometimes out of order), and the relative lack of subtle social cues will make online expression less ambiguous (more clear and outspoken). This could make it harder to prevent disagreements from escalating online. Indeed, comparing the way in which interaction partners handled disagreements online and FtF, the reduced responsiveness and ambiguity in text-based online discussions explained reductions in experienced shared cognition and solidarity (Roos et al., 2020). Interestingly, these studies did not find any more or less expressed disagreement or disinhibited behavior in the online chats. This suggests that, although
Varying responsiveness and ambiguity within media
To summarize, these studies suggest that the use of everyday diplomatic skills (responsiveness and ambiguity) reduces the risk of the escalation of disagreement, and that online interactions can be more prone to polarization where these diplomatic behaviors are limited. However, this research has one intrinsic limitation that means that alternative explanations cannot be ruled out. Because the studies compare two very different communication media, other media characteristics that have also been associated with polarization may play a role. There could have been residual differences in anonymity, social presence, and so on. Although the prior research attempted to control for such confounding variables, the only way to rule out alternative explanations conclusively is to replicate the effects within the same communication medium.
Therefore, in an attempt to replicate and extend this prior research (Bonett, 2012), the present studies try to vary the conversational use of everyday diplomatic skills
In Study 1a, we asked a group of actors to (re-)enact text-based online and FtF controversial small group discussions. The actors were interviewed about their (re-)enactment experiences. In Study 1b, we tested the hypotheses (see below) by asking individual participants to rate the videoclips of these (re-)enactments on perceived responsiveness and clarity (vs ambiguity). In order to assess the social attributions following from observing these diplomatic behaviors, we also asked participants to rate the degree of shared cognition (the absence of which suggests there is perceived disagreement and lack of mutual understanding; perceived polarization) and solidarity (the absence of which suggests there is a perceived lack of good social relationships) among interaction partners. In Study 2, we took the complementary approach by asking another sample to evaluate the
Hypotheses
In line with Roos et al. (2020), we expect that, irrespective of the communication medium format (i.e. FtF or online) in which it is perceived, participants will evaluate the original chats as less responsive (Hypothesis 1a) and more clear (Hypothesis 1b) compared to the original FtF discussions. We further hypothesize that, irrespective of the communication medium format, participants will think interaction partners in the original chats experience less shared cognition (i.e. more polarization, Hypothesis 2a) and less solidarity (Hypothesis 2b). We further hypothesize a causal relationship between these factors, such that a reduced responsiveness and increased clarity will lead participants to conclude that interaction partners experience less shared cognition and less solidarity (Hypothesis 3a and 3b).
Alternative predictions can also be made. It seems reasonable to expect that the fit between diplomatic behaviors and medium format might play a role, in line with Expectancy Violations Theory (EVT, Burgoon and Hale, 1988; Burgoon and Walther, 1990). EVT proposes that people enter interactions with certain expectations concerning behavior, and that these expectations are shaped, among others, by communication format. People are very sensitive to violations of these expectations, and attempt to interpret and evaluate these violations as soon as they occur. Accordingly, based on prior experiences, observers might expect to see a lack of responsiveness and an abundance of clarity and outspokenness in online chats, and consider this normal and appropriate. The reverse might apply to the FtF format: here, observers would expect more responsiveness and more ambiguity. But when their expectations concerning diplomatic behaviors are violated, that is, when online conversation is responsive and ambiguous or when FtF discussion is not responsive and clear, observers might conclude that interaction partners are more polarized.
General method
As the methods of the two studies overlap considerably, we will first describe the procedures that apply to both studies. The methodological specifics of each study will then be presented together with the results. Both studies had a within-subject design with two conditions representing different communication media sources: text-based online chat and FtF discussion. In Study 1, each participant saw one online chat and one FtF conversation in a FtF format (in randomized order: 57 first saw a FtF and 45 first saw an online discussion), and in Study 2, each participant read one online chat and one FtF conversation in an online chat format (in randomized order: 51 first saw a FtF and 52 first saw an online discussion).
Power and participant recruitment 1
A repeated measures within factors power calculation, with an expected effect size of
Stimulus materials and procedures
The stimulus materials came from a previous study in which groups of four unacquainted Dutch students discussed controversial statements: one via a text-based online chat and one FtF (Roos et al., 2020). Unknown to participants, the fourth group member was a confederate who expressed controversial views. Specifically, she spoke or wrote two scripted sentences that represented a right-wing position. To allow for a natural conversation, the remainder of the discussion was unscripted, but the confederate was instructed to stick to her position. The text of the online discussions was saved and the FtF discussions were voice-recorded. From the resulting dataset, we selected the online chat and FtF conversations of two groups: four discussions in total. 2 Similar, and equivalently controversial, statements were covered in both conditions. 3
In Study 1, the selected discussions were (re-)enacted by a group of actors in front of a camera. In Study 2, the same discussions were transcribed into an online chat format. The specific procedures of (re-)enacting and transcribing will be outlined below. The resulting videoclips and texts were embedded in an online questionnaire so that each videoclip or text was followed by the same block of questions.
Participants were unaware that the videos and texts originated from online chat and FtF interactions: participants were simply told that they would see or read two conversations about controversial topics between unacquainted students. However, in Study 1 both discussions were presented in an FtF format (i.e. in a video-recording), and in Study 2 both discussions were presented in a typical text-based chat layout.
Dependent measures 4
To measure perceived responsiveness, we adapted
5
the scale of Roos et al. (2020). Participants indicated how often (1 =
We assessed perceived shared cognition with a three-item scale adapted from Koudenburg et al., (2013a), for example, “During this conversation, I felt that the group members understood each other” (1 =
Statistical analyses
The intraclass correlations ranged between .00 and .37 for the repeated measures of the dependent variables, suggesting clustering of scores within participants (Bliese, 2000). To account for this dependence of observations, the data were analyzed in multilevel models, where condition (Level 1) was nested in participants (Level 2). We analyzed the data with the lmer function in R (lme4, version 1.1-9, Bates et al., 2019). To test the effects of condition on perceived diplomatic behavior and perceived shared cognition and solidarity (Hypotheses 1 and 2), we compared the fit of the multilevel models that included only the random effect of participants with the equivalent models that added condition as fixed effect predictor. 9 We used the emmeans package (version 1.4., Lenth et al., 2019) to estimate the condition means and confidence intervals. We tested the causal relations (Hypotheses 3) in two multilevel models with either shared cognition or solidarity as outcome variable, and both responsiveness and clarity as fixed effect predictors, nested in the random factor of participants.
Study 1a: performing the (re-)enactments
Study 1 consisted of two parts. Study 1a, the qualitative and more exploratory part, covered actors’ experiences through interviews before, during, and after the (re-)enactment session. Study 1b, the quantitative and hypotheses-testing part, covered observers’ perceptions of these (re-)enactments as measured by the questionnaire.
Method study 1a
Procedure and stimuli
As input material for the (re-)enactments, we retained the text-based online chats in their original state and transcribed the FtF voice-recordings in detail. We invited four young female actors from a Dutch theater group whose gender and approximate age matched the original student groups that produced the scripts (as well as the majority of the intended participant population). The (re-)enactments took place in two sessions.
In the first session, the actors were presented with the four transcripts, plus the voice-recordings of the FtF discussions, and asked how they would convert these into theatrical performances. We gave the actors a lot of freedom because we expected this would provide us with unique information. Actors are used to translating text-based scripts into FtF performances; a process in which many decisions have to be made. The peculiarities of each form of conversation might become apparent through these decisions and the difficulties encountered in their execution.
The actors wanted to learn by heart the entire chat scripts and the parts of the (relatively long) FtF conversations around the two standardized sentences of the confederate. They rotated the role of confederate among themselves. The actors decided to emphasize the differences they observed between the FtF and the online chat interactions as well as between the (in their opinion) outspoken confederate and tentative naive participants. The actors further decided to improvise the FtF filler words (e.g. “hmm,” “yes”) instead of keeping strictly to the scripted ones. For the online chats, the actors decided to speak out abbreviations completely, and to verbalize the typos and emoji.
The second session took place three weeks later. The actors performed the (re-)enactments seated on adjacent sides of a table in front of a camera. We edited the resulting videos to be of equal length (approximately five minutes). We selected the parts of the (re-)enactments that included the two standardized sentences of the confederate. 10
Qualitative measures
During both sessions, we conducted semi-structured interviews in which we asked the actors about their (re-)enactment experiences and the differences they noticed between the online chat and FtF conversations. Moreover, after watching the resulting videoclips, the actors were asked about their perceptions and experiences via email. We performed an inductive thematic analysis on the actors’ interview responses (see Miles and Huberman, 1994). This means that the content of their answers was analyzed to extract recurrent themes. We only extracted themes that all actors agreed on.
Results study 1a
The thematic analysis revealed three themes, which will be described below with illustrative quotes.
(Re-)Enactment challenges
Interestingly, the actors commented not just on the difficulties of enacting the chats. They also talked about the challenges they encountered while re-enacting the FtF conversations. The quotes below illustrate the complexity for the actors. 11
Q1: “It [enacting face-to-face and chat discussions] was just very different from acting a regular scene from a play.” (Interview 2, Actor 1)
Q2: “A face-to-face conversation feels very uncomfortable and does not always run smoothly either, just like the chat, but only few punches are thrown.” (Interview 2, Actor 1)
The actors experienced both the chat and the FtF (re-)enactments as very different from playing a regular theater script (Q1). In fact, much of the interviews revolved around the dissection of exactly what made the (re-)enactments so challenging. In both online and FtF discussions, the actors noticed discomfort evidenced by a lack of smooth flow, but the way in which they attributed this was medium dependent (Q2): disagreement leads to unpleasant conversations, no matter in which medium it is encountered, but only the interaction partners in the chat discussions confronted this aggressively. In FtF discussions, people refrained from seeking confrontations (“only few punches are thrown”), but clearly showed their discomfort while doing so. The latter might fulfill a social function by communicating tact and good intentions (cf. politeness, Brown and Levinson, 1987). It thus appears that the factors making (re-)enactments challenging differ between media. The two other themes uncover these different factors.
Empty phrases
The main challenge encountered in the FtF re-enactments were the many “empty” or “content-less” phrases and utterances, as the actors called them, such as “uhm” and “or so,” that were absent in the online chats. The following quotes illustrate how the actors saw this difference between media and its consequences.
Q3: “All the statements in the chat conversations are clear, the sentences are clear, while the face-to-face conversations are often more nuanced through the use of words like ‘say’, ‘or something’, ‘I think’.” (Interview 3, Actor 1)
Q4: “The word choice and the doubtful replies in the face-to-face conversations show that people are probing what they can and cannot say in this group. Judging by their replies, the chatters seem to take that less into account.” (Interview 3, Actor 1)
Q5: “I think that the people in the face-to-face conversation might have felt quite tense. I do not think they knew each other. There is always a social aspect involved: you want to be liked or maintain a good atmosphere. That results in very tentative expressions of opinions. The chat is safer, because you do not see each other. It is easier to simply say what you think. I think that those people felt more at ease.” (Interview 3, Actor 2)
Q6: “Speaking in the chat is a lot less nuanced and takes on the form of rounded ‘statements.’ These do not connect as well as expressions in a face-to-face conversation. However, it also struck me that those without an extreme opinion in the face-to-face discussion had some difficulty with formulating their thoughts. A lot of words were spoken, but no clear point was made. That made the discussion woolly and more difficult to enact.” (Interview 3, Actor 1)
To the actors, the chat conversations seemed much more clear, efficient, and to-the-point than the FtF conversations, in which people expressed themselves rather ambiguously and used many more words than necessary. Quotes 4 to 6 show that these differences shaped the actors’ impressions of interaction partners’ intentions. The actors thought the interaction partners in the FtF discussions were tentatively exploring what was deemed acceptable conduct in order “to be liked or maintain a good atmosphere” (Q4 and 5). Thus, FtF people seemed to be more concerned with each other’s opinion than with their own. On the contrary, actors thought that the interaction partners in the online chats did not care much about each other’s opinion, and, consequently, were less afraid for social rejection and less reluctant to share their opinion. Another reason the actors mentioned for the apparent ease with which online interaction partners expressed themselves was being convinced and holding a clear opinion. Conversely, the empty phrases in the FtF conversations were interpreted as a process of formulating thoughts and therefore indicative of a lack of an “extreme opinion” (Q6). Interestingly, the undecided “wooliness” of the FtF discussions made these harder to re-enact. The actors found it easier to remember their lines when their characters make a clear point. Indeed, the actors tried to define their character based on his or her statements, which were rather ambiguous in the FtF discussions.
Action–reaction
In enacting the online chats, the actors especially noticed the difficulty they had in suppressing their natural tendency to respond to each other’s comments, a challenge that was absent in the FtF re-enactments. The quotes below illustrate how the actors experienced this difference between media and its consequences.
Q7: “In acting, and in normal conversations, you want to be able to react to each other. Action-reaction. That was possible in the face-to-face conversations. In the chats that was difficult.” (Interview 3, Actor 2)
Q8: “If you concentrate on what is actually being said, you notice that the chat is like a weird robot interaction, as if people are not really listening to each other. The face-to-face conversation comes across as being a lot more natural! That kind of surprises me, it was difficult for us to incorporate so many ‘loose’ words but it worked and that is nice to see.” (Interview 3, Actor 1)
Q9: “People tried to maintain a good atmosphere in the face-to-face conversations. They also tried to listen to each other and understand each other, what happened less in the chats. There [in the chats], people sometimes completely talked past each other because everyone was mainly concerned with sharing his/her own opinion. The atmosphere was virtually unimportant in the chats.” (Interview 3, Actor 2)
The actors thought that the increased to-the-point-ness in the online discussions could explain why interaction partners reacted to each other less (Q6). Actors repeatedly characterized the online chat as a “weird robot interaction” (Q8). Just like robots, interaction partners appeared unaware of each other’s presence: rather than listening to each other, they seemed to be more concerned with venting their own opinion. This resulted in a disjointed conversation where interaction partners appeared to be unconcerned with, and even dislike, each other (Q9). Conversely, the actors thought the many empty phrases in the FtF discussions promoted natural action–reaction sequences (Q8). Actors saw this responsiveness as a sign of interaction partners being concerned with each other and motivated to maintain good relations (Q9). Note the overlap with the impressions resulting from the (lack of) empty phrases.
Connecting empathy phrases with action–reaction
To conclude, both the online chats and the FtF discussions were challenging to (re-)enact, but for different reasons: the FtF discussions contained many “empty” phrases and the online chats were lacking smooth action–reaction sequences. Interestingly, these characteristics appear tightly interconnected: the clear to-the-point chat sentences do not connect well, and the empty FtF phrases promote smooth action–reaction sequences. Apparently, even though both felt awkward to the actors, the latter combination resulted in a more natural conversation. Furthermore, both characteristics contributed to the actors’ perceptions of the way interaction partners related to each other. The online combination of clear and disjointed sentences communicates that interaction partners care more about their own opinion than about each other more about their own opinion than about each others’s feelings. Conversely, the FtF combination of empty phrases and coherent action–reaction signals that interaction partners prioritize their relationship.
Study 1b: observing the (re-)enactments
Method study 1b
Participants and design
A sample of 102 participants (
Results Study 1b 13
We found medium to large condition effects on the four dependent variables supporting Hypotheses 1 and 2 (Table 1). Specifically, participants felt interaction partners were less responsive (
For each dependent variable measured in Study 1b, the test results of the difference between conditions (chi-square test and effect size), and the means with 95% confidence intervals per condition.
Note. ***
Cohen’s
The results of the regression analyses are shown in Figure 1. In line with Hypothesis 3, responsiveness positively predicted perceived shared cognition and solidarity (both

Repeated measures regressions where perceived responsiveness and clarity predict perceived shared cognition and solidarity in Study 1b (unstandardized betas are reported). The repeated measures correlations between responsiveness and clarity, and between shared cognition and solidarity are also shown.
Discussion Study 1
In this study, discussions that were originally held via text-based chat or FtF were all (re-)enacted in an FtF setting. Observers (and actors) noticed the differences between conditions as predicted. The original online discussions were considered less responsive and clearer than the original FtF discussions. Furthermore, to observers (and actors), online interaction partners seemed to be more concerned with clearly venting their own opinion than with listening to each other and therefore seemed to disagree more and experience less solidarity. The results are in line with the hypothesis that regardless of the medium, responsiveness and ambiguity in controversial discussion are seen as indicators of agreement and good social relations. However, these findings might instead be explained by a naturally better fit between these diplomatic behaviors and the FtF medium. To test whether the observed effects are truly independent of the medium, we ran a second study in which we assessed how observers perceive these diplomatic behaviors in a text-based online chat.
Study 2: observing the transcripts
Method Study 2
Participants and design
In Study 2, we presented another sample of participants with the textual versions of the chat and FtF discussions used in Study 1. We transcribed the parts of the discussions that actors (re-)enacted, which made the transcripts of both conditions approximately equal in length. We edited the discussions as little as possible. All edits were aimed at improving the readability and/or comparability of the transcripts. When in doubt, we used a conservative principle, that is, differences between media were minimized rather than maximized. The chat transcripts were kept almost unedited. Only the characteristics that could not be present FtF, and might therefore confound the results, were edited: we removed spelling errors (language errors were maintained), replaced the one emoji by “haha,” excluded parentheses, and wrote out abbreviations. The FtF conversations were transcribed by removing unfinished sentences (as these might unjustly raise the impression that speakers were interrupted), writing out overlapping speech as if comments appeared after each other (but not when this would unjustly raise the impression of ignoring), adding commas and some periods to indicate speaking pauses, adding ellipses (. . .) when a speaking turn was ended prematurely, including citation marks (“”) where people referred to a specific term or cited someone else, and replaced laughing by “haha” (not for chuckling as this might unjustly communicate sarcasm). The FtF transcripts were structured like the original online chats where four differently colored blocks with a letter (
A sample of 103 participants (
Results Study 2
Condition effects were partially different from those observed in Study 1b (see Table 2). In line with Hypothesis 1b (and Study 1b), participants considered the online chat transcripts clearer than the FtF transcripts (
For each dependent variable measured in Study 2, the test results of the difference between conditions (chi-square test and effect size), and the means with 95% confidence intervals per condition.
Cohen’s
Whereas we did not observe significant differences on responsiveness and perceived shared cognition between conditions, it is still interesting to test whether (within medium) variations in perceived responsiveness and clarity can explain variations in perceived shared cognition and solidarity. In line with Hypothesis 3a (and Study 1b), responsiveness positively predicted perceived shared cognition and solidarity (

Repeated-measures regressions where perceived responsiveness and clarity predict perceived shared cognition and solidarity in Study 2 (unstandardized betas are reported). The repeated measures correlations between responsiveness and clarity, and between shared cognition and solidarity are also shown.

First section of one of the two face-to-face conversations presented to participants in Study 2. The discussed statement is “It is natural that there are few women in the top of business and government.” The two standardized comments by the confederate are included (person M). Translated to English by the first author.

First section of one of the two chat conversations presented to participants in Study 2. The discussed statement is “Dutch businesses that support child labor should be punished heavily.” The two standardized comments by the confederate are included (person P). Translated into English by the first author. The text between brackets was not included in the stimulus materials but added to this figure for ease of reading.
Discussion Study 2
The results of Study 2 partially support the hypotheses
When examining the results of the regressions, we see a similar pattern of support for some hypotheses and not for others. With respect to responsiveness, the more there was of it, the more agreement and solidarity observers inferred. But clarity was
General discussion
It is often said that online discussions are more prone to polarization than FtF discussions. Recent research suggests that the subtle skills of everyday diplomacy play a central role (Roos et al., 2020). Even when there is no real polarization in terms of disagreement, online interaction partners can still perceive polarization because contention is dealt with less responsively and disagreement is conveyed more clearly. However, as these conclusions were drawn from between media comparisons, other media characteristics, such as anonymity, may have confounded these effects. Therefore, the current article developed a new methodology to test the generalizability of these findings from between media to within media. By keeping constant the medium in which discussions were perceived and assessing the perceptions of observers that were not aware of the medium in which the discussions were originally held, we isolated the consequences of diplomatic behaviors.
Translating chat to FtF
First of all, the process of translating discussions from one communication medium to the other proved very informative about the differences between media. In a thematic analysis of the actors’ reflections on the (re-)enactments, we inductively identified three (recurring) themes: (1) the challenging experience of (re-)enacting both chat and FtF discussions, (2) the many empty phrases in FtF discussions, and (3) the lack of action-reaction sequences in online chats.
The (re-)enactments of both the chat and the FtF discussions were challenging, but both in their unique ways. The re-enactment of FtF discussions felt unnatural to actors mostly because of the amount of content-less speech. This took the form of short interjected expressions like “hmm” and “or so,” but also of long speaking turns in which it did not become clear what interaction partners actually meant or wanted to say. While reading and enacting, actors felt these “empty phrases” made the interaction feel uncomfortable and weird, but when looking at the resulting videoclips, actors saw that these fulfilled an important social function: interaction partners seemed to be holding back to first explore what was acceptable to say before sharing their thoughts. In the online chats, actors noticed that interaction partners were much more to-the-point, which disrupted the smooth transition between speakers and gave these conversations an unnatural quality too. Interaction partners seemed to be dedicated to clearly communicating their own opinion, but to ignore each other’s comments in the process. Interestingly, these observations support the hypotheses that a reduced responsiveness and increased clarity lead to perceptions of disrupted social relationships.
Observers’ perceptions
The hypotheses were tested by having participants rate original online chat and FtF discussions in an FtF (Study 1b) and online chat format (Study 2). Together, the findings partially support the hypotheses (based on Roos et al., 2020) and partially support an alternative Medium-Fit argument (based on Expectancy Violation Theory, Burgoon and Hale, 1988; Burgoon and Walther, 1990).
FtF and text-based responsiveness
In line with Hypothesis 3a, in both FtF and text-based chat formats, observers considered responsiveness indicative of agreement and solidarity. However, what is considered responsive behavior appears to be medium-specific. Echoing the disrupted action–reaction sequences noticed by the actors, observers thought the original online chats were less responsive, and therefore more divided, when seeing these enacted FtF. But when presented with these same online discussions in a chat format, observers did not consider these any less, nor more, responsive than the original FtF discussions. This may point to a Medium-Fit effect: the behaviors that make FtF discussions more responsive might be experienced as disrupting the smooth coordination of online chats, thereby reducing responsiveness. Considering the importance of responsiveness in fostering a sense of common understanding and agreement (e.g. Davis and Perkowitz, 1979; Reis and Clark, 2013), this absence of a condition effect on responsiveness could also explain the absence of an effect on perceived shared cognition in the transcripts.
FtF and text-based clarity
In line with Hypothesis 1b, independently of the medium format in which it was presented, observers thought interaction partners in the original online discussions expressed themselves relatively clearly. However, in line with a Medium-Fit argument, the consequences of clarity depended on the medium format. In the enactments, observers (and actors) felt clarity signaled disrupted social relations. But in the transcripts, clarity was associated with slightly
This can be connected to the observation that the behaviors that make FtF discussions responsive might not work online. Indeed, ambiguity in the enactments was considered conductive to responsiveness by actors (empty phrases promoting smooth action–reaction sequences) and observers alike (negative correlation between clarity and responsiveness). However, this relationship was absent in the transcripts. This suggests that ambiguity serves a responsiveness function FtF, but that ambiguating online might be experienced as avoiding the issue and therefore not responsive at all.
In sum, the findings of Roos et al. (2020) were replicated in the context of FtF discussions but not in the text-based chat format. This suggests that what counts as diplomacy is not the same for each medium. Responsiveness seems to be a diplomatic skill that helps preserve good social relations in both online and FtF discussions. But, in line with a Medium-Fit argument, observers appear to (unwittingly) prefer a match between the degree of outspokenness and the communication medium. That is, whereas ambiguity can be used to good effect to prevent escalation in FtF discussions, ambiguity in online chats may be interpreted as evasiveness and thus backfire.
Limitations and future directions
In the process of translating discussions held in one medium into another medium, many, often rather subjective, decisions had to be made. For example, the actors added some nonverbal cues (such as eye contact) to the purely textual chat discussions, and we did not include all interruptions when transcribing the FtF discussions. These adaptations were made to prevent the conversations from becoming completely incomprehensible or clearly unnatural, which would introduce extra noise. These decisions underscore just how different these media are and how difficult it is to directly compare them. However, we believe that there is still value in attempting to do so, precisely because we want to learn more about the differences between media. Moreover, the adaptations we made, in effect making chat and FtF more similar, would undermine the true differences between conditions rather than artificially increasing them. In fact, our conservative approach could be an alternative explanation for the absence of a main effect on responsiveness in Study 2. Further research could explore the consequences of making different choices in the translation processes.
Although the used method allows us to keep constant many characteristics that differentiate media, the conditions arguably still differed on other characteristics besides responsiveness and outspokenness, most importantly expressed disagreement. We can assume these differences are negligible, however, because the discussions used as stimulus materials came from a previous study in which no medium effect on expressed disagreement was found.
Relatedly, as the stimulus materials were controversial discussions held in small groups of strangers in a lab environment, the generalizability of the findings might be limited to this specific type of discussions. It could be that the observed processes work out differently in large-scale discussions, in non-contentious conversations, or in discussions among non-strangers. However, by pointing away from anonymity as a key driver of online polarization effects, our findings could also help explain why polarization may occur in contexts such as social media platforms, in which people are not anonymous to each other, but are limited in the diplomatic skills they can enact. We therefore encourage future research to assess replicability in different online contexts.
The generalizability of the results is further limited by our sample: young, Western, and highly educated participants. Studying young adults, who are relatively accustomed to participating in online conversations, can be considered a conservative test of our hypotheses: more senior people’s expectations about diplomatic behaviors in online discussions might more closely match their norms for FtF discussions. As a result, more senior people might consider the online lack of responsiveness and ambiguity more indicative of polarization. Furthermore, different cultures are expected to have differing norms concerning diplomatic behavior (Culpeper et al., 2010). This might also be the case for people from different educational levels and socioeconomic backgrounds. These are all interesting directions for future research.
Conclusion
The present research shows that polarization can occur merely by the way in which text-based online media lead people to express themselves and by how audiences subsequently interpret their messages. Importantly, none of these effects were caused by selective exposure due to the algorithms, filters, or bubbles, which according to other scholars are key drivers of polarization (e.g. Settle, 2018). Rather, polarization can be perceived simply due to the misattribution of unintentional behavior. Arguably, the piling up of these misunderstandings on a micro-level might lead to the formation of camps that feel ever more divided, resulting in a polarized society on the macro-level. That is, the seeds for societal polarization might be sown on this micro-level, unknowingly, unintentionally.
Accordingly, we argue that to understand polarization we should look beyond the dynamics of “us” and “them” and people’s perceptions of society at large, and move beyond the use of big data and large-scale surveys. We believe that to understand how polarization occurs in the midst of communities both online and offline, the dynamics that lead to escalation and de-escalation need to be studied more closely. By presenting a new method for comparing these dynamics within different media, we hope to inspire research that aims to uncover, and ultimately promote, dynamics that contribute to an undivided and harmonious society.
Footnotes
Acknowledgements
The authors thank the four actors of theater group Vals Alarm for performing the (re-)enactments and sharing their experiences with us: Margot Vlot, Vera van Burgsteden, Willemijn Bouma, and Yteke Hoekstra. The authors additionally thank Justin Richardson for helping with the translation of the actors’ quotes to English.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
