Abstract
In an increasingly interconnected world, many people handle large parts of their communication online, often via social networking sites (SNS). In contrast to face-to-face communication, messages on SNS are accessible by potentially unknown and large audiences. However, it is an open question what users actually perceive as a
From an early age on people extend their mode of communication to digital devices and thus to the online realm, in particular to social media such as social networking sites (SNS; MPFS, 2020). Three quarters of adult U.S. internet users have an account for at least one SNS (Perrin, 2015), which they mostly use on a daily basis. These services change crucial parameters of communication as people know it from offline contexts. As opposed to offline contexts, it is inherently easy to make a message publicly available; in contrast, however, it takes an effort to keep content private when communicating online (boyd, 2010). This is partly caused by the fact that the content is stored on the service provider’s servers from where it is widely accessible by potentially unknown audiences at any point of time in the future (Bernstein et al., 2013; boyd, 2008). While audiences on SNS may on one hand provide social support (Ellison et al., 2007), they may on the other hand cause a user to experience negative consequences such as context collapse (Marwick & boyd, 2010) and adverse events such as gossip, hurtful comments, or social embarrassment. The extent to which users are vulnerable to these risks thereby depends on the
When SNS users decide to disclose information to an online audience, their weighing of risks and benefits (so-called privacy calculus, Trepte et al., 2017) is likely to be biased. For instance, people have limited information about their privacy settings (Moll et al., 2014) and they perceive their own internet privacy risks to be smaller compared with others’ risks (Third Person Effect, Chen & Atkin, 2021). SNS users—as all people—have a preference for immediate gratification and may therefore consider the benefits of self-disclosure rather than the distal risks of that same behavior (Acquisti, 2004; Hallam & Zanella, 2017). In addition, users may have distorted implicit beliefs about the—potentially large—audience’s behavior and seem to think that the actual audience will be small despite the public nature of their shared information (Moll et al., 2017).
Interestingly however, the perception of an audience to be large (or not) may depend on the collectively understood attributes of the particular context, namely the mode of communication. Thus, the fact that online and offline communication differ in their defaults regarding information access and distribution is likely to have shaped people’s understanding and expectation of what it means, when there are “many” or “few” people in an audience. In the following, we will refer to the psycholinguistic approach of so-called
Quantifiers and Context
Vague quantifiers or Natural Language Quantifiers (NLQs) such as “few,” “some,” “lots of,” or “many” are a fundamental part of our language. Whenever we are not able to speak of specific numbers, we use vague quantifiers to give an approximating impression of our perception. Vague quantifiers are believed to “map onto a second number system that appears to be common to pre-linguistic infants, adults and many animals” (Coventry et al., 2010, p. 222). Since NLQs are vague by definition and purpose, their interpretation inherently depends on the context in which they occur (Newstead & Coventry, 2000). To grasp the meaning of a proposition that includes a vague quantifier, conversation partners need to have some form of shared conceptual understanding of the context (Clark & Brennan, 1991).
For example, to say that “there are
Context, however, is not only implied by the attributes of an object but also by the situation in which the object occurs. In that way, set size, functionality, and even unrelated objects in the vicinity can influence the interpretation of vague quantifiers (Coventry et al., 2010; Newstead & Coventry, 2000). Thus, stating that “there are many grains of sand
For instance, we assume that people’s context knowledge entails the (principally valid) assumption that in comparison to public-by-default online communication, offline communication is generally private-by-default and that publicness is only achieved through effort. If this was true, they should associate lower quantities with the expression “many people” for offline than for online contexts. Moreover, if this was also true for situations where information access is restricted as in private groups on SNS, for example, this scalar shift would also imply a tolerance for larger audiences in situations that are considered to be private.
In a typical paradigm in which the meaning of vague quantifiers is empirically investigated, short vignettes are presented to the participants in which a quantifier describes an object within a specific context. Usually, the quantifier describing the object and/or the context in which it occurs is varied experimentally. Participants are asked to indicate, for example, what specific number the quantifier denotes in the situation described in the vignette (Moxey & Sanford, 1993; Newstead & Collis, 1987 see Experiment 1). The changing numerical meaning of one and the same quantifier allows for an inference about different expectations people have across different contexts. With regard to online communication, a most interesting question concerns the interaction between different contextual parameters. For example, would a private context entail different numerical expectations when it takes place within an online as opposed to an offline interaction?
Other experimental designs turn this idea around: Participants are presented with specific numbers or proportions of an object
In the present study, we applied both study designs to investigate the participants’ expectations in different contexts. In the following, we will elaborate and report the rationale, method, results, as well a brief discussion separately for Experiment 1 and then Experiment 2, followed by a general discussion for both studies in reference to the theoretical idea and potential implications.
Experiment 1
Rationale and Hypotheses
The first aim of the present study was to explore people’s expectations regarding the quantity of people that audiences encompass in different situations (i.e., audience expectations). Therefore, we asked participants to map numbers to quantifiers for audiences in different contexts. We chose the quantifiers
We hypothesized that numbers assigned to the quantifier
Second, we hypothesized that people’s experiential understanding of online contexts leads them to expect larger numbers of people in online audiences—and thus assign greater numbers to both quantifiers—compared with audiences in offline contexts (Hypothesis 2).
At the same time, audience size is influenced by the extent to which there are physical or technological restrictions to the extent to which a message can be accessed by others, that is, the extent of privacy. Overall, numbers assigned to quantifiers in public contexts should be larger than the ones in private contexts (Hypothesis 3). However, it is an open question, what numerical expectations are triggered when both the mode of communication and the extent of privacy describe the context in which an audience occurs. We therefore explored the question of how both contextual details
Methods
Participants
We distributed the link to the online experiment via snowball sampling which started on the SNS Facebook. Since the study was conducted in German, the sample was restricted to German-speaking participants. Our final sample consists of
Design and procedure
The experiment was implemented as computerized survey via Unipark (QuestBack). As an introduction, participants were informed that we were interested in the numerical meaning of vague words like
Materials
We were particularly interested in expectations regarding audiences in different situations, namely, the number of people who (1)
Medians, Lower (Many), and Upper (Few) Quartiles for Items in Experiment 1.
Results
The descriptive results indicated large variability between the participants’ estimates. For example, mean estimates for the item
Apart from the descriptive results per item (Table 1), we were interested if the numbers assigned to the specific quantifiers differed as a function of our experimental variations of context. To investigate this, outliers with
Results show significant main effects for
First, we found a two-way interaction between
A two-way interaction between

Bars indicate the medians of estimated numbers that participants assigned to the quantifiers in the described context conditions (x-axis): privacy (private, public) and mode of communication.
Finally, a two-way interaction between
Discussion
The results found in Experiment 1 overall support Hypotheses 1 to 3. However, we also found several interesting interactions between the experimental factors.
Quantifier × privacy
First, our results showed that
Quantifier × mode of communication
Similarly, while participants generally expected larger audiences in online than in offline modes of communication (confirming Hypothesis 2), this was particularly true for the quantifier “many”: “Few people” denoted similar numbers in offline as in online contexts. We can conclude that the meaning of “few” is less context dependent than the meaning of “many”: “Many” does not imply a restriction at the upper end of a numerical scale which seems to be different for the meaning of “few.” Thus, when speaking of “many people” in an audience, people’s everyday understanding of online versus offline contexts seems to acknowledge the different core characteristics of these contexts.
Privacy × mode of communication
A similarly interesting pattern occurred for the factor privacy. While participants on average expected roughly twice as much people in public than private situations (confirming Hypothesis 3), the difference needs to be interpreted in the light of a significant interaction with mode of communication: For
Experiment 2
Rationale and Hypotheses
Overall, the descriptive results from Experiment 1 showed a large between-subject variance in the estimations for one and the same item, indicating that people may have very individual expectations toward the meaning of vague quantifiers in certain situations. This can in part be explained by different experiences that people make in one and the same context—experiences that shape their contextual understanding. Importantly, however, regardless of their individual experiences, people should at least roughly agree in their contextual understanding when acting in or talking about an audience, because otherwise they would not properly understand what someone else means by describing
In Experiment 2, we used the same vignettes as in Experiment 1, but this time they included
Experiment 2 aimed to validate the results and interpretations from Experiment 1 in two major ways. First, if people mainly agreed with the appropriateness of the median in a particular situation, this would support the validity of the descriptive results from Experiment 1 (also see Hypothesis 4). Second, we aimed to investigate in an explorative manner whether contextual parameters would influence peoples’ acceptance of deviations from an expectancy value. This would imply that context determines in how far people’s numerical expectations are variable.
Methods
Participants
Participants were school students from Germany who were recruited during their visit of the local university in Muenster. Ninety-seven participants started the experiment; one person was excluded because of systematic response behavior, seven participants were excluded because they did not explicitly agree that their data could be used for further analysis. Our final convenience sample thus consisted of
Design and procedure
The experiment was implemented as computerized survey via Unipark (QuestBack). We used a 2 × 2 × 2 × 3 design.
Materials
While the context specified within the items remained the same (Table 1), participants in Experiment 2 had to repeatedly judge how appropriately the quantifier described a specific number that we included in each item’s depiction. For example, one item said “
Each of the 22 items from Experiment 1 was presented in three different versions within each quantifier condition; thus, each participant rated 66 out of a total of 132 items. The three item versions differed in their numeric
The
The
Results
The randomized distribution of participants to one of the two quantifier conditions resulted in a slightly uneven cell distribution with 41 participants in the
As our materials’ design intended, appropriateness ratings for targets lay significantly above the scale’s mean (4),
Means and Standard Deviations of Appropriateness Ratings From Experiment 2.
We found a significant two-way interaction between
The factor
The significant main effect of the factor
Discussion
While Experiment 1 showed that people can have a different understanding of vague quantifiers in a specific context, results from Experiment 2 suggest that there is some kind of agreement about scale properties of audiences in different contexts, that is, about the meaning of vague quantifiers. Thus, participants rated the medians extracted from the items’ distributions in Experiment 1 to be appropriately described by the two quantifiers. More specifically, their ratings were not only higher than the scale’s mean, but also higher than their ratings for the two distracters (Hypothesis 4).
Value × quantifier
For the quantifier
Privacy × quantifier
Moreover, the two quantifiers seem to be conceptually tied to different contexts. Thus, participants in the
Value × mode × privacy
The results showed a main effect for the factor mode of communication, showing that overall numbers in offline contexts were rated to be more appropriately described by the quantifiers than in online contexts. For public contexts, this was true for all quantities and all value conditions. For private contexts, however, only the target condition produced higher appropriateness ratings offline than online. On one hand, this pattern shows that regardless of the mode of communication, the conceptual understanding of “private” remains narrow. Thus, even smaller deviations from the expectancy value carry a lot of weight as to the perceived appropriateness of the number that is described by the quantifier. On the other hand, since appropriateness ratings in the public conditions were also higher when combined with the ambiguous or the contrast value, this interaction suggests that people do not necessarily have fixed quantitative expectations once communication is online and public.
General Discussion
We argued that the conditions of online communication—in particular on SNS—have changed people’s expectations and numerical understanding of audiences in online contexts and presented two scenario-based experiments to explore this matter. In both experiments, we varied whether the described interaction took place online or offline, and whether access was restricted (private) or unrestricted (public). To the best of our knowledge, this is the first study systematically assessing the question of the numerical meaning of vague quantifiers in the context of online and offline audiences. When using this theoretical and methodological approach as a framework to research people’s expectations of audiences, several interesting patterns occur:
First, both results from Experiments 1 and 2 suggest that regardless of communication mode there seems to be an upper limit restricting the number of people that can be present when a situation is considered as private. Corroborating this finding, there is a conceptual connection between the quantifier “few” and the context attribute of privacy. In that way, the public-by-default nature of online communication seems to
Second, while our findings do not support a literal change in the concept of privacy, they do support a change of the expectancy value, or else, the scalar representation: In online communication, more people are expected to be in the audience as opposed to offline communication,
Third, we need to consider the finding from Experiment 2, showing that appropriateness ratings were systematically higher for offline than for online situations. Since this was true for all value conditions (including ambiguous and contrast values), it seems that numeric expectations for offline contexts are more gradable than online ones, especially when they are public. This result seems counterintuitive, because audiences in online situations are typically much more difficult to predict and the audience size of public online information is potentially limitless. This should actually
However, our results require a different explanation. The higher flexibility in public offline as opposed to public online contexts may derive from the fact that in online situations people usually have some sort of—inaccurate—metric indicating the size of the audience, for instance, the number of comments or Likes beneath a post. This metric, however, is completely absent in offline contexts. Thus, people will rarely have any evidence relating to the number of people who may have seen a poster on an advertising pillar. Therefore, their assignments of quantifiers to numbers is based purely on probability instead of experience. This would explain higher appropriateness ratings even for ambiguous and contrast values and thus higher gradability. Supporting this argument, the exceptions to this pattern were private situations. Here, ambiguous and contrast values were not rated as systematically more appropriate in offline than in online contexts. This makes perfect sense, because for private offline situations, people do have an experiential metric for the number of people in the audience. After all, in private situations the audience is usually fully controlled by the sender of the message (for instance, when someone sends invitations to a party via the postal service). In that way, although there are less boundaries and thus less actual certainty in online communication, the scalar concept of audiences is less gradable in online as compared with offline communication.
Limitations
The interpretation of our results has several limitations. First, our results are based on rather small convenience samples. As such, we cannot rule out the possibility of beta errors; moreover, generalizations to other populations may be limited. At the same time, participants in Experiment 2 found the medians created by participants from Experiment 1 highly appropriate although the two samples differed in their demographic properties, such as their age. This implies that the overall pattern of results is likely to occur in other samples as well.
Second, the short vignettes describing the different contextual parameters in both Experiments 1 and 2 may in some cases differ in more respects than intended by the experimental factors. This problem is inherent in the research question and hypotheses posed in the beginning, because online contexts simply are very different from offline ones. In that way, it might not actually be possible to describe an offline situation that only differs from an online situation in the described respects.
Third, it can be argued that the interpretation of quantifiers is not only based on context, but also on the individual experiences people have made in these contexts. Thus, due to cross-linguistic and cultural differences, individual usage patterns or privacy preferences, people may be used to very different audience sizes in online communication. While this leaves room for future research, usage patterns and privacy preferences do not necessarily confound our findings: First, since people rarely act in accordance with their privacy preferences, these should not systematically lead to different experiences online (so-called privacy paradox; for a critical review, see Solove, 2020). Second, people in a similar age group with roughly similar usage patterns (which are implied by our sample description) should—regardless of their individual experiences—have shared knowledge of the most basic conditions of online communication, and should thus at least roughly agree on the meaning of “many” and “few.” This notion is supported by results from Experiment 2, where distracters are systematically rated as less appropriate than targets.
Conclusion and Future Research
The present study shows that people’s understanding of online audiences differs from offline ones in that the former has increased people’s expectation of larger audiences. This shift in scalar concept can also be observed for situations that are considered to be private. We might argue that this shift is a simple reaction to the reality of online communication, specifically on SNS, which was the context mostly underlying our rationale. However, a shift in scalar concept also implies a less clearly determined concept of a
Future research could, on the one hand, explore the influence of users’ individual experiences on their expectancy values (see section “Limitations”). On the other hand, it would be interesting to assess the exact scalar limits that people apply in different situations: Which contextual attributes are necessary for a message to still be validly called “private”? Is it more important how many people can potentially read a message, or rather, how many people have actually done so? This line of research would lead to a clearer picture of how people understand the concept of privacy and whether the public-by-default nature of online environments has changed crucial parameters of this concept. In this regard, future research could assess, in how far quantifiers in online and offline events are actually mapped onto one and the same scale, or whether online contexts represent their own ontological category of communication. This again would imply that the concept of privacy itself is not changing per se, but can simply not be applied to online contexts.
Both interpretations may become particularly relevant with regard to online users’ risk perception when disclosing personal information via SNS: When they judge an actually large audience to be “not that large” in that particular context, their risk perception may be constantly lowered—which may have consequences for the way users regulate their privacy.
In that way, our study also has implications for educational practice in the context of digital literacy: When explaining why it might be harmful to disclose personal information in an SNS context, scholars and practitioners should be aware that their educational attempts may fail, because younger and potentially more experienced SNS users have a different conceptual understanding of audiences in online contexts. Thus, digital literacy education aiming to increase users’ awareness of potential privacy problems when communicating online could—as one part of a training—discuss people’s conceptual understanding of audiences, audience sizes, and in how far consequences of online publicity may easily affect one’s “offline” life. One possibility to enhance users’ risk awareness thereby directly relates to the numerical representation of audiences. Thus, providers should, whenever possible, include displays of actual audience sizes to facilitate individual privacy regulation.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was funded by the research training group 1712/01 “Trust and Communication in a Digitized World” of the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG).
