Sage Journals: Discover world-class research

Abstract

In this paper we explore human communicative behaviour in unsolicited commercial telephone calls between human telemarketers and ‘bots’ that exhibit human characteristics. Drawing on a corpus of recorded telephone conversations between telemarketers and a spam-interception service, we examine some of the communicative dimensions through which telemarketers make sense of their interactions with this technology as trust, or rather the illusion of it, is established, severed and restored. The analysis shows how trust is established early in the calls through an authentic human voice, the illusion of progressivity and purported intersubjectivity, including ‘doing-being-human’ excuses. In cases where telemarketers realise they had not been talking to a human, verbal abuse towards the bot, and expressions of surprise and embarrassment oriented to their professional face are articulated as the call is used as a training opportunity to identify bots. The article contributes to understanding some of the technology enabled contemporary communicative practices human beings engage in as part of their everyday lives. It raises questions about how humans negotiate trust and validate authenticity in an increasingly automated and technologically driven world.

Keywords

bots excuses face intersubjectivity trust progressivity scam calls spam calls

Introduction

Unsolicited phone calls from legitimate and illegitimate organisations are a significant global issue. The prevalence of spam calls has greatly increased in recent years due to technological advancements such as VOIP that enables access to telephone networks through the internet (Saunders & Frascella 2022), making it easier and more cost effective to make huge volumes of calls.

Efforts to counter nuisance calls include national ‘do not call’ registers that are available in some countries (e.g., Telephone Preference Service in the UK). Individuals can register their number to opt out of unwanted calls, and in order to try and protect consumers from fraud, many network providers have spam blocking measures in place. Regulatory bodies are introducing new rules for telephone providers that require them to identify and subsequently block spam numbers (Ofcom 2022, Povich 2022). However, fraudulent companies are using more sophisticated tactics to avoid detection, such as ‘spoofing’ caller ID in order to impersonate a trusted organisation, for instance, a bank or insurance company (Ofcom 2022).

There are now a number of companies which offer a service that intercepts spam calls¹ received by landlines or mobile phones. In this paper we examine a sample of calls from one of these companies. For an annual subscription fee, the Respond and Protect Telephone Company (R&P)² offers ten different ‘robots’ that subscribers can choose to intercept any spam/scam calls they receive. These ‘bots’ are in fact recordings that have been created based on analyses of telemarketing calls. Their purpose is to give the illusion that the caller is conversing with another human and to hook potential spammers or scammers into rounds of interaction without getting out of the loop to waste as much of their time as possible. The bots are, however, elementary. There is no speech recognition or AI to modify the bots’ answers; the same set of utterances are used at the same time and in the same order, akin to the chatbot ‘Lenny’ in Relieu et al (2019). Unlike mutual understanding in human communication, the bots are incapable of producing a (non-answer) response (Stivers & Robinson 2006) or parrot back. Their utterances are not adjusted progressively or incrementally. Despite their basic technology, the formulaic nature of the telemarketers’ (TMs) scripts (Lockwood et al 2009) is a key factor which enables the bots’ utterances to be effective and keep the TM on the line.

Any form of social interaction involves a level of implicit or explicit trust, expressed by information that is relayed through a communication channel between individuals (Hancock et al 2023). Trust is “the extent to which a person is confident in and willing to act on the basis of the words, actions, and decisions of another” (McAllister 1995: 25). In the calls we examine, from the TMs’ perspective, trust is plausible given that there is an authentic human voice. It is contingent on action and grounded in the fact that throughout the calls each of the TMs’ utterances receives a reaction. Even if the responses are ill-fitting, they are often ignored by the TMs in the hope that the called party (i.e., the bot) will focus on the business in hand and some value (i.e., a sale/scam) will arise from the trust invested by the TM.

Our study aims to provide a first exploration into human communicative behaviour in unsolicited and often fraudulent commercial telephone calls enabled by bots. We focus our attention on some of the communicative dimensions through which humans make sense of their interactions with these technologies as trust, or rather the illusion of it, is interactionally established, severed and restored. In so doing, we identify the conversational mechanisms through which recordings perform a human persona and examine the TMs’ reactions to the realisation they had not been interacting with a human.

In the next section we discuss the interactional principles through which trust is established. In section 3, the data and methods are introduced. Section 4 presents detailed analyses of the interactions followed by the conclusion.

(Re)establishing trust in pursuit of prospect engagement

Recent studies on nuisance and scam calls have explored methods to detect and prevent abuse and misleading calls in human-human interactions (e.g., Javed et al 2021, Wood et al 2023). They have also identified some of the mechanisms through which digital deception and cybercrime is enacted (e.g., Dynel & Ross 2020, Rowe 2009). The rise in cyber fraud has led to organised online vigilantism or scambaiting (Loveluck 2020, Button & Cross 2017) whereby the scam-baiter engages with the scammer and exposes their fraudulent activity. Whilst there are similarities between R&P’s project and scambaiting, for example, attempting to frustrate, prolong interactions and waste time (Chia 2020, Smallridge et al 2016), an important distinction is that scambaiting involves online human-human interaction, whereas R&P’s service is based on human-bot interaction; the bots are designed to intercept all unsolicited calls but not to commit any sort of (cyber)crime.

Telemarketing calls, legitimate or otherwise, follow a detailed organisational script through which businesses strive to direct and maintain the sequential organisation of the calls and its direction (e.g., Jagodzinski & Archer 2018, Tovar 2020, Márquez Reiter 2011) with a view to achieving efficiencies and increase sales. The calls we examine in this paper are characterised by the fact that their organisation is disrupted with the TMs losing the direction of the call and displaying negative emotional reactions towards the bots or embarrassment upon realising they had not been talking to a human being. Even though the interactional projects of the TM and the R&P bots are related in the sense that they both seek to maintain interactional engagement, the TM’s objective is to sell an alleged service to the ‘prospect’ and the bot is designed to waste the TM’s time.

For the bots to waste as much of the TMs’ time as possible, they need to project personhood, that is, they need to display some self-awareness, capacity to reason, capacity to communicate, and some self-motivated activity. It is the very idea or illusion of personhood and the convenient communicative setting that helps to establish trust, at least for the TMs who think they are talking to a human.

Trust has received a great deal of attention in the humanities and social sciences. It is generally considered to be a precondition for social interaction (e.g., Weber & Carter 2003 on the social construction of trust in everyday life) and cooperation (e.g., McCabe, Rigdon & Smith 2003 on trust and reciprocity in games). In human-human interaction, conversational participants assume others to be cooperative (e.g., Grice 1975), honest (e.g., Bellucci & Park 2020), able to fulfil their promises (e.g., Searle 1975) and a relatively trustworthy source of information (e.g., Sheard 2015).

Ullman and Malle’s (2018) analysis of the semantic space of trust showed that the human-human interaction literature often conceives trust as an agent’s acceptance of vulnerability in an interaction or relationship; that is, the belief that the other will not exploit their vulnerability. By contrast, when humans interact with machines, trust appears to be grounded in the machine’s capacity to do what its label says. Based on this, Ullman and Malle distinguished between “relational” trust versus “capacity” trust. Most measures of trust, especially in human-robot interaction, focus on the person’s belief that the robot can complete a given task while most measures of trust in human-human interaction focus on integrity, loyalty, and other social-moral constructs. The results of their study suggest that trust has a multidimensional structure with four distinct dimensions: Being Capable, Ethical, Sincere, and Reliable. We shall return to these in the analysis.

In the calls examined here, the bot’s utterances, in the initial stages of the call, broadly index a degree of conversational cooperation (Grice 1975). This is because every contribution by the TM receives a reaction by the bot, which, at first sight, suggests some level of informativeness, truthfulness, appear to be relatively relevant and clear, especially as far as the call’s progressivity is concerned.

As the call progresses, however, this transpires not to be the case. The premise of R&P’s service is to waste the TMs’ time and keep them on the call as long as possible. Trust is vital to the continuity of the call; if the TMs suspect they are not interacting with a human, they will end the call and move on to the next number on their list.

Trust is thus implicit early in the conversation given the bots’ authentic human voice, willingness to engage and the display of purported intersubjectivity, and it is sustained by the TMs’ need to maintain the prospect on the line and progress the call with a view to pursuing their goal (i.e., to make a sale or defraud).

Intersubjectivity, or the state of mutual understanding between interactants, is achieved through the sequential organisation of talk (Schegloff 2007) whereby each new turn continuously updates interactants’ understanding of the previous turn (Heritage & Atkinson 1984: 11). The principle of progressivity refers to the advancement of conversation within turns and sequences (Schegloff 2007). In interaction, there is a preference for progressivity (Heritage 2007, Stivers and Robinson 2006), and shared understandings are designed to achieve some action, such as agreement or response (Schegloff 1992). Intersubjectivity and progressivity are, therefore, two of the principles through which conversational cooperation gets practically done as “actions are co-created by the concerted effort of the participants” (Mey 2010:2887). One way in which participants enact cooperation is by supporting the progress of interaction (Stivers & Robinson 2006), understood as showing alignment e.g., responding to a question for information with an answer, rather than with arbitrary tokens, such as ‘mmm hmm’ (Stivers 2008). Thus, progressivity is cooperation at the structural level of the interaction (Duranti & La Mattina 2022) and participants typically adjust their actions to achieve intersubjective attunement (Rommetveit 1988).

The calls we analyse illustrate how the TMs hear the level of intersubjectivity that has been achieved is ‘good enough’ (Garfinkel 1967:8) to move the conversation forward, and how the bots hold themselves accountable for the suspension of progressivity. They do this by requesting the repetition of a previous sequence based on excuses (Scott & Lyman 1968) for temporary lapses in understanding. The excuses are grounded in human traits and the TMs readily orient to this portrayal of personhood.

Data and methods

The data for this article draw on a corpus of 140 telephone conversations recorded by the R&P Telephone Company and uploaded to YouTube between 2015 and 2022. Forty-seven calls were randomly selected from the corpus; all recordings were listened to and poor-quality calls, very short examples or extremely offensive calls were discarded. Calls were transcribed using Jefferson transcription conventions (Jefferson 2004). We present here an analysis of a sample of these calls depicting recurrent patterns of communicative phenomena observed across the dataset to give a flavour as to how they unfold and the TMs’ verbal reactions to realising they had not been interacting with a human. The data are analysed from an interactional pragmatics approach (Márquez Reiter 2009, 2019, cf. Chang & Haugh 2011, Haugh & Culpeper 2018). The analysis draws on concepts from conversation analysis (Schegloff 2007) to identify the stages in the calls where the unfolding of marked sociopragmatic phenomena, such as to third-party interventions, issues of professional face and emotional responses when realising the called party is not human, and how they are interactionally constructed by the humans. This is coupled with analytic interpretations which consider the larger economic context in which the pragmatic phenomena are embedded: telephone service provision continues to be a prevalent commercial practice with agents working under pressure and surveillance (Brophy 2017; Tovar 2020).

Whilst the recordings are publicly available on YouTube, we acknowledge the ethical dilemmas of using publicly accessible online data (boyd & Crawford 2012). It was not possible to obtain informed consent as details of the companies and individuals were not available. In view of this, all names and any other identifying information have been removed from the dataset and, where appropriate, replaced with pseudonyms (franzke et al 2020).

Analysis

The first part of the analysis focuses on how the TMs deal with a series of ill-fitting reactions in the first two stages of the calls: the opening and the middle of the exchange (Zimmerman 1992). We then turn to the TMs’ emotional reactions once they realise something is not quite right with the interaction.

Establishment and restoration of trust

Opening sequence

In this section we analyse the techniques used to establish trust. In the dataset, these are recurrent practices that occur in the opening sequence of the call and at the beginning of the business exchange. The first examples illustrate the opening of a typical telemarketing call. Excerpt 1 is from a call between a newspaper company and the R&P bot ‘Susan’. The TM is trying to get the ‘prospect’ to sign up to a year’s newspaper subscription.

Excerpt 1: Newspaper subscription

1 PRO: Hello? 2 (3.3) 3 TM: <Hi good morning this ((bleep) is with the ((bleep)) News how 4 are you, 5 (1.6) 6 PRO: Y:es. 7 (1.8) 8 TM: HEL:LO, 9 (1.1) 10 PRO: I’m sorry Could you say that again? 11 (1.1) 12 TM: <SURe this is ((bleep)) you with the ((bleep)) News .hhh I 13 was just calling because you are former subscribers so we’re 14 just reaching out with our special promotion .hhh to [let] 15 you knOW 16 PRO: [Oh ] 17 TM: that we would like to give you the Wednesday and next Sunday 18 paper for just a dollar fifty a -week? .hhh Erm this offer is 19 still good for a [full ye]ar just something nice we want you 20 to enjoy 21 PRO: [Sure ] 22 TM: to get you sign back -up .hhh I can actually have [unclear 23 00:00:36] before eight am if that's early enough_ 24 (0.8) 25 PRO: Okay? 26 (1.3) 27 TM: Okay so let's see what your pricing would -be he::re ch ch 28 .hhh so that also does include the f- er full seven day 29 digital access as [well] .hhh so you would get your Sunday 30 PRO: [sure] 31 and Wednesday paper

The opening sequence follows the trajectory of a typical marketing call. The TM first provides self and organisational identification (Zimmerman 1992, Márquez Reiter 2011) followed by the first part of a how-are-you exchange, suggesting that the call is not business as usual (Tracy & Agne 2002) by way of the synthetic personalisation (Fairclough 1989) it indicates. The 1.6s gap and the prospect’s non-fitting response (Thompson et al 2015) (Y:es) in line 6 to the TM’s ‘how are you’ alerts the TM to a potential connection problem, confirmed by the prospect’s request in line 10. The TM then introduces an extended reason for the call in the anchor position (Schegloff 1986) in lines 12-23. The prospect’s overlapping go ahead token in line 21 (Sure) is trust-implicative for it can be understood to signal compliance with the reason for the call and helps to underlie her identity as the right ‘person’ to talk to (see also line 16). In view of this, in lines 22-23, the TM tries to ascertain the prospect’s preference for the newspaper delivery. The first sign of trouble comes with the prospect’s response in line 25, preceded by a gap of 0.8s in line 24. The sharp rising intonation in the token ‘Okay?’ is prosodically and lexically non-fitting. However, it is heard by the TM as indicating potential interest, evidenced by her okay-prefaced response (l.27) that functions as a bridge (Merritt 1978) between the different stages of the attempted sale – the initial explanation and the pricing of the offer. The characteristic opening sequence of sales calls is further demonstrated in the following excerpt from a car insurance company.

Excerpt 2: Auto warranty call

32 TM: Hello, 33 (1.7) 34 PRO: Ye:p_ 35 TM: Hello:, [this ]is ((bleep)) from ((bleep) how are you doing 36 PRO: [Yes,] 37 TM: today, 38 PRO: Ye:p_ 39 TM: £Oka:y£ (nervous laughter) er sir er the reason of this short 40 call is to inform you that ((bleep)) have recently dropped down 41 by a huge margin and I believe that you do have a good auto 42 insurance right?

The same opening sequence seen in Excerpt 1 is also evident here – summons-answer, exchange of greetings, identification of the caller and reason for the call (Schegloff 1986). Nevertheless, there are early signs of trouble, for example, the prospect responds to the TM’s greeting with an elongated ‘Ye:p’ (l.34) and also following the question, ‘how are you doing today’ (l.35). The TM’s nervous laughter (Glenn & Holt 2013) (£Oka:y£ – l.39) and hesitation marker (er) could be a sign that he considers this an unusual response, however he continues with the reason for the call.

These examples show how the opening sequence of the interactions mostly follow the normativities of a telemarketing call (Lockwood et al 2009, Márquez Reiter 2011, Jagodzinski & Archer 2018). Although some of the acknowledgement tokens interspersed throughout the calls mark compliance, agreement or confirmation with the TMs’ sales itinerary, others are prosodically and lexically non-fitting and often produced after significant gaps. These inconsistencies seem to go largely unnoticed or are ignored by the TMs in their intent to progress the call.

Middle of the business exchange

We now turn to the middle of the business exchange. This part of the call is where the main negotiation takes place and the TMs attempt to maintain progressivity to achieve their objective of a successful sale or scam. In Excerpt 3, the TM introduces the newspaper subscription price.

Excerpt 3: Newspaper subscription

58 PRO: [Mmm hmm] 59 TM: seventy-eight dollars for just the entire ↑year. 60 (2.2) 61 PRO: Ok:ay? 62 (1.3) 63 TM: Er is that something we can sign you up for today? 64 (0.7) 65 PRO: Uh-Huh, 66 (0.9) 67 TM: .hhh Awesome and so we d::o also erm take all major credit 68 cards to process over the phone and we do erm send bills out 69 as well. 70 (0.6) 71 PRO: [Mmm hmm ] 72 TM: [What would b::e] better for you, 73 (0.5)

The TM informs the prospect about the price of the offer in line 59 with an extreme case formulation (‘entire’) (Pomerantz 1986) and sharper intonation rise in ‘↑year’ with which she intensifies the alleged good value of the offer. The details of the offer are finalised here with falling intonation (‘.’) that signals a transition relevance place and completeness. At this point, an assessment of some sort would be expected from the prospect, however, the offer is reacted to with silence in l.60 and a repeated non-fitting token (ok:ay? – this time prolonged). It is this and the silence in l.62 that leads the TM to check the prospect’s interest with a yes/no interrogative at l.63. The lax token (Uh-huh) and subsequent silence in lines 65-66 are taken to be ‘good enough’ (Garfinkel 1968) by the TM to continue her pursuit. Similarly, the TM disattends the rather weak overlapping acknowledgement ‘Mmm hmm’ in line 71 with a specifying question (Fox & Thompson 2010) in an attempt to close the deal and secure payment. Therefore, the TM ignores repeated signs of interactional trouble (lines 62, 65, 66, 70, 71) in her efforts to hook the prospect and progress the call. In Excerpt 4, we return to the air duct cleaning call where the TM attempts to secure an appointment for a technician’s visit.

Excerpt 4: Air duct cleaning call

88 TM: Yeah I got you down here at [(addre]ss,) 89 PRO: [Yeah_ ] 90 (0.5) 91 TM: Yeah. Erm I got the- we’ll have a technician out in that area 92 tomorrow. I wanted to know erm what would you prefer the 93 morning or afternoon? 94 PRO: Oka:y_ 95 TM: Which do you prefer? 96 PRO: Right_ 97 (3.7) 98 TM: Mr [((name ))]? 99 PRO: [Hello: ] 100 TM: Yes_ 101 PRO: Yes_ 102 TM: Can you hear↑ me, 103 PRO: Er: ↑yes 104 (1.2) 105 TM: Okay. So I have you here at ((address)), 106 (0.8)

In line 88, the TM uses the declarative form as a question to confirm the prospect’s address to which the prospect reacts with an expected affirmative reply (Yeah – l.89). The TM then poses an alternative wh-question (morning or afternoon) at lines 91-93. The prospect’s non-fitting response (Oka:y) leads the TM to repeat the question (l.95) as a way of testing the reliable dimension of trust and checking for intersubjectivity; the emphasis on the ‘do’ indicates that the response received was not entirely clear. The prospect then produces another non-fitting response (Right – l.96). The lengthy silence that follows alerts the TM to a potential problem and he checks that the prospect is still on the line (l.98) and again verifies that there are no connection problems (Can you hear↑me – l.102). Whilst this time the prospect’s response to this polar interrogative is type-conforming, there is another 1.2s gap before the TM repeats his initial address inquiry as he endeavours to restore progressivity.

In addition to the non-fitting reactions which suspend the call’s progressivity for they suggest that mutual understanding has not been fully achieved, prospects attempt to maintain trust by displaying stereotypical human behaviour and reflexively account for presumed inattention.

Accounting for lapsed intersubjectivity – traditional gender stereotypes as credible excuses

All the R&P recordings contain distractions that are intended to waste time and derail the TM’s project. One way in which the bots do so is by accounting for their lack of attention. Excerpt 5 from the newspaper subscription call directly follows on from Excerpt 3 when the TM asks, ‘what would b::e better for you’ (see Ex. 3, line 72).

Excerpt 5: Newspaper subscription

107 PRO: Yeah I'm so- I'm sorry could you repeat that (.) again I I 108 totally distracted what (.) what were you calling about? 109 (1.5)

The Excerpt shows the bot’s request for repetition couched as an apology. The apology is articulated in a polite and expected manner, with a self-repaired intensified apologetic formula followed by an explanation (e.g., Márquez Reiter 2000). In so doing, Susan accounts for her behaviour in line 108 (I I totally distracted). The last turn construction unit of that contribution, ‘what were you calling about?’ should have provided the TM with a clear sign that something was very wrong. However, she agrees to repeat the reason for the call (not shown). In Excerpt 6, the prospect accounts for her behaviour thus far.

Excerpt 6: Newspaper subscription

110 TM: Er your last name I have (.) I have as ((bleep))is that correct? 111 (2.6) 112 PRO: Hold on honey (.) mommy, 113 (0.4) 114 PRO: Just just ye:ah keep talking I'm listening this is- it's all 115 very good I'm just .hhh I’m multitasking. Yes okay yeah yeah 116 mommy will [unclear 00:04:18] fine go ahead [unclear 117 00:04:21]

The scripted side dialogue between the prospect and her daughter draws on traditional gender stereotypes of women and mothers and their capacity for multitasking (Lui et al 2021). The prospect’s metapragmatic articulation of her behaviour and her role as a busy mum (lines 115-117) adds authenticity and helps to restore trust. This is done by producing an account in the form of an excuse (Scott & Lyman 1968) for the distraction. The excuse underlies the prospect’s human qualities; as far as the TM is concerned, she reacts as if she is interacting with a real person. Each time a side dialogue is introduced, the progressivity of the course of action is impeded. Nonetheless, perhaps convinced that she has succeeded in securing a new subscriber, we can see how the TM persists, constantly pursuing a (specific and fitting) response (Pomerantz 1986), as she makes four attempts to confirm the prospect’s last name, here in line 110 and on three more occasions (not shown). These contingent questions (Zimmerman 1992) are essential to progress the transaction, hence the TM’s insistence. The distracted mum stereotype (Odenweller & Rittenour 2017) is also found in Excerpt 7, taken from a call between a credit card scammer and the R&P recording ‘Emma’ where the TM is attempting to fraudulently elicit Emma’s credit card number. The call also includes a conversation between ‘Emma’ and her ‘teenage daughter’.

Excerpt 7: Credit card scam

TEEN: TEENAGE DAUGHTER

119 TEEN: [Mo:::::m. ] 120 TM: [I hope you know that.] 121 PRO: [Hm mm? ] 122 (0.2) 123 PRO: Yah? 124 TEEN: Are you [almost done on the pho::ne?] 125 TM: [That’s why according- ] (0.2) yeah. 126 (0.5) 127 TM: Yah_ 128 PRO: =Hang [on a second. ] 129 TM: [That’s why acc-] 130 PRO: [Yah.] 131 TEEN: [How ] long can you keep this show [paused!] 132 PRO: [G- ] go on go on, 133 (0.7) 134 TM: That’s why according to yo::r payment history we going to 135 drop down your interest rate less than six [per cent] 136 PRO: [Ye::ah. ]

This Excerpt is from early in the conversation following the TM’s extended reason for the call (not shown). This is derailed by an interruption from Emma’s teenage daughter demanding to know how long the call will go on for (l.124). However, in pursuit of a response, the TM is undeterred by the interruption and attempts to maintain progressivity (l. 125 and 129). In line 132, Emma’s ‘go on go on’ is a request for action. It infers that her attention has wavered by the apparently legitimate interruption and invites the TM to continue; the recognisable human conduct accounts for the distraction which appears to convince the TM to continue. Finally, in line 134, the TM is able to state his offer to incentivise the prospect and the continuer ‘Ye::ah’ (l.136) can be heard as acknowledgement of interest in the offer.

A final example of traditional gender stereotypes can be seen in Excerpt 8, taken from a call between a pyramid scheme scam and the bot ‘Bob’, who is presented as an older person. Earlier in the call, Bob implied that he had trouble moving (it’s ↑not easy for me to get off the chair every two seconds and run to the do:r..), and he also asks the TM to speak louder, thus evoking an ageist stereotype of someone with mobility and hearing issues (Hummert et al 2004). The Excerpt begins three minutes into the call. The TM has been trying to give Bob a website address and asking if he has an ink pen, a request he made five times throughout the call.

Excerpt 8: Pyramid scheme

137 TM: .h D-Do you have an ↑ink pen_ 138 (0.5) 139 PRO: Oh er HOLD ON a minit >okay there< hold on I’m watching the 140 (0.2) television here=there’s .h a HOCKey game o:n, 141 (0.5) 142 PRO: Ho-Hold on there’s a [penalty ] hold on. 143 TM: [Oh:: okay.] 144 PRO: Let me- they’r gonna play it back to you [just hang] in 145 TM: [Okay. ] 146 PRO: there for a minit_ 147 (0.7) 148 TM: Okay. 149 (1.5) 150 PRO: Hello::oh yah .h er do you know anything about HOC↑↑KEY? 151 TM: =Hello::oh, 152 PRO: Let me hol- let me clo- let me lower the TV for a minit_ cos153 I don’t hear you okay? 154 (1.0) 155 TM: ↑O↓kay. 156 (2.0) 157 PRO: Heh (0.5) he’s just got a penalty. 158 (0.5) 159 PRO: My man’s in the box. 160 (0.5) 161 PRO: .h So er what did you call me about. 162 (1.0)

Bob’s failure to reply with a type-conforming response to the TM’s yes/no interrogative (l.137) is accounted for as Bob gets distracted by a hockey game on the television (l.139-140), endorsing another male-watching-sport stereotype (e.g., Burstyn 1999). The acknowledgement tokens in lines 143, 145, 148 and 155 indicate compliance with Bob’s imperative requests to wait (Thompson et al 2015). As shown in Excerpt 5, the last turn construction unit in line 161 (.h So er what did you call me about) should have been heard as a sign of interactional trouble. However, Bob’s account of the hockey game, together with the accounts earlier in the call, combine to bring to the fore the prospect’s human identity (older man, distracted by sport on the television). The placement of these side dialogues, whilst not deliberate, often occurs when the TMs are trying to secure some agreement or information; that is, towards the end of the middle of the exchange.

So far we have shown how the TMs’ implicit expectation of trust is maintained through the interactional principles of intersubjectivity and progressivity. This was done through the arbitrary placement of particles that typically function as acknowledgement tokens and continuers. These were found to occur in approximately appropriate interactional spaces but to display some sort of ill-fitting format, such as their prosody or lexis, or by a delay in response. These allow for the creation of trust and progressivity which is then temporarily suspended by integrating derailments. Derailments were principally effected by integrating side dialogues into the recorded script. These consisted of traditional biased gender stereotypes through which the prospects made themselves metapragmatically accountable for their distraction with a view to keeping the TM on the line. The enactment of human authenticity helped to rebuild lost trust and the TM is then invited to repeat a previous sequence for progressivity to be regained. In the next section we examine TMs’ reactions upon realising their interaction was not with a human.

TMs’ reactions on realising they have not been interacting with a human

Across the calls, three meaningful reactions were identified: using the call as a training opportunity, perceived professional face threats, and verbal abuse.

Training opportunity

The job of telemarketers entails making hundreds of unsolicited telephone calls every day (Woodcock 2017) to disinterested prospects. This explains why they may become unfazed by rejection. Alongside this, the neoliberal conditions of their work (Heller & Dûchene 2012) and the need to secure a sale may explain why the TMs fail to unpack non-fitting responses. Moreover, the productivity of TMs is controlled by management surveillance, and their performance, which includes the number of calls they make and their interaction with customers, is policed (Brophy 2017). Excerpt 9 below is taken from a 14-minute call from an illegitimate holiday company and illustrates the supervisor’s intervention.

Excerpt 9: Holiday credit scam

SUP: SUPERVISOR

163 SUP: ºWe’re on our last call,º 164 PRO: Mmm: 165 TM: =ºI know,º 166 (1.5) 167 SUP: ºWhat time did you get on that [call,º] 168 PRO: [Hello?] 169 TM: ºI don’t know,º Yes [why don't we try callin]g you back next

The supervisor joins the call in the ninth minute and encourages the TM to end the conversation through a complaint-implicative statement regarding the TM’s performance thus far (l.163). The statement is constructed in the plural (We/our) and produced in a slower pace. This minimises its potential threat to the TM’s professional face by constructing the effort as collective. The TM reacts immediately (see latch in line 165) displaying agreement and a sense of frustration (i.e., slowed pace and continuing intonation). Following the silence this ensues, the supervisor questions the length of time she has been on the phone (line 167) and the TM politely tries to end the conversation. In the dataset, interactions are often lengthy as the TMs persist in trying to reach the objective of their call. At some point, non-fitting responses and lack of progressivity lead to the realisation that something is not quite right. Excerpt 10, from the newspaper subscription call, shows another supervisor intervention towards the end of the seven-minute call, when the realisation that the prospect is not human occurs.

Excerpt 10: Newspaper subscription

170 SUP: Okay I know that [name} went over everything with you but I 171 just want to make sure I’ve got an email address for your 172 digital access.=Can you give me your email please. 173 (0.8) 174 SUP: .hhh (.) Now I- is this a right person I’m speaking with? 175 (1.3) 176 PRO: Sure [unclear 00:05:40] 177 SUP: [I don’t think so]

In line 172, the supervisor asks the prospect a specifying question with a request for information, namely can she supply her email address. After a gap when one would have expected a forthcoming response, the supervisor checks whether he is talking with a real person (l.174). Another 1.3s gap follows and the prospect’s non-fitting acknowledgement token ‘Sure’ in line 176 is ignored as the supervisor responds to his own question with the assertion ‘I don’t think so’ (l. 177), asserting his epistemic authority versus the TM on recorded answering services. In Excerpt 11, we join the interaction a few turns later.

Excerpt 11: Newspaper subscription

178 SUP: But then like hold on (.) she was like hold on [unclear 179 00:06:21]=It’s been reported that people have (.) f- for sale 180 calls. 181 (0.3) 182 TM: Rea::lly. 183 SUP: For unknown numbers yes_ 184 (0.2) 185 PRO: [Mm hmm ] 186 TM: [All right] ((laughing)) Ok(h)ay 187 SUP: [There’s some weird stuff out there.]

At the end of the call, while the prospect continues to produce arbitrary acknowledgement tokens, the supervisor uses the incident as a training opportunity for the TM (lines 178-180, 183). The elongated vowel and slight rising intonation of the response particle ‘Rea:lly,’ (l. 182) expresses an element of disbelief. The TM’s response is also a news mark (Heritage 1984). These tokens are understood to make a response relevant and here invite the supervisor to elaborate with further explanation (l.183) (Thompson et al 2015). The supervisor’s response at line 187 signals understanding for how the TM assumed the prospect was human and in this way reduces the threat to the TM’s professional face.

Perceived professional face threats

The supervisor’s intervention, together with the realisation that they have been interacting with a non-human party, constitute a threat to TMs’ professional face – “the ‘professional persona’ on loan to the agent” (Márquez Reite 2011: 3863, Márquez Reite 2009).

Excerpt 12: Newspaper subscription

188 SUP: [I don’t think so] 189 TM: She’s answered all my questions. 190 SUP: I know but it’s just literally answers. 191 TM: Oh I’ve been talking to her,

Whilst the TM’s response in line 189 ‘She’s answered all my questions’ is a tacit disagreement of this assertion, it is also an attempt to save face (Goffman 1967) and demonstrate that she has done her job properly. The supervisor’s reply in line 190, in an agree+disagree format (Pomerantz 1984), acknowledges the TM’s response as a remedial exchange (Goffman 1971), but also restates his judgement of the trouble following the conjunction but. The TM prefaces their next turn with the particle ‘Oh’ (l.191). Heritage (2002:196) attests that oh-prefaced turns can be used to ‘convey what might be termed ‘ownership’ of knowledge’. In this instance, the TM attempts to justify her performance thus far, based on her experiential knowledge of her interaction with the prospect. In other words, the response ‘Oh I’ve been talking to her’, serves to implicitly question the supervisor’s observation of the prospect as non-human and justify the amount of time spent on the call. Excerpt 13 is a continuation of the call a few turns later.

Excerpt 13: Newspaper subscription

192 SUP: Yeah she just keeps saying Uh-Huh. 193 TM: Really? 194 SUP: That’s not live person it’s a recording.=That’s a good one 195 though_ 196 (0.6) 197 PRO: Okay. 198 TM: O::h £my god I'm so sorry£. 199 (0.7) 200 TM: ((laughs)) 201 TM: £I did not£ (h) I-= 202 SUP: =It’s a good [one [unclear 00:06:07 ] 203 TM: [Because she was saying] more like she was talking 204 to her son.=I [just ]

In line 193, the TM reacts to the supervisor’s assertion with a change-of-state token of ritualised belief (Heritage 1984:339) ‘Really?’ with rising intonation. The supervisor then confirms that the TM has been speaking with a recording (‘yeah she just keeps saying uh-huh’); however, he produces a positive assessment (Pomerantz 1984) ‘That’s a good one though’ as a way of explaining how the TM may have been misled and mitigate any potential professional face threat. Consequently, the TM finally accepts the prospect is not human and responds with a surprise token and apology in line 198, ‘O::h my god I’m so sorry’, with prosodic elongation of the ‘O::h’ particle and suppressed laughter displaying a state of embarrassment and attempting to save her professional face.

Previously in Excerpt 9, we showed the supervisor intervention from the holiday company. The company is trying to persuade the prospect ‘John’ that he has unused travel credits that will expire unless action is taken. There have been a number of derailments earlier in the call. The call continues with further random distractions until a few turns later the TM seems to display a metapragmatic awareness that something is not right with the call and asks for John to hold the line. Following a pause, Excerpt 14 begins with the supervisor taking the phone.

Excerpt 14: Holiday credit scam

205 SUP: Yeah hello:, 206 (5.6) 207 PRO: Mmm hmm 208 (3.7) 209 PRO: Hell:oh↑ 210 (2.8) 211 SUP: Are you there sir? 212 PRO: Hey, <hey hey> I keep losing you can c-come again? 213 SUP: ºYes, he’s talking (inaudible) it’s a voice activated [machi]ne 214 so when you talk and it responds with something 215 PRO: [okay,] 216 SUP: weird_º 217 TM: =ºNo answering mach[ine c]ould respond like he’s been doing_º 218 PRO: [okay_] 219 TM: ºNo I’m really serious_º 220 SUP: =ºYeah, I’m letting you know itº is, yeah_ 221 PRO: [I don’t want ANY- >hold on hold on< I ]don't want any 222 TM: [ºBut they were talking about toilets,º] 223 SUP: [Yeah but (inaudible) voice activated. They’ll never give you 224 a direct answer, [just some off-the-wall shit.] 225 PRO2: [there you go it’s pretty good] right, 226 SUP: ºif you say “are you still in New York?” [it’s voice- 227 PRO: [yeah that’s pretty 228 darn good,] 229 SUP: activated ]so every time you talk it's gonna start saying 230 something else randomly_º 231 TM: =He’s talking to someone else, he’s not talking to me_ 232 PRO: Can you-I’m sorry can you con-con back up a little bit and 233 tell me what [you were_] 234 SUP: [ºI know_ ]he’s acting like and then he’ll come 235 back and say “oh wait I’m here sorry what were you [saying?”] 236 then he’ll start talking again and he’ll act 237 PRO: [O::h? ] 238 SUP: like he’s doing something else again [or (.) t]alking (.) 239 PRO: [Hello::?] 240 SUP: no it’s an answering machine_º 241 TM: Okay_ (.) listen we’ll talk again next week, okay? 242 PRO: Hello:: is anybody- is anybody here on the [phone with me?] 243 TM: [I’m here but ] 244 we’ll talk next week okay, (.) have a great evening_

The supervisor’s intervention confirms his suspicions that John is in fact a bot. As in Excerpt 12, in lines 217 and 219 the TM manifests resistance to the supervisor and counters his assessment based on her own knowledge of the prospect during the call. However, in contrast to Excerpt 12, where the supervisor addresses any potential face threat with a remedial exchange, here the supervisor dismisses the TM’s claims (l.220). The TM responds with a but-prefaced response (l.222) (Jackson & Jones 2013). But-prefaced components are often employed as a defence to make a point (Schiffrin 1987) or to convey the accuracy of an assertion that has been questioned by the previous speaker (Bolden 2010). With her response, the TM negates the supervisor’s comments and maintains her claim that she has been talking to a real person. The TM continues to reject his assessment insisting ‘He’s talking to someone else, he’s not talking to me’ (l.231), perhaps as a way of justifying the amount of time invested in the call. To further underline her defiance, and perhaps to save her professional face, she returns to the call to say goodbye and confirms she will call John the following week.

Verbal abuse as an emotional response

While the TMs in the above calls were concerned about saving their professional face, in other calls we found two contrasting emotional reactions when they realised they had been interacting with a non-human prospect: expressions of abuse towards the ‘prospect’ through swearing, and expressions of anger through vulgar lexemes, which could be considered a form of ranting (Thorson & Baker 2019). In the following examples, the TMs make their own assessment of the prospects’ odd behaviour:

Excerpt 15: Credit card scam 2

245 TM: .hhh how much credit card debt would you say you are carrying 246 today_ is it ten thousand [dollars] or more, 247 PRO: [Yes ] 248 PRO: Yes 249 TM: Okay so about how much,= 250 PRO: =Yes 251 (1.5) 252 TM: This bitch must be retarded as any mother fucker

Excerpt 16: Air duct cleaning scam

253 PRO: er >you know I was having trouble concentrating< ‘cause| you 254 sound exactly like somebody I went to high school with e:rm 255 so sorry can you say that part again, 256 TM: Yeah so_ are you fucking with it or not, 257 PRO: Okay_

In Excerpt 15, the TM begins to ask a wh- question and switches to a polar interrogative to which the prospect responds with a type-conforming affirmative (l.247). However, when the TM repeats the specifying question in line 249, the subsequent affirmative response is non-fitting. Following a pause, the TM accounts for the prospect’s behaviour by questioning their mental capacity, referring to them in the third person with the declarative at line 252, as if they were not present in the interaction (see Rehm 2020) or perhaps addressing a co-worker in the call centre. Excerpt 16 shows a similar response. In this recording, the ‘high school’ narrative has already been introduced earlier in the call. When the same distraction occurs again, the TM queries whether the prospect is ‘with it’ (l.256). We cannot know whether at this point in the calls the TMs think they are talking with a recording or whether they actually believe the prospects have impaired cognitive capabilities; nonetheless the TMs appear to be questioning the prospects’ competence. The TMs may realise that there is no value in the time they have invested in the call and their confidence in making sale is low which leads to the abusive language (cf. De Angeli & Brahman 2008).

More sustained verbal abuse is found in Excerpt 17, taken from the first credit card scam call, which we present in detail. The whole interaction lasts for nearly 13 minutes and during this time the TM calls back three times. The Excerpt begins approximately 3:30 minutes into the call. Up until this point, there have been three side dialogues between Emma and her daughter (see Excerpt 7). This is the fifth time the TM has requested information about Emma’s credit card and he is beginning to show signs of frustration.

Excerpt 17: Credit card scam

258 TM: Which credit card you use most your Visa card or Master mam. 259 (0.2) 260 PRO: Hm mm? 261 (1.0) 262 TM: What do you mean hm mm hm mm_ 263 (0.2) 264 PRO: °Yeah_° 265 (1.7) 266 TM: Hello_ 267 (Lines omitted) 268 TM: Uh-let me tell you one thing. 269 (.) 270 TM: Let me clear you one thing. 271 If you are:: playing with me if you are just trying to waste 272 [my:: time or your] time .h I’m going to fight 273 PRO: [Mmm right. ] 274 TM: your whole fa:mily okay? 275 (.) 276 TM: You -don’t have any ide::a. 277 (0.2) 278 TM: If you just trying to kidding me:: 279 PRO: [Hm mm? ] 280 TM: [so like] uh let me tell 281 you one thing right now you are getting -wrong .one person 282 okay? 283 (0.2) 284 PRO: Hm mm? 285 TM: =And I will -proo::ve it.

Following a non-fitting response (Hm mm? – l.260) to the specifying question, we can see a breakdown in trust as the TM proceeds to mimic Emma in line 262 and, in this way, also questions the capability and sincerity dimensions of trust. The TM uses message enforcers, constituting preliminaries to the ensuing threat, in lines 268, 270 and 271 (e.g., Uh-let me tell you one thing) and makes an explicit conditional threat of violence towards Emma’s family in lines 272 and 274. For reasons of space, we have not included all the turns in this section of the interaction. However, we can see how it follows what Culpeper (2011: 224) classifies as typical rhetorical patterning of impoliteness, for example, reformulation (e.g., if you are just trying to waste my:: time / if you just trying to kidding me::) and repetition (e.g., And I will proo::ve it, repeated again twice in the call [not shown]). The swearing and abuse are clearly oriented interpersonally with the intention to offend and undermine Emma’s face (Dynel 2012) and her failure to counter the abuse and reciprocate in a similar manner seems to further frustrate the TM, as shown in Excerpt 18.

Excerpt 18: Credit card scam

286 TM: Like (0.2) >so< er: are you just doing this as well a- I’m 287 sarry. 288 (0.2) 289 TM: I [-didn’t (hear)] you can (0.2) can you start over? 290 PRO: [Oh yeah. ] 291 TM: Put- fuck my foot motherfucker, 292 PRO: =Mmm: right. 293 (0.5) 294 TM: Have- -ri::ght .right what d’you mean mm. 295 (0.2) 296 TM: Hm -mm uh -huh ah hah .mm [ah hah? ] 297 PRO: [Oh ye::s?] 298 (0.5) 299 TM: Do you have any idea how much calls we receive in a day. 300 (1.2) 301 TM: Do you have [any idea ] how many (0.2) many fucking 302 PRO: [Yeah oka::y?] 303 TM: foo::l (0.2) we-er we face in a day like -you you bitch. 304 PRO: =Yes.

This Excerpt follows another derailment where Emma explains her difficulties trying to record a TV show. The TM once again starts to mimic Emma (lines 289, 294 and 296). Emma’s affirmative acknowledgement tokens appear to further incite the TM and his aggression begins to escalate. He produces personalised negative vocatives in lines 291 and 303 and a third person negative reference (Culpeper 2011) in lines 301/303 with the strong intensifier (fucking foo::l). The following few minutes of the interaction consist of the TM threatening to harass Emma with multiple phone calls. The call has now been going on for a long time, progressivity has never really been achieved but the TM persists. Operators in call centres are not allowed to terminate calls and must attempt to close a sale unless the prospect ends the call (Woodcock 2017). This may explain the TM’s persistence as approximately 8:30m into the call he tries to get Emma to hang up (you can hang up the call=I will call you back right no:w). As this obviously does not happen, the TM, whose first language is not English, continues to mimic Emma, be abusive and threatening. This type of intrapersonally oriented ranting and swearing (Stapleton et al 2022; Wajnryb 2005) has been observed in human-machine interaction (Brahnam & De Angeli 2008), especially when technology is endowed with specific human characteristics (language, a human voice and the assumption of human roles), and despite the human being aware from the start that they were interacting with a machine (Reeves & Nass 1996).

Their reactions are akin to human reactions when technology is not working, though admittedly at a different scale of abusive language. As pointed out by Ullman and Malle (2018) trust is multidimensional; it involves relational (assuming the bot was human, e.g., sincere) and capacity (the bot’s elementary abilities, e.g., task accomplishment) dimensions, and for it appears to be allowable for humans to abuse tools in ways that would be unacceptable if they were human (Parasumaran & Riley 1997).

Conclusion

In this article we provided a first analysis of human communicative behaviour in outbound marketing calls intercepted by bots. The calls in question are unsolicited and the products or services being sold are not necessarily wanted or indeed real. The bots perform a safeguarding function: detection and protection from all too pervasive nuisance calls. They are aimed at protecting humans from time-wasters or potential criminal activity at the hands of other humans by taking revenge on them.

Whilst the telephone company may have uploaded the most successful examples of calls to promote and sell their service, our analysis of a sample of calls between human telemarketers and bots has identified the main interactional principles and resources that bots ‘used’ to project personhood to give TMs the illusion that they were interacting with another human. We have contended that the communicative environment itself – telephone conversations where the audible rather than visual takes pre-eminence and distractions or deviations from established conversational rules are common and tolerated – and the working conditions of TMs offer a convenient setting in which doing being a human can successfully develop.

The formulaic nature of the TMs’ scripts coupled with the conditions of an industry which requires telephone operators to make hundreds of calls per day, be patient, listen to prospects and avoid putting the telephone down so as not to lose potential customers, enables the bots to be relatively effective despite the general ill-fittedness of their contributions.

We have noted delays in the bots’ utterances, lexical and prosodic arbitrariness of tokens, which are nevertheless heard as acknowledgement or continuers by the TM. These were, however, produced at approximately appropriate times, well into the opening and middle stages of the calls, allowing the bots to project enough ‘intersubjectivity’ for the TM to progress the call.

The bots’ intermittent commitment in the calls, as illustrated by ill-formatted and ill-placed tokens, was nonetheless accounted for. “Doing-being-human” and engaging in typical human activities (e.g., laundering clothes, watching sports) were brought forward as excuses for lapsed intersubjectivity and purported lapsed attention was used to prevent the TMs from advancing their agenda and the bots to maintain (relational) trust. The calls comprise a combination of interactional contributions in the way of utterances and the annexing of side dialogues through which bots reflected on their behaviour thus far in the calls. The calls never progress, despite engagement with alleged conversation.

The illusion of personhood was further conveyed by authentic human voices, which unambiguously distinguished between genders and age. These were accompanied by traditional stereotypes which were relationally oriented, both in terms of their content (e.g., the distracted mum and their capacity for multitasking) and the way in which they were constructed (e.g., inviting TM engagement). The integration of bots’ accounts invited the TMs to repeat a previous sequence for progressivity to be regained and the whole charade started again.

The TMs all display some form of metapragmatic awareness that there is something strange about the interactions. This awareness is expressed in a number of ways: through embarrassment, anger or frustration. Given the preference for progressivity and the pressure to keep telephone calls short in call centres (Woodcock 2017), these prolonged interactions do not conform with prescribed call centre interactions. It could be argued that while the TMs may consider some of the continuous disruptions in the calls rather odd, the derailments successfully account for this behaviour.

We have seen how intersubjectivity and progressivity are woven together in pursuit of the establishment and restoration of trust and how the working conditions of telemarketing agents responsible for making unsolicited calls to sell a given product (legitimate or not) provide fertile ground to ignore potential troubles in the calls, try to resolve them. Despite the TMs’ intent, ultimately communication failed because no intersubjective attunement or togetherness could be achieved between the TM and a non-responsive recording.

Since collecting the data, R&P has incorporated text-based ChatGPT-4 and other speech recognition and voice cloning technology into their system. The new service is an embryonic example of a proper bot. There are only two example calls available on their website, however the original recordings were so well honed to the formulaic nature of the telemarketing script that they did a better job of doing being a human than this first version of a bot. This highlights a current contextual limitation of AI as this new bot is unable to recognise the context of a telemarketing call or relate to its situated interactivity. AI would need to evolve before it could be used effectively in this type of calls.

Services offered by companies such as R&P can be useful to protect vulnerable members of the public (e.g., the elderly) from nuisance and scam calls. Notwithstanding this, there are clear ethical and moral implications in using AI which does not identify as such to target humans. Humans’ implicit assumption of trust by the very fact that the bots engage in interaction, their authentic human voice, legitimate backstories, reactions to the TMs’ every utterance, coupled with the TMs’ constraint of being unable to end the call, sheds new light on further ethical dilemmas that emerge in a communicative arena characterised by (socioeconomic) inequalities. This is especially poignant in the case of telemarketing agents who often work under slave-like conditions with clear implications for their livelihoods.

Footnotes

Declaration of conflicting interests

The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author received no financial support for the research, authorship, and/or publication of this article.

Notes

Author biography

Dr Rosina Márquez Reiter is Full Professor of Pragmatics and Interaction in the School of Languages and Applied Linguistics at The Open University, UK.

Dr Mandie Iveson is an Independent Researcher based in the UK.

References

Bellucci

Park

(2020) Honesty biases trustworthiness impressions. Journal of Experimental Psychology: General 149(8): 1567.

Bolden

(2010) ‘Articulating the unsaid via and-prefaced formulations of others’ talk. Discourse Studies 12(1): 5–32.

Boyd

Crawford

(2012) Critical questions for big data: Provocations for a cultural, technological, and scholarly phenomenon. Information, Communication & Society 15(5): 662–679.

Brahnam

De Angeli

(2008) Editorial. Special issue on the abuse and misuse of social agents. Interacting with Computers 20(3): 287–291.

Brophy

(2017) Language Put to Work: The Making of the Global Call Centre Workforce. London, UK: Palgrave Macmillan.

Burstyn

(1999) The Rites of Men: Manhood, Politics and the Culture of Sport. Toronto: University of Toronto Press.

Button

Cross

(2017) Cyber Frauds, Scams and Their Victims. London, UK: Routledge.

Chang

W-LM

Haugh

(2011) Strategic embarrassment and face threating in business interactions. Journal of Pragmatics 43(12): 2948–2963.

Chia

(2020) Seeking justice on the web: How news media and social norms drive the practice of cyber vigilantism. Social Science Computer Review 38: 655–672.

10.

Culpeper

(2011) Impoliteness: Using Language to Cause Offence. Cambridge: Cambridge University Press.

11.

De Angeli

Brahnam

(2008) I hate you! Disinhibition with virtual partners. Interacting With Computers 20(3): 302–310.

12.

Duranti

La Mattina

(2022) The semiotics of cooperation. Annual Review of Anthropology 51: 85–101.

13.

Dynel

(2012) Swearing methodologically: The (im)politeness of expletives in anonymous commentaries on YouTube. Journal of English Studies 10(1): 25–50.

14.

Dynel

Ross

(2021) You don’t fool me: On scams, scambaiting, deception, and epistemological ambiguity at R/scambait on Reddit. Social Media+Society 7(3): 20563051211035698.

15.

Fairclough

(1989) Language and Power. London, UK: Longman.

16.

Fox

Thompson

(2010) Responses to wh-questions in English conversation. Research on Language and Social Interaction 43: 133–156.

17.

Franzke

Bechmann

Zimmer

, et al. (2020) Internet Research: Ethical Guidelines 3.0. https://aoir.org/reports/ethics3.pdf

18.

Garfinkel

(1967) Studies in Ethnomethodology. Englewood Cliffs, NJ: Prentice Hall.

19.

Glenn

Holt

(2013) Studies of Laughter in Interaction. London, UK: Bloomsbury.

20.

Goffman

(1967) Interaction Ritual: Essays on Face-to-Face Behaviour. New York, NY: Pantheon Books.

21.

Goffman

(1971) Relations in Public: Microstudies of the Public Order. New York, NY: Basic Books.

22.

Grice

(1975) Logic and conversation. In Cole

Morgan

(eds.) Syntax and Semantics, Vol. 3: Speech Acts. New York, NY: Academic Press, pp. 41–58.

23.

Hancock

Kessler

Kaplan

, et al. (2023) How and why humans trust: A meta-analysis and elaborated model. Frontiers in Psychology 14: 1081086.

24.

Haugh

Culpeper

(2018) Integrative pragmatics and (im)politeness theory. In: Illie

Norrick

(eds.) Pragmatics and Its Interfaces. Amsterdam, The Netherlands: John Benjamins Publishing Company, pp. 213–239.

25.

Heller

Duchêne

(2012) Pride and profit: Changing discourses of language, capital and nation-state. In Duchêne

Heller

(eds.) Language in Late Capitalism: Pride and Profit. New York, NY: Routledge, pp. 1–21.

26.

Heritage

(1984) A change-of-state token and aspects of its sequential placement. In: Maxwell Atkinson

Heritage

(eds.) Structures of Social Action. Cambridge: CUP, pp. 299–345.

27.

Heritage

(2002) Oh-prefaced responses to assessments. In: Ford

Fox

Thompson

(eds.) The Language of Turn and Sequence. New York, NY: Oxford University Press, pp. 196–224.

28.

Heritage

(2007) Intersubjectivity and progressivity in person (and place) reference. In: Stivers

Enfield

(eds.) Person Reference in Interaction: Linguistic, Cultural and Social Perspectives. Cambridge: CUP, pp. 255–280.

29.

Heritage

Atkinson

(1984) Introduction. In: Maxwell Atkinson

Heritage

(eds.) Structures of Social Action. Cambridge: CUP, pp. 1–16.

30.

Hummert

Garstka

Ryan

, et al. (2004) The Role of Age Stereotypes in Interpersonal Communication. Mahwah, NJ: Lawrence Erlbaum Associates.

31.

Jagodzinski

Archer

(2018) Co-creating customer experience through call centre interaction: Interactional achievement and professional face. Journal of Politeness Research 14(2): 257–277.

32.

Javed

Toumi

Alharbi

, et al. (2021) Detecting nuisance calls over internet telephony using caller reputation. Electronics 10(3): 353.

33.

Jefferson

(2004) Glossary of transcript symbols with an introduction. Conversation Analysis 13–31.

34.

Jackson

Jones

(2013) Well they had a couple of bats to be truthful: Well-prefaced, self-initiated repairs in managing relevant accuracy in interaction. Journal of Pragmatics 47(1): 28–40.

35.

Kigerl

(2020) Spam-based scams. In: Holt

Bossler

(eds.) The Palgrave Handbook of International Cybercrime and Cyberdeviance. Cham: Palgrave Macmillan, pp. 877–897.

36.

Lockwood

Forey

Elias

(2009) Call centre communication: Measurement processes in non-English speaking contexts. In: Belcher

(ed.) English for Specific Purposes in Theory and Practice. Ann Arbor: University of Michigan Press, pp. 143–165.

37.

Loveluck

(2020) The many shades of digital vigilantism: A typology of online self-justice. Global Crime 21(3–4): 213–241.

38.

Lui

Yip

Wong

(2021) Gender differences in multitasking and performance. Quarterly Journal of Experimental Pyschology 74(2): 344–362.

39.

Márquez

Reiter R

(2000) Linguistic Politeness in Britain and Uruguay. Amsterdam/Philadelphia: John Benjamins.

40.

Márquez

Reiter R

(2009) ‘How to get rid of a telemarketing agent: Face-work strategies in a Spanish intercultural service call’. In: Bargiela-Chiappini F and Haugh M (eds). Face, Communication and Social Interaction. London: Equinox, pp. 55–77.

41.

Márquez

Reiter R

(2011) Mediated Business Interactions: Intercultural Communication between Speakers Spanish. Edinburgh: Edinburgh University Press.

42.

Márquez Reiter

(2019) Navigating commercial constraints in a Spanish service call. In: Garcés Conejos Biltvich

Hernández López

Amaya

(eds). Mediated Service Encounters. Amsterdam: John Benjamins, pp. 121–144.

43.

McAllister

(1995) Affect-and cognition-based trust as foundations for interpersonal cooperation in organizations. Academy of Management Journal 38(1): 24–59.

44.

McCabe

Rigdon

Smith

(2003) Positive reciprocity and intentions in trust games. Journal of Economic Behavior & Organization 52(2): 267-275.

45.

Merritt

(1978) On the use of ‘OK’ in service encounters. In Fasold

(ed.) Variation in the Form and Use of Language. Washington: Georgetown University Press, pp. 294–304.

46.

Mey

(2010) Reference and the pragmeme. Journal of Pragmatics 42: 2882–2888.

47.

Odenweller

Rittenour

(2017) Stereotypes of stay-at-home and working mothers. Southern Communication Journal 82(2): 57–72.

48.

Ofcom (2022) Improving the accuracy of Calling Line Identification (CLI) data. Available at: Statement: Improving the accuracy of Calling Line Identification (CLI) data - Ofcom.

49.

Parasuraman

Riley

(1997) Humans and automation: Use, misuse, disuse and abuse. Human Factors 39(2): 230–253.

50.

Povich

(2022) State Attorneys General Unite Against Robocalls. Pew Charitable Trusts. Available at: State Attorneys General Unite Against Robocalls | The Pew Charitable Trusts (pewtrusts.org).

51.

Pomerantz

(1984) Agreeing and disagreeing with assessments: Some features of preferred/dispreferred turn shaped. In Atkinson

Heritage

(eds.) Structures of social action. Cambridge: CUP, pp. 79–112.

52.

Pomerantz

(1986) Extreme case formulations: A way of legitimizing claims. Human Studies 9: 219–229.

53.

Reeves

Nass

(1996) The Media Equation: How People Treat Computers, Television, and New Media like Real People and Places. Stanford, CA: CSLI Publications and Cambridge University Press.

54.

Relieu

Sahin

Francillon

(2019) Lenny the bot as a resource for sequential analysis: Exploring the treatment of Next Turn Repair Initiation in the beginnings of unsolicited calls. In: Mensch und Computer 2019 – Workshopband, Bonn: Gesellschaft für Informatik e.V., https://doi.org/10.18420/muc2019-ws-645

55.

Rommetveit

(1998) Intersubjective attunement and linguistically mediated meaning in discourse. In: Bråten

(ed.) Intersubjective Communication and Emotion in Early Ontogeny. Cambridge: Cambridge University Press, pp. 354–371.

56.

Rehm

(2020) “She is so stupid” analysing user-agent interactions in emotional game situations. Interacting with Computers 20(3): 311–325.

57.

Rowe

(2009) The ethics of deception in cyberspace. In: Luppicini

(ed.) Handbook of research on technoethics. Hershey, PA: IGI Global, pp. 529–541.

58.

Saunders

Frascella

(2022) Scam robocalls: Telecom providers profit report. National Consumer Law Center. Available at: Scam Robocalls (nclc.org).

59.

Schegloff

(1986) The routine as achievement. Human Studies 9: 111–151.

60.

Schegloff

(1992) Repair after next turn: The last structurally provided defense of intersubjectivity in conversation. American Journal of Sociology 97(5): 1295–1345.

61.

Schegloff

(2007) A Primer in Conversation Analysis: Sequence Organisation in Interaction. Cambridge: CUP.

62.

Schiffrin

(1987). Discourse Markers. Cambridge: CUP.

63.

Scott

Lyman

(1968) Accounts. American Sociological Review 33(1): 46–62.

64.

Searle

(1975) Indirect speech acts. In Cole

Morgan

(eds.) Syntax and Semantics, Vol. 3. Speech Acts. New York: Academic Press, pp. 59–82.

65.

Sheard

(2015) Truth and trustworthiness. In: Achourioti

Galinon

Fernández

, et al. (eds.) Unifying the Philosophy of Truth. New York: Springer, pp. 107–115.

66.

Smallridge

Wagner

Crowl

(2016) Understanding cyber-vigilantism: A conceptual framework. Journal of Theoretical & Philosophical Criminology 8(1): 57–70.

67.

Stapleton

Fägersten

Stephens

, et al. (2022) The power of swearing: What we know and what we don’t. Lingua 277, 103406.

68.

Stivers

(2008) Stance, alignment, and affiliation during storytelling: When nodding is a token of affiliation. Research on Language and Social Interaction 41: 31–57.

69.

Stivers

Robinson

(2006) A preference for progressivity in interaction. Language in Society 35(3): 367–392.

70.

Thompson

Fox

Couper-Kuhlen

(2015) Grammar in Everyday Talk: Building Responsive Actions. Cambridge: CUP.

71.

Thorson

Baker

(2019) Venting as epistemic work. Social Epistemology 33(2): 101–110.

72.

Tovar

(2020) Call center agents’ skills: Invisible, illegible, and misunderstood. Sociolinguistic Studies 14(4): 437–458.

73.

Tracy

Agne

(2002). ‘I just need to ask somebody some questions’: Sensitivities in domestic dispute calls. In: Coterill

(ed.) Language in the Legal Process. Basingstoke: Palgrave Macmillan, pp. 75–89.

74.

Ullman

Malle

(2018) What does it mean to trust a robot? Steps toward a multidimensional measure of trust. In: Companion of the 2018 ACM/IEEE International Conference on Human-Robot Interaction, pp. 263–264.

75.

Wajnryb

(2005) Expletive Deleted: A Good Look at Bad Language. New York: Simon and Schuster.

76.

Weber

Carter

(2003) The Social Construction of Trust. New York: Kluwer Academic.

77.

Wood

Kepkowski

Zinatullin

, et al. (2023) An analysis of scam baiting calls: Identifying and extracting scam stages and scripts. arXiv preprint arXiv:2307.01965.

78.

Woodcock

(2017) Working the Phones: Control and Resistance in Call Centres. Chicago, IL: Pluto Press.

79.

Zimmerman

(1992) The interactional organization of calls for emergency assistance. In: Drew

Heritage

(eds.) Talk at work: Interaction in Institutional Settings. Cambridge: CUP, pp. 359–469.

The establishment and breakdown of trust in human-bot marketing calls

Abstract

Keywords

Introduction

(Re)establishing trust in pursuit of prospect engagement

Data and methods

Analysis

Establishment and restoration of trust

Opening sequence

Middle of the business exchange

Accounting for lapsed intersubjectivity – traditional gender stereotypes as credible excuses

TMs’ reactions on realising they have not been interacting with a human

Training opportunity

Perceived professional face threats

Verbal abuse as an emotional response

Conclusion

Footnotes

Declaration of conflicting interests

Funding

Notes

Author biography

References