Abstract
There is a driving need computationally to interrogate large bodies of text for a range of non-denotative meaning (e.g., to plot chains of reasoning, detect sentiment, diagnose genre, and so forth). But such meaning has always proven computationally allusive. It is often implicit, ‘hidden’ meaning, evoked by linguistic cues, stylistic arrangement, or conceptual structure – features that have hitherto been difficult for Natural Language Processing systems to recognize and use. Non-denotative textual effects are the historical concern of rhetorical studies, and we have turned to rhetoric in order to find new ways to advance NLP, especially for sophisticated tasks like Argument Mining. This paper highlights certain rhetorical devices that encode levels of meaning that have been overlooked in Computational Linguistics generally and Argument Mining particularly, and yet lend themselves to automated detection. These devices are the linguistic configurations known as
Introduction
Rhetorical Figures are cognitively moulded linguistic devices that serve functional, mnemonic, and aesthetic purposes. Take the famous maxim from John F. Kennedy’s 1961 United States Presidential inaugural address:
Ask not what your country can do for you. Ask what you can do for your country. [34]
This expression quickly became proverbial in the American consciousness for the way it captures the spirit of a particular historical moment, the ethos of a particular administration, and the aspirations of a particular generation. Countless more prosaic formulations, by Kennedy and others, expressed that same confluence, but they left a distinctly less memorable impression. Why? Two reasons. Firstly, the formal structure and the functional structure are virtually isomorphic: Kennedy (and speechwriter Ted Sorensen) expressed the rejection of one civic attitude and its replacement by an opposite one, in the iconicity of identical terms of reference repeated in reverse order. Secondly, the form in that snug form/function coupling, on its own, is cognitively very sticky. The Kennedy–Sorensen phrase has become so widely known, in short, so easily shared, so frequently invoked and quoted and recited because of (1) the schematic congruence with which the form matches the Reject–Replace function its arrangement serves, and (2) the cognitive affinities humans have for its structural properties (opposition, repetition, sequence, and symmetry).
The cognitive affinities explain its mnemonic and aesthetic effects, but its tight form-functional correlation, studied in an approach known as
There is a growing interest in the convergence of rhetoric, argumentation, and Advanced Natural Language Processing (ANLP) – that is, in the new field of Computational Rhetoric – sparked by such works as [10,21–24,53,61,62].1 We do not put Mann and Thompson’s [41] Rhetorical Structure Theory (RST) in this category because, while it has made some valuable insights into text linguistics, it is simply incorrectly named, by scholars who appear to know little or nothing about rhetoric. RST has really to do with
Meanwhile, over in Computational Linguistics, there is an equally surprising omission. Computational Linguists have had success at detecting some Rhetorical Figures, but have shown little interest in the Rhetorical Functions those figures serve. We are building an approach to Computational Rhetoric that combines the insights of Figural Logic with the goals of Argument Mining – namely, finding patterns of reasoning in texts – which promises equally rich payoffs for both computer science (specifically, in Argument Mining and Computational Linguistics) and for rhetorical theory (specifically, the understanding of Rhetorical Figures).
Serious challenges need to be resolved in Computational Rhetoric for us to reach the Argument Mining pot of gold. We concentrate on one of the most immediate of those challenges in this paper: the “serious bottleneck” in Computational Rhetoric, identified by Marie Dubremetz and Joakim Nivre, stemming from “the lack of annotated data” [12]. This problem is especially acute for Machine Learning (ML), where we believe rich insights will come in the future, about which they note “there has been no training data available” [14]. They are clever researchers, so Dubremetz and Nivre manage some interesting results all the same, using a tiny training set, but the lack of training data remains urgent – in particular, for more significant and complex Computational Rhetoric problems than they address, the form/function correlations of Rhetorical Figures, and the combinatoric properties of Rhetorical Figures (what we call
Our approach involves a more sophisticated conception of Rhetorical Figures than has been adopted heretofore – in computational approaches, but also in humanistic and social science approaches. Our research operates at layers of formal and functional abstraction that have not been previously explored (though see [37] for promising new advances). It depends fundamentally on an annotation format for Rhetorical Figures, which we report in this paper.
In this paper, we argue for the importance of Rhetorical Figures for ANLP generally and Argument Mining specifically; we identify the challenges and opportunities of integrating a rich knowledge of Rhetorical Figures into ANLP; and, most specifically, we offer an eXtensible Markup Language (XML) annotation scheme for Rhetorical Figures that meets some of these challenges and therefore opens up new opportunities for Argument Mining and Computational Linguistics, especially in concert with ML.
Computationally, Rhetorical Figures are critical to the understanding of natural language for three central reasons. First, they are endemic to human language. This fact is established beyond dispute for a few tropes, such as metaphor, which is a central focus of Cognitive Linguistics and deeply entrenched in ontologies like FrameNet and WordNet. But it is equally true of literally (a word we do not use lightly) hundreds of other figures. If we want to develop language-perceptive algorithms, they must have knowledge of the figurative dimensions of language. Secondly, Rhetorical Figures epitomize argument structure, increasingly a prime concern for ANLP. Again, this is well understood for metaphor (and simile, though it gets much less overt attention), which epitomize analogic argumentation, but is largely unrecognized for most figures. Thirdly, many figures (especially the ones called
The contemporary scholar most responsible for the claim that Rhetorical Figures are constructions with especially tight couplings of form and function is Jeanne Fahnestock, whose Figural Logic is brilliantly articulated in
Rhetorical Figures are not without their computational challenges, of course. Metaphor remains elusive, for instance, despite all the attention it has attracted in Cognitive Science, Artificial Intelligence (AI), and linguistics, including Computational Linguistics, over the last several decades. Metaphor is a type of figure known as a Some rhetoricians, including Fahnestock, regard antithesis as a scheme. See [27] for an argument that it is a trope. The young would choose an exciting life; the old a happy death. (
But most tropes are far from tractable computationally.
Another type of Rhetorical Figure, called
If it doesn’t fit, you must acquit. (
The computational detection of figures, including antimetabole, is finding success [2,12–15,19,20,33,37,48–50]. Virtually none of this work, however, has gone beyond detection to consider the argumentation context. Dubremetz and Nivre, for instance, in their impressive work with antimetabole [12–15], appear to be unfamiliar with the Rhetorical Functions antimetabole serves. But antimetabole is a parade example of form/function correlation. It has a small set of Rhetorical Functions, keyed to the iconicity of its formal structure (which evokes balance and opposition, as well as sequence or priority). We have very limited space in this paper to demonstrate these Rhetorical Functions, so a few examples will have to suffice.
One paradigmatic function of antimetabole is to convey Reciprocal Force, illustrated by Sentence 4, Newton’s third law of motion.
If you press a stone with your finger, the finger is also pressed by the stone. (
Newton’s third law is often expressed as “for every action, there is an equal and opposite reaction” (which features another Rhetorical Figure, polyptoton – a repetition of the same lexical stem with different morphology – here,
A very similar Rhetorical Function of antimetabole is to convey Reciprocal Specification, a kind of mutual definition, illustrated by Sentence 5:
Gay rights are human rights, and human rights are gay rights. (
In this phrase the notions of ‘human rights’ and ‘gay rights’ are reciprocally identified with each other. Sentence 5 says not only that you can’t have one without the other, neither makes sense without the other.
A third Rhetorical Function of the antimetabole is to convey Comprehensiveness, illustrated by the ordinary-language example, Sentence 6:
A place for everything, and everything in its place. (
The reverse repetition in Sentence 6 implicates a kind of a reciprocal
A fourth Rhetorical Function of the antimetabole is to convey Irrelevance-of-Order, well known from algebra and predicate calculus:
There are other ways to express the principle of commutation, but none as natural and iconic as formulae like 7. Opposite sequences of the same variables, on either side of the same operator, pivoted by a predication of identity, equivalence, or equality inescapably
We have built a curated list of over 400 antimetaboles illustrating these functions, but only have space for a few more representative examples:
A corollary of PHC [the Principle of Hierarchical Coincidence] is that resources flow toward political power, and political power flows toward resources; or, the power of state and of capital typically appear in conjunction and are mutually reinforcing. (
Women are changing the universities and the universities are changing women. (
The negation of a conjunction is the disjunction of the negations and the negation of a disjunction is the conjunction of the negations. (
Anger and depression, the pop-psych books tell us, are two sides of the same coin: depression is anger suppressed, anger is depression liberated. (
I meant what I said and I said what I meant. (
Whether we bring our enemies to justice or bring justice to our enemies, justice will be done. (
With a similar qualification, in the Cambridge Grammar of the English Language, a head ‘plays the primary role’ in ‘determining the distribution of the phrase’ (introductory chapter signed by Pullum and Huddleston, in Huddleston and Pullum 2002:24) (
“Spanglish,” [is] the combination of Spanish and English (or English and Spanish) (
It is such clearly identifiable Rhetorical Functions as these, coupled with the relative ease of rhetorical-scheme detection, that make Rhetorical Figures so promising for computational tasks in which comprehension is central, like Argument Mining and text summarization.
Again, however, there are challenges. They are not as thorny as the challenges of most tropes because they concern formal surfaces, not semantic depths. But they exist. In particular, figures rarely come in isolation. Sometimes figures get in each other’s way. Take an example like 16:
We didn’t land on Plymouth Rock. Plymouth Rock landed on us. ( A version of Example 16 predates its use by Malcolm X. It first shows up in a Cole Porter song, “Anything goes” [52], where it signals topsy-turviness of sexual morés rather than of progress and freedom.
These two sentences look like an antimetabole (
But often the combinatorics of Rhetorical Figures work the other way; in such cases, which are frequent, they reinforce and constrain each other in predictable ways. In some combinations of Rhetorical Figures these effects lead to clearer, even monovalent Rhetorical Functions; often, they aggregate into robust nuggets that offer particularly rich rewards for Argument Mining. When this happens – in figural collocations – the functions of two or more figures work in concert to narrow the range of meaning. We’ve already seen a clear example of such a collocation, in fact: the Kennedy–Sorenson maxim (Example 1). It instances antimetabole (
Figural collocations present a technical challenge that has not been met, but which strikes us as quite soluble, and they present rich opportunities. The challenge is to detect multiple overlapping figures; so far, researchers have only detected individual figures. The opportunities arise because Rhetorical Functions are often enhanced and stabilized under the combinatorics of collocation; some collocations are much more common than others, because of their cognitive similarities and functional implications. When two or more figures combine in certain recurrent collocations, the functions they convey tend to be highly consistent. Collocations, that is, often lead to functional conspiracies.
For instance, when antimetabole collocates with mesodiplosis and antithesis, the combined function is primarily to reject the negated predication utterly and replace it with the positive predication. Example 1 is our paradigm, but here are two more:
We don’t build services to make money; we make money to build better services. ( “Nonviolence has not failed us,” he said, “we have failed nonviolence.” (
Unpacking these, we have reciprocal causal relationships – between making money and building services in 17; between nonviolence and human agents in 18 – with one causal relationship rejected, the other promoted in its place. All three figures are important here. Antimetaboles give us two terms in reverse order. Mesodiploses join two sets of terms in the same grammatical and semantic relationship, a Rhetorical Function of Equivalent Relation; it is not exactly proportional or analogical, but it puts two sets of terms (A&B and C&D) into something very much like the familiar Aristotelian formula, A:B::C:D. That is, A has the same grammatico-semantic relations to B as C has to D. In 17, we have two Subject/Complement causal relationship mediated by
We see inescapably now, too, that mesodiplosis is central to the Rhetorical Functions of 4, 5, and 7–15. We wouldn’t get Reciprocal Force without the Equivalent Relation enforced by the mesodiplosis of verbs like
Ultrabooks are laptops after all, but not all laptops are ultrabooks. ( All compounds are molecules (since compounds consist of two or more atoms), but not all molecules are compounds (since some molecules contain only atoms of the same element). (
Here the Equivalent Relation of the copulative mesodiplosis is crucial, exactly as it is with Reciprocal Specification – the most natural coding of the IsA relation – but the addition of antithesis negates one direction of the predication. In predicate calculus terms, the antithesis turns a bidirectional (mutual) entailment into a unidirectional entailment.
We do not pretend to have a full and complete mapping of form to function, nor a chart of Rhetorical Figure combinatorics. This work is still in the very early stages, but these examples show that it holds considerable promise, and we believe ML corpus studies can be extremely helpful, especially for figural collocation.
Figural collocation, as we come to understand the functional combinatorics better, holds perhaps the greatest promise of Rhetorical Figures for computational understanding of natural language. Our paradigm example (1), a collocation of the schemes antimetabole and mesodiplosis with the trope antithesis provides a pitch-perfect example of the Rhetorical Function, Reject–Replace. A computational analysis of Kennedy’s inaugural address tuned to the workings of Rhetorical Figures could tell us what the address was about – namely, the rejection of an ethos of entitlement and its replacement with an ethos of responsibility – virtually on the basis of this particular collocation (along with, of course, the lexical semantics of
We can, and should, rely on rhetoricians to guide us in the functions of certain figures and certain figure-bundles, at least in these early stages. But the rhetorical tradition is haphazard at best, and too frequently conflictual. The terminology alone is forbidding, not just because of its Greek and Latin roots but because of its historical inconsistency. As much as Computational Argumentation studies can benefit from a better understanding of Rhetorical Figures, Rhetorical Figure theories can benefit from computational studies of form and meaning. (And, yes, that sentence was an antimetabole, bundled with mesodiplosis; the Rhetorical Function is Reciprocal Force, modulated by the possibility-modality of
The path forward is to bootstrap rhetoricians’ knowledge by way of annotated corpora and Machine Learning, so that computationally mined data can start to clarify what functions certain figures and figural collocations have, through confirmation, through refinement, and through new discoveries – all of which we have good reason to anticipate.
We can discover the proportionality of certain collocations (observationally, both antithesis and mesodiplosis seem strongly to co-occur with antimetabole), and the correlation of the collocations with the Rhetorical Functions (as sketched above, on the basis of limited and anecdotal research). At its best, this work can revolutionize Computational Argumentation studies and rhetoric in the way corpus linguistics revolutionized lexicography and generated ontologies like WordNet and FrameNet. But even if it does not prove entirely revolutionary, we are very confident of finding important form/function correlations that can significantly inform Computational Argumentation and Rhetorical Theory in novel ways.
Linguists have a role to play here as well. We have seen that grammatical features such as transitivity and copularity interact with Rhetorical Figures to determine their Rhetorical Functions, but Grammatical Constructions also interact with Rhetorical Figures intriguingly ([6]:55–60). For instance, the well-known chiastic Easier-to-take-the-A-out-of-B-than-the-B-out-of-A catchphrase clearly fits the criteria of Construction Grammarians [32]:
[I]t was easier to take the girl out of the brothel than to take the brothel out of the girl. ( It was much easier to take Kuhn out of Harvard than Harvard out of Kuhn. ( It was found easier to take the evacuee out of the slum than to take the slum out of the evacuee. ( After twenty-five years in the field. I’ve traded the front seat of a 4 × 4 for a swivel chair and a desk. The change did not come easily for me. As the old saying goes – it’s a lot easier to take the man out of the field than to take the field out of the man. ( I could take Tarzan out of the jungle. Could I take the jungle out of Tarzan? (
The tricky aspect of these examples is the presence of a trope, metonymy. The word in the first A location (A1) in all of these cases is the physical person (e.g.,
There have been limited successes in figure detection over the past several years [2,12–15,19,20,33,37,48–50]. But they have been restricted both in method and in scope, with no concern for figural combinatorics.
Hromada’s work [33], for instance, is very successful at the detection of antimetabole, but he defines antimetabole in a narrow, overdetermined way. Using the Waterloo Rhetorical Figure Notation (WRFN) [30]4 Hromada calls this notation,
Little of the figure-detection work looked for more than one figure at a time, and none looked for figures specifically in terms of recurrent collocations. Hromada also searched for three other repetition figures (anadiplosis, epanaphora, and epiphora), but only in isolation. Dubremetz and Nivre regularly find antitheses collocated with antimetabole [12–15], because they use negation as a correlative of antimetabole (which markedly improved their detection success), and they regularly find parison (syntactic parallelism, discussed in more detail below) as well. But they were not looking for those figures and do not report their results. Only Gawryjołek [19,20] was genuinely looking for collocated figures, but again that was largely incidental to his focus. He did not interpret the collocations at all, nor report on the statistics. He was merely looking for multiple figures in the same corpus, many of which overlapped.5 Anadiplosis is clause-final-clause-initial lexical repetition (< …Wx >< Wx …>). Epanaphora is clause-initial lexical repetition (< Wx …>< W x …>). Epiphora (also known as
But detecting Rhetorical Figures must be only the beginning of the story. We know from millennia of humanistic research that linguistic forms correlate with Rhetorical Functions – that figures do communicative work beyond ‘mere aesthetics’ – and we can thank Fahnestock for collating and expanding this research so clearly in the contemporary era [17]. On the basis of this work, Fahnestock’s Figural Logic, we can use the detected figures to help chart meanings – sometimes very fundamental meanings.
But it is still an open question how well the form–function couplings that humanists have found stand up beyond the small sampling of discourse they have cared to explore – the orations, poetry, and killer ripostes that stock their examples. Worse, much of the collection and curation of figures has also been quite lazy, with the same examples propagating century after century for the same figures. The data is highly skewed.
Do these form–function couplings hold up in the conversations, news stories, opinion pieces, blogs, review articles, short stories, tweets, scientific arguments, and so on, that fill the vast sea of everyday and specialist human discourse? We have our hunches, but we don’t know. Corpus studies can tell us. Do Reciprocal Force antimetaboles always occur with transitive verbs, for instance? Do Reciprocal Specification and Subclassification antimetaboles always occur with copulas? Do Irrelevance-of-Order antimetaboles always show up with conjunctions and disjunctions? How frequently does mesodiplosis collate with antimetabole? What other robust collocations are there, with what functional implications? We have intuitions, and there is a good deal of particularized research (that is, specific works of rhetorical criticism), but intuitions and particularized research need to be tested on corpora.
How do figures cluster in terms of genres? Do individual authors have identifiable figural proclivities? Is sentiment a trigger for certain figures? Do certain argument types favour certain figures? Are there author-genre figural effects? Argument-sentiment figural effects? Author-sentiment? Again, intuitions and particularized research suggest answers; again, these need to be tested.
When multiple figures co-occur, as they almost always do, which functions conspire, which remain independent, which ones take precedence in the possibility of a conflict? Are there functional differences between “accidental” figures and “designed” figures? If figures are form–function couplings, does it even make sense to speak of ‘accidental’ figures?
This work can undoubtedly be strengthened by ML, one of the motivations for the format we have developed for annotating Rhetorical Figures, to complement (and interact with) the annotation formalisms developed for part-of-speech tagging, speech-act tagging, and so on. Corpora annotated with Rhetorical Figures can be used to train systems on new and more sophisticated detection tasks, especially for collocations and functional correlations.
Marie Dubremetz [12]:25 calls the work of Jakub Gawryjołek [19,20] and Daniel Devatman Hromada [33] “pioneering;” we might choose, rather, the term “exploratory,” in the sense of crucial preliminary work charting landmarks and features of a new territory, a term we would also extend to the research of Claus Strommer [60] and Mohammad Alliheedi [2,3], and certainly to Dubremetz herself, who has gone (with her doctoral main advisor, Joakim Nivre) deeper into the territory of Rhetorical Figures than any other Computer Scientist. Dubremetz’s work inaugurates a second phase of figure detection, the systematic refining of methods in recurrent studies. She has been particularly preoccupied in this work with two issues, annotation and ranking. She has redefined figure detection “as a ranking task” [12]:27, rather than a binary task. Examples 26 and 27 are from her data: 26 gets a rank of 1, the premier instance of a chiastic figure in her corpus, and a chiastic score of 99.77%; 27 is ranked 3000, the worst chiastic instance in the corpus, with a chiastic score of 0.01% (both examples, with their ranking, along with the 2,998 between them, can be found at stp.lingfil.uu.se/~marie/corpus/quote_chiasmus.txt):
There are only two kinds of men: the righteous who think they are sinners and the sinners who think they are righteous. (
You hear about constitutional rights, free speech and the free press. Every time I hear these words I say to myself, ‘That man is a Red, that man is a Communist!’ (
All figure-detection researchers, like Dubremetz, worry about what is a ‘true figure’ and what is an ‘accidental figure,’ or a ‘nonfigure’ that just happens to resemble a ‘true figure.’ But they are not thinking in terms of instances like Example 16 above (the antanametabole), which resembles one figure but is really a different figure. They are thinking, rather, about rhetorical intension. Did the speaker/writer intend their utterance to be a Rhetorical Figure, they worry, or did the words just fall into a pattern that looks like a Rhetorical Figure “Is the figure ‘accidental’?” We think this is a non-issue, but Dubremetz’s ranking strategy is an important way of showing how figures operate. In particular, it illustrates clearly that figural activity is not just a binary on/off situation. We can see instantly that the reverse repetitions of 26 (along with the medial repetitions, the mesodiplosis, of “who think they are”) serve a clear Rhetorical Function – the one we call
Dubremetz has also improved our understanding of annotation, by reflecting the same themes of ‘true figures’ and ‘accidental figures’. She develops a protocol for ensuring reliable human annotation of figures to build ML training corpora, involving the comparison to curated exempla, substitution tests, and translation tests [12]:33. Her protocols are valuable because they do indeed help to find figurative instances that align with the most established and venerable examples from the rhetorical tradition – as one would expect, since her instances are chiefly calibrated against such examples.
But she helps (unintentionally) to expose how much the curating rhetoricians and literary critics have fallen short on this matter, and setting figure detection researchers from Gawryjołek to Dubremetz on the wrong path. A closer look at Examples 26 and 27 should make this point. You already know, from previous discussions, that 26 contains a mesodiplosis, the clause-medial repetition of “who think they are” – contributing to the referential stability of the instance. But there’s more. There are two figures of parallelism in 26 as well, parison and isocolon, figures which frequently collocate with chiastic figures. Parison, mentioned briefly above, is syntactic repetition: both of the clauses have the same syntactic structure; that is, the second clause repeats the structure of the first clause:
[Det N]NP [Wh-Pro]NP [V [[Pro]NP [V[Adj]AP]VP]S]VP
[Det N]NP [Wh-Pro]NP [V [[Pro]NP [V[Adj]AP]VP]S]VP
Isocolon is prosodic repetition: both of the clauses have precisely the same prosody; the second clause repeats the prosodic contour of the first clause:
Woes unite foes.
Woes unite enemies.
The moral of the story, then, is that Example 26 is not so much a ‘true’ antimetabole in comparison with the ‘accidental’ or ‘false’ or ‘non-’ antimetabole 27, as it is a more effective collocation of figures in which antimetabole is made more salient and cohesive through figures of parallelism and referentially stabilized by mesodiplosis. Rhetoricians and literary scholars have routinely catalogued instances like Example 26 without identifying – for the most part, without even noticing – all the other figural activity that collectively contributes to their aesthetic and suasive effects.
Example 27 also has additional figuration, but none of that figuration collaborates with the reverse repetitions. It has, for instance, some routine metonymy (
The absence of parison is particularly crucial. Indeed, now that you know what parison is, you can look back over previous examples and see how regularly it collocates with antimetabole: the relevant phrases or clauses of Examples 5 and 7–15 are all syntactically parallel; and it is only the presence of negation that puts 1, and 16–20 slightly out of syntactic parallel. One particularly central job that parison does with antimetabole is to ensure that the terms repeating in reverse order swap syntactic roles. The syntactic (and often semantic) roles almost always interchange when terms reverse under parison: subjects become objects, objects become subjects, and so on. Dubremetz does not appear to be aware of the figure parison, but she notes that “the first salient common point we notice [with paradigmatic cases of antimetabole] is their perfectly symmetrical switch of syntactic roles” [12]:37. Accordingly, she tunes her detection algorithms to look for such “syntactic features,” and (unwittingly) searches for parison.
Example 27 does not exhibit parison. There is no symmetrical switch of syntactic roles. Indeed, the two clauses are so far out of parallel that repetitions of the relevant words are interrupted by a clause boundary (the first
We stress again that Dubremetz, Hromada, Strommer, and the other computational figure detectors, back to the beginnings of this work with Gawryjołek, are not to be faulted on their true/false or graduated treatment of figures. They have simply and reasonably followed what rhetoricians and literary scholars propagated, rather negligently, for millennia. Rhetoricians have consistently defined figures in terms such as “two or more words are repeated in inverse order” [66]:492 and “inverting the order of repeated words (ABBA)” for antimetabole [36]:14, and then placed those definitions alongside representative examples like 1, without noting the other figures in the instance, without identifying the importance of collocation, and without tying Rhetorical Functions to the collective operation of Rhetorical Figures. Indeed, computational figure detectors like Gawryjołek and Dubremetz are to be commended. Were it not for the issues they have kicked up in their research, attempting to find the essential, computationally tractable character of certain figures, working from the paradigmatic examples that Rhetoricians and literary scholars assembled, these fundamental aspects of Rhetorical Figures may never have surfaced.
If we want to resolve the serious annotated-data bottleneck Dubremetz and Nivre identify for Computational Rhetoric [12], especially if we want to tap into the resources of ML to meet the challenge of Rhetorical Figure combinatorics, we will need texts annotated for occurrences of multiple figures – mutually re-enforcing, often interpenetrating, sometimes wholly overlapping multiple Rhetorical Figures, but also independent, or even mutually interfering, multiple Rhetorical Figures as well. That prospect requires a standardized annotation scheme. We have developed such a scheme.
The XML Waterloo Annotation Scheme (WAS) for Rhetorical Figures is quite straightforward, crucially identifying the figure-span and its defining elements, with standard start (<x>) and end (</x>) tags. For the figure-span, we just use the name of the figure itself; for the elements, it is the name of the figure; followed by an alphabetic variable, to distinguish the elements; followed by a numeric designation, to mark the sequence in which the elements occur; separated by dashes; all terms mandatory. Antimetabole, for instance, includes the tags <antimetabole>, <antimetabole-A-1>, <antimetabole-B-1>, <antimetabole-A-2>, and <antimetabole-B-2>. If we take the classic example of antimetabole, familiar from All for one, one for all. [16]:80 <antimetabole> <antimetabole-A-1>All </antimetabole-A-1> for <antimetabole-B-1>one </antimetabole-B-1> <antimetabole-B-2>one </antimetabole-B-2> for <antimetabole-A-2>all </antimetabole-A-2> </antimetabole>. All <mesodiplosis> <mesodiplosis-A-1>for </mesodiplosis-A-1>one, one <mesodiplosis-A-2>for </mesodiplosis-A-2> </mesodiplosis> all. <parison> <parison-A-1>All for one </parison-A-1>, <parison-A-2>one for all </parison-A-2> </parison>. <isocolon> <isocolon-A-1>All for one </isocolon-A-1>, <isocolon-A-2>one for all </isocolon-A-2> </isocolon>. A most beastly place. Mudbank, mist, swamp, and work; work, swamp, mist, and mudbank. [11]:216 A most beastly place. <antimetabole> <antimetabole-A-1>Mudbank </antimetabole-A-1>, <antimetabole-B-1>mist </antimetabole-B-1>, <antimetabole-C-1>swamp </antimetabole-C-1>, and <antimetabole-D-1>work </antimetabole-D-1>; <antimetabole-D-2>work </antimetabole-D-2>, <antimetabole-C-2>swamp </antimetabole-C-2>, <antimetabole-B-2>mist </antimetabole-B-2>, and <antimetabole-A-2>mudbank </antimetabole-A-2> </antimetabole>. A most beastly place. <parison> <parison-A-1>Mudbank, mist, swamp, and work </parison-A-1>; <parison-A-2>work, swamp, mist, and mudbank </parison-A-2> </parison>.
This scheme is scalable and adaptable, allowing for expansion. So, while antimetabole so frequently involves two inversely repeated words that rhetoricians use the shorthand ABBA for it, sometimes there are three (which would technically be ABCCBA), four (ABCDDCBA), or more inverse repetitions, as in 31a (antimetabole annotation, 31b; there is also a parison, 31c).
Phenomena such as 31 have led us, incidentally, to redefine antimetabole as “reverse repetition of at least two words.”
Note that we use a system of standoff markup, as did Gawryjołek [19,20], with each figure annotated individually, rather than tagging all figures concurrently in the same text. Not only is this a cleaner approach than inline markup with respect to the data structure, it avoids one particularly troublesome problem for XML – the complication of overlapping tags, where one figure starts inside another’s tags but ends outside them. In our first attempt to deal with this problem we used XML attributes [55], but standoff markup is a more efficient solution. We illustrate this with another instance, while also introducing a new suite of figures, the zeugmas. Zeugmas are figures in which one term or lexical string, very frequently implicating a verb, ‘controls’ at least two other terms or phrases, as in 32a, in which “He hated” applies to both Noun Phrases (NP) in the first clause (“white oppression” and “white domination”) as well as to the NP in the final clause (“white people themselves”). (The zeugmatic suite of figures is divided in terms of the location of the governing term or phrase – in prozeugmas, as in 32, the governing term appears before any of the governed elements; in mesozeugmas, it is between the governed elements; in hypozeugma, it follows all governed elements.) Example 32a also exhibits epanaphora, the repetition of the adjective
He hated white oppression and white domination, not white people themselves. [40]:108b
<prozeugma>
<prozeugma-A-2> [he hated] white domination, not
</prozeugma-A-2>
<prozeugma-A-3> [he hated]
</prozeugma-A-3>
</prozeugma> white people themselves.
He hated
<epanaphora>
<epanaphora-A-1>white
</epanaphora-A-1>oppression and
<epanaphora-A-2>white
</epanaphora-A-2> domination
</epanaphora>not
<epanaphora-A-3>white
</epanaphora-A-3>
</epanaphora>people themselves
<antithesis>
<antithesis-A-1>He hated white oppression and white domination
</antithesis-A-1>
<antithesis-B-1> not white people themselves
</antithesis-B-1>
</antithesis>.
He hated
<parison>
<parison-A-1>white oppression
</parison-A-1> and
<parison-A-2>white domination
</parison-A-2> not
<parison-A-3>white people
</parison-A-3>
</parison>themselves.
It is common to treat zeugma as ‘omitting’ a version or versions of the governing term. Indeed, aficionados of the recent history of formal linguistics will remember that Transformational Grammar treated zeugmas with Raising operations that deleted the relevant terms, and various forms of Lexical Theories (like Government-Binding Theory) handled zeugmas with null proforms, such as traces or PROs. We have adopted a similar convention, signaling the governing structure in zeugmas with the kind of ghost repetition, or echo, you see in 32b (“[he hated]”).
We have developed an interface for displaying multiple figures in the same instance, an evolution of Gawryjołek’s JANTOR interface [19,20]. Gawryjołek’s interface uses different colours simultaneously, turning them on with check boxes, to signal individual figures in the same passage of text. But since Rhetorical Figures do not respect one another’s boundaries, since they overlap and interpenetrate, the colours quickly cancel each other out just when the signaling is most valuable, and the passage can become a wash of brown; some figures (as with Example 30’s parison and isocolon) overlap completely. We avoid representing them simultaneously therefore, and provide a radio-button toggling interface to display them. Figure 1 illustrates this approach for Example 32.

Depiction of interface for displaying multiple figures in a single instance, representing Example 32. For printing reasons, this depiction substitutes bold italics for colours.
We can now return to the standoff-markup solution to overlapping tags: Example 32 demonstrates the sort of problem that interpenetration of figures cause for inline-markup using XML. Figure 2 is a diagram of the figural elements of Example 32, showing that it is possible to tag some of the Rhetorical Figures in nesting ways – for instance, by marking the open and close of epanaphora inside the tags for parison; and both of those Rhetorical Figures naturally nest within antithesis and prozeugma. However, the latter two figures interpenetrate, with prozeugma starting within the text-span of antithesis but ending outside its text span. That is, inline markup would result in a syntax error for antithesis and prozeugma: <antithesis> <prozeugma> …</antithesis> </prozeugma>. Standoff markup avoids this problem altogether.

The interpenetration of figures can lead to syntax errors for inline XML markup, illustrated by the crossing antithesis and prozeugma arcs in this representation of Example 32.
There are other solutions to the interpenetration problem – most notably using an inline XML annotation scheme utilizing attributes, and avoiding XML altogether, with something like a Resource Description Framework (RDF) approach – which have certain advantages. We outlined an attribute-based solution in the previous iteration of this paper, delivered at CMNA 16 [55] and we have explored an RDF approach in unpublished work. But the standoff annotation scheme is the cleanest and mnemonically most workable for human annotators and by keeping the scheme stable we can use and maintain conversion tools for other formats, for various purposes, as it becomes necessary. For instance, conversion to RDF makes the instances highly definable in an ontology (for instance, in OWL or RDFS) that can capture much more detail about constraints, relationships and categorization, as that data becomes available.
Our approach also makes use of a unique and persistent URI for each Instance in our database, so that, for instance, algorithms can easily recognize the
Daniel Devatman Hromada concludes his important computational exploration of Rhetorical Figures by suggesting that the goal of such work is not just to produce yet “another tool for stylometric analysis of textual corpora,” but to get at a fundamental question of Natural Language: “What is the essence of figures of speech and how could they be represented within&by an artificial and/or organic symbol-manipulating agent?” [33]. While Hromada doesn’t go there himself, his goal is precisely the objective of the kind of research we have labelled
Our current annotation scheme does not solve all the problems. The eventual target is to develop a markup scheme that provides computationally accessible information for all Rhetorical Figures, but we have concentrated almost wholly on Rhetorical Schemes in our research to this point, figures of surface phenomena (phonology, morphology, lexis, and syntax). Tropes, which concern the trickier phenomena of semantics, provide significant complications that we are just beginning to tackle. But we believe they are largely tractable within our annotation scheme. Example 33, for instance, exhibits a general metaphor (“mattresses of kelp”) and a particular subtype of metaphor known as
The advancing tide pushed mattresses of kelp up the scarp of the beach. [7]:143
The advancing tide pushed
<metaphor>
<metaphor-S-1>mattresses
</metaphor-S-1> of
<metaphor-T-1>kelp
</metaphor-T-1>
</metaphor> up the scarp of the beach.
The
<personification>
<personification-S-1>advancing
</personification-S-1>
<personification-T-1>tide
</personification-T-1>
<personification-S-2>pushed
</personification-S-2>
</personification>mattresses of kelp up the scarp of the beach.
What is important about the annotations of 33 is that sequence is much less important for metaphor, perhaps irrelevant, though some scalar tropes (such as
Footnotes
Acknowledgements
We would like to thank our colleagues at the University of Waterloo, including Elena Afros, Tyler Black, Adam Bradley, Kyle Gerber, Alice Leung, Isabel Li, Ashley Mehlenbacher, Devon Moriarty, Omar Nafees, Ricky Rong, George Ross, Terry Stewart, Katherine Tu, and Yetian Wang; our international colleagues, including Marie Dubremetz, John Lawrence, Jelena Mitrović, Chris Reed, Jacky Visser, and James Wynn; and the Social Sciences and Humanities Research Council of Canada for financial assistance. We also thank four anonymous reviewers for CMNA and
