Abstract
Dung’s abstract argumentation frameworks have had a very significant role in the rise in interest in argumentation throughout this century. In this paper we will explore the impact of this seminal idea on a specific application domain, AI and Law. Argumentation is central to legal reasoning and there had been a considerable amount of work on it in AI and Law before Dung’s paper. It had, however, been rather fragmented. We argue that the abstract argumentation frameworks had a unifying effect by offering a means of relating previously diverse work. We also discuss how the particular demands of legal systems have led to developments building on the basic notions of abstract argumentation.
Introduction
From its very beginnings Artificial Intelligence and Law has been interested in argumentation. One of the first major AI and Law projects was McCarty’s TAXMAN [74], which attempted to reproduce the argumentation in the majority and minority opinions in
Law (at least in US and UK) is
Legal decisions must be
Law is
Law is
For these reasons AI and Law has always needed to concern itself with argumentation and there was a good deal of work before [42] appeared in 1995. AI and Law therefore offers an excellent domain in which to explore the impact of Dung’s seminal paper [42] by examining the differences his ideas made.
Argumentation in AI and Law before Abstract Argumentation
In [13], the journal version of a 1995 Jurix keynote speech [12], we can find a survey of argumentation in AI and Law before Dung. It made a distinction between arguments based on a representation of cases and arguments based on rules. The former concentrated on the
Legal case based reasoning in the early 1990s
Thinking about reasoning with legal cases was dominated by HYPO [102] and [7], and subsequent work deriving from it, including CABARET [112], CATO [4] and BankXX [103]. For an overview of HYPO and its successors see [20]. In these systems cases could be seen as collection of factors.1 A factor may be taken as a stereotypical pattern of facts with legal significance. As such they are Boolean: either present in, or absent from, a case. As discussed in [20], several of the systems use dimensions, which have magnitudes and range from an extreme pro-plaintiff point to an extreme pro-defendant point, rather than factors. For simplicity we will just use
In the first ply the proponent cites the precedent with the desired outcome which most closely matches the current case, arguing that, on the basis of the similarities, the decision should be the same.
In the second ply the opponent can respond in two ways:
By citing the precedent with a different outcome which most closely matches the current case as a
By citing features in the current case but not the precedent, or in the precedent but not the current case, to
In the third ply the proponent attempts to rebut the opposing arguments by distinguishing the counter examples and down playing the differences, arguing that they are not significant.
This structure is a very natural way to organise legal arguments of the sort found in the high level US court decisions used by HYPO since it closely follows the structure of Oral Hearings in the US Supreme Court [1]. In [4] and [112] some additional structure was provided by the identification of the argument moves that the participants use to present their arguments within these three plies.
A key point about this approach is that there was no formal, theoretical, underpinning. Even though the model is computational (the algorithms involved in HYPO were given in Appendix A of [4]), the analysis relates to the informal argument tradition [46] which attempts to provide a description of legal argumentation from consideration of examples of actual practice. HYPO drew especially on the oral hearing stage of the US Supreme Court. Typical of this style of analysis we see the dialogue divided into a sequence of
CATO [4] has eight argument moves. Essentially in the first stage a case is
A second feature of this kind of argumentation is that there was no attempt made to evaluate arguments: all the emphasis was on the generation of arguments, and the onus was on the user to decide which ones were strong and convincing. This stress on generation of arguments rather than their evaluation continued in CABARET [112] and BankXX [103] which generated its arguments through heuristic search of a collection of
Initially the output of rule based models of law relied on the standard
Argument for modelling legal processes: Dialogue games
Dialogue Games were introduced in philosophy, mainly with the intention of modelling fallacies [65] and [72]. These were taken up in AI as a way of interacting with knowledge bases [25] and [80]. Dialogue games were popularised in AI and Law in by [64], [51] and [63]. Again, the point was not to predict the outcome of cases. In [63] the purpose was to model legal dialogues so that hard cases could be specified in dialogical terms. In [51] arguments were evaluated to identify which issues were in dispute so as to model the process of pleading.2 “The process performed by the parties to a suit or action, in alternately presenting written statements of their contention, each responsive to what precedes, and each serving to narrow the field of controversy, until there evolves a single point, affirmed on one side and denied on the other, called the “issue”, upon which they then go to trial” [36].
Legal reasoning is inherently defeasible. In the court of first instance the status of arguments may change in the light of new facts or new arguments. Even when the facts are agreed, a new interpretation of a rule may reverse a decision. To account for this legal systems typically offer a right of appeal, and often the right to appeal again against the appeal decision. Thus legal reasoning may be seen as non-monotonic. Moreover, we often find norms in conflict, often, but not always, because a legal norm is promulgated as a general rule and a series of exceptions. Further, since legal norms always require interpretation to apply them to specific fact situations, we may find conflicting interpretations, as in the case of open-textured concepts [66]. It is clear therefore that legal systems need to be able to handle non-monotonicity and conflicts between norms and between interpretations.
This could be accommodated by using a programming language that supported non-monotonicity, such as Prolog with negation as failure [68] or some more specialised language [49].3 Although Prolog was able to handle the exceptions in [109], there are aspects of non-monotonicity that it cannot deal with [50].
The notion of argumentation seemed to some to provide excellent potential as a means of handling these problems. The idea is that the conflicting arguments can be generated and the user can be called upon to decide between them (effectively playing the role of judge). Presentation as arguments is preferable to presentation as rules, because it provides the surrounding context, support for the rule antecedents and the like. This was essentially the purpose of McCarthy’s TAXMAN project [74], which was intended to generate the competing arguments in a particular case. Presentation of competing arguments was suggested as a general solution to problems of open texture in [32]. Although that paper only generated arguments, the desirability of computationally supporting the choice between them was acknowledged, suggesting that attention should be paid to producing “a representation in computer intelligible terms of what it is that makes an argument persuasive”. The responsibility for deciding which argument to accept, however, remained for the present squarely on the user.
This role for argumentation can also be found in the hybrid rule and case system of CABARET [112]. In that paper, excellently summarised by Loui in Section 3.4 of [21], there is a top layer of rules, but the interpretive work required to determine whether or not the antecedents apply in a particular case is done by HYPO style case based reasoning. Thus both case-based and rule-based traditions saw argumentation as the key to resolving the problem of open texture.
Argumentation in AI and Law had a number of features before Dung.
Its main inspiration came from informal logic traditions: notions of argumentation schemes, context, stages of dialogues and argument moves all come from that tradition. This imposes attention on the particular content and context of arguments. Arguments were rarely seen as convincing on the grounds of their form alone.
The main role for technology was the generation of arguments, whether using case based reasoning, logic programs, non-monotonic logics, neural networks or through instantiating argumentation schemes. The evaluation of the status of arguments was normally left to the user.4 Casting technology in the role of supporting rather than making legal decisions was in part a consequence of resistance to the idea of a “computer judge”. Some work on evaluation did go on, however. Arguments were evaluated in [52], although the purpose there was to identify disputed issues rather than make decisions on cases. There was also interest in using the principles described in Section 2.4. For example Prakken advocated preferring the more specific argument [83].
In rule based approaches, argumentation was usually5 Again there were exceptions, notably the work of Tom Gordon [52] and Henry Prakken [84].
The result was that argumentation in AI and Law taken as a whole was rather lacking in coherence. Arguments were generated in a variety of ways including: using instantiations of arguments schemes, ranging from generic schemes, such as that of Toulmin, to the highly specialised schemes such as those found in HYPO and its progeny; using the proof traces of logic programs; using other rule instantiations. This meant that arguments tended to take a rather diverse form, and that attacks on them could be highly application specific. The use of arguments, whether as the basis of dialogues or for explanation, tended to be very project specific, and approaches were often difficult to compare. What was lacking was a means of relating this diverse work within a common framework. This was achieved with the advent of Abstract Argumentation.
The idea that abstract argumentation could offer a way to relate the various strands of AI and Law on argumentation was first proposed in an important paper by Henry Prakken [85]. This paper draws its notion of abstract argumentation not from [42] but from the pre-journal version presented at IJCAI [41]. There had been argumentation frameworks before Dung: Prakken cites several examples in [85], including Simari and Loui [111] from general AI, and, from AI and Law, Sartor [106], Gordon [52] and his own [84]. But all of these are tied to a particular logic: in AI and Law, Gordon uses conditional entailment [48] while Sartor and Prakken rely on a version of default logic. Dung’s insight was that arguments could be
Levels in argumentation
The main point of [85]. is to distinguish three levels required for adversarial legal reasoning.
The logic level, which generates arguments.
The argument level, which organises these arguments and identifies attack relations between them. This determines the acceptability of arguments at a given point in the debate.
The dialogical level, which determines how the arguments and attacks from the level below can be deployed in a dispute. This level moves the debate forward and refers to the level below to determine the acceptability of arguments at the current stage.
The use of In the early 90s, Toulmin’s scheme was almost the only explicit scheme used. The explicit use of a variety of argumentation schemes to represent legal rules was proposed in [117], drawing on [127]. After that the explicit use of schemes became widespread: e.g. [29,53,56,95,99,120,131]. But the use of argument schemes was also implicit in earlier work: [89] argues “that much AI and Law research in fact employs the argument-scheme approach, although it usually is not presented as such” and provides an excellent retrospective discussion of this earlier work.
The second level is the argumentation framework itself, and it here that Dung’s framework makes its impact. Whereas in previous work the argument framework had been dependent of the way in which the arguments were generated, Dung’s proposal was to abstract away all these generation-specific aspects. Thus, using Dung’s framework at this level, enabled arguments to be abstracted from the means used to produce them and from their particular forms. Dung’s framework does, however require the identification of the attack relations between the arguments (in [42] an argument is “abstract entity whose role is solely determined by its relations to other arguments” and the attack relation is the only such relation in [42]). This means that nature of attack is abstract, and no distinction is made between different kinds of attack. In [85] only two types of attack, rebuttal and undercut, were recognised. This is in keeping with the focus on logic, and excluded underminers (attacks on arguments represented by attacks on their premises) because they would be attacks on sub-arguments. But there is no need to restrict the nature of attacks. If we are using the instantiation of argumentation schemes to generate arguments then it is appropriate to include attacks deriving from so called
The third level, the level of dialogue, is able to draw on the abstract argumentation framework for its moves. Although in many cases the dialogue operates on a fixed argumentation framework, it is possible to return to the lower levels to seek additional arguments if they are needed. Because the arguments are abstract, different classes of dialogue such as persuasion, deliberation and inquiry [128] can all be implemented on the same structure. Using argumentation frameworks gave rise to a new type of dialogue game (e.g. [124]). When the dialogues are based on abstract frameworks, the content of the move is changed from that found in most earlier dialogue games7 There were some exceptions, including [52]. A similar effect is achieved, by different means, in [52].
Thus while some dialogue games such as [124] and [28] operated at the argument level, and were based on complete argumentation frameworks derived from the underlying knowledge base, issues relating to dialogical aspects, such as the structure of arguments and the way in which information is introduced into the discussion, required that arguments retain their structure. Therefore several researchers continued to use the more traditional style of games to explore these important issues. These include, amongst others, Prakken [91,98], Lodder [69] and Verheij [118].
A detailed examination of the three level approach is given in [96], and the approach developed in that paper was used to model reasoning with legal precedents in [98]. The second of these papers is especially significant since it enables reasoning with precedents to be rendered in terms of arguments. The key idea, reflecting the adversarial nature of legal dispute, and the fundamental principle of precedential reasoning,
The strongest argument in favour of the plaintiff.
The strongest argument in favour of the defendant.
An argument that the precedent shows that one of the preceding arguments is preferred, depending on the outcome of the precedent case.
If we represent cases as sets comprising the factors in a case in the manner of CATO [4], the three arguments come from three rules (if Plaintiff factors then find for Plaintiff, if Defendant factors then find for Defendant and a priority rule between these two rules). In this way a body of case law can be represented as a knowledge base of rules or a logical theory. This representation has proved fruitful in developing a logic of precedential reasoning in [67] and [100].
While the structure of arguments was important in [98], a body of case law was represented as a framework of purely abstract arguments in [16].9 This is, of course, not an example of the three level model of [85] (unless one counts the analyst as constituting the logic level).

Abstract Argumentation for the wild animals domain as represented in [16]. Shaded nodes indicate cycles.
Depending on which arguments can be made in a given case, argument
In Dung’s abstract argumentation framework of [42] all attacks always succeed. This gives rise to several possible semantics to determine acceptance. Grounded, preferred and stable were mentioned in [42], and since then there have been many other proposals [11]. Among the properties of the original semantics are:
The grounded extension is unique, but may be the empty set.
If there are no odd length cycles, there will always be at least one non-empty preferred extension, but there may be many preferred extensions. Multiple preferred extensions arise out of even length cycles.
There may be no stable extension.
If we consider a legal dispute, we will have arguments presented by two opposing parties. These opposing arguments may manifest themselves as cycles, giving rise to competing positions. In Fig. 1, for example we see two even length cycles: M and O represents a difference of opinion as to whether Justian is too old be an acceptable authority, and B-T-S-E represents a dilemma turning on whether deciding what constitutes unfair competition is within the remit of the Court. In courts where the facts are at issue, 2-cycles naturally arise from conflicting testimony and other evidence. Even length cycles are almost inescapable in an AF representation of a legal dispute. Odd length cycles almost never occur since odd length cycles represent paradoxes and so suggest a problem with the representation [19].
If the dispute is live, either side
In this sense legal reasoning has more in common with practical reasoning than theoretical reasoning ([61] and [9]). Consider Searle’s notion of
For this reason it is important that there is a means of presenting arguments for and against accepting one argument against another: essentially these are the arguments that will be presented in the judge’s decision. This means that some attacks will fail. The need to enable a distinction between successful and unsuccessful attacks was recognised by several researchers, both before and after Dung: for example by Pollock [82], Vreeswijk, [123] and [122], Simari and Loui [111] and Prakken and Sartor [97]. A way of distinguishing between successful and unsuccessful attacks as a direct extension of Dung frameworks was given in [5]. This approach augmented abstract argumentation frameworks by the addition of a preference relation between arguments. Now an attack fails if the attacked argument is preferred to its attacker. The effect is essentially the same as [98] which derived its preference relation from precedents. There are some problems with this approach, however, in that the preferences are simply given, not explained. Although preferences derive from many sources, such as social and moral values, and these motivated the precedent decisions, in [98] there is no scope to give reasons for the preferences other than the precedents themselves. Moreover the preferences are between arguments at a very coarse granularity: each case gives rise to only two arguments each amalgamating
To address this, an alternative extension to Dung’s framework was provided by value based reasoning [17], with the intention of avoiding the reduction of reasoning with precedents to
There are, however, as shown in [78], some strong limitations on the approaches of [5] and [17], namely that they only work correctly if all attacks are preference-dependent (which is why most work on value based argumentation follows [10] and resolves factual disputes before forming the value based argumentation framework) and if they are all direct (so that can be no subargument attacks). Moreover, while value preferences gave reasons for preferring arguments, no reasons for preferring values (other than the audience) could be given in the framework of [17], and so the problem had been merely moved up a level. Moreover as well as precedents and values, there may be other reasons for preferring a precedent: thus a general way of arguing for a preference is needed.
Arguments about preferences had been recognised as necessary in AI and Law, and addressed in [97], [86] and [54]. The problem of relating arguments about preferences to Preference and Value Based Argumentation Frameworks was solved by the notion of Extended Argumentation Frameworks [76], which allowed arguments to attack attacks as well as other arguments, and so enabled reasoning about preferences. These ideas were applied to reasoning with legal cases in [28], which applied the approach to the abstract framework of [16]. In fact, as was shown in [77], Extended Argumentation Frameworks can be rewritten as standard abstract argumentation frameworks, using arguments with statements about arguments such as “
The above discussion refers to attempts to handle preferences as an extension to Dung’s framework. There are also, approaches in which, like the pre-Dung work cited above, attacks are defined in terms of a more basic notion of conflict plus the use of preferences. The influential ASPIC+ [92] and [79] represents a contemporary example of that approach.
Argument structure
Another line of investigation in AI and Law is the structure of arguments. Although abstraction for use in a Dung style or Value Based Argumentation Framework is very useful, especially for evaluating competing positions, law does need to deal with particular concrete arguments. This can be very important: knowing the type of attack can be crucial since while preferences can be used to defend against rebuttals and underminers, they provide no defence against undercutters [92].
While there have been proposals to include a support relation as well as an attack relation [6] in abstract argumentation frameworks, the arguments in such a framework do not capture sufficient structure for successful modelling of legal argument. Another suggestion to capture the structure of arguments in an abstract argumentation framework was made in [133], but this had limitations as shown in [94]. Instead the approach taken has been to model arguments with structure (at least premises and conclusions). Although this represents a step back from abstraction, one popular approach, ASPIC+ [79], is intermediate in its level of abstraction between concrete logics and the fully abstract level and accommodates a broad range of instantiating logics. ASPIC+ is intended to generate abstract argumentation frameworks to enable its arguments to be evaluated. Another approach to capturing structure, Carneades [55], has its own method for evaluating arguments and does not typically use abstract argumentation frameworks. The relation of Carneades to abstract frameworks (and to ASPIC+) is shown in [116].
These frameworks have been used for some quite concrete modelling. Different argumentation based approaches to representing a particular case was the topic of a special issue of
Concluding remarks: Dung and AI and Law
The impact of Dung on AI in general is unarguable. Before [42] argumentation was very much a minority taste. It rarely appeared as a keyword in AI conferences and there were no specialist conferences or workshops. Now, however, it is an established subfield of AI: argumentation sessions are a feature of AI conferences, there is a specialised conference (COMMA) and several regular workshops, such as Computational Models of Natural Argument (CMNA) and Theory and Applications of Formal Argument (TAFA), and a specialist journal in On 30th September 2019. The second most cited computer science paper with argumentation in its title, abstract or keywords [17] has only 505 citations.
Before abstract argumentation there had been a lot of work directed towards argumentation in AI and Law, as described in Section 2. However, this work was lacking in coherence and unity. Projects were pursued independently and were not easy to relate to one another. Even projects with a common origin such as Aleven and Ashley’s CATO and Skalak and Rissland’s CABARET went in separate, quite different, directions, as described in [20]. This diversity manifested itself in a variety of approaches, each of which were individual to a particular project, in source material (cases or statutes or commentaries), and in the particular part of the process addressed (generation, conflict identification, explanation, etc). What abstract argumentation offered was a level of abstraction at which it was possible to relate these strands. In addition to the elegance and accessibility of Dung’s framework, with an abstract argumentation framework as the target differences between approaches such as the source of the arguments, the means used to generate them, the particular aspect of the task addressed, and the use to which they would be put could be ignored. This greatly reduced the idea that different approaches were in conflict. Different approaches could even contribute to the same framework, and different approaches could be compared at the framework level. The result was that debates as to the “correct” logic to use largely disappeared and the exploration of legal reasoning with rules and cases could be seen as synergistic rather than competitive alternatives.
This influence lay more in the perspective it offered than the explicit use of Dung’s frameworks. Of course AI and Law research on argumentation continues to exhibit a great deal of diversity, and by no means everyone uses argumentation frameworks. Defeasible logic has continued to be the basis of the work of Governatori and his colleagues (e.g. [58] and [105]) and Carneades has its own particular approach. Verheij has developed two novel approaches using DEFLOG [119] and case models [121]. Abstract Dialectical Frameworks (ADFS: see [39] and [38]) form the basis of the methodology advocated in [2]. None the less it is possible to view much of this work from the perspective of abstract argumentation: defeasible logic can provide arguments in an abstract argumentation framework, and the relation between Carneades and abstract argumentation frameworks was shown in [116]. Verheij’s DEFLOG focusses on statements rather than arguments, but is said in [119] to have “close formal relations” to Dung, although providing a richer language. ADFs can be seen as a generalisation of AFs [39].11 Abstract Dialectical Frameworks may in future, like Abstract Argumentation Frameworks, provide a way of bringing together different strands of AI and Law research. The methodology of [2] attempts to encapsulate case based knowledge and is used to represent the knowledge of CATO [4] (and other domains) in an ADF. Subsequently the ADF was extended to represent the US Trade Secrets domain with factors with magnitudes [22]. Recently it has been successfully applied to the domain of compensation for Noise Induced Hearing Loss [3]. ADFs have also been used to reconstruct Carneades [37]. They may provide a fruitful avenue for future exploration in AI and Law.
This diversity is to be expected, and welcomed. The unifying perspective of abstract argumentation applies at the middle level of a three level model. Different approaches are still to be expected at the argumentation generation and argument deployment levels, and some may choose to use a less abstract representation at the second level also. Nevertheless the abstract level remains a way of seeing connections between different approaches.
AI and Law has also been able to contribute to abstract argumentation in general. Much of the work on abstract argumentation is at a theoretical level: comparison of different semantics [11] and their complexity [43] for example. Any examples used in such work tend to very simple, even simplified, sometimes even simplistic. Law, in contrast, demands the ability to represent substantial frameworks, and requires them to be reasonably faithful to the cases they represent which militates against simplifications. Thus many of the more substantial representations of domains using argumentation frameworks have been in the domain of law. For this reason, legal instantiations have proved useful to those wishing to provide computational implementations of abstract argumentation, e.g. [45]. Three other developments were driven by the needs of legal applications. First, the need to provide reasons for choosing between positions and determining which arguments should be preferred led to value-based [17], extended [76] and metalevel [77] argumentation frameworks. Second, the need to examine the structure of arguments and to determine the nature of the attacks between them led to structured argumentation frameworks such as ASPIC+ [79] and Carneades [55]. Third, the widespread use of informal logic techniques already a feature of AI and Law, especially argumentation schemes and dialogue games, could now be used in the context of abstract argumentation, with mutual benefit.
Thus Dung’s landmark paper [41] played an important role in the development of the study of argumentation in AI and Law, by offering a level of abstraction at which the previously disparate lines of work could be related, for comparison and common enlightenment, even where subsequent work did not explicitly use Dung’s frameworks. In return AI and Law offered the opportunity to explore and develop abstract argumentation is a domain where faithful modelling of argumentation and debate is of central importance.
