Abstract
This paper presents OpMAP: a tool for visualizing large scale, multi-dimensional opinion spaces as geographic maps. OpMAP represents opinions as labelings on a structured deductive argumentation framework. It uses probabilistic degrees of justification and Bayesian coherence measures to calculate how strongly any two opinions cohere with each other. The opinion sample is, accordingly, represented as a weighted graph, a so-called opinion graph, with opinion vectors serving as nodes and coherence values as edge weights. OpMAP partitions the nodes of the opinion graph by using clustering methods. Finally, the graph is visualized as a geographic map using a method based on a particular (e.g., force-directed) layout.
Keywords
Introduction
This paper presents OpMAP: a tool which applies formal argumentation theory to structure and visualize large scale, multi-dimensional opinion spaces as geographic maps. The purpose of this paper is to provide a proof of concept. It describes the design of the tool and applies it to a simple illustrative example.
To understand how people judge a policy proposal, it is important to know what they think about other related issues. For example, one’s stance towards
The theoretical background and basic outline of OpMAP can be summarized in five steps:
A simplified
An opinion vector is modeled as a
The structured argumentation framework allows one to calculate the extent to which any two opinion vectors, i.e. labelings, cohere with each other. The opinion sample is represented as a weighted graph, with opinion vectors serving as nodes and coherence values as edge weights.
The nodes of the opinion graph are partitioned by using clustering methods.
The graph is visualized as a map using a method based on a particular (e.g., force-directed) layout.
The paper proceeds as follows: The remainder of this introductory section reviews related work on argument visualization, visualization of opinion data, as well as formal argumentation theory and graph drawing. Section 2 describes the main theoretical components of OpMAP, it is organized in accordance with the five steps specified above. Following the same structure, Section 3 develops the proof of concept by means of a simple case – a stylized reconstruction of the debate about eating meat – which serves to illustrate OpMAP’s different components as well as the theoretical framework it builds upon. Finally, Section 4 identifies challenges for future research; it concludes this paper by setting forth a list of general criteria for evaluating OpMAP and assessing the quality of visualizations generated with this tool.
Visualization tools for argument analysis
Visualizing argumentative reasoning nearly always aims at depicting the structure of argumentation [31]. Visual devices such as debate graphs, argument maps, and inference diagrams lay out the structure of arguments or debates with varying degrees of detail and scope. Software tools such as
Argument maps, which are the state-of-the-art for presenting the results of complex argumentative analyses, are no doubt useful visual devices for experts, argument analysts and critical thinking instructors [18,43]. But there is more to a debate than its arguments and their interrelations – namely the way proponents, stakeholders, or citizens assess the arguments and claims. What a debate’s proponents believe is no less important for decision-making than the debate’s argumentative structure. Against this background, this paper suggests to explore a visualization tool that complements argument maps: opinion maps. Rather than visualizing the argumentative structure (of a debate) itself; opinion maps visualize the argumentatively structured opinion landscape of a debate. OpMAP can hence be understood as a novel species in the “ecosystem of argumentation technologies” [28,34].
Visualization tools for opinion analysis
While charts (timelines, pie charts, bar charts etc.) seem to be the standard device for visualizing opinion data, three more specific visualization methods that are related to the OpMAP concept can be identified: (1) geographic mapping of regional opinion data; (2) mapping opinions in a topic space; (3) social network visualization.
Re (1): If a survey yields opinion data for different geographic regions (for example, in the U.S., counties or states), this data can be visualized by coloring an actual geographic map accordingly. Presenting the degree of public ascent to key findings of climate science across the U.S. (different map for each finding), the Yale Climate Opinion Maps [20] illustrate this approach. Unlike these visualizations, however, OpMAP does not color a real map; it rather constructs a geographic-like map fully on its own; moreover, OpMAP visualizes not only one belief dimension but complex opinion vectors.
Re (2): ConToVi is a digital research instrument for policy and discourse analysis [10]. One of its central features is the so-called topic space, a radial representation of all topics that are touched upon in a discussion. Speakers and their individual utterances can be mapped into that topic space according to how strongly they relate to the different topics. So, the topic space from ConToVi is a 2D representation of the opinion landscape of a debate. However, unlike OpMAP, this opinion space is not argumentatively structured at all; it is merely based on results from topic analysis.
Re (3): It is standard to represent social networks as graphs, to carry out cluster analysis on these graphs and to draw these graphs (typically by means of force-directed methods) [36]. OpMAP builds on a particular kind of social network, which represent agents only in terms of their complex belief states and which determines the relations between agents just with regard to argumentative relations between their belief states. OpMAP therefore uses methods from social network analysis to study and visualize argumentative networks and epistemic communities [46].
Theoretical background
OpMAP builds on a variety of well-studied formal models and algorithms. In particular, it construes policy debates as deductive argumentation [2–4,6], i.e., as arguments whose inferences are necessarily truth-preserving. This requires that arguments which instantiate standard, non-deductive schemes for practical reasoning [29,45] are reconstructed as deductively valid, e.g. by adding a corresponding general principle as connecting premiss [25]. Likewise, defeasible arguments can be represented as deductive inferences by adding the defeasible inference principle as premiss. Yet, formally speaking, deductive argumentation frameworks are a special case of structured argumentation frameworks such as ASPIC+ [32]. There are seemingly no principal reasons why the tool presented in this article shouldn’t be generalizable to structured argumentation frameworks, which allow for deductive as well as defeasible arguments.
Moreover, OpMAP adapts and uses Bayesian measures of coherence [9,12,38] in conjunction with probabilistic degrees of justification defined on complex deductive argumentation [5].
Finally, OpMAP relies upon – and might ultimately have to improve – the so-called GMap algorithm [15] in order to cluster and draw the weighted opinion graphs.
Opinion map
This section describes the main theoretical components of OpMAP.
Simple structured argumentation framework
The sentence pool
Arguments are inferences from premisses to a conclusion, where both premisses and conclusions are sentences in
A set of arguments
Multi-dimensional opinions in the structured AF
The opinions of an agent in a debate – i.e., her position – is modeled as a partial truth-value assignment to the sentences that appear in the debate.
(Position).
Let
A position
A position complete iff partial iff atomic iff tautological iff
The tautological position is denoted with ⊤.
Equivalently, a position can be expressed as a two-valued index-function over
(Minimally consistent position).
Let
A partial position
A consistent position, besides being minimally consistent, accounts for the inferential relations between sentences that are established by the various arguments in the SSAF.
(Consistent position).
Let
A partial position
Finally, we define: Let (Mutually consistent position).
Mutual agreement between belief states
An opinion landscape is given through a set of positions plus information about how strongly any two positions agree / disagree with each other. But how can one measure mutual agreement / disagreement? Such a formal measure should meet the following desiderata:
It represents how well two positions fit together given the arguments of the debate.
It is able to distinguish different degrees of disagreement for mutually inconsistent positions.
It assigns maximal degree of mutual agreement to two identical positions.
Desideratum (ArgAgre) excludes standard distance measures – such as, e.g., Hamming distance, or edit distance, which work on vector representations of positions only – as suitable measures of mutual agreement. That is because such standard distance measures ignore argumentative relations between the sentences.
Bayesian epistemology is arguably the most influential and advanced research program in contemporary epistemology [41]. It therefore seems promising to rely on Bayesian epistemology in constructing a measure of argumentative agreement between positions. This requires one, however, to introduce a probabilistic measure of degree of justification in the first place, which the following three definitions accomplish [5].
(Set of all complete positions which extend a position).
Let
The
(Spread of positions).
Let
The degree of justification of a position is simply its spread, divided by the tautological position’s spread.
(Degree of justification).
Let
As degrees of justification are probabilities (i.e., satisfy the probability axioms), they enable one to import the formal concepts and mathematical results of Bayesian epistemology into the deductive argumentative framework [5]. In particular, a range of Bayesian coherence measures [9,12,38] are now available for setting up an argumentative measure of mutual agreement/disagreement between any two positions.
Desideratum (IncoDis), according to which a suitable agreement measure is able to assign different degrees of disagreement to pairs of mutually inconsistent positions, rules out simple coherence measures, such as the naive Shogenji measure [38]. The question which Bayesian coherence measures are best equipped to deal with inconsistent sets of statements has been discussed extensively in the literature [38,40]. For the purpose of this article, a particular so-called mutual support measure is chosen as starting point [12]; this measure, in turn, relies on the Kemeny–Oppenheim measure of confirmation (see below). These choices seem to account for desideratum (ArgAgre) in a natural way, because mutual support measures assume that coherence depends on the degree to which various parts of a position support each other, which in turn is a function of the given arguments in the debate. Yet, the use of alternative coherence measures should be explored and assessed in future research.
Now, standard Bayesian coherence measures are supposed to assess the
(Mutual coherence).
Let
In calculating mutual coherence, the following confirmation measure is relied upon.
(Kemeny–Oppenheim confirmation).
Let
Note that, according to these definitions,
The mutual coherence measure can be applied either to original positions in the opinion sample or to their deductive closures (which include all sentences implied by an original position).
(Deductive closure).
Let
This means that MutCoh can spell out mutual agreement in at least two different ways (i.e., with respect to the original or the deductively closed positions). It is up to future research to figure out how this choice affects the analysis and visualization of the opinion landscape. As far as the proof of concept in Section 3 is concerned, MutCoh will be applied to the original positions.
The concept of an opinion graph explicates the metaphor of an opinion landscape and fuses the results of this subsection. Let (Opinion graph).
Network analysis
In order to understand how the opinion landscape is structured – i.e., which belief states form clusters, and how big these clusters are, or how influential these belief states are – , network analysis methods are applied to the opinion graph.
When applying network analysis methods to the opinion graph, two basic challenges arise:
The opinion graph contains edges with negative weight.
The opinion graph is complete.
Most network analysis and clustering methods are per se not able to cope with negative edge weights. Moreover, the internal structure of complete graphs is comparatively hard to discover, and most network analysis algorithms primarily consider sparse graphs.
Both challenges can be addressed by transforming edge weights and filtering edges. In principle, this can be done in many different ways. And it would be worthwhile to investigate these alternatives in the future. In order to come up with a proof of concept, this paper relies on a simple linear transformation of edge weights (namely: add 1); as regards filtering, all edges with a transformed edge weight below 0.2 are removed.
Graph drawing
Network visualization methods can further enhance one’s understanding of the structure of the opinion graph. The most straight-forward network visualization metaphor represents nodes of the graph as points on the plane and edges as curves connecting the corresponding nodes. So-called force-directed algorithms construct such visualizations; these methods aim at placing connected nodes close to each other (see e.g. [23]). This complies with an intuitive depiction of relation among objects. (Thus, within Gestalt theory of human perception, it has been verified numerous times that humans tend to think about objects as related when the distance between them is relatively small.)
While force-directed methods are very popular in practice, since they produce fairly readable drawings for sparse graphs, dense graphs represent a challenge for these methods. Also, the methods do not allow for negative edge weights. Thus, in order to make standard force-directed algorithms applicable, the edge weights are transformed so that all values are positive, and the low-weighted edges are filtered, as already discussed.
A simple and classical way to show the results of the clustering algorithm on the visualization is to color the nodes according to the cluster they belong to. To emphasize the clustering even more and to explore the human familiarity with geographic maps, Gansner et al. [14] have introduced the algorithm GMap, which extends force-directed algorithms and constructs visualizations resembling a geographic map. In this visualization each cluster is depicted by a polygon (country), the nodes that belong to the cluster are represented by points lying within this polygon, and two polygons that contain many pairs of connected edges share a boundary (i.e., are neighboring countries). It has been verified empirically that humans perform better with this map-resembling type of visualization than with simple colored visualizations when it comes to so-called group-based tasks, which are tasks that can be performed by considering nodes, edges, and clusters. Additionally, the performance for other types of tasks, where only nodes or only nodes and edges have to be considered, does not deteriorate [35]. These findings suggest using the GMap method for visualizing the clustered opinion graph.
Applying the GMap algorithm to the opinion graph poses new challenges for the method which stem from the semantic structure of the data. They will be outlined in Section 4.
Proof of concept
This section illustrates the formal components of OpMAP by means of a stylized reconstruction of the debate about eating meat – in short: the veggie debate. It develops, in structural analogy to Section 2, a simple case study, which establishes the proof of concept.

Visualization of the stylized reconstruction of the veggie debate as argument map. Besides arguments (filled boxes), the three core claims are also shown.
In the given simplified argumentative analysis of the veggie debate, three propositions take center stage:
The proposition
All arguments are reconstructed as premiss-conclusion structures with deductively valid inferences. For instance, the detailed reconstruction of the
The most delicious meals involve meat.
It is permissible to eat the most delicious meals.
If it is permissible to eat some meal, then it is permissible to eat whatever is included in that meal.
The number of premisses per argument varies, in the debate, from 2 to 6; in total, the sentence pool contains
Multi-dimensional opinions in the structured AF
In future applications of OpMAP, opinions of participants, stakeholders and citizens will be elicited by means of a survey, will be manually reconstructed from a discussion [1], or could possibly be identified by means of stance detection and opinion mining algorithms [7]. In order to demonstrate the feasibility of OpMAP, this subsection, however, simply sets up a small fictitious opinion sample. Table 1 defines and describes seven consistent partial positions in the veggie debate’s SSAF. In terms of core claims, positions 1 and 2 accept
Seven consistent partial positions defined on the veggie debate SSAF. “T”, “F” and blank space stand for true, false and judgement suspension, respectively. Sentences in the sentence pool are referenced and arranged row-wise in accordance with their function (premiss/conclusion) in the various arguments; in case one sentence appears in several arguments, different rows will represent one and the same sentence
Seven consistent partial positions defined on the veggie debate SSAF. “T”, “F” and blank space stand for true, false and judgement suspension, respectively. Sentences in the sentence pool are referenced and arranged row-wise in accordance with their function (premiss/conclusion) in the various arguments; in case one sentence appears in several arguments, different rows will represent one and the same sentence
Mutual coherence, as defined in the Section 2.3, can be calculated for every pair of positions in the opinion sample (Table 1). This yields the edge weights of the corresponding opinion graph. Figure 2 displays the weighted adjacency matrix – visualized as a heat map – of this opinion graph.
Network analysis
As explained above, the original opinion graph includes edges with negative weights; hence the edge weights are transformed (simply by adding 1); and the graph is optionally filtered by removing all edges with a transformed edge weight below 0.2. For illustrative purposes, different clustering methods available in Mathematica are applied to both the unfiltered as well as the filtered transformed opinion graph. As Table 2 shows, filtering affects the results and tends to improve the ability to find clusters, as expected. Especially the clusters obtained with
Graph drawing
Figure 3 displays the opinion graph and two clusterings found by
Figure 4 shows the result of the application of the GMap algorithm to the filtered opinion graph with the clustering found by
Discussion
This concluding section highlights challenges and directions of future research; it suggests, in particular, criteria for assessing opinion graph drawings which may be used in future investigations.

Heatmap representation of the weighted adjacency matrix of the illustrative opinion graph. Shading corresponds, in accordance with the right-hand legend, to the degree of mutual coherence between a given pair of positions.
Different clustering methods (

Visualization of the clustered unfiltered (left) and filtered (right) illustrative opinion graph. Communities are identified with

Geographic visualization of the filtered illustrative opinion graph; generated with the GMap implementation from
The simple case study presented in Section 3 establishes the feasibility of OpMAP and constitutes, accordingly, a proof of concept. A basic follow-up question is, obviously, whether OpMAP
Large debates and opinion samples pose specific problems for OpMAP’s different components:
The calculation of mutual coherence values is computationally demanding: To start with, determining a single degree of justification requires the solution of a specific #SAT problem (model counting). When mutual coherence between two positions is calculated, ever new degrees of justification have to be determined
Under the assumption that opinion graphs are relatively dense, large opinion graphs will be difficult to draw in a meaningful way; especially force-directed layouts, which lie at heart of the GMap algorithm, are prone to yield mere “hair-ball” plots. Future research has to investigate how this issue can be overcome [26].
Clustering, too, will become more difficult for large opinion graphs. That is because, in large graphs, one may expect more so-called hub nodes. These hub nodes are generated by ‘small’ partial positions, which suspend judgement vis-à-vis most of the sentences of the debate. These positions will cohere strongly with many other positions, which translates, in terms of the graph, into many heavily-weighted edges. Tailored filtering methods or other techniques might be needed to handle such hubs [37].
Preliminary evidence from an ongoing exhibition project suggests that the scalability issues can be overcome [27]. It is left to future work to analyze this large-scale implementation of OpMAP in detail.
Two further challenges do not relate directly to scalability. First, a useful opinion map of a debate will have to display meaningful labels – both for individual positions but also, and more importantly, for the different clusters, or “countries”, in the map. It should be investigated whether existing techniques, e.g. from topic labeling [21], can be applied and adapted to generate opinion cluster labels. Second, some foreseen applications of OpMAP involve dynamic updates of the opinion map. This poses novel challenges for clustering and drawing in terms of stability, i.e., neither the clusters nor the drawing should change completely once a new opinion is inserted into the sample [17,19].
Future research will have to study how OpMAP must be extended or modified in order to cope with the aforementioned challenges. For this sort of investigation, it is important to have explicit normative standards against which different versions and implementations of the OpMAP concept are evaluated.
The analogy between opinion maps and geographic maps suggests the following criteria:
Opinions in one “country” (cluster) should cohere with one another much more strongly than with opinions outside that “country”.
The size (area) of a “country” in the map should correspond to the number of agents whose opinions are located in that “country”.
The Euclidean distance between the locations of any two opinions in the drawing should correspond to the degree of mutual coherence between the two opinions.
The informal description of these criteria allows for different mathematical explications (e.g., rank correlation versus standard correlation). (Meaningful Countries), (Country Size) and (Distance) should be seen as umbrella criteria that organize a broad range of different statistical measures. Such precise statistical measures should be applied, in future research, in a context-sensitive way and in function of the corresponding investigation’s purpose.
Footnotes
Proof that MutCoh satisfies desideratum (EquiMax)
As function MutCoh is the average of multiple confirmation-terms, it is maximal (equals 1) iff each confirmation term is maximal (equals 1), which is in turn the case iff both positions entail each other. More precisely, the following statements are, in view of Definitions 8 and 9, pairwise equivalent.
Theses and arguments of the illustrative debate (“veggie debate”)
The following argdown-snippet [44] contains all theses (introduced by square brackets) and arguments (introduced by angle brackets) of the debate about eating meat, which is used for illustrative purposes in this paper.
