Uncovering interactions in multivariate contingency tables: a multi-level modelling exploratory approach

Abstract

Much quantitative behavioural social science – a great deal of it exploratory in nature – involves the analysis of multivariate contingency tables, usually deploying logistic binomial and multinomial regression models with no exploration of interaction effects, despite arguments that this should be a crucial element of the analysis. This article builds on suggestions that the search for interaction effects should employ multi-level modelling strategies and outlines a procedure for modelling patterns in data sets with small numbers of observations in many, if not all, of their multivariate contingency table cells; all expected cells must be non-zero. The procedure produces precision-weighted estimates of the observed:expected rates for each and every cell, together with associated Bayesian credible intervals, and is illustrated using a large survey data set relating voting (and abstaining) at the 2015 UK general election to age, sex and educational qualifications. Crucially, while fine detail can be explored in the analysis, unreliable rates for particular subgroups are automatically down-weighted to what is happening generally. The identification of reliable differential rates then allows a simpler hybrid model that captures the main trends to be fitted and interpreted.

Keywords

Multivariate contingency tables interactions multi-level modelling Bayesian inference voting in Britain

Much quantitative behavioural social science involves the analysis of contingency tables, many of them multivariate in structure with large numbers of cells. A great deal of that work is also exploratory in character: although there are general expectations regarding the relationship of one variable with another, there are rarely firm hypotheses, particularly regarding interactions or equivalent subgroups of people. Thus, in any analysis of such tables, it is desirable to explore the relationships in some depth. That is too often not the case, however: many studies fit binomial or multinomial regression models, but these are specified with the main effects only. Interactions are very frequently unexplored. Hypotheses (some of them implicit only) are tested through such regression models, but they fail to address the potential full richness of patterns and differences across the cells of a multi-way contingency table.

Several authors have pressed for the exploration of interaction effects. Elwert and Winship (2010), for example, argue that

Most social scientists would probably agree that the assumption of constant effects that is embedded in main-effect only regression models is theoretically implausible. Instead, they would maintain that regression effects … vary across individuals between groups, over time and across space. In other words, social scientists doubt constant effects and believe in effect heterogeneity. (p. 328)

Similarly, Gelman (2008) has argued that

… interactions are important, but we should look for them where they make sense … (p. 1)

Few social scientists have followed up that argument, however, and the basic textbooks do not encourage it. For example, interactions are mentioned only four times in the index to the second edition of Agresti’s (2002) text Categorical Data Analysis, and the main concern with each of the entries is to test whether there are any statistically significant interactions rather than to identify and interpret their intensity, that is, the size and nature of the effects. There are many more mentions of interactions in the second edition of his An Introduction to Categorical Data Analysis (Agresti, 2007): it includes a section titled ‘Allowing interaction’ and interaction terms are included in a number of the worked examples throughout the book (most of which involve relatively small tables with few categories for each of the variables). Nevertheless, there is little focus on what the interaction terms show let alone an emphasis on their potential importance relative to the main effects.

At this point, a note of caution is needed. There are difficulties with unbridled exploration for interaction effects, a point noted by Elwert and Winship (2010) who continue their argument by pointing out that

… sample sizes in the social sciences are often too small to investigate effect heterogeneity by including interaction terms between the treatment and more than a few common effect modifiers (such as sex, race, education, income, or place of residence). (p. 328)

A key concern, even with large and complex data sets, is that their decomposition can rapidly reach the point where some of the table’s cells have only small counts. This situation can be likened to the ‘Texas sharp shooter’ problem of drawing the target on the barn door after the shots have been fired (see Nuzzo (2015) among others). More formally, in the modelling context, this involves the formulation of a hypothesis only after data have already been analysed – introducing the problems of induction, multiple hypothesis testing and finding chance results that are not generalisable due to not having specific hypotheses to hand before analysing the data. Nevertheless, as argued here, exploratory analysis of a complex table can have very beneficial and illuminating results.

If the analysis incorporates a clear hypothesis regarding an interaction effect, this can generally be built into a logistic regression analysis without taking up too many additional degrees of freedom, but in exploratory studies that may not be feasible – probably one reason why Gelman (2011) argued that treatment interactions

… should be estimated using multilevel models. If you try to estimate complex interactions using significance tests or classical interval estimation, you’ll probably just be wasting your time. (p. 1)

That is the basis for the procedure introduced here, which is explicitly exploratory in its nature: once the variables for analysis have been selected – presumably based on either theory, or other empirical findings, or ‘researcher’s hunch’ – it imposes no pre-fixed, often restrictive, structure on the analysis but simply ‘lets the data speak for themselves’ (Gould, 1981), thereby maximising the potential for discovering substantial, and significant, findings.

There are, however, other problems involved in the search for interaction effects using logistic multinomial regression to explore multivariate contingency tables with several outcome variables (see Brambor et al., 2006; Mood, 2010). For example, several papers (Ai and Norton, 2003; Greene, 2010; Karaca-Mandic et al., 2012; Norton et al., 2004) have pointed to difficulties in the interpretation of interaction effects in such models, including (1) although the coefficient in the regression model may be zero, this does not mean it will be zero for every observation – whether there is an interaction effect has to be evaluated separately for every observation, which leads to (2) the statistical significance of an interaction effect cannot be evaluated by a single t-test because it can vary across the observations; (3) the interaction effect is conditional on the full set of predictor variables (as Brambor et al., 2006, showed); and (4) the sign on the interaction effect can vary depending on the covariates included in the model. Furthermore, as Agresti (2007) notes, ‘Interpretations are more complicated when a model contains three-factor terms’ (p. 218) – that is, the analysis is seeking interaction effects where there are three ‘independent’ variables (e.g. the effects of sex, age and education on voting choice, as used here).

The coefficients in a logistic regression model are ratios of ratios. If the effect of sex on voting either Conservative or Labour is being studied, for example, the regression coefficient would indicate the probability of a male rather than a female voting Labour rather than Conservative. This is fairly straightforward to interpret – together with its associated odds ratio – but if there is a third ‘independent’ variable, class perhaps, then the model must also include not only the three-way interactions (the ratio of young, middle-class females to that for young, middle-class males voting Labour rather than Conservative, say) but also the underpinning three two-way interactions (sex and age, sex and class, age and class): the output is extremely difficult to interpret – other than merely whether the observed coefficient is statistically significantly different from zero (either positive or negative). In this context and using standard approaches, one can understand Agresti’s limited interpretation of results from models with many interactions discussed earlier, but that of course is what is wanted when a large contingency table is being explored.

To circumvent these problems, this article introduces an alternative, explicitly exploratory, procedure, set in a multi-level modelling framework. The key aspect of this approach is that it examines detailed differences across subgroups but in such a way that there is an ex ante prior expectation that each of multiple subgroups does not differ from what is happening generally across all subgroups. There is thus an anchoring of the results to the null hypothesis of no effect. This prior expectation is only overturned where there is reliable statistical evidence to the contrary. This approach has been developed out of the statistical geographical analysis of mortality data where small counts (of death) are the norm when the data refer to relatively small geographical areas, with many to be examined for ‘hotspots’ (areas with especially high or low rates), and there is a need to identify these without unduly alarming the public with false positives of high risk.¹ Full details of the approach’s statistical properties have been published elsewhere (Jones et al., 2015), using a very different example. This article provides an overview of the procedure, and illustrates its use with a relatively small multivariate contingency table (four variables and 288 cells), but one that nevertheless illustrates Elwert and Winship’s (2010) point regarding sample sizes.

A multi-level random-effects model approach to multivariate contingency tables

The approach introduced here develops from the literature on disease mapping where rates are often based on relatively small numbers of observations and so are inherently unstable – a small change in either or both of the numerator and denominator can substantially alter the rate (Clayton and Kaldor, 1987; Jones and Kirby, 1980). Any modelling of such rates must take into account the stochastic variation associated with small counts in some of the table’s cells by stabilising the incidence rates (Manda and Leyland, 2006).

For each cell in the contingency table, we have an observed value. We can also calculate an expected value from a null hypothesis – the standard in much statistical hypothesis testing – that there are no differences across the cells in the rate being considered: thus, if we are looking at the propensity of individuals in the United Kingdom to vote either Conservative or Labour by sex and by age, if across the entire sample 45% vote Conservative, we would expect the same percentage in each age-sex group (e.g. both males aged under 25 and females aged over 65). We can then derive an observed relative rate for each cell, as the observed number divided by the expected: a ratio of 1.0 would indicate no deviation from the null hypothesis; rates above and below 1.0 would indicate cells with more or less persons than expected in them respectively if each subgroup voted the same way as all subgroups. Thus, by setting up the expected values in this way, we are identifying rates for voting – our outcome variable – in relation to age and sex as explanatory variables; if there were no differential effects of demography on voting, all the rates would be 1.0.

But how many more or less: are the differences from 1.0 likely to have occurred by chance or are they, in the standard statistical terminology, significantly different from 1.0 at some predetermined probability level? To address this question, we formulate a saturated (in the sense of a parameter for each and every cell of the table) Poisson regression model

O_{i} ~ P o i s s o n (π_{i})

π_{i} = e^{(β_{1} x_{1 i} + β_{2} x_{2 i} + \dots + β_{n} x_{n i})}

{Log}_{e} (π_{i}) = L {og}_{e} (E_{i}) + β_{1} x_{1 i} + β_{2} x_{2 i} + \dots + β_{n} x_{n i}

V a r (O_{i} | π_{i}) = π_{i}

In this, $O_{i}$ is the observed count for each cell of the table which is indexed by the subscript i; the table has n cells (i.e. n = 12 if we have two sexes, three age groups and two options for voting). As is common with count data, these are assumed to come from an underlying Poisson distribution with a mean of π. This mean rate is non-linearly related to the set of predictors that defines the cells of the table as an exponential relationship with the n cells represented by a set of 1/0 dummy indicator variables. This exponential relationship is transformed to a linear model by taking the natural logarithm (the log link). ${Log}_{e} (E_{i})$ is known as the offset (McCullagh and Nelder, 1989) which has its parameter constrained to 1. The effect of this is that the model analyses differences not in the log of the counts but rather the log relative rate taking account of $E_{i}$ , the expected values. The $β$ terms, once exponentiated, give the relative rate for each cell. The final line of the specification (as is the norm in Poisson models) states that the variance of the observed counts conditional on the underlying rate is equal to the underlying rate. Importantly, the standard errors as well as the coefficients can be estimated in this generalised linear model taking account of the Poisson stochastic nature of the underlying counts.

With the number of model parameters equal to the number of cells, this saturated model is not only unwieldy – especially in the case of a large table with many cells – but also little is gained as the exponentiated estimates are the same as the simple ratios of the observed to expected values in each cell; there is no pooling of information (Gelman and Hill, 2006) and each cell’s value is separately estimated – and there will be problems of fitting the model where the observed count is zero. Hence, instead we fit a random-effects two-level null or empty Poisson model

O_{i j} ~ P o i s s o n (π_{i j})

π_{i j} = e^{(β_{0} + u_{j})}

{Log}_{e} (π_{i j}) = {Log}_{e} (E_{i j}) + β_{0} + u_{j}

u_{j} ~ N (0, σ_{u}^{2})

V a r (O_{i j} | π_{i j}) = π_{i j}

where individuals, i, are placed in the n cells, j, with an overall intercept, $β_{0}$ (with no other predictors in the fixed part of the model), and $u_{j}$ are an allowed-to-vary differential for each type of person – the vote by age by sex combinations. If this differential is positive, the cell has a higher rate of voting for that party than expected; if negative, a lower rate. Assuming that these differentials come from a normal distribution, they are summarised by the variance term $σ_{u}^{2}$ , which summarises the differences between all cells and is based on information for all the different subgroups. Because the overall sums of the observed and expected values are the same, the intercept can be expected to be zero, which becomes 1.0 when exponentiated: this is the all-group average and the differentials show the relative rates for each of the table’s n cells. This level-2 variance allows for and estimates the degree of over-dispersion between different types of people after taking account of the expected stochastic Poisson variation of the counts. Thus, the higher level variance summarises the ‘true’ differences between cells over and above those expected from a random variation due to the absolute size of the counts.

In the saturated fixed-effects model, normally specified by having a dummy variable for each cell in a regression framework, each cell’s value is separately calculated – only information from that cell is used to estimate the size of the effect – whereas in a random-effects model, as specified here, the estimates are precision-weighted, so that if they are based on small counts, the estimated rates are shrunk back towards the overall rate of 1.0 (that for the intercept) of the null hypothesis of no effect. The more unreliable the rate (i.e. the smaller the number of observations on which it is based), the closer it will be to 1.0 (Jones and Bullen, 1994). This is equivalent to the Bayesian pooling of information (Beck and Katz, 2007; Jones and Spiegelhalter, 2011), and represents a data-driven adaptive procedure for handling the uncertainty associated with sparse data (Gelman, 2014). Furthermore, the estimated rates for each cell have associated Bayesian credible intervals (CIs), and here we have used the 95% intervals to summarise the degree of uncertainty around the estimates. We call an effect ‘significant’ if the 95% CIs do not include the value of 1, so that the weight of the evidence is for a distinctively high or low rate.² If the CIs on the exponentiated scale include the value 1, we do not have strong evidence that the rate for such a cell differs from that for the overall relationship; there is no credible evidence for a particular age-sex combination to have a preference for a particular political party. The exponentiated estimates provide a natural interpretation of effect size so that 2 represents a doubling of the rate of vote for that group. This is much easier to interpret than the contortions required by the multinomial logit model.

Another important advantage of this random-effects shrinkage approach is in relation to multiple comparisons, which is at the heart of the induction problem of standard exploratory procedures. If you do enough testing, the chances of finding significant results increase rapidly. However, as demonstrated by Gelman et al. (2012), it is much more efficient to shift estimates towards each other rather than try to inflate the usual confidence intervals through a Bonferroni correction to control the overall error rate. Thus, shrinkage automatically makes for more appropriately conservative comparisons while not reducing the power to detect true differences. The final advantage is dealing with zero counts. With raw rates, if the numerator of a cell is zero, then the associated rate can only be zero. Moreover, if you calculate a saturated fixed-effects model with such a count, the estimate will fail to converge and you will get impossible, uninterpretable values (accompanied and signalled by exceptionally large standard errors).³ But this does not happen in the random-effects approach; what does happen depends on the expected value. A zero based on an expected value of 1 means a quite different thing from a zero based on an expected value of 100 – for the former, it is uncertain whether the rate is really zero; for the latter, we can be quite confident in this inference. The random-effects estimate shrinks more towards the overall rate for the former than for the latter.

The models are run using Markov Chain Monte Carlo (MCMC) estimation (Jones and Subramanian, 2014) within the standard multi-level modelling software (MLwiN), and we have checked that the models have been run for sufficient time (a discarded burn-in of 5000 simulations followed by a monitoring phase of 500,000) to ensure convergence and to obtain reliable 95% CIs. We have used default priors to impose as little information as we can on the estimates so that the results are data-driven.

The approach illustrated: voting at the 2015 British general election

To illustrate the argument, we use the example of voting at the 2015 UK general election, by age, sex and educational qualifications, using data from the post-election wave of the British Election Study (BES) Internet panel survey.⁴ Most studies have found that age and sex are related to party choice. Educational qualifications are used here as a proxy for social class – in part because of incomplete data on other possible measures (such as occupation and individual or family income); those with higher level qualifications in general are in higher status occupations and have higher incomes. It is generally accepted that social class has become a less important influence on voting choice in recent decades (see Whiteley et al., 2013), but it remains relevant (as the results presented here show). In any case, the goal of the present analysis is not to contribute substantially to appreciation of British voting patterns but rather to illustrate the advantages of the method proposed for uncovering substantively interesting and well-supported patterns that might not otherwise be found. Respondents were placed into four groups: those with no qualifications, those whose qualifications were only those associated with the official school-leaving age (in England formerly termed ‘O levels’ and now General Certificate of Secondary Education (GCSE)), those with post-school-leaving-age qualifications below the status of university degrees and diplomas, and those with degrees or diplomas.

For the analysis, we have excluded all respondents living in Scotland and Wales, where voters had a different set of outcome choices – the respective nationalist parties (the Scottish National Party (SNP) and Plaid Cymru) contested all seats there in 2015. (Northern Ireland is not included in the BES.) For England alone, therefore, we study those who voted for one of the five parties which contested virtually all of the seats,⁵ plus those who reported that they did not vote.⁶ The small number of BES respondents who voted for another party are excluded. This gives a sample size of 20,966 and a multivariate contingency table comprising six vote choices, two sexes, six age groups and four qualification categories – a total of 288 cells, giving an average of 73 respondents per cell. Nevertheless, many of the cells contained small numbers of respondents, as shown in Figure 1; the distribution is highly skewed (the median number of observations per cell was 44) – hence the value of the modelling approach adopted here that pools information to produce weighted estimates of rates in cells with small observed and/or expected values. To be clear, the expected value is derived for each age by sex by qualification group if the overall national rates of vote choice applied.

Figure 1.

Histogram of the number of observations in each of the 288 cells of the sex by age by qualifications by vote contingency table.

The baseline

To provide a baseline for the application of our proposed method, we conducted multinomial logistic regression analyses with voting choice as the dependent variable, as would be the norm in studies of such contingency tables. Five models were run: the first three incorporated each of the ‘independent’ variables separately, the fourth included both age and sex and the fifth all three. The results – giving the regression coefficients and their standard errors, with significant differences from zero shown in bold – are presented in Table 1. The reference category for the dependent variable is voting Conservative, so each coefficient shows the probability of voting for another party rather than Conservative for the chosen group relative to its comparator. Thus, in Model 1, the first significant coefficient of −0.103 indicates on the logit scale that males were significantly less likely than females to vote Labour rather than Conservative.

Table 1.

Multinomial logistic regression models of voting in England in 2015, by sex, age and educational qualifications.

	Labour	LibDem	UKIP	Green	DNV
Model 1
Sex (comparator: Female)
Male	−0.103 (0.031)	0.129 (0.047)	0.195 (0.046)	−0.102 (0.066)	−0.212 (0.48)
Nagelkerke’s R²	0.003
Chi-square	78.129
Model 2
Age, years (comparator: 66+)
18–25	0.632 (0.060)	0.395 (0.085)	−0.482 (0.100)	2.279 (0.148)	1.825 (0.102)
26–35	0.555 (0.056)	0.285 (0.079)	−0.321 (0.085)	1.628 (0.151)	1.617 (0.099)
36–45	0.593 (0.057)	0.199 (0.083)	0.045 (0.081)	1.407 (0.157)	1.187 (0.106)
46–55	0.548 (0.055)	0.143 (0.080)	0.371 (0.072)	0.983 (0.161)	0.980 (0.105)
56–65	0.309 (0.053)	−0.002 (0.077)	0.245 (0.069)	0.599 (0.163)	0.664 (0.105)
Nagelkerke’s R²	0.053
Chi-square	1272.788
Model 3
Qualifications (comparator: Degree)
None	0.408 (0.063)	−0.664 (0.115)	0.996 (0.085)	−0.905 (0.184)	0.762 (0.090)
Leaving	0.105 (0.038)	−0.882 (0.063)	0.740 (0.056)	−0.877 (0.093)	0.477 (0.059)
Leaving+	0.102 (0.041)	−0.201 (0.058)	0.254 (0.065)	−0.021 (0.077)	0.405 (0.063)
Nagelkerke’s R²	0.035
Chi-square	837.434
Model 4
Sex (comparator: Female)
Male	−0.110 (0.032)	0.123 (0.047)	0.200 (0.046)	−0.122 (0.066)	−0.226 (0.048)
Age, years (comparator: 66+)
18–25	0.632 (0.060)	0.394 (0.085)	−0.483 (0.100)	2.280 (0.148)	1.826 (0.102)
26–35	0.554 (0.056)	0.286 (0.079)	−0.319 (0.085)	1.627 (0.151)	1.615 (0.099)
36–45	0.593 (0.057)	0.200 (0.083)	0.046 (0.081)	1.406 (0.157)	1.187 (0.106)
46–55	0.550 (0.055)	0.141 (0.080)	0.368 (0.072)	0.985 (0.161)	0.983 (0.105)
56–65	0.303 (0.053)	0.004 (0.077)	0.255 (0.069)	0.593 (0.163)	0.652 (0.105)
Nagelkerke’s R²	0.057
Chi-square	1354.807
Model 5
Sex (comparator: Female)
Male	−0.102 (0.032)	0.098 (0.047)	0.226 (0.046)	−0.135 (0.067)	−0.202 (0.049)
Age, years (comparator: 66+)
18–25	0.761 (0.062)	0.215 (0.088)	−0.228 (0.103)	2.148 (0.152)	2.209 (0.107)
26–35	0.681 (0.057)	0.101 (0.081)	−0.065 (0.088)	1.497 (0.154)	1.983 (0.103)
36–45	0.699 (0.058)	0.082 (0.085)	0.221 (0.083)	1.331 (0.159)	1.455 (0.108)
46–55	0.622 (0.055)	0.111 (0.081)	0.449 (0.074)	0.976 (0.162)	1.131 (0.107)
56–65	0.337 (0.053)	−0.023 (0.077)	0.300 (0.070)	0.576 (0.163)	0.729 (0.106)
Qualifications (comparator: Degree)
None	0.651 (0.066)	−0.600 (0.108)	0.987 (0.088)	−0.294 (0.189)	1.434 (0.097)
Leaving	0.193 (0.040)	−0.739 (0.064)	0.679 (0.057)	−0.564 (0.096)	0.831 (0.062)
Leaving+	0.084 (0.042)	−0.204 (0.059)	0.293 (0.066)	−0.135 (0.079)	0.342 (0.065)
Nagelkerke’s R²	0.091
Chi-square	2208.197

UKIP: UK Independence Party; DNV: Did Not Vote.

Statistically significant coefficients at the 0.05 probability level or better are shown in bold.

The overwhelming conclusion to be drawn from Table 1 is that all three independent variables are significantly related to voting choice – both separately and together. For example, males were less likely than females either to vote Labour rather than Conservative or to abstain rather than vote Conservative, and more likely to vote for either the Liberal Democrats or UK Independence Party (UKIP); there was no significant difference between males and females in their propensity to vote for the Green party. In general, younger people were more likely to vote Labour, Liberal Democrat or (especially) Green, or not to vote, rather than vote Conservative, compared to those aged 66 and over: those aged under 36 were significantly less likely to vote for UKIP rather than Conservative than those in the oldest age group, whereas those aged between 46 and 65 were more likely. There were also statistically significant and, given the varying size of the coefficients, substantial differences in voting by class. Those with no qualifications were more likely than those with degrees either to vote UKIP or to abstain rather than vote Conservative, for example.

For many studies, Table 1 would be the conclusion: significant patterns had been identified – very much in line with previous work and expectations. But two elements of that table raise questions. The first is the low level of ‘explanation’ provided by all the models: even Model 5, with all three independent variables included, accounts for less than 10% of the variation in voting choice – which leaves a great deal ‘unexplained’. That may well be because we have an under-specified model which excludes a large number of extra variables that other studies have shown to be related to voting choice (Whiteley et al., 2013). The second is the change in some of the coefficients – notably between Models 1–3 and 5. This is particularly the case with the age variables. Those for voting Labour rather than Conservative are substantially different between Models 2 and 5; those for voting Liberal Democrat rather than Conservative even more so – indeed, two that were highly significant in Model 2 are insignificant in Model 5. This suggests a degree of collinearity between age and qualifications, which may be confounding the ‘true’ relationships in the final model: members of some age groups may be more likely to vote Liberal Democrat in some classes than others.

These two issues together suggest that there may be interactions – that there are, for example, not only differences in voting by age and by sex separately but also by age and sex together. But an attempt to fit a Model 6 – introducing all of the possible two- and three-way interactions among the variables – failed because of singularities in the Hessian matrix resulting from the large number of cells with zero or near-zero values. It was not possible, therefore, to compare the outcome of our multi-level modelling (discussed later) with that of a multinomial regression incorporating the full set of possible influences on voting patterns – such as differences between age-sex groups within those with the same qualifications.⁷ Furthermore, even if a Model 6 could be fitted, all of the one-way ratios together with the two-way and three-way interaction ratios have to be taken into account when determining the size of any significant differentials; interpretation is far from straightforward. Hence, the value of the approach adopted here, which generates a modelled rate, with CIs, for each cell – that is, for each of the six voting options, including Conservative, which is a comparator only in the multinomial regression model and for which there is no direct indicator of where its support is strongest and weakest. The method introduced here avoids those difficulties by providing a single coefficient for each cell of the n-way table being analysed, with an indication of whether it is significantly different from the null effect of 1.0 on the exponentiated scale; the output from the multi-level modelling is more readily interpretable.

Unpacking the table

Rather than explore the interactions through the multinomial logistic regression framework, therefore, we have analysed the 6 × 2 × 6 × 4 contingency table using the method introduced above. The output from this is 288 modelled rates, each with its Bayesian CIs, and these rates are shown in Tables 2 to 7 – one each for the voting choices. Table 8 and Figure 2 provide summaries of the number of significant rates – that is, those that are reliably different from 1.0, according to their CIs, for each category in each of the independent variables and for each of the voting choices.

Table 2.

The modelled values (with their CIs) for the rates of voting Conservative by sex, age and educational qualifications.