Abstract
In this visualization, we demonstrate how to use radar plots to represent the class-specific posterior response probabilities from latent class analysis results. These plots allow for a simple representation of the class differences in the distributions across the modeled indicators. We demonstrate the utility of this approach with results from a published model of women’s employment and family life circumstances. In doing so, we demonstrate how to avoid some of the pitfalls common to radar plots and provide example code to allow other researchers to readily adapt this approach to present their own results.
Latent Class Analysis
Sociologists have increasingly utilized a variety of cluster-based analytic strategies, especially those that incorporate the estimation of relationships between latent traits and observed behaviors. One particularly common form of this type of model is latent class analysis, or LCA (McCutcheon 1987).
LCA uses observable indicators to yield unobserved, or latent, probabilities of endorsing y1 . . . yk response patterns accounting for the covariance among observed indicators. Its basic form is defined by
where
In reporting LCA results, it is common to disclose both the probability of class membership (gammas) and the distribution of class-specific item-response probabilities (rhos). For the latter, it is difficult to concisely and clearly represent the full pattern of response probabilities across classes, any time there are more than a few variables with a limited number of response items, combined with few classes.
Alternatives for presenting LCA results have typically included (a) tables with each class-item-response category probabilities reported, sometimes highlighting patterns across the observed variable distributions and item probabilities; (b) line/bar charts that group classes together; and occasionally (c) approaches that summarize some of that information, for example, with ternary plots separating out the differences in response patterns across classes (Bakk and Roux 2017). While rhos inherently include three dimensions of interest—(1) response-item distributions, (2) class memberships, and (3) the differences in (1) by (2), none of the existing approaches for presenting these results optimize across all three of these dimensions simultaneously, instead prioritizing some over the others. For example, tables can provide detailed item-response distributions but make seeing the class differences across those difficult. Simple line or bar charts can make the distributions visible but can only cluster classes together (making comparisons across them difficult) or variables together (making class compositions difficult to see). Alternatively, while ternary plots make the differences between classes easily perceptible, the underlying item-response distributions are obfuscated.
Visualizing LCA Results with Radar Plots
Here, we suggest radar plots (Nakazawa 2018) accomplish all three of these aims simultaneously, demonstrate their use as applied to a previously published result, and provide replicable code (in ways that incorporate these optimizations) for others to adapt for presenting their own results.
The radar plots presented in Figure 1 use the LCA solution from Lippert and Damaske (2019) on young adult women’s work and family formation trajectories. 1 For presenting results in the radar plot format, we choose to employ two options. 2 First, we normalize each variable’s value into a single index ranging from 0 to 1. Normalization requires multiplying the item-specific response probabilities by a scaling factor determined by the order and number of categories within each item. For illustration purposes, the variables in the presented model include dichotomous, trichotomous, and other ordered variables. Second, we reverse code indicators as necessary to ensure all normalized values are oriented in the same direction. 3

For the seven classes identified, four are characterized by full-time employment (left panel) and three by lower engagement with paid labor (right panel). Two classes—professional workers with and without children—were similar with respect to their employment status, propensity for well-paid high-skilled work, and job decision latitude but differed by their relationship status and whether they had children in their care (see blue vs. orange polygons in the left panel). Further distinctions between the identified classes are visible by comparing across the distribution of item posterior response probabilities between the classes.
Our approach provides a means for visually representing LCA results 4 that allows readers to easily compare class-level differences in the pattern across distributions of the item-specific response categories. We hope that the companion code will allow future researchers to more easily present LCA results in their own research.
Footnotes
Supplemental Material
1
For the complete table of the posterior probabilities included in this solution, see the .csv file in the Appendices.
2
These options can be turned on/off in the provided code.
3
In the case presented here, this required subtracting the result of the normalization step from 1 for the variable indicating number of children in one’s care.
4
This approach is likely also readily adaptable to the presentation of results from other clustering-based approaches.
