Exploring the relationship between personality,sociodemographic factors,and mental health: A design study

Abstract

Mental health research increasingly considers individual differences such as personality traits and sociodemographic factors, yet existing analytical practices rely largely on statistical software, which limits exploratory analysis and hypothesis generation. From a visualization perspective, there is a lack of domain-informed design studies that investigate how interactive visual analytics can support the joint exploration of these factors. We present a visualization design study conducted in close collaboration with personality psychology experts, aimed at supporting exploratory analysis of relationships between sociodemographic, personality, and mental health data. Through an iterative co-creation process involving interviews, design workshops, and prototyping, we designed FFM-MHI Vis, an interactive visualization system integrating coordinated views such as parallel coordinates, boxplots, scatterplots, and Sankey diagrams. The system was evaluated with domain experts through a task-based study comprising six analytical tasks. Results show high perceived usability (84.92 ± 1.94), perceived usefulness (91.42 ± 1.65%), and ease of use (87.14 ± 1.67%). We contribute with reflections and lessons learned from the design process, demonstrating how visualization can effectively support exploratory analysis and hypothesis generation in personality and mental health research.

Keywords

design study psychology personality five-factor model mental health sociodemographic factors

Introduction

The topic of mental health has been gaining more attention in recent years, focusing on how we promote, protect, and care for mental health.¹ According to the World Health Organization (WHO), mental health can be defined as “a state of complete physical, mental, and social well-being,”² underlining that mental health is integral to overall health.³ Several factors affect mental health.^4–7 They can range from external factors, such as society as a whole or the professional occupation, to individual differences like personality. In particular, individual differences are the primary contributor to mental health levels.^8,9 Personality is the agglutination of habits, traits, attitudes, and ideas.¹⁰ On the one hand, neuroticism is linked to higher engagement with mental healthcare, even in people who may hold negative attitudes about seeking treatment.¹¹ Additionally, traits like extraversion have been weakly linked to higher social support seeking, while conscientiousness may also play a positive role in support networks, both of which are protective for mental health.¹¹ On the other hand, personality disorders (e.g. borderline or avoidant) are associated with wide-ranging mental health difficulties, poorer psychosocial functioning, and diminished quality of life.¹² Considering the significant role of personality as central and dynamic factor influencing mental health, its assessment and consideration are increasingly integrated into prevention, diagnosis, treatment, and recovery strategies in modern mental health care.^9,12,13

Social and demographic contexts also have a strong influence on personality by defining one’s decision-making process.¹⁴ Several studies address the interaction between personality and sociodemographic factors to verify how the latter shapes personality disorders¹⁵ or parental burnout,¹⁶ among others. Although there is access to these types of data, further research that leverages personality and sociodemographic factors in their relationships with mental health is needed to extend our understanding of these synergies.¹⁷ Moreover, the results of these studies are aggregated usually into tables.¹⁸ A tabular presentation is not suitable for efficiently exploring multidimensional or temporally rich mental health data, although it presents informative data as well as precise.¹⁸ A recent research agenda highlights the lack of tools to digest and visualize such amounts of data.^19,20 This problem motivated the present study. Our goal is to leverage visual analytic tools to allow the user to “comprehend large amounts of data immediately and perceive emergent, unanticipated properties” while facilitating “the understanding of large-scale and small-scale data features.”²¹

This work described a visualization design study with personality psychology experts based on a 16-month collaboration. We explored which information visualization techniques are more suited to analyze the relationship between sociodemographic, personality, and mental health factors. Our objective is that, by exploring data through visualization, experts can generate research hypotheses that later can be tested with the data. Our work follows the nine-stage framework²² design study methodology. In particular, we conducted semi-structured interviews with domain experts to design and prototype an information visualization tool. The tool was suggestively named FFM-MHI Vis and its main objective is to support domain experts focused on the relationship between the three categories of data (sociodemographic, personality, and mental health factors) through well-known chart types: parallel coordinates, boxplot, scatterplot, and Sankey diagram. We also ran user tests to assess both the utility and the usability of the prototype. Finally, we summarize the findings of this design study.

The contributions of this paper are threefold. First, we provide a problem characterization and abstraction of analytical practices in personality psychology, identifying challenges faced by domain experts when exploring relationships between sociodemographic, personality, and mental health data. Second, we present and validate an application-oriented visualization design, developed through participatory design with psychology experts, which integrates established visualization techniques to support exploratory analysis and hypothesis generation. This validation includes task-based utility and usability evaluations, reporting performance as well as perceived usability, usefulness, and ease of use. Third, we offer reflections from a long-term, real-world collaboration with domain experts, highlighting lessons learned and design considerations for applying information visualization methods in psychological research contexts.

Related work

Recent advances in interactive information visualization systems have increasingly focused on integrating individual differences, in particular personality, mental health status, and socio-demographic context, to support psychological research and clinical insight. Tools like the Tulsa Life Chart¹⁹ offer life-course visualizations of mental health trajectories, enabling transdiagnostic comparisons across age, sex, and diagnostic groups. HealthPrism²³ extends this approach by incorporating behavioral sensor data and contextual variables to explore factors influencing children’s well-being. Similarly, mobile applications²⁴ using the Omaha System facilitate the visualization of complex interdependencies between psychosocial, environmental, and physical health domains through techniques like alluvial diagrams and parallel coordinates. Shiono et al.²⁵ further emphasize dynamic exploration of individual psychological states over time, providing a novel lens on personality and mood evolution through interactive utterance-based representations with an interactive visualization system for psychological topology. TherapyView²⁶ is another relevant example focused on mental health conditions, including anxiety, depression, and schizophrenia. The system analyzes psychotherapy session transcripts to generate visual representations that provide a concise summary of the content covered in each session. Finally, MBTIviz²⁷ analyzes a MBTI-demographic dataset to calculate age, gender, and occupation percentages for each MBTI and study correlations between those variables. Together, these systems illustrate a growing emphasis on holistic, person-centered analytics in psychological visualization research.

While these systems highlight the potential of visualization tools to support personalized mental health insights, most remain researcher-driven and lack direct involvement from end users in their design. Addressing this gap, Sama et al.²⁸ present the only known study to adopt a co-design methodology. The authors extended the mindLAMP platform²⁹ by implementing a clinician-facing data visualization dashboard named mindLAMPVis. It integrates both active (self-reports, cognitive assessments) and passive (digital phenotyping) patient data, including socio-demographics, clinical phenotypes, and behavioral patterns. Furthermore, the dashboard is used to detect patterns (such as relapse in schizophrenia) and compare complex trajectories across various patient-level factors. This research gap clearly shows the lack of empirical data to provide robust guidelines for practitioners to leverage psychological factors on the design pipeline of visualization systems.

Among the state-of-the-art research in human–computer interaction, user-centered design³⁰ has proven to be an effective framework for increasing system usability, acceptance, and relevance.³¹ Central to this approach is the explicit consideration of stakeholders’ needs, abilities, and practices throughout the design process.^32,33 Beyond improving usability, user-centered methods support rapid iteration^34,35 and foster stakeholder engagement, which has been shown to improve both system quality and long-term adoption.³⁶ Within this paradigm, participatory design (PD) has been widely adopted to surface domain-specific requirements and to co-create solutions with end users.^37,38 Grounded in principles such as mutual learning, empowerment, transparency, and shared ownership,³⁹ PD has proven particularly valuable in sensitive domains, including mental health, where contextual knowledge and trust are critical.^40–42 However, while these studies demonstrate the benefits of involving users, they often focus on usability outcomes rather than on articulating transferable design knowledge.

Visualization design studies provide a complementary methodological lens by explicitly connecting domain problems, design decisions, and reflective insights.²² As defined by Sedlmair et al., design studies aim not only to produce a working system but also to contribute generalized lessons that refine visualization design guidelines. Prior work has successfully applied this approach in diverse domains,^43–45 demonstrating its effectiveness in uncovering latent domain needs and shaping visualization idioms accordingly.⁴⁶ Despite known challenges, including managing scope and balancing generality with specificity,^22,47 sustained collaboration in design studies has been shown to increase trust, rapport, and co-ownership between visualization researchers and domain experts.⁴³

While there is a growing body of work on visualization systems for psychological and mental health data, most existing approaches emphasize analytical outcomes or system functionality, with limited attention to how design decisions emerge through iterative collaboration with domain experts. In particular, we found no prior visualization design study that systematically addresses the joint exploration of sociodemographic, personality, and mental health factors through a co-creation process. This absence limits the availability of empirically grounded design rationales that could inform visualization practitioners working in personality psychology and related domains. To address this gap, we conducted a design study that documents the iterative development of FFM-MHI Vis, a visualization system co-designed with a personality psychology expert to support pattern analysis across these intertwined dimensions. The following section details the iterative design process that guided this work.

Design study methodology overview

Sedlmair et al.’s nine-stage framework for visualization design studies²² offers a well-established methodological foundation for addressing complex, real-world problems through close collaboration with domain experts. Rather than prescribing a linear process, the framework characterizes design studies as iterative and reflective, spanning phases concerned with domain understanding, problem framing, design exploration, system realization, and knowledge dissemination. As such, it has been widely adopted to support visualization research in domains where analytical practices, data characteristics, and user needs are not yet well understood.

We adopted this framework as a guiding structure for our collaboration with personality psychology experts, using it to organize and reflect on the progression of our design activities. Our work primarily engaged stages from learn and winnow, focused on understanding the experts’ analytical practices and challenges when studying relationships between sociodemographic, personality, and mental health factors, through cast and discover, where we abstracted domain questions into analytical tasks and explored candidate visualization solutions through iterative prototyping. These stages informed subsequent design and implement activities, during which the visualization concepts were refined and realized as a functional system.

Throughout the process, the framework supported a continuous dialog between usability considerations and analytical expressiveness, ensuring that the resulting tool remained both accessible to domain experts and capable of supporting in-depth exploratory analysis. In line with Sedlmair et al.’s emphasis on reflection, insights gained during deploy and evaluation were used to consolidate design decisions and articulate lessons learned. In particular, we carried out a visualization design study for 16 months in close collaboration with personality psychology experts. Figure 1 depicts the co-creation phases. We conducted four formal discussion meetings with a focus on paper and rapid prototyping^48,49 based on related literature.^22,46,47 Although our sample was small, the co-design activities offered an overview of the mental model and experience of the experts throughout the design study. Moreover, the constant discussions allowed us to adjust the prototype to the requirements of the experts. Together, these stages structure the design study reported in this paper and provide the foundation for the development of FFM-MHI Vis.

Figure 1.

Overview of the iterative design study process.

Learn, winnow, and cast: Establishing the collaboration and problem frame

The objective of this phase was to gain a deep understanding of the target domain, identify appropriate collaborators, and frame a tractable research problem aligned with expert needs. Through initial engagement with the (Omitted Research Center) at (Omitted University), we identified a group of domain experts working at the intersection of sociodemographics, personality, and mental health. Discussions focused on the structure of their database, the types of variables collected, current analytical practices, and practical constraints in data exploration. It included the mentioned factors, and the expert used the data in their studies. However, such a large amount of data hinders the analysis process, which led to psychology experts constantly seeking alternative ways to process the data in collaboration with computer science experts through visualization techniques. Three experts took part in the co-creation process to design FFM-MHI Vis, whose inclusion helped eliciting requirements and design ideas and improving our design decisions in every development stage.⁴⁶ One expert was a PhD candidate in personality psychology and the other two were professional personality psychology clinicians. These three experts are co-authors of this paper.

Key takeaways

The key takeaways from this phase included the establishment of a clear collaboration with defined expert roles, agreement on which subset of the data and tasks would be most relevant for visualization support, and an understanding of expert needs and pain points that would guide the design study.

Discover: Problem characterization and abstraction

The objective of the discover phase was to elicit expert analytical questions, explore potential visualization solutions, and identify candidate tasks that a visualization tool should support. We conducted semi-structured interviews to understand the experts’ reasoning, priorities, and current challenges in exploring their dataset. Moreover, we assessed whether data visualization could provide a beneficial approach to their research pipeline and, in that case, which features must be present to fulfill the necessary tasks.⁴⁶ This work allowed us to position the design study regarding information location and task clarity.²² The following sections present the problem characterization and our reflections from the interview.

Domain

Personality includes stable differences that exist in individuals.⁵⁰ Nevertheless, one cannot define personality without also defining personality traits⁵¹ and checking what can be drawn from them to analyze someone as a whole. Over the years, researchers clinically studied several theories about personality. In this study, we follow the Five-Factor Model 288 (FFM),⁵² also known as “Big Five,” since it is composed of the five most important personality traits existent in one’s personality⁵³: neuroticism, extraversion, openness to experience, agreeableness and conscientiousness. Several distinct approaches have commonly measured these constructs using surveys or questionnaires.⁵⁴ One of these apparatus is the NEO-FFI, which consists of 60 questions (12 per trait) to evaluate the FFM. It is a reliable version, showing an internal consistency between 0.68 and 0.86.⁵⁵

Regarding mental health, Veit and Ware Jr.⁵⁶ developed the Mental Health Inventory, also named MHI-38. Its primary goal is to measure general psychological distress and well-being with 38 questions that evaluate five different factor groupings: anxiety, depression, behavioral/emotional control, general positive affect, and emotional ties. The first three factors are related to psychological distress, while the last two are associated with psychological well-being.⁵⁶ This questionnaire allows researchers to measure the Mental Health Index (MHI), and several subsequent smaller questionnaires were created to ease data collection like the MHI-5.⁵⁷ In particular, the MHI-5 has shown a correlation between 0.93 and 0.95 with the original MHI-38.^58,59

Finally, sociodemographic factors include a combination of social and demographic factors. While social factors (e.g. wealth, household, education, social mobility, employment status, income inequality, and community safety⁶⁰) affect the lifestyle of oneself, demographic factors (e.g. age, gender, and marital status) are more related to the statistic study of human populations.

Semi-structured interview

We conducted a semi-structured interview with a personality psychology researcher in the discover phase.²² Its purpose is to characterize and abstract the domain problem met by the experts. In addition, we can better understand the types of tasks and information we need to make available. The interview started with an introduction by the expert regarding the domain. Then, the expert explained how they use the dataset currently and the most frequent tasks of the experts in a domain-specific language. Afterward, we asked 10 questions to understand which requirements the visualization tool should meet. The questions translated from (Omitted Nationality) are the following:

Q1. Should we focus on all sociodemographic variables in your dataset? Are there any relevant subsets of variables?

Q2. Should the tool display responses to individual items? Should it only consider the overall scores for the psychological constructs?

Q3. Should the tool provide an overview of the data and subject-wise? Should it allow to compare subjects?

Q4. Should the tool support conditional filtering, for example, only show the individuals with a conscientiousness score higher than 30?

Q5. Should the tool display sociodemographic, personality, and mental health data at the same time? Should they be presented in the same container or separated?

Q6. Should the tool have a single page or multiple pages?

Q7. While showing a single subject, which sociodemographic, personality and mental health factors should be presented? Should individual items of the questionnaires also be displayed?

Q8. Do you have any preference for data charts to display these types of information?

Q9. Should the tool have color encodings? Should they have a dichotomy semantic in personality traits and mental health?

Q10. In which devices do you intend to use the tool?

Data

We used a novel research-orientated database comprising 814 adults from the general population (mean age = 40.09 years, SD = 14.25 years; 60.8% female; 48.8% with higher education). The co-authors of this article collected the data to support the study of multidimensional psychological phenomena by jointly collecting standardized measures of personality traits, mental health indicators, and sociodemographic variables in a unified, linkable data structure. Table 1 presents each variable and its data type. Data pre-processing included calculating the values for each of the FFM traits, resulting in five more variables for personality (continuous), as well as a MHI score (continuous). We also created a script to convert an SPSS file into a .csv. For the purposes of this work, we sampled 200 individual records (70 male, 130 female) ranging from 18 to 76 years old (mean age 41, SD = 13.25; 65% female; 42.5% with higher education).

Table 1.

Variables and measurement scales included in the dataset.

Category	Variable	Data type
Sociodemographic	Age	Continuous
	Gender	Nominal
	Nationality	Nominal
	Residence	Nominal
	Schooling level	Ordinal
	Profession	Nominal
	Current economic situation	Nominal
	Marital status	Nominal
	Household	Nominal
	Religion	Nominal
Personality (NEO-FFI)	Item 1–Item 60	Ordinal
Mental health (MHI-5)	Item 1–Item 5	Ordinal
Personality (derived)	Five FFM traits	Continuous
Mental health (derived)	Overall MHI score	Continuous

Use cases

The domain expert reported that the purpose of the database is to study the population from a macro perspective. Based on personality psychology, the domain experts create hypotheses to verify whether there is any relationship between a set of psychological variables, such as FFM traits or the MHI. One familiar use case is to collect and compare data from two samples. For instance, psychology researchers collect data from the general population to serve as the ordinary sample, and the second from individuals diagnosed with depression. Then, domain experts study whether there are moderation effects between the psychological variables through quantitative statistical methods. Finally, the domain expert explained exemplary use cases concerning the database we have at hand. For instance, experts want to check whether there are clusters based on age groups or gender that show inter-differences in mental health or personality characteristics.

Expert tasks

Following the discussion with the domain expert, we understood that the researcher actively wants to comprehend whether intersections between sociodemographic, personality, and mental health factors exist. Although this high-level domain task looks exploratory, we actively promoted the researcher to narrow it into low-level tasks. This approach allows us to focus the design process and, at the same time, contextualize the expert intentions into primitive visual analytical tasks.⁶¹ Additionally, we include a translation of the domain tasks into abstract forms of visual analytical. The following Expert Tasks (ET1–ET3) were derived directly from the semi-structured interview questions (Q1–Q10):

ET1. Quantify and visualize the distribution of psychological variables and questionnaire responses. The domain experts know each item score from the NEO-FFI and MHI-5 questionnaires for the subjects in the database. Based on these scores, it is also possible to calculate the general score of the FFM and the MHI psychological variable. In addition, the experts know how each individual reported their sociodemographic factors. It poses a very appealing opportunity to study whether the distribution of a variable correlates with another one’s distribution or whether there are substantial differences based on sociodemographic factors. In particular, the responses to the items in the questionnaires allow the domain experts to study with finer granularity these relationships. In particular, experts want to examine related data to understand observation.

ET2. Filter the sample through multiple variables at the same time. With many psychological variables, the domain expert raised the need to cross several distributions and investigate clusters of individuals that satisfy a set of parameters. For instance, the dashboard should support tasks such as “Do neurotic female individuals have poor mental health scores?” In this case, experts want to iteratively identify attribute differences and similarities between populations.

ET3. Show an aggregated view of the personality variables by age groups. To provide a broader analysis of the effect of age on the remaining factors, the domain expert highlighted that part of their data analysis includes grouping individuals based on their age. These groups should be pre-defined and allow an aggregated view of the data. In terms of visual analytics, experts want to iteratively identify attributes at an aggregated level.

In particular, ET1, focused on quantifying and visualizing psychological variables, was informed by questions regarding individual and overall scores (Q2, Q5, Q7); ET2, concerning filtering across multiple variables, was guided by questions on comparison and conditional filtering (Q3, Q4, Q5); and ET3, regarding aggregated views by age groups, was derived from questions about grouping and overview (Q1, Q5, Q7). Additional questions on interface layout, visualization idioms, color encoding, and platform preferences (Q6, Q8, Q9, Q10) further informed the design decisions. By abstracting these tasks into common terminology in the visualization domain, we hope that researchers in other domains can more easily understand this domain study and apply the design outcomes to other situations.⁶² Additionally, these tasks suggest that we follow a strategy such as the Focus-plus-Context rule, where viewers can check the object of interest in detail while getting an overall impression of all the surrounding information.⁶³ The information location should be predominantly in the visualization. However, the interpretation of personality and mental health scores is implicit knowledge of the expert.

The researcher showed a strong preference for scatterplots when asked about their knowledge of information visualization methods. We believe that the core academic education of the expert may be responsible for this preference since their current data analysis procedure leverages only scientific software such as spreadsheets or SPSS. Both these tools commonly use scatterplots and line charts to display information based on a query from the researcher to perform a statistical test. Therefore, these practices hinder the interaction with other types of data visualizations, emergent data patterns, and preemptive data analysis. Indeed, they may decrease the quality of data analysis since performing one statistical test at a time with such a multidimensional dataset may be too cumbersome and require a lot of time to be conducted. In addition, the researcher highlighted that it would be relevant to have a color scheme that favors an immediate assessment of an ordinal item and that a dichotomy semantic was not appropriate. Finally, the researcher asked that the tool should have a single page and be preferably a website available in a desktop setting since they were more familiar with this technology.

Key takeaways

In the discover phase, we identified the core analytical needs of the personality psychology expert and clarified how a visualization tool could support their research. Three expert tasks were derived from the semi-structured interview: visualizing and quantifying psychological variables (ET1), filtering samples across multiple variables (ET2), and examining aggregated distributions by age groups (ET3). These tasks were informed by specific interview questions regarding data scope, comparison, grouping, and interaction preferences. The expert emphasized scatterplots for individual-level analysis, the use of color encoding for rapid ordinal assessment, and a single-page, desktop-oriented interface. Collectively, these insights established both the functional requirements and the visual strategies that guided subsequent design iterations, while also confirming the relevance of integrating sociodemographic, personality, and mental health factors into coordinated visualizations.

Design: Converging on a coherent visualization concept

The objective of the design phase was to refine candidate visualization solutions into an integrated system concept that effectively supports the experts’ analytical needs. This section includes the design of the prototypes, the workshops to refine them, and our reflections on the activities.

Choosing the idioms

By using analogy-based creativity methods⁶⁴ as well as the visualization awareness method,⁶⁵ we designed a curated collection of possible visualizations focused on the data and task types. In particular, we chose the previous set of idioms based on their ability to depict relationships between continuous/ratio (e.g. the FFM traits and the MHI) and categorical data (e.g. sociodemographic factors like marital status and gender). The main goal of this phase was to understand the stakeholders’ preferences regarding the implementation of the idioms. We considered the following idioms: radial bar charts, radar charts, parallel coordinates, Cleveland dot plots, scatterplots, boxplots, histograms, and heatmaps (see Figure 2).

Figure 2.

Examples of sketches of different charts designed for the converging workshop. (a) Radial bar chart. (b) Radial bar chart. (c) Scatterplot. (d) Boxplot. (e) Radar chart. (f) Cleveland dot plot. (g) Histogram. (h) Parallel coordinates.

We conducted another semi-structured interview with the same expert from the problem characterization session. In contrast, we started by showing the idioms one at a time to the researcher and explaining how they could visualize information. Then, we encouraged the researcher to externalize their ideas regarding which visualizations looked easier to understand and how they could apply them to their domain. In addition, we asked six questions based on the experts’ tasks to check how much information density the researcher would like to find on the screen and point out their preferred interaction method. The questions translated from (Omitted Nationality) were the following:

Q1. To compare the FFM traits and MHI of different subjects, which of these sketches is/are more suitable for the task?

Q2. Would it be interesting to have a scatterplot that allows the user to show the distribution of the FFM traits and MHI between genders and ages?

Q3. To filter results, when applicable, would you rather have text boxes (to insert values manually), or a slider?

Q4. Would it be of your interest to check the individual answers to the NEO-FFI and MHI-5 questionnaires immediately (using a color scheme to detect the answer visually), or would you rather use the mouse and hover to check each answer when necessary?

Q5. How would you distribute these idioms chosen across the screen?

Q6. Do you have any more suggestions to make to show interactivity between idioms?

The expert’s answers to questions Q1–Q6 provided concrete guidance for narrowing down the set of idioms and defining their intended use. The expert consistently favored representations that are commonly employed in psychology research and perceived as analytically trustworthy, which motivated the selection of scatterplots and boxplots over more decorative or unfamiliar alternatives. Responses to questions concerning interaction (Q3 and Q4) highlighted the importance of direct manipulation and detail-on-demand, leading to the use of sliders for filtering and color-encoded heatmaps complemented by hover interactions for precise item-level inspection. Feedback on layout and coordination (Q5 and Q6) emphasized the need to separate global overviews from detailed individual-level views, while maintaining visual linkage across idioms. These insights directly informed the configuration choices detailed in the following reflections, where each selected idiom was refined in close collaboration with the expert.

Results

The sketches of the possible visualizations rendered many discussions, providing the experts an initial view of how their ideas could lead to a concrete tool. In line with the expert’s responses to Q1 and Q2, which emphasized analytical familiarity and interpretability, the expert showed interest in the following idioms: a parallel coordinates chart, a scatterplot, a heatmap, a boxplot, and a histogram.

Regarding the parallel coordinates, we agreed, in line with the need to compare multiple psychological variables simultaneously (Q1), that it should contain six axes corresponding to each of the FFM traits and the MHI score. Each line would represent the scores of a subject across variables. In addition, the axes should include a drag-and-drop behavior to organize them in the most convenient order and support filtering through brushing, addressing the expert’s preference for direct manipulation expressed in Q3. It must also be easy to compare the values of a single individual with the global average of the sample.

We also agreed, following the discussion prompted by Q2, that the scatterplot should present each individual as a circle, with position encoding age and a selected psychological variable among the FFM traits or the MHI. This configuration allows experts to explore relationships between continuous variables while maintaining a familiar analytical representation. The heatmap was also straightforward, particularly in response to Q4, as it should contain all answers to the NEO-FFI and MHI-5 questionnaires. Each column corresponds to an FFM trait or the MHI, and each line represents an individual item. A user can perceive response patterns through color encoding while retaining the possibility of inspecting exact values on demand.

In contrast, we brainstormed whether representing each psychological variable with an individual boxplot or histogram would be advantageous. Since both idioms resulted in similar perspectives of the data, and following the expert’s interest in preserving both summary statistics and distribution shape (Q1), we opted for a violin chart. This idiom allows inspection of quartiles while also conveying distribution density. Two y-axes were used (one for the FFM traits and one for the MHI) to avoid misleading comparisons between variables with different ranges, while allowing the expert to focus on distribution shapes. Additionally, the expert could not initially identify a preferred idiom to analyze sociodemographic data. Based on the discussion initiated by Q6 regarding coordinated views, we proposed a Sankey diagram, whose flow-based structure supports the identification of distributions across categorical variables. Regarding layout (Q5), the expert emphasized the importance of dedicating a section of the interface to presenting all sociodemographic factors of a single individual, while allowing flexibility in the placement of the remaining idioms.

Key takeaways

In the design phase, we refined candidate visualization idioms into an integrated concept tailored to the expert’s analytical workflow. The semi-structured co-design activities clarified both the preferred visual representations and interaction strategies. Scatterplots and boxplots were selected for individual-level analysis due to their familiarity and interpretability, while parallel coordinates supported multi-variable comparisons. Violin charts were chosen to convey distributions alongside summary statistics, and a Sankey diagram was introduced to explore sociodemographic relationships. Interaction design emphasized direct manipulation, detail-on-demand, and coordinated views, with layout choices separating individual-level details from global overviews. These insights ensured that the final prototype aligned with domain-specific reasoning while remaining flexible and analytically robust.

Implement: Realizing the design

The objective of the implement phase was to develop a fully functional visualization tool based on the refined design. According to the previous visualization choices, we implemented a high-fidelity prototype, which we developed across two iterations concerning interface design principles based on Kachkaev et al.,⁶⁶ including consistency of encoding, transitions between views, and the data are the interface.

First Iteration The first version of the prototype contained a parallel coordinates chart, a scatterplot, a heatmap, a violin chart, and a Sankey diagram (Figure 3). We then conducted a semi-structured interview with three domain experts. We asked the expert in each interview to explore the prototype by themselves to support the development of a minimal mental model of the tool. When the expert felt proficient with it, we asked them to perform a set of tasks based on the experts’ tasks. In particular, we decided to cover both ET1 (T1 to T5) and ET2 (T6 and T7), as well as verifying whether the domain experts can manipulate the visualizations (T8 and T9):

T1. Determine the marital status and religion of Subject #78.

T2. Determine how much did Subject #78 score in the Openness trait.

T3. Identify how many male subjects are practicing catholics.

T4. Filter Subject #18. After filtering it, identify their scores in each of the FFM traits and the MHI.

T5. Filter Subject #16. Press the point in the scatterplot that represents this subject. Verify what is their economical and work status.

T6. Filter all subjects between 40 and 50 years old, and identify what is their average score for the Extraversion trait.

T7. Filter all subjects with the following conditions: females, 40 years old or less, with an Agreeableness score beneath 25. Count how many subjects are currently being queried. Find the largest MHI score between this sample’s individuals. Ascertain if Subjects #66 and #114 belong to this group.

T8. Reorder the parallel coordinates’ axes in the following sequence, from left to right: MHI, Extraversion, Conscientiousness, Openness, Agreeableness, Neuroticism.

T9. Reorder the Sankey diagram with the following sociodemographic factors, from left to right: Residence, Gender, Religion, and Schooling.

Figure 3.

Prototype used in the first refining workshop.

The three experts (E) were able to complete all tasks with little to no help from the developer. After accomplishing all tasks, they verbally evaluated the system, enumerating its pros and cons and making some suggestions. In particular, E1 stated that the parallel coordinates chart should present the average scores of each group of subjects of the same age extent, instead of doing so for each subject. Although this aggregated view of the data is one of the expert tasks, we did not implement it at the time. E1 also reported that they preferred a simple boxplot rather than a violin chart since they did not think that showing the overall distribution in-depth is important. E2 said they would not modify the dashboard since they enjoyed working with all the idioms implemented except the violin chart. Additionally, E2 reported that a help menu would be helpful to know how to work with the charts if the developer did not explain them and their purpose beforehand. Finally, E3 stated that the violin plots should have tooltips to inform which were the quartile values and the sample mean value while hovering over a trait, instead of looking at these measures with the naked eye. This user also thought the Sankey chart was valuable and easy to understand; however, the scatterplot was somewhat confusing, with a tooltip while hovering a particular dot being the most relevant component of this chart. Overall, E3 reported that the system was easy to understand and use. Indeed, the implemented transitions were a positive approach to detecting the changes while applying filters.

We should replace the violin chart with a boxplot with tooltips for the quartiles based on this feedback. Moreover, the comments on the age aggregation allow us to conclude that the parallel coordinates chart is the privileged idiom to contain this representation. Aside from these changes, the number of filtered subjects currently should also be added to the left drawer.

Second Iteration After iterating the prototype, we conducted an additional refining workshop with the domain expert from the problem characterization session. Again, the participant started by exploring the prototype by themself, followed by a set of tasks. Similar to the previous workshop, we address the experts’ tasks: ET1 is covered from T1 to T3, ET2 from T4 to T6, and ET3 by T7 and T8. The last question (T9) focuses on settings. The tasks were the following:

T1. Identify the gender and age of Subject #10.

T2. Select Subject #8. Verify if this subject has an Agreeableness score between the first and third quartiles. Identify the Agreeableness values for the first and third quartiles of the sample.

T3. Select Subject #88. Identify what is NEO-FFI’s Question #7 related to Conscientiousness, as well as the answer given to that question by Subject #88.

T4. Filter all male subjects between 25 and 40 years old. Report what is the minimum Extraversion score in this group. Assess if Subject #140 belongs to this sample. Identify this individual’s economical status. Verify if Subject #12 is a member of this group.

T5. Assess how many subjects with only the sixth grade are employed.

T6. Report how many people who live with their spouse only have no more than the sixth grade.

T7. Identify the MHI score average of all the subjects with ages between 18 and 30 years.

T8. Filter all subjects between 40 and 50 years old, and identify what is their median for the Extraversion trait.

T9. Open the settings menu. Alter the colors that represent both male and female subjects. Verify that the colors were used in the system. Restore these colors to default. Again, verify that the colors changed again in the system.

Once again, the expert completed all tasks in the presented order with little help from the developer. After finishing them, we asked for new feedback about this version. The researcher referred that each node of the Sankey diagram could present a tooltip, thus identifying the number of subjects that node links. Overall, the tester thought the system had a clean and intuitive design.

To consolidate the outcomes of the two refinement iterations, we summarize the role of each visualization idiom, along with its main advantages and limitations as identified during the expert workshops, in Table 2. This summary highlights how each idiom supported specific expert tasks, as well as the design trade-offs that motivated subsequent modifications, replacements, or enhancements. By explicitly contrasting the idioms, this overview clarifies the rationale behind the final set of visualizations included in the system and illustrates how expert feedback guided the refinement process, which we introduce in the next section.

Table 2.

Summary of visualization idioms, their analytical role, and expert feedback.

Idiom	Primary analytical role	Advantages identified by experts	Limitations and design decisions
Parallel coordinates	Multivariate comparison of personality and mental health traits	Enables simultaneous comparison of multiple traits; supports filtering and pattern detection across variables	Individual-level view caused visual clutter; extended to support aggregated views by age groups
Scatterplot	Exploration of relationships between two variables	Familiar representation; effective for identifying individual subjects and outliers	Interaction required to reduce ambiguity; tooltips identified as essential for interpretation
Sankey diagram	Exploration of relationships between sociodemographic factors	Intuitive and easy to understand; effective for examining distributions and flows	Lacked quantitative detail; tooltips added to display counts per node
Violin chart (first iteration)	Distribution analysis of psychological traits	Provided detailed distribution shape	Considered unnecessary for expert goals; replaced with boxplots
Boxplot (second iteration)	Summary statistics and group comparison	Clear and familiar representation; aligned with expert analytical practices	Required tooltips to expose quartiles and summary values
Heatmap	Fine-grained inspection of questionnaire item responses	Allows detailed exploration of NEO-FFI and MHI-5 item-level data; color encoding supports magnitude comparison	High information density; contextual placement in the side drawer limits visual overload

Key takeaway

The implementation phase translated the refined design into a fully functional prototype through two iterative workshops with domain experts. The first iteration validated the overall selection of visualization idioms and identified improvements, such as replacing the violin chart with boxplots, adding tooltips, and supporting age-based aggregation in parallel coordinates. The second iteration confirmed these refinements and further highlighted enhancements for interactivity and interpretability, specially regarding tooltips for the Sankey diagram. Across both iterations, expert feedback guided critical design adjustments, ensuring that the tool effectively supported the intended analytical tasks while maintaining usability and clarity.

FFM-MHI overview

Figure 4 depicts a screenshot of the interface. FFM-MHI is composed of three main components: (i) an observable view, (ii) side drawers, one for each side, and (iii) a filters and settings panel. This section provides an overview of each FFM-MHI component, including our reflections from the semi-structured interviews conducted with the domain experts and the design decisions.

Figure 4.

Screenshot of FFM-MHI Vis. The dashboard contains four main idioms: a parallel coordinates chart (top left corner), a scatterplot (top right corner), a boxplot (bottom left corner), and a Sankey diagram (bottom right corner). The lefthand drawer has a list of the individuals that match the conditional filters. The righthand drawer displays an heatmap when there is at least one individual selected.

Observable view

The Observable View (OV) contains four idioms that allow the researcher to explore the data through visualization. The design of the OV is a result of an intensive collaboration with the domain experts, comprising a total of four semi-structured interviews with three personality psychology. The idiom in the top-left is the parallel coordinates chart, which has an axis for each of the FFM traits and the MHI. The axes have a drag-and-drop behavior to brush data and change the order of the axes as requested. Each line has a color encoding where blue and orange stand for female and male participants, respectively. Moreover, the line of a selected individual has a larger width, while the remaining lines lose opacity. The user can also aggregate age groups the general trends of the sample, as depicted in Figure 5(a). This idiom directly supports ET1 and ET3.

Figure 5.

Screenshots to showcase features of FFM-MHI. (a) Parallel Coordinates by age group. (b) Heatmap for Subject #9. (c) Filtering all female subjects with ages between 35 and 60 years.

The scatterplot represents individuals in a two-dimension space defined by the value in a FFM trait or the MHI and their age. Each dot has a similar color encoding to the parallel coordinates, and selected individuals have their circle with a larger radius compared to the other ones. For both idioms, one can select items by clicking on the respective mark, which the user can hover to see a tooltip with more details. Regarding the boxplots, there is one instance for each of the FFM traits and the MHI. One can hover over each instance to depict the quartiles and the minimum and maximum values of the sample. If a particular subject is selected, an orange diamond shape appears on each axis, graphically showing the scores of that person in each of the boxplots. Both the scatterplot and the boxplot approaches support ET1. Finally, the Sankey diagram shows the distribution relationships between the sociodemographic factors. Therefore, experts can analyze how the chart divides the sample into a relatively small part of the screen at first glance. We took the design decision to, when the system loads, only display four of these factors (gender, residence, economical status, and religion), as the deemed them as more important compared to the other ones. However, it is possible to add up as many factors as the user wants and rearrange their order with a drag-and-drop behavior, thus supporting ET2.

Side drawers

The heatmaps are placed in the right drawer of the screen when at least one individual is selected (Figure 5(b)). This idiom contains all the answers to the NEO-FFI and MHI-5 questionnaires. In particular, each column corresponds to an FFM trait or the MHI, and each line corresponds to a different item. We added a color encoding to show the magnitude of the value of a cell, which can be hovered to find the particular answer to that item. Additionally, the left drawer consists of a listing of subjects that meet all filters applied by the user. If no filter is selected, it shows all the subjects. Their gender is encoded again by color, and clicking on a specific subject displays all sociodemographic values of the user. It is also possible to filter the list of subjects by ID (Figure 6), which does not have a selection semantic, that is, a highlight in the OV.

Figure 6.

Example of a subject with ID #1 highlighted in FFM-MHI Vis. In the parallel coordinates chart, the other lines lose opacity; in the scatterplot, the subject’s circle has a thicker border; each boxplot shows a mark on top indicating the ID; in the Sankey diagram, the edges corresponding to this participant are darker; the heatmap in the right-hand drawer displays the individual’s NEO-FFI and MHI scores; and the left-hand drawer presents all of the subject’s demographic values.

Filters and settings

We included several menus to interact with the idioms to help the user explore data and filter the database. On the top bar, there is a menu for the sociodemographic factors. It shows a checkbox for each factor which enables axis selection on the Sankey diagram. It is also possible to change the order of the axes by a drag-and-drop behavior. In addition, there is a menu of filters where the researcher can filter a subject by ID or each sociodemographic factor (Figure 5(c)). While filtering for the latter, the user can apply multiple factor filters simultaneously. Finally, the researcher can change the gender color encoding with RGB or HEX codes or a color picker. In addition, there is a help menu with a compilation of slides containing text that helps the user to understand how to interact with the system.

System implementation

FFM-MHI Vis is a single-page application running on a web browser. We used HTML5, Vue.js, Quasar, and D3 to program the client-side to render the page and its elements. The server-side uses a simple Node.js to load, save and change the user preferences on-the-fly. The tool is able to load other datasets with the same data structure. It is open-source and available at (Ommitted URL).

Deploy: Tool evaluation

The deploy phase focused on presenting the implemented system to the domain experts for real-world use and gathering feedback to refine the tool. One pitfall of design studies is the experimental demand characteristic effect,^67,68 which leads domain experts to be very likely to offer favorable feedback. As such, we decided to perform both utility and usability tests. The former is a case study where experts are concerned with the usefulness of the prototype to their work. In contrast, we can conduct the latter with any subject to evaluate not only utility, but also efficiency, safety, memorability, learnability, and satisfaction.⁶⁹

Participants

A total of 30 participants (21 male, 9 female) ranging from 21 and 60 years old ( $μ \approx$ 27.63, $σ \approx$ 11.09) volunteered to participate in our user tests. Of these 30 subjects, three of them are psychologists or experts on the theme of psychology and participated in the early design stages of the tool. Participants have a computer science (76.67%), psychology (10.0%), civil engineering (3.33%), tourism (3.33%), electrical engineering (3.33%) or graphical design (3.33%) background. Participants were recruited by direct contact and word of mouth and, upon completing the study, they were eligible to a raffle of three 20€gift cards.

The participants’ familiarity with the domain, visualization, and each idiom ranges from average to familiar, as depicted in Figure 7. These results indicate that users were more familiar with chart types commonly used in their research area (e.g. scatterplots and boxplots), and less familiar with less frequently encountered idioms (e.g. parallel coordinates and Sankey diagrams). In addition, we were able to verify that participants were not well familiarized overall with the FFM and MHI models. We believe that this factor has a positive impact, allowing the information visualization techniques to be evaluated instead of the knowledge of the psychology topics.

Figure 7.

User familiarization with the topics of interest for this work.

Apparatus

We conducted the user studies through videoconference software to share and record the screen. We used a timer to collect task completion time. Additionally, three questionnaires were used to evaluate the system: System Usability Scale (SUS),⁷⁰ Technology Acceptance Model 3 (TAM-3),⁷¹ and NASA Task Load Index (NASA-TLX).⁷² While we measure the perceived usability through the SUS, we opted to collect only the perceived usefulness and ease of use from the TAM-3. Finally, the NASA-TLX assesses six dimensions regarding user interaction: mental, physical, and temporal demands, as well as performance, effort, and frustration. All three dimensions have been discussed in related works to assess the quality of the apparatus.^73–75

Procedure

Each usability test consists of three parts: (i) fill in a form to collect demographic data, including familiarity with the topics of interest for this work; (ii) explore the tool until the participant is ready to conduct tasks; (iii) perform six tasks, in a random order, with a think-aloud protocol; and (iv) fill in another form consisting of the SUS, TAM-3, and NASA-TLX scales. This new set focuses on prompting the user to interact with all elements in the tool. In particular, ET1 is covered by Tasks T1, 5, and T6, while ET2 has T2, and ET3 has T3. We also added T5 to cover the settings of the dashboard. Each subject has up to 5 min to complete a task. The set of tasks is the following:

T1. Filter Subject #188. Indicate its age, gender, schooling level, and religion. Verify what the average score is, in the age range, for the Extraversion trait. Ascertain if the MHI score of this individual is between the second and third quartiles of the full sample. Clean the filters.

T2. Show only and exclusively the following sociodemographic factors: gender, economic status, and marital status. Reorder these factors so that the order presented in the Sankey diagram is the following, from left to right: economic status, marital status, and gender. Find out how many people that are married are also male. Finally, discover how many economically unsatisfied individuals are also divorced or separated.

T3. Start by opening the Filters menu. Filter all-female subjects. Indicate how many subjects are filtered. Filter all female subjects with their ages ranging between 25 and 40 years. Identify how many subjects are filtered. Ascertain if Subject #13 belongs to the queried group. Now, check if Subject #73 belongs to the same group. Remove the filter by gender, and, this time, filter all subjects with their ages ranging between 25 and 40 years. Identify how many subjects are filtered. Verify what the minimum, maximum, and median values of the Openness trait are for this selection. Restore the filter selection.

T4. Start by opening the Settings menu. Change the colors for the male and female genders to ones of your choice, and apply this new palette to the system. Confirm that the colors changed. Restore the system to its original colors. Confirm that the colors changed once again.

T5. Identify which is the subject identifier with the lowest Conscientiousness score of the entire sample. Indicate what its age is, and what its score in this FFM trait is.

T6. Filter Subject #74. Identify Question #7 related to the Neuroticism trait, as well as the answer to this question. Identify Question #19 related to the MHI, as well as the subject’s answer to this question.

Regarding the utility tests, we first allowed the participants to explore the tool and try one possible use case that they would apply through statistical methods. We then asked them to perform the mentioned tasks following a think-aloud protocol. We only collected qualitative feedback in a follow-up interview.

Usability testing

We conducted usability tests with the 30 participants to complement our prototype evaluation. The prototype scored 84.92 ± 1.94 in perceived usability, 91.42% ± 1.65% in perceived usefulness, and 87.14% ± 1.67% in perceived ease-of-use (Figure 8). These scores show that participants assess the quality dimensions of the prototype with high scores. Regarding the NASA-TLX, results are also positive since the lower the score for each metric (except for performance), the better the system is⁷² (Figure 9). We can see that the workload has most of its dimensions in the bottom half, and that the self-assessed performance was highly scored.

Figure 8.

Distribution of self-assessment metrics regarding the apparatus.

Figure 9.

Distribution of NASA-TLX metrics regarding the apparatus.

Regarding user efficiency, Tasks T1 to T3 are more complex and have more steps to be accomplished. Therefore, we expected that the times for each of these would be higher. Their time distributions were very similar, taking around 3 min to complete each task. In contrast, Tasks T4 to T6 are easier to fulfill, thus being expected at smaller time intervals. Indeed, this tendency held with the measured time intervals being around just 1 min. These results suggest that participants were able to complete multi-step analytical tasks within a few minutes, indicating that the system supports efficient exploratory analysis workflows (Figure 10).

Figure 10.

Distribution of task completion times.

Utility testing

We conducted utility tests with three psychologists. While applying a use case, one expert stated that the visualization dashboard was a good starting point to explore the data before running any statistical method. In particular, the charts provided “clues” and a “visual expectancy” regarding the data by offering a “configuration of the data from an aesthetic scope.” Hence, these visualizations supported a “pre-analysis of the data” to assess whether other variables could provide interesting findings on not preregistered relationships in the study. The expert also added that the graphical analysis supported the “identification of outliers in the sample.” The major drawback of solely using visualization is that “the dashboard could not provide the certainty of statistical analysis” to find group differences. Despite the boxplots providing a “more statistical, quantitative visualization,” the remaining charts were more focused on a “visual arrangement of the data.”

Concerning the tasks, only one user suffered from internet connection problems for a period and did not complete a task. Testers completed the remaining tasks in less than 4 min. Despite finding the system confusing at the start of the experiment, one tester expressed satisfaction with the global coherence and general aspect of the interface after completing the tasks. Other experts stated that the system was fun to work with. However, a first-time user may feel difficulties interacting with the interface conveniently. In particular, they would need to decipher what the meaning of more uncommon idioms is first. Participants also said the tool was clean and suggestive. The third psychologist claimed that the system was very intuitive, even stating that “it is much easier to work with data via charts and direct manipulation than with statistical software such as SPSS.” These observations indicate that the system complements traditional statistical analysis by enabling a rapid, visual pre-analysis phase, where experts can identify patterns, outliers, and candidate relationships before formal hypothesis testing.

Key takeaways

The deploy phase evaluated the visualization tool’s utility and usability with both domain experts and general users. Usability tests with 30 participants demonstrated high perceived usability, usefulness, and ease of use, with task completion times aligning with expected complexity levels. Utility tests with psychologists confirmed that the dashboard effectively supported pre-analysis, exploratory insights, and identification of outliers, complementing traditional statistical methods. Participants emphasized the tool’s intuitive design, clarity, and interactivity, while noting that less familiar visualization idioms might require initial guidance. Overall, feedback validated that the system meets the intended analytical objectives and provides a practical, engaging means for exploring multidimensional psychological data.

Case study example

To demonstrate the novelty and utility of the tool, we present a case study illustrating how FFM-MHI Vis can support expert analysis (Figure 11). We focus on the personality trait Conscientiousness, which is associated with goal-directed behavior, organization, and reliability.⁷⁶ On one hand, people with high scores on conscientiousness are more likely to be well organized, reliable, and consistent. On the other hand, individuals with low conscientiousness scores are generally more easy-going, spontaneous, and creative. In particular, these individuals tend to be more tolerant and less bound by rules and plans, which explains why they are also more likely to engage in antisocial and criminal behavior.⁷⁷ Experts are interested in understanding how this trait interacts with sociodemographic factors and mental health outcomes to answer various questions, such as: Do people with similar conscientiousness scores have the same job? Do constantly planning ahead and seeking achievements affect mental health? Does the conscientiousness score positively influence the female schooling levels? Does this effect apply to all age groups?

Figure 11.

Screenshots to showcase the case study example. (a) Filtered conscientiousness scores. (b) Work status distribution via Sankey. (c) Female participants by education and age. (d) Multiple traits in parallel coordinates.

The analysis begins by filtering the dataset to display only the range of Conscientiousness scores of interest (Figure 11(a)). Using the Sankey diagram, experts can observe the distribution of work status for individuals within the selected score range (Figure 11(b)) or filter for female participants to observe how Conscientiousness scores intersect with educational attainment and age groups (Figure 11(c)), providing insight into demographic patterns and potential psychological implications. Experts can simultaneously examine how mental health scores vary relative to the general population in the boxplots (Figure 11(c), bottom left chart). The parallel coordinates view allows experts to explore multiple variables at once (Figure 11(d)), such as the other personality traits, to identify trends and potential correlations across subgroups. Throughout the analysis, the tool supports experts in iteratively refining their queries, applying multiple filters, and visually comparing distributions. This step-by-step exploration enables conclusions to be drawn based on the observed patterns in the data. Experts can identify clusters, detect outliers, and compare subgroups without relying solely on traditional statistical methods, which typically require multiple sequential analyses. Instead, the system supports rapid exploratory analysis and hypothesis generation through interactive visual inspection. The strength of FFM-MHI Vis is that it offers many potential scopes through which the experts can analyze the data.

Reflect: Lessons learned from expert collaboration

The objective of the reflect phase was to analyze the outcomes of the design study, evaluate how effectively the visualization tool supported the experts’ analytical tasks, and extract lessons for future design work. In this design study, we changed the way experts look into a problem regarding the intersections between sociodemographic, personality, and mental health factors. Indeed, while the most common approach to analyzing highly informative tables is through statistical software such as SPSS, in this design study, we introduced visualization techniques to the domain experts. The nine-stage design study methodology structured the development of FFM-MHI Vis throughout the collaboration. Early semi-structured interviews revealed that experts typically relied on tabular data and regression models, motivating the need for visual overviews that support pre-analytical exploration. Co-design workshops helped identify which relationships experts most frequently investigated (e.g. between personality traits and sociodemographic factors), directly informing the choice of coordinated views such as parallel coordinates and Sankey diagrams. Iterative prototyping and feedback sessions guided interaction design decisions, including the emphasis on direct manipulation through filtering, reordering, and brushing, which experts described as more intuitive than parameter-based statistical queries.

When we deployed the tool to evaluate whether it met the experts’ goals, each evaluation task (T1T6) was designed to exercise one or more of the expert tasks (ET1ET3) identified in the design study. Tasks T1, T2, T5, and T6 primarily addressed ET1, as they required participants to explore individual and aggregated questionnaire scores, examine distributions of psychological variables, and analyze relationships with sociodemographic factors. Tasks T2 and T3 supported ET2 by having participants apply multiple filters across psychological and sociodemographic variables to identify subgroups and clusters. Finally, ET3 was exercised through T3, where participants aggregated data by age groups and analyzed trends across these defined groups. We also created tasks related to customization (T2 and T4) to assess learnability and configurability, reflecting experts’ interest in adapting visual encodings to their analytical preferences. This explicit mapping ensured that each core interaction in the system—including filtering, reordering, direct manipulation, and aggregation—was exercised in a realistic analytical context. It also demonstrates how the design study methodology directly guided both the evaluation task design and the functionality of the tool, aligning the study with real-world analytical workflows in personality psychology.

Utility results also showed positive qualitative feedback by the psychology experts, highlighting how relevant participatory design techniques are to design tools with the end users.⁷⁸ In particular, this new data analysis strategy allows experts to visually explore large volumes of data instead of running multiple regression models before diving in-depth into the data. To diminish the experimental demand characteristic effect,^67,68 we also conducted usability tests with participants outside this domain of knowledge. Our system scores 84.92 ± 1.94, 91.42 ± 1.65%, and 87.14 ± 1.67% in perceived usability, usefulness, and ease of use, respectively, which shows that we designed the system well and that even participants without psychology expertise were able to understand the information and interact with the system. Participants were also efficient with the low task completion times. Therefore, the system met the expectations, and it can be of great help for experts and people outside the area of expertise to analyze these types of data.

Notes from computer scientists

Similar to León and Breiter,⁴⁶ we noticed that social science experts were focused mainly on publishing their research, while their interest in the design study was to check how the tool could help them with their data analysis. Although our prototype was well received, we believe that it would still take some time for personality psychology experts to migrate from their typical statistical software to visualization tools. This inertia to adopt other data analysis strategies may arise from the core academic education of the experts since SPSS is the obligatory software for data analysis in the psychology courses at their university. Consequently, psychology experts may be entrenched in analyzing data using advanced descriptive statistics methods. It may explain the final relatively-simple visual analytics system since this type of software works with well-known idioms such as scatterplots, line charts, and boxplots.

However, we found that participants invested in collaborating with us, providing valuable and necessary domain expertise for a successful design process. They were also interested in learning how to apply visualization techniques to their research. We believe that the most valuable lesson learned is that personality psychology is a rich, underexplored domain that holds promise for visualization if experts invest in educating experts in information visualization to produce novel techniques.

Notes from personality psychologists

The system is pragmatic, functional, and useful from the psychologists’ perspective. Moreover, the psychologists claimed that FFM-MHI is technically simple to use and can be generalizable. Quoting the domain expert, “It has the potential to be applied in the research area, in terms of nomothetic analysis, supporting a graphical and numerical analysis. There are also advantages in idiographic assessments terms, allowing the exploration of a wide range of partial references.” She also highlighted the broad possibility of analyzing several interacting data simultaneously. We believe that information visualization techniques can be a set of innovative auxiliary instruments in the personality psychology area, with possibilities for expansion.

Limitations

The major limitation of this study is the sample size. A larger sample of experts per workshop would allow us to gather more feedback and enrich the prototype development process. Regarding the tool itself, it presents scalability issues when the dataset contains more than 250 subjects. This effect is present in the scatterplot especially. Moreover, we did not push the development of other features such as responsiveness or user customization. Although the prototype only presents three data types (FFM personality traits, MHI and sociodemographic factors), other variables with similar data characteristics like the HEXACO⁷⁹ personality model can replace the mentioned variables with a few changes. Furthermore, the system uses a static database, meaning that users must reload the tool to modify the dataset. Beyond this technical limitation, our evaluation also presents methodological constraints, namely the absence of a controlled comparison between visualization-based exploration and traditional statistical workflows (e.g. regression analysis in SPSS). While qualitative feedback suggests that the system accelerates early-stage data exploration, future work should quantify this effect through controlled experiments measuring analysis time, accuracy, and insight generation. Finally, a further limitation of the prototype is the absence of explicit graph titles, which may make the interpretation of some views more difficult, particularly for first-time users or for less familiar visualization idioms. While this choice was motivated by a clean, dashboard-style design to encourage exploratory analysis, future iterations should incorporate concise titles or lightweight annotations to improve immediate interpretability without compromising visual clarity.

Conclusions and future work

This design study has highlighted the need for visualization tools to help personality psychology experts analyze the intersections between sociodemographic, personality, and mental health factors. We followed a co-creation approach to identify opportunities where visualization can improve data analysis techniques. We started with workshops to elicit the requirements as well as to design and develop an visualization tool. We evaluated the prototype with utility and usability tests. We included our reflections and findings on the different phases of the design study.

Future work includes repeating the study with a larger sample of experts, specifically addressing a wider range of visualization knowledge among the participants. In addition, we would like to conduct several creative visualization opportunities⁴⁷ to explore more methodologies to elicit design requirements. A more heterogeneous sample may allow experts to explore a broader set of initial ideas and collect feedback from multiple distinct scopes. Moreover, we would like to conduct a study to compare whether using our tool in their daily work routine affects the workflow of the experts. Finally, recent advances have explored the use of artificial intelligence-powered tools to address mental health,⁸⁰ motivating future investigations into how AI-driven methods could complement interactive visualization tools in supporting experts’ analytical workflows.

Footnotes

ORCID iDs

Tomás Alves

Sandra Gama

Author contributions

Diogo Mestre: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Writing–Original Draft, Visualization.

Rute Pires, Bruno Gonçalves, Joana Henriques-Calado: Methodology, Resources, Writing–Review & Editing, Supervision.

Tomás Alves: Conceptualization, Methodology, Formal analysis, Writing–Original Draft, Visualization, Funding acquisition.

Sandra Gama: Methodology, Writing–Review & Editing, Supervision, Funding acquisition.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by national funds through Fundação para a Ciência e a Tecnologia (FCT) with references SFRH/BD/144798/2019, UID/50009/2025, UID/50021/2025, UID/PRR/50021/2025, and UIDB/315/2025.

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

References

Kestel

. Transforming mental health for all: a critical role for specialists. World Psychiatry 2022; 21(3): 333–334. https://doi.org/10.1002/wps.21030

World Health Organization. World mental health report: Transforming mental health for all. World Health Organization, 2022.

Hancock

. Mental health promotion must be a priority. CMAJ 2018; 190(16): E522. https://doi.org/10.1503/cmaj.180464

Kirkbride

Anglin

Colman

, et al. The social determinants of mental health and disorder: evidence, prevention and recommendations. World Psychiatry 2024; 23(1): 58–90. https://doi.org/10.1002/wps.21160

Magomedova

Fatima

. Mental health and well-being in the modern era: a comprehensive review of challenges and interventions. Cureus 2025; 17(1): e77683.

Ayodeji

. Examining the impact of technology-facilitated gender-based violence on the mental health and wellbeing of adolescents. Curr J Appl Sci Technol 2025; 44(5): 66–77. https://doi.org/10.9734/cjast/2025/v44i54537

Humer

Pieh

Probst

, et al. Mental health amidst multiple crises: trends and sociodemographic risk factors in Austria’s general population. Front Psychiatry 2025; 16: 1534994–1542025. https://doi.org/10.3389/fpsyt.2025.1534994

Ferguson

. Personality is of central concern to understand health: towards a theoretical model for health psychology. Health Psychol Rev 2013; 7(Suppl 1): S32–S70. https://doi.org/10.1080/17437199.2010.547985

Waszczuk

Hopwood

Luft

, et al. The prognostic utility of personality traits versus past psychiatric diagnoses: predicting future mental health and functioning. Clin Psychol Sci 2022; 10(4): 734–751. https://doi.org/10.1177/21677026211056596

10.

Perlman

. Persona: Social role and personality. University of Chicago Press, 2018.

11.

Beghi

Sanza

. Editorial: reviews in psychiatry 2023: personality disorders. Front Psychiatry 2025; 16: 2025 1567144. https://doi.org/10.3389/fpsyt.2025.1567144

12.

Sayar

Wilberg

Eikenæs

, et al. Improvement of alexithymia in patients treated in mental health services for personality disorders: a longitudinal, observational study. Front Psychiatry 2025; 16: 2025 1558654. https://doi.org/10.3389/fpsyt.2025.1558654

13.

Harrer

Miguel

Van Ballegooijen

. Supplemental material for effectiveness of psychotherapy: Synthesis of a “meta-analytic research domain” across world regions and 12 mental health problems. Psychol Bull 2025; 151(5): 600.

14.

Krahé

. Personality and social psychology: towards a synthesis. Sage Publications, Inc, 1992.

15.

Ekselius

Tillfors

Furmark

, et al. Personality disorders in the general population: DSM-IV and ICD-10 defined prevalence as related to sociodemographic profile. Pers Individ Dif 2001; 30(2): 311–320.

16.

Lindström

Aman

Norberg

. Parental burnout in relation to sociodemographic, psychosocial and personality factors as well as disease duration and glycaemic control in children with type 1 diabetes mellitus. Acta Paediatr 2011; 100(7): 1011–1017.

17.

Volk

Brazil

Franklin-Luther

, et al. The influence of demographics and personality on covid-19 coping in young adults. Pers Individ Dif 2021; 168: 110398.

18.

Sapsford

. Survey research. Sage, 2006.

19.

Aupperle

Paulus

Kuplicki

, et al. Web-based graphic representation of the life course of mental health: cross-sectional study across the spectrum of mood, anxiety, eating, and substance use disorders. JMIR Ment Health 2020; 7(1): e16919. https://doi.org/10.2196/16919

20.

Polhemus

Novak

Majid

, et al. Data visualization for chronic neurological and mental health condition self-management: systematic review of user perspectives. JMIR Ment Health 2022; 9(4): e25249. https://doi.org/10.2196/25249

21.

Ware

. Information visualization: perception for design. Morgan Kaufmann, 2020.

22.

Sedlmair

Meyer

Munzner

. Design study methodology: Reflections from the trenches and the stacks. IEEE Trans Vis Comput Graph 2012; 18(12): 2431–2440.

23.

Jiang

Chen

Zhou

, et al. Healthprism: A visual analytics system for exploring children’s physical and mental health profiles with multimodal data. IEEE Trans Vis Comput Graph 2023; PP(1): 1205–1215. https://doi.org/10.1109/TVCG.2023.3326943

24.

Austin

Mathiason

Monsen

. Using data visualization to detect patterns in whole-person health data. Res Nurs Health 2022; 45(4): 466–476. https://doi.org/10.1002/nur.22248

25.

Shiono

Kato

Aoki

, et al. Interactive visualization system for psychological topology. Indones J Electr Eng Inform 2019; 7(4): 779–788.

26.

Lin

Zecevic

Bouneffouf

, et al. Therapyview: visualizing therapy sessions with temporal topic modeling and ai-generated arts. arXiv preprint arXiv:230210845, 2023.

27.

Yang

Dong

Tian

, et al. Mbtiviz: a visualization system for research on psycho-demographics and personality. In: Yu

Han

Wang

(eds.) Data Science. Springer Nature Singapore, 2023, pp.259–276.

28.

Sama

Sreevalsan-Nair

Choudhary

, et al. mindlampvis as a co-designed clinician-facing data visualization portal to integrate clinical observations from digital phenotyping in schizophrenia: User-centered design process and pilot implementation. JMIR Form Res 2025; 9: e70073. https://doi.org/10.2196/70073

29.

Vaidyam

Halamka

Torous

. Enabling research and clinical use of patient-generated health data (the mindlamp platform): digital phenotyping study. JMIR Mhealth Uhealth 2022; 10(1): e30557.

30.

Norman

Draper

. User centered system design; new perspectives on human-computer interaction. L. Erlbaum Associates Inc, 1986.

31.

FDIs I. 9241-210. 2009). ergonomics of human system interaction-part 210: Human-centered design for interactive systems (formerly known as 13407). International Organization for Standardization (ISO) Switzerland 2009.

32.

Kopeć

Skorupska

Jaskulska

, et al. Livinglab pjait: towards better urban participation of seniors. In Proceedings of the International Conference on Web Intelligence, 2017, pp.1085–1092.

33.

Wilkinson

De Angeli

. Applying user centred and participatory design approaches to commercial product development. Des Stud 2014; 35(6): 614–631.

34.

Signoretti

Martins

Almeida

, et al. Trip 4 all: A gamified app to provide a new way to elderly people to travel. Procedia Comput Sci 2015; 67: 301–311.

35.

Ferreira

Almeida

Rosa

, et al. Elderly centered design for interaction – the case of the s4s medication assistant. Procedia Comput Sci 2014; 27: 398–408.

36.

Lindsay

Jackson

Schofield

, et al. Engaging older people using participatory design. In: Proceedings of the SIGCHI conference on human factors in computing systems, 2012, pp.1199–1208.

37.

Frishberg

. Interactive sparklines: A dynamic display of quantitative information. In: CHI’11 Extended abstracts on human factors in computing systems, 2011, pp.589–604.

38.

Kahng

Andrews

Kalro

, et al. ActiVis: Visual exploration of industry-scale deep neural network models. IEEE Trans Vis Comput Graph 2018; 24(1): 88–97.

39.

Bossen

Dindler

Iversen

. Evaluation in participatory design: a literature survey. In: Proceedings of the 14th Participatory Design Conference: Full papers-Volume 1, 2016, pp.151–160.

40.

Rothmann

Mouritsen

Ladefoged

, et al. The use of telehealth for psychological counselling of vulnerable adult patients with rheumatic diseases or diabetes: explorative study inspired by participatory design. JMIR Hum Factors 2022; 9(1): e30829.

41.

Mozgai

Hartholt

Leeds

, et al. Iterative participatory design for VRET domain transfer: From combat exposure to military sexual trauma. In: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems, CHI EA ’20, New York, NY, USA, 2020, pp.1–8. Association for Computing Machinery. https://doi.org/10.1145/3334480.3375219.

42.

van Hierden

Dietrich

Rundle-Thiele

. Designing an ehealth well-being program: a participatory design approach. Int J Environ Res Public Health 2021; 18: 7250. https://doi.org/10.3390/ijerph18147250

43.

Goodwin

Mears

Dwyer

, et al. What do constraint programming users want to see? exploring the role of visualisation in profiling of models and search. IEEE Trans Vis Comput Graph 2017; 23(1): 281–290.

44.

Kerzner

Lex

Sigulinsky

, et al. Graffinity: visualizing connectivity in large graphs. Comput Graph Forum 2017; 36(3): 251–260. https://doi.org/10.1111/cgf.13184

45.

Nobre

Gehlenborg

Coon

, et al. Lineage: Visualizing multivariate clinical data in genealogy graphs. IEEE Trans Vis Comput Graph 2019; 25(3): 1543–1558.

46.

León

Breiter

. Co-creating visualizations: a first evaluation with social science researchers. Comput Graph Forum 2020; 39(3): 291–302. https://doi.org/10.1111/cgf.13981

47.

Kerzner

Goodwin

Dykes

, et al. A framework for creative visualization-opportunities workshops. IEEE Trans Vis Comput Graph 2018; 25(1): 748–758.

48.

Sedlmair

Isenberg

Baur

, et al. Cardiogram: visual analytics for automotive engineers. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 2011, pp.1727–1736.

49.

Lloyd

Dykes

. Human-centered approaches in geovisualization design: investigating multiple methods through a long-term case study. IEEE Trans Vis Comput Graph 2011; 17(12): 2498–2507.

50.

Mischel

Shoda

Ayduk

. Introduction to personality: toward an integrative science of the person. John Wiley & Sons, 2007.

51.

Hofstee

WKB

. Who should own the definition of personality? Eur J Pers 1994; 8(3): 149–162.

52.

Costa

McCrae

. The revised neo personality inventory (neo-PI-R). In: Boyle

G. J.

Matthews

Saklofske

D. H.

(eds.) The revised NEO personality inventory (NEO-PI-R). SAGE Publications Ltd, 2008, pp.179–198. https://doi.org/10.4135/9781849200479.n9

53.

Goldberg

. An alternative” description of personality”: the big-five factor structure. J Pers Soc Psychol 1990; 59(6): 1216–1229.

54.

McCrae

John

. An introduction to the five-factor model and its applications. J Pers 1992; 60(2): 175–215.

55.

Pedroso de Lima

Gonçalves

Salgueira

, et al. A versão portuguesa do NEO-FFI: Caracterização em função da idade, género e escolaridade. Psicologia 2014; 28(2): 1–10. https://doi.org/10.17575/rpsicol.v28i2.534

56.

Veit

Ware

Jr . The structure of psychological distress and well-being in general populations. J Consult Clin Psychol 1983; 51(5): 730–742. https://doi.org/10.1037/0022-006X.51.5.730

57.

Ware

Snow

Kosinski

, et al. Sf-36 health survery. Manual and interpretation guide The Health Institute, New England Medical Center Boston, Mass 1993; 136.

58.

Ware

Jr Gandek

. The sf-36 health survey: Development and use in mental health research and the iqola project. Int J Ment Health 1994; 23(2): 49–73.

59.

Santos

AMS

Novo

. Mental Health Inventory: sensitivity and specificity of the Portuguese version of the MHI-38 and MHI-5. Psychol Rep 2020; 123: 1452–1469. https://doi.org/10.1177/0033294119850490

60.

Council NR, on Population C. US health in international perspective: Shorter lives, poorer health. National Academies Press, 2013.

61.

Chen

Yang

Ribarsky

. Toward effective insight management in visual analytics systems. In 2009 IEEE Pacific Visualization Symposium, 2009, pp.49–56. IEEE.

62.

Lam

Tory

Munzner

. Bridging from goals to tasks with design study analysis reports. IEEE Trans Vis Comput Graph 2018; 24(1): 435–445.

63.

Card

. Readings in information visualization: using vision to think. Morgan Kaufmann, 1999.

64.

Gordon

. Synectics: the development of Creative Capacity. Collier-Macmillan, 1970. URL http://google.cat/books?id=9g9InwEACAAJ

65.

Koh

Slingsby

Dykes

, et al. Developing and applying a user-centered model for the design and implementation of information visualization tools. In: 2011 15th International Conference on Information Visualisation, 2011, pp.90–95. IEEE.

66.

Kachkaev

Wood

Dykes

. Glyphs for exploring crowd-sourced subjective survey classification. Comput Graph Forum 2014; 33(3): 311–320. https://doi.org/10.1111/cgf.12387

67.

Intons-Peterson

. Imagery paradigms: How vulnerable are they to experimenters’ expectations? J Exp Psychol Hum Percept Perform 1983; 9(3): 394–412.

68.

Brown

Reeves

Sherwood

. Into the wild: challenges and opportunities for field trial methods. In Proceedings of the SIGCHI conference on human factors in computing systems, 2011, pp.1657–1666.

69.

Jordan

. An introduction to usability. CRC Press, 1998.

70.

Brooke

. Sus: a “quick and dirty” usability scale. Usability evaluation in industry 1996; : 189.

71.

Venkatesh

Bala

. Technology acceptance model 3 and a research agenda on interventions. Decis Sci 2008; 39(2): 273–315.

72.

Hart

. Nasa-task load index (nasa-tlx); 20 years later. Proc Hum Factors Ergon Soc Annu Meet 2006; 50(9): 904–908. https://doi.org/10.1177/154193120605000909

73.

Roa Romero

Tame

Holzhausen

, et al. Design and usability testing of an in-house developed performance feedback tool for medical students. BMC Med Educ 2021; 21(1): 354–359.

74.

Thayer

Ferro

Miller

, et al. Human-centered development of an electronic health record-embedded, interactive information visualization in the emergency department using fast healthcare interoperability resources. J Am Med Inform Assoc 2021; 28: 1401–1410.

75.

Castro

Quinan

Hosseinpour

, et al. Examining effort in 1d uncertainty communication using individual differences in working memory and nasa-tlx. IEEE Trans Vis Comput Graph 2022; 28(1): 411–421.

76.

Halko

Kientz

. Personality and persuasive technology: an exploratory study on health-promoting mobile applications. In International conference on persuasive technology, 2010, pp.150–161. Springer. https://doi.org/10.1007/978-3-642-13226-1_16

77.

Ozer

Benet-Martínez

. Personality and the prediction of consequential outcomes. Annu Rev Psychol 2006; 57: 401–421. https://doi.org/10.1146/annurev.psych.57.102904.190127

78.

Sanders

. An evolving map of design practice and design research. Interactions 2008; 15(6): 13–17.

79.

Ashton

Lee

. The HEXACO-60: a short measure of the major dimensions of personality. J Pers Assess 2009; 91(4): 340–345.

80.

Yang

Zhang

, et al. Towards interpretable mental health analysis with large language models. In: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023. pp.6056–6077.