Abstract
Emotional stimuli such as images, words, or video clips are often used in studies researching emotion. New sets are continuously being published, creating an immense number of available sets and complicating the task for researchers who are looking for suitable stimuli. This paper presents the KAPODI-database of emotional stimuli sets that are freely available or available upon request. Over 45 aspects including over 25 key set characteristics have been extracted and listed for each set. The database facilitates finding of and comparison between individual sets. It currently contains sets published between 1963 and 2020. A searchable online version (https://airtable.com/shrnVoUZrwu6riP9b) allows users to select specific set characteristics and to find matching sets accordingly, as well as to add new published sets.
Introduction
One of the most challenging aspects of designing a study involving emotions can be the selection of suitable emotional stimuli (ES) for the research goals. The aim of the present paper is to reduce this challenge and assist the researcher in making a more informed choice about which stimulus set best suits such goals, via the creation of a new tool – the searchable KAthrin POs. DIconne (KAPODI) database of emotional stimuli.
Choosing appropriate stimuli is no simple matter as the range of examples of ES include images, words, music, speech, or even video-clips. Their application is diverse; for example, word- and speech ES have been used to aid diagnoses in a clinical context (Haro et al., 2017; Nieuwenhuis-Mark et al., 2009), while video and image stimuli have been applied in therapy for alcohol addiction (Pronk et al., 2015), and eating disorders/obesity (Miccoli et al., 2014, 2016). Furthermore, emotional speech and music stimuli have been used for advertisements (Zander, 2006), learning aids (Schön et al., 2008), and political and linguistic research (Cullen & Harte, 2018; Edelman et al., 1992). Images of facial expressions are also useful for training emotion recognition and emotion expression in children (Cardos et al., 2017), individuals with autism (Wingenbach et al., 2016), and schizophrenia (Gutiérrez-Maldonado et al., 2014). Additionally, ES can be used in intercultural research (Sacco et al., 2016), human-machine interaction (Battocchi et al., 2005), and machine learning (Zafeiriou et al., 2016) within artificial intelligence. They are also useful for automatic recognition of anger in speech (Neiberg & Elenius, 2008), for instance, to indicate the customers’ emotion to staff in call-centres, as well as for emotion detection within texts such as online messages, comments on websites, and blog entries (e.g., Ramalingam et al., 2018). Finally, they can be used for automatic emotion recognition in human movement (Crane & Gross, 2007) such as detecting aggression, which can be applied within video surveillance of prisons or public spaces for security reasons. The ability to detect emotions automatically is thus of great interest to many organizations, including private businesses and the government. For algorithms training machines, ES are used as the ground truth.
Therefore, as well as the actual ES themselves, validation data is just as important. These assessed rating data provided along with stimuli are called
Commensurate with a rapidly growing interest in emotion research (e.g., emotion regulation [Gross, 2015] and emotional development [Pollak et al., 2019]), the need for ES has also increased. This is reflected in the extensive number of ES sets that have been steadily developed over the past few decades which have increased and broadened the range of available stimuli. The result is that when attempting to find suitable ES, one is faced with the daunting task of evaluating a vast number of sets which can be time consuming and unrewarding. Additionally, some sets may be outdated, difficult to access, or unavailable.
Interestingly, despite the abundance of ES, emotion-related studies tend to use only a few well-known stimuli sets that have, as a result, established research credentials. Examples of these include the
Nevertheless, relying on normative data can be a double-edged sword. For instance, although researchers may assume that selected stimuli will possess the characteristics specified by the normative data to bring about the intended effects, the ratings themselves may be affected by a plethora of external factors. These could for instance be the participants’ age (Grandy et al., 2020; Isaacowitz et al., 2007), gender (Hall & Matsumoto, 2004; Lithari et al., 2010; Nater et al., 2006), ethnicity (DeBusk & Austin, 2011; Deng et al., 2017), mental health (Mathews & MacLeod, 2005), hormone levels (Little, 2013), or social and cultural background and environment (Boiger et al., 2018; Kring & Gordon, 1998; Matsumoto et al., 2008). Furthermore, stimuli characteristics such as stimuli model's gender (Adolph & Alpers, 2010), or study characteristics such as order of stimuli presentation (Thayer, 1980a, 1980b), and evaluation context (Delatorre et al., 2019) can also influence the perception of ES. In other words, a stimulus can elicit different emotions or physiological reactions between different individuals, or even within one individual, and is also dependent on the context (Frijda, 2007).
Hence, a set validated for one population or context may not have the same effect - and thus not be valid - in another population or context. For example, research investigating the emotional perception of images in countries suffering from violence showed that Israeli adults rated images differently than adults in the United States (Okon-Singer et al., 2011). Moreover, the year the study was conducted may play an important role: For example, an image of the World Trade Center in New York City, presented to participants prior to the 11 September 2001, may have elicited different emotions compared to after that date. Images or video stimuli may include cues such as hair style or fashion that can easily be associated with a specific decade and thus seem outdated when seen today.
Unfortunately, not all available stimuli sets document these factors. Therefore, it is important to consider small details of ES set construction such as characteristics of the rating population, date of created stimuli, or country of research, when selecting stimuli and/or relying on normative data. Hence, the availability of a large pool and broad range of assessed ES is of great importance.
Although short overviews of existing sets have previously been provided in the research literature, these are typically not comprehensive (e.g., Grühn & Sharifian, 2016; Krumhuber et al., 2017), as they have not systematically searched for sets, or the overviews focus only on specific types of ES such as
Therefore, a step forward in establishing a more extensive and inclusive tool for emotion researchers is a central searchable database that can provide access to pre-validated stimuli and that lists key characteristics such as gender, age, and ethnicity of the rating population, type and length of rating scales, and assessed emotions and dimensions. It will enable researchers to search for suitable stimuli, facilitate comparison across sets, and save time. Moreover, such a database would be useful in allowing more effective study replication as well as more successful manipulation or control of a multitude of external (e.g., country of the survey, year of the study, ethnic background of participants) as well as internal (e.g., luminance, color, duration, or video/audio speed of presented stimuli) factors that are important in emotion research.
With no existing comprehensive review to date, the objective of this study was to systematically review existing and freely available sets of ES and provide an overview by documenting the central characteristics of each set. A searchable online version of the results will serve as a tool allowing specific set characteristics to be selected and display filtered results. The hope is that the resulting database will be a useful resource for researchers planning studies and in need of stimuli and/or assessed emotional rating data. Additionally, it may be directly beneficial to other contexts such as in therapeutic settings, or human-machine interaction.
Method
The methodological procedure of the current systematic literature review consisted of three main stages: first, conducting a systematic literature review aiming to detect all existing ES sets; second, coding and listing of key characteristics of all included sets; and third, creating a searchable online database.
Stage One: Systematic Literature Review
To capture the greatest possible number of papers proposing ES sets, an appropriate keyword selection had to be made and inclusion as well as exclusion criteria determined. In two consecutive steps, publications not meeting inclusion criteria were excluded.
Information sources
The keyword search was conducted in April 2019, on
Due to a long time-spanning coding process after the first search, the same search was conducted a second time in June 2020 aiming to detect all studies presenting ES published between January 2019 and June 2020. Literature search included emotion- and stimuli-related keywords and were kept limited to the six basic emotions (Ekman et al., 1969). The applied exact keywords were:
The keyword
Eligibility criteria
Papers were selected according to the criteria below:
To be included, papers had to
Excluded were all sets containing ES such as heat, pressure, or odor, as well as ES created for animal studies. Further, sets providing solely emotional physiological data unaccompanied by stimuli were excluded.
As discussed earlier, various factors can influence the perception of ES; therefore, in this systematic literature review validation of presented stimuli through participants was not considered a prerequisite. Publication without validation was sometimes the case for studies where models were asked to express certain emotions, (e.g., Minear & Park, 2004; O’Toole et al., 2005; Yingliang et al., 2006), or also for word lists created by the researchers themselves (Barrington, 1963).
Study selection
Literature search results were uploaded to Rayyan Software (Ouzzani et al., 2016), an internet-based software program that facilitates the study selection process. All search results were manually and independently screened against inclusion criteria, based on title and abstract. Uncertainties concerning inclusion were resolved through discussion and consensus of two to four researchers. When necessary, additional information was sought directly from study authors. Concerning inquiries regarding availability, authors were contacted twice via e-mail within approximately two to three months. If they did not respond after the second enquiry, the set was considered unavailable.
Stage Two: Data Collection Process and Data Items
Each publication that met inclusion criteria was then read two times independently and assigned to one of six subfolders:
Coded characteristics for each subfolder.
Stage Three: Creation of a Searchable Online Database
All extracted information was coded in an Excel sheet. An online version of the database was created. It serves as a search tool in which specific criteria such as type of stimuli, models’ age, rating scales, included emotions, etc. can be selected, leading to the display of solely sets containing these characteristics. The searchable database can be found at https://airtable.com/shrnVoUZrwu6riP9b.
Results
The first keyword search yielded 1,877 pieces of published work (443 in PsychInfo, 393 in Medline, and 1,041 in Web of Science). Duplicates (

PRISMA flow diagram of the research procedure.
A few publications mentioned further stimuli sets that were not detected by our initial key word search. Therefore, we decided to further verify the availability of all sets presented in tables within the publications that were already part of our database (extended table-search). All additional sets conforming to the inclusion criteria as mentioned above were included: Another
This updated keyword search yielded another
With 311 publications from the first search and 53 publications from the updated search, at the point of creation the database contains a total of
In the following section, a brief explanation of the subfolders is included. Currently, the database contains publications from 1963 to 2020. Note that the final keyword search in this study was conducted in June 2020, therefore stimuli sets published after that date are not included in the discussion below.
Stimuli Subfolders
For easier comparison across stimuli, each publication was allocated to one of six subfolders describing the type of stimuli:
A short description of results per subfolder follows:
Audio
Faces
A specific focus within video and image stimuli are facial expressions. This subfolder contains the most publications in relation to the other folders and accounts for almost one third (32.14%) of all sets. A growing interest in automatic emotion detection within artificial intelligence and progress in human-machine interaction are pushing researchers to continuously adjust and improve algorithms; these strongly depend on standardized and/or validated sets of face-stimuli. Proposed sets contain both grey-scale stimuli and color-stimuli, in 2D as well as 3D. Further, still-image stimuli, as well as video and dynamic stimuli are included. Dynamic stimuli can be constructed artificially based on individual images. Similarly, this folder also includes sets with morphed stimuli which are stimuli that are created by superimposing multiple images or video frames. In some cases, researchers provide video streams, as well as individual frames (image-stimuli) of the recorded videos. Standardization such as fixed pupil position across all models from one set, or removal of potentially distracting information (e.g., jewelery, hair, makeup, clothes) is coded. Both posed, as well as natural/spontaneous emotion expressions are included. Sets also vary in the degree of homogeneity regarding models’ gender, age, and ethnicity.
Images
Video
Words
Mixed
The
Navigation of the Database
In addition to listing all emotional stimuli sets with their key characteristics as presented in the Excel sheet version (see

Exemplary view of the KAPODI searchable database I.

Exemplary view of the KAPODI searchable database II.
Within each subfolder (e.g., audio stimuli), (see Figure 3, [
Discussion
The searchable KAPODI database was constructed in response to an increasing number of publications providing ES sets to the research community. The numerous publications reflect a growing demand for tailored stimuli as well as a growing interest in the field of emotion research. In the present work, the first comprehensive systematic review of emotional stimuli sets was conducted, and key set characteristics were coded to allow for comparison between sets and aid researchers in choosing appropriate sets for their research. The resulting KAPODI database contains the largest list of available stimuli sets to this date and is therefore a useful contribution to research on emotion and beyond.
In the following section, the database will be discussed regarding its use and its limitations. Finally, the creation and publication of emotional stimuli sets will be discussed with recommendations regarding how these should be reported.
Using the Database
Researching the availability of different ES sets is a time-consuming task and it often leads researchers to resort to well-known and widely used stimuli sets despite a wide range of other stimuli sets being available. At the time of publication, the KAPODI database comprises 364 publications from 1963 to 2020 that cover various types of stimuli such as audio-, image-, video-, and word-stimuli, or a combination thereof (e.g., audio-visual). Six sub-folders were created within the database for easier comparison of similar stimuli and to facilitate stimuli search for researchers in the future. Over 25 key characteristics have been coded for each set, leading to over 45 criteria that researchers can use to find appropriate stimuli (e.g., the characteristic
Information coded about the included emotions may for instance allow researchers to select stimuli of a distinct emotion, or select stimuli rated as neutral, for comparison. Researchers may search for stimuli sets based on applied rating scales (e.g., SAM-scale, Likert scale, visual analogue scale) or length of used scales, which may be important information for choosing new stimuli for replication studies. Researchers can also select sets that include a minimum number of stimuli, a minimum number of included models (e.g., within face-stimuli), or choose stimuli sets that include a specific type of content (e.g., images of food, or fear-inducing images). Moreover, it is possible to search for stimuli of a specific language or sets that include an additional type of data (e.g., physiological recordings).
The coded number of raters per stimulus allows researchers to search for stimuli assessed by a minimum number of raters, which is relevant for discerning the reliability of the ratings. Additional information regarding the rating population allows selection of stimuli based on the type of assessors (e.g., sets normed with student populations or via crowdsourcing). For certain types of stimuli, further information is coded to allow researchers to find exactly what they are looking for. For example, a researcher may wish to find audio stimuli which include natural (non-acted) expressions of happiness. The selected filters would be as follows: within the
Finally, further information is provided regarding the context of creation for each ES (e.g., researchers’ affiliation and country of study), as we believe that this may provide valuable information for researchers interested in the development of different types of stimuli from a geographical perspective.
In summary, the database provides the researcher with more flexibility in selecting an appropriate stimulus set and provides a systematic basis for going beyond classic ES sets (e.g., ANEW). This central database facilitates access to, and eases comparison between stimuli and/or sets for a wide range of applications and for researchers from a wide range of disciplines.
Strengths and Limitations
Despite its benefits for research, the database described is not without limitations. These, along with a few examples, are outlined in the following section.
Similar to the
The root of this difficulty could be due to incompatible theories of emotion; for instance, there is no agreement for a unitary definition of emotion (Izard, 2007) to this date. Therefore, the current database employed categories that were thought to best represent the various facets of emotional information and allow comparison between sets without subscribing to a specific overall theory of emotion categorization. In summary, criteria from different sets were coded according to our best understanding of set content and study procedure, while also keeping in mind the usability of our created database (e.g., set search by key words) for interested researchers.
To give an example, some scientists accept only the
Comparably, other terms may have differed in some publications, though we did not modify terms while coding: despite most publications naming happiness as one of the basic emotions, in a few cases, happiness was replaced by
Another example is that of the three dimensions
Recommendations for the Creation and Publication of Stimuli Sets
To offer a large applicability of good quality ES, researchers creating and presenting stimuli in the future should generally consider three aspects: high-quality stimuli, good validation procedure, and clear reporting. That is, researchers should
The issue of clear reporting is particularly important, we will therefore expand on this. Especially among video stimuli, information regarding sex and age of models is frequently missing. Similarly, stimuli sets within the
More recently published stimuli sets are often good examples of comprehensive reporting and high-quality stimuli, reflecting an increasing understanding of the need for relevant information to be included, but also of improving technological abilities (e.g.,
A central aim regarding future research conducted in relation to ES sets should be to improve the uniformity in reporting the characteristics of the set. Hence, it is suggested that researchers developing stimuli sets in the future should include information regarding all the key characteristics established through this current systematic review. For guidance, researchers may use the KAPODI submission form (see the link in the
Finally, researchers should ensure that their ES sets are made available freely and openly to other researchers, which will substantially contribute to transparency and reproducibility of research procedures (see Munafò et al., 2017).
With the central aim of supporting the efficiency of scientific research and knowledge accumulation, we took the decision to include solely stimuli sets that are publicly available/ freely available upon request. The information regarding availability of the set was taken from the original source. Some publications include an internet link, directly leading to the freely accessible set or to a compliance form for researchers. Others provide an e-mail address through which sets and data can be requested to the author(s) directly. Nevertheless, links and/or e-mail addresses may have changed, or sets may no longer be available. We therefore cannot guarantee that sets are truly available at present, even if stated so in the original source. Rather than relying on requests for access via email, we recommend that in the future, researchers should upload their created sets to a website or repository granting availability and easing access to stimuli to colleagues. An automatic validation system (for instance through a form requiring assurance regarding the academic purpose of accessing the set) could restrict access to researchers only. This further ensures access to the stimuli to remain the same, even if the set creator has changed or left their institution.
Final Remarks
Using a systematic review methodology, the current study aimed to identify as many available ES sets as possible. The resulting searchable database, which can be found on https://airtable.com/shrnVoUZrwu6riP9b, currently contains 364 different stimuli sets. It is available to the research community and all included stimuli sets are freely available or available upon request. By making all extracted and listed set key characteristics available in an Excel sheet as well as through the website Airtable.com, we further ensure its permanent availability. We aim to maintain the searchable online version of the database updated. By offering researchers the opportunity to add new stimuli sets that they wish to provide to the research community, we allow continuous content extension. Researchers who wish to add their new stimuli set to the searchable KAPODI database can fill out the corresponding set form available through https://linktr.ee/KAPODI_database. The submitted information will be uploaded to the database upon verification by one of the authors.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship and/or publication of this article.
