Abstract
Research on citizen science volunteers has historically focused on single projects, but emerging research suggests many volunteers engage in multiple projects. Platforms that host thousands of projects, like SciStarter.org, enable exploration of volunteer activity across multiple projects. To learn more about the phenomenon of multi-project engagement, we carried out a descriptive social network analysis using digital trace data depicting volunteer activity on SciStarter.org from 2017 to 2018. During this time period, our sample included 624 citizen science projects and 3,650 unique volunteers that engaged in these projects. We used these data to visualize and analyze project connection networks formed when volunteers join multiple projects. Volunteers joined an average of 2.93 projects spanning many different scientific disciplines (e.g., topics such as Health & Medicine, Ecology & Environment) and modes of participation (e.g., online, offline); 73% of volunteers joined 2 or more projects. Volunteer engagement in citizen science produced a complex network of project connections with low network centrality, low levels of homophily and clustering, and ample evidence of boundary spanning (e.g., based on topic or mode). The projects most central in the network, which were also the most popular, were those featured as affiliates on the website or in promotional email campaigns. By using a network approach to analyze digital trace data, our research illustrates the extent of multi-project, multi-disciplinary engagement on a third-party platform, laying the groundwork for researchers and platform managers to explore and facilitate multi-project engagement and its implications for the larger field of citizen science.
Plain language summary
Research on citizen science volunteers has historically focused on single projects, but recent studies suggest many volunteers engage in multiple projects. Platforms that host thousands of projects, like SciStarter.org, enable exploration of volunteer activity across multiple projects. To learn more about the phenomenon of multi-project engagement, we analyzed digital trace data on SciStarter.org using social network analysis. From 2017-2018, our sample included 624 citizen science projects, and 3,650 unique volunteers. Volunteers on the platform joined an average of 2.93 projects spanning many different scientific disciplines (e.g., topics such as Health & Medicine, Ecology & Environment, Astronomy) and modes of participation (e.g., online, offline). Citizen science projects were connected via shared volunteers in many different ways that spanned topic and mode boundaries. For example, volunteers who worked on ecology projects also worked on health projects, and volunteers who engaged in online projects also worked in offline projects. The projects most central within the network, which were also the most popular, were those featured as affiliates on the website or in promotional email campaigns. Our research revealed the extent of multi-project, multi-disciplinary engagement on a third-party platform, laying the groundwork for researchers and platform managers to explore and facilitate multi-project engagement among volunteers and its implications for the larger field of citizen science.
Keywords
Introduction
As a collaboration between the science community and members of the public, citizen science enables researchers to answer questions and collect data that would be difficult to answer and access with traditional research methods (e.g., Bonney et al., 2009; Chandler et al., 2017; Cooper et al., 2007; McKinley et al., 2017). Citizen science also functions as an outreach tool to accomplish educational goals (Bonney et al., 2016; Jordan et al., 2012; Peter et al., 2021; Phillips et al., 2018) and may influence volunteer behavior (e.g., Toomey & Domroese, 2013; Peter et al., 2019). Over the past few decades, the number of citizen science projects has rapidly grown (Parrish et al., 2018), as has research on the practice of citizen science (Jordan et al., 2015). Although citizen science research does look across multiple citizen science projects, it can be difficult to access sufficient data to get a landscape view of the field. Because of this, researchers are still learning about citizen scientists themselves (Peter et al., 2021), patterns of volunteer participation across different projects (Allf et al., 2022), and how that might impact public participation in science as whole. In this paper, we explore the nature of volunteer participation across the citizen science landscape using an innovative approach: social network analysis of digital trace data on the popular platform SciStarter.org (2024).
The Growth of Multi-Project Participation and Platforms
The conventional paradigm for managing citizen science volunteers is at the individual project level, focused on managing volunteers while they are in service to a particular project (Sharova, 2020). Additionally, most investigations of the practice of citizen science have generally focused on single projects, with a few notable exceptions that examined volunteer activity patterns across multiple projects on an online platform (Allf et al., 2022; Herodotou et al., 2020; Ponciano & Pereira, 2019). Compared to individual project-based management, platform-based management is more volunteer-centric, focused on managing volunteers holistically and beyond the bounds of a single project (Allf et al., 2022; Stein et al., 2023). And, although many citizen science projects operate in isolation (Bonney et al., 2014), many volunteers are already engaging in multiple projects (Allf et al., 2022; Hoffman et al., 2017), a behavior that is associated with increased durations of engagement with citizen science compared to single-project volunteers (Ponciano & Pereira, 2019). Platform-based research therefore provides the landscape-level view needed to generate broader insights into the volunteer experience as well as the potential for coordinating the management of volunteers (Stein et al., 2023). Many of these insights can be generated by analyzing digital trace (i.e., web tracking) data.
With widespread availability of information and community-oriented technologies, such as Web 2.0 and smartphones, the operation of modern citizen science projects requires a sufficient amount of digital infrastructure (Brenton et al., 2018). Web and phone application (app) platforms provide many citizen science projects with the necessary digital infrastructure for obtaining data from volunteers as well as managing subsequent databases. Platforms that support project capacity in these ways include Citsci.org, Zooniverse.org, Anecdata, and iNaturalist.org. Thus, the landscape of citizen science includes projects using custom-created digital infrastructure as well as the shared infrastructure of various platforms and apps. Some prior studies have examined volunteers engaged in different projects across shared infrastructure of a platform (Herodotou et al. 2022). The use of application programming interfaces (APIs) allows the secure transfer of user information between platforms and apps. The SciStarter.org web platform is the largest clearinghouse of citizen science projects, the majority of which operate with their own infrastructure outside their posting on the website. SciStarter members navigate from SciStarter to the projects they choose to join. In addition, some citizen science projects use the SciStarter API to share information about the activity of SciStarter members in their projects. Digital trace data from SciStarter contains information about volunteer engagement across a broad range of thousands of projects, irrespective of their source infrastructure. Similar to “big data” approaches gaining popularity in fields such as social media analytics (Rathore et al., 2017), these data allow managers and researchers to map users’ activity patterns on the website over time and represent a unique source of readily accessible data for analyzing volunteer activity across multiple projects.
Using Social Network Analysis With Digital Trace Data to Understand Project Connections
Although online platforms already play an important role in the citizen science landscape, researchers just beginning to understand what this role is, how it can be assessed, and how volunteers benefit from participating in citizen science through online platforms (Herodotou et al., 2022). Using innovative tools and approaches, researchers can visualize and analyze the resulting network of engagement across diverse projects, such as those in the large SciStarter.org repository. Social network analysis (SNA) is a research paradigm that allows for formalization and characterization of social units within a larger network (Borgatti et al., 2009; Wasserman & Faust, 1994). A short glossary of SNA terms relevant to this research is included in S1 Appendix. SNA has been used to examine connections and interactions among citizen science volunteers, but often within the context of a specific project (Amarasinghe et al., 2021). Few studies have examined volunteer networks across multiple projects (Herodotou et al., 2020).
Social network analysis is based on the premise that social networks are made up of actors (nodes) and the relationships between them (edges). In one-mode networks, all nodes in the network are of the same category; for example, families connected by marriage (Padgett, 2010). In two-mode networks, also called bipartite networks, nodes are of two different categories; for example, parties and the people who attend them (Davis et al., 1941; Freeman, 2003) or, in the current case, citizen science projects and the volunteers who join them. Bipartite networks can also be projected into one-mode networks (Herodotou et al., 2020; Pettey et al., 2016; Sankar et al., 2015) that, with thoughtful interpretation (Opsahl, 2013), can offer insights into patterns of co-incidence in a large network (Pettey et al., 2016). For example, in the citizen science landscape, we can project the bipartite network of project and volunteer connections into a network to show the project connections formed when a volunteer joins multiple projects. Descriptive analysis of this network can reveal what characteristics unite the different projects that a volunteer joins. Researchers can test for homophily (similar projects connected to each other) and clustering (distinct groups of interconnected projects) to determine if volunteers tend to join similar groups of projects. Researchers can also test for centrality to determine what projects are the most connected and show the most multi-project engagement. Exploring connections between projects is a critical new direction in citizen science research, where multi-project engagement by volunteers is already happening but the factors influencing this engagement and the factors associated with it are poorly understood (Allf et al., 2022; Herodotou et al., 2020; Hoffman et al., 2017).
Attributes Associated With Project Connectivity
When volunteers view projects on a citizen science platform, they may be attracted to intrinsic attributes of the project, such as the scientific topic of the project or the process for participating. Additionally, they may be affected by extrinsic attributes of the project, such as how the project is featured on the online platform. We were interested in the influence of intrinsic and extrinsic project attributes on volunteer activity across the platform.
We investigated two types of intrinsic project attributes (topic, mode) and two types of extrinsic project attributes (affiliate status, campaign feature status) to determine how volunteers are currently engaging with projects across and within attributes. Topic refers to the scientific discipline of the project, and mode refers to the means of accessing the project. Projects on platforms such as SciStarter.org and Zooniverse.org are commonly grouped by topic or scientific disciplines, which might include themes such as the environment, public health, and astronomy. Other typologies of citizen projects often focus on methods of participation, although the specific words used to describe these different methods may vary. For instance, Parrish et al. (2018) refers to the division as “hands-on” and “virtual,” and Sharova (2020) refers to “outdoor” and “indoor” projects. In the current research, we chose online versus offline to reflect the online status of SciStarter.org. Topic and mode are attributes intrinsic to the project and are not manipulated by the SciStarter Platform.
Alternatively, project affiliate status and campaign feature status refer to the statuses given to projects by the SciStarter platform and are thus extrinsic to the projects. Affiliate projects on SciStarter.org share data contribution information with SciStarter and in return are marked with an icon indicating their status as affiliates. A project’s campaign feature status is a result of whether it has been featured in an email campaign to SciStarter members. Collectively, these intrinsic and extrinsic attributes could influence overlap of volunteers across the larger citizen science landscape.
Research Objectives
Our research sought to fill critical research gaps by mapping and analyzing the extent of multi-project engagement, and the project attributes linked to it, across the citizen science landscape. The first objective in our exploratory study was to describe the structure of the project connection network, based on shared volunteers, from a snapshot in time on SciStarter.org (Obj. 1). For our second objective, we sought to quantify the patterns of participation within and across project attribute groups to determine if homophily (or clustering) is present in the network and if intrinsic (topic, mode) and extrinsic (affiliate status, campaign status) project attributes influence volunteer overlap among projects (i.e., multi-project participation; Obj 2). Our third and final objective was to discover the projects central to the network, or those with the most shared volunteers. We explored project centrality in two ways: first determining which projects are central to the projection network (Obj. 3a) and then examining project attributes associated with centrality (Obj. 3b). By addressing these objectives, we sought to describe and map the patterns of multi-project participation on SciStarter.org, building important foundational knowledge about the current state of multi-project participation in citizen science.
Materials and Methods
SciStarter Website
Our analysis centered on SciStarter.org, a website that hosts a platform and database of thousands of citizen science projects. The SciStarter platform provides volunteers with a wide choice of projects and creates opportunities for volunteer-centric management. At the time of data collection (the website has been updated since), SciStarter managed a database of user-generated content about each citizen science project, a Project Finder application that provided advanced functionality for searching for projects, and a syndicated blog to promote citizen science projects, often facilitating volunteer connections to and participation in multiple projects (Allf et al., 2022). The Project Finder application allowed interested volunteers to search for projects by various tags including keyword, topic (scientific discipline), mode of participation (“exclusively online,” “on a lunch break,” “at school,” etc.), location, and target age group. Volunteers can become SciStarter members by creating a free account, and making a user profile in which they click projects to “join.” These are saved, thus allowing for tracking of engagement in projects within or separate from the SciStarter web platform.
All projects on the site have a project page that shares information about the project and how to participate. Project pages are searchable through the project finder tool. However, not all projects on the SciStarter website are presented in the same way. Some projects are highlighted because they are SciStarter “affiliates,” which means that they use SciStarter’s API to share information about the frequency of volunteers’ data contributions to their project. This is then displayed on the volunteer’s profile on SciStarter. These affiliate projects are marked with an icon in the search results and on the project page and appear first in search queries. Another way that members find projects is through monthly emails that feature certain projects that fit the theme of that month’s campaign. For instance, an email in October might feature projects that align with a Halloween theme, such as projects about bats or spiders. These differences in how projects are viewed or appear on the website could affect volunteer choices of projects, and thus the project’s position in the larger project connection network.
Defining Project Attributes
We wanted to learn more about how project differences may affect their position in the larger project network, so we collected information related to four project attributes: topic, mode of participation, SciStarter affiliation status, and campaign feature status. To code project topics (topic), we applied the typology of 14 project topics (Supplemental Appendix S2) used in Allf et al. (2022) and Sharova (2020). Postings that were determined to not be citizen science projects, including postings for tools (such as spectrometers) and postings for other citizen science platforms (such as for the website CitSci.org), were dropped from the sample. In some analyses, we grouped project topics into four larger groups (super-topics) so that we could more clearly test differences between these larger disciplines: Earth & Life Sciences, Behavior & Social Sciences, Engineering & Physical Sciences, and Health & Medicine. The compositions of these super-topics are also shared in S2 Appendix.
Mode of participation (mode) was coded by the authors using expert review with the following definitions. Offline projects are projects where the primary mode of data collection is offline. Projects in which data collection is offline but submitted through an online platform (e.g., iNaturalist) are considered offline projects. Online projects are projects where the primary mode of data collection or classification is online (e.g., Stall Catchers). For both topic and mode classification, we used project descriptions available on SciStarter.org. Additional information from project websites was used if needed. Projects with no further information and three or fewer instances of engagement (joins or bookmarks) during the year of data collection were excluded from the sample.
We coded a project’s affiliate status and campaign feature status using records from SciStarter.org. Because of the longitudinal nature of data collection, we assigned different status levels based on when the project became an affiliate or when it was featured in monthly email campaigns. Affiliate status was separated into three groups: (a) projects that were never a SciStarter affiliate (Not Affiliate), (b) projects that became an affiliate sometime during the data collection period (Part-time Affiliate), and (c) projects that were an affiliate throughout the entire data collection period (Full-time Affiliate). A project’s campaign feature status tells us when that project was featured in one of the monthly emails that go out to volunteers. The campaign feature status was separated into four categories: (a) projects that were never featured in a SciStarter promotional campaign, (b) projects featured only before data collection, (c) projects featured only during data collection, and (d) projects featured in promotional campaigns both before and during data collection.
SciStarter Digital Trace Data
Digital trace data for this research was extracted from SciStarter.org (which existed as SciStarter.com at the time data collection was initiated). On Dec 6th, 2018, our research team received anonymized digital trace data of SciStarter members’ activity on the website between Sept 19th, 2017 and Dec 3rd, 2018. Sept 19th, 2017 marked the launch of SciStarter 2.0 (Hoffman et al., 2017), which introduced the added functionality of member accounts, dashboards, and profile pages. Use of these secondary data was approved by the NC State University Institutional Review Board (IRB Protocol # 20934) prior to analysis. There was very little demographic data included, as we only had access to what volunteers willingly entered into their volunteer profile; 6.9% of volunteers provided their birth year, and 6.3% provided a gender. Consequently, we were unable to investigate associations between volunteer demographic characteristics and multi-project participation in this analysis, though that remains an important direction for future research.
At the time of data collection, volunteers could click buttons to either “join” or “bookmark” projects of interest on the SciStarter website (Figure 1; SciStarter.org, 2024). Additionally, volunteers could check a box to indicate that they had previously joined a project. Clicking “join” sent the volunteer to the project’s website (unless that project was exclusively hosted on SciStarter), and it automatically added the project to a list of joined projects on a volunteer’s profile. Thus, the join function in SciStater could be considered “conversion” (Crall et al., 2017), or an expression of interest in joining a project. Clicking “bookmark” added the project to a list of bookmarked projects on the volunteers’ profile. Volunteer activities like “joins” and “bookmarks” were recorded in the digital trace data, along with an anonymized participant ID number.

At the time of data collection, interested volunteers could either click to “join” or “bookmark” a project on SciStarter. When volunteers clicked the “join” feature depicted here, they navigated to the project website and the project was added to their volunteer dashboard. Clicking the “join” button was used as a proxy for project engagement. Sparrow Swap logo and project overview are reprinted with permission from the NC Museum of Natural Sciences and SciStarter.org, and can be viewed at https://scistarter.org/sparrow-swap.
As noted, some citizen science projects are affiliates that are directly hosted on the SciStarter website or use SciStarter’s API. Because of this, we were able to track individual contributions to these affiliate projects—a deeper level of project engagement than we could track in non-affiliate projects. However, because a record of data contributions was only available for 30 of the 624 projects that were active during data collection, we used volunteers selecting the “join project” button as an indication of volunteers’ behavioral intent to join a project. “Bookmarking” a project for later reference was not used as a proxy. Although tangible data contributions might have been a more accurate measure of project participation, previous research suggests that behavioral intent is strongly linked to actual behavior (Ajzen, 1991). Analysis of the data contributions showed that “joining” a project is a better predictor of subsequent contributions than “bookmarking” a project; we were able to confirm that at least 30% of volunteers who clicked “join” on an affiliate project later contributed to that project, while only 10% of volunteers who clicked “bookmark” ended up contributing. These estimates may be low, however, as volunteers would have to be logged in through SciStarter for their keystrokes to be accurately recorded. Thus, given the SciStarter metrics available through the API at the time of data collection, “join” was the best proxy for project participation.
Data Analysis
We conducted analyses in R (R Core Team, 2019) using RStudio (RStudio Team, 2016) and in SPSS (IBM Corp, 2017). We used social network analysis to better understand the relationships between projects connected by shared volunteers, which reveal volunteers’ patterns of engagement across the citizen science landscape. To get deeper insight into project connectivity across this landscape (Obj. 1), we used the R packages igraph (Csardi & Nepusz, 2006) and sna (Butts, 2019) to create a one-mode projection of citizen science projects (nodes) connected by shared volunteers (edges), with weighted connections in proportion to the number of volunteers that two projects share. Whole network descriptive statistics such as component analysis, clustering, and centralization revealed the structure of the network (definitions provided in Supplemental Appendix S1).
We used an exponential random graph model (ERGM) to test for homophily in the network. Homophily is the tendency in networks for similar nodes to be connected to each other. We tested for homophily to determine if volunteers tend to join projects with similar attributes, such as the same topic or method of participation (Obj, 2). ERGM is an approach to data analysis that allows researchers to statistically test the impact of connectedness and project attributes on how volunteers move among projects. We used the ergm R package (Handcock et al., 2018; Hunter et al., 2008) to develop a predictive model that allowed for link prediction, predicting which projects would be connected in the future, based on node attributes (Robins et al., 2007; van der Pol, 2019). We followed this with a Chi-square analysis of connections between projects of varying attributes to determine if observed instances of connection differed from what would be expected in a random distribution. This demonstrates whether certain attributes are more or less connected than would be expected in a random network.
We then determined which projects were central to the network (Obj. 3a) by calculating the centrality of the nodes in the project connection network. Using degree centrality (the number of connections from one project to other project nodes) as the dependent variable, we used a negative binomial regression analysis to explore how project attributes influenced project centrality (Obj. 3b). We chose a negative binomial regression for this analysis because the data were over-dispersed, with the variance exceeding the mean (Hilbe, 2011).
Results
Dataset Description and Project-Based Network Projection
During the period of data collection, digital trace data revealed that 3,650 SciStarter members joined 624 projects, resulting in 9,678 instances of project joins. The 624 projects spanned all project attributes, representing 14 different topics (scientific disciplines). The breakdowns of the attributes are shown in Table 1.
Counts and Ratios of the Number of Citizen Science Projects Displaying Each Attribute in the Project Connection Network From SciStarter.org.
In 98.4% of these projects, at least one volunteer also joined other projects. In 93.9% of projects, at least three-fourths of the volunteers joined other projects. The 624 projects had an average of 17.1 volunteers, with a minimum of 1 and a maximum of 480 volunteers (note that these numbers do not represent the total number of volunteers within a given project, but only those with SciStarter accounts). The distribution of projects was skewed, however, as 476 projects had 10 or fewer volunteers (Figure 2). The digital trace data showed that volunteers joined an average of 2.9 projects, with a median of two joins, minimum of 1 and maximum of 34 project joins; 73% of all active volunteers joined 2 or more projects, and 46% joined three or more projects. These patterns align with self-reported participation among SciStarter members (Allf et al., 2022).

Number of volunteers participating in different projects based on digital trace data available through the SciStarter platform. Data are skewed, showing that most projects have a small number of volunteers.
The project connection projection network (hereafter project network) showed relationships among the various citizen science projects on SciStarter (Figure 3). In this network, the nodes represent the 624 projects, and an edge between two nodes represents the connection made when one or more volunteers joins both projects. Node color represents project topic, and node shape represents project mode. Only 10 of the projects in the project network were isolates, meaning that they are not connected to any other projects. The remaining 614 projects (98.3% of all projects) shared volunteers with at least one other citizen science project. The 614 project nodes made up one connected component. The longest geodesic distance between any two projects was 7; this means that there was a maximum of seven edges, or different links representing shared volunteers, between projects. A modularity-optimization-based cluster analysis did not reveal significant clustering in the network (Blondel et al., 2008). Clauset et al. (2004) suggested the result of such an analysis be at least 0.3 to indicate clustering; the SciStarter network had a modularity of 0.2, meaning that we were not able to detect distinct communities of densely connected nodes. Table 2 displays descriptive statistics for the connected component in the project network. These descriptive statistics again highlight the high interconnectedness of the network. In particular, the relatively high transitivity compared to the density (Supplemental Appendix S1) emphasizes that projects at the center of the network are often connected to each other in triads, where three projects are connected by all possible edges (i.e., shared volunteers); the same is not true throughout the network. These triads do not form distinct communities though, as represented by the lack of significant clustering.

Project connection network for all citizen science projects on SciStarter. This is a one-mode projection of projects connected by shared volunteers. Node shape represents project mode. Node color represents project topic.
Descriptive Statistics for the Single Large Component in the SciStarter Project Network.
Exploration of Network Homophily
We used exponential random graph modeling (ERGM) to determine if links in the network could be predicted using project attributes. We aimed to determine if projects with similar attributes (topic, mode, affiliate status, and campaign feature) were more likely to share volunteers, which would result in homophily in the network. Neither the original bipartite (two-mode) nor the projection (project-mode alone) network produced a viable model (see Supplemental Appendix S3 for a more in-depth explanation). The persistent degeneracy of the models suggests a lack of statistically significant homophily in the network, meaning either that volunteers were regularly joining projects across different attributes, or that volunteer behavior cannot be reliably predicted based on the project attributes investigated in this study (topic, mode, affiliate status, and campaign feature).
Distribution of Direct Connections Among Diverse Project Attributes
To dig deeper into the patterns of connections across and within different groups of attributes, we calculated the number of connections between projects with different attributes (e.g., connections between online and offline projects based on shared volunteers). For this analysis, we compared the observed percentage of connections between each attribute type with what would be expected in a random distribution of connections (Table 3).
Connections Between and Across Citizen Science Project Attributes Based on Percentage of Observed Connections (via Shared Volunteers) Relative to What Would be Expected in a Perfectly Random Distribution (Bottom Row of Each Matrix) for (a) Project Topic, (b) Project Mode, (c) Affiliate Status of Project, and (d) Campaign Feature of Project.
Chi-square analyses revealed that the observed percentages differ from what would be expected in a random distribution for the project topics,
Centralization and Centrality in the Project Network
We analyzed the centralization and centrality of the large component of the project network (see Figure 2), excluding the ten isolate projects, to see if certain projects were more prominent nodes of connectivity than others. These scores are normalized measures—the closer they are to 1, the more centrality is present around one node (Supplemental Appendix S1). The degree centralization for the project network was 0.43, demonstrating that degree centrality was not centered on only a few projects, and that many projects were well-connected in the network. Betweenness centralization in the project network was only 0.08, suggesting that most projects played an equal role in connecting other projects. This supports the lack of clustering we found, suggesting there were not disparate groups of projects in the network. The project network presented high eigenvector centrality at 0.84, meaning that a few highly connected projects were well connected to each other. Figure 4 highlights the strongest connections in the network, showing only edges with 10 or more shared volunteers. This network illustrates the high interconnectedness, and high degree of boundary spanning based on topic and mode, at the center of the network.

Projection of the largest component of the SciStarter citizen science project connection network using modularity-optimization clustering algorithm (Blondel et al., 2008). Node shape represents project mode. Node color represents project topic. Modularity of the fit of this analysis is insignificant at .181, meaning that there is insignificant evidence for clustering in this network.
The most connected projects in this network were also the most popular projects. We found a significant correlation,

Relationship between number of volunteers in a project and the number of connections that has project has with other citizen science projects on the SciStarter platform (based on shared volunteers).
Attributes Associated With Project Centrality
We ran a negative binomial regression model to determine the influence of project topic, mode, affiliate status, and campaign promotion status on the degree centrality of projects in the network. All attributes were significant in predicting degree centrality (Table 4). Online projects were more connected to other projects than offline projects (
Negative Binomial Regression a Results Showing Project Attributes (Topic, Mode, Affiliate Status, Campaign Features) Associated With Degree Centrality b for Citizen Science Projects on the SciStarter.org platform.
Omnibus Test
Degree centrality represents the number of connections from one project to other project nodes in the network (
Indicates reference category for each attribute group.
Discussion
Using a social network analysis with digital trace data from the SciStarter platform, our study explored the degree to which projects are connected by shared volunteers and the role that third-party platforms can play in facilitating connectivity across the citizen science landscape. Our results support recent studies showing that citizen science volunteers engage across multiple projects (Allf et al., 2022; Herodotou et al., 2020; Hoffman et al., 2017; Ponciano & Pereira, 2019). Volunteers on SciStarter joined more than two projects on average, and 73% of volunteers joined 2 or more projects.
The phenomenon of multi-project engagement in citizen science is important to consider for the sake of both volunteer experience and project management. Previous research on platforms has shown that volunteers engaging with multiple projects tend to stay on platforms longer than those volunteers that engage with a single project (Ponciano & Pereira, 2019). Longer engagement with a platform could indicate more prolonged engagement with citizen science as a whole. Additionally, multi-project engagement presents more opportunities for volunteers to build skills across projects and advance learning outcomes (Phillips et al., 2018; Peter et al., 2019; Smith et al., 2024). Platforms also benefit project managers, providing cyber-infrastructure, access to volunteers, and a chance for collaborative learning to advance best practices across projects (Newman et al., 2012). For example, research suggests that third-party “enabler” organizations - whether in-person institutions or online platforms - can strengthen volunteer engagement with science in multiple forms (Salmon et al., 2021; Smith et al., 2024). Given the potential benefits of multi-project engagement, it is important to understand how projects are connected by shared volunteers.
Our projection depicting the network of citizen science projects connected by shared volunteers revealed an interconnected network with one large component. This network presented minimal clustering and homophily, meaning that at a whole network level, similar projects are not predictably sharing volunteers, and there are not clear groups of projects that all share the same volunteers. It therefore appears that citizen science volunteers are open to exploring new project types and often cross over to participate in projects with different attributes such as topic, mode, affiliate status, and campaign feature status, opening the opportunity for transdisciplinary citizen science (Spasiano, et al., 2021). A cluster analysis on a similar dataset from the citizen science platform Zooniverse also found minimal evidence for clustering (Herodotou et al., 2020), suggesting that the lack of clustering among projects may be a common trait shared by many citizen science platforms.
Looking more closely at this intermixing, we did see some patterns of participation emerge on an individual attribute level (Table 3). For the intrinsic attributes of project topic and project mode, project connections between two projects with matching topics or modes were proportionally more likely than connections between projects with different topics or modes. In other words, there was a slight tendency for volunteers to join multiple projects within the same general topic or mode of projects, but only within intrinsic attributes (topic and mode) at the individual attribute level. When looking at the whole network, this tendency fades away. More research is needed to investigate the strength of these patterns when multiple attributes are considered.
For all extrinsic attribute levels, the most likely connections were with projects that hold Full-time Affiliate status or projects that were featured in campaigns Before and During Data Collection. We were not able to investigate whether there was a causal relationship between connections and project features as part of this research. The high connectivity of featured projects may help explain the influence that platforms can have on multi-project engagement, potentially allowing volunteers to discover and connect with new types of science. However, it is also possible that SciStarter may only promote particularly popular projects. Future research using experimental manipulation of platform management strategies (e.g., systematic selection of projects for campaigns) could reveal causal pathways that help to better explain volunteer recruitment and retention and create new multi-project engagement pathways.
The significant correlation between centrality and popularity (based on number of joins) showed that popular projects were central to the SciStarter network. The skewed distribution of joins revealed that a few projects were responsible for most of the activity on SciStarter. High eigenvector centrality and high clustering coefficients compared to density measures also demonstrated that these central, popular projects were highly connected to each other. Regression results suggested that the centrality of these projects was in part due to their topic (health and medicine projects were most popular) and mode (online projects were popular). Project popularity was also linked to SciStarter featuring these projects as affiliates and in email campaigns. Projects featured by SciStarter received higher numbers of joins than projects that were not featured. However, as noted above, we were not able to investigate whether project features led to popularity or popularity led to a project being featured. A more advanced understanding of what makes projects popular could address the “nibble and drop” problem in citizen science (Fischer et al., 2021), where many volunteers try a project once but do not return. Since the collection of the data in this study, SciStarter has implemented an intelligent recommendation tool that recommends projects that volunteers might be interested in based on past data, which has led to increased participation among volunteers (Zaken et al., 2021). Future research could explore the patterns of participation resulting from the new recommendation tool to learn more about the influence of third-party suggestions and features.
The interconnected network on SciStarter and the suggested influence of platform activity on project popularity demonstrates the increasingly important role that platforms play in the citizen science landscape. Many of the projects connected by shared volunteers in this network are not featured together in any location other than the SciStarter website. By sharing the cyber-infrastructure, volunteer recruitment efforts, and volunteers themselves, platforms like SciStarter not only serve as “enabler” organizations (Salmon et al., 2021), but also foster shared management practices that allow for conservation of resources, including the resource of volunteer energy and maximization of scientific and volunteer outcomes (Brudney & Meijs, 2009; Newman et al., 2012; Sharova, 2020). By acknowledging and encouraging the boundary-spanning engagement revealed by our social network analysis, project and platform managers can effectively leverage the phenomenon of multi-project participation in citizen science, potentially extending the impacts of citizen science in society (Lynch-O’Brien, et al., 2021; McKinley et al., 2017; Stein et al., 2023).
Limitations and Future Research
Digital trace data is a helpful source of information, but it is “found” data that does not expressly match the intent of any particular study (Howison et al., 2011). As such, there are several limitations of our research that should be acknowledged. We only had access to data from SciStarter members, which excluded the activity of an unknown number of volunteers that use the SciStarter website without a membership, and who may represent the majority of volunteers in a given project. For example, large projects such as iNaturalist have thousands of volunteers, only a small portion of whom are also SciStarter members (our study only captured activity for 182 iNaturalist volunteers). Because we were limited to monitoring activity on the SciStarter website, our best measure of project engagement came through the use of the “join” button on SciStarter. Joining a project in this way does indicate behavioral intention to engage with a project, and behavioral pledges are associated with actual behavior in other contexts (Costa et al., 2018; Katzev & Wang, 1994). Thus, project joins could be considered a reliable path to conversion and ultimate participation (Crall et al., 2017). Nevertheless, for the majority of projects on SciStarter, we had no record of volunteers’ data contributions after they clicked the join button. However, across other citizen science projects, a significant portion of active participants do not contribute data on a regular basis, confounding efforts to estimate participation rates (Cooper et al., 2017; Fischer et al., 2021). Future research could investigate the citizen science project connection network using data that accounts for a deeper level of engagement (beyond project “joins”). This might include self-reports of project engagement or digital trace data that includes project contributions.
We also have little data on volunteer demographics, as we only have access to what volunteers provided in their SciStarter profiles (most of which were incomplete). This prevented us from exploring demographic attributes in the bipartite network. Research that is able to incorporate volunteer demographics would reveal who is currently engaging with online citizen science platforms (and who isn’t) and should be conducted to guide future project design and outreach efforts (Allf et al., 2022; Pateman, et al., 2021). Other studies have demonstrated that, within single projects, information sharing within social networks varies based on the type of people participating (e.g., volunteers vs. moderators vs. scientists), underscoring the importance of accounting for heterogeneity across networks (Amarasinghe et al., 2021). Attention to other project attributes, such as location or type of engagement (contributory style, co-created, etc.; Shirk et al., 2012) could reveal additional influences on project popularity and centrality that influence volunteer recruitment and movement among projects. Additional research and practice could also incorporate the temporal component of citizen science project engagement, investigating the existence of “gateway” projects, or volunteers’ first projects that lead them into the world of multi-project engagement.
Data for this project were collected from 2017 to 2018. Since this time, SciStarter has migrated to a new url (from .com to .org) and has expanded considerably, in terms of number of the registered users and projects, as well as platform capabilities. For example, as of July 2022, SciStarter.org had approximately 135,000 members, growing from around 51,000 members in December 2018. Additionally, SciStarter.org has added several new features that may influence the connections among projects, including AI-generated project recommendations (Zaken et al., 2021; Zaken et al., 2022), new organization-specific portals and microsites customized to feature combinations of citizen science projects for school systems or companies (e.g., see the page built for Girl Scouts at https://scistarter.com/girlscouts/info; Smith et al., 2024), and a feature called Lists which creates a page with citizen science projects of one’s choosing, adds personalized instructions, and incorporates analytics to track progress of those accessing each project page. SciStarter also updated the functionality to verify project joins and track more volunteer contributions across the rapidly growing number of affiliate projects. Patterns observed and described in our current social network analysis might therefore be expanded with the digital trace data from over 100,000 current SciStarter members. Lessons learned from these approaches, and similar analyses applied to other third-party platforms (Herodotou et al., 2020), could shed more light on the nature of multi-project, multi-disciplinary project participation across the broader citizen science landscape. Longitudinal research could also analyze individual learning trajectories to understand what multi-project engagement means in terms of sustained engagement with citizen science, skill development over time, and broadening connections to science (Allf et al., 2022; Ponciano & Pereira, 2019; Stylinski et al., 2020).
Implications for Research and Practice
Although research is beginning to reveal the prevalence of multi-project participation among citizen science volunteers (Allf et al., 2022; Hoffman et al., 2017; Ponciano & Pereira, 2019), our study is among the first, along with Herodotou et al.’s (2020) study of project connections on Zooniverse.org, to use SNA techniques to describe connections of projects within the larger citizen science landscape. Unlike Herodotou et al.’s (2020) study, our research is the first, to our knowledge, to use SNA to explore project connections across multiple apps, organizations, and platforms (via SciStarter), including numerous offline projects, simultaneously. Social network analyses based on digital trace data show that an interconnected landscape of citizen science projects exists, and that volunteers are active throughout it.
From a project management perspective, our results highlight abundant opportunities for coordinating the management of volunteers across projects in a more collectivist manner, rather than fearing competition withing a scarce pool of volunteers. This may be welcome news to the broader field of citizen science, which is struggling with recruitment challenges (Allf et al., 2022; Fischer et al., 2021). By strategically featuring and promoting certain groups of projects, online platform managers can facilitate and influence connectivity and shared management of volunteers. Just as a project brings together volunteers for a shared goal, platforms such as SciStarter can bring together projects and overlapping volunteers for goals that span individual projects, such as supporting volunteer learning or building social capital. Such a shift could enhance capacity for public participation in science and make science itself a more truly collective and impactful endeavor (von Gönner et al., 2023). Additionally, managers who set project spanning goals could seek to facilitate retention of volunteers within the citizen science field, as coordination among projects could allow for differing designs that respond to volunteers’ shifting motivations over time (Aristeidou et al., 2017; Larson et al., 2020).
Our work also underscores the need for continued research with landscape-level projections to investigate how the larger network of citizen science projects and volunteers interact to influence broader participation outcomes. Such an approach could move beyond descriptive mining and analysis of “big data” to build theories describing participation patterns and outcomes (Kar et al., 2023). For example, SNA has been employed in other contexts such as tourism to predict consumer choices and behaviors (Acharya et al., 2023). Perhaps future studies using SNA could identify the directionality and drivers of volunteer movement among citizen science projects. An enhanced understanding of connectivity across citizen science projects would help both project and platform managers leverage existing capacity and benefit from the resources (e.g., access to data, volunteer energy, and project infrastructure) this larger landscape provides, ultimately increasing the potential benefits that citizen science can provide to volunteers, science, and the larger society.
Supplemental Material
sj-docx-1-sgo-10.1177_21582440241298424 – Supplemental material for Exploring Project Connections Across the Citizen Science Landscape: A Social Network Analysis of Shared Volunteers
Supplemental material, sj-docx-1-sgo-10.1177_21582440241298424 for Exploring Project Connections Across the Citizen Science Landscape: A Social Network Analysis of Shared Volunteers by Sara E. Futch, Lincoln R. Larson, Caren B. Cooper, Bethany B. Cutts, Bradley Allf, Maria V. Sharova, Darlene Cavalier and Cathlyn Davis in SAGE Open
Supplemental Material
sj-docx-2-sgo-10.1177_21582440241298424 – Supplemental material for Exploring Project Connections Across the Citizen Science Landscape: A Social Network Analysis of Shared Volunteers
Supplemental material, sj-docx-2-sgo-10.1177_21582440241298424 for Exploring Project Connections Across the Citizen Science Landscape: A Social Network Analysis of Shared Volunteers by Sara E. Futch, Lincoln R. Larson, Caren B. Cooper, Bethany B. Cutts, Bradley Allf, Maria V. Sharova, Darlene Cavalier and Cathlyn Davis in SAGE Open
Supplemental Material
sj-docx-3-sgo-10.1177_21582440241298424 – Supplemental material for Exploring Project Connections Across the Citizen Science Landscape: A Social Network Analysis of Shared Volunteers
Supplemental material, sj-docx-3-sgo-10.1177_21582440241298424 for Exploring Project Connections Across the Citizen Science Landscape: A Social Network Analysis of Shared Volunteers by Sara E. Futch, Lincoln R. Larson, Caren B. Cooper, Bethany B. Cutts, Bradley Allf, Maria V. Sharova, Darlene Cavalier and Cathlyn Davis in SAGE Open
Footnotes
Acknowledgements
We thank Joe Heimlich and Bruce Lewenstein for their input, Lisa Lundgren for contributing to and guidance in coding projects for this study, and we thank the
team for providing access to the anonymized digital trace data that made this project possible. Thanks are also due to citizen scientists on SciStarter.org for their contributions to hundreds of science projects that supported these projects and allowed us to better understand the landscape of projects.
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Darlene Cavalier is the founder of SciStarter and continues to oversee the growth and development of the citizen science platform. The remaining authors have no conflicts of interest to declare.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Funds from NSF AISL grant #1713562 supported this research.
Ethics Statement
Use of secondary data was approved by the NC State University Institutional Review Board (IRB Protocol # 20934) prior to analysis.
Data Availability Statement
Digital trace data used in the analysis for this project are available via Dryad at DOI: 10.5061/dryad.dfn2z34z3
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
