Abstract
Big data trends in health research challenge the oversight mechanism of the Research Ethics Committees (RECs). The traditional standards of research quality and the mandate of RECs illuminate deficits in facing the computational complexity, methodological novelty, and limited auditability of these approaches. To better understand the challenges facing RECs, we explored the perspectives and attitudes of the members of the seven Swiss Cantonal RECs via semi-structured qualitative interviews. Our interviews reveal limited experience among REC members with the review of big data research, insufficient expertise in data science, and uncertainty about how to mitigate big data research risks. Nonetheless, RECs could strengthen their oversight by training in data science and big data ethics, complementing their role with external experts and ad hoc boards, and introducing precise shared practices.
Introduction
In recent years, research using large volumes of data has drastically increased across a variety of fields including data science, physics, biomedicine, psychology, and the social sciences (Leonelli, 2020). This type of research, known as big data research, benefits from merging and harnessing data from multiple sources, generating new insights and unexplored scientific perspectives. In this paper, we refer to “big data research” as any research relying on large datasets, made of data heterogeneous in source, processed at high speed, and analyzed through novel computational techniques (Ienca et al., 2018).
In parallel with these changes in research practice, high profile cases of data misuse have emerged, exposing research participants to privacy breaches and risk of harm (Fuller, 2019). In response, debate has increased about the role and effectiveness of the Research Ethics Committee (REC) as the chief ethical research oversight mechanism in research, given the specific challenges presented by research with big data (Ferretti et al., 2020; Rennie et al., 2020). RECs, also known as Institutional Review Boards (IRBs) and Research Ethics Boards (REBs), were created in the 20th century to protect the safety and interests of human participants in research (Friesen et al., 2019). Today, the REC’s mandate—the regulation of human subject research and the evaluation of key ethics review principles—might fall behind the demands of data-intensive research (Vayena et al., 2016). In fact, big data research is characterized by novel ethical concerns, which can challenge traditional ethics oversight mechanisms and practices (Samuel et al., 2021).
Particularly in the biomedical and health fields, the increasing availability of digital health technologies enables the collection of an unprecedented amount of data (Car et al., 2019). The possibility of using artificial intelligence (AI) and extraordinary computational capabilities to merge, analyze, and harness these data offers great opportunities to improve individual and public health (Blasimme & Vayena, 2019). The potential of AI in medicine has emerged even more clearly during the COVID-19 pandemic, as differently structured data from heterogeneous sources were collected and processed for public health purposes, such as containment, mitigation, and vaccine development (Murray et al., 2020; Ngan & Kelmenson, 2020). A crucial benefit offered by AI technologies is improved prevention and personalized treatment. In fact, AI can extract information related to individual health status by combining data unrelated to health and wellbeing (e.g., location data, blog posts) collected through a variety of tools (e.g., social media, wearable devices) (Vayena & Gasser, 2016). Despite the mentioned benefits, these new research methods and technological developments have numerous downsides. First, they challenge traditional research principles such as data privacy, informed consent, scientific validity of research, risk assessment, and distribution of benefits (Price & Cohen, 2019; Rivas Velarde et al., 2020). Second, they introduce new epistemic challenges related to the assessment of scientific validity, technological reliability, accountability, fairness, and transparency (Friesen et al., 2021). Finally, they challenge the very notion of human participants in research, as they enable retrospective data processing without physical interaction with research participants (Metcalf & Crawford, 2016).
Several questions arise about whether existing regulatory and ethical governance tools, and current practices and expertise of RECs, are adequate to protect human participants and enable ethical research (Ferretti et al., 2021). While some authors argue that the ethical principles and frameworks that traditionally govern research need to be adapted considering new research contexts (Parasidis et al., 2019; Vayena & Blasimme, 2018), studies investigating the perspectives and needs of the involved stakeholders remain scarce. Recent studies (Favaretto et al., 2020b; Samuel et al., 2021) analyzing researcher views on the topic revealed both a lack of adequate expertise among REC members and the absence of clear and consistent criteria for evaluation. Similar conclusions were reached by empirical studies conducted in the UK, Canada, and the US, interviewing REC members about the ethics of social media research and research using pervasive sensing technologies (Hibbin et al. 2018; Nebeker et al., 2017; Samuel et al., 2018). REC members were able to identify emerging ethical challenges related to big data but reported feeling unprepared to address those challenges, and a lack of normative guidance. Although these studies are highly informative, their exploratory and context-dependent nature makes their claims difficult to generalize. Furthermore, it should be noted that ethical oversight practices and research ethics guidelines diverge at the international level because legal requirements differ from state to state (Vayena, 2021).
In Switzerland, research involving human subjects, biomedical data, and biological samples requires the approval of the REC. Most of the research projects conducted in biomedical and health fields are reviewed by Cantonal RECs (Coordination Office for Human Research, 2019). Switzerland counts seven of these committees organized under Swissethics, the association of Cantonal RECs (Swissethics, 2021). RECs apply the legal and ethical rules included in the Human Research Act (HRA), which ensure the dignity, privacy, and health of research participants, as well as the ethical value of the research. Each REC oversees projects in a specific geographical area of Switzerland: two in the French-speaking region, one in the Italian-speaking region, and four in the German-speaking region (Figure 1). While the HRA sets general standards about RECs’ composition, members’ requirements, and review procedures, each REC is organized and managed independently at the Cantonal level. Although the number of members varies across committees, RECs usually include a chair, vice-chair, managing director, and scientific secretary (Swiss Federal Council, 2021).

Distribution of Swiss Research Ethics Committees (RECs) in the Swiss territory and areas of authority.
Typically, research involving anonymous health data or biological samples is not subject to the HRA. Similarly, studies without direct implications for “the understanding of human diseases; the structure and function of the human body; or public health” (Swiss Federal Office of Public Health (FOPH), 2011) are exempted. As a consequence, human subject research in the fields of psychology, sociology, or marketing is exempted from HRA provisions. Several universities have introduced institutional ethics committees to review research projects falling outside the Cantonal RECs’ purview. Nevertheless, the implementation of such intra-institutional local ethics committees is uneven throughout the country, as federal law only provides for the establishment of Cantonal RECs, and universities have no legal obligation to introduce these committees.
While a recent study looked at the experience of Swiss researchers when submitting big data research for ethical review (Favaretto et al., 2020a), no study to date has investigated the opinions and perspectives of Cantonal RECs. Therefore, this study aims to fill this gap, complement existing research, and expand knowledge on the topic by engaging with members of Cantonal RECs. Their direct experience in evaluating big data projects can provide valuable insight into the current primary ethical oversight mechanism in Switzerland, shed light on existing gaps in the mechanism, and pave the way for needed reforms.
Methods
Recruitment and Sampling
For each Swiss Cantonal REC, we interviewed the chairperson (or vice-chairperson or managing director) and, whenever possible, one scientific secretary. Committees were identified through the Swissethics website. The invitation sent to each chairperson included the following: the outline of the research and research aims; the interview methodology and a preliminary timeline; the informed consent form and details about safeguards in place for data protection and confidentiality; and the research team contacts. The response rate was 100%. All Cantonal RECs (n = 7) responded to our email and participated in our study. Prior to recruitment, we obtained approval to conduct this study from the responsible REC.
Interviews
Between October 2018 and May 2019, MI, AF, and MRV conducted semi-structured interviews, either face-to-face or via telephone. After written and verbal consent, each interview was recorded, and lasted between 35 min and 1 h. Interviewees could specify their preference for the interview language (French, German, Italian, English, or a combination.) We completed a total of seven interviews with 13 interviewees. Across RECs, interviewees shared similar disciplinary backgrounds (Table 1).
Demographics.
Each interviewee can be expert in more than one field.
MI developed the interview guide (Appendix 1), which was vetted by AF and EV and approved by the research team. These interviews aim to investigate the perspective of the Cantonal RECs on (1) how to define big data research; (2) their experience with reviewing big data projects and with the ethical guidelines used for the assessment of big data research; (3) the peculiarities of big data research, namely, its benefits and challenges; and (4) the needs of RECS in order to adequately address big data research challenges (e.g., high-level recommendations, procedural good practices, education, training).
Analysis
We transcribed verbatim the audio files in the original language of the interviews with the support of Sonix online software. Three interviews were in English, three in German, and one in Italian. To increase data consistency and reduce selective bias, we translated the non-English transcriptions into English with the assistance of DeepL Pro online software and additional human review. AF, MI, and MRV thematically coded and analyzed the data with NVivo 11 Software. Each interview was coded independently by two researchers using a combination of inductive and deductive reasoning for theme development (Fereday & Muir-Cochrane, 2006). While the deductive analysis traced the themes listed in the interview guide, the inductive analysis allowed the expansion of the list of themes, by adding those that emerged from coding the interview content.
The data analysis was performed in two steps. First, major themes of interest were identified and categorized (please refer to the Results section). This phase was duplicated by two researchers, and any disagreement was resolved with a third researcher. Second, the themes were analyzed in depth through discussion among the researchers, and adjustments to the final thematic map were made to improve logical cohesion. The result of this analysis is detailed in the following section.
Results
Our analysis identified four recurrent themes and several subthemes, which are summarized in Table 2. These themes mirror the research questions addressed by this research, namely, (1) what is RECs’ understanding of the “big data” concept? (2) What is the ethics review process currently in place for big data research? (3) What are RECs’ perspectives about the benefits and challenges of big data research? (4) What are RECs’ needs in the big data era?
Overview of Interview Themes and Subthemes.
Note. REC = Research Ethics Committee.
Characteristics of Big Data Research
The interviewees displayed variation in interpreting the concept of big data research, often deviating from the definition proposed in the introduction. Consistent with existing literature on the topic (Ienca et al., 2018; Jin et al., 2015), Interviewee 10 observed “there is much talking about big data but no unanimous definition.” The majority of interviewees mentioned the three versus (volume, variety, and velocity) characterizing big data, particularly stressing volume and variety. To me, what is relevant is data volume… the fact that there is an increasing amount of data in research files or databases. In addition, it is important where and how these data come from (Interviewee 10) I think that what would qualify as big data approach […] is if data are being analyzed using artificial intelligence and other analytic approaches that usually are not used for the regular project that we are evaluating - where normal or ordinary [statistical] methodology is applied. Here, with the big data, you are getting into a new dimension. (Interviewee 12)
Although interviewees could formulate definitions of big data research, they were confused about the line between traditional and big data research (“when does a biomedical project start to be a big data project?” (Interviewee 3)). Interviewee 4 said that medical research always collected and relied upon voluminous datasets. Therefore, it is only a matter of interpretation whether traditional research is considered big data research: in cancer research it is common to integrate many patients’ pathology data with x-rays or other imaging data, and genetic information. This happened already in small projects; but now these projects are viewed as big data projects (Interviewee 4). When you talk to me about big data my idea goes more to databases, or biological sample banks that collect a huge amount of data and for which there is no purpose. […] I think big data means analyzing a huge amount of data from various sources but without a precise purpose in mind. (Interviewee 8)
Current State of Ethical Oversight in Big Data Research
Six of seven Cantonal RECs reported previously reviewing and assessing big data projects. Nevertheless, our respondents emphasized that, so far, this had occurred rarely, only a few times a per year. Moreover, none of these studies were explicitly labeled by researchers as a big data project. Interviewees acknowledged their limited experience in reviewing big data research and speculated that this is because a few of these projects had taken place in Switzerland so far. However, REC members anticipated that this trend would evolve in the future, especially with the creation of new biobanks and more medical data from electronic health records. Furthermore, interviewees highlighted the limits of their oversight power in the big data context. Their precisely defined mandate might be a reason they only rarely reviewed big data projects. In fact, Cantonal RECs’ research purview is restricted to biomedical and clinical projects involving humans, and human biological data and samples. For instance, big data studies collecting social media data or anonymized data in the fields of social sciences and psychology would fall outside Cantonal RECs’ review: They [the not-strictly biomedical projects] are, so to speak, in the grey area: the conventional ethics committees are not responsible for them, but it is still completely unclear which oversight mechanisms should be applied (Interviewee 1) I am really wondering actually whether the researchers doing research on big data are willing to come and ask for our opinion. I will not be so surprised to learn that there are researchers that have actually conducted research on big data without coming to us, and I believe that under the legal point of view they may have some arguments. (Interviewee 13) For now, there is no evaluation grid for analyzing these studies involving big data. […] The purpose of the study, the scientific question to which the study responds, is fundamental, and is one of the factors that we take into account. (Interviewee 8) I think for that what we are seeing at the moment… we have a law, we have data protection rules, we have the Human Research Act here and I think the regulations we have can apply for this kind of research [big data research] as well as for other types of research. So, we should not make any difference at the moment. (Interviewee 3) I mean, of course big data shows that issues are more pressing to answer. But the pending questions…we have identified them, even though from a different point of view. […] for each of those issues I can provide examples in traditional research that are already raising those questions. (Interviewee 12) I don't want big data to be defined any differently than other requests […] only because it’s called big data… for me it is not fundamentally different than a normal request. (Interviewee 4) …if you've done this [assessing projects] for so many years now, you have a certain routine. But with big data and AI, if we don't even know what the risks are, how can we assess and approve them? (Interviewee 1) One problem is that there is no control for data quality in self-collected self-tracked data. All those medical apps, all those devices. There is no quality control for that. Who is ensuring, checking the quality of the data they generate? Same for social media… The quality of those data is not, at least not always, identical as in conventional research. (Interviewee 10) If there is not a sufficient legal framework… projects involving big data are only interpreted from an ethical point of view. And ethical interpretations from one committee to another may vary. The lack of a precise frame allows you to have more interpretations – which are always interesting – but could create problems. (Interviewee 9)
Implications of Big Data Research
Overall, our respondents indicated a variety of benefits and challenges associated with big data research (summarized in Figure 2).

Benefits and challenges of big data research discussed by Swiss Cantonal Research Ethics Committee (RECs).
Concerning the former, nearly all REC members flagged the importance of big data studies for increasing scientific knowledge and generating public benefit; “I certainly believe that the public health dimension and the public benefit of big data has to be stressed and has to be encouraged” (Interviewee 12). On a similar note, Interviewee 3 viewed big data as a chance for the scientific community to tackle broad research questions: I think that the most important benefit is moving away from research on small data packages…I think if you merge these data together you will have a much better chance to have a good research. (Interviewee 3) Generally, as I have seen the projects so far, the individual does not benefit directly from the research. Data are used to improve prevention and find new therapy….so the benefit is shifted into the future. (Interviewee 4) …all of a sudden you notice you can find things that you shouldn't have. Big data linkage creates more problems to ensure people’ dignity and data confidentiality. These information are precious to people and should not be put in danger. (Interviewee 13) Is anonymization possible or not?…Or is it just a word that is not true anymore because it's so easy to identify people behind [the data]? (Interviewee 3) It's just hard to agree to a declaration of consent online. These terms and conditions just require you to click and accept, but nobody reads them. That is not an informed and good consent. (Interviewee 1) I think to do research in a responsible manner we should say which data are used for what, and give the owner of data or samples the chance to choose each time. (Interviewee 3) The law should be changed to resemble the system they have in Scandinavia where, for research purposes, the access to personal data and samples is guaranteed by a presumed consent. Of course, you need to have a democratic and human rights system in place to allow for presumed consent. (Interviewee 12) I am not so sure we need consent for all data. […] We should only protect sensitive data, hence make sure we obtain consent for those. […] People freely “leave their traces” around the web, giving their information for free to companies while using apps and online services without being concerned. Why should researchers be more concerned? (Interviewee 10)
First, the exploratory nature of big data research and the numerous possibilities for data linkage make anticipating the risks very complex (“another issue is the unforeseeable risk…because as of today we can't tell what we're going to find out about that person through big data analysis” (Interviewee 8)). Interviewee 3 spoke about the incremental risk of managing incidental findings (“How are you (researcher) dealing with incidental findings? Is there still a possibility to report them back to the patients or not?”), due to the large volumes of data which are combined and analyzed. This is especially critical since RECs review research intentions, but do not control the outcomes (“we only see a project on paper at the beginning and then actually implement it. That's kind of out of our hands then” Interviewee 5)
Second, the use of analytics tools like opaque AI algorithms “that nobody at the end understands” (Interviewee 3) increases the chance for unclear and incorrect data processes. In turn, these processes can result in “wrong conclusions” (Interviewee 7): “moving away from, let's say, a kind of research where you are looking for causality we are diving much more into an area where you just look for correlations, which may be coincidences” (Interviewee 3).
Third, the chances of data hacks and de-identification are hard to anticipate, especially when private companies are involved “and there are a lot of secrets around these (data protection strategies). (…) how do you evaluate the quality of protection when you don't know how much they are subject to attacks, how many of those attacks are successful, and what are the steps taken against those attacks?” (Interviewee 13).
Finally, the presence of private actors in big data research makes determining fair distribution of benefits more complex; “we have to be careful about the fact that big data (research) is not a way of monetizing on our data by big companies…you see it already…they take all of our data and so on and make profit out of that while it is a public good” (Interviewee 12)
Given the broad spectrum of potential but unclear risks emerging in big data research, the majority of interviewees were dubious when asked to define the threshold of minimal risk: I mean…if you see the potential for data abuse which is here…and what has already happened…then you can't even speak of minimal risk! (Interviewee 1)
Needs of RECs in Big Data Research
Overall, committee members agreed about not having sufficient experience or expertise in technical areas, such as big data analytics or computer science. These weaknesses emerged when trying to understand (“We can't understand that at all” (Interviewee 7)) or assess biomedical big data projects ((“with big data research we simply do not have the know-how” (Interviewee 6)). Interviewees’ concerns predominantly centered around the speed at which new technology evolves. This constant change makes it virtually impossible to have sufficient experience and insight to judge projects with a high degree of certainty: New algorithms with artificial intelligence… I have no experience with this and how to deal with this in the future this is an open question. (Interviewee 3)
Regarding the first, almost all respondents expressed an urgent need to fill the REC expertise gap, recognizing expertise as a crucial factor in effectively fulfilling the oversight mandate. Interviewees expressed interest in targeted trainings discussing characteristics, risks, and ethical implications of big data projects and AI applications in biomedicine. Interviewee 4 suggested conducting these trainings in a dynamic format, offering case studies and mock projects to analyze. Meanwhile, other respondents further highlighted that improving REC members’ knowledge and allowing for greater exchange could increase review standardization within and across committees: it would be really good to show examples of how the projects are built and which algorithms are on the back of the analysis and how to they are put together […] I need concrete examples… […]The case studies should come from the people who do this…the researchers….to get a proper understanding. […] then, there should be a discussion among the ethics committees on how to deal with these case studies (Interviewee 4) I think they have to be members, so that we do not have to go and get an expert for an opinion every time (Interviewee 6) I am not convinced that introducing a technical figure can be a solution. Rather get training for the whole committee. (Interviewee 9) It is important to define what is good big data research, what must be done when conducting this type of research, and what is optional […], which methodology is acceptable and which unacceptable. (Interviewee 6) I think we have to look whereby certain standards are fulfilled and sometimes we get research application where it's not clear what is the research question. (Interviewee 3) We say: OK let's look what these researchers want to do and how can we do it in the best way so that they do not hurt people. (Interviewee 12)
Finally at the regulatory level, interviewees discussed whether the scope and mandate of RECs should be expanded to cover those big data projects currently outside their purview. From the respondents’ perspectives, these projects might still carry negative consequences for individual health and wellbeing, as well as for broader society. Although REC members had a favorable view of the option of expanding the REC mandate, they also highlighted two crucial points. First, the aim of RECs should align with society’s expectations and values. Political and health authorities, as well as RECs, should engage with society to define the boundaries of RECs’ scope (what should be reviewed or not) in light of new technological advancements. Only as a result of this democratic debate should the law be adjusted. Our role is to protect the individual and to decide what is in the interest of a society…Committees should agree with the society about what should be permitted and what not […] we need a clear and harmonized understanding of the role of RECs and what is legally required. (Interviewee 3)
The second point is of a pragmatic nature. As RECs do not have the capacity and expertise to review highly technical studies or studies outside the biomedical field, most respondents agreed on the idea of introducing specific oversight boards to assess the technical features of projects involving big data and AI. These boards could complement—rather than substitute for—RECs and find their place alongside those already supervising data uses (such as data protection legal offices and data safety monitoring boards). I can imagine that an external body with certain skills could be useful….to evaluate the technical aspects that we do not consider […] its skills complements our evaluation. (Interviewee 8) Possibly on the long term we are going to need something like “big data board”…possibly. I do not think they could replace RECs…they will be rather complementary. (Interviewee 10) This should be carefully considered […]. I always struggle with too many parallel structures […]…in the end we have a forest of ethical institutions and nobody knows anymore what is really well reviewed. (Interviewee 1)
Strengths and Limitations
While the methodology of qualitative interview analysis allows for the detailed exploration of opinions and perspectives, the same study design challenges the generalization of the conclusions. However, although the findings of this study are confined to the Swiss context, the fact that we interviewed members of all seven Cantonal RECs made it possible to represent the full spectrum of cultural variation that exists within the country. Furthermore, since the Swiss ethical oversight mechanism partially resembles those of other European countries (e.g., neighboring Germany, Austria, France, and Italy) and internationally, a certain degree of generalization of results could be justified.
In this study, a selection bias may have arisen from including only the views of Cantonal RECs. Although other ethics committees exist in Switzerland (e.g., the national ethics committee and institutional review committees within universities), this study focused on big data research in the biomedical and health field, which is usually reviewed by Cantonal ethics committees. The fact that only the chairperson/vice-chairperson and one scientific secretary per REC were interviewed may also have introduced a bias into the study. Nevertheless, one must consider that chairpersons, in practice, set the agenda for the committee, and scientific secretaries first review and evaluate research protocols. Therefore, we believe that their perspectives and comments have provided valuable insights into the ethics of research with big data in biomedical and health settings.
Finally, the fact that the interviews were conducted before the COVID-19 pandemic can be interpreted as a limitation. Indeed, the pandemic has increased pressure on RECs, especially for reviewing public health projects that leverage the power of big data and AI. Nonetheless, the results of this study transcend the temporality of current research conditions, as they relate to the complex oversight system of Cantonal RECs, which is not evolving as rapidly. Future research could explore which processes and functions of Cantonal RECs have changed as a result of the COVID-19 pandemic.
Discussion
Our findings reveal four main areas of ethical significance. First, the lack of specific normative standards for the ethics review of big data studies. Second, epistemic challenges faced by REC members, specifically insufficient experience and expertise. Third, normative ethical challenges related to the scope of ethical reflection on big data, as conceptual tools traditionally used to assess biomedical research appear increasingly inadequate to assess unforeseeable and novel risks generated by big data studies. Finally, proposals for reform emerged from our analysis, including both conservative reforms (e.g., building capacity and promoting data literacy among REC members) and more radical reforms, such as complementing RECs with data-focused oversight bodies. In the following, we provide a detailed analysis of these themes.
Lack of Specific Review Standards
Although REC members share a general idea of what constitutes big data, they lack a precise common definition and clear guidance on how to recognize these studies in practice. As previous studies indicated, REC members’ uncertainty could result in inconsistencies across committees (Favaretto et al, 2020b; Vitak et al., 2017). Moreover, the way in which interviewees define big data can influence their assessment of the most pressing ethical challenges. By using narrow definitions of big data—namely focusing on one or few characteristics (e.g., data source, data volume)—RECs may be more sensitive to some ethical implications than others. It is relevant to note, however, that disagreement on a definition of big data is secondary to a lack of tailored standards for reviewing big data research. Our results, in line with previous research, highlight the lack of specific ethical guidelines for evaluating big data projects and thus the application of traditional ethical frameworks in the evaluation of all projects without distinction (Ienca et al., 2018). While some interviewees believed that the lack of specific ethical guidance does not negatively impact ethical review practices, others expressed concern about having to interpret and judge big data research on a case-by-case basis without guidance. These diverging opinions might mirror different RECs approaches to ethics review, as well as RECs’ members’ confidence levels, experience in managing big data projects, and expertise in the technical disciplines (i.e., data and computer sciences). RECs’ diverging interpretations may result in disharmonious evaluations and decisions across committees, which could negatively affect researchers’ trust in the oversight system, data sharing practices, and research collaborations (Ballantyne & Schaefer, 2020; Dove & Garattini, 2018; van den Hoonaard & Hamilton, 2016). Although a lack of transparency about evaluation procedures and inconsistencies across RECs’ judgments are not exclusive to big data research (Lynch et al., 2020), our findings show that these limitations in REC practices continue to hamper research.
Limited Experience and Expertise
REC members acknowledged their limited experience in dealing with big data projects and inadequate expertise about fundamental technical aspects characterizing these studies. REC members recognized that their narrow mandate diminishes their oversight function in big data research. In fact, the narrow boundaries of HRA result in only a portion of big data projects conducted at the national level coming to their attention (von Elm & Briel, 2019). However, unless the law is amended to expand the purview mission of the ethical oversight mechanism, Cantonal RECs have no choice but to invite researchers to submit their research voluntarily. Some studies have suggested that RECs should engage researchers in a dialogue to make them aware of and accountable for the consequences of their research (Holland, 2018). Concerning RECs’ insufficient expertise around technical features of big data research, our interviewees were aware of their shortcomings. Most REC members expressed both willingness and commitment to implementing strategies to overcome their weaknesses (e.g., involving data specialists in the assessment of big data research, or attending trainings to increase their technical competence). Furthermore, our findings partially align with the results of a recent study focusing on IRB staff’s perspectives in the United States (Vitak et al., 2017). The IRBs surveyed believed that over time they would surmount their shortcomings in assessing the technical aspects of big data research proposals, through cumulative experience. Yet, the rapidity with which AI technology and big data applications evolve further complicates RECs’ and IRBs’ attempts to get up to speed in their subject matter knowledge (Nebeker et al., 2019; Prosperi & Bian, 2019).
Scope of Ethical Reflection
Our findings reveal that REC members are overall well informed about the benefits and challenges brought about by the advent of big data and data analytics techniques. However, they disagreed on which challenges are the most pressing and which tools are best suited to address them. These different opinions among interviewees might be explained by their background, personal bias, and the lack of training in big data research. The fact that many interviewees focused on how to adapt and improve the informed consent tool, and implement in the most rigorous way the existing data protection regulations, may signal a problem. Some authors flag the risk of viewing these tools as ethics panacea (Babb, 2020; Corrigan, 2003). While regulating data re-uses and operationalizing informed consent remain unresolved issues, privacy-focused ethical oversight may be insufficient to address other challenges raised by big data, concerning, for example, justice, dignity, and fairness (Ballantyne, 2019; McKeown et al., 2021). Our results highlight this gap in current ethical oversight, as respondents expressed concern about how to balance the risks and benefits of projects. The traditional ethics tools used to assess biomedical research are inadequate and ineffective when assessing unforeseeable and novel risks (Sheehan et al., 2019). This concern, which remains unresolved for the time being, underscores the need for a broader conversation in society about the importance of big data research, and its uses in terms of our collective interest (McMahon et al., 2020).
Proposals for Reform
Our results shed light on the limitations of the current mechanism of Cantonal RECs, in terms of skills, practices, and guidelines. REC members—aware of these shortcomings—suggested possible solutions to tackle them. The interviewees’ request for training on big data and AI reveals interest in expanding their knowledge. In addition, the practice of involving experts to fill RECs’ expertise gap can be seen as an attempt to offer assistance to researchers (Huh-Yoo et al., 2021). Interviewees’ desire to improve the status quo of ethical review is further evidenced by their suggestion of creating complementary oversight mechanisms (e.g., big data boards), to review the technical aspects of projects and highlight inherent risks, while keeping pace with the fast-changing nature of research. Some interviewees imagined these boards serving as an accreditation mechanism, to certify the quality of a project’s technical features. These boards could operate across disciplines, to certify research in private and public sectors, regardless of data types and sources. Consequently, fewer big data projects would be left without any sort of oversight. Finally, our interviewees strongly defended the role of RECs as a key mechanism for ethical review in research and spoke against overturning the entire system by introducing new high-level principles or laws. Nevertheless, REC members would welcome more operationalizable guidance on what constitutes a good big data project. Therefore, future research and initiatives should aim to fill this gap by offering ERCs practical guidance for orienting their judgment in the field of big data research.
Best Practice
Swiss Cantonal RECs should be reformed if they are to be effective in the big data research context. In this paper, we argue that these reforms should involve not only the practices of REC members, but also their expertise and the regulations that define the mandate of RECs. Ethics oversight mechanisms outside the Swiss context might benefit from similar revisions. In addition, this study suggests that researchers be proactive in reaching out for RECs’ opinions and aware of their responsibilities when conducting research. However, the efforts of researchers must be supported by a system of clear rules and ethics training put in place by a network of actors (such as policymakers, universities, and funding bodies) (Samuel et al., 2019).
Research Agenda
In this paper, we reported the perspectives of Swiss cantonal RECs on the challenges they face in reviewing big data projects and their needs in order to adequately address these challenges. We believe this analysis contributes significantly to the existing literature as it is the first qualitative study to survey Swiss RECs about their experiences and views on this topic. Interestingly, our results align with the literature at the international level. More research is required to explore the need for globally shared ethical standards for conducting research with big data. In fact, as interdisciplinary and cross-country big data projects increase, the scientific community may need not only clear common data governance, but also a shared vision about what an ethically aligned big data project consists of (ÓhÉigeartaigh et al., 2020). The recent COVID-19 pandemic exemplified how divergent laws governing research, unclear ethical evaluation methods, and unrobust oversight mechanisms can slow down research processes, jeopardize efforts for public health, and reduce public trust in scientific institutions (Gardner et al., 2020).
Educational Implications
Our results emphasize the need for knowledge exchange and a more productive engagement among the various factors involved in big data research. These include and are not limited to RECs, researchers, research institutions, private enterprises, citizen science groups, and the public (Vayena & Gasser, 2016). In particular, if on the one hand REC members should acquire more technical skills about, for example, data analysis methodologies and AI-enabled technologies, researchers should also be more informed about the value of and the necessary steps for conducting research ethically. The dynamics of collaboration between RECs and researchers should not only be aimed at fulfilling the requirements imposed by law (i.e., ensuring compliance), but also at increasing mutual knowledge through an open dialogue and positive attitude towards learning. Scholars have argued that positive (although maybe not perfect) actions and responsible big data research can emerge only by asking difficult questions and through transparent confrontation on diverging perspectives (Zook et al., 2017). Finally, our research findings indicate the crucial importance of informing society about issues related to big data and the use of AI in research. Starting with this democratic engagement, the general public can clarify their expectations regarding research with big data and thus inform the decisions of other actors involved.
Supplemental Material
sj-docx-1-jre-10.1177_15562646211053538 - Supplemental material for The Challenges of Big Data for Research Ethics Committees: A Qualitative Swiss Study
Supplemental material, sj-docx-1-jre-10.1177_15562646211053538 for The Challenges of Big Data for Research Ethics Committees: A Qualitative Swiss Study by Agata Ferretti, Marcello Ienca, Minerva Rivas Velarde, Samia Hurst and Effy Vayena in Journal of Empirical Research on Human Research Ethics
Footnotes
Appendix 1: Interview guide
Acknowledgments
We are grateful to our interviewees who kindly spent their time to take part in this research. We also would like to thank Dorothee Caminiti who assisted MI and AF during the completion of the interviews. We are grateful to Shannon Hubbs for her editorial suggestions.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung (Grant No. 407540_167223).
Ethics Statement
Prior to recruitment, we obtained approval (EK 2017-N-74) to conduct this study from ETH Zurich’s Research Ethics Committee.
Authors’ Contributions
MI, AF, and MRV conducted and analyzed the interviews and compiled the results. AF drafted the manuscript. All authors contributed to the study design and the development of the final manuscript and approved the submitted version.
Supplemental Material
Supplemental material for this article is available online.
Authors Biography
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
