Sage Journals: Discover world-class research

Abstract

Keywords

nonresponse respondent engagement survey climate

1. To Start

The questions raised in the title are (of course) more teasers than real research questions. I am not a sociologist. But even if I would be, then such questions are just too bold and too general to answer. However, remarkably, response rates have dropped at a similar pace for all age groups. Today’s response rates for the elderly, when invited to interviewer-assisted surveys at Statistics Netherlands, resemble those of twenty years ago. For instance, the response rates for persons sixty-five to sixty-nine years old were typically around 65% at the time before covid-19, which are similar to the response rates in the Netherlands in the nineties. Luiten et al. (2020) show that since the mid eighties the average drop in international response rates was 0.5% to 1% per year. This at least raises the speculation that we did not lose respondents, but we failed to encourage new respondents.

So what happened in the past forty years? At the birth of JOS I entered secondary school. Halfway, at JOS’ twentieth anniversary, I had just started at the methodology department of Statistics Netherlands. Today, I am still at Statistics Netherlands, and with Utrecht University researchers, I have tried to venture into new types of surveys, termed smart surveys. I would mark the first period as “the hunt for the last respondent” and the second as “the hunt for the best respondent.” I advocate to label the new period as “the hunt for the engaged respondent.”Table 1 gives a summary of what the world looked like for me at the three time points. It is subjective, but there are some keywords to take away from it. These are digital, online, anywhere-anytime, multi-channel communication, individual, computational power, population diversity. With some caution and with some time lag survey institutes have been trying to follow these changes.

Table 1.

Some Typical Features of the 80s, 00s, and 20s.

At the start of JOS (80s)	JOS’ 20th anniversary (00s)	JOS 40th anniversary (20s)
Two TV channels	Many TV channels	Digital TV anytime, anywhere Digital news(papers)
Paper newspapers	Newspapers in dire needs	Social media as source of news and influencers
No Internet	Internet, but basic browsing	High-dimensional, relatively cheap online activity anywhere anytime
	Emergence of social media	Social media diverse and massive
Social clusters based on religion, income, region	Fading boundaries between traditional social clusters.	Even stronger cultural diversity
Relatively modest cultural diversity	Strongly growing cultural diversity and new clusters	Societies becoming more nationalistic
	The world is opening up to everyone in media	Internet and social media are universal
Postal mail	Mail goes hybrid, paper and digital	Postal mail is more or less gone
Holiday cards	Contact home from anywhere	Location tracking and contacting anywhere
Paper communication	Email/text communication	Chatting and other forms of instant, short communications
Landline phones	Landline phones on return	Landline phone mostly gone
	Mobile phones	Mobile phone becomes smart
Shops closed at night and on Sundays	Shops have wider opening hours	Some types of shops almost vanish
Post-order shopping	Emergence of online shops	Large-scale online shopping
Physical banking only	Banking through secure websites	Banking smart, payments digital
Population registers	Digital registers but still manual/paper data entry forms	Digital data entry for registers Emergence of big data, sensor systems, IoT
Long computation times	Rapidly growing computer performance	Immense computational power
Very modest digital storage options	Personal digital storage options on portable devices	Almost unlimited digital storage options available to anyone
	Emergence of AI	AI entering daily life

But has all context changed? Hardly. What has not changed are societal issues, public debate, and complex political decisions. In fact, some of these issues have become stronger such as environmental concerns, society segregation, and large-scale migration. So reasons to jointly think about where to go are, perhaps, even more needed than ever. Hence, one may question whether people no longer care. A look at the rise of more nationalistic and protective tendencies in many countries is an indirect evidence that general populations are opinionated. There may be a believe though that survey stakeholders and survey designers are not the ones that will bring change to these issues.

In the following sections, I will look at each of the three time periods.

2. The Hunt for the Last Respondent

For quite some years, nonresponse was more of a nuisance than a big problem. It was the time of the adjusters. The established sampling theory was modified to include unit-nonresponse. A regression estimator became a modified regression estimator and a Horvitz-Thompson estimator became an inverse propensity weighting estimator. The focus was more on the technique itself than on the auxiliary information contained in the adjustments. It led to seminal works such as Bethlehem and Kersten (1985) and Lundström and Särndal (1999).

For some reason, that I have never managed to fully understand, already in the nineties the response rates in the Netherlands were at levels of 60% to 65% that other countries only had much later. When I started, the Netherlands were by some called “world champion” in nonresponse. An illustrious honor that meant an upsurge of research activities and explicit programs.

However, response rates were on the decline in many countries. Only some five years after the birth of JOS, the International Workshop on Household Survey Nonresponse was established. It brought together the big names in survey research such as Bob Barnes, Bob Groves, and Lars Lyberg. The workshop had the sense of the “hunt for the last respondent,” a title that Ineke Stoop gave to her PhD dissertation in 2005. It brought together adjusters and reducers and led to a gradual shift from the adjusters to the reducers (who called the adjusters the “Greek people” because of the many formulas). In a number of field experiments “extreme” tactics were made available to interviewers. The objectives were to find the limit in what response rates could be achieved and whether that made any difference to statistics. Accounts of this are given in Voogt and Saris (2005) and Billiet et al. (2007). Interviewer tactics followed the ideas of tailoring and maintaining interaction, for example, Groves and Couper (1996).

Statistics Netherlands had its own version of the hunt for the last respondent. In 2005, it fielded an experiment in the Labour Force Survey (LFS) with two experimental arms. In the one arm, the basic-question-approach was applied. Nonrespondents were offered a very short questionnaire. In the other arm, there was a call-back-approach. The best performing face-to-face interviewers were selected and they could offer large incentives and had a temporary low workload to optimally schedule visits. Table 2 shows the results in comparison to the regular LFS. The R-indicator estimates variations in response propensities across subgroups identified by linked administrative data. The coefficient-of-variation does the same, but penalizes low response rates. Statistics Netherlands had one source of comfort in their championship: the relatively large amount of linkable administrative data. In Table 2, the indicators are estimated for a basic and an extended set. The basic set consisted of age, migration background, urbanization, and the type of household. The extended set also had registered-(un)employment, region, average value of houses at ZIP-code level, various binary indicators for receiving forms of social allowance, and a binary indicator for having a registered landline phone. The call-back approach led to a strong improvement in representation, while the basic-question approach had little impact. In fact, the extended set of auxiliary variables showed that the basic-question-approach further increased contrasts. The experimentally achieved R-indicator of 0.85 and coefficient-of-variation of 0.09 for the extended set led us to the recommendation at the time that these should be benchmark targets in data collection. This recommendation was not taken. It was too early still for “hunting for the best respondent.”

Table 2.

Response Rate (RR), R-Indicator (R), and Coefficients-of-Variation (CV) for the Dutch Labour Force Survey (LFS) 2005 Regular, with a Call-Back-Approach and with a Basic-Question-Approach. R and CV Are Estimated for a Basic Set and an Extended Set.

Dataset	RR (%)	Basic auxiliary variables		Extended auxiliary variables
		R	CV	R	CV
LFS2005	62.2	0.824	0.141	0.801	0.159
LFS2005 call-back	76.9	0.861	0.090	0.851	0.091
LFS2005 basic-question	75.6	0.828	0.114	0.780	0.146

The slow but gradual shift from adjustment to reduction was paralleled by sophisticated programs aiming at total survey design. Surveys were becoming more costly, especially due to efforts to convert nonrespondents. Budgets needed to be weighed against other design choices. A broader scope, such as explained for example in Dillman (1996) and Biemer (2001), was needed. It was advocated in the International Total Survey Error Workshop (ITSEW for short). It took a while, however, to move from the last respondent to the best respondent.

3. The Hunt for the Best Respondent

It became clear that a blind focus on more response was not going to work, if only because of survey budgets. It was the stepping stone to what I would call the era of the hunt for the best respondent. The ideas of tailoring were further advanced and the notion that a one-size would fit all was legitimately and explicitly abandoned. This shift was made possible by three developments. One development was the strong rise of Internet which was adopted as survey mode. It (allegedly) gave more “levers” to apply. The second development was the digitalization of administrative data. They allowed for the necessary differentiation in treatment, although not accessible in all country contexts. The third development was the advance of survey case management systems that allowed for faster and more manageable design changes. But it also allowed for the structured collection of paradata, an alternative source for differentiation.

The interest in differentiation also implied the rise of more specific workshops. Workshops on adaptive and responsive survey designs and internet studies/online research came up. The number of conference sessions on mixed-mode survey design and the use of paradata grew substantially.

A first step, and one that is still on-going, is forensics into the entire “(non)respondent journey.” Paradata were studied much more extensively and became important ingredients in survey monitoring dashboards. Good examples are Sakshaug and Kreuter (2011) and Lynn et al. (2014). While giving very useful insights, paradata tend to be weakly informative for self-administered survey modes. There is simply no sign of life for nonrespondents, making it hard to break down nonresponse into the main causes: non-contact, not able, language barriers, and refusal. It led to various comparative mode studies, for example, Vannieuwenhuyze et al. (2014).

A second step is formalization of “the best” respondent. Tailoring fieldwork, apart from levers and differential respondent behaviors, need indicators of yield and cost and a reproducible tactic to glue all those together. Response rates were supplemented with other indicators. The new indicators formed the basis for adaptation strategies, ranging from case prioritization to full mathematical optimization protocols. Schouten et al. (2011) and Lundquist and Särndal (2013) give accounts. Work in this area is still pending with growing sophistication, for example, Wagner et al. (2024). But, and this is an important achievement, it meant that reducers and adjusters became (re-)united. Both disciplines are needed.

Statistics Netherlands adopted R-indicators and coefficients-of-variation. The coefficient-of-variation became the default objective function in adaptive survey design mathematical optimization problems. From 2018 on, household surveys were gradually converted to adaptive designs with the survey mode as main design feature. Table 3 shows the indicators for the Dutch Health Survey. The survey migrated from a face-to-face survey in 2010 to a web-face-to-face mixed-mode survey in 2011 to an adaptive mixed-mode survey in 2018. Table 3 confirms that response rates gradually drop. In the Health Survey this is in the interviewer-part. This is to a large extent due to the adaptive survey design that reduces effort for population subgroups that respond well to the web mode. The decreasing indicators, however, also hint at a web response that is becoming less representative. The interviewer modes make up for this in part. Representation deteriorates slower than response rates, because of the adaptive design, but the coefficients-of-variation point at more and more risk of bias.

Table 3.

Response Rates, R-Indicators, and Coefficients-of-Variation for the Dutch Health Survey in 2010, 2019, and 2023 for Web Only and for Web Plus Face-to-Face. The 2019 and 2023 Health Surveys Are Adaptive. In 2010 Also the Former Full Face-to-Face Design was Fielded in Parallel.

Design	Response rate			R			CV
	2010	2019	2023	2010	2019	2023	2010	2019	2023
Web	0.356	0.360	0.372	0.800	0.763	0.751	0.281	0.329	0.335
Web-F2F	0.632	0.546	0.494	0.804	0.828	0.824	0.155	0.157	0.179
F2F	0.610	—	—	0.811	—	—	0.155	—	—

Despite the increased sophistication in survey data collection design, the “best respondent” design paradigm is cost efficiency. Most design changes, being mixed-mode and/or adaptive-responsive, reduce costs per sample unit. They do not aim at different or more effective strategies. Many implemented adaptive survey designs minimize the impact on representation when doing less, rather than doing more. Figure 1 reflects the vulnerability of such design efficiency. The adaptive survey design lost its power to better representation during the corona-pandemic.

Figure 1.

The coefficient-of-variation of the Dutch Health Survey between quarter 1 of 2019 and quarter 4 of 2022. 90% (dark gray), 95% (gray), and 99% (light gray) confidence intervals are given. The horizontal line is the historic benchmark value.

4. The Hunt for the Engaged Respondent

I would like to advocate that we move to a new era, the hunt for the engaged respondent. The key design recommendations, in my opinion, should be:

Be more relevant in the choice and format of statistics;

Become more explicitly and visibly disconnected from politics;

Use modern data collection tools but without being predictable and standard;

Adapt to the general population and not let the population adapt to us;

Collaborate internationally;

The last fifteen years have seen major developments in communication, computational power and technology. Smart devices, sensor systems, big data, Internet-of-Things, social media all are a result of this. Survey institutions and researchers have not been ignorant of this. They have started to incorporate them into the production of statistics. See, for example, Daas et al. (2015), McCool et al. (2021), Minnen et al. (2023), and Schouten, Lugtig, and Luiten (2025).

With the new developments, again the focus of workshops changed. There are still workshops on online research, but they now also include new ways of collecting data. The workshop on adaptive-responsive survey design lost its relevance. Instead there are workshops such as the Mobile App and Sensor Surveys (MASS) that gained interest.

But successes are still modest. Response rates of these new types of surveys are similar to their traditional counterparts. Schouten, Lunardelli, et al. (2025) performed a cross-country study to find out how general populations like to integrate smart features into surveys. Results are mixed. One conclusion is that tailoring and maintaining interaction cannot be done through invitation materials. The type of hesitations that respondents have and the guarantees that respondents look for vary greatly and depend strongly on the context. They cannot all be addressed without making materials scaringly long. There is, however, also good news. Table 4 presents two Statistics Netherlands’ studies that included “smart” forms of data collection: a two-weeks smart household budget survey and a one-week smart travel survey. In mailed invitation conditions, registration rates resembled that of traditional non-smart counterparts. There was the usual impact of the incentive amount, which flattened off after ten Euro’s. However, the personal invitation had a strong and lasting impact. The impact would likely have been higher if the corona distancing measures had not been in play. The good news is that a personal element, although being relatively modest, already had a quite strong impact.

Table 4.

Registration, Activity, and Completion Rates for Two Smart Survey Pilots, a 2018 Smart Travel Survey (TS) and a 2021 Smart Household Budget Survey (HBS). The TS Employed Randomized Incentive Conditions. The HBS Had a Randomized Mailed and F2F Invitation.

Survey	Contact mode	Incentive	Registration rate (%)	Activity rate (%)	Completion rate (%)
HBS21	Mail	5 + 0 + 20	16	14	12
	F2F	5 + 0 + 20	26	24	22
TS18	Mail	5 + 5 + 5	23	18	14
	Mail	5 + 0 + 10	27	22	16
	Mail	5 + 0 + 20	29	24	20

Note. The incentive condition is split in three steps unconditional at invitation + conditional at being active during the diary survey + conditional at completion of the diary survey. Registration = installing the survey app and logging in. Activity = completing the intro questionnaire and starting the diary for at least one day. Completion i = finishing all specified diary days.

In our hunt for the best respondent, we have been applying traditional tactics, but in a much more clever way. This is not sufficient in a world that is full of communication all the time, anywhere, and in many diverse forms. A world in which the relevance and influence of survey institutions may be unclear and potentially linked to authorities that may not be considered authorities. The survey message gets overwhelmed. Getting back to the recommendations: We should be demonstrably more relevant and more explicitly separated from politics. An option is to let the general population co-decide about earmarked survey budgets and be involved in the evaluation of survey results. Furthermore, the respondent should be at the heart of data collection in a modern but also authentically interested way. We need modern tools but with a human touch. In my opinion, the personal touch is imperative in order to show that we care. In all this, with geographical and cultural boundaries becoming stronger, more international collaboration is needed in developing tools and in advocating the survey message. JOS should, and probably will, remain relevant in doing so.

To end, I cannot give empirical answers to the questions raised in the title. However, given the changing world and the encouraging results of Table 4, I believe that we did not lose respondents, but we did not manage to win them. While somewhat grim as a conclusion, I do believe that we can win respondents if we dare to break with trends of cost efficiency and standardization. I would very much welcome an international workshop on engaging respondents.

Footnotes

Funding

The author received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Barry Schouten

Received: December 23, 2024

Accepted: January 22, 2025

References

Bethlehem

J. G.

Kersten

H. M. P.

1985. “On the Treatment of Nonresponse in Sample Surveys.”Journal of Official Statistics 1 (3): 287–300. DOI: https://www.proquest.com/openview/1774a31d22ee43e0d399ecb0430e3b70/1?pq-origsite=gscholar&cbl=105444.

Biemer

P. P.

2001. “Nonresponse Bias and Measurement Bias in Comparison of Face to Face to Telephone Interviewing.”Journal of Official Statistics 17 (2): 295–320. DOI: https://www.proquest.com/openview/b75ed6369c49a12291be7d83033009a8/1?pq-origsite=gscholar&cbl=105444.

Billiet

Philippens

Fitzgerald

Stoop

2007. “Estimation of Nonresponse Bias in the European Social Survey: Using Information from Reluctant Respondents.”Journal of Official Statistics 23 (2): 1–35. DOI: https://d1wqtxts1xzle7.cloudfront.net/92232468/estimation-of-nonresponse-bias-in-the-european-social-survey-using-information-from-reluctant-respondents-libre.pdf?1665392369=&response-content-disposition=inline%3B+filename%3DEstimation_of_nonresponse_bias_in_the_Eu.pdf&Expires=1738337766&Signature=fyehsvqHnOtHWWUPAYnXFrvjRKxMvfyHit8HbckIesVQoUSO3abSX~yIyNAglFnuVdrPGFrEs8Ihd3ftsyIZg5hnRMuI-kqDd2AsWJOqnD4UtJfUY4Ke64l3JDQzkmeYpBef40g0Ni4YwpVaO6~SOjCnG0TFUlxvfVPIMvrb78Z8QB1EdthnaD-LjstOHODl~O3VPZY2MVZ5FWo80qJCFKkR-8IDtkr5na~5FShWzgIvIi-fw2XMGfred2hepRa8bYcUSZ~XFSVrXt4ITWcJtAnES6EnjSPILbWvGgdEeEWKAXYvm6R2owecrp1CjNhadmZ1uas5qbtPGtnmWCh~yg__&Key-Pair-Id=APKAJLOHF5GGSLRBV4ZA.

Daas

P. J. H.

Puts

M. J.

Buelens

van den Hurk

P. A. M.

2015. “Big Data as a Source for Official Statistics.”Journal of Official Statistics 31 (2): 249–62. DOI: https://doi.org/10.1515/JOS-2015-0016.

Dillman

D. A.

1996. “Why Innovation is Difficult in Government Surveys.”Journal of Official Statistics 12 (2): 113–24. DOI: https://www.scb.se/contentassets/ca21efb41fee47d293bbee5bf7be7fb3/why-innovation-is-difficult-in-government-surveys.pdf.

Groves

R. M.

Couper

M. P.

1996. “Contact-Level Influences on Cooperation in Face-to-Face Surveys.”Journal of Official Statistics 12 (1): 63–83. DOI: https://www.scb.se/contentassets/ca21efb41fee47d293bbee5bf7be7fb3/contact-level-influences-on-cooperation-in-face-to-face-surveys.pdf.

Luiten

Hox

de Leeuw

2020. “Survey Nonresponse Trends and Fieldwork Effort in the 21st Century: Results of an International Study Across Countries and Surveys.”Journal of Official Statistics 36 (3): 469–87. DOI: https://doi.org/10.2478/JOS-2020-0025.

Lundquist

Särndal

C. E.

2013. “Aspects of Responsive Design with Applications to the Swedish Living Conditions Survey.”Journal of Official Statistics 29 (4): 557–82. DOI: https://doi.org/10.2478/jos-2013-0040.

Lundström

Särndal

C. E.

1999. “Calibration as a Standard Method for the Treatment of Survey Nonresponse.”Journal of Official Statistics 15 (2): 305–327. DOI: https://www.scb.se/contentassets/ca21efb41fee47d293bbee5bf7be7fb3/calibration-as-a-standard-method-for-treatment-of-nonresponse.pdf.

10.

Lynn

Kaminska

Goldstein

2014. “Panel Attrition: How Important is Interviewer Continuity?” Journal of Official Statistics 30 (3): 443–57. DOI: https://doi.org/10.2478/JOS-2014-0028.

11.

McCool

Lugtig

Mussmann

Schouten

2021. “An App-Assisted Travel Survey in Official Statistics: Possibilities and Challenges.”Journal of Official Statistics 37 (1): 149–70. DOI: https://doi.org/10.2478/JOS-2021-0007.

12.

Minnen

Rymenants

Glorieux

van Tienoven

T. P.

2023. “Answering Current Challenges of and Changes in Producing Official Time Use Statistics Using the Data Collection Platform MOTUS.”Journal of Official Statistics 39 (4): 489–505. DOI: https://doi.org/10.2478/JOS-2023-0023.

13.

Sakshaug

J. W.

Kreuter

2011. “Using Paradata and Other Auxiliary Data to Examine Mode Switch Nonresponse in a ‘Recruit-and-Switch’ Telephone Survey.”Journal of Official Statistics 27 (2): 339–57. DOI: https://www.researchgate.net/profile/Joseph-Sakshaug-2/publication/288713326_Using_Paradata_and_Other_Auxiliary_Data_to_Examine_Mode_Switch_Nonresponse_in_a_Recruit-and-Switch_Telephone_Survey/links/5c3e3bb2299bf12be3cb2d65/Using-Paradata-and-Other-Auxiliary-Data-to-Examine-Mode-Switch-Nonresponse-in-a-Recruit-and-Switch-Telephone-Survey.pdf.

14.

Schouten

Lugtig

Luiten

2025. “Do Smart Surveys Have a Positive Business Case? An Evaluation Based on Three Case Studies.”Journal of Official Statistics. Under Review.

15.

Schouten

Lunardelli

Perez

, et al. 2025. “How Does the General Population Think About Smart Surveys?” Journal of Official Statistics. Under Review.

16.

Schouten

Shlomo

Skinner

2011. “Indicators for Monitoring and Improving Representativeness of Survey Response.”Journal of Official Statistics 27 (2): 231–53. DOI: https://eprints.soton.ac.uk/158353/.

17.

Vannieuwenhuyze

J. T. A.

Loosveldt

Molenberghs

2014. “Evaluating Mode Effects in Mixed-Mode Survey Data Using Covariate Adjustment Models.”Journal of Official Statistics 30 (1): 1–21. DOI: https://doi.org/10.2478/jos-2014-0001.

18.

Voogt

R. J. J.

Saris

W. E.

2005. “Mixed Mode Designs: Finding the Balance Between Nonresponse Bias and Mode Effects.”Journal of Official Statistics 21 (3): 367–87. DOI: https://www.scb.se/contentassets/ca21efb41fee47d293bbee5bf7be7fb3/mixed-mode-designs-finding-the-balance-between-nonresponse-bias-and-mode-effects.pdf.

19.

Wagner

West

B. T.

Kim

Suolang

Engstrom

Sinibaldi

2024. “Using a Stopping Rule to Optimize Cost-Quality Tradeoffs in a Large, Mixed-Mode Survey: A Simulation Study.”Journal of Official Statistics. Published electronically November 11, 2024. DOI: https://doi.org/10.1177/0282423X241287452.

Where Have the 20% Respondents Gone? Did We Lose Them or Failed to Win Them? And Is It Too Late?

Abstract

Keywords

1. To Start

2. The Hunt for the Last Respondent

3. The Hunt for the Best Respondent

4. The Hunt for the Engaged Respondent

Footnotes

Funding

ORCID iD

References