Abstract
This paper analyzes the effects of caseworker screening of doctors’ medical sick-leave certificates. The analysis uses data from people appealing caseworkers’ denial of their benefit claims. Caseworkers at four units made decisions on the appeals. The distribution of the cases to the four units was based on the appealing person’s birth date. One of the units was much stricter than the others (7.8% approved in contrast to 16.1% for the others). This allows us to use birth date as an instrument to estimate the effects of being denied sickness benefits. We find that the denial of sickness benefits, on average, has (i) positive effects on the labor-market outcomes and (ii) no negative effects on health outcomes. As the more liberal units deny sickness benefits for most screened medical certificates, we conclude that caseworker screening is very important in separating meritorious from non-meritorious claims. Doctors’ conflicting roles make it difficult for them to act in the best of interest of both their patients and society.
Introduction
It is well known that substantial moral hazard can exist in the sickness and disability insurance (SI and DI) programs (see e.g., Godard et al. (2022) and Eliason et al. (2019)). The implication is that there is a need to screen eligibility of the people applying for benefits. In most countries with SI and DI programs, doctors have a gate-keeping role in the form of providing written medical certificates to caseworkers who approve or deny (cf. Liebert, 2019). However, the primary objective of physicians is to be a healer. For patients presenting with symptoms, it can be difficult for doctors to question the reported symptoms—that is, to act as gatekeepers—since doing so may seem unethical and could interfere with providing appropriate medical care. 1
The two-sided role of treating physicians makes the role of a caseworker or an external specialist important. Evidence for this comes from Liebert (2019), who evaluates the impact of the introduction of external medical review of DI applicants in Switzerland. He finds that external medical review reduced DI inflow by 23% and increased labor-market participation of claimants. The positive effect is mainly derived from those with difficult-to-diagnose conditions. Liebert concludes that the reduction stems mainly from a reduction in type-II errors.
This paper analyzes effects on labor-market outcome and health of caseworker screening. The analysis exploits the fact that previously denied sickness benefit (SB) applicants appealing the decision in 2009–2012 were assigned to four different specialist units based on their birth date. Although following the same eligibility criteria, the units systematically differ in the level of stringency. Thus, the paper contributes to the literature on effects of screening in the SI program. However, as the Swedish SI program is intimately related to the DI program and almost all individuals on disability benefits (DB) have a history of long sick-leaves we also contribute to the literature on targeting and screening in the DI program.
Denied applications have previously been used to study targeting and classification errors in the DI literature. Bound (1989) is the classic study to analyze the earnings of rejected DI applicants. More recent contributions are French and Song (2014), Maestas et al. (2013), Low and Pistaferri (2015), and Manasi and Yue (2019).
Since assignment to different specialist units was based on birth date the two groups should be similar by design. If this is the case, then the analysis consists in comparing outcomes of those assigned to the restrictive unit to those of the three other units. The identification strategy is thus similar to that used in French and Song (2014) and Maestas et al. (2013), who analyze the effects of denial in the U.S. DI program based on differences in lenience among judges and examiners.
As we observe health of the denied person this allows us to also indirectly test the balance between Type-I and Type-II errors in awards. The idea is that, if the health- and labor-market outcomes are no worse for those assigned to the more restrictive units this provides evidence of the importance of caseworker screening in reducing Type-II errors.
As in Liebert (2019) we analyze the effect of external reviews. In the Liebert case, the external reviewer is a medical specialist examining doctor’s assessments. Thus, the effect in Liebert could be from differences in strictness but also from differences in quality of assessment. In our case, the external reviewer is a caseworker that is a specialist who is providing a second opinion—on a colleague’s denial—of a doctor’s certificate. However, as we are using variation in assessment—that can be considered as random—between specialist units, the effects we are estimating should be the effect of differences in screening strictness only.
All previous studies on increased screening strictness in the SI program that we have reviewed have found a reduction in take-up rate of sick-leave (see e.g., De Jong et al., 2011; Hartman et al., 2013; Johansson & Lindahl, 2013; Markussen et al., 2018). 2 In addition to focusing on analyzing caseworker screening, our contribution to this literature is that we are the first paper analyzing the effects of screening on health outcomes. Additionally, we contribute to this literature by analyzing more long-term effects than previously studied.
The analysis is based on administrative data from the Social Insurance Agency (SIA) and the National Board of Health and Welfare. Labor-market outcome is measured by labor income, number of days with SB, and disability benefits (DB) take-up rates. Health is measured by number of outpatient care visits, number of inpatient care days, and drug consumption. We estimate yearly effects up to 3 years after the appeal decision.
We find that denial of SB, on average, (i) reduced number of sick-leave days, (ii) reduced risk of receiving DB within 3 years, and (iii) increased labor income. We find no evidence that denial negatively impacted health on average. As no negative effects on health were found, it suggests that stricter screening of appealed cases for SB reduced Type-II errors in the SI program and thus that it produces a more deserving pool of DI beneficiaries. As more liberal units deny sickness benefits for almost 84% of their screened medical certificates, we also conclude that caseworker screening in general is very important in separation of meritorious from non-meritorious claims. Doctors’ conflicting roles make it difficult for them to act in the best interest of both their patients and society.
The paper unfolds as follows. Section 2 discusses Swedish institutions. Section 3 describes the quasi-experiment and data together with a descriptive analysis. The results are presented in section 4, and section 5 concludes with a discussion.
Institutions
This section provides a short description of the Swedish social insurance (section 2.1) and health care systems (section 2.2).
The Social Insurance System
Both employed and unemployed individuals are covered by the public SI and DI program administrated by the Swedish SIA. Most workers are also covered by an unemployment insurance (UI) program. Unemployed individuals (regardless of whether or not they are covered by the UI program) have access to the SI program.
The SI Program
The employer is responsible for sick pay the first 14 days of the individual’s sick-leave (the first day is not compensated). After that, the SIA pays SB. For an unemployed individual, the SIA pays SB starting from day 2 of the individual’s sick-leave. Compensation covers a little less than 80% of foregone earnings 3 up to a cap (USD 2,220 per month in 2012). Furthermore, collectively agreed compensation typically covers up to 90% of earnings during the first 3 months. From the fourth month onwards, total compensation is typically back at 80%, at least among highly paid workers. After a year, compensation decreases from 80% to 75% of earnings.
During the first 7 days of an illness, the individual decides if (s)he needs to be absent from work. The individual merely has to inform the employer (or the SIA if unemployed) that (s)he is ill. As of the eighth day, a medical certificate is required. The doctor provides medical evidence about applicants’ health condition related to work requirement of the individual. In the certificate, the doctor recommends length and extent of sick-leave. All doctors have the right, and are obliged, to write a medical certificate if they believe their patient requires one. There are no sanctions on doctors writing flawed certificates and they are not paid to write certificates by the county council (see the section below for the organization of health care).
The caseworker takes information in the medical certificate along with other factors in the application (e.g., the applicant’s occupation) and it is their duty to ultimately determine eligibility for SB—not the doctor. Thus, based on this information, the caseworker decides whether the illness has reduced the individual’s capacity to work. Most often full-time benefits are paid. However, sick-leave can also be part-time, which implies SB being paid for 75, 50, or 25% work limitations.
There are eligibility checks at three and 6 months in a sick-leave spell, stipulating increasingly higher requirements to qualify for SB. Workers are entitled to SB for 90 days if they are unable to perform their ordinary duties, or other temporary work, at their workplace. After 90 days, entitlements to SB hold only if workers are unable to perform any type of work at their work place. After 180 days, entitlement to SB holds only if the individual cannot do any type of work on the labor market. 4 However, this rule holds only if the worker is unlikely to return to work with their employer before day 366. For unemployed workers applying for SB, working capacity is judged against the labor market as from day 1. Between July 2008 and January 2016, there was a maximum time limit of 2.5 years with SB, which no longer exists. A worker who had received SB for 2.5 years was disqualified from further SB for at least 3 months.
SB is denied if the caseworker does not find the individual’s capacity to work is being reduced. The applicant can then appeal.
5
The appeal must be submitted within 60 days from the time period when (s)he was denied SB. Appeals concern both new sick-leave spells and ongoing sick-leave spells. An appeal of a denial in an ongoing sick-leave spell concerns a sickness certificate that extends and an “ongoing” spell. For new spells, the appeal concerns denied SB, that is, sick-leave spells longer than 14 days.
6
In general, it takes around 3 months from denial of SB until a decision of the appeal is taken in the reassessment. The mean number of days to reach a decision is 93 in our data. Figure 1 sums up essentials of the sick-leave process. The Swedish Sick-Leave Process. “CW” is Caseworker and “Sthlm” is Stockholm
The appeal assessment was conducted by specialized caseworkers in dedicated assessment units within the SIA. The caseworkers were instructed to perform a comprehensive new examination of each case. The reassessment was to be based on the same documentation as the original decision. However, if the applicant provided additional information in their appeal, further investigation might be necessary for a thorough examination.
If SB is denied in the reassessment, the applicant can appeal to administrative courts on different levels up to the supreme administrative court. These appeals are very rare and not considered in the analyses.
Adjoining Social Insurance Programs
Full income-related DB amounts to about 64% of previous earnings up to the same cap as in the SI scheme. The collectively agreed compensations typically cover up to 80% of earnings. Hence, going from SB to DB does not involve a drastic reduction in a compensation level. If the individual is younger than 30 years, (s)he is only granted DB for 3 years at a time. As in the SI, there is a possibility of receiving partial DB, which is benefits when a person can work for 75, 50, or 25%. As an employee with, say, 50% DB, can be on sick-leave from his or her job, this means that (s)he can, at the same time be on both SB and DB. If this person is not working any of the ordinary 50% working time, (s)he is considered as being full-time on sick leave. On the other hand, if (s)he is working 25%, (s)he is considered as being on part-time sick leave.
The payment from the UI program is based on previous income instead of foregone earnings as in the SI. The implication of this difference in basis for benefit payments is that benefits could be higher as sick listed than as unemployed.
The Health Care System
The local county councils are the major financiers and providers of Swedish health care. There are 20 county councils (23 at the time of the quasi-experiment), and each council is obliged to provide its residents with equal access to health services and medical care. Health care is mostly financed through local taxes. Each county council sets its own patient fees, but a national ceiling limits the total amount that a patient pays during a 12-month period (out-of-pocket) to 1,100 SEK (USD 120). Thus, patient fees only account for about 3% of total revenues. Doctors—not nurses—have the right to write medical certificates. The daily fee for staying in a hospital is a maximum of 100 SEK (USD 11). There is free choice of providers but referral is required in some cases, particularly when patients seek specialized care or when they choose health care in another county. The county councils are allowed to contract private providers but the majority of health care is performed by public agents. For prescribed medicines, there is a maximum amount that a patient needs to pay of 2,200 SEK (USD 240) over a 12-month period.
The Quasi-Experiment, Data, and Descriptive Statistics
Between 2009 and 2012, the reassessments were performed by caseworker specialists at four units. The allocation of cases across the four units was based on the day of the month that the individual was born: if the individual was born in the first half of the month, that is, between the 1st and the 15th, the reassessment was performed in Stockholm. If the individual was born in the second half of the month, that is, between the 16th and the 31st, the reassessment was performed in either Jönköping (2009–2010), Norrköping (2010–2012), or Göteborg (2011–2012).
Administrative data are gathered from the database at the SIA. The data includes information on all started sick-leave, unemployment and disability spells since 1994, as well as income, employment status and other socioeconomic variables. Data on yearly number of outpatient care visits, inpatient care visits and drug consumption are taken from registers administrated by the National Board of Health and Welfare. Given the low personal cost for a hospital visit and costs for drugs, these measurements should be relevant and valid health measurements (see e.g., O’Donnell, 2009).
We find that about two-thirds of appeals concern new sick-leave spells. These spells are most likely short as doctors very seldom issue a first medical certificate longer than 1 month. Total spell lengths of those appealing in an ongoing spell can, however, be very long (sometimes exceeding 2 years). Overall mean number of days to a decision of appeal in our data is 93 days with a standard deviation of 46 days. In the analysis the outcomes are measured after the decision from the appeal.
The allocation procedure was not followed 100%, possibly due to transfers caused by temporary peaks in workload. 7 As a consequence, we use date of birth as an instrument in an instrumental variable analysis.
Descriptive Statistics, Mean (Standard Deviation), of Those Starting a Sick Spell in 2009–2012, Appealing Applicants, and Study Population
aRefers to the status at sick-leave start (“All starting a sick spell”) and at the time of benefit denial (“Appealing”, “Study population”).
bRefers to spells started during and after 2005 and ended before benefit denial.
cThis information is only available for registered sick spells, which means that we can only observe it for i) all starting a sick spell (Column 1), ii) those appealing a negative decision concerning an extension of sickness benefits (Column 2 and 3).
From bottom panel in the table, we can see that of all approved sick-leave spells 41% are due to psychiatric (19%) and musculoskeletal (22%) diagnoses. In the appealing population we do not observe the diagnosis as the spells were not approved. However, for the sub-population of individuals with an ongoing sick spell, we observe the diagnosis code of earlier spell. In this population, we can see that musculoskeletal diagnoses are almost twice as common as compared to the overall population.
Column (3) displays the statistics for the study population. In this population we have excluded foreign born, those currently receiving DB, and those younger than 20 and older than 64. This leaves us with a total of 11,557 individuals in our analysis sample. The reason for excluding foreign-born individuals is that the date of birth is often assigned rather than recorded for refugee immigrants to Sweden. Since certain dates (e.g., the 1st, 10th, and 15th) are disproportionately chosen, this undermines the validity of using date of birth as an instrument. However, it should be noted that the main results—presented below—remain qualitatively the same when we include immigrants and the younger and older population. As can be seen from column (3) and (2), the study population—in comparison to the appealing population—is positively selected: they have higher income, are less unemployed, are more educated, and are less on sick leave. We can also see that this group is even more dominated by females.
Descriptive Statistics, Mean (Standard Deviation) of Those Appealing (Study Population)
aRefers to status at the time of the rejection decision.
bRefers to spells started during and after 2005 and ended before the rejection decision.
cThe number of ongoing spells are 3,331. Thus, 8,226 (=11,557–3,331) are new spells.
The Swedish Social Inspectorate interviewed caseworkers in 2013 on the exact nature of the appeal process. Results from the interviews surprisingly showed that two distinct approaches had emerged (ISF, 2013, p. 55): • Legal correctness approach: Caseworkers examined whether the original decision was legally sound. If not, the decision was amended. • Fresh assessment approach: Caseworkers conducted a new assessment based on the original documentation, independent of the previous decision.
We suspect that caseworkers at the Stockholm unit largely adhered to a “legal correctness” approach. Cases with minor documentation deficiencies were generally not deemed relevant. Instead, attention was focused on cases where the documentation was clearly flawed or insufficient as a basis for changing the decision. Caseworkers at the other units, on the other hand, seemed to have adopted the fresh assessment approach.
Empirical Strategy and Results
The main contribution to the screening effects literature is the simultaneous analysis of both health and labor-market outcomes. This dual focus inherently introduces a multi-significance problem.
With just two effect estimates this is easy to handle using a Bonferroni correction. However, with many observable variables measured over 3 years, we struggled with how to handle the multi-significance problem. The most usual way is that of having a main analysis to establish a few effects using Bonferroni corrected significance level. Subsequently, exploratory analyses can be carried out to investigate other potentially interesting effects without strict correction.
The problem we faced was that we could not determine which main analysis was most relevant. The core issue was how to measure an effect on health, and here we provide our rationale for not applying multiple comparison adjustments in the main analysis. 9
It was not obvious to us how a valid measurement of health could be constructed using the available registers at the National Board of Health and Welfare. For this reason, we decided to use three health measures: hospital care visits, inpatient care visits and drug doses. However, we also wanted to estimate separate effects for the 3 years. The argument for using the three variables is that we believe they measure the same thing (i.e., if there is a positive/negative effect on health the three effects should all be non-negative/non-positive). It is not obvious when the effect should be measured. However, based on Grossman (1972) we expect any long-term effect to be larger than any short-term effect. Thus, if we have positive/negative effects after 1 year we expect any of these effects to increase with time.
Also, with the analysis on labor-market outcomes it is reasonable to believe that we could learn more by using more than one outcome measure at one time period. It is also highly plausible that the consequence of being denied benefits differs with respect to duration of sick-leave and for this reason we would like to analyze the effects separately for those in new and ongoing spells, respectively.
All in all, we are interested in estimating 36 treatment effects. The implication is that we cannot interpret single statistically significant estimates as being evidence of an effect. It is a consistent pattern of effect sizes that provides support for an effect of the stricter screening on labor market and health. For instance, we expect (i) the short-term effect on sick leave would be larger than the long-term effect and (ii) the short-term effect on DB to be smaller than the long-term effect. Furthermore, if an effect on health exists, we would expect (i) a consistent pattern across the three outcomes, and (ii) the long-term effect to exceed the short-term effect. Finally, we expect the effects in general to be larger for the population with ongoing sick-leave spells than for the population with new sick-leave spells.
Labor income is defined as yearly labor income observed in each of the 3 years following the decision. The other outcomes are measured over a year from the date of the appeal decision. We measure SB as number of days with SB (sick-leave days) in this period. 10 DB is measured using an indicator of receiving any DB in this period. The health outcomes are measured—also in this period—as number of hospital care visits, inpatient care days, and daily drug doses.
The model used in the analysis of the yearly effects (δ
t
, t = 1, 2, 3) is the following:
Here y is any of the six outcomes discussed above, Denied takes on a value of 1 if the individual is denied in the reassessment and 0 otherwise, Early takes on a value of 1 if the individual was born in days 1–15 of the month, and 0 otherwise and ω i and ɛ it are error terms, where the latter is assumed to be independent with Early. 11 The other Greek letters are parameters to be estimated.
Parameters in model (1) are estimated using the two-stage least squares (2SLS) estimator. Inference is based on robust standard errors, using a robust covariance matrix estimator.
Estimates (est.) and Standard Errors (se) of the Effects of Being Denied Sick Leave on Labor-Market Outcomes for all Individuals, and Subdivided by New and Ongoing Sick-Leave Spells
Note. Estimated standard errors are corrected for the estimated instrument and using a robust covariance matrix.
*, **, and *** denote statistical significance at the 10, 5, and 1% level, respectively.
aMean of dependent variable in the control group.
From column (1), we can see that denial of SB reduces future SB (panel 2) and DB usage (panel 3) and increases labor income (panel 4).
The effect on SB is statistically significant (at the 5% level) in the first and the second year while the effect on DB is statistically significant in the third year. Given the time limit in the SI of 2.5 years, together with the possibilities to enter permanent DB, the non-significant effect in the third year on SB together with the significant effect on DB is expected. Labor income is increased by an average of 62,580 SEK (USD 9,605), 56,580 SEK (USD 8,684), and 56,740 SEK (USD 8,708) in the first, second, and third year, respectively. The magnitudes of these increases in labor income are 44% (=62,580/141,244*100), 37% (=56,580/149,285*100) and 37% (=56,740/154,055*100), respectively. The effect is statistically significant the first year and also weakly statistically significant the second and third year.
The analysis supports the hypothesis that individuals who were denied in their appeal were more likely to return to work than their counterparts who were approved in their appeal.
Column (3) and (5) present the corresponding results for new and ongoing cases. The pattern is consistent with the overall results, showing negative effects on SB (panel 2) and DB (panel 3) and positive effects on labor income (panel 4). However, ongoing cases are driving overall effects on especially SB and labor income. This result may not be that surprising. It is reasonable to believe that being denied benefits in an ongoing sickness spell would be a considerably larger change in circumstances for the person compared to a denial in a new sick-leave.
Estimates (est.) and Standard Errors (se) of the Effects of Being Denied Sick Leave on Health Outcomes for All Individuals, and Subdivided by New and Ongoing Sick-Leave Spells
Note. Estimated standard errors are corrected for the estimated instrument and using a robust covariance matrix.
*, **, and *** denote statistical significance at the 10, 5, and 1% level, respectively.
aMean of dependent variable in the control group.
As shown in Table 1, the study population differs from the overall population. A relevant question, then, is whether the results presented in Table 3 would still hold if we applied weights based on observable characteristics of the broader population on sick leave at that time. To address this, we used predicted values from an estimated probability of being denied sickness benefits in the full population as weights. 12 Results from this weighted analysis, presented in Table A2 in the Appendix, show lower precision but confirm that the inferences from Table 3 remain valid for the larger population. The results from this analysis shows lower precision but that the inference from Table 3 holds to this larger population (see Table A2 in the Appendix).
Thus, from this analysis we conclude that the behavioral effects are not explained by differences in observable characterizes in the two populations.
How to Interpret the Effect and Who are the Compliers?
Around 95% of all SB applications are approved. We can reasonably assume that the population of appealing individuals consists of individuals with problems that are more difficult to verify than in the overall population. Caseworkers screening the appeal follow strict guidelines, but they are allowed some discretion. Screening of the appeal follows the same guidelines as the initial assessment. Additionally, unless new information has been added to the case, the screening is based on the same documentation as the original application. The main difference is that the appeal is handled by another caseworker. It is unlikely that caseworkers consider any potential individual effect (positive or negative) in their decision. If this assumption is correct, it means that we are estimating the average treatment effects for a population with a working capacity determined by differences in discretion in screening by the less strict units and the stricter unit.
Share (in %) Who are Approved in the Reassessment, Overall, and Subgroups
aRefers to status at the time of the rejection decision.
bRefers to spells started during and after 2005 and ended before the last registered SB application.
The high compliance rate among those with a psychiatric diagnosis is no surprise. Compared to other diagnoses, psychiatric diagnoses are to a greater extent based on symptoms rather than on objective medical findings. This idiosyncrasy suggests that caseworker discretion is greater for psychiatric diagnoses. Some of the male-female difference in compliance rate is likely a direct consequence of females having a higher rate of psychiatric diagnosis (25% compared to 17%).
In the next section—focusing on analysis of heterogeneous effects—we also study differences in compliance for those classified as having good versus poor health (or good vs. bad working capacity), and those with stronger versus weaker labor-market attachment.
Heterogeneous Effects
In this section we estimate effects separately by type of labor-market attachment, health (or working capacity) and by men and women at the time of the appeal.
As potential measurements for labor-market attachment we have education and employment status (employed or not). 14 As potential measures of health we have three previous health measures, but measured before the decision. By combining the indicators of both latent variables into a score our hope is to use this to classify individuals into groups with good or bad labor-market attachment and good or bad health. To this end, we use principal component analysis to create these two scores. 15
Labor-market attachment index is created by using first principal component (explaining 60% of the variation). Employment status and education have the same weight in the score. The health index is created from the first principal component (explaining 39% of the variation) of three health indicators measured during the last 6 months up to the decision of the appeal. For this score number of hospital care visits is the most important (42%) and number of inpatient care days are the least important (17%). We classify individuals with values of the score above the median as having worse labor-market attachment and health than individuals with values below the median.
The same analysis as above is conducted by each of the two strata separately. We use the findings from the main analysis to restrict the analyses to a 2 year follow-up period in all cases except the DB, where a 3 year follow-up period is used.
Results from 2SLS Estimations. Stratification on Sex, Health and Labor-Market Status, and Direct Effects
Note. We use a robust covariance matrix to correct for estimated standard errors, shown in parentheses. Mean of the dependent variable in the control group within [ ] brackets. *, **, and *** denote statistical significance at the 10, 5, and 1% level, respectively.
Columns (1) and (2) present the results for those with good and poor health, respectively. We can see that reduction in SB days and DB take-up are more pronounced for those with relatively poor health. For those with relatively good health, effects on SB and DB are not statistically significant. For both groups, effects on income are positive and larger for the poor health group. However, none of the estimates is statistically significant. The non-negative effect on income for those with relatively poor health is interesting per se. In combination with negative effects on SB and DB, it suggests that individuals in this group are not leaving the labor force. Finally, health estimates are imprecise and none of the differences are statistically significant.
Columns (3) and (4) present the results for those with a strong and weak (i.e., more likely to be unemployed and low educated) labor-market attachment, respectively. We can see the effects on the labor-market outcomes are much more pronounced for those with weak labor-market attachment. For this group we see a reduction of 154 days of SB, a 43% point reduction in DB and an increase in labor income by 227,510 SEK (USD 34,917) (129%). Conversely, for those with relatively strong labor-market attachment we only find a statistically significant effect on SB. The effect on DB and labor income is insignificant (neither statistically nor economically). Interestingly, the signs of the coefficients for two of the health indicators suggest improved health (i.e., negative effects) from the denial for those with weak attachment but a health deterioration for those with good attachment. The differences in estimates are, however, not statistically significant.
With regard to sex differences, results shown for men and women in columns (5) and (6) are somewhat mixed. Men who are denied benefits reduce their sick-leave and increase their labor income by an average of 445,000 SEK (USD 68,298) (160%). Women on average reduce the probability of receiving DB but do not significantly increase their labor income. Finally, the health estimates are imprecise and none of the differences are statistically significant.
Our interpretation of the results regarding health and labor-market attachment is that consequences of a denial likely depend on individuals’ outside options. Those with favorable outside options—such as relatively good health and strong labor-market attachment—tend to return to work regardless of whether their application was approved or denied.
In contrast, for individuals with limited outside options, denial appears to have positive long-term effects. It may prevent people who are capable of working from becoming locked into long-term sickness insurance (SI) or disability insurance (DI) programs. Interestingly, we also observe positive health effects among individuals with weak labor-market attachment. This suggests that remaining connected to the labor market may have beneficial effects on health.
Discussion
This paper analyzes the effects on labor-market outcomes and health of caseworker screening of doctors’ medical certificates approving sick-leave. We find that benefit denial on average reduces sick-leave, increases labor income in following years, and reduces the risk of receiving disability benefits (DB) within 3 years. We also find that the denial has no impact on health on average. Thus, the results show the importance of caseworker screening in reducing the conflicting role of doctors. These conflicts should especially be prevalent for patients with mild or moderate mental illness and/or musculoskeletal disorders, which also make up the majority of the population in this study.
The results are consistent with earlier studies showing that stricter screening in both the sickness insurance (SI) and disability insurance (DI) process can be an efficient policy to increase work resumption and decrease DI take-up rates (cf. Bound, 1989; French & Song, 2014; Low & Pistaferri, 2015; Maestas et al., 2013).
French and Song (2014) and Maestas et al. (2013), who used similar identification strategies in the U.S. DI program, find effects on labor supply that are similar to our findings on labor income. French and Song (2014) use the differences in leniency of administrative law judges in the second stage of an appeal process and conclude that a denial would have increased the labor supply by 26%. Maestas et al. (2013), on the other hand, make use of differences in examiners’ allowances rates as an instrument for denial. They find a 28% increase in labor supply in a group of denied applicants who are on the margin of being approved (comprising 23% of the population). The effect on labor supply is found to be stronger for individuals with less severe impairments. We find an effect to be around a 40% increase on labor income. We, however, for completeness, used a measure of labor supply in an extended analysis (see Table A4 in the Appendix). As no employment indicator is available in the register, we define employment using the labor income data. We follow Johansson et al. (2016, 2017) who defined a person being retired at year t if he or she has no income above one price base amount (PBA) at least in the 2 years after year t 17 Here we use the same—but inverse—definition of being employed after the appeal decision. We find an overall effect of 31% in the first year after the denial and 20% in the second and third year. The effects in year one and years two and three are statistically significant at the (at least) 5 and 10% risk, respectively. As with income, the effect is substantially larger for individuals in ongoing sick-leave spells.
The study closest to ours is Liebert (2019), who concluded that external medical reviews in the DI process reduced DI inflow and classification errors due to incorrect awards (Type-II errors). Our results show that caseworker screening plays a similarly important role in separating meritorious from non-meritorious claims and that the screening on average improves welfare for the individuals. Furthermore, the results show that stricter screening of sick-leave spells reduces DB take-up. To this extent, the results correspond to findings in De Jong et al. (2011), who found that intensified screening in the SI program in two Dutch regions decreased long-term sickness absence and DI application rates. The results on all outcomes are also similar to results in a recent study by Ahammer and Packham (2023), who—using male workers aged 55–62—analyzed increased strictness for elderly in the Austrian DI program in 2013. They found a reduction in DB take-up rate, that the workers are more likely to remain in the labor force and no differences on their health outcomes.
Replacement rates in Swedish SI and DI programs are by no means extreme (Scruggs, 2006), and at the time of the study, the level of sick-leave in Sweden was also close to other comparable European countries. This suggests that the level of screening stringency was similar and that many other countries should face the same problems as Sweden with regards to monitoring and screening in social insurance programs. Autor and Duggan (2006) conclude that one important reason for the increase in DB claims after 1984 in the U.S. DI program is the extension of benefit eligibility to difficult-to-verify impairments like back pain and depression. From a policy perspective, this study shows clearly that caseworker screening of the medical certificates for sickness benefit claims are important in reducing the number of DB claims and that this is also beneficial for individuals on average.
Supplemental Material
Supplemental Material - Effects of Caseworker Screening on Employment and Health: Quasi-Experimental Evidence From the Swedish Sickness Insurance Program
Supplemental Material for Effects of Caseworker Screening on Employment and Health: Quasi-Experimental Evidence From the Swedish Sickness Insurance Program by Pathric Hägglund, Per Johansson, and Kristian Persson in Evaluation Review.
Footnotes
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Supplemental Material
Supplemental material for this article is available online.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
