Abstract
Devising effective cost-containment strategies in public insurance programs requires understanding the distribution of health care spending and characteristics of high-cost enrollees. The aim was to characterize high-cost enrollees in a state’s public insurance program and determine whether expenditure inequality changes over time, or with changes in cost-sharing policies or program eligibility. We use 1999-2011 claims and enrollment data from the Alabama Children’s Health Insurance Program, ALL Kids. All children enrolled in ALL Kids were included in our study, including multiple years of enrollment (N = 1,031,600 enrollee-months). We examine the distribution of costs over time, whether this distribution changes after increases in cost sharing and expanded eligibility, patient characteristics that predict high-cost status, and examine health services used by high-cost children to identify what is preventable. The top 10% (1%) of enrollees account for about 65.5% (24.7%) of total program costs. Inpatient and outpatient costs are the largest components of costs incurred by high-cost utilizers. Non-urgent emergency department costs are a relatively small portion. Average expenditure increases over time, particularly after expanded eligibility, and the share of costs incurred by the top 10% and 1% increases slightly. Multivariable logistic regression results indicate that infants and older teens, Caucasian children, and those with chronic conditions are more likely to be high-cost utilizers. Increased cost sharing does not reduce cost concentration or average expenditure among high-cost utilizers. These findings suggest that identifying and targeting potentially preventable costs among high-cost utilizers are called for to help reduce costs in public insurance programs.
Introduction
It is well established that overall health care expenditures in the United States are not uniformly distributed. One report from the Agency for Healthcare Research and Quality found that, among the US population, the top 5% of patients accounted for 49.5% of all spending, and the top 1% for almost 22% of all spending. 1 Concerns about continued growth in health care spending have generated discussions about “bending the curve” for health costs. However, standard approaches to cost containment aimed at all consumers, like premium increases, copayment increases, or higher deductibles, may not be effective if health care expenditures are concentrated among a small proportion of consumers. 2 A better understanding of the distribution of costs among publically insured children is necessary to develop more viable approaches to cost containment.
The extant literature in this field has largely focused on cross-sectional analysis of distribution of costs over relatively short periods. For example, using Medical Expenditure Panel Survey (MEPS) for 1996 to 1998, McCormick et al found that the top 10% of child enrollees accounted for 69% of total pediatric health spending, 3 and using MEPS data for 2000 to 2001, Liptak et al found that the top 10% accounted for 54% of all costs. 4 For Medicaid and CHIP child enrollees in 2002 to 2005, Kenney et al (2009) found that the highest 10% of enrollees accounted for 72% of all program expenditures, whereas 30% of enrollees used no services at all. 5 Similar cost concentrations have been found in studies focusing on children in public health programs in a single state, or special-needs children.6,7 We are not aware of any studies that have explored whether programmatic changes such as increased cost-sharing or changes in eligibility to include less disadvantaged families influence cost concentration. Nor are we aware of studies that have explored whether child characteristics that predict being a high-cost user in one period also predict being a persistent high-cost user.
Regarding cost sharing and cost concentration, much of the medical care provided to children is preventive care that parents may regard as relatively discretionary. Hence, a modest increase in premiums may mostly deter take-up or re-enrollment by children in good health with minimal health expenses.5,8-10 Furthermore, authors of one of the seminal RAND health insurance studies posited that many children’s illnesses are “acute but self-limiting,” and may be sensitive to cost-sharing in the form of higher copayments. 11 Their empirical results found that outpatient visits among children were sensitive to higher copayments, whereas inpatient visits were less so. Thus, increased cost sharing may increase inequality in health care spending if it reduces re-enrollment and health service utilization by low-cost users while having little impact on high-cost utilizers.
An expansion in eligibility that adds families that are less disadvantaged, both in terms of income and parental education, to the insurance pool should increase health care utilization and average enrollee costs across the board, as such families may be more conscious about acquiring preventive services for their children and willing to spend more in copayments on both low-acuity care and high-acuity care. It is not clear, a priori, how this would affect cost concentration. However, if less disadvantaged families disproportionately utilize preventive and low-acuity care, then this may reduce cost concentration.
In this study, we use detailed longitudinal claims data from the ALL Kids program and add to the literature in several ways. First, we examine the distribution of annual expenditures and cost concentration over a 12-year period, during which there were programmatic changes such as copayment and premium increases, as well as an expansion of eligibility to children from more affluent families. Second, we examine the individual characteristics that predict being a high-cost enrollee in 1 enrollment year, as well as a high-cost enrollee in several consecutive years. Finally, we use detailed claims data to assess what health services high-cost enrollees are most likely to utilize, and what percent of these are potentially preventable. This study complements previous work by this research team on changes in enrollment, health service utilization and average health expenditures following increased cost sharing and expansion in eligibility, and persistence of high costs for enrollees with specific chronic conditions.10,12-16
Methods
We use pooled claims data from 1999 to 2011 from ALL Kids–Alabama’s CHIP program. Alabama is one of 12 states with separate CHIP and Medicaid programs. At the start of our study period, ALL Kids coverage was available to Alabama residents under age 19 with family incomes between 100% and 200% of the federal poverty level (FPL). Enrollees face annual premiums and copayments that vary across 3 groups defined by family income and Native American status. Children in families with incomes 100% to 150% of the FPL (termed the low-fee group) face lower levels of cost sharing, whereas children in families with incomes 150% to 200% of the FPL (termed the fee group) face higher levels of cost sharing. The third group, comprising primarily Native American children (“no fee group”), is federally exempt from all cost sharing. There are no upfront annual deductibles in the ALL Kids program, and out-of-pocket costs per year may not exceed 5% of the family income. 17
The ALL Kids program underwent 2 major changes over our study period. In October 2003, ALL Kids raised premiums and copayments for most non-preventive services. Thereafter, beginning in October 2009, the eligibility level was expanded to 300% of the FPL. These changes allow us to explore whether the distribution of costs changes following increased cost sharing and expanding eligibility to children from less disadvantaged families.
We investigated the share of expenditures incurred by the lower 90% of enrollees versus the 90th percentile and above (ie, top 10%) and 99th percentile and above (top 1%). This is done separately for 1999 to 2003 (ie, before the increase in cost sharing), 2004-2009 (ie, after the increased cost sharing but before expanded eligibility), and 2010-2011 (ie, after the eligibility expansion). Individual enrollees are classified based on their total claims-based expenditures, including those enrolled who had no claims. Observations for the top 1% are also included in overall observations for the top 10%. In addition to the share of expenditure of each type, we also present the average per member per month (PMPM) costs incurred overall, and for service category, adjusted to 2011 dollars.
We also compare socio-demographic characteristics of enrollees in the lower 90% versus the high-cost utilizers in the top 1% and 10% of the expenditure distribution. Children are considered high-cost utilizers during the fiscal year if they were among the top 1% or 10%, though the same child can be a high-cost utilizer in multiple fiscal years.
To characterize trends in the distribution of expenditures over time, we construct Gini coefficients for each fiscal year. The Gini coefficient is a measure of statistical dispersion conventionally used to measure inequality, and is independent of unit of measure or absolute size of group.18-22 Appendix A gives details about how they are constructed.
Furthermore, we estimate multivariable logistic regressions to identify the characteristics associated with being a high-cost utilizer belonging in the top decile of the expenditure distribution, and in the top percentile of the expenditure distribution. For each outcome category (ie, 10%, 1%), we estimate logistic models with 2 alternative sets of covariates: (1) patient socio-demographic characteristics, and (2) patient socio-demographic characteristics plus previous enrollment plus previous years of being a high-cost utilizer. Thereafter, we leverage the longitudinal nature of our data to investigate what characteristics are associated with being a “persistent high utilizer,” namely, for 3 consecutive fiscal years or more. We present these latter results for the top 1% only. All results are presented in the form of “marginal effects”—that is, the percentage point change in the likelihood of being in the top 10% (1%) associated with a specific characteristic.
Finally, we briefly summarize the specific conditions that account for the most spending among the top 1%, and whether any of these appear to be potentially preventable (Appendix B). This can help provide information to program administrators about areas that can be targeted for cost savings.
This study was approved by the institutional review board for use of secondary data.
Results
As shown in Table 1, in fiscal years (FYs) 1999 to 2003, the top decile accounted for 63.6% of all health expenditures in the ALL Kids program; the top 1% accounted for nearly 20% of total expenditures. After the increase in cost sharing (FYs 2004-2009), the figures for the top 10% and top 1% were 64.2% and 21.1%, respectively. After the expansion in eligibility (FY 2010-2011), the corresponding figures were 65.5% and 24.7%. Figure 1 presents Gini coefficients calculated for each year from 1999 to 2011. The Gini coefficients range between 0.68 and 0.70, suggesting that the pattern of inequality in the distribution of health care expenditure does not change substantially over time.
Comparison of the Health Expenditures by the Lower 90%, Top 10%, and Top 1% of ALL Kids Enrollees Over 3 Periods.
Note. All expenditures are adjusted to 2011 dollars using the Consumer Price Index. CPI. ED = emergency department; PMPM = per member per month; FY = fiscal year. Percentage changes in PMPM changes represent changes from the previous period for the specific cost-category.
Denotes how the expenditures for each group of enrollees (lower 90%, top 10%, and top 1%) are distributed across the categories of health services.
ED use that did not lead to an inpatient admission.

The concentration of expenditures is illustrated from FY 1999 to FY 2011 by the Gini coefficient, a measure of equality.
For all 3 periods, inpatient and outpatient services are the 2 highest categories of expenditures among the top 10%. Inpatient services account for the highest share of expenditures for the top 1%. The share declines slightly following increased cost sharing, but increases again with expanded eligibility. In contrast, for non–high-cost utilizers (ie, those below the 90% percentile) the highest share of expenditures comes from outpatient claims, followed by dental claims and prescription drugs expenditure. Emergency department (ED) expenditures are a relatively small category for all groups, and its share is particularly small for the top 1%.
A more complete picture emerges when these numbers are taken in conjunction with the inflation-adjusted average PMPM expenditure in all categories. Overall, average PMPM expenditures increase for all groups of enrollees, but the increases are larger for the top 1%. For example, between FYs 1999 to 2003 and 2004 to 2009, average overall expenditures for the lower 90% increase by 18% ($489 to $577) whereas those for the top 1% increase by 28% ($22 619 to $28 918). Noticeably, there is no growth in PMPM for inpatient services for the lower 90%, and a 15% increase in PMPM for outpatient services. In contrast, for the top 1%, PMPM for inpatient services increases by 18% and for outpatient services by 33%. From FYs 2004 to2009 to FY 2010 to 2011, the average overall expenditure for the lower 90% also increases by 18% ($577 to $683), but that for the top 1% increases by almost 43% ($28 918 to $41 436), leading to a further concentration in spending among the most expensive enrollees. The PMPM expenditures for all services increase over time, but the dollar increases from FYs 2004 to 2009 to FYs 2010 to 2011 are sizably larger than the changes from FYs 1999 to 2003 to FYs 2004 to 2009. For example, inpatient spending among the top 1% increases by 56.3% (from $16 748 to $26 182) across the 2 periods.
Table 2 presents descriptive statistics on socio-demographic characteristics. High-cost utilizers tend to be more concentrated among the youngest (0-1) and oldest (17-19) ALL Kids enrollees. High-cost enrollees are more likely to be Caucasian, have a chronic disease, have been previously enrolled in ALL Kids, and have previously been in the top percentile or decile of program expenditures.
Characteristics of ALL Kids Enrollee Months by “Non–High-Cost” and “High-Cost” Status.
Note. Pooled data for 1999 to 2011 are used. “High-cost status” is based on expenditures in that fiscal year only. Enrollees may have high-cost status in multiple fiscal years.
Table 3 shows the adjusted marginal effects for high-cost utilizers from multivariable logistic models. Note that, when modeling the likelihood of being a persistent 1% user (3 years or more), covariates are measured at the first year of the spell. Chronic disease was a statistically significant predictor of being within the top 10% of expenditures. Although still positive and statistically significant, the magnitude of the effect for being in the top 1% was smaller, and for being a persistent top 1% was even smaller. Similarly, having previously been in the top 10% or top 1% of expenditures, as well as having previous ALL Kids enrollment were associated with being a high-cost utilizer, although again the magnitudes were higher in models examining the likelihood of being in the top 10% as opposed to top 1% or persistent top 1%. Age was also an important predictor, such that all age groups compared with the youngest children were less likely to be among the top 10% of utilizers, with the exception of older children aged 17 to 19 years. Notably, it is better to interpret effects of age from model 1—there are obvious collinearity issues in model 2 as the reference category of infants 0 to 1 years old can neither have previous ALL Kids enrollment nor have previously been in a high-expenditure category. The general pattern appears to be that all children in age groups from 2 to 12 years are less likely to be high-cost compared with infants. However, 17- to 19-year-olds appear to have a somewhat higher likelihood than infants of being in the top 10%, though they are less likely to be in the top 1%. When it comes to persistently being in the top 1%, there is no significant difference between infants and 17- to 19-year-olds (many of whom age out of ALL Kids over the 3 years), and 13- to 16-year-olds are actually more likely than infants to be in that category.
Multivariable Logistic Regression for Characteristics Associated With Being a High-Cost Utilizer.
Note. The 95% confidence intervals are given in brackets. Pooled data for 1999 to 2011 are used, N = 1 021 377 enrollee-months. Top 10% of 1% is based on expenditures in that fiscal year only. Enrollees may have high-cost status in multiple fiscal years. Persistent 1% is based on being in the category for at least 3 subsequent years. Standard errors are clustered to account for repeat observations for the same child.
“Fee group” enrollees were less likely than “low-fee” group enrollees to belong to the top decile, and also slightly less likely to persistently be in the top percentile. No fee group enrollees were more likely to belong to both the top decile, though their results for being in the top percentile are mixed. Expansion group enrollees were more likely to be in the top decile than low-fee group enrollees in model 1—again, there are collinearity issues with this group and model 2 as they only start in ALL Kids from 2010.
We also conducted several supplementary analyses (not shown) to explore persistence. For example, among infants initially in the top decile, 74.1% remained in the program the next year, but only 29.5% remained after 3 years. Of those who remained, 45.6% were still in the top decile the next year, but only 20% of those enrolled still remained in the top decile in later years.
In Appendix B, we present a summary of the inpatient diagnoses among the top 1% by year, and broadly categorize them into conditions that appear to be potentially preventable, conditions that may be preventable but are outside the scope of ALL Kids as they relate to maternal prenatal health and health behaviors, and conditions that are not preventable. We find that the majority of inpatient cases fall in the last category.
Discussion
We analyzed the distribution of individual health expenditures in Alabama’s ALL Kids between 1999 to 2011, a period during which the program underwent changes in cost sharing as well as program eligibility. We found the top 10% of enrollees accounted for 64% to 66% of program expenditures, and the top 1% for 20% to 24.7%, throughout this period. These findings are broadly consistent with previous studies examining the distribution of health expenditures among children. Consistent with previous literature, we find that non-Hispanic whites and those with chronic diseases are more likely to be high utilizers,4,5 and that infants and older adolescents are the age groups most likely to have the highest expenditures.4,23 We additionally find that older adolescents may be most likely to persist in the highest percentile for 3 or more years.
Our results generally support arguments of Kenney et al that policies like nominal increases in cost sharing are likely to have minimal effectiveness in terms of cost containment. 5 Our previous work has suggested cost sharing discouraged re-enrollment 10 and modestly reduced or slowed growth in the utilization of some health services,12,16 but results here suggest that they did little to slow utilization among high utilizers. Our results also support earlier work indicating that average expenditures increased following the expansion in eligibility, 13 but we additionally found that increases among the top 1% were disproportionately higher in the post-expansion period, leading to further cost concentration.
Although we agree that cost-containment approaches should focus on “strategies that target Medicaid/CHIP spending among children in the upper spending decile,” 5 these strategies must be carefully selected. For example, targeting popular “wasteful” services like non-emergent ED use may not play a major role in reducing costs, as ED use that did not result in an inpatient admission accounts for only a small share of spending by high-cost utilizers. Better management of chronic diseases may be useful in limiting costs among the top decile, but is unlikely to help with reducing costs among the top percentile, given its small association with being in, or persisting in, the top 1%.
Finally, we identify the most common reasons for inpatient stays among the top 1%, and broadly categorize them as potentially preventable versus not as a way of providing some guidance to policy makers as to where attempts at cost containment could start. This is presented in Table 4. The large majority of inpatient stays appear to be due to “non-preventable” acute health events. A small fraction of potentially preventable cases are outside the scope of ALL Kids to address—for example, as the program does not cover non-minor pregnant women, the program cannot seek to improve prenatal care in an effort to reduce the risk of low birth weight. However, approximately 14% to 15% of inpatient stays that stem from causes like asthma, chronic disease complications, nutrition deficiency, or mood/anxiety disorders might be prevented with better access to preventive services, outpatient services, or better case management.
Most Common Reasons for Inpatient Stays Among Top Percentile.
Note. Data are based on a single primary diagnosis per child per day or per stay.
The study has several limitations. Because it is based on a free-standing CHIP program, distinct from the state’s Medicaid program, results may not be generalized to all publicly insured children. It focuses on claims-based expenditures from the program’s point of view, and does not inform on out-of-pocket expenditures for children. We have little information on “high-expenditure” status of enrollees who leave ALL Kids for Medicaid or private insurance. We also have no information on prenatal health behaviors of mothers whose infants fall in the high-expenditure category. We present a broad generalization of the categories of inpatient use that may be considered “preventable,” but cannot do an in-depth clinical review of individual records to further verify this. Finally, we cannot comment on the extent to which high-cost utilizers experience improvements in health and quality of life as a result of consuming higher levels of health services.
Overall, this study suggests that to effectively control public insurance program costs, more in-depth attention should be given to health services utilized by high-cost enrollees. Using information on socio-demographic characteristics, health service access and patterns of utilization, targeted interventions should be developed to reduce the use of avoidable care by the highest cost utilizers.
Footnotes
Appendix A
The Gini coefficient (G) is a unit-independent measure of statistical dispersion. It is frequently used in the analysis of inequality of income or expenditures. Mathematically, G is the ratio of the area between the expenditure distribution curve (Lorenz curve) and the line of equal distribution (45° straight line)—denoted as “A” in diagram below—to the area under the equal distribution curve—denoted as (A + B). Each (x, y) point on the curve means that the bottom x% of enrollees account for y% of the wealth. The 45° straight line represents all points where x = y (such that lowest 10% of enrollees account for 10% of expenditure, lowest 20% account for 20% of expenditure, and so forth). G can hypothetically range from 0 to 1; 0 represents the “most equal” situation—each person accounts for an equal share of expenditure and the Lorenz curve is identical to the 45° line, and 1 represents the most unequal distribution, where just one person accounts for all expenditure.
Appendix B
Percentage of Diagnosis for Inpatient Stays Among Highest 1% of Enrollees Due to “Preventable” and “Not Preventable Conditions,” 1999-2011.
| % Preventable by ALL kids a | % Preventable, outside scope of ALL kids b | % Not preventable c | |
|---|---|---|---|
| 1999 n = 118 |
16 | 2 | 82 |
| 2000 n = 152 |
14 | 1 | 85 |
| 2001 n = 163 |
15 | 1 | 83 |
| 2002 n = 168 |
14 | 1 | 85 |
| 2003 n = 180 |
14 | 1 | 84 |
| 2004 n = 189 |
14 | 1 | 85 |
| 2005 n = 184 |
14 | 1 | 85 |
| 2006 n = 176 |
9 | 1 | 90 |
| 2007 n = 171 |
10 | 1 | 89 |
| 2008 n = 186 |
13 | 1 | 86 |
| 2009 n = 185 |
13 | 1 | 86 |
| 2010 n = 197 |
13 | 1 | 86 |
| 2011 n = 194 |
15 | 1 | 84 |
Note. This categorization is based on over 200 types of claims that were the primary reason of inpatient stays among the top 1%. The rows may not add up to 100% due to rounding.
Includes conditions such as mood disorders, anxiety disorders, attention-deficit, chronic disease complications, nutrition deficit, anemia, asthma, influenza, upper respiratory infection, otitis media, tonsillitis.
Includes hypertension during pregnancy, low birth weight.
Includes conditions such as malignancy, organ anomaly/disorder, organ failure, acute medical conditions (acute cardiovascular disease, appendicitis, injury, fractures, etc), infection.
Authors’ Note
The study team had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. The design and conduct of the study; collection, management, analysis, and interpretation of the data; and preparation, review, and approval of the manuscript were the sole responsibility of the authors listed and were not influenced by the research’s sponsor.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was funded by the Alabama Department of Public Health.
