Abstract
Introduction
The dramatic rise in obesity prevalence and its well documented adverse effects have become a challenging issue for policy makers and academics over the past 2 decades. An individual is classified as obese when the body mass index (BMI) equals 30 or more (BMI is calculated as weight in kilograms divided by height in meters squared). Over 1 billion individuals worldwide are overweight, with approximately 300 million obese. 1 Obesity is a precursor of many chronic diseases (eg, cardiovascular disease, type 2 diabetes, hypertension, liver disease, as well as certain types of cancer). 2 In addition, obesity may cause psychological disorders through societal prejudice and discrimination against obese individuals. 3 Moreover, the economic cost attributable to overweight and obesity is substantial.4–6 For example, a recent study estimates that the total economic cost of overweight and obesity in the United States is $270 billion yearly and the cost in Canada is $30 billion yearly. 6 There has been remarkable increase in the prevalence of obesity in Canada, the adult obesity prevalence rate almost doubled from 13.8% in 1978 to 23.1% in 2004. 7 This has been confirmed by an increase in the average BMI of Canadian adults from 25.2 in 1994 to 26.5 in 2008.
Several studies have argued that technological innovation may have contributed to increase body weight through a reduction in food prices, as well as the promotion of sedentary behaviors.8–11 The World Health Organization 12 together with empirical studies has linked individual’s diet and nutrition behavior including the consumption of fruits and vegetables (FV) to the global rise in obesity (for a comprehensive review, see Rolls et al and Tohill et al). 13 , 14 The health benefits of adequate consumption of FV daily (5 servings or a minimum of 400 grams) are enormous. 12 , 15 Inadequate consumption of FV has been linked to approximately 2.7 million deaths per year, 19% of gastrointestinal cancers, 31% of ischemic heart diseases, and 11% of strokes. 12 , 16
The rising obesity rate in Canada has been accompanied by increasingly poor eating behavior among Canadians. 17 According to Health Canada, 18 Canadian eating habits do not fully meet Canada’s food guide to healthy eating. A significant proportion of the Canadian population aged 12 and older reported consuming FV less than 5 times per day. 19 For example, during the period 2003–2010, approximately half of Canadian females and more than 60% of males consumed FV less than 5 times per day (see Figure 1). It is assumed that the frequency of FV consumption is equal to serving, hence, consuming FV less than 5 times per day is below the recommended level. 20 Whereas in the United States, more than two-thirds of adults consume fruits less than 2 times per day and three-quarters consume vegetables less than 3 times per day in 2009. 21

Percentage of males and females aged 12 or older reporting that they consumed fruits and vegetables at least 5 times daily in Canada from 2003 to 2010.
High intake of FV may help in reducing the risk of obesity because most FV are high in water and fiber content and low in fat content. 13 Thus, adding FV to the diet enhances satiety, reduces feelings of deprivation and hunger, and reduces energy intake. 13 There is mixed empirical evidence about the association between FV intake and body weight in both clinical 13 and epidemiologic studies. 14 In some studies, body weight is negatively associated with the intake of FV, 22 , 23 while other studies find no significant association.24–26 For example, using a sample of children and adolescents in the United States, Field et al 24 find that the intake of FV or juice is not related to changes in BMI during 3 years of follow-up. In a prospective cohort study among middle-aged women, He et al 22 find that the higher the consumption of FV over time, the lower the risk of obesity and weight gain. Using a sample of participants in the Baltimore Longitudinal Study of Aging, Newby et al 23 find that consuming a diet rich in FV and low in fat, dairy, whole grains, meat, fast food, and soda is associated with smaller gains in BMI and waist circumference.
Recently, several studies have examined the indirect effect of FV on BMI through its prices. For example, using repeated cross-sectional US data, Auld and Powell 10 find that the prices of FV are positively associated with adolescents BMI. They also find that a decrease in the relative price of FV (a proxy for low energy-dense foods) tend to reduce BMI if the price per calorie of less energy-dense foods is lower than those of high energy dense food. Sturm and Datar 27 find that lower real prices for FV predict a significantly lower gain in BMI between kindergarten and third grade. Some studies report that there are gender differences in eating patterns and report how these affect body weight. 28 , 29 Baker and Wardle 30 find that men consume fewer servings of FV daily than women. They attributed this to poorer nutrition knowledge of men relative to women: they find that men are less likely to know the healthy recommendations for FV intake, and the benefits of FV consumption for disease prevention.
The objective of this study is to examine the association between FV intake and body weight along different points of the BMI distribution using data from the Canadian Community Health Survey (CCHS). The key contribution of this study is twofold. First, most of the previous studies report only the bivariate association between the intake of FV and BMI, without controlling for confounding factors like socio-demographic and lifestyle (such as physical activity and smoking status) which have been shown to be important determinants of individual BMI. 14 Second, previous multivariate studies mostly use linear regression methods to examine the correlates of the conditional mean of BMI. This approach may be less informative if the association between the intake of FV and BMI significantly varies across the BMI distribution. Moreover, logistic regression treats observations that exceed a particular cut off level equally. For example, 2 individuals with a BMI of 40 and 30 are equally classified as being obese, notwithstanding the intensity of obesity for the first person is higher. This leads to a statistical loss of information that may be relevant for intervention measures. Individuals may respond differently to the factors causing obesity, depending on their location in the BMI distribution.
Accordingly, this study uses a quantile regression framework to characterize the heterogeneous association across the different quantiles of the BMI distribution. This is relevant to the nutrition and obesity literature where attention is given to certain segments of the BMI distributions. For example, individuals in the upper quantiles of the BMI distribution, both obese and overweight, are of more interest to policies aimed at reducing obesity. Standard linear regressions, like ordinary least squares (OLS), estimate the effect of different covariates on the conditional mean of the BMI. This average effect may over or under estimate the influence of the covariates at different points across the BMI distribution and, hence, may lead to misleading policy inferences.
Data
This study uses data from the Statistics Canada 2004 Canadian Community Health Survey (CCHS) cycle 2.2. CCHS is a nationally representative cross-sectional survey of the Canadian population and it collects vital information on health related behavior, as well as corresponding economic and social-demographic variables. The survey excludes those living on Indian Reserves and Crown Lands, institutional residents, full-time members of the Canadian forces, and residents of certain remote regions. 45 889 households were selected to participate in Cycle 2.2 of the CCHS. However, a national response rate of 76.5% was achieved. Data were collected in person and approximately 7% of respondents had their first 24-hour dietary recall interview completed over the telephone. The nutrition questionnaire of the CCHS consists of 2 components: general health and 24-Hour Dietary Recall. The general health component has information about socio-demographic characteristics of respondents, their height and weight, physical activity, and chronic health conditions. The 24-hour dietary recall component has information about all the food and beverages a respondent consumed during the 24 hours preceding the interview. A second dietary recall interview was conducted 3 to 10 days after the initial interview. Unlike previous cycles of the CCHS, Cycle 2.2 has the merit that the BMI is measured rather than self reported. This was done personally by the interviewer. For more information about the CCHS, see Statistics Canada. 31 We restrict the sample to those aged 14–65 years, and after excluding missing observations, the sample includes 11 818 individuals. Seniors tend to have a low BMI due to ageing rather than dietary choice. The eating behavior of children is largely affected by their parental background. Accordingly, we restricted our sample to those aged 14–65 so as to minimize factors that may bias our results.
The BMI, which is the dependent variable, is derived from the measured anthropometric information (height and weight) available in the CCHS. The BMI is calculated as body weight in kilograms divided by height in meters squared. The merit of using the 2004 CCHS cycle 2.2 is that the BMI is based on a respondent’s actual (measured) weight and height. This study follows the standard in the literature by using a set of covariates that has been shown to be potential determinants of the BMI. The independent variable of interest is an individual’s FV consumption. This variable indicates the total number of times per day the respondent consumes FV. Other individual socio-demographic and lifestyle variables are also included in the analyses. Age is represented by 3 categories: 14–30 (reference group), 31–50, and 51–65. Gender is captured by a dummy variable (male = 0, female = 1). Marital status is represented by 3 dummy variables: married, separated and single (reference group). Four dummy variables represent an individual’s educational attainment: less than secondary, secondary, some post secondary (reference group), and post secondary. Household income is represented by 4 dummy variables: less than $30,000 (reference group), $30,000 to $49,999, $50,000 to $79,999, and $80,000 or more. A dummy variable indicating individual social interaction (sense of belonging to a local community) is included (strong =1, weak = 0). Individual physical activity level is represented by 3 categories: active, moderate, and inactive (reference group). This classification is based on the total daily energy expenditure values (kcal/kg/day) on leisure-time physical activities. The daily energy expenditure for each activity is measured using the frequency, duration per session and the metabolic energy cost of the activity. An individual is classified as physically active if the total daily energy expenditure is greater than 3, as moderately active if the total daily energy expenditure is greater than 1.5 and less than 3 and inactive otherwise. For more information see (statistics Canada, 2005). Smoking status is classified as: never smoker (reference group), current smoker, and former smoker. Immigration status is captured by a dummy variable (immigrant = 1, non-immigrant = 0). Provincial or regional effects are captured in 5 categories: Ontario, British Colombia, Atlantic (comprising New Brunswick, Prince Edward Island, Nova Scotia and Newfoundland and Labrador), Western (Alberta, Saskatchewan and Manitoba) with Quebec as the reference group.
Method
Economists have developed economic models to explain how individuals engage in different consumption behaviors. Individuals maximize their utility subject to income, time and other resource constraints (eg, Lakdawalla and Philipson and Auld and Powell). 9 , 10 One of those behaviors is food consumption. These models are based on the Becker and Murphy 32 rational addiction (RA) framework which has become the canonical model of analysis. Consumers in this model make optimal choices on what to consume. Borrowing from the behavioral economics literature, Ruhm 33 adds to the traditional economic model by allowing for the possibility that individual weight outcomes could also be determined by biological and environmental cues. These cues can subvert the decision part of the brain which may lead to sub-optimal choices. In his model, advances in food engineering by producers may have contributed to the difficulty of resisting food cravings.
To examine how BMI is associated with the frequency of FV consumption across the BMI distribution, we estimate the following quantile regression 34 , 35 :
where i and j denote individual and province of residence. BMI denotes an individual’s Body Mass Index which is derived from measured height and weight. fv denotes the frequency of fruit and vegetable consumption and X is a vector of control variables. τ represents quantile,
Results
Summary statistics of the variables used in the analyses are reported in Table 1. The mean BMI is 26.5, which indicates that, on average, the study population is slightly overweight. The average number of FV servings per day is approximately 4, and this is below the recommended number of 5 times per day. In terms of educational level completed, approximately 45% have completed 1 or more post secondary education, whereas 21% have less than secondary education. A large percentage (54%) of the sample is physically inactive. Approximately 53% are female and 21% are immigrants.
Summary Statistics
The statistics are weighted using the CCHS sampling weights.
The BMI quantile regression and the OLS estimates for the full sample are reported in Table 2 for some selected quantiles between the 10th and 90th BMI distribution. In addition, Figure 2 displays the OLS and quantile regression estimates for the BMI determinants over the entire BMI distribution. The conditional mean estimate of BMI shows a negative relationship between FV and BMI. The quantile regression enables us to examine the heterogeneous responses of individual’s BMI to the model covariates at different tails of the BMI distribution.
OLS and Quantile Regression Results for BMI Determinants at Selected Quantiles for the Whole Sample
Robust standard errors are in parentheses. ***

OLS and quantile regression estimates for BMI determinants: whole sample
Although the results reveal that the frequency of FV intake has a negative and statistically significant association with BMI, the coefficient of FV varies across quantiles of the conditional BMI distribution. In particular, the FV coefficient increases in size for individuals at higher points of the conditional BMI distribution. For example, the coefficient of FV at the 90th quantile is almost 3 times the estimate at the 30th quantile, suggesting that an increase in the intake of FV may be an effective dietary strategy to control weight and reduce the risk of obesity especially for the overweight. The FV estimate at the median (50th quantile) is equal to the OLS estimate.
In terms of other control variables, the results show substantial differences across the quantiles of the BMI distribution (see Figure 2). Age has a positive relationship with BMI; those that are older (51 to 65, 31 to 50 years old) have higher BMI than the reference group (14 to 30 years old). The female coefficient is negative and statistically significant, indicating that females have less BMI compared to males. At the 90th quantile, the female estimate changes sign to positive. This means that at the 90th percentile, females’ BMI are higher than males. In general, the variables: married, separated and former smokers have positive and statistically significant association with BMI. Also, physically active (active and moderate), current smoker, and immigrant variables have a negative association with BMI. The association between socio-economic status (education and income) and BMI is less clear both in terms of sign and size of estimates.
The BMI quantile regression results for males are reported in Table 3 while females are reported in Table 4. The estimates for FV for both males and females reveal a similar pattern to the full population estimates shown in Table 2. The association between the frequency of fruit and vegetable intake and BMI is negative and statistically significant. Other covariates are largely similar to the full sample results with the exception of the income variables. For males, individuals in a high income household have higher BMI than those in low income households. The income variables for females have a negative effect on BMI in most of the quantiles and only a few of the estimates are statistically significant.
OLS and Quantile Regression Results for the BMI Determinants at Selected Quantiles for Males
Robust standard errors are in parentheses. ***
OLS and Quantile Regression Results for the BMI Determinants at Selected Quantiles for Females
Robust standard errors are in parentheses. ***
Discussion
It has been reported that worldwide, 1 in 3 and 1 in 9 adults are overweight and obese respectively. 36 Several studies link obesity and excess weight to the eating behavior of individuals, which includes the consumption of fruits and vegetables (FV). 13 , 14 The health benefits of consuming FV are numerous. 12 , 15
Evidence from the clinical and epidemiological literature on the relationship between the intake of FV and body weight is inconclusive. In this study, we examine the association between the consumption of FV and the body mass index (BMI) using data from the Canadian Community Health Survey. Based on the unconditional estimates, we find that the daily average number of FV servings among individuals in our sample is approximately 4, which is below the recommended number of 5 servings per day. Results, from the OLS baseline model, show that the conditional mean of the BMI is negatively and significantly associated with FV consumption. We use a quantile regression to characterize the effect of FV consumption on the entire BMI distribution. We find that the association between FV intake and BMI is negative and statistically significant. Quantile regression shows that this association varies significantly across the conditional BMI distribution. In particular, the effect of FV increases in size for individuals at higher points of the conditional BMI distribution. The estimates for both males and females reveal similar patterns to the full population estimates that FV intake is negatively and significantly associated with BMI. The OLS model overstates (understates) the effect of FV intake on the BMI at the lower (higher) half of the conditional BMI distribution. This suggests that conclusions from standard models (eg, OLS) that assume uniform response across different quantiles of the BMI distribution may be misleading.
Results for the other BMI determinants are comparable to previous studies. Socioeconomic status (SES), as usually measured by income and education level, largely affects the dietary choices of individuals. 37 The level of income affects the amount of financial resources available for healthy and nutritious food, and also the time devoted to physical activity. 38 Educational attainment affects nutritional knowledge and awareness about the benefits of physical activity. Several studies show that people with higher SES have healthier, nutritionally more balanced diets and are more physically active than those with lower SES. 39 , 40 Existing literature mostly documents a negative association between SES and the BMI among females in developed countries, however, this association is less consistent among males. 41 In line with previous studies (eg, McLaren and Sánchez-Vaznaugh et al), 42 , 43 we find a negative SES gradient in BMI among females, and a relatively strong positive income gradient among males.
Results for both males and females show that smoking status significantly affects the BMI. In particular, we find that smokers have lower BMI, while former smokers have a higher BMI, compared to those who never smoked. This is consistent with the general belief that smoking cessation is usually associated with an increase in the BMI. 44 For example, using a prospective study, Munafò et al 44 find that the BMI of never and former smokers is on average 1.6 kg/m2 higher than the BMI of current smokers. The authors also find an average increase in BMI of 1.6 kg/m2 due to smoking cessation. It has been reported that smoking suppresses the appetite, 45 where smokers may have higher metabolic rates than non-smokers and, hence, smoking may be used to control weight. 46
We find that immigrants have lower BMI than natives. This is in line with the findings of an early study on differences in obesity prevalence among US immigrants and natives. 47 The authors find that immigrants in the United States in general have lower BMI than natives, but these differences decreases overtime due to acculturation and the influence of the US lifestyle. Results also show that the BMI increases with age and this is consistent with a previous study by Baum and Ruhm, 48 who predicted an annual increase in the BMI of 0.12 kg/m 2 . Since physical activity affects the expenditure side of the energy balance equation, it is well established that regular physical activity is an important determinant of bodyweight, people who are physically active being less likely to be obese. 49 Our results are consistent with this evidence.
This study has some strength. First, the BMI used is based on measured height and weight rather than the frequently used self-reported measure. It has been documented that individuals tend to over-report their height and under-report their weight, which may have implications in terms of the consistency of estimated parameters. Second, many of the previous studies report only the bivariate association between the intake of FV and BMI which could lead to misleading conclusion about the true association. 13 Moreover, previous multivariate studies mostly estimate the effect of FV on the conditional mean of BMI/likelihood of being obese using standard linear/binary response regressions. Results from these estimation methods assume that the effect of the explanatory variables is the same at different parts of the BMI distribution. However, nutrition promotion and weight management policies give more attention to individuals at certain segments of the BMI distribution.
The current study has some limitations. First, the cross-sectional design of the analysis limits the ability to infer causality. Second, due to data limitation, the intake of FV is based on the number of times per day an individual consumes FV rather than the quantity consumed. Third, we did not control for the form in which FV are consumed. FV in their natural physical shape are low in energy density and have higher satiety effects, while they become more energy dense when fried, served with high-calorie sauces or dried. 13 Fourth, there may be omitted variable bias due to unobserved individual characteristics like preferences.
Conclusion
From the public policy perspective, the findings of this paper suggest that policies aimed at increasing the intake of FV may help to control weight and mitigate the risk of obesity. The multivariate analyses showed that conclusions from the standard models that assume uniform response across different quantiles of BMI distribution may be misleading. Accordingly, understanding how the association between FV and BMI depends on individuals’ location on the BMI distribution may help in implementing intervention measures that target the most vulnerable groups (overweight and obese).
Footnotes
Acknowledgements
We are thankful to 2 anonymous referees of this journal as well as Gordon Fisher and Ian Irvine for their valuable comments and suggestions.
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
The authors received no financial support for this study.
