Abstract
Although missing data are found in all types of data sets, surveys are particularly prone to produce data sets in which values of some respondent variables are missing (see, e.g., Cochran, 1977; Ericson, 1967; Kalton, 1983; and Hutcheson and Prather, 1977). Survey data collected for end-use energy demand models are no exception; high frequencies of nonresponse occur for many variables. This issue is, however, generally disregarded in the end-use literature, and analysts working with end-use models often discard cases in which values are missing for variables required by their models (see e.g., U.S. Government, 1983; Pacific Gas and Electric, 1983; Hirst and Carney, 1978; and EPRI, 1977). Discarding cases with missing values has important consequences. It implicitly assumes that the missing values occur randomly rather than systematically. If, however, missing values do not occur randomly, discarding cases with missing values will result in misspecified models and biased forecasts. Furthermore, by discarding cases, the detail appropriate for a given end-use model can be lost.
Get full access to this article
View all access options for this article.
