Estimation of Poverty Measures for Small Areas Under a Two-fold Nested Error Linear Regression Model: Comparison of Two Methods

Abstract

Demand for reliable statistics at a local area (small area) level has greatly increased in recent years. Traditional area-specific estimators based on probability samples are not adequate because of small sample size or even zero sample size in a local area. As a result, methods based on models linking the areas are widely used. World Bank focused on estimating poverty measures, in particular poverty incidence and poverty gap called FGT measures, using a simulated census method, called ELL, based on a one-fold nested error model for a suitable transformation of the welfare variable. Modified ELL methods leading to significant gain in efficiency over ELL also have been proposed under the one-fold model. An advantage of ELL and modified ELL methods is that distributional assumptions on the random effects in the model are not needed. In this article, we extend ELL and modified ELL to two-fold nested error models to estimate poverty indicators for areas (say a state) and subareas (say counties within a state). Our simulation results indicate that the modified ELL estimators lead to large efficiency gains over ELL at the area level and subarea level. Further, modified ELL method retaining both area and subarea estimated effects in the model (called MELL2) performs significantly better in terms of mean squared error (MSE) for sampled subareas than the modified ELL retaining only estimated area effect in the model (called MELLI).

AMS Subject Classification: 62D05, 62G09

Keywords

Areas and subareas ELL and modified ELL methods poverty incidence and gap two-fold nested error model

1. Introduction

Data collected from probability samples can provide reliable estimates of parameters of interest for domains (subpopulations) with large enough sample sizes to permit direct, domain-specific estimators of desired precision. We call such domains as large areas. On the other hand, sample sizes can be very small or even zero for local areas (called small areas) and direct estimators are not adequate or feasible. Demand for reliable statistics at the level of small areas has increased greatly and it is necessary to use model-based methods that can yield reliable estimates for small areas by integrating information across areas through linking models. Rao and Molina^[13] provide a comprehensive account of model-based small area estimation of means, totals, and more complex parameters like poverty measures.

In this article, we focus on the estimation of FGT poverty measures, proposed by Foster et al.^[6] Poverty incidence, gap and severity belong to the family of FGT measures. World Bank widely used a method proposed by Elbers et al.^[5], called the ELL method, to provide FGT poverty measures for specified local areas in many developing countries. The ELL method involves the following steps: (a) Simulate multiple censuses of the welfare variable of interest based on an assumed model relating the welfare variable to auxiliary variables obtained from a recent census; (b) Calculate the FGT measure for specified local areas from each simulated census and then take the average over the censuses as the ELL estimator; (c) Variance of the simulated census estimators is taken as the estimator of mean squared error (MSE) of the ELL estimator. An advantage of the ELL method is that it is free of parametric distributional assumptions and computationally simple. However, Molina and RaO^[11] showed that the ELL method can lead to large MSE compared to an optimal method, called the Empirical Best (EB) method, assuming a one-fold nested error linear regression model with normally distributed random effects. Diallo and Rao^[4] developed a modification to ELL method that leads to substantial reduction in MSE and compares favorably to the normality-based EB method. As in the ELL method, the modified ELL method is free of parametric distributional assumptions. The proposed ELL, modified ELL and EB methods are based on a one-fold nested error linear regression model relating a suitable function of the welfare variable to the census variables and a random area effect. Sample survey data observing the welfare variable and the census variables, based on two-stage cluster sampling, are used to fit the one-fold model. In the traditional ELL method, random cluster effects are included in the model and simulated censuses are generated. From a simulated census, a desired poverty measure is calculated for any desired small area. Note that it is not necessary to specify the areas in advance because area effects are not included in the ELL one-fold model. Hossain et al.^[9] used a two-stage sample of districts and household within districts to estimate a food insecurity measure at the district level in Bangladesh. In this case, clusters are areas. In this article, we focus on two-fold random effect models involving area and subarea random effects. For example, an area could refer to a state and a subarea to a county within a state. Marhuenda et al.^[10] studied EB estimation of FGT poverty measures under the two-fold model, assuming that the random effects in the model are normally distributed, as in the case of the onefold model studied by Molina and Rao^[13]. In their application to Spanish survey data, areas are provinces and subareas are comarcas, and it is of interest to obtain estimates of poverty measures at the domain as well as subdomain level.

Section 2 sets the stage by briefly describing the ELL, modified ELL and EB methods for estimating poverty indicators for areas under a one-fold nested error regression model. Section 3 introduces the two-fold model and the associated FGT poverty measures for domains and subdomains. Section 4 extends the ELL and modified ELL methods to two-fold models with no distributional

assumptions on the random effects in the model, as in the case of the one-fold model. Section 5 presents some results of a simulation study on the performance of ELL and modified ELL estimators. Finally, some remarks on the estimation of MSE of the estimators are given in Section 6.

2. One-fold Nested Error Model

In this section, we introduce the ELL method and its modification (MELL), based on a one-fold nested error linear regression model relating a log transformation of a welfare variable of interest $E$ to known census variables $x$ for the finite population of interest. In the ELL method, a twostage cluster sample $\{(E_{d j}, x_{d j}^{T}), d = 1, \dots, D; j = 1, \dots, n_{d}\}$ is used in conjunction with known population records/census variables $\{x_{d j}, d = 1, \dots, D_{p}; j = 1, \dots, N_{d}\}$ , where $(D_{p}, D)$ denote the number of clusters in the population and the sample, and $(N_{d}, n_{d})$ denote the number of population units and sample units in cluster $d$ . An intercept term is included in the covariate vector $x_{d j}$ . To reduce skewness in the welfare variable, a suitable transformation $y_{d j} = T (E_{d j})$ is used, where $T ($ . is one-to-one. In the ELL method, $y_{d j} = log (E_{d j})$ is used. Typically, clusters are nested in the areas of interest. For simplicity, we assume here that clusters are the same as the areas.

The FGT poverty indicator for area $d$ is given by

F_{α d} (z) = \frac{1}{N_{d}} \sum_{j = 1}^{N_{d}} F_{α d j},

(2.1)

where

F_{α d j} = {(\frac{z - E_{d j}}{z})}^{α} I (E_{d j} < z) .

(2.2)

In (2.2), $z$ is the known poverty line and $I (E_{d j} < z)$ is the indicator variable taking the value 1 when $E_{d j}$ is smaller than $z$ and 0 otherwise. Poverty incidence, poverty gap and poverty severity correspond to $α = 0, α = 1$ , and $α = 2$ , respectively. We use a one-to-one transformation $y_{d j} =$ $log (E_{d j})$ of the welfare variable $E_{d j}$ . Often, a positive constant $c$ is added to $E_{d j}$ to ensure $E_{d j} +$ $c > 0$ . Here we assume $E_{d j} > 0$ for all the population units. We can express $F_{α d j}$ in terms of the transformed variable $y_{d j}$ as

F_{α d j} = {(\frac{z - exp (y_{d j})}{z})}^{α} I (exp (y_{d j}) < z) : = h_{α} (y_{d j}) .

We focus here on the additive FGT poverty measures, $\sum_{j} h_{α} (y_{d j})$ , but the ELL method and its modifications are applicable to general measures of the form $h (y_{d 1}, \dots, y_{d N_{d}})$ .

2.1. ELL Method

We assume a one-fold nested error linear regression model relating $y_{d j}$ to $x_{d j}$ :

y_{d j} = x_{d j}^{T} β + u_{d} + e_{d j}; d = 1, \dots, D_{p}, j = 1, \dots, N_{d}

(2.3)

where $u_{d}$ and $e_{d j}$ denote random area effect and residual error and independently distributed with means zero and variances $σ_{u}^{2}$ and $σ_{e}^{2}$ respectively. Parametric distributions on the two random effects are not assumed. Sampling is assumed to be non-informative in the sense that the population model holds for the sample data:

y_{d j} = x_{d j}^{T} β + u_{d} + e_{d j}; d = 1, \dots, D, j = 1, \dots, n_{d}

(2.4)

where we denote the first $n_{d}$ units as the sample in area $d$ , without loss of generality. The following steps are used in the ELL method: (a) Calculate the sample residuals $r_{d j} = y_{d j} - x_{d j}^{T} \hat{β}$ , where $\hat{β}$ is the ordinary least square (OLS) estimator of the regression coefficient $β$ in the sample model (2.4) with estimated covariance matrix ${\hat{V}}_{β}$ ; (b) The random area effect $u_{d}$ is estimated as ${\hat{u}}_{d} = n_{d}^{- 1} \sum_{j} r_{d j}$ ; (c) Unit level residual $e_{d j}$ is estimated as ${\hat{e}}_{d j} = r_{d j} - {\hat{u}}_{d}$ . Two referees noted that the above estimators are equivalent to estimators obtained from an analysis of covariance model treating $u_{d}$ as fixed effect and imposing the condition $\sum_{d} n_{d} u_{d} = 0$ ; (d) Draw $B$ sets of bootstrap values $\{({\hat{β}}^{* (b)}, u_{d}^{* (b)}, e_{d j}^{* (b)}), d = 1, \dots, D; j = 1, \dots, N_{d}\}, b = 1, \dots, B$ from $N (\hat{β}, {\hat{V}}_{β})$ , the empirical distribution of ${\hat{u}}_{d} (d = 1, \dots, D)$ , and the empirical distribution of ${\hat{e}}_{d j} (d = 1, \dots, D; j = 1, \dots, N_{d})$ , respectively; (e) Construct $B$ sets of simulated census values $y_{d j}^{* (b)}$ as follows: $\{y_{d j}^{* (b)} = x_{d j}^{T} {\hat{β}}^{* (b)} +$ $u_{d}^{* (b)} + e_{d j}^{* (b)}, d = 1, \dots, D; j = 1, \dots, N_{d}\}, b = 1, \dots, B$ , using the census values of the covariates $x_{d j}$ ; (f) Calculate simulated population FGT measures $F_{α d}^{* (b)} = N_{d}^{- 1} \sum_{j = 1}^{N_{d}} F_{α d j}^{* (b)}, b = 1, \dots, B$ , where $F_{α d j}^{* (b)} = h_{α} (y_{d j}^{* (b)})$ ; (g) Calculate the ELL estimator of $F_{α d}$ by taking average of $F_{α d}^{* (b)}$ over $b$ :

{\hat{F}}_{α d}^{ELL} = \frac{1}{B} \sum_{b = 1}^{B} F_{α d}^{* (b)} .

(2.5)

The ELL method is also applicable to non-sampled areas $d$ following steps 4-7 above.

2.2. Modified ELL Method

The modified ELL method, proposed by Diallo and Rao,^[4] retains the estimated area effects ${\hat{u}}_{d}$ for sampled area $d$ , unlike using $u_{d}^{* (b)}$ as in the ELL method. The simulated values $y_{d j}^{* (b)}$ are generated from $y_{d j}^{* (b)} = x_{d j}^{T} \hat{β} + {\hat{u}}_{d} + e_{d j}^{* (b)}$ and steps (f) and (g) above are implemented using the simulated values to calculate the modified ELL (MELL) estimator of $F_{α d}$ , denoted as ${\hat{F}}_{α d}^{MELL}$ . For non-sampled areas $d$ we use the ELL method by generating the simulated values $y_{d j}^{* (b)}$ from $y_{d j}^{* (b)} = x_{d j}^{T} \hat{β} + u_{d}^{* (b)} + e_{d j}^{* (b)}$ .

A simulation study with skew normal error $e_{d j}$ showed large gain in efficiency for MELL estimator over ELL estimator for sampled areas $d$ . For non-sampled areas $d$ , ELL and MELL are the same. Diallo^[3] studied the performance of MELL when the ELL estimator ${\hat{u}}_{d}$ is replaced by the EBLUP estimator of $u_{d}$ , using moment estimators of the variance components in the nested error model (2.4) which are distribution-free. The resulting MELL estimators of the FGT measures were not significantly more efficient than the MELL estimators based on the ELL estimators ${\hat{u}}_{d}$ . A plausible reason for this lack of improved efficiency is due to the fact that the EBLUP estimators of the $u_{d}$ are designed for optimal estimation of the area means of the transformed variables $y_{d j}$ and not for estimating FGT measures for the areas.

In the special case of area mean ${\bar{Y}}_{d}$ , its ELL estimator is approximately equal to the synthetic estimator ${\bar{X}}_{d}^{T} \hat{β}$ for all areas, as noted in Molina and Rao^[13]. On the other hand, MELL estimator of ${\overset{࿽}{Y}}_{d}$ for a sampled area $d$ is approximately equal to the sample regression estimator ${\bar{y}}_{d} + {({\bar{X}}_{d} - {\bar{x}}_{d})}^{T} \hat{β}$ , where ${\bar{y}}_{d}$ and ${\bar{x}}_{d}$ are the sample means for area $d$ . For a non-sampled area, MELL estimator of ${\bar{Y}}_{d}$ reduces to the synthetic estimator.

Both ELL and MELL are applicable to more complex measures, such as the Gini coefficient and the Fuzzy Monetary Index (Neri et al.^[12]). Those measures are not additive non-linear functions of the welfare variable $E_{d j}$ , unlike the FGT measure. An advantage of those complex measures over the FGT measure is that the knowledge of the poverty line is not required. Further, note that both ELL and MELL do not require linking the sample file to the population register from where the auxiliary variables $x_{d j}$ are obtained.

2.3. EB Method

Molina and Rao^[13] studied empirical best (EB) estimation of FGT poverty measures, assuming normality of the area effects $u_{d}$ and the unit errors $e_{d j}$ . The best estimator of $h_{α} (y_{d j})$ for a non-sampled unit $j$ in area $d$ is obtained as the expectation with respect to the conditional distribution of $y_{d j}$ given the vector of sampled values in area $d$ . A closed form expression for the best estimator does not exist in general, but it can be approximated by simulating $y_{d j}^{(l)}, l = 1, \dots, L$ from the conditional distribution. Molina and Rao^[13] showed that $y_{d j}^{(l)}$ can be generated from a univariate normal distribution. EB estimator is obtained by replacing the model parameters in the best estimator by suitable estimators.

A limitation of EB estimation is that it requires linking the sample to the population for identifying the non-sampled units in each area, unlike ELL estimation. If linking is not feasible, then we can obtain EB estimator of $h_{α} (y_{d j})$ for all the population units $j$ in area $d$ , leading to Census EB estimator (Guadarrama et al.^[8]). The Census EB estimator is less efficient than the EB estimator when linking is feasible, but the loss in efficiency is small when the area sampling fraction is small. Corral-Rodas et al.^[2] provide refinements and extensions to the EB method of Molina and Rao (MR). A referee noted that the poverty maps produced by the World Bank recently were based on the EB method rather than the ELL method.

Diallo and Rao^[4] studied EB estimation under the one-fold model (2.4) with skew normal (SN) errors $e_{d j}$ and normal area effects $v_{d}$ . Their simulation results showed that the normality-based EB of Molina and Rao^[11] can be inefficient due to inflation of MSE caused by substantial bias in the estimator. However, the EB estimator of Diallo and Rao^[4] is very complex, and they proposed a simplified EB method which performed quite well compared to the EB method under SN errors. The modified ELL method is less efficient than the tailor-made EB methods of Diallo and Rao^[4] under the model with SN errors. Note that distributional assumptions on $u_{d}$ and $e_{d j}$ are not needed for the ELL and modified ELL methods. Graf et al.^[7] developed EB estimators by modelling the welfare variable $E_{d j}$ directly using a Generalized Beta distribution of the Second Kind.

3. Two-fold Nested Error Model

As in Marhuenda et al.^[10], the finite population of interest consists of $D$ areas (domains) $d =$ $1, \dots, D$ , and area $d$ is divided into $M_{d}$ subareas (subdomains) $j = 1, \dots, M_{d}$ . The subdomain $j$ within the domain $d$ contains $N_{d j}$ elements $k = 1, \dots, N_{d j}$ . The population data is denoted as $\{(E_{d j k}, x_{d j k}^{T}), d = 1, \dots, D; j = 1, \dots, M_{d}; k = 1, \dots, N_{d j}\}$ , where $E_{d j k}$ is the welfare variable of interest and $x_{d j k}^{T} = (x_{1 d j k}, \dots, x_{p d j k})$ is a $p$ -vector of known census variables. If an intercept term is needed, then we set $x_{1 d j k} = 1$ for all the population units. To reduce skewness in the welfare variable, a suitable transformation $y_{d j k} = T (E_{d j k})$ is used. For the FGT poverty measures, we make a $log$ transformation $y_{d j k} = log (E_{d j k})$ , assuming $E_{d j k} > 0$ for all the population units, as in Section 2.

A two-fold nested error population model relating the transformed variable $y_{d j k}$ to the census variables $x_{d j k}$ is given by

y_{d j k} = x_{d j k}^{T} β + u_{d} + v_{d j} + e_{d j k}; d = 1, \dots, D, j = 1, \dots, M_{d}, k = 1, \dots, N_{d j},

(3.1)

where $β$ is a $p \times 1$ vector of unknown regression parameters, $u_{d}$ are the area effects, $v_{d j}$ are the cluster effects, and $e_{d j k}$ are the residual errors. The three random errors $u_{d}, v_{d j}$ , and $e_{d j k}$ are independent with $E (u_{d}) = E (v_{d j}) = E (e_{d j k}) = 0$ . Parametric distributions on the two random effects and the unit errors are not assumed.

We assume two-stage sampling in each area: a sample, $s_{d}$ , of $m_{d} (\leq M_{d})$ subareas is selected from area $d$ and if subarea $j$ is sampled then a subsample, $s_{d j}$ of $n_{d j}$ elements is selected from subarea $j$ . We further assume that the population model (3.1) also holds for the sample data $\{(y_{d j k}, x_{d j k}), d =$ $1, \dots, D; j = 1, \dots, m_{d}; k = 1, \dots, n_{d j}\}$ . Therefore, the model for the sample data is given by

y_{d j k} = x_{d j k}^{T} β + u_{d} + v_{d j} + e_{d j k}; d = 1, \dots, D, j = 1, \dots, m_{d}, k = 1, \dots, n_{d j} .

(3.2)

The FGT population measure for area $d$ is given by

F_{α d} (z) = \frac{1}{N_{d}} \sum_{j = 1}^{M_{d}} \sum_{k = 1}^{N_{d j}} F_{α d j k},

(3.3)

where $N_{d} = \sum_{j} N_{d j}$ and

F_{α d j k} = {(\frac{z - E_{d j k}}{z})}^{α} I (E_{d j k} < z) .

(3.4)

In (3.4), $z$ is the known poverty line and $I (E_{d j k} < z)$ is the indicator variable taking the value 1 when $E_{d j k}$ is smaller than $z$ and 0 otherwise. Also, the FGT measure for subarea $j$ within area $d$ is given by

F_{α d j} (z) = \frac{1}{N_{d j}} \sum_{k = 1}^{N_{d j}} F_{α d j k} .

(3.5)

4. Estimators of FGT Poverty Measures

In this section, we describe how to estimate FGT poverty measures (3.3) and (3.5) for areas and subareas, respectively. Suppose that there is a one-to-one transformation $y_{d j k} = log (E_{d j k})$ of the welfare variables, assuming $E_{d j k} > 0$ for all the population units. Then we can express $F_{α d j k}$ given by (3.4) in terms of $y_{d j k}$ :

F_{α d j k} = {(\frac{z - exp (y_{d j k})}{z})}^{α} I (exp (y_{d j k}) < z) : = h_{α} (y_{d j k}) .

4.1. ELL Method

In this section, we extend the ELL method under the one-fold nested error model to the two-fold nested error model (3.1) when the model has area level random effect term, unit level random effect term, and unit level error term. The proposed extension of the ELL method to two-fold models consists of drawing from the estimated area, subarea and unit level residuals to create a simulated census. The steps of the ELL method can be summarized as follows:

Estimate $β$ from the nested error model given by (3.2) and obtain unit level residuals ${\hat{r}}_{d j k} = y_{d j k} - x_{d j k}^{T} \hat{β}$ where $\hat{β}$ denotes the OLS estimator of $β$ .

The area effect $u_{d}$ , the subarea effect $v_{d j}$ , and the unit level errors $e_{d j k}$ are estimated as

{\hat{u}}_{d} = \frac{1}{n_{d}} \sum_{j = 1}^{m_{d}} \sum_{k = 1}^{n_{d j}} {\hat{r}}_{d j k},

{\hat{v}}_{d j} = \frac{1}{n_{d j}} \sum_{k = 1}^{n_{d j}} {\hat{r}}_{d j k} - {\hat{u}}_{d},

and

{\hat{e}}_{d j k} = {\hat{r}}_{d j k} - \frac{1}{n_{d j}} \sum_{k = 1}^{n_{d j}} {\hat{r}}_{d j k} .

Draw ${\hat{β}}^{* (b)}, u_{d}^{* (b)}, v_{d j}^{* (b)}$ , and $e_{d j k}^{* (b)}, b = 1, \dots, B$ from $N (\hat{β}, {\hat{V}}_{β})$ , the empirical distribution of ${\hat{u}}_{d}$ , the empirical distribution of ${\hat{v}}_{d j}$ , and the empirical distribution of ${\hat{e}}_{d j k}$ , respectively. Also, ${\hat{V}}_{β}$ denotes the estimated covariance of $\hat{β}$ .

Construct $B$ simulated census values $\{y_{d j k}^{* (b)}; k = 1, \dots, N_{d j}, j = 1, \dots, M_{d}, d = 1, \dots, D\}$ as follows: $y_{d j k}^{* (b)} = x_{d j k}^{T} {\hat{β}}^{* (b)} + u_{d}^{* (b)} + v_{d j}^{* (b)} + e_{d j k}^{* (b)}$ , using the census values of the covariates.

Population measures $F_{α d}^{* (b)} = \frac{1}{N_{d}} \sum_{j = 1}^{M_{d}} \sum_{k = 1}^{N_{d j}} F_{α d j k}^{* (b)}$ and $F_{α d j}^{* (b)} = \frac{1}{N_{d j}} \sum_{k = 1}^{N_{d j}} F_{α d j k}^{* (b)}$ are calculated from each simulated census $b$ , where $F_{α d j k}^{* (b)} = h_{α} (y_{d j k}^{* (b)}), b = 1, \dots, B$ .

The ELL estimators of $F_{α d}$ and $F_{α d j}$ are calculated by averaging over the $B$ simulated measures as follows:

{\hat{F}}_{α d}^{ELL} = \frac{1}{B} \sum_{b = 1}^{B} F_{α d}^{* (b)}

and

{\hat{F}}_{α d j}^{ELL} = \frac{1}{B} \sum_{b = 1}^{B} F_{α d j}^{* (b)} .

The estimators ${\hat{u}}_{d}$ and ${\hat{v}}_{d j}$ are similar to those obtained under an analysis of covariance model treating $u_{d}$ and $v_{d j}$ as fixed and imposing side conditions on $u_{d}$ and $v_{d j}$ to ensure estimability, as done in the case of the one-fold model studied in section 2.1.

4.2. Modified ELL Methods

Method 1. This modification retains ${\hat{u}}_{d}$ in constructing the predictors $y_{d j k}^{* (b)}$ , unlike the use of $u_{d}^{* (b)}$ in the ELL method. We have the following modified ELL method:

From the nested error model given by (3.2), estimate the fixed effects $β$ using OLS.

Estimate $u_{d}, v_{d j}$ , and $e_{d j k}$ as in the traditional ELL method.

Draw $v_{d j}^{* (b)}$ and $e_{d j k}^{* (b)}, b = 1, \dots, B$ from the empirical distributions of ${\hat{v}}_{d j}$ and ${\hat{e}}_{d j k}$ , respectively.

Construct $B$ simulated census values $\{y_{d j k}^{* (b)}; k = 1, \dots, N_{d j}, j = 1, \dots, M_{d}, d = 1, \dots, D\}$ as follows:

y_{d j k}^{* (b)} = x_{d j k}^{T} \hat{β} + {\hat{u}}_{d} + v_{d j}^{* (b)} + e_{d j k}^{* (b)} .

Then, the simulated population measures $F_{α d}^{* (b)}$ and $F_{α d j}^{* (b)}$ are calculated as in the traditional ELL method from each simulated census $b$ . The modified ELL estimators of $F_{α d}$ and $F_{α d j}$ , denoted by ${\hat{F}}_{α d}^{MELL 1}$ and ${\hat{F}}_{α d j}^{MELL 1}$ , respectively, are as follows:

{\hat{F}}_{α d}^{MELL 1} = \frac{1}{B} \sum_{b = 1}^{B} F_{α d}^{* (b)}

and

{\hat{F}}_{α d j}^{MELL 1} = \frac{1}{B} \sum_{b = 1}^{B} F_{α d j}^{* (b)} .

Method 2. This modification retains ${\hat{u}}_{d}$ and ${\hat{v}}_{d j}$ , for $j \in s_{d}$ , and uses $v_{d j}^{* (b)}$ for subarea $j$ not sampled from area $d$ in constructing the predictors $y_{d j k}^{* (b)}$ . Then, the modification is as follows:

From the nested error model given by (3.2), estimate the fixed effects $β$ using OLS.

Estimate $u_{d}, v_{d j}$ , and $e_{d j k}$ as in the traditional ELL method.

Draw $e_{d j k}^{* (b)}, b = 1, \dots, B$ from the empirical distribution of ${\hat{e}}_{d j k}$ .

Construct $B$ simulated census values $y_{d j k}^{* (b)}$ for the units in the sampled subareas as

y_{d j k}^{* (b)} = x_{d j k}^{T} \hat{β} + {\hat{u}}_{d} + {\hat{v}}_{d j} + e_{d j k}^{* (b)}

and for subareas that are not sampled $y_{d j k}^{* (b)}$ are generated from

y_{d j k}^{* (b)} = x_{d j k}^{T} \hat{β} + {\hat{u}}_{d} + v_{d j}^{* (b)} + e_{d j k}^{* (b)},

where $v_{d j}^{* (b)}, b = 1, \dots, B$ , are drawn from the empirical distribution ${\hat{v}}_{d j}$ .

Then, the simulated population measures $F_{α d}^{* (b)}$ and $F_{α d j}^{* (b)}$ are calculated as in the traditional ELL method from each simulated census $b$ , and the second modified ELL estimators of $F_{α d}$ and $F_{α d j}$ , denoted by ${\hat{F}}_{α d}^{MELL 2}$ and ${\hat{F}}_{α d j}^{MELL 2}$ , respectively, are as follows:

{\hat{F}}_{α d}^{MELL 2} = \frac{1}{B} \sum_{b = 1}^{B} F_{α d}^{* (b)}

and

{\hat{F}}_{α d j}^{MELL 2} = \frac{1}{B} \sum_{b = 1}^{B} F_{α d j}^{* (b)} .

In the special case of area mean ${\bar{Y}}_{d}$ , its ELL estimator is approximately equal to the synthetic estimator ${\bar{X}}_{d}^{T} \hat{β}$ , as noted in Marhuenda et al.^[10] On the other hand, both MELL1 and MELL2 estimators for a sampled area are approximately equal to the sample regression estimator ${\bar{y}}_{d} + ({\bar{X}}_{d} -$ ${{\bar{x}}_{d})}^{T} \hat{β}$ . For a non-sampled area, both reduce to the synthetic estimator.

Turning to a subarea mean ${\bar{Y}}_{d j}$ , MELL1 is approximately equal to ${\bar{X}}_{d j}^{T} \hat{β} + ({\bar{y}}_{d} - {\bar{x}}_{d}^{T} \hat{β})$ for all subareas within a sampled area. On the other hand, MELL2 is approximately equal to the subarea level sample regression estimator ${\bar{y}}_{d j} + {({\bar{X}}_{d j} - {\bar{x}}_{d j})}^{T} \hat{β}$ for a sampled subarea. For a non-sampled subarea within a sampled area, MELL2 is equal to MELL1. For subareas within a non-sampled area, both MELL1 and MELL2 estimators are approximately equal to the synthetic estimator ${\bar{X}}_{d j}^{T} \hat{β}$ . On the other hand, ELL estimator for all subareas is approximately equal to the synthetic estimator ${\bar{X}}_{d j}^{T} \hat{β}$ .

4.3. EB Method

Marhuenda et al.^[10] studied EB estimation of FGT poverty measures for area and subareas under the two-fold nested error model (3.1), assuming the population model holds for the sample. This leads to the sample model (3.2). Unlike the ELL method, they assume normality of the area effects $u_{d}$ , subarea effects $v_{d j}$ and unit errors $e_{d j k}$ . The model on the unit error $e_{d j k}$ permits unequal variance $σ_{e}^{2} / w_{d j k}$ with known heteroscedasticity weight $w_{d j k}$ . Under the above set-up, they extended the EB method of Molina and Rao^[11] to derive EB estimators of FGT measures for areas and both sampled and non-sampled subareas within an area. This EB method is applicable only to additive measures like the FGT measure. They also conducted an extensive simulation study and compared the EB estimators for subareas under the two-fold model with those obtained under models with only subarea effects, when all subareas are sampled or not sampled. Here we consider the special case of $w_{d j k} = 1$ and denote the EB estimator of Marhuenda et al.^[10] as EB2.

5. Simulation Study

We conduced a simulation study to examine the performance of the two modified ELL methods under the two-fold nested error linear regression model (3.1). Marhuenda et al.^[10] conducted a simulation study on the performance of EB estimators of FGT measures for areas and subareas under a two-fold nested error model assuming $u_{d}, v_{d j}$ and $e_{d j k}$ are normally distributed. We follow their simulation set-up but also considered a skew normal scenario with $(u_{d}, v_{d j})$ normal (N) and $e_{d j k}$ skew normal (SN). Marhuenda et al.^[10] made an extensive study for the two-fold model in the normal case, by fixing $σ_{e}$ and considering two marginal scenarios for $(σ_{u}, σ_{u})$ : (a) $σ_{u}$ fixed and $σ_{v}$ varied; (b) $σ_{v}$ fixed and $σ_{u}$ varied. The range of values considered included $σ_{u} = 0$ corresponding to a model with subarea effects only, and $σ_{v} = 0$ corresponding to a model with area effects only. Their results suggested that assuming the two-fold model when the correct model involves only area random effect or subarea random effect may not lead to significant loss in efficiency. We also included the case of $(u_{d}, v_{d j}, e_{d j k})$ normal (N) studied by Marhuenda et al.^[10]

We set $σ_{e} = 0.50$ as in Marhuenda et al.^[10], and consider two scenarios for setting the values of $σ_{u}$ and $σ_{v}$ : (a) Between area variation $σ_{u} = 0.5$ is larger than the between subarea variation $σ_{v} = 0.25$ ; (b) $σ_{u} = 0.25$ is smaller than $σ_{v} = 0.5$ . We also considered the case of smaller $σ_{u}$ and $σ_{v}$ to reflect more accurate covariates in the model: (c) $σ_{u} = 0.1$ and $σ_{v} = 0.2$ .

We generated $I = 1, 000$ populations each of size $N = 20, 000$ composed of $D = 40$ areas each containing $M_{d} = 10$ subareas each containing $N_{d j} = 50$ units. As in Marhuenda et al.^[10], all the areas are sampled. We first generated the covariate vector $x_{d j k} = {(1, x_{1 d j k}, x_{2 d j k})}^{'}$ for each population unit, based on $x_{1 d j k} \sim B (1, p_{1 d j})$ and $x_{2 d j k} \sim B (1, p_{2 d j})$ with probabilities $p_{1 d j} = 0.2 +$ $\frac{0.4 d}{D} + \frac{0.4 j}{M_{d}}$ and $p_{2 d j} = 0.2, j = 1, \dots, 10, d = 1, \dots, 40$ . The generated population covariate values are held fixed and used to generate the dependent variable $y_{d j k}$ from the two-fold model using $β = {(3, 0.03, - 0.04)}^{'}$ and specified distributions for $u_{d}, v_{d j}$ , and $e_{d j k}$ with mean zero and standard deviations $σ_{u}, σ_{v}$ and $σ_{e}$ , respectively. For the $SN$ case, we took $u_{d} \sim N (0, σ_{u}^{2}), v_{d j} \sim N (0, σ_{v}^{2})$ and $e_{d j k} \sim SN (μ, σ^{2}, λ)$ with $μ$ and $σ$ chosen to make the mean and standard deviation of $e_{d j k}$ equal to zero and $σ_{e}$ , and $λ = λ_{e} = 3$ which leads to moderate skewness. As in Marhuenda et al.^[10], we set $E_{d j k} = exp (y_{d j k})$ . The above process was repeated to generate $I = 1000$ population values $\{y_{d j k}^{(i)}, i = 1, \dots, 1000\}$ .

We calculated the FGT measures $F_{α d}^{(i)}$ for each area and $F_{α d j}^{(i)}$ for each subarea from each of the simulated populations $i = 1, \dots, 1000$ . We focus on poverty incidence $(α = 0)$ and poverty gap $(α = 1)$ . Following Marhuenda et al.^[10], we took the poverty line as $z = 0.6 med (E_{d j k})$ for a population generated as above.

We considered two cases for generating a sample of units. In case 1, all subareas are sampled $(m_{d} = M_{d} = 10)$ by selecting $n_{d j} = 10$ units from each subarea by simple random sampling. In case 2, a simple random sample of $m_{d} = 5$ subareas is selected from each area and then a simple random sample of $n_{d j} = 20$ units is drawn from each sampled subarea. In both cases, the over all sample size within each area is equal to 100 and the number of bootstrap simulated censuses, $B$ , is taken as $B = 50$ .

As in Marhuenda et al.^[10], we used a model-based set up by conditioning on the selected sample of units and extracting the corresponding sample data $(y_{d j k}^{(i)}, x_{d j k})$ from each simulated population $i$ . Using the sample data, we then obtained the desired estimates for areas and subareas from the assumed two-fold model. Denoting the estimators for areas and subareas for any given method by ${\hat{F}}_{α d}$ and ${\hat{F}}_{α d j}$ respectively, we computed empirical MSEs of the estimators for areas and subareas as

MSE ({\hat{F}}_{α d}) = I^{- 1} \sum_{i = 1}^{I} {({\hat{F}}_{α d}^{(i)} - F_{α d}^{(i)})}^{2}, MSE ({\hat{F}}_{α d j}) = I^{- 1} \sum_{i = 1}^{I} {({\hat{F}}_{α d j}^{(i)} - F_{α d j}^{(i)})}^{2},

where ${\hat{F}}_{α d}^{(i)}$ and ${\hat{F}}_{α d j}^{(i)}$ denote the estimators for the simulated population $i$ .

5.1. $e_{d j k}$ Skew Normal

Table 1 reports results on average MSE for the areas, sampled subareas and non-sampled subareas for scenario 1: $σ_{u} = 0.5$ and $σ_{v} = 0.25$ . Table 1 shows that ELL leads to very large average MSE in all cases compared to the other methods. For areas, MELL2 and MELL1 are comparable and slightly better than EB2 in terms of average MSE. For subareas when all subareas are sampled (case 1), MELL2 leads to large reduction in average MSE over MELL1. This is to be expected because MELL1 does not use subarea specific method unlike MELL2. Also, EB2 seems to be somewhat better than MELL2 in terms of average MSE: 8.81 for EB2 vs. 11.39 for MELL2 in the case of poverty gap.

Table 1.

Average of MSEs $(\times 10^{4})$ . Case I when all subareas are sampled and case 2 is when all subareas are not sampled ( $e_{djk}$ is $SN, σ_{u} = 0.5$ and $σ_{v} = 0.25)$ .

			Estimation method
		Poverty indicator	EB2	MELLI	MELL2	ELL
Case I	Area	inc	8.70	7.88	7.28	557.44
	Area	gap	1.28	1.30	1.17	95.33
	Subarea	inc	56.09	187.69	69.35	737.22
	Subarea	gap	8.81	33.52	11.39	127.53
Case 2	Area	inc	24.15	23.69	23.11	562.45
	Area	gap	4.22	4.27	4.17	96.46
	Sampled-subarea	inc	27.55	167.68	34.49	742.50
	Sampled-subarea	gap	3.97	29.81	5.02	128.94
	Non-sampled-subarea	inc	233.85	236.05	236.59	738.95
	Non-sampled-subarea	gap	41.77	42.39	42.43	127.65

Turning to case 2 where not all subareas are sampled, results for sampled subareas are similar those for case 1 where all subareas are sampled. Note that the average MSE is significantly decreased for sampled subareas because the sample size in those subareas is doubled relative to case 1. On the other hand, for areas the average MSE is significantly increased in case 2 compared to case 1 because the number of sampled subareas is reduced by half compared to case 1.

For nonsampled subareas (case 2), MELL1, MELL2 and EB2 are comparable in terms of average MSE. This is to be expected because for non-sampled subareas MELL1 and MELL2 are similar. Note that the average MSE is significantly increased for nonsampled subareas compared to corresponding values for sampled subareas. Box plots (not presented here) lead to conclusions like those arrived from the values of average MSE.

Table 2.

Average of MSEs $(\times 10^{4})$ . Case I when all subareas are sampled and case 2 is when all subareas are not sampled ( $e_{djk}$ is $SN, σ_{u} = 0.25$ and $σ_{v} = 0.5)$ .

			Estimation method
		Poverty indicator	EB2	MELLI	MELL2	ELL
Case I	Area	inc	6.75	14.34	6.91	185.96
	Area	gap	1.11	3.33	1.14	29.96
	Subarea	inc	63.19	580.26	69.22	750.43
	Subarea	gap	10.27	104.62	11.55	131.26
Case 2	Area	inc	48.58	67.21	62.39	186.23
	Area	gap	8.32	11.61	10.51	30.30
	Sampled-subarea	inc	28.35	530.11	34.12	756.84
	Sampled-subarea	gap	4.22	97.07	5.13	133.50
	Non-sampled-subarea	inc	679.26	740.64	742.72	752.55
	Non-sampled-subarea	gap	119.75	129.75	129.72	130.87

Table 2 reports results on average MSE for scenario 2: $σ_{u} = 0.25$ and $σ_{v} = 0.5$ , corresponding to those in Table 1 for scenario 1. Comparing the two tables, conclusions for the case all subareas are sampled are similar for EB2 and MELL2, but average MSE for MELL1 has significantly increased. For the case not all subareas are sampled, average MSE values for EB2, MELL1 and MELL2 are substantially increased for areas, although conclusions are similar. For sampled subareas, EB2 and MELL2 gave similar average MSE values. For nonsampled subareas in case 2, MELL1, MELL2 and ELL are comparable in terms of average MSE while EB2 exhibits slightly smaller average MSE.

Results on average MSE for scenario 3 with $σ_{u} = 0.1$ and $σ_{v} = 0.2$ are reported in Table 3. For the case 1 with all subareas sampled, MELL1 and MELL2 are similar for areas in terms of average MSE, and slightly larger than the average MSE for EB2. ELL leads to a much larger average MSE for areas in both cases 1 and 2. Turning to subareas, MELL2 leads to large reduction in MSE relative to MELL1 and ELL, and slightly larger average MSE relative to EB2 in both cases 1 and 2. For the nonsampled subareas in case 2, average MSE is comparable across EB2, MELL1, MELL2 and ELL. Note that scenario 3 is favorable to ELL because the area and subarea effects are small.

Table 3.

Average of MSEs $(\times 10^{4})$ . Case I when all subareas are sampled and case 2 is when all subareas are not sampled ( $e_{djk}$ is $SN, σ_{u} = 0. I$ and $σ_{v} = 0.2$ ).

			Estimation method
		Poverty indicator	EB2	MELLI	MELL2	ELL
Case I	Area	inc	6.96	7.72	7.23	41.38
Case I	Area	gap	0.84	0.55	0.54	2.34
	Subarea	inc	52.68	140.62	72.12	174.22
	Subarea	gap	3.56	8.59	5.05	10.38
Case 2	Area	inc	14.23	18.51	17.98	42.06
	Area	gap	1.28	1.15	1.11	2.36
	Sampled-subarea	inc	26.64	128.38	38.27	175.37
	Sampled-subarea	gap	1.77	7.91	2.48	10.47
	Non-sampled-subarea	inc	159.32	172.61	172.81	173.06
	Non-sampled-subarea	gap	10.23	10.35	10.32	10.99

5.2.

(u_{d}, v_{d j}, e_{djk})

Normal

We also considered the case where $u_{d}, v_{d j}$ and $e_{d j k}$ are normally distributed. Marhuenda et al.^[10] studied this case in the context of EB2. Table 4 gives the average MSE under scenario 1. Table 4 indicates that the results on MSE for areas, sampled subareas and non-sampled subareas are similar with the results in section 5.1 corresponding to skew normal. We also note that EB2 leads to substantial reduction in average MSE over MELL2 for subareas in case 1 where all subareas are sampled: 49.77 vs. 64.30 for incidence and 8.84 vs. 11.38 for gap. This is to be expected because EB2 is optimal under normality. Table 5 reports average MSE for the normal case under scenario 3. Results are similar to those in Table 4.

Table 4.

Average of MSEs $(\times 10^{4})$ . Case I when all subareas are sampled and case 2 is when all subareas are not sampled $(u_{d}, v_{d j}$ and $e_{d j k}$ all $N, σ_{u} = 0.5$ and $σ_{v} = 0.25)$ .

		Poverty indicator	Estimation method
		Poverty indicator	EB2	MELLI	MELL2	ELL
Case I	Area	inc	5.87	6.82	6.42	508.88
	Area	gap	1.07	1.23	1.14	93.07
	Subarea	inc	49.77	168.60	64.30	670.71
	Subarea	gap	8.84	32.14	11.38	123.98
Case 2	Area	inc	19.92	21.73	21.32	514.96
	Area	gap	3.81	4.11	4.03	94.63
	Sampled-subarea	inc	24.62	151.31	31.31	677.23
	Sampled-subarea	gap	4.11	29.04	4.83	126.31
	Non-sampled-subarea	inc	209.02	214.87	215.08	675.51
	Non-sampled-subarea	gap	40.22	41.28	41.32	125.06

Table 5.

Average of MSEs $(\times 10^{4})$ . Case I when all subareas are sampled and case 2 is when all subareas are not sampled $(u_{d}, v_{d j}$ and $e_{d j k}$ all $N, σ_{u} = 0. I$ and $σ_{v} = 0.2)$ .

			Estimation method
		Poverty indicator	EB2	MELLI	MELL2	ELL
Case 1	Area	inc	5.49	6.83	6.69	35.06
Case 1	Area	gap	0.50	0.64	0.62	2.30
	Subarea	inc	47.18	121.03	66.43	149.24
	Subarea	gap	4.28	10.80	6.14	13.14
Case 2	Area	inc	11.65	16.06	15.83	35.50
	Area	gap	1.02	1.40	1.37	3.02
	Sampled-subarea	inc	25.48	108.81	33.70	149.06
	Sampled-subarea	gap	2.30	9.80	3.02	13.18
	Non-sampled-subarea	inc	135.97	149.43	149.35	148.17
	Non-sampled-subarea	gap	12.02	13.15	13.12	13.03

6. MSE Estimation

In the ELL method for the one-fold model, the variability of the simulated census measures is taken as the estimator of MSE of the ELL estimator. Similarly, under the two-fold model the corresponding MSE estimators of ELL for areas and subareas are given by

MSE ({\hat{F}}_{α d}^{ELL}) = B^{- 1} \sum_{b = 1}^{B} {(F_{α d}^{* (b)} - {\hat{F}}_{α d}^{ELL})}^{2}

(6.1)

and

MSE ({\hat{F}}_{α d j}^{ELL}) = B^{- 1} \sum_{b = 1}^{B} {(F_{α d j}^{* (b)} - {\hat{F}}_{α d j}^{ELL})}^{2} .

(6.2)

MSE estimators similar to (6.1) and (6.2) are applicable to MELL1 and MELL2, using simulated census measures. The proposed MSE estimators are simple, but they can lead to significant underestimation of the true MSE because the model parameters and the random effects in the model are not re-estimated in each replicate from the replicated sample data $(y_{d j k}^{* (b)}, x_{d j k})$ .

Marhuenda et al.^[10] proposed a proper parametric bootstrap MSE estimator for EB2 estimators, based on re-estimating model parameters and random effects in the two-fold model under normality. A similar procedure may be developed for ELL and MELL using a distribution free bootstrap, like the ELL method.

7. Concluding Remarks

We considered the estimation of FGT poverty measures under a two-fold nested error model. We developed extensions of the ELL method and the modified ELL method of Diallo and Rao^[4] to twofold models. The methods are free of parametric distributional assumptions on the random effects in the two-fold model. Our simulation results indicate that the proposed modified ELL methods lead to large efficiency gains over the ELL for both areas and subareas. Further, MELL2 leads to significant reduction in MSE over MELL1 for sampled subareas, and it is comparable to the EB2 method of Marhuenda et al.^[10] under normality assumption. Our simulation study is somewhat limited. It would be desirable to conduct a more extensive simulation study with different parameter combinations, as well as design-based simulations, as in Marhuenda et al.^[10] Bootstrap MSE estimation for MELL methods, along the lines of Marhuenda et al.^[10] but without normality assumption, needs a detailed investigation.

The EB2 method of Marhuenda et al.^[10] is applicable to additive functions like the FGT measures, and its extensions to more complex parameters remain to be investigated. On the other hand, as noted earlier, MELL2 is readily applicable to general parameters, not necessarily additive in the individual values like the FGT measures.

Footnotes

Acknowledgements

We thank two referees for several constructive comments and suggestions.

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: We would like to express our sincere gratitude to Statistics Canada and the Natural Sciences and Engineering Research Council of Canada (NSERC) for their valuable support in funding our research.

References

Battese

, Harter

, Fuller

WA.

An error-components model for prediction of county crop areas using survey and satellite data. J Amer Stat Assoc 1988; 83: 28–36.

Corral-Rodas

, Molina

, Nguyen

Pull your small area estimates up by the bootstraps. J Stat Comput Simul 2021; 91: 3304–3357.

Diallo

MS.

Small area estimation under Skew-Normal nested error model . PhD Thesis, Carleton University, Ottawa, Canada, 2014.

Diallo

, Rao

JNK.

Small area estimation of complex parameters under unit-level models with skewnormal errors. Scandinavian J Stat 2018; 45: 1092–1116.

Elbers

, Lanjouw

Welfare in villages and towns: micro-level estimation of poverty and inequality. Unpublished manuscript, The World Bank, 2001.

Foster

, Greer

, Thorbecke

A class of decomposable poverty measures. Econometrica 1984; 52: 761766.

Graf

, Martin

, Molina

A generalized mixed model for skewed distributions applied to small area estimation. Test 2019; 28: 565–597.

Guadarrama

, Molina

, Rao

JNK.

A comparison of small area estimation methods for poverty mapping. Stat Transit New Ser Surv Meth Joint Issue 2014; 17: 41–66.

Hossain

, Das

, Chandra

, Islam

MA.

Disaggregate level estimates and spatial mapping of food insecurity in Bangladesh by linking survey and census data. Plos One 2020; 15: 1–16.

10.

Marhuenda

, Molina

, Morales

, Rao

JNK.

Poverty mapping in small areas under a twofold nested error regression model. J Royal Stat Soc: Series A 2017; 180: 1111–1136.

11.

Molina

, Rao

JNK.

Small area estimation of poverty indicators. Cananadian J Stat 2010; 38: 369–385.

12.

Neri

, Ballini

, Betti

Poverty and inequality mapping in transition countries. Stat in Transit 2005; 7: 135–157.

13.

Rao

JNK

, Molina

Small area estimation (2nd ed.). Hoboken, NJ: Wiley 2015.

Estimation of Poverty Measures for Small Areas Under a Two-fold Nested Error Linear Regression Model: Comparison of Two Methods

Abstract

Keywords

1. Introduction

2. One-fold Nested Error Model

2.3. EB Method

3. Two-fold Nested Error Model

4.1. ELL Method

4.2. Modified ELL Methods

4.3. EB Method

5. Simulation Study

5.1. e d j k Skew Normal

Table 1.

Average of MSEs × 10 4 . Case I when all subareas are sampled and case 2 is when all subareas are not sampled ( e djk is SN , σ u = 0.5 and σ v = 0.25 .

Average of MSEs × 10 4 . Case I when all subareas are sampled and case 2 is when all subareas are not sampled ( e djk is SN , σ u = 0.25 and σ v = 0.5 .

Average of MSEs × 10 4 . Case I when all subareas are sampled and case 2 is when all subareas are not sampled ( e djk is SN , σ u = 0. I and σ v = 0.2 ).

Table 4.

Average of MSEs × 10 4 . Case I when all subareas are sampled and case 2 is when all subareas are not sampled u d , v d j and e d j k all N , σ u = 0.5 and σ v = 0.25 .

Average of MSEs × 10 4 . Case I when all subareas are sampled and case 2 is when all subareas are not sampled u d , v d j and e d j k all N , σ u = 0. I and σ v = 0.2 .

Footnotes

Acknowledgements

Declaration of Conflicting Interests

Funding

References

5.1. $e_{d j k}$ Skew Normal

Average of MSEs $(\times 10^{4})$ . Case I when all subareas are sampled and case 2 is when all subareas are not sampled ( $e_{djk}$ is $SN, σ_{u} = 0.5$ and $σ_{v} = 0.25)$ .

Average of MSEs $(\times 10^{4})$ . Case I when all subareas are sampled and case 2 is when all subareas are not sampled ( $e_{djk}$ is $SN, σ_{u} = 0.25$ and $σ_{v} = 0.5)$ .

Average of MSEs $(\times 10^{4})$ . Case I when all subareas are sampled and case 2 is when all subareas are not sampled ( $e_{djk}$ is $SN, σ_{u} = 0. I$ and $σ_{v} = 0.2$ ).

Average of MSEs $(\times 10^{4})$ . Case I when all subareas are sampled and case 2 is when all subareas are not sampled $(u_{d}, v_{d j}$ and $e_{d j k}$ all $N, σ_{u} = 0.5$ and $σ_{v} = 0.25)$ .

Average of MSEs $(\times 10^{4})$ . Case I when all subareas are sampled and case 2 is when all subareas are not sampled $(u_{d}, v_{d j}$ and $e_{d j k}$ all $N, σ_{u} = 0. I$ and $σ_{v} = 0.2)$ .