Intrastratum Counterbalancing: A Procedure to Improve the Precision of Stratified Mean-Per-Unit Estimates in Accounting and Auditing

Abstract

In accounting and auditing, critical monetary estimation applications based on sampling often rely on stratified mean-per-unit (SMPU) sampling due to the technique’s unique ability to produce trustworthy confidence intervals when applied to low-error populations. Although reliable, SMPU yields less precise confidence intervals than other estimation techniques. To address this weakness, we present and test a data transformation procedure for use with SMPU-Intrastratum Counterbalancing. After strata boundaries are finalized but prior to sample selection, this procedure creates new sampling units in each stratum by counterbalancing original stratum population elements one or more times. As a result, the number of strata sampling units and strata standard deviations are reduced. These changes reduce the SMPU estimator standard deviation thereby improving precision. In simulations using real accounting populations, counterbalancing produced large improvements in precision. Compared to SMPU without counterbalancing, one application of counterbalancing produced a 52% improvement in mean precision while a second application produced a 73% improvement in mean precision. These improvements were achieved with no change in the number of original population elements examined and suggest that use of Intrastratum Counterbalancing would be beneficial in many SMPU estimation applications. Equations are provided for planning Intrastratum Counterbalancing applications.

Keywords

stratified sampling mean-per-unit estimation estimation precision counterbalancing audit sampling

Introduction

Statistical sampling has long been recognized as a useful tool for accountants and auditors and is used in both hypothesis testing and monetary estimation applications (American Institute of Certified Public Accountants [AICPA], 2019, §2.38; Arkin, 1963; Backer & Fertig, 1958; Cyert & Davidson, 1962; Hill et al., 1962; Vance & Neter, 1957). Despite technological advances in accounting systems and audit practice (Kogan et al., 2019; Salijeni et al., 2019), and the resulting ability to programmatically perform many accounting and auditing activities, statistical sampling remains a useful tool (Hall et al., 2024; Lowe et al., 2018; Public Company Accounting Oversight Board [PCAOB], 2020).¹ Given this fact, both the Uniform CPA Examination and the Certified Internal Auditor Examination include sampling-related questions (AICPA, 2024; Institute of Internal Auditors [IIA], 2025).

For critical monetary estimation applications based on sampling, accountants and auditors typically rely on one of four statistical techniques: difference, ratio, regression, or stratified mean-per-unit (SMPU). If the population in question is expected to contain significant monetary error, a difference, ratio, or regression estimator will be used because these estimators yield better precision than SMPU (Neter & Loebbecke, 1975). However, these estimators tend to produce untrustworthy confidence intervals when few monetary errors are found (Duke et al., 1985; Loebbecke & Neter, 1975; Neter & Loebbecke, 1975).² In contrast, tests conducted by Neter and Loebbecke (1975) found that SMPU produces reliable confidence intervals over a wide range of error conditions, including low-error conditions. SMPU’s ability to produce reliable confidence intervals for low-error populations and the fact that most accounting and auditing populations exhibit low error (Allen & Elder, 2005; Durney et al., 2014) explain the common use of SMPU in critical monetary estimation applications.

Given the common use of SMPU in critical monetary estimation applications and the technique’s inferior precision relative to other techniques (Kaplan, 1973; Neter & Loebbecke, 1975), a procedure for improving the precision of SMPU estimates is needed. To address this need, we present and test a data transformation procedure for use with SMPU-Intrastratum Counterbalancing. After strata boundaries are finalized but prior to sample selection, this procedure creates new sampling units in each stratum by counterbalancing original stratum population elements one or more times. As a result, the number of strata sampling units and strata standard deviations in the transformed data set are reduced relative to the original data set. These changes reduce the SMPU estimator standard deviation thereby improving precision. In simulations using real accounting populations seeded with low error and a fixed sample size measured in original population elements (hereafter real sample size), counterbalancing produced large improvements in precision. Relative to SMPU without counterbalancing, one application of counterbalancing produced a 52% improvement in mean precision while a second application produced a 73% improvement in mean precision.

Our study contributes to the accounting and auditing literatures by presenting Intrastratum Counterbalancing and documenting the procedure’s ability to produce large improvements in SMPU precision. We also present equations for estimating SMPU precision with, and without, the use of counterbalancing. These equations should be helpful in deciding when to use counterbalancing. Our results should be of interest to accounting and auditing professionals who plan and execute SMPU monetary estimation applications and standard setting authorities responsible for providing guidance on sampling matters.

Our presentation proceeds as follows. The next section provides a definition of critical monetary estimation applications and identifies examples of these applications. This is followed by a review of SMPU mechanics, an explanation of Intrastratum Counterbalancing mechanics and its predicted effects on SMPU precision. We next present study hypotheses, details of the study methodology, and findings. The closing section presents a discussion of study results. Equations for projecting SMPU precision with, and without, counterbalancing are presented in the Appendix.

Critical Monetary Estimation Applications Defined

In accounting and auditing, a SMPU procedure may be executed as part of a hypothesis test or a monetary estimation application (AICPA, 2019, §2.38). With a hypothesis test, the SMPU procedure is executed to help determine whether the total recorded value of a population is materially correct.³ Assurance for the hypothesis test is typically derived from multiple sources of evidence including assessments of inherent risk, tests of internal control, analytical procedures, and the sample result (AICPA, 2019, §2.06; PCAOB, 2025, ¶.19). In contrast, a monetary estimation application is executed to determine a population’s total correct value. These applications are performed when: (1) the auditor has decided to reject a hypothesis that the recorded value is materially correct or (2) there is no recorded value and one is needed. When a monetary estimation application is executed, all of the assurance for the test is derived from the sample result.

Monetary estimation applications may be classified as critical or noncritical. When the failure to reach a correct decision creates substantial legal, monetary, regulatory, or reputational risk, the estimation application is considered critical. Critical tests require more evidence and a higher level of assurance than noncritical tests. As a result, sample sizes for critical SMPU applications are substantially larger than sample sizes for noncritical SMPU applications.

This study is concerned with improving the precision of critical SMPU monetary estimation applications. These applications are routinely performed by external auditors, corporate accountants, and governmental auditors. For example, in a financial statement audit, after conducting various tests, an external auditor may decide to reject a hypothesis that the recorded value of a major asset is materially correct. In this circumstance, the external auditor may proceed to estimate the correct total value of the asset via SMPU. Outside of financial statement audits, external auditors conduct critical SMPU applications in agreed-upon procedures engagements and forensic audits. Corporate accountants perform critical SMPU applications in a variety of circumstances including cost segregation studies and estimating the basis of stock acquired in a tax-free stock-for-stock exchange. Governmental auditors perform critical SMPU applications when performing sales tax audits and audits of amounts reported on federal and state tax returns.

SMPU Mechanics

SMPU is commonly used in applications where the total recorded value ( $X = \sum x_{i}$ ) of a population is known, the auditor or accountant wants to estimate the total correct value of the population ( $Y = \sum y_{i}$ ), and the population is expected to exhibit little error. Typically, the application is designed to control sampling risk while yielding a desired magnitude of estimation precision in monetary units. For a given level of sampling risk and number of strata, estimation precision is principally managed by modifying the sample size.⁴ Due to its relative inefficiency, SMPU requires a larger sample size than other estimation techniques to achieve the same precision.

As a first step in the estimation process, individually significant members of the population are identified, placed in a census stratum, and fully examined (AICPA, 2019; Guy et al., 2002; PCAOB, 2025, ¶.21). This practice ensures that individually significant items are always examined and yields several beneficial effects including: (1) improving estimator precision by reducing the estimator variance and (2) enhancing confidence interval reliability by reducing skewness in the sampled population. An individually significant member may be identified based on its recorded value or other risk-related characteristics. Due to the fact that all members of the census stratum are examined, results for this stratum do not affect SMPU precision. In hypothesis testing applications where overstatement error is the primary concern, auditors may also remove (trim) very small items from the sampling field as these items present no risk of material overstatement. However, in estimation applications where the objective is to estimate the total correct value of a monetary amount, very small items are not typically removed (trimmed) from the sampling field.

After creation of the census stratum, the remaining N members of the population are grouped into sampling strata. Due to their availability and high correlation with correct values in low-error populations, individual recorded values ( $x_{i}$ ), are often used as the basis for stratification (Guy et al., 2002; McGrath, 2003; Yancey, 2002).⁵ Typically, population members subject to sampling are sorted in ascending order based on recorded value {x_i: $i = 1, 2, \dots, N$ } and divided into L mutually exclusive sampling strata {A_h: $h = 1, 2, \dots, L$ } with corresponding strata population counts {N_h: $h = 1, 2, \dots, L$ }, strata recorded values { $x_{h_{i}} : h = 1, 2, \dots, L; i = 1, 2, \dots, N_{h}$ }, and strata correct values { $y_{h_{i}} : h = 1, 2, \dots, L; i = 1, 2, \dots, N_{h}$ }, respectively. Once strata boundaries are finalized, strata recorded value standard deviations { $s_{h_{x}}$ : $h = 1, 2, \dots, L$ } are computed and used as estimates of strata correct value standard deviations { $s_{h_{y}}$ : $h = 1, 2, \dots, L$ } for sample planning purposes. In each sampling stratum, an independent simple random sample { $n_{h}$ : $h = 1, 2, \dots, L$ } is selected and examined. When SMPU is used without counterbalancing (hereafter conventional SMPU), individual population members serve as stratum sampling units.

Specification of a minimum stratum sample size ( $m$ ) is common in practice to help ensure reliable estimates of strata variances (AICPA, 2019; Wilburn, 1984; Yancey, 2002). Allocation of the total sample size ( $n = \sum n_{h}$ ) to the sampling strata is typically based on equation (1) as Neyman (1934) has shown this procedure yields a minimum variance.⁶

n_{h} = n \frac{N_{h} s_{h_{x}}}{\sum_{h} N_{h} s_{h_{x}}}

(1)

Upon completion of the examination, an estimate of the total correct value is computed as:

\hat{Y} = C + \sum_{h = 1}^{L} N_{h} M_{h_{y}}

(2)

where

C

is the total correct value of population items in the census stratum,

N_{h}

is the number of population members in sampling stratum h, and

M_{h_{y}}

is the mean correct value of sampling units examined in sampling stratum h. The estimator variance V (

\hat{Y}

) and standard deviation S (

\hat{Y}

) depend only on results from the sampling strata and are computed as:

V (\hat{Y}) = \sum_{h = 1}^{L} N_{h} (N_{h} - n_{h}) s_{h_{y}}^{2} / n_{h}

(3)

S (\hat{Y}) = \sqrt{\sum_{h = 1}^{L} N_{h} (N_{h} - n_{h}) s_{h_{y}}^{2} / n_{h}}

(4)

where

s_{h_{y}}^{2}

is the estimated variance of correct values (

y_{j}

) in sampling stratum h and is computed as:

s_{h_{y}}^{2} = \frac{1}{n_{h} - 1} \sum_{j = 1}^{n_{h}} {(y_{h_{j}} - {\bar{y}}_{h})}^{2}

(5)

The precision ( $P$ ) of the estimate is computed as:

P = t S (\hat{Y})

(6)

In equation (6), the multiplier $t$ is based on the desired confidence level and is either: (1) a standard normal deviate (z-value) when the application degrees of freedom is sufficiently large⁷ or (2) a Student’s t-value when use of a standard normal deviate is not appropriate. When a Student’s t-value is used, determining the exact degrees of freedom (df) is impractical due to mathematical complexity (Cochran, 1977). As a practical workaround, the degrees of freedom may be approximated as $d f = n - L$ (Lohr, 1999) or by use of the Satterthwaite approximation (Cochran, 1977). For the confidence levels and sample sizes commonly used in accounting and auditing, these approximating techniques yield similar df and resulting Student’s t-values. Finally, a two-sided confidence interval of width 2 $P$ is computed as:

\hat{Y} \pm P

(7)

Intrastratum Counterbalancing

Overview

Intrastratum Counterbalancing is a data transformation procedure applied after strata boundaries are finalized but prior to sample selection. The process of counterbalancing may be performed multiple times. While any basis for stratification may be used, a reliable proxy for correct values must be known to apply Intrastratum Counterbalancing. In accounting and auditing with low-error populations, recorded values should perform well as a proxy.

For expositional clarity, the remaining sections adopt two naming conventions. First, the subscripts cb0, cb1, cb2, cba, and cbv are used to distinguish between sampling applications using: (1) SMPU without counterbalancing (cb0 or zero-order counterbalancing), (2) SMPU with counterbalancing applied once (cb1 or first-order counterbalancing), (3) SMPU with counterbalancing applied twice (cb2 or second-order counterbalancing), (4) SMPU with counterbalancing applied a times (cba or a-order counterbalancing), or (5) SMPU with counterbalancing where the level of counterbalancing may vary. Second, when the sample size ( $n$ ) is measured in original population elements, we describe it as the real sample size because this metric provides an accurate indication of the audit effort involved. Alternatively, when the sample size ( $n$ ) is measured in sampling units, we describe it as the nominal sample size because this metric may not provide an accurate indication of the audit effort involved when counterbalancing is employed. With conventional SMPU, the real sample size and the nominal sample size are identical. When SMPU is used with counterbalancing, the real sample size will be larger than the nominal sample size.

With conventional SMPU, the set of stratum sampling units in stratum A_h ( ${S S U}_{h^{c b 0}}$ ) consists of population elements assigned to that stratum. With first-order counterbalancing, a new set of stratum sampling units ( ${S S U}_{h^{c b 1}}$ ) is created by sorting set ${S S U}_{h^{c b 0}}$ in ascending order based on recorded value and creating two-unit clusters of counterbalanced ${S S U}_{h^{c b 0}}$ sampling units. The first sampling unit in set ${S S U}_{h^{c b 1}}$ consists of the two ${S S U}_{h^{c b 0}}$ sampling units with the largest and smallest recorded values. The second sampling unit in set ${S S U}_{h^{c b 1}}$ consists of the two ${S S U}_{h^{c b 0}}$ sampling units with the second largest and second smallest recorded values. This process continues until all members of set ${S S U}_{h^{c b 0}}$ have been placed in a two-unit cluster to form a member of ${S S U}_{h^{c b 1}}$ . When counterbalancing is applied twice, a new set of stratum sampling units ( ${S S U}_{h^{c b 2}}$ ) is created by sorting set ${S S U}_{h^{c b 1}}$ in ascending order based on recorded value and creating two-unit clusters of counterbalanced ${S S U}_{h^{c b 1}}$ sampling units. This process continues until the desired order of counterbalancing is reached.

Effect of Counterbalancing on Stratum Characteristics

Number of Stratum Sampling Units

Under ideal circumstances, prior to counterbalancing, the number of stratum sampling units in stratum A_h (i.e., $N_{h^{c b 0}}$ ) is an even number.⁸ If counterbalancing is applied once (a = 1) and $N_{h^{c b 0}}$ is an even number, each of the resulting sampling units in set ${S S U}_{h^{c b 1}}$ consists of two sampling units from ${S S U}_{h^{c b 0}}$ . After first-order counterbalancing is completed, the number of sampling units in set ${S S U}_{h^{c b 1}}$ is $N_{h^{c b 1}}$ = $N_{h^{c b 0}} / 2$ . Within stratum A_h, the set of original population elements comprising set ${S S U}_{h^{c b 0}}$ remains unchanged because stratum boundaries have not changed.

If counterbalancing is applied twice (a = 2), each of the resulting sampling units in set ${S S U}_{h^{c b 2}}$ consists of two sampling units from set ${S S U}_{h^{c b 1}}$ . After second-order counterbalancing is completed, the number of stratum sampling units in set ${S S U}_{h^{c b 2}}$ is $N_{h^{c b 2}}$ = $N_{h^{c b 1}} / 2$ = $N_{h^{c b 0}} / 4$ . Within stratum A_h, the set of original population elements comprising set ${S S U}_{c b 0}$ remains unchanged because stratum boundaries have not changed.

In general, for a-order counterbalancing ( $a \geq 1$ ), the number of a-order sampling units in set ${S S U}_{h^{c b a}}$ will be $N_{h^{c b a}}$ = $N_{h^{c b 0}} / 2^{a}$ . Within stratum A_h, the set of original population elements comprising set ${S S U}_{h^{c b 0}}$ remains unchanged because stratum boundaries have not changed.

Stratum Sampling Fraction

In SMPU applications where the stratum sample size is determined using Neyman Allocation (Neyman, 1934), the stratum sampling fraction ( $f_{h} = n_{h} / N_{h}$ ) for first-order, second-order, and higher orders of counterbalancing will be approximately the same as with zero-order counterbalancing. That is, $f_{h^{c b 0}}$ ≈ $f_{h^{c b 1}}$ ≈ $f_{h^{c b 2}}$ and so forth. This result occurs because the sample size for stratum A_h is based on the product $N_{h} s_{h_{x}}$ relative to the sum of this product across all sampling strata. When counterbalancing is used, reductions in $N_{h}$ and $s_{h_{x}}$ tend to be proportionally similar across sampling strata. The result is that a similar proportion of the total sample size ( $n_{c b 0}, n_{c b 1}, n_{c b 2}$ ) is allocated to stratum A_h resulting in an approximately stable sampling fraction for the stratum.

Stratum Standard Deviation

Excluding circumstances where stratum sampling unit correct values ( $y_{h_{i}}$ ) do not vary, counterbalancing reduces the range of stratum sampling unit correct values. In most circumstances, a reduction in the range of sampling unit values induces a similar reduction in the standard deviation of those values. This relationship was documented in the statistics literature over 80 years ago by Pearson and Haines (1935), and appears to have been first introduced in the auditing literature by Hill et al. (1962).⁹ Given this relationship, in most circumstances, counterbalancing will reduce the standard deviation of stratum sampling unit correct values ( $s_{h_{y}}$ ).¹⁰ In turn, this should permit a reduction in the minimum stratum sample size needed to reliably estimate strata standard deviations.

Screening and Construction of Sampling Strata

Ideally, once the number of counterbalancing applications ( $a$ ) is determined and sampling strata boundaries are set, the value $N_{h^{c b 0}} / 2^{a}$ will be an even number in all sampling strata. When this outcome exists, each sampling stratum will have an even number of sampling units at each stage $i$ of the counterbalancing process ( $c b i : i = 0, 1, \dots, a)$ . In practice, this condition is not likely to exist in all sampling strata. In most applications, some sampling strata at a particular stage $i$ of the counterbalancing process ( $c b i : i = 0, 1, \dots, a)$ will contain an odd number of sampling units. The presence of an odd number of strata sampling units precludes a straight-forward application of the counterbalancing procedures described earlier.

When a sampling stratum at a particular stage $i$ of the counterbalancing process ( $c b i : i = 0, 1, \dots, a)$ contains an odd number of sampling units, counterbalancing may proceed via use of the following modified counterbalancing procedures.

(1) Sort ${S S U}_{h^{c b a}}$ in ascending order based on recorded value.

(2) Designate one sampling unit in ${S S U}_{h^{c b a}}$ to be a sampling unit in the next set of stratum sampling units ( ${S S U}_{h^{c b (a + 1)}}$ ). This forces the number of remaining sampling units in ${S S U}_{h^{c b a}}$ to be an even number. To help ensure there is no increase in the range of ${S S U}_{h^{c b (a + 1)}}$ sampling units, the recorded value of the designated sampling unit from ${S S U}_{h^{c b a}}$ should be as close as possible to twice the mean recorded value of sampling units in ${S S U}_{h^{c b a}}$ .

(3) Apply counterbalancing to the remaining sampling units in ${S S U}_{h^{c b a}}$ to create sampling units for the next set of stratum sampling units ( ${S S U}_{h^{c b (a + 1)}}$ ).

(4) The resulting next set of stratum sampling units ( ${S S U}_{h^{c b (a + 1)}}$ ) will consist of the designated sampling unit from step #2 above and the counterbalanced sampling units from step #3 above.

These modified counterbalancing procedures introduce additional complexity into the counterbalancing process. To avoid this complexity and related work effort, if the value $N_{h^{c b 0}} / 2^{a}$ is not an even number in all sampling strata after preliminary stratum boundaries are set, the sampling plan can be modified to force an even number in all sampling strata using one or more of the following procedures.

(1) Alter stratum boundaries until $N_{h^{c b 0}} / 2^{a}$ is an even number in all sampling strata.

(2) Move one or more sampling units from the top sampling stratum to the census stratum to force an even number of sampling units in the top stratum.

(3) Add a dummy sampling unit with a correct value of zero to each sampling stratum containing an odd number of sampling units. The recorded value of this dummy sampling unit should be set so that it is the smallest recorded value in the stratum.

In many applications, using a combination of the first two procedures will be sufficient.

Predicted Effect of Counterbalancing on SMPU Estimator Variance

Recall from equation (3), the contribution ( $Δ_{h_{y}^{c b 0}}$ ) of sampling stratum A_h to the conventional SMPU estimator variance is:

Δ_{h_{y}^{c b 0}} = N_{h^{c b 0}} (N_{h^{c b 0}} - n_{h^{c b 0}}) s_{h_{y}^{c b 0}}^{2} / n_{h^{c b 0}}

(8)

Under an assumption that counterbalancing maintains a constant sampling fraction, the variable $n_{h^{c b 0}}$ can be eliminated from equation (8) by substituting the product: $f_{h} N_{h^{c b 0}}$ . The result is:

Δ_{h_{y}^{c b 0}} = (1 - f_{h}) N_{h^{c b 0}} s_{h_{y}^{c b 0}}^{2} / f_{h}

(9)

In equation (9), the contribution of sampling stratum A_h to the estimator variance is a function of the stratum sampling fraction, number of stratum sampling units, and stratum variance. Given the assumption of a constant sampling fraction, and a recognition that the number of stratum sampling units and stratum variance vary with the number of times that counterbalancing is applied, equation (9) can be restated as:

Δ_{h_{y}^{c b v}} = (1 - f_{h}) N_{h^{c b v}} s_{h_{y}^{c b v}}^{2} / f_{h}

(10)

where the subscript cbv identifies model terms that vary with successive counterbalancing applications.

To determine how successive counterbalancing applications affect the contribution of sampling stratum A_h to the estimator variance, we take the total differential of equation (10). The result is:

d [Δ_{h_{y}^{c b v}}] = (\frac{(1 - f_{h}) s_{h_{y}^{c b v}}^{2}}{f_{h}} d [N_{h^{c b v}}]) + (\frac{(1 - f_{h}) N_{h^{c b v}}}{f_{h}} 2 s_{h_{y}^{c b v}} d [s_{h_{y}^{c b v}}])

(11)

With each successive counterbalancing application, the increment $d [N_{h^{c b v}}]$ will be negative in all sampling strata while the increment $d [s_{h_{y}^{c b v}}]$ will be negative in most, if not all, sampling strata. All other factors in equation (11) are positive. Hence, $d [Δ_{h_{y}^{c b v}}]$ will likely be negative with each successive application of counterbalancing, indicating a reduced contribution to the SMPU estimator variance. Smaller contributions to the estimator variance across sampling strata lead to a reduced estimator variance and, in turn, a reduced estimator standard deviation.

Numeric Example: Effect of Counterbalancing on Stratum Standard Deviation

To illustrate the process of counterbalancing, Table 1 presents information for a hypothetical stratum consisting of 20 population members (

N_{h^{c b 0}}

= 20). These population members are the cb0 sampling units comprising set

{S S U}_{h^{c b 0}}

. For purposes of exposition, these cb0 sampling units have been sorted in ascending order on recorded value (

x_{i}

) and assigned an identification number on the range of 1 to 20, inclusive. Details for the stratum include individual recorded values (

x_{i}

), correct values (

y_{i}

), and differences (

d_{i}

). Consistent with a low-error population, there are only two differences between recorded values and correct values, with the two differences totaling 112. Ten percent of stratum sampling units contain error and the total monetary error represents approximately two percent of the total recorded value. Table 1 reports the range and standard deviation of sampling unit correct values as 362 and 113.99, respectively.

Table 1.

Stratum A_h Sampling Units Prior to Counterbalancing (Sorted in Ascending Order on Recorded Value)

Sampling unit identification number	Recorded value	Correct value	Difference in values
$i$ ^a	$x_{i}$	$y_{i}$	$d_{i}$
1	116	116	0
2	126	126	0
3	139	139	0
4	161	127	34
5	193	193	0
6	204	204	0
7	213	213	0
8	228	228	0
9	229	229	0
10	254	254	0
11	276	276	0
12	277	277	0
13	286	286	0
14	310	310	0
15	363	285	78
16	428	428	0
17	431	431	0
18	437	437	0
19	462	462	0
20	478	478	0
Total =	5,611	5,499	112
Number of sampling units =	20	20	20
Range of sampling units =	362	362	78
Standard deviation of sampling units =	113.58	113.99	18.18

^aThe sampling units presented in this table are cb0 sampling units, comprise the membership of ${S S U}_{h^{c b 0}}$ , and would be used in performing first-order counterbalancing.

Table 2 presents information about the hypothetical stratum after application of first-order counterbalancing to set

{S S U}_{h^{c b 0}}

listed in Table 1. The result is set

{S S U}_{h^{c b 1}}

which consists of 10 cb1 sampling units. Each member of set

{S S U}_{h^{c b 1}}

listed in Table 2 consists of two cb0 sampling units from set

{S S U}_{h^{c b 0}}

listed in Table 1. For purposes of exposition, these cb1 sampling units have been sorted in ascending order on recorded value and assigned an identification number on the range of 1 to 10, inclusive. Compared to conventional SMPU, first-order counterbalancing reduced the number of stratum sampling units by 50% (20 cb0 sampling units versus 10 cb1 sampling units). After first-order counterbalancing, the range and standard deviation of stratum sampling unit correct values are 132 and 41.52, respectively. Hence, compared to conventional SMPU as depicted in Table 1, first-order counterbalancing reduced the range and standard deviation of sampling unit correct values by 63.5% and 63.6%, respectively.

Table 2.

Stratum A_h Sampling Units after First-Order Counterbalancing (Sorted in Ascending Order on Recorded Value)

Sampling unit identification number	Composition of j^th sampling unit	Recorded value	Correct value	Difference in values
$j$ ^a	Mapped to Table 1 ^b	$x_{j}$	$y_{j}$	$d_{j}$
1	$i$ = 9 & 12	506	506	0
2	$i$ = 8 & 13	514	514	0
3	$i$ = 7 & 14	523	523	0
4	$i$ = 10 & 11	530	530	0
5	$i$ = 6 & 15	567	489	78
6	$i$ = 3 & 18	576	576	0
7	$i$ = 2 & 19	588	588	0
8	$i$ = 4 & 17	592	558	34
9	$i$ = 1 & 20	594	594	0
10	$i$ = 5 & 16	621	621	0
Total =		5,611	5,499	112
Number of sampling units =		10	10	10
Range of sampling units =		115	132	78
Standard deviation of sampling units =		37.79	41.52	24.47

^aThe sampling units presented in this table are cb1 sampling units, comprise the membership of ${S S U}_{h^{c b 1}}$ , and would be used in performing second-order counterbalancing.

^bThe sampling units presented in Table 1 are cb0 sampling units, comprise the membership of ${S S U}_{h^{c b 0}}$ , and would be used in performing first-order counterbalancing.

Table 3 presents information about the hypothetical stratum after application of second-order counterbalancing to set

{S S U}_{h^{c b 1}}

listed in Table 2. The result is set

{S S U}_{h^{c b 2}}

which consists of five cb2 sampling units. Each member of set

{S S U}_{h^{c b 2}}

listed in Table 3 consists of two cb1 sampling units from set

{S S U}_{h^{c b 1}}

listed in Table 2. Each member of set

{S S U}_{h^{c b 2}}

listed in Table 3 also consists of four cb0 sampling units from set

{S S U}_{h^{c b 0}}

listed in Table 1. If desired, members of set

{S S U}_{h^{c b 2}}

listed in Table 3 would be used in a third application of counterbalancing (not presented here). For purposes of exposition, these cb2 sampling units have been sorted in ascending order on recorded value and assigned an identification number on the range of 1 to 5, inclusive. Compared to conventional SMPU, second-order counterbalancing reduced the number of stratum sampling units by 75% (20 cb0 sampling units versus 5 cb2 sampling units). After second-order counterbalancing, the range and standard deviation of sampling unit correct values are 62 and 23.25, respectively. Hence, compared to conventional SMPU as depicted in Table 1, second-order counterbalancing reduced the range and standard deviation of sampling unit correct values by 82.9% and 79.6%, respectively.

Table 3.

Stratum A_h Sampling Units after Second-Order Counterbalancing (Sorted in Ascending Order on Recorded Value)

Sampling unit identification number	Composition of k^th sampling unit mapped to		Recorded value	Correct value	Difference in values
$k$ ^a	Table 2 ^b	Table 1 ^c	$x_{k}$	$y_{k}$	$d_{k}$
1	$j$ = 2 & 9	$i$ = (8 & 13) & (1 & 20)	1,108	1,108	0
2	$j$ = 3 & 8	$i$ = (7 & 14) & (4 & 17)	1,115	1,081	34
3	$j$ = 4 & 7	$i$ = (10 & 11) & (2 & 19)	1,118	1,118	0
4	$j$ = 1 & 10	$i$ = (9 & 12) & (5 & 16)	1,127	1,127	0
5	$j$ = 5 & 6	$i$ = (6 & 15) & (3 & 18)	1,143	1,065	78
Total =			5,611	5,499	112
Number of sampling units =			5	5	5
Range of sampling units =			35	62	78
Standard deviation of sampling units =			12.06	23.25	30.76

^aThe sampling units presented in this table are cb2 sampling units, comprise the membership of ${S S U}_{h^{c b 2}}$ , and would be used in performing third-order counterbalancing.

^bThe sampling units presented in Table 2 are cb1 sampling units, comprise the membership of ${S S U}_{h^{c b 1}}$ , and would be used in performing second-order counterbalancing.

^cThe sampling units presented in Table 1 are cb0 sampling units, comprise the membership of ${S S U}_{h^{c b 0}}$ , and would be used in performing first-order counterbalancing.

Numeric Example: Effect of Counterbalancing on SMPU Estimator Precision

This section develops formulas to estimate the change in SMPU estimator precision (P) arising from use of Intrastratum Counterbalancing. We begin by considering how counterbalancing affects the multiplier t in equation (6). This is followed by development of numeric estimates of changes in precision when first-order counterbalancing is used.

Effect of Counterbalancing on the Multiplier t

In applications where a standard normal deviate (i.e., z-value) is used in computing precision (P), counterbalancing will have no effect on the multiplier t. If a Student’s t-value is used in computing precision (P), given the typical characteristics of accounting and auditing estimation applications (n = 100 to 300, L = 3 to 6, and planned confidence levels of 85 to 95%), the change in t will be minor.¹¹ For example, with $n_{c b 0}$ = 100, L = 3, and assuming a two-tailed desired confidence of 85%, $t_{c b 0}$ (df = 97) = 1.45. Given the same facts but using first-order counterbalancing and assuming a constant sampling fraction, the result is $n_{c b 1}$ = 50, L = 3, and $t_{c b 1}$ (df = 47) = 1.46. In this example, first-order counterbalancing increased the multiplier t by approximately 0.7%.

Alternatively, with $n_{c b 0}$ = 300, L = 6, and assuming a two-tailed desired confidence of 95%, $t_{c b 0}$ (df = 294) = 1.97. Given the same facts but using first-order counterbalancing and assuming a constant sampling fraction, the result is $n_{c b 1}$ = 150, L = 6, and $t_{c b 1}$ (df = 144) = 1.98. Again, the smaller degrees of freedom associated with first-order counterbalancing caused the multiplier t to increase by 0.5%.

In both of the preceding examples, first-order counterbalancing increased the multiplier t by a small magnitude. Because counterbalancing always reduces the number of sampling units examined, each successive application of counterbalancing reduces the application degrees of freedom and increases the multiplier t (assuming use of a Student’s t-value). Absent other counterbalancing effects, this degrades precision. However, as demonstrated in the next two sections, other effects induced by counterbalancing lead to improved precision.

Effect of First-Order Counterbalancing on Precision

Based on equations (4) and (6), estimator precision assuming conventional SMPU (cb0) and SMPU with first-order counterbalancing (cb1) are:

P_{c b 0} = t_{c b 0} \sqrt{\sum_{h = 1}^{L} N_{h^{c b 0}} (N_{h^{c b 0}} - n_{h^{c b 0}}) s_{h_{y}^{c b 0}}^{2} / n_{h^{c b 0}}}

(12)

P_{c b 1} = t_{c b 1} \sqrt{\sum_{h = 1}^{L} N_{h^{c b 1}} (N_{h^{c b 1}} - n_{h^{c b 1}}) s_{h_{y}^{c b 1}}^{2} / n_{h^{c b 1}}}

(13)

To determine how first-order counterbalancing affects estimator precision, we compute the ratio of SMPU precision with first-order counterbalancing divided by SMPU precision without counterbalancing. The result is:

\frac{P_{c b 1}}{P_{c b 0}} = \frac{t_{c b 1} \sqrt{\sum_{h = 1}^{L} N_{h^{c b 1}} (N_{h^{c b 1}} - n_{h^{c b 1}}) \frac{s_{h_{y}^{c b 1}}^{2}}{n_{h^{c b 1}}}}}{t_{c b 0} \sqrt{\sum_{h = 1}^{L} N_{h^{c b 0}} (N_{h^{c b 0}} - n_{h^{c b 0}}) \frac{s_{h_{y}^{c b 0}}^{2}}{n_{h^{c b 0}}}}} = \frac{t_{c b 1}}{t_{c b 0}} \sqrt{\frac{\sum_{h = 1}^{L} N_{h^{c b 1}} (N_{h^{c b 1}} - n_{h^{c b 1}}) \frac{s_{h_{y}^{c b 1}}^{2}}{n_{h^{c b 1}}}}{\sum_{h = 1}^{L} N_{h^{c b 0}} (N_{h^{c b 0}} - n_{h^{c b 0}}) \frac{s_{h_{y}^{c b 0}}^{2}}{n_{h^{c b 0}}}}}

(14)

A ratio less than one indicates precision with first-order counterbalancing is better than precision with conventional SMPU. The result is a narrower confidence interval and improved precision.

For hypothesis-development purposes, we model strata variances with first-order counterbalancing as the product of variances with zero-order counterbalancing (i.e., conventional SMPU) and a constant multiplier $F_{1}$ . That is:

F_{1} s_{h_{y}^{c b 0}}^{2} = s_{h_{y}^{c b 1}}^{2}

(15)

We also model the multiplier $t_{c b 1}$ with first-order counterbalancing as the product of the multiplier $t_{c b 0}$ with zero-order counterbalancing (i.e., conventional SMPU) and a constant multiplier $G_{1}$ . That is:

G_{1} t_{c b 0} = t_{c b 1}

(16)

Assuming strata sampling fractions are constant, substituting $\frac{N_{h^{c b 0}}}{2}$ for $N_{h^{c b 1}}$ , and substituting $\frac{n_{h^{c b 0}}}{2}$ for $n_{h^{c b 1}}$ in equation (14), the revised equation becomes:

\frac{P_{c b 1}}{P_{c b 0}} = \frac{G_{1} t_{c b 0}}{t_{c b 0}} \sqrt{\frac{\sum_{h = 1}^{L} \frac{N_{h^{c b 0}}}{2} (\frac{N_{h^{c b 0}}}{2} - \frac{n_{h^{c b 0}}}{2}) \frac{{F_{1} s}_{h_{y}^{c b 0}}^{2}}{\frac{n_{h^{c b 0}}}{2}}}{\sum_{h = 1}^{L} N_{h^{c b 0}} (N_{h^{c b 0}} - n_{h^{c b 0}}) \frac{s_{h_{y}^{c b 0}}^{2}}{n_{h^{c b 0}}}}}

(17)

After cancellations, the result for first-order counterbalancing is:

\frac{P_{c b 1}}{P_{c b 0}} = G_{1} \sqrt{\frac{F_{1}}{2}}

(18)

Based on equation (18), if first-order counterbalancing reduces conventional SMPU strata variances by 20% ( $F_{1} = 0.80)$ and increases the multiplier t by 1% ( $G_{1} = 1.01$ ), the magnitude of SMPU estimator precision with first-order counterbalancing (cb1) will be approximately 64% of precision with conventional SMPU. This equates to a 36% improvement in precision. Alternatively, if first-order counterbalancing reduces conventional SMPU strata variances by 40% ( $F_{1} = 0.60)$ and increases the multiplier t by 2% ( $G_{1} = 1.02$ ), equation (17) predicts the magnitude of SMPU estimator precision with first-order counterbalancing will be approximately 56% of precision with conventional SMPU. This represents a 44% improvement in precision.

Effect of Second-Order Counterbalancing on Precision

To determine how second-order counterbalancing affects estimator precision, we use the same logic as before but compare precision with second-order counterbalancing to precision with no counterbalancing. The result is:

\frac{P_{c b 2}}{P_{c b 0}} = \frac{t_{c b 2} \sqrt{\sum_{h = 1}^{L} N_{h^{c b 2}} (N_{h^{c b 2}} - n_{h^{c b 2}}) \frac{s_{h_{y}^{c b 2}}^{2}}{n_{h^{c b 2}}}}}{t_{c b 0} \sqrt{\sum_{h = 1}^{L} N_{h^{c b 0}} (N_{h^{c b 0}} - n_{h^{c b 0}}) \frac{s_{h_{y}^{c b 0}}^{2}}{n_{h^{c b 0}}}}} = \frac{t_{c b 2}}{t_{c b 0}} \sqrt{\frac{\sum_{h = 1}^{L} N_{h^{c b 2}} (N_{h^{c b 2}} - n_{h^{c b 2}}) \frac{s_{h_{y}^{c b 2}}^{2}}{n_{h^{c b 2}}}}{\sum_{h = 1}^{L} N_{h^{c b 0}} (N_{h^{c b 0}} - n_{h^{c b 0}}) \frac{s_{h_{y}^{c b 0}}^{2}}{n_{h^{c b 0}}}}}

(19)

To simplify equation (19) we model strata variances with second-order counterbalancing as the product of variances with conventional SMPU and a constant $F_{2}$ . That is:

F_{2} s_{h_{y}^{c b 0}}^{2} = s_{h_{y}^{c b 2}}^{2}

(20)

We also model the multiplier $t_{c b 2}$ with second-order counterbalancing as the product of the multiplier $t_{c b 0}$ with conventional SMPU and a constant $G_{2}$ . That is:

G_{2} t_{c b 0} = t_{c b 2}

(21)

Finally, we substitute $\frac{N_{h^{c b 0}}}{4}$ for $N_{h^{c b 2}}$ and, assuming a constant sampling fraction, $\frac{n_{h^{c b 0}}}{4}$ is substituted for $n_{h^{c b 2}}$ . The revised equation is:

\frac{P_{c b 2}}{P_{c b 0}} = \frac{G_{2} t_{c b 0}}{t_{c b 0}} \sqrt{\frac{\sum_{h = 1}^{L} \frac{N_{h^{c b 0}}}{4} (\frac{N_{h^{c b 0}}}{4} - \frac{n_{h^{c b 0}}}{4}) \frac{{F_{2} s}_{h_{y}^{c b 0}}^{2}}{\frac{n_{h^{c b 0}}}{4}}}{\sum_{h = 1}^{L} N_{h^{c b 0}} (N_{h^{c b 0}} - n_{h^{c b 0}}) \frac{s_{h_{y}^{c b 0}}^{2}}{n_{h^{c b 0}}}}}

(22)

After cancellations, the result for second-order counterbalancing is:

\frac{P_{c b 2}}{P_{c b 0}} = G_{2} \sqrt{\frac{F_{2}}{4}}

(23)

If second-order counterbalancing reduces conventional SMPU strata variances by 20% ( $F_{2} = 0.80)$ and increases the multiplier t by 1% ( $G_{2} = 1.01$ ), equation (23) predicts the magnitude of SMPU estimator precision with second-order counterbalancing will be approximately 45% of precision with conventional SMPU. The result is a 55% improvement in precision. Alternatively, if second-order counterbalancing reduces conventional SMPU strata variances by 40% ( $F_{2} = 0.60)$ and increases the multiplier t by 2% ( $G_{2} = 1.02$ ), equation (23) predicts the magnitude of SMPU estimator precision with second-order counterbalancing will be approximately 40% of precision with conventional SMPU. The result is a 60% improvement in precision.

The preceding results can be generalized as follows. The precision of SMPU with a-order counterbalancing relative to precision with conventional SMPU can be estimated as:

\frac{P_{c b a}}{P_{c b 0}} = G_{a} \sqrt{\frac{F_{a}}{2^{a}}}

(24)

Planning Intrastratum Counterbalancing

In most cases, the application of counterbalancing in all sampling strata will improve SMPU estimation precision. In rare cases, counterbalancing in all strata will improve SMPU estimation precision overall, but some sampling strata will not benefit from the application of counterbalancing. To help plan the use of counterbalancing, the Appendix provides equations for: (1) assessing whether counterbalancing in all sampling strata will improve SMPU estimation precision and (2) identifying those sampling strata where the use of counterbalancing will not be beneficial.

Hypotheses

Given the predictions derived from equations (18) and (23), we expect first-order and second-order counterbalancing will yield better (smaller) estimator precision than conventional SMPU. We also expect that second-order counterbalancing will yield better (smaller) estimator precision than first-order counterbalancing. Hence, our hypotheses are:

Hypothesis 1

For low-error populations, SMPU estimation with first-order counterbalancing yields better (smaller) mean precision values than SMPU estimation without counterbalancing (i.e., ${\bar{P}}_{c b 1}$ < ${\bar{P}}_{c b o})$ .

Hypothesis 2

For low-error populations, SMPU estimation with second-order counterbalancing yields better (smaller) mean precision values than SMPU estimation without counterbalancing (i.e., ${\bar{P}}_{c b 2}$ < ${\bar{P}}_{c b o})$ .

Hypothesis 3

For low-error populations, SMPU estimation with second-order counterbalancing yields better (smaller) mean precision values than SMPU estimation with first-order counterbalancing (i.e., ${\bar{P}}_{c b 2}$ < ${\bar{P}}_{c b 1})$ .

The preceding hypotheses are tested via simulation using real accounting populations, realistic numbers of sampling strata, and realistic population misstatement rates.¹² Although our hypotheses do not involve the number of sampling strata or population misstatement rates, these factors are manipulated in our simulations to enhance the external validity of our tests. These manipulations also permit testing for important interactions between: (1) counterbalancing and the number of sampling strata, and (2) counterbalancing and the population misstatement rate.

Methodology

Overview

The study used a 3 (Counterbalancing Applications) × 3 (Number of Strata) × 4 (Misstatement Rate) factorial repeated-measures design with one observation for each of 10 populations in each of the 36 design cells.¹³ A repeated-measures design was employed because relatively few real accounting populations were available making use of unique populations in each design cell infeasible. Within each cell of the experimental design, the response measure for each population was the mean SMPU estimation precision derived from 10,000 stratified random samples selected from that specific population. This large number of samples was used to eliminate the possibility that an unrepresentative sample would bias study results. In total, our simulations generated 3,600,000 stratified random samples (36 design cells x 10 populations x 10,000 samples for each population) that were averaged to create a total of 360 measures of mean SMPU estimation precision. These observations were analyzed for main effects and interaction effects.

Manipulation #1: Counterbalancing Applications

Counterbalancing Applications was manipulated on three treatment levels: stratified random selection with no counterbalancing (cb0), stratified random selection with first-order counterbalancing (cb1), and stratified random selection with second-order counterbalancing (cb2). This manipulation was used to permit testing of study hypotheses. In our cb1 and cb2 treatment levels, counterbalancing was applied in all sampling strata.

Manipulation #2: Number of Sampling Strata

Number of Strata was manipulated on three treatment levels: 3, 5, and 7 sampling strata. These values fall within the range commonly suggested in the accounting and auditing literatures (AICPA, 2019; Guy et al., 2002; Roberts, 1978). The range of sampling strata used in the study does not include larger values that are sometimes used in practice. Higher values were excluded to avoid empty cells in the study design.

Manipulation #3: Misstatement Rate

Misstatement Rate was manipulated on four treatment levels: 0.0%, 0.6%, 1.2%, and 1.8%. To achieve these misstatement rates, populations were seeded with four different error rates: 0.0%, 3.0%, 6.0%, and 9.0%.¹⁴ Error magnitudes were fixed at 20% of recorded value (i.e., a 20% tainting).

Based on a study by Durney et al. (2014), study misstatement rates are consistent with rates commonly found in accounting populations. Durney et al. (2014) examined sampling data for 160 sampling applications performed by a large audit firm. Their analysis disclosed low misstatement rates in accounting populations. Fifty-eight percent of applications exhibited no error, 90% of applications exhibited a misstatement rate of 0.5% or less, 96% of applications exhibited a misstatement rate of 1.5% or less, and 100% of applications exhibited a misstatement rate of 5.0% or less.

Study Populations

Actual inventory listings for 10 business firms were obtained from a commercial inventory valuation firm. These inventory listings constitute the original populations used to create study populations. Each entry in these listings included a description of an inventory item and its recorded value. For some business firms, a single inventory listing was received (typically finished goods). For other business firms, three separate inventory listings were received (raw materials, work-in-progress, and finished goods). We exercised no control over the selection of these listings but did request they reflect the full range of characteristics in populations known to the inventory valuation firm.

Equations (4) and (6) indicate that population size is a causal factor in determining SMPU precision. As population size increases, the magnitude of precision ( $P$ ) for an estimated total value increases (estimates become less precise assuming no other changes). To ensure that variations in population size did not affect study results, the size of study populations was fixed at 8,000 items. While other values are plausible and reasonable, this value is realistic, sufficient to justify the use of stratified selection, and not so great as to create a computational burden for study simulation programs. Given the decision to fix population size at 8,000 items, original populations with fewer than 8,000 items were eliminated from use in the study.

From among the remaining original populations, the 10 populations with the largest number of items were selected for use in our study. To fix study population sizes at 8,000 items, the following procedures were used. First, within each original population, individual items with recorded values in excess of one percent of total recorded value were identified as individually significant, classified as belonging in a census stratum, and eliminated from inventory listings.¹⁵ Second, within each original population, the largest 8,000 items were selected for use in study simulations.¹⁶ The result of these procedures was creation of 10 study populations with 8,000 items each whose key characteristics closely mirrored the original inventory populations. Summary statistics for the recorded values of the study populations are provided in Table 4.

Table 4.

Summary Statistics for Study Populations

Population	N ^a	Mean^b	Standard deviation^b	Skewness	Minimum value^b	Maximum value^b
1	8,000	953	2,637	8.84	46	63,666
2	8,000	10,963	43,212	9.89	5	868,509
3	8,000	98	59	0.58	20	233
4	8,000	3,994	15,015	11.04	324	294,140
5	8,000	16	3	0.19	11	23
6	8,000	38	7	0.21	27	51
7	8,000	89	19	0.32	60	128
8	8,000	1,629	3,270	9.81	343	74,124
9	8,000	2,580	7,914	9.75	94	163,161
10	8,000	2,665	6,390	6.94	28	126,350

^aNumber of population members.

^bValues are rounded to the nearest whole number.

Once the selection of study populations was completed, correct values for population items were created by randomly seeding four different error rates into each population (0.0, 3.0, 6.0, and 9.0%). Error magnitudes were fixed at 20% of recorded value (i.e., a 20% tainting). The result was four different versions of each population. The number of population items seeded with error was a function of the population error rate and the fixed population size ( $N = 8, 000$ ). With 10 populations and four misstatement rates (0.0, 0.6, 1.2, and 1.8%), a total of 40 unique populations were created for use in study simulations.

Sample Size

Equations (4) and (6) indicate that sample size is a causal factor in determining SMPU precision when estimating a population total value. As sample size increases, the magnitude of precision ( $P$ ) decreases (estimates become more precise assuming no other changes). To ensure that variations in sample size did not affect study results, application real sample size measured in original population elements was fixed at $n_{c b 0} = 200$ in all simulations. While other values are plausible, this value is realistic for a conventional SMPU monetary estimation application and consistent with the sample sizes used in prior audit sampling studies (Duke et al., 1985; Neter & Loebbecke, 1975) and an SMPU sampling case study presented in the AICPA’s Audit Sampling Guide (AICPA, 2019).¹⁷ In our simulations, the fixed real sample size of $n_{c b 0} = 200$ yielded an average achieved precision of approximately 4.7%.

Our sample size of $n_{c b 0} = 200$ is substantially larger than sample sizes reported in Durney et al. (2014) and Christensen et al. (2015). Although these studies do not report the objectives of the sampling applications reviewed, they are likely hypothesis tests as these tests are more common in financial statement audits than monetary estimation applications. Our use of a larger sample size may well be attributable to the fact that our study is focused on critical monetary estimation applications rather than audit hypothesis tests. In critical monetary estimation applications the objective is to independently estimate the total value of a population (AICPA, 2019, §7.06). These applications draw assurance from only the sample result and are typically based on classical variables sampling (AICPA, 2019, §7.06). In contrast, the objective of an audit hypothesis test is to determine if the account is materially misstated. Hypothesis tests typically draw assurance from multiple sources of evidence (AICPA, 2019, §2.06) and rely on highly efficient monetary unit sampling techniques (AICPA, 2019, §6.04). Both of these factors lead to smaller sample sizes for hypothesis tests. These differences in application purpose, sources of assurance, and sampling technique may account for the larger sample size used in our study.

Sample Selection Procedures

Sample selection procedures began by stratifying each study population based on recorded values using the Dalenius-Hodges method of stratification and equal-width class intervals (Cochran, 1977; Dalenius & Hodges, 1959). Once strata boundaries were determined, strata recorded values were used to estimate strata standard deviations and the total real sample size ( $n_{c b 0} = 200$ ) was allocated to the strata using Neyman (1934) allocation. In each sampling application, a minimum stratum sample size of five sampling units was imposed. For applications not using counterbalancing, the minimum stratum sample size was measured in original population elements. For applications using counterbalancing, the minimum stratum sample size was measured in counterbalanced units. Within each stratum, a simple random sample was selected and correct values of sample items were determined. For each sample selected, correct values of sample items were used along with equations (4) and (5) to compute SMPU estimator standard deviations.

Response Measure

Within each of the 36 design cells, 10,000 stratified random samples were selected for each population. For each of these 10,000 samples, SMPU precision was computed using estimator standard deviations derived from sample results and equation (6). A Student’s t-value was used for the multiplier $t$ in equation (6) with degrees of freedom determined using the Satterthwaite approximation (Cochran, 1977). SMPU precision for these 10,000 samples was averaged and used as the study response measure for the particular population subject to sampling. With 10 populations in each of the 36 design cells, this resulted in 10 SMPU mean precision values in each design cell for a total of 360 observations in the data set.

Primary Statistical Analysis

Response measures were analyzed as a 3 (Counterbalancing Applications) × 3 (Number of Strata) × 4 (Misstatement Rate) factorial repeated-measures ANOVA. All factors were treated as fixed effects and testing errors were determined by reference to Gamst et al. (2008) and Keppel (1991). A statistical test was considered significant when the resulting p-value was 0.05 or less. Orthogonal contrasts were used to test study hypotheses. To judge the practical significance of statistically significant model effects, we followed the suggestions of Keppel (1991) and Keren and Lewis (1979), and report partial omega squared values ( $ω^{2}$ ) for each model term. This measure of effect size is a function of the treatment effect and the error component, and is unaffected by other experimental manipulations. Formulas for the computation of $ω^{2}$ in a three-way repeated-measures design were obtained from Kroes and Finley (2023).

Supplemental Statistical Analysis

Because the study utilized a small number of actual inventory populations, the ANOVA statistical analysis was expected to have limited power. To address this limitation, supplemental analyses using Wilcoxon Rank-Sum nonparametric tests were executed for each combination of population, seeded Misstatement Rate, and number of strata used. For each of the resulting 120 sampling applications, a Wilcoxon test was performed for each of the three pair-wise combinations of Counterbalancing Applications (cb0 vs. cb1, cb0 vs. cb2, and cb1 vs. cb2) comparing SMPU precision values.¹⁸ Each individual Wilcoxon test utilized 10,000 SMPU precision values for each of the two levels of Counterbalancing Applications (20,000 observations in total). In total, 360 Wilcoxon tests were executed. Given the large number of tests, to limit experiment-wise sampling risk, a Wilcoxon statistical test was considered significant when the resulting p-value was 0.00014 (0.05 ÷ 360) or less.

Results

Mean SMPU Precision Values

Panel A of Table 5 reports mean precision values ( $\bar{P}$ ) for each level of counterbalancing while Panel B reports the percentage change in these mean values for the comparisons referenced in study hypotheses. Information presented in Panel A indicates that, on average, SMPU with second-order counterbalancing (cb2) provided the best (smallest) mean precision (231,347), SMPU with first-order counterbalancing (cb1) provided the second-best mean precision (418,316), and SMPU without counterbalancing (cb0) provided the worst (largest) mean precision (868,313). Consistent with the directional predictions of study hypotheses, Panel B reports that:

• SMPU with first-order counterbalancing yielded a 52% reduction (improvement) in mean precision over SMPU without counterbalancing (Hypothesis H1).¹⁹

• SMPU with second-order counterbalancing yielded a 73% reduction (improvement) in mean precision over SMPU without counterbalancing (Hypothesis H2).

• SMPU with second-order counterbalancing yielded a 45% reduction (improvement) in mean precision over SMPU with first-order counterbalancing (Hypothesis H3).

Table 5.

SMPU Mean Precision Values ( $\bar{P}$ ) and Percentage Change in Mean Precision Values

Computational procedure	Panel A: SMPU mean precision ( $\bar{P}$ )			Panel B: Percentage change in mean precision ( $\bar{P}$ )
Computational procedure	cb0 ^a	cb1 ^b	cb2 ^c	H1: ${\bar{P}}_{c b 1}$ < ${\bar{P}}_{c b o}$	H2: ${\bar{P}}_{c b 2}$ < ${\bar{P}}_{c b o}$	H3: ${\bar{P}}_{c b 2}$ < ${\bar{P}}_{c b 1}$
Overall	868,313	418,316	231,347	−52%	−73%	−45%
By number of strata (L)
L = 3	1,367,204	776,561	455,335	−43%	−67%	−41%
L = 5	728,840	321,647	155,881	−56%	−79%	−52%
L = 7	508,894	156,740	82,826	−69%	−84%	−47%
By misstatement rate (R)
R = 0.0%	869,073	403,691	238,317	−54%	−73%	−41%
R = 0.6%	869,268	411,480	231,375	−53%	−73%	−44%
R = 1.2%	868,470	424,310	229,886	−51%	−74%	−46%
R = 1.8%	866,440	433,783	225,811	−50%	−74%	−48%

^aSMPU with no counterbalancing (i.e., conventional SMPU).

^bSMPU with first-order counterbalancing.

^cSMPU with second-order counterbalancing.

Panel A of Table 5 also reports mean precision values for each level of counterbalancing by (1) Number of Strata and (2) Misstatement Rate. This information indicates that, although the patterns of improvement varied (suggesting the presence of interactions), in all cases second-order counterbalancing (cb2) provided the best mean precision, first-order counterbalancing (cb1) provided the second-best mean precision, and conventional SMPU (cb0) provided the worst mean precision. Information presented in Panel B of Table 5 indicates that precision improvements induced by counterbalancing increased as the number of sampling strata increased. However, information presented in Panel B indicates that precision improvements induced by counterbalancing were relatively stable regardless of the population misstatement rate.

Results presented in Table 5 are consistent with all three study hypotheses. Regardless of Number of Strata or Misstatement Rate, SMPU with first-order counterbalancing yielded better precision than conventional SMPU (Hypothesis H1), SMPU with second-order counterbalancing yielded better precision than conventional SMPU (Hypothesis H2), and SMPU with second-order counterbalancing yielded better precision than SMPU with first-order counterbalancing (Hypothesis H3). The next section presents results of tests performed to determine if differences in mean precision values reported in Table 5 are statistically significant.

Primary Statistical Analysis Results

Table 6 reports statistical test results comparing mean precision values across levels of each study main effect: Counterbalancing Applications (C), Number of Strata (L), and Misstatement Rate (R). Information reported in Table 6 confirms the presence of statistically significant differences with medium effect sizes²⁰ for Counterbalancing Applications (p-value = .0264; ω² = .07) and Number of Strata (p-value = .0260; ω² = .07). However, the test on Misstatement Rate was not statistically significant and the estimated effect size was below the threshold for a small effect size (p-value = .3351; ω² < .01).

Table 6.

ANOVA Test Results and Effect Size Estimates

Model source	df	F-test	Partial
Model source	df	p-value^a	ω²
Within experimental units
Main effects:
Effect: Counterbalancing applications (C)	2	.0264	.07^b
Testing error: C x A	18
Effect: Number of strata (L)	2	.0260	.07^b
Testing error: L x A	18
Effect: Misstatement rate (R)	3	.3351	<.01
Testing error: R x A	27
Interaction effects:
Effect: C x L	4	.0137	.01^c
Testing error: C x L x A	36
Effect: C x R	6	.0039	<.01
Testing error: C x R x A	54
Effect: L x R	6	.0002	<.01
Testing error: L x R x A	54
Effect: C x L x R	12	.0096	<.01
Testing error: C x L x R x A	108
Between experimental units
Accounting population (A)	9
Total	359

^aA p-value of .05 or less is considered statistically significant.

^bMedium effect size = .06 to .13 (see Cohen, 1988, pages 286-287).

^cSmall effect size = .01 to .05 (see Cohen, 1988, pages 286-287).

The statistically significant result for Counterbalancing Applications in Table 6 indicates the presence of at least one significant difference among the manipulations, but does not directly address the specific comparisons identified in study hypotheses. To test study hypotheses, orthogonal contrasts were performed comparing mean precision for: (1) SMPU with no counterbalancing (cb0) versus SMPU with first-order counterbalancing (cb1), (2) SMPU with no counterbalancing (cb0) versus SMPU with second-order counterbalancing (cb2), and (3) SMPU with first-order counterbalancing (cb1) versus SMPU with second-order counterbalancing (cb2). The resulting p-values for these contrasts were .0545, .0093, and .4040, respectively. These test results confirm that differences in mean precision for conventional SMPU with no counterbalancing versus SMPU with second-order counterbalancing were statistically significant. However, no statistically significant differences were detected in the tests comparing mean SMPU precision for: (1) SMPU with no counterbalancing versus SMPU with first-order counterbalancing and (2) SMPU with first-order counterbalancing versus SMPU with second-order counterbalancing.

Table 6 also reports that all four tests for interaction between study main effects were statistically significant interactions with p-values less than 0.05. However, all four effect sizes were small (partial ω² = .01 or less). These test results indicate the patterns of improvement in precision attributable to counterbalancing exhibited minor variations depending on the number of sampling strata and the population misstatement rate. However, the magnitudes of these variations appear to be inconsequential for the specific manipulations tested in this study.

Supplemental Statistical Analysis Results

The fact that SMPU mean precision values in Table 5 varied dramatically with the extent of counterbalancing but two of the three orthogonal contrasts used to test study hypotheses were not statistically significant suggests a lack of statistical power. To investigate this issue, a supplemental statistical analysis consisting of 360 individual Wilcoxon Rank-Sum tests was performed. Each test included a total of 20,000 individual SMPU precision values. The null hypothesis in each of these tests is that SMPU precision did not vary. Of the 120 tests comparing SMPU precision without counterbalancing (cb0) versus SMPU precision with first-order counterbalancing (cb1), all were statistically significant with p-values ≤ 0.00001. For the 120 tests comparing SMPU precision without counterbalancing (cb0) versus SMPU precision with second-order counterbalancing (cb2), all were statistically significant with p-values ≤ 0.00001. Finally, for the 120 tests comparing SMPU precision with first-order counterbalancing (cb1) versus SMPU precision using second-order counterbalancing (cb2), 116 of these tests were statistically significant with p-values ≤ 0.00001 but four of these tests did not meet the threshold for statistical significance (p-value ≤ 0.00014).

Support for Study Hypotheses

Given the theoretical predictions provided by equation (11), the pattern of mean precision values reported in Table 5, and the combined results of the primary and supplementary statistical analyses, Hypothesis H1 and Hypothesis H2 are clearly supported. However, while results mostly supported Hypothesis H3, in approximately 3% of supplementary tests second-order counterbalancing provided no incremental benefit over first-order counterbalancing.

Discussion

Study Findings

Our study confirms that counterbalancing can materially improve the precision of SMPU monetary estimates in populations with low error. In study simulations, one application of counterbalancing produced a 52% improvement in mean precision while a second application of counterbalancing produced a 73% improvement in mean precision. These improvements increased in magnitude as the number of sampling strata increased, but were relatively stable over the population misstatement rates tested. Statistical tests disclosed that first-order and second-order counterbalancing produced improved precision relative to SMPU without counterbalancing. However, in a small number of cases, SMPU with second-order counterbalancing produced no improvement in mean precision when compared to SMPU with first-order counterbalancing. These results suggest a declining marginal benefit for counterbalancing, with first-order counterbalancing providing the highest likelihood of improved precision, followed by second-order counterbalancing, followed by higher order applications of counterbalancing.

Implications for Accounting and Auditing Practice

When a critical SMPU monetary estimation application is planned, the accountant or auditor specifies both desired precision and desired reliability (Arkin, 1963; Roberts, 1978) because these factors directly influence the quality and decision usefulness of the resulting monetary estimate. As indicated by equation (6), achieved estimation precision is a function of the multiplier t and the standard deviation of the monetary estimate while achieved reliability is determined by the multiplier t used in computing achieved precision. In most strata, assuming no change in real stratum sample size, Intrastratum Counterbalancing yields a smaller stratum standard deviation than conventional SMPU. In all strata, Intrastratum Counterbalancing reduces the number of strata sampling units compared to SMPU without counterbalancing. These two effects produce a smaller standard deviation for the estimated total correct value of the population. As a result, application precision and/or reliability with counterbalancing will be improved thus improving one or both of the quality metrics for the critical monetary estimate.

For external auditors, improved precision and/or reliability for critical monetary estimates should improve financial reporting quality and lead to improved audit quality (DeFond & Zhang, 2014; Francis, 2024). Outside of external auditing, the use of counterbalancing by industry accountants, governmental auditors, and internal auditors in SMPU estimation applications should also produce quality improvements. Alternatively, Intrastratum Counterbalancing might be used to reduce the real sample size while maintaining a specified precision and reliability. In these applications, the use of counterbalancing will improve efficiency rather than quality.

Although counterbalancing can significantly improve SMPU estimation precision, when statistical sampling is used in hypothesis testing, auditors will likely continue to rely on monetary unit sampling (MUS) techniques. For populations with very low-error rates where few, if any, errors are expected in sample results, MUS sample sizes are likely to be smaller than those required by SMPU even with the use of counterbalancing. The advantage of MUS for hypothesis testing arises because sample sizes are insensitive to the population size and variance. However, MUS does not perform well when the auditor is testing for understatements, population recorded values include both positive and negative amounts, the population contains both understatements and overstatements, some errors in the population exceed 100% of recorded value, or more than a few errors are expected in the sample result (AICPA, 2019, §6.07). For example, MUS would not likely be used when auditing inventory in a financial statement audit where the inventory system is poorly controlled and has historically yielded multiple overstatement and understatement errors. Similarly, in a forensic audit investigating loan fraud where a significant number of fraudulent loans are suspected, MUS would not likely be used.

When conditions indicate use of MUS is not appropriate, one of the classical variables sampling techniques should be considered. Classical techniques include SMPU, difference, ratio, and regression estimation. Research by Neter and Loebbecke (1975) indicates that SMPU provides reliable estimates regardless of population error conditions, but generally yields inferior estimation precision compared to other classical variables sampling techniques. However, other classical variables sampling techniques require error rates approaching 30% or higher to yield reliable estimates (Neter & Loebbecke, 1975). In circumstances where MUS techniques are not appropriate and the auditor is uncertain about population characteristics, SMPU is a safe choice and SMPU with counterbalancing is likely to yield improved estimation precision.

Planning SMPU Applications With Counterbalancing

Because Intrastratum Counterbalancing is a data transformation procedure rather than a modification of conventional SMPU, existing recommendations for the design of SMPU applications remain unchanged but must be applied giving recognition to characteristics of the counterbalanced strata. As reported in Hall et al. (2024), SMPU estimation efficiency is directly related to the number of sampling strata, inversely related to the minimum stratum sample size, and likely to be optimized when stratum boundaries are selected using an iterative search-based method or the well-known Dalenius-Hodges method. When Intrastratum Counterbalancing is used with SMPU, application design should be implemented based on characteristics of the transformed population. Those characteristics include a reduced number of strata sampling units, and typically reduced variation in strata sampling units.

When SMPU is used, the interplay of minimum stratum sample size and total sample size may limit the feasible number of sampling strata. While this constraint may affect any SMPU application (with or without counterbalancing), it is more likely to exist when counterbalancing is used given the smaller nominal sample size. Plausibly, there will be applications where counterbalancing does not affect how many sampling strata can be used. But it is also plausible that, for some applications, the use of counterbalancing will necessitate a reduction in the number of sampling strata. These circumstances will depend on the specific characteristics of the population and sample planning parameters.

In light of the reduced variation in strata sampling units, the minimum stratum sample size needed to reliably estimate the variation in strata sampling units will likely decline. However, in the absence of research on this issue, there is no evidential basis for specifying the degree of reduction. For low-error rate populations where $a$ -order counterbalancing is applied, and assuming an approximately linear relationship between stratum variation and the minimum stratum sample size needed to reliably estimate stratum variation, we suggest the minimum stratum sample size be proportionally reduced to match the proportionate reduction in the stratum standard deviation created by counterbalancing. This procedure should be applied on a stratum-by-stratum basis with the revised minimum stratum sample size computed as:

m_{h^{c b a}} = m_{h^{c b 0}} (s_{h_{x}^{c b a}} / s_{h_{x}^{c b 0}})

(25)

In circumstances where the accountant or auditor prefers to use a constant minimum stratum sample size across all sampling strata, the largest value of $m_{h^{c b a}}$ should be used.

When planning an SMPU application, the accountant or auditor can assess whether counterbalancing will likely improve estimation precision by comparing results for Appendix equations (A8) (SMPU without counterbalancing) and A16/A22 (SMPU with counterbalancing). When equation (A16) (or A22) yields a smaller value than equation (A8), counterbalancing will likely improve SMPU estimation precision. In some counterbalancing applications, the number of sampling strata may need to be reduced to ensure a feasible application or to ensure the minimum stratum sample size summed over all sampling strata (i.e., $\sum_{h} m_{h^{c b a}}$ ) does not degrade efficiency of the sample allocation process.²¹ When these circumstances arise, equations (A8) and (A16) (or A22) may still be used to compare projected estimation precision, but the variable values must be based on the number of planned sampling strata. In equation (A8), this will be the original number of sampling strata. In A16 (or A22), this will be the reduced number of sampling strata.

In some applications, counterbalancing will increase a stratum’s contribution to the estimator variance and adversely affect precision. This outcome was observed in about 3% of strata in our simulations. Identification of strata where counterbalancing will degrade precision can be made via use of Appendix equations (A6) and (A14) (or A20) on a stratum-by-stratum basis. When equation (A14) (or A20) yields a larger value than equation (A6), counterbalancing the stratum in question will likely increase the stratum’s contribution to the estimator variance and adversely affect estimator precision. For those sampling strata where counterbalancing will likely increase the estimator variance, counterbalancing should not be used.

When counterbalancing is used in some sampling strata, and not used in other sampling strata, the application’s sampling units will be a mixture of counterbalanced units and noncounterbalanced units. This circumstance presents no practical or theoretical problem so long as, within each sampling stratum, every sampling unit has the same likelihood of inclusion in the stratum sample.

Implementation Issues and Limitations

Prior to the use of SMPU with counterbalancing, the population data set must be reviewed for problems and, as necessary, cleaned. This process includes addressing missing values, removing duplicates, correcting data format errors, and investigating outlier observations. In our simulations, although not an absolute requirement, we made manual adjustments to strata boundaries to ensure each stratum had an even number of sampling units. This simplified the process of counterbalancing.

At present, due to the absence of software incorporating counterbalancing, application of Intrastratum Counterbalancing is a labor-intensive process.²² Accountants and auditors who consider using SMPU with counterbalancing will need to evaluate whether the improved estimation efficiency and/or reduced sampling cost justifies the cost of implementation. However, once Intrastratum Counterbalancing is incorporated into existing audit software, the process of creating efficient strata, identifying strata where counterbalancing is warranted, selecting strata samples, and projecting results should involve minimal cost and effort.

While counterbalancing improved SMPU estimation precision in this study, and the magnitudes of improvement were noteworthy, it is important to recognize that changes in estimation precision produced by counterbalancing will likely vary in practice. SMPU estimation precision is affected by multiple factors including number of sampling strata, sample size, population size, sampling fraction, desired confidence, sampling risk, and within-strata population variation. When counterbalancing is used, additional factors affecting precision include the degree to which counterbalancing affects: (1) within-strata population variation, (2) minimum stratum sample size, (3) number of feasible sampling strata, and (4) the multiplier $t$ in equation (6). Our study reports results for one set of circumstances and may not be representative of the results produced by a different set of circumstances.

Footnotes

Acknowledgment

We are grateful for the detailed and helpful comments of the anonymous reviewer.

ORCID iD

Thomas W. Hall

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The authors’ time on this work was supported by the University of Texas at Arlington and New Mexico State University.

Declaration of Conflicting Interests

The authors declared the following potential conflict of interest with respect to the research, authorship, and/or publication of this article. The University of Texas at Arlington filed a patent application establishing property rights for Intrastratum Counterbalancing.

Notes

Author Biographies

Thomas W. Hall is a Certified Public Accountant and serves as the Public Accounting Professor at the University of Texas at Arlington. He has been a faculty member at the University of Texas at Arlington since earning his PhD degree in accounting at Oklahoma State University in 1980. His research focuses on accounting measurement, auditing, and decision making. He has published in a variety of academic journals including The Accounting Review; Auditing: A Journal of Practice & Theory; Behavioral Research In Accounting; Journal of Accounting Research; Journal of Accounting, Auditing & Finance; Journal of Applied Psychology; and Organizational Behavior and Human Decision Processes.

Bethane Jo Pierce is a Certified Public Accountant and an accounting professor at the University of Texas at Arlington. She has been a faculty member at the University of Texas at Arlington since earning her PhD degree at the University of North Texas in 1987. Her research focuses on accounting measurement, auditing, and decision making. She has published in a variety of academic journals including Advances In Accounting; Auditing: A Journal of Practice & Theory; Behavioral Research In Accounting; Journal of Accounting, Auditing & Finance; and Journal of Management Accounting Research.

David M. Rosser is a Certified Public Accountant and has been an assistant professor of accounting at New Mexico State University since 2024. He previously served as an assistant professor of accounting at the University of Texas at Arlington and earned his PhD degree at the University of Arkansas in 2017. His research focuses on accounting measurement, audit risk assessment, and audit quality. He has published in a variety of academic journals including Auditing: A Journal of Practice & Theory; Accounting Horizons; and Journal of Accounting and Public Policy.

Appendix

References

Allen

Elder

(2005). A longitudinal investigation of auditor error projection decisions. Auditing: A Journal of Practice & Theory, 24(2), 69–84. https://doi.org/10.2308/aud.2005.24.2.69

American Institute of Certified Public Accountants . (2019). Audit sampling: AICPA audit guide. AICPA.

American Institute of Certified Public Accountants . (2024). Uniform CPA examination blueprints. Ewing, NJ: AICPA. https://www.aicpa-cima.com/resources/article/learn-what-is-tested-on-the-cpa-exam

Arkin

(1963). Handbook of sampling for auditing and accounting. McGraw-Hill.

Backer

Fertig

(1958). Statistical sampling and the accounting curriculum. The Accounting Review, 33(3), 415–418. https://doi.org/10.2308/tar-7059513

Christensen

Elder

Glover

(2015). Behind the numbers: Insights into large audit firm sampling policies. Accounting Horizons, 29(1), 61–81. https://doi.org/10.2308/acch-50921

Cochran

(1977). Sampling techniques (Third edition). John Wiley & Sons.

Cohen

(1988). Statistical power analysis for the behavioral sciences (Second Edition). Lawrence Erlbaum Associates.

Collyer

Enns

(1987). Analysis of variance: The basic designs. Nelson-Hall, Inc.

10.

Cyert

Davidson

(1962). Statistical sampling for accounting information. Prentice-Hall Inc.

11.

Dalenius

Hodges

(1959). Minimum variance stratification. Journal of the American Statistical Association, 54(285), 88–101. https://doi.org/10.1080/01621459.1959.10501501

12.

DeFond

Zhang

(2014). A review of archival auditing research. Journal of Accounting and Economics, 58(2-3), 275–326. https://doi.org/10.1016/j.jacceco.2014.09.002

13.

Duke

Leitch

Neter

(1985). Behavior of test statistics in the auditing environment: An empirical study. Studies in accounting research #23. American Accounting Association.

14.

Durney

Elder

Glover

(2014). Field data on accounting error rates and audit sampling. Auditing: A Journal of Practice & Theory, 33(2), 79–110. https://doi.org/10.2308/ajpt-50669

15.

Francis

(2024). What exactly do we mean by audit quality? Accounting in Europe, 21(2), 123–133. https://doi.org/10.1080/17449480.2023.2247410

16.

Gamst

Meyers

Guarino

(2008). Analysis of variance designs: A conceptual and computational approach with SPSS and SAS. Cambridge University Press.

17.

Guy

Carmichael

Whittington

(2002). Audit sampling: An introduction (Fifth edition). John Wiley & Sons.

18.

Hall

Hoogduin

Pierce

Tsay

(2024). Designing efficient stratified mean-per-unit sampling applications in accounting and auditing. Journal of Accounting, Auditing & Finance, 39(2), 434–455. https://doi.org/10.1177/0148558x211064205

19.

Hill

Roth

Arkin

(1962). Sampling in auditing. The Ronald Press Company.

20.

Hoel

Port

Stone

(1971). Introduction to probability theory. Houghton Mifflin Company.

21.

Institute of Internal Auditors . (2025). CIA exam syllabus, part 2 – Practice of internal auditing. IIA. https://www.theiia.org/fr-ca/certifications/cia/exam-prep-resources/exam-syllabus/exam-syllabus-part-2/

22.

Kaplan

(1973). Statistical sampling in auditing with auxiliary information estimators. Journal of Accounting Research, 11(2), 238–258. https://doi.org/10.2307/2490191

23.

Keppel

(1991). Design and analysis: A researcher’s handbook. Prentice-Hall.

24.

Keren

Lewis

(1979). Partial omega squared for ANOVA designs. Educational and Psychological Measurement, 39(1), 119–128. https://doi.org/10.1177/001316447903900116

25.

Kogan

Mayhew

Vasarhelyi

(2019). Audit data analytics research - An application of design science methodology. Accounting Horizons, 33(3), 69–73. https://doi.org/10.2308/acch-52459

26.

Kroes

Finley

(2023). Demystifying omega squared: Practical guidance for effect size in common analysis of variance designs. Psychological Methods, 30(July), 866–887. https://doi.org/10.1037/met0000581

27.

Loebbecke

Neter

(1975). Considerations in choosing statistical sampling procedures in auditing. Journal of Accounting Research, 13(2), 38–69. https://doi.org/10.2307/2490476

28.

Lohr

(1999). Sampling: Design and analysis. Duxbury Press.

29.

Lowe

Bierstaker

Janvrin

Jenkins

(2018). Information technology in an audit context: Have the big 4 lost their advantage? Journal of Information Systems, 32(1), 87–107. https://doi.org/10.2308/isys-51794

30.

Mahmoud

Henderson

Epprecht

Woodall

(2010). Estimating the standard deviation in quality-control applications. Journal of Quality Technology, 42(4), 348–357. https://doi.org/10.1080/00224065.2010.11917832

31.

McGrath

(2003). Empirical tests of stratum boundary methods in tax populations. In Proceedings of the American Statistical Association joint statistical meetings (pp. 2775–2781). Section on Survey Research Methods.

32.

Neter

Kutner

Nachtsheim

Wasserman

(1996). Applied linear statistical models. WCB McGraw-Hill.

33.

Neter

Loebbecke

(1975). Behavior of major statistical estimators in sampling accounting populations. Auditing research monograph #2. American Institute of Certified Public Accountants.

34.

Neyman

(1934). On the two different aspects of the representative method: The method of SS and the method of purposive selection. Journal of the Royal Statistical Society, 97(4), 558–606. https://doi.org/10.2307/2342192

35.

Pearson

Haines

(1935). The use of range in place of standard deviation in small samples. Supplement to the Journal of the Royal Statistical Society, 2(1), 83–98. https://doi.org/10.2307/2983588

36.

Public Company Accounting Oversight Board . (2020). Spotlight: Data and technology research update: PCAOB. https://pcaobus.org/Documents/Data-Technology-Project-Spotlight.pdf

37.

Public Company Accounting Oversight Board . (2025). AS 2315: Audit sampling. PCAOB. https://pcaobus.org/oversight/standards/auditing-standards/details/AS2315

38.

Roberts

(1978). Statistical auditing. American Institute of Certified Public Accountants.

39.

Roshwalb

Wright

(1991). Using information in addition to book value in sample designs for inventory cost estimation. The Accounting Review, 66(2), 348–360. https://doi.org/10.2308/tar-9605070385

40.

Rudmin

(2010). Calculating the exact pooled variance. https://arxiv.org/pdf/1007.1012

41.

Salijeni

Samsonova-Taddei

Turley

(2019). Big data and changes in audit technology: Contemplating a research agenda. Accounting and Business Research, 49(1), 95–119. https://doi.org/10.1080/00014788.2018.1459458

42.

Simon

Bruce

Blank

(2019). Resampling stats for excel. Statistics.com, LLC.

43.

Vance

Neter

(1957). Statistical sampling for auditors and accountants. John Wiley & Sons.

44.

Wilburn

(1984). Practical statistical sampling for auditors. Marcel Dekker.

45.

Yancey

(2002). Statistical sampling in sales and use tax audits: Commerce Clearing House.