Optimal designs for asymmetric sigmoidal response curves in bioassays and immunoassays

Abstract

The 5-parameter logistic (5PL) model is frequently used to model and analyze responses from bioassays and immunoassays which can be skewed. Various types of optimal experimental designs for 2, 3 and 4-parameter logistic models have been reported but not for the more complicated 5PL model. We construct different types of optimal designs for studying various features of the 5PL model and show that commonly used designs in bioassays and immunoassays are generally inefficient for statistical inference. To facilitate use of such designs in practice, we create a user-friendly software package to generate various tailor-made optimal designs for the 5PL model and evaluate robustness properties of a design under a variation of criteria, model forms and misspecification in the nominal values of the model parameters.

Keywords

Approximate design asymmetric calibration curves D-optimal design robust optimal design toxicology

1 Introduction

The 3-parameter logistic (3PL) and 4-parameter logistic (4PL) models are widely used to capture a symmetric sigmoidal relationship between the response and dose concentration. Recent studies show that asymmetrical response curves are often observed in various bioassays and immunoassays.^1–4 For such asymmetrical sigmoidal response curves, the 3PL and 4PL models are inappropriate. Gottschalk and Dunn⁵ showed that the 5-parameter logistic (5PL) model is able to capture the asymmetric relationships adequately and produce more accurate inference for the assays compared to results using the 3PL or 4PL models.

Statistical inference for bioassays and immunoassays based on the 5PL model is not new. For example, Findlay and Dillard⁶ applied the 5PL model to fit the data for ligand binding assays and Feng et al.⁷ presented a Bayesian approach to fit the 5PL model using data from an enzyme-linked immunosorbent assay (ELISA). Dawn et al.^8,9 used a modified 5PL model (5PL-1P) to capture the asymmetry in mixture toxicity assessment and Cumberland¹⁰ discussed the choice between the 4PL and 5PL models for estimation purposes; see also model fitting issues using biological data for these models in Davis et al.¹¹ Another application of the 5PL model is Gottschalk and Dunn,¹² who applied the model for measuring parallelism and relative potency in biological applications.

The design of a scientific study plays a crucial role in the accuracy of the inference that follows. Many of the above studies for the 5PL model used between 5 and 10 evenly spaced design points on the log scale with equal replication at each dose. This seems to be the current practice even though there is very little research in the literature to support such design choices. Manukyan and Rosenberger¹³ appears to be the first who found locally D-optimal designs for the 5PL model when the response is binary.

The 3, 4 and 5PL models are frequently used to describe sigmoidal response curves and use of a wrong model can produce inaccurate or wrong inference. For example, optimal designs for the 3PL model cannot estimate all parameters in the 5PL model and an optimal design for the 5PL model can perform poorly when the 3PL or 4PL model holds. The implemented design should therefore provide good efficiencies when there are misspecifications in the nominal values, mean response assumptions and under a variation of criteria. Designs that provide relatively high efficiencies under model misspecifications are robust.

In practice, there are several objectives in the study and some are more important than others. This calls for a multiple-objective optimal design that can deliver user-specified efficiencies commensurate with the importance of each of the objectives. Such an optimal design is also appropriate when some parameters in the model are more interpretable than others. For instance, in the widely used two-parameter Michaelis-Menten model in the biological sciences, the Michaelis-Menten parameter is more interesting than the saturation parameter because it governs how fast the enzyme-substract kinetics reaction velocity is. It follows that the user should devote more resources to estimating the more interesting parameter or parameters so that they are more accurately estimated. Research to date shows a multiple-objective optimal design generally has overall higher efficiencies across all objectives than any of the single objective optimal designs can provide, see for example, Cook and Wong¹⁴ and Hyun and Wong.¹⁵ The former proposed a graphical approach to find dual-objective optimal designs using efficiency plots and, Hyun and Wong¹⁵ gave a step-by-step approach to find 3-objective optimal designs for a nonlinear model. Of course, if the efficiencies sought under all the objectives are too high, a multiple-objective optimal design may not exist.

This paper has two aims. First, we focus on bioassays and immunoassays applications and find a variety of optimal designs for the 5PL model to accurately estimate (1) the model parameters in the model or (2) a target dose such as the EC₅₀ that results in having one half of test subject having the maximal expected response. Since model uncertainty is always an issue, we assess efficiencies of the optimal designs when the true model or the nominal values are misspecified. Additionally, we find robust D-optimal designs when the study has multiple objectives and the true model may be the 3PL, 4PL or 5PL model. Our second aim is to facilitate practitioners implement optimal designs and evaluate other designs for the 5PL model using an R package. Because the 5PL model is an extension of the 3PL and 4PL models, our functions can also readily find various optimal designs for the latter models.

Section 2 describes the response curve, and interprets the meaning of each parameter in the 5PL model and derivation of the Fisher information matrix. Section 3 presents several types of locally optimal designs for the 5PL model and a robust D-optimal design that performs well for the 3PL, 4PL, and 5PL models for estimating model parameters. In Section 4, we propose an algorithm with R functions to search for all the optimal designs in this paper. Section 5 studies sensitivities of the locally D-optimal design for the 5PL model when there are various misspecifications in the model assumptions. In Section 6, we recommend an efficient design and show it outperforms currently used designs in immunoassays and bioassays. Section 7 concludes with a summary of our work and future directions.

2 Background

Let X be the user-selected compact design space from which the design points are selected to observe the observations. Let Y_ij be the continuous response from the jth replicate at the ith concentration level $x_{i} \in X$ , where $j = 1, \dots, n_{i},$ and $i = 1, \dots, K$ . Assume that we have resources to take a predetermined number of observations N so that $n_{1} + \dots + n_{K} = N$ . Given a design criterion, the design questions are the optimal number of design points to use, the optimal number of replicates and the location of each design point $x_{i}, i = 1 \dots, x_{K}$ .

Let Θ be the vector of nominal values for the model parameters. Our statistical models have the form

Y_{ij} = f (x_{i}, Θ) + ɛ_{ij}, j = 1, 2, \dots, n_{i}, i = 1, 2, \dots, K, n_{1} + \dots + n_{K} = N

(1)

where

f (x_{i}, Θ)

is the mean response at x_i. The errors ∈_ijs are independent and normally distributed with means 0 and unknown variance

σ^{2}

We focus on approximate designs or large sample designs where we approximate each $n_{i} / N$ by its proportion. We denote such a design by $ξ = {(x_{i}, w_{i})} 1 K$ where each $x_{i} \in X, w_{i} \in (0, 1)$ and $w_{1} + \dots + w_{K} = 1$ . In dose–response experiments, optimal design issues concern the total number of concentrations to be used (K), where these K concentration levels or design points $x_{i},$ $i = 1, 2, \dots, K$ are, and the proportions $w_{i}, i = 1, \dots, K$ of subjects to be allocated to each of these concentrations. Approximate designs can be studied under a unified framework and there are algorithms for finding many types of optimal designs. Formulas for such designs are available for many models and they facilitate studying properties of the optimal approximate designs. In addition, there are theoretical tools for verifying if an approximate design is optimal among all designs and the optimal approximate designs do not depend on the value of N by definition.

We measure the worth of a design by its Fisher information matrix. For the approximate design ξ, the normalized Fisher information matrix is

I (ξ; Θ) = \frac{1}{σ^{2}} \sum_{i = 1}^{K} w_{i} g (x_{i})^{⊤} g (x_{i})

(2)

where

g (x) = (\begin{matrix} \frac{\partial f (x, Θ)}{\partial θ_{1}}, & \frac{\partial f (x, Θ)}{\partial θ_{2}}, & \dots, & \frac{\partial f (x, Θ)}{\partial θ_{v}} \end{matrix})

and v is the number of model parameters. Since a ‘large’ information matrix is desirable for statistical inference, many optimality design criteria seek a design that makes this matrix as large as possible in different ways.

For the 5-parameter logistic (5PL) model, the mean response $f (x, Θ)$ is

f (x, Θ) = \frac{θ_{1} - θ_{4}}{[1 + {(\frac{θ_{3}}{x})}^{θ_{2}}] θ_{5}} + θ_{4}

(3)

where θ₁ and θ₄ are the maximum and the minimum expected responses, respectively, θ₂ controls the stiffness of the response curve, θ₃ is the position of the transition region in concentration, and θ₅ is the asymmetric factor and takes a value greater than 0. The parameters θ₂ and θ₅ jointly control the slope of the response curve. Clearly, the 5PL model becomes the 4-parameter logistic (4PL) model when θ₅ takes the value of 1, and it becomes the 3-parameter logistic (3PL) model when θ₄ and θ₅ take the values of 0 and 1, respectively.

For the 5PL model, the vector g(x) has components given by

\begin{matrix} \frac{\partial f (x, Θ)}{\partial θ_{1}} = (1 + D) - θ_{5}; \\ \frac{\partial f (x, Θ)}{\partial θ_{2}} = - \frac{(θ_{1} - θ_{4}) θ_{5}}{θ_{2}} D (1 + D) - 1 - θ_{5} log (D); \\ \frac{\partial f (x, Θ)}{\partial θ_{3}} = - \frac{(θ_{1} - θ_{4}) θ_{2} θ_{5}}{θ_{3}} D (1 + D) - 1 - θ_{5}; \\ \frac{\partial f (x, Θ)}{\partial θ_{4}} = 1 - (1 + D) - θ_{5}; \\ \frac{\partial f (x, Θ)}{\partial θ_{5}} = - (θ_{1} - θ_{4}) (1 + D) - θ_{5} log (1 + D) \end{matrix}

where

D = (\frac{θ_{3}}{x}) θ_{2}

. A direct calculation shows that the normalized Fisher information matrix (2) is

I (ξ; Θ) = A M (ξ; Θ) A^{⊤}

, where

A = \begin{matrix} (\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ 0 & \frac{- (θ_{1} - θ_{4}) θ_{5}}{θ_{2}} & 0 & 0 & 0 \\ 0 & 0 & \frac{- (θ_{1} - θ_{4}) θ_{2} θ_{5}}{θ_{3}} & 0 & 0 \\ - 1 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & - θ_{1} + θ_{4} \end{matrix}) \end{matrix}

M (ξ; Θ) = \frac{1}{σ^{2}} \sum_{i = 1}^{K} w_{i} g^{*} (D_{i})^{⊤} g^{*} (D_{i}), ϕ (D_{i}) = (1 + D_{i}) - θ_{5}

and

g^{*} (D_{i}) = ϕ (D_{i}) (\begin{matrix} 1, & \frac{D_{i} log (D_{i})}{1 + D_{i}}, & \frac{D_{i}}{1 + D_{i}}, & ϕ {(D_{i})}^{- 1}, & log (1 + D_{i}) \end{matrix}) .

The matrix A does not contain any concentration level or weight and this implies that maximizing some function of

M (ξ; Θ)

is equivalent to maximizing the same function in

I (ξ; Θ)

. We observe that

M (ξ; Θ)

contains only the three parameters θ₂, θ₃, and θ₅, and so any classical optimal design such as D-, A-, c-, or D_s-optimal design for model (3) does not depend on the parameters θ₁ and θ₄.

3 Optimal designs

The optimal design depends on the objective or objectives of the study. They can vary from estimating all or some model parameters to predicting mean response at a location in the design space or minimizing the sum of elements in the covariance matrix. Frequently, the criterion is formulated as a convex function of the information matrix so that we can use an equivalence theorem to check if the design is optimal among all designs. The equivalence theorem is derived from the directional derivative consideration and is unique for each convex functional; see design monographs, such as Fedorov¹⁶ and Atkinson et al.¹⁷ However, equivalence theorems all have a similar form as an inequality with 0 on the right-hand side of the inequality. The function on the left-hand side of the inequality is frequently called the sensitivity function in the literature.

The information matrix for a nonlinear model depends on the model parameters and so our optimal design depends on the unknown parameters. Such designs are termed locally optimal.¹⁸ These optimal designs can be sensitive to the nominal values and so they must be selected carefully. However, they are the easiest to find and are important because they typically represent a first step to finding more complex designs.¹⁹ Two approaches that do not require a set of single best guesses for the values of the parameters to find optimal designs are the Bayesian and minimax or maxmin approaches. The former requires a prior distribution for the parameters and the latter requires the user to specify a plausible region of possible values for all the parameters. Both methods generalize the concept of locally optimal designs and both Bayesian and minimax type of optimal designs are clearly much more difficult to find, theoretically or computationally, than locally optimal designs. For example, minimax optimal designs are found under a non-differentiable criterion and there are no effective algorithms for finding them for a general regression model. Chen et al.²⁰ provide examples of minimax and standardized minimax optimal designs, including showing how locally optimal designs are first determined before a standardized minimax optimal design is found. Recent work on maximin optimal designs, which are equivalent to minimax optimal designs, and Bayesian optimal designs are Coffey²¹ and McCallum and Bornkamp,²² respectively, among others. For space consideration, these two approaches will not be further discussed here.

We now review several commonly used locally optimal designs in practice. In what follows, we use the terms locally optimal designs and optimal designs interchangeably when there is no room for confusion.

3.1 D-optimal designs for estimating Θ

D-optimal designs are the most appropriate when the interest in the study is to estimate the vector of model parameters Θ as accurately as possible. The D-optimal design ξ_D maximizes the determinant of the Fisher information matrix $| I_{(ξ; Θ)} |$ over all designs on X. Equivalently, for fixed Θ, we want a design that minimizes the convex function $- log | I_{(ξ; Θ)} |$ . The directional derivative of the D-optimality criterion leads to the sensitivity function: $d_{D} (x, ξ) = g (x) I^{- 1} (ξ; Θ) g (x)^{⊤} - 5$ . The Equivalence Theorem states that design ξ_D is D-optimal for the 5PL model if and only if

d_{D} (x, ξ_{D}) \leq 0

for all x in X with equality at the design points of ξ_D.

3.2 c-Optimal designs for estimating the EC₅₀

A c-optimal design is used to estimate a function of model parameters as accurately as possible by minimizing the asymptotic variance of its estimate. The EC₅₀ is the concentration producing a response that is half way between the expected maximum and minimum responses. A direct calculation shows

E C_{50} = {arg}_{x} {f (x, Θ) = \frac{1}{2} (θ_{1} + θ_{4})} = θ_{3} (2 θ_{5}^{- 1} - 1) - 1 / θ_{2}

If ${\hat{EC}}_{50}$ is the maximum likelihood estimate of EC₅₀, the c-optimal design for estimating the EC₅₀, ξ_c minimizes $Var ({\hat{EC}}_{50}) = E C'_{50} I^{- 1} (ξ; Θ) [E C'_{50}]^{⊤},$ where $E C'_{50}$ is the derivative of the EC₅₀ with respect to Θ, i.e.

E C'_{50} = (\begin{matrix} 0, & \frac{1}{θ_{2}^{2}} E C_{50} log (2 θ_{5}^{- 1} - 1), & \frac{1}{θ_{3}} E C_{50}, & 0, & \frac{1}{θ_{2} θ_{3}^{θ_{2}} θ_{5}^{2}} 2 θ_{5}^{- 1} {EC}_{50}^{θ_{2} + 1} log (2) \end{matrix})

Consideration of the directional derivative of the c-optimality criterion leads to the sensitivity function

d_{c} (x, ξ) = \frac{{(g (x) I^{- 1} (ξ; Θ) {[E C'_{50}]}^{⊤})}^{2}}{E C'_{50} I^{- 1} (ξ; Θ) {[E C'_{50}]}^{⊤}} - 1

and the Equivalence Theorem states that design ξ_c is c-optimal for the 5PL model if and only if

d_{c} (x, ξ_{c}) \leq 0

for all x in X with equality at the design points of ξ_c.

3.3 D_s-optimal designs for estimating the θ₅

A D_s-optimal design is used to estimate one or more model parameters. If one assumes the 5PL model, one may wish to estimate θ₅ accurately since the 5PL model becomes the 4PL model when $θ_{5} = 1$ . Clearly if there is a single parameter of interest, D_s-optimality reduces to c-optimality. When there are multiple parameters of interest, the D_s-optimal design minimizes the generalized variance of the estimated parameters. One proceeds by first partitioning the Fisher information matrix (2) suitably

I (ξ; Θ) = \frac{1}{σ^{2}} (\begin{matrix} I_{11} (ξ; Θ) & I_{12} (ξ; Θ) \\ I_{21} (ξ; Θ) & I_{22} (ξ; Θ) \end{matrix})

(4)

where

I_{uv} (ξ; Θ) = \sum_{i = 1}^{K} w_{i} g_{u} (x_{i})^{⊤} g_{v} (x_{i}),

u, v = 1, 2

. For example, if estimating θ₅ in the 5PL model is the key objective, we let

g_{1} (x) = (\begin{matrix} \frac{\partial f (x, Θ)}{\partial θ_{1}}, & \frac{\partial f (x, Θ)}{\partial θ_{2}}, & \frac{\partial f (x, Θ)}{\partial θ_{3}}, & \frac{\partial f (x, Θ)}{\partial θ_{4}} \end{matrix}) and g_{2} (x) = (\begin{matrix} \frac{\partial f (x, Θ)}{\partial θ_{5}} \end{matrix})

It follows that the variance of the estimated θ₅ is proportional to

I^{22} (ξ; Θ) = {I_{22} (ξ; Θ) - I_{21} (ξ; Θ) I_{11}^{- 1} (ξ; Θ) I_{12} (ξ; Θ)} - 1

and the D_s-optimal design

ξ_{D_{s}}

maximizes the determinant

| I_{22} (ξ; Θ) - I_{21} (ξ; Θ) I_{11}^{- 1} (ξ; Θ) I_{12} (ξ; Θ) | = \frac{| I (ξ; Θ) |}{| I_{11} (ξ; Θ) |}

In particular, the directional derivative of the D_s-optimality criterion for estimating a subset of s of the parameters leads to the sensitivity function

d_{D_{s}} (x, ξ) = {g (x) I^{- 1} (ξ; Θ) g {(x)}^{⊤} - g_{1} (x) I_{11}^{- 1} (ξ; Θ) g_{1} {(x)}^{⊤}} - s

In our case, since we have only one parameter of interest, we set s = 1. The Equivalence Theorem states that design $ξ_{D_{s}}$ is D_s-optimal if and only if

d_{D_{s}} (x, ξ_{D_{s}}) \leq 0

for all x in X with equality at the design points of

ξ_{D_{s}}

3.4 Design efficiency

We use design efficiency to compare the worth of a design relative to the optimum. This is a value between 0 and 1 and frequently it is simply the ratio of the optimal values of the criterion evaluated for the two designs or some simple function thereof. The interpretation of the efficiency of a design is that if its value is r, this design requires $100 (1 / r - 1) %$ more observations to do as well as the optimal design. For example, when $e_{D} (ξ) = 0.5$ for a given design ξ, this design requires 200% more observations to provide the same D-optimality criterion value as the D-optimal design does and this tells that twice as many observations are required for the design to be as efficient as the D-optimal design. The performance of a design ξ for estimating Θ is given by its D-efficiency

e_{D} (ξ) = (\frac{| I (ξ; Θ) |}{| I (ξ_{D}; Θ) |})^{\frac{1}{5}}

Likewise, for estimating a given function of the model parameters, say EC₅₀ using design ξ, its c-efficiency is

e_{c} (ξ) = \frac{EC'_{50} I^{- 1} (ξ_{c}; Θ) {[EC'_{50}]}^{⊤}}{E C'_{50} I^{- 1} (ξ; Θ) {[EC'_{50}]}^{⊤}}

where ξ_c is a EC₅₀-optimal design. Similarly, the D_s-efficiency of a design ξ for estimating s parameters in the model is

e_{D_{s}} (ξ) = (\frac{| I (ξ; Θ) | / | I_{11} (ξ; Θ) |}{| I (ξ_{D_{s}}; Θ) | / | I_{11} (ξ_{D_{s}}; Θ) |})^{\frac{1}{s}}

In practice, we want the implemented design to have high efficiency under the user-specified criterion and ideally a design with relatively high efficiencies across criteria and under a variety of model violations.

3.5 A robust D-optimal design to model misspecification

Locally optimal designs can be sensitive to misspecifications in a nonlinear model, including nominal values which we need to construct a locally optimal design. Here we propose a robust locally D-optimal design that has relatively high efficiencies for estimating parameters in the 3PL, 4PL and 5PL models.

Let $Θ_{1}, Θ_{2}, Θ_{3}$ be the vectors of nominal values for the model parameters in the 3PL, 4PL, 5PL models, respectively, and let $g^{1} (x), g^{2} (x), g^{3} (x)$ be the gradients of the mean functions for the three models respectively. The normalized Fisher information matrices for each of the model is

I_{t} (ξ; Θ_{t}) = \frac{1}{σ^{2}} \sum_{i = 1}^{K} w_{i} g^{t} (x_{i})^{⊤} g^{t} (x_{i}), t = 1, 2, 3

Following Cook and Wong¹⁴ and Atkinson et al.,¹⁷ we use a compound design criterion to construct an efficient design to estimate model parameters accurately regardless which one of the three models holds. Given nominal values $Θ_{1}, Θ_{2}$ and $Θ_{3}$ for the 3PL, 4PL and 5PL model, respectively, the sought locally optimal design maximizes a weighted average of the three D-optimality criteria for the three models, i.e.

\sum_{t = 1}^{3} \frac{λ_{i}}{p_{t}} log (| I_{t} (ξ; Θ_{t}) |)

Here, p_t is the number of model parameters in the tth model and λ_t is a user-selected prior probability that the tth model holds with $\sum_{t = 1}^{3} λ_{t} = 1$ . By taking directional derivative of the above criterion, one can show the sensitivity function is

d_{RoD} (x, ξ) = {\sum_{t = 1}^{3} \frac{λ_{t}}{p_{t}} d_{t} (x, ξ)} - 1

(6)

where

d_{t} (x, ξ) = g^{t} (x) I_{t}^{- 1} (ξ; Θ_{t}) g^{t} (x)^{⊤}

. By the Equivalence Theorem, the design ξ_RoD is robust D-optimal design if and only if

d_{RoD} (x, ξ_{RoD}) \leq 0

for all x in X with equality at the design points of the design ξ_RoD.

4 An algorithm and R-package

Yang et al.²³ introduced an efficient algorithm to search several types of optimal designs for nonlinear models and showed that it outperforms other well-known standard design algorithms. Hyun et al.²⁴ modified their algorithm to search the optimal designs more efficiently and this modified algorithm is used to search all the optimal designs in this paper. Given a differentiable criterion Ψ, we first compute the first and the second derivatives of the optimality criterion with respect to the weights, $\frac{\partial Ψ}{\partial w_{i}}$ and $\frac{\partial^{2} Ψ}{\partial w_{i} w_{j}}$ . The algorithm selects good initial design points via the Fedorov's algorithm,¹⁶ and at each iteration it selects the point that maximizes the sensitivity function $d (x, ξ)$ and adds it to the support of the current design. The optimal weights for the selected design points are then obtained by the Newton Raphson's method using $\frac{\partial Ψ}{\partial w_{i}}$ and $\frac{\partial^{2} Ψ}{\partial w_{i} w_{j}}$ . Upon convergence, the optimality of the design is verified by a General Equivalence Theorem for its optimality.²⁵ To facilitate practitioners use our optimal designs, we have developed an R package Opt5PL based on this algorithm to search for all optimal designs in this paper and the package is available at the Comprehensive R Archive Network (https://CRAN.R-project.org/package=Opt5PL). The supplementary material for this paper contains illustrative examples showing how to use the package to obtain several optimal designs and their efficiencies reported in this paper.

The R package contains several functions (ROPT, EDpOPT, DsOPT, Deff, EDpeff, Dseff) useful for finding and evaluating optimal designs for the 5PL models, including functions for studying optimal designs for the 3PL or 4PL models. We describe a few here:

The ROPT function generates the robust D-optimal design when there is uncertainty among the 3PL, 4PL, and 5PL models. The function produces the D-optimal design for each model and the ROPT function maximizes the compound optimality criterion (5) and verifies the optimality of the generated design using an General Equivalence Theorem by producing a graphical plot of the sensitivity function (6).

We recall EC_p is the concentration level that achieves the $100 p %$ of the difference between the maximum and the minimum responses and the EDpOPT function finds the c-optimal design to estimate EC_p. When p = 0.5, this function finds the c-optimal design for estimating EC₅₀ in the 5PL model. Another function is the DsOPT function for finding the D_s-optimal design for estimating θ₅ in the 5PL model. Both functions verify optimality of the generated designs using an approrpiate General Equivalence Theorem.

Given a user-supplied design, the Deff, EDpeff, Dseff functions compute, respectively, its D-efficiency for estimating Θ, c-efficiency for estimating the EC_p, and D_s-efficiency for estimating θ₅ in the 5PL model. The function Deff can also be used to compute D-efficiencies of any design in the 3PL and 4PL models. Together, these functions generate different types of optimal designs for the 5PL model and evaluate their performances under various misspecifications in the model assumptions.

5 Robustness of the D-optimal design for the 5PL model

5.1 Robustness to the model parameter values and to the 4PL model

The 5PL model captures asymmetric levels of the response curve and so is a more flexible than the 3PL or 4PL model. Are optimal designs for the 5PL model robust to misspecification in nominal values or when the true model is the 4PL model? We provide some insights for the robustness properties for locally D-optimal designs for the 5PL model and also compare its performance with cD-optimal designs that combine c- and D-optimality in different ways to meet user-specified efficiencies for estimating both model parameters and EC₅₀. These cD-optimal designs were proposed by Holland²⁶ and shown to perform well under the 4PL model for fitting symmetrical sigmoidal response curves only. Specifically, we consider the three types of cD-optimal designs shown in 2, 3b, and 4b in Table 1 of Holand-Latz's paper²⁶ and denote them by

ξ_{i}^{4 PL}

, i = 1, 2, 3. They are, respectively, obtained by (i) maximizing a weighted geometric average of the c- and D- criteria, (ii) a two stage procedure using the idea of design augmentation in Padmanabhan,²⁷ and (iii) maximizing a weighted geometric average of the c- and D- criteria subject to a user-specified constraint. In what is to follow, we check their performance compared to the D-optimal design for the 5PL model

ξ_{D}^{5 PL}

and evaluate the robustness of the four designs to misspecified parameter values and whether the true model is the 4PL or 5PL model.

Table 1.

D-efficiencies, $e_{D}^{5 PL}$ , and c-efficiencies, $e_{c}^{5 PL}$ , of four designs under the 5PL model with various values of θ₅.

		Nominal values of θ₅
Design	Efficiency	0.5	0.8	1.0	1.2	1.5	1.8	2.0
$ξ_{1}^{4 PL}$	$e_{D}^{5 PL}$	0.23	0.27	0.29	0.30	0.31	0.31	0.31
	$e_{c}^{5 PL}$	0.00	0.01	0.13	0.46	0.07	0.01	0.01
$ξ_{2}^{4 PL}$	$e_{D}^{5 PL}$	0.58	0.67	0.71	0.74	0.75	0.74	0.72
	$e_{c}^{5 PL}$	0.05	0.52	0.97	0.90	0.69	0.50	0.40
$ξ_{3}^{4 PL}$	$e_{D}^{5 PL}$	0.22	0.25	0.27	0.28	0.29	0.29	0.29
	$e_{c}^{5 PL}$	0.00	0.01	0.11	0.45	0.06	0.01	0.01
$ξ_{D}^{5 PL}$	$e_{D}^{5 PL}$	0.86	0.95	1.00	0.97	0.92	0.83	0.76
	$e_{c}^{5 PL}$	0.37	0.71	0.69	0.64	0.61	0.56	0.53

Note: The designs $ξ_{1}^{4 PL}, ξ_{2}^{4 PL}, ξ_{3}^{4 PL}$ in the first three rows are the locally cD-optimal designs found from the three ways assuming the 4PL model holds. The fourth design $ξ_{D}^{5 PL}$ is the locally D-optimal design for the 5PL model.

We use the same setup in Holland²⁶ and the vector of nominal values for the 4PL model is $Θ_{2} = (θ_{1}, θ_{2}, θ_{3}, θ_{4}) = (1, 1, 1, 0)$ and the log concentration range is between −5 and 5. For the same concentration range, the vector of nominal values for the 5PL model parameters is $Θ_{3} = (θ_{1}, θ_{2}, θ_{3}, θ_{4}, θ_{5}) = (1, 1, 1, 0, 1)$ . We denote the three cD-optimal designs for the 4Pl model and the D-optimal design for the 5PL model, respectively, by $ξ_{1}^{4 PL}, ξ_{2}^{4 PL}, ξ_{3}^{4 PL}$ , and $ξ_{D}^{5 PL}$ and they are given by

\begin{matrix} ξ_{1}^{4 PL} = (\begin{matrix} - 5.00 & - 0.60 & - 0.50 & 0.50 & 0.60 & 5.00 \\ 0.25 & 0.19 & 0.06 & 0.06 & 0.19 & 0.25 \end{matrix}), \\ ξ_{2}^{4 PL} = (\begin{matrix} - 5.00 & - 1.00 & 0.00 & 1.00 & 5.00 \\ 0.25 & 0.12 & 0.25 & 0.12 & 0.25 \end{matrix}), \\ ξ_{3}^{4 PL} = (\begin{matrix} - 5.00 & - 0.60 & - 0.50 & 0.50 & 0.60 & 5.00 \\ 0.25 & 0.21 & 0.04 & 0.04 & 0.21 & 0.25 \end{matrix}), and \end{matrix}

ξ_{D}^{5 PL} = (\begin{matrix} - 5.00 & - 1.96 & - 0.15 & 1.65 & 5.00 \\ 0.20 & 0.20 & 0.20 & 0.20 & 0.20 \end{matrix})

Tables 1 to 3 display D-efficiencies, $e_{D}^{5 PL}$ , of the four designs $ξ_{1}^{4 PL}, ξ_{2}^{4 PL}, ξ_{3}^{4 PL}$ , and $ξ_{D}^{5 PL}$ for estimating the model parameters and their c-efficiencies, $e_{c}^{5 PL}$ , for estimating the EC₅₀ when the 5PL model holds and one of the three parameters $θ_{5}, θ_{2}$ and θ₃ is misspecified, respectively. These cD-optimal designs have more than four design points and it is appropriate to ascertain whether they remain efficient for estimating all the parameters or EC₅₀ when the 5PL model holds. We note at the end of Section 2 that the design construction does not depend on the parameters θ₁ and θ₄ and so we do not investigate the drop in the design efficiencies when their nominal values are misspecified. In these and all tables to follow, zero efficiencies mean actual efficiencies are smaller than 0.01.

It is clear from Tables 1 to 3 that the three cD-optimal designs for the 4PL model do not perform well under the 5PL models when the values of θ₂, θ₃ or θ₅ are misspecified. The design

ξ_{2}^{4 PL}

performs better than the other two cD-optimal designs but the overall efficiencies for the three cD-optimal designs for either estimating parameters or for EC₅₀ are poor compared with those from

ξ_{D}^{5 PL}

, even in the case when

θ_{5} = 1

when the two models coincide. There are two interesting exceptions: (i) Table 2 shows both D- and c-efficiencies of the design

ξ_{2}^{4 PL}

begin to outperform those from

ξ_{D}^{5 PL}

when the parameters are misspecified and larger than the true value, and (ii) Table 3 shows the c-efficiencies of the design

ξ_{2}^{4 PL}

outperform

ξ_{D}^{5 PL}

when the nominal value of θ₃ is misspecified and smaller than the true value. This reinforces the importance of considering the 5PL model to construct optimal designs and shows how the designs obtained from the 4PL model perform inefficiently when they are used for the 5PL model. In contrast, the D-optimal design

ξ_{D}^{5 PL}

works well for estimating the model parameters for the different values of θ₂, θ₃, and θ₅. The c-efficiencies of

ξ_{D}^{5 PL}

are lower than the D-efficiencies but mostly they are higher than ones for obtained from the cD-optimal designs. Additionally, Table 3 shows that the D- and c-efficiencies of the D-optimal design

ξ_{D}^{5 PL}

are much more consistent for different values of θ₃ when they are compared to the changes for different values of θ₂ and θ₅ in Tables 1 and 2. This tells that the D-optimal design for the 5PL model is more resistant to changes in the values of θ₃ than the changes in θ₂ and θ₅.

Table 2.

D-efficiencies $e_{D}^{5 PL}$ and c-efficiencies $e_{c}^{5 PL}$ of the locally optimal designs in column 1 when they are used for the 5PL model with various nominal values of θ₂.

		Nominal values of θ₂
Design	Efficiency	0.5	0.8	1.0	1.2	1.5	1.8	2.0
$ξ_{1}^{4 PL}$	$e_{D}^{5 PL}$	0.18	0.24	0.29	0.33	0.40	0.45	0.48
	$e_{c}^{5 PL}$	0.00	0.00	0.13	0.12	0.03	0.03	0.02
$ξ_{2}^{4 PL}$	$e_{D}^{5 PL}$	0.47	0.61	0.71	0.80	0.89	0.93	0.93
	$e_{c}^{5 PL}$	0.04	0.30	0.97	0.85	0.69	0.67	0.67
$ξ_{3}^{4 PL}$	$e_{D}^{5 PL}$	0.17	0.23	0.27	0.31	0.38	0.42	0.45
	$e_{c}^{5 PL}$	0.00	0.00	0.11	0.08	0.02	0.02	0.02
$ξ_{D}^{5 PL}$	$e_{D}^{5 PL}$	0.79	0.93	1.00	0.96	0.86	0.71	0.61
	$e_{c}^{5 PL}$	0.29	0.75	0.69	0.58	0.50	0.46	0.40

Note: The designs $ξ_{1}^{4 PL}, ξ_{2}^{4 PL}, ξ_{3}^{4 PL}$ in the first three rows are the locally cD-optimal designs found from the three ways for the 4PL model and the fourth design $ξ_{D}^{5 PL}$ is the locally D-optimal design for the 5PL model.

Table 3.

D-efficiencies $e_{D}^{5 PL}$ and c-efficiencies $e_{c}^{5 PL}$ of the locally optimal designs in column 1 when they are used for the 5PL model with various nominal values of θ₃.

		Nominal values of θ₃
Design	Efficiency	0.5	0.8	1.0	1.2	1.5	1.8	2.0
$ξ_{1}^{4 PL}$	$e_{D}^{5 PL}$	0.23	0.28	0.29	0.39	0.28	0.27	0.27
	$e_{c}^{5 PL}$	0.39	0.85	0.07	0.02	0.00	0.00	0.01
$ξ_{2}^{4 PL}$	$e_{D}^{5 PL}$	0.58	0.68	0.71	0.72	0.70	0.68	0.66
	$e_{c}^{5 PL}$	0.89	0.83	0.69	0.53	0.31	0.16	0.48
$ξ_{3}^{4 PL}$	$e_{D}^{5 PL}$	0.22	0.26	0.27	0.27	0.27	0.26	0.25
	$e_{c}^{5 PL}$	0.28	0.85	0.06	0.01	0.00	0.00	0.01
$ξ_{D}^{5 PL}$	$e_{D}^{5 PL}$	0.86	0.96	1.00	0.96	0.94	0.90	0.88
	$e_{c}^{5 PL}$	0.61	0.61	0.61	0.60	0.61	0.65	0.70

Tables 4 and 5 assume the 4PL model holds and compare D-efficiencies,

e_{D}^{4 PL}

, and c-efficiencies,

e_{c}^{4 PL}

of the four designs when one of the two parameters θ₃ and θ₂ is misspecified. Under the 4PL model, θ₃ represents the EC₅₀ and θ₂ represents the slope at the EC₅₀. The D-optimal design

ξ_{D}^{5 PL}

for the 5PL model performs well for the 4PL model when θ₃ is misspecified. However, their c-efficiencies are consistently smaller than those from the three cD-optimal designs, except when

θ_{3} = 0.5

or 2.0. Table 5 shows a similar pattern with the three cD-optimal designs having higher c-efficiencies than those from the design

ξ_{D}^{5 PL}

when the 4PL model holds. Interestingly, Table 5 also shows that the design

ξ_{D}^{5 PL}

provides competitive D-efficiencies with those from cD-optimal designs but begins to loose ground when θ₂ is misspecified with a value larger than the true value assumed to be unity. The design

ξ_{D}^{5 PL}

does not take into account c-optimality criterion for estimating the EC₅₀ and so it is unsurprising that it has smaller c-efficiencies than its D-efficiencies for both the 4PL and the 5PL model, and also the same is true when compared with the cD-optimal designs that incorporate c-optimality.

Table 4.

D- and c-efficiencies, $e_{D}^{4 PL}$ and $e_{c}^{4 PL}$ of the four designs under the 4PL model with various values of θ₃.

		Nominal values of θ₃
Design	Efficiency	0.5	0.8	1.0	1.2	1.5	1.8	2.0
$ξ_{1}^{4 PL}$	$e_{D}^{4 PL}$	0.81	0.89	0.90	0.89	0.86	0.83	0.81
	$e_{c}^{4 PL}$	0.54	0.81	0.84	0.82	0.73	0.62	0.54
$ξ_{2}^{4 PL}$	$e_{D}^{4 PL}$	0.84	0.90	0.90	0.90	0.88	0.86	0.85
	$e_{c}^{4 PL}$	0.56	0.78	0.81	0.80	0.71	0.62	0.56
$ξ_{3}^{4 PL}$	$e_{D}^{4 PL}$	0.81	0.89	0.90	0.89	0.87	0.84	0.81
	$e_{c}^{4 PL}$	0.54	0.81	0.84	0.82	0.74	0.62	0.54
$ξ_{D}^{5 PL}$	$e_{D}^{4 PL}$	0.89	0.89	0.90	0.91	0.91	0.91	0.91
	$e_{c}^{4 PL}$	0.54	0.55	0.55	0.55	0.54	0.55	0.55

Note: $ξ_{1}^{4 PL}, ξ_{2}^{4 PL}, ξ_{3}^{4 PL}$ are the cD-optimal designs for the 4PL model with $Θ_{2} = (1, 1, 1, 0)$ and $ξ_{D}^{5 PL}$ is the D-optimal design for the 5PL model with $Θ_{3} = (1, 1, 1, 0, 1)$ .

Table 5.

D-efficiencies $e_{D}^{4 PL}$ and c-efficiencies $e_{c}^{4 PL}$ of locally optimal designs in column 1 when they are used for the 4PL model with various nominal values of the slope parameter θ₂.

		Nominal values of θ₂
Design	Efficiency	0.5	0.8	1.0	1.2	1.5	1.8	2.0
$ξ_{1}^{4 PL}$	$e_{D}^{4 PL}$	0.73	0.84	0.90	0.94	0.99	1.00	0.99
	$e_{c}^{4 PL}$	0.94	0.89	0.84	0.79	0.70	0.60	0.53
$ξ_{2}^{4 PL}$	$e_{D}^{4 PL}$	0.77	0.85	0.89	0.91	0.92	0.91	0.88
	$e_{c}^{4 PL}$	0.91	0.86	0.81	0.77	0.72	0.69	0.68
$ξ_{3}^{4 PL}$	$e_{D}^{4 PL}$	0.74	0.84	0.90	0.95	0.99	1.00	0.99
	$e_{c}^{4 PL}$	0.94	0.89	0.84	0.78	0.69	0.59	0.52
$ξ_{D}^{5 PL}$	$e_{D}^{4 PL}$	0.88	0.91	0.89	0.86	0.78	0.69	0.62
	$e_{c}^{4 PL}$	0.70	0.59	0.55	0.53	0.51	0.47	0.41

Note: The designs $ξ_{1}^{4 PL}, ξ_{2}^{4 PL}, ξ_{3}^{4 PL}$ in the first three rows are the locally cD-optimal designs found from the 3 ways for the 4PL model and the fourth design $ξ_{D}^{5 PL}$ is the locally D-optimal design for the 5PL model.

6 Applications

In this section, we apply the R package Opt5PL to find locally D-optimal designs ξ_D, locally c-optimal designs ξ_c, and locally D_s-optimal designs

ξ_{D_{s}}

for two bioassay studies and use them to evaluate efficiencies of the implemented designs. Both studies assume the 5PL model or a slightly modified version of it and come with nominal values. To study robustness properties of the various designs to misspecification in the nominal values of the model parameters, we consider six vectors of possible nominal values for the five model parameters and denote them by

Θ_{31} - Θ_{36}

in Table 6. More details about how we arrived at the six different sets of nominal values for the parameters are given in section 6.1.

Table 6.

The six sets of parameter values of the 5PL model for Study 1 and Study 2.

Θ³	Study 1	Study 2
Θ³¹	(30000, 0.5, 800, 0.5, 2.0)	(100, 0.81, 40.14, 0, 1.63)
Θ³²	(30000, 0.5, 800, 0.5, 5.0)	(100, 0.93, 49.82, 0, 1.06)
Θ³³	(30000, 1.0, 800, 0.5, 1.0)	(100, 1.11, 69.26, 0, 0.59)
Θ³⁴	(30000, 1.0, 800, 0.5, 1.5)	(100, 0.80, 10.58, 0, 2.33)
Θ³⁵	(30000, 2.0, 800, 0.5, 2.0)	(100, 0.80, 12.12, 0, 2.33)
Θ³⁶	(30000, 2.0, 800, 0.5, 5.0)	(100, 0.83, 16.93, 0, 1.90)

Θ = vector of model parameter values $= (θ_{1}, θ_{2}, θ_{3}, θ_{4}, θ_{5})$ .

We now briefly describe the two studies, one at a time, before we provide an assessment of the designs used in the two studies. In section 6.1, we assess how well the implemented designs perform when parameters are misspecified or under different criteria. In second section 6.2, we report how the implemented designs perform when the model is either the 3PL, 4PL or 5Pl model. We note that for each study, the implemented design has equal weight at each design point.

Study 1: Bio-Plex cytokine assays are described extensively in www.bio-rad.com, www.biocompare.com and several other web sites. We used the setup described in the technical report¹¹ and considered Bio-Plex cytokine assays that are bead-based multiplex sandwich immunoassays. The models of interest are the 4PL and the 5PL models, which have been shown to be appropriate for fitting data from such assays. There are two recommended setups for the assays to achieve efficient performance. One is a high-sensitivity range standards (0.2–3200 pg/ml) and the other is a broad range standards (1.95–32,000 pg/ml). Typically at least five standards (concentrations) are recommended for the 4PL model and at least six standards are recommended for the 5 PL model, along with a further recommendation that there be a total of eight evenly distributed standards in the range for an accurate fit. Under a four-fold dilution series, the broad range standard has eight design points at $1.95, 7.8, 31.25, 125, 500, 2, 000, 8, 000$ and 32, 000.

Study 2: Dawn et al.⁸ assessed toxicity of four chemical agents alone and in mixture using the 5PL-1P model, which is a modified 5PL model after removing the minimum response parameter. They fitted the concentration–response curves from each single chemical and their mixture using three different exposure durations at 15, 30 and 45 min. The experimental design in the study prepared test concentrations by serial dilution using 1.867 as the dilution factor. Among the four agents, we focus on two agents with the same concentration range (7–300 mg/L) and compare performances of the implemented designs relative to those from the optimal designs. One agent is ethyl chloroacetate (ECAC) and the other agent is 3-methyl-2-butanone (3M2B). Based on the dilution factor, both designs have seven design points at $7.09, 13.24, 24.71, 46.12, 86.10, 160.70$ and 300.00 with two replications at each design point.

6.1 Efficiencies of the implemented designs under nominal values misspecification

In Study 1, the investigators studied cytokine assays over a pre-specified range of concentrations between 1.95 and 32,000 assuming the vector of nominal values for the model parameters is $Θ_{3} = (30, 000, 1800, 0.5, 1)$ . To simulate various response curves over the same range, we created six different sets of possible values of (θ₂, θ₅) commensurate with values of θ₁, θ₃, and θ₄ in their paper. Results from previous section suggest that the locally D-optimal design for the 5PL model is more sensitive to the two parameters θ₂ and θ₅.

In Study 2, each single chemical has three different response curves based on three different exposure times. Estimated parameter values of the EC₅₀, the slope, and the asymmetric factor were made available when the 5PL-1P model was fitted to each of the agents.⁸ We used the six different sets of parameter values for the two agents ECAC and 3M2B to create six additional response curves of the 5PL model. As noted before, the optimal designs for the 5PL model do not depend on the maximal and minimal responses. To fix ideas, we assume that their values are 100 and 0, respectively, since the response is a toxicity effect (0–100%) in the study.

We use the R package Opt5PL and for each vector of nominal values, generate the locally D-optimal designs ξ_D, locally c-optimal designs ξ_c, and locally D_s-optimal designs

ξ_{D_{s}}

for the 5PL models. The suggested guideline for fitting the 5PL model requires at least six design points, and so one may add two evenly spaced design points between the second and the fourth design points of ξ_D on the log scale, and call this an extended D-optimal design ξ_ExD. The extended D-optimal design has equal weight across the seven design points. Table 7 displays the four types of locally optimal designs including the extended D-optimal designs ξ_ExD found for the six sets of nominal values. For space consideration, we provide the obtained optimal designs for study 2 only. In both studies, the three types of locally optimal designs always contain the endpoints of the range space and the middle points are changed by different optimality criteria and the nominal parameter values.

Table 7.

Four different types of locally optimal designs, ξ_D, ξ_ExD, ξ_c, and $ξ_{D_{s}}$ for the 5PL model for the six sets of parameter values ( $Θ_{31} \sim Θ_{36}$ ) for Study 2.

$Θ_{3}$	ξ_D	ξ_ExD
$Θ_{31}$	(7.00, 15.58, 47.75, 143.44, 297.65)	(7.00, 15.58, 32.65, 47.75, 68.44, 143.44, 297.65)
$Θ_{32}$	(7.00, 14.97, 44.97, 136.44, 297.65)	(7.00, 14.97, 31.27, 44.97, 65.32, 136.44, 297.65)
$Θ_{33}$	(7.00, 14.97, 44.97, 133.74, 297.65)	(7.00, 14.97, 31.06, 44.97, 64.45, 133.74, 297.65)
$Θ_{34}$	(7.00, 12.63, 34.67, 115.11, 297.65)	(7.00, 12.63, 26.38, 34.67, 55.11, 115.11, 297.65)
$Θ_{35}$	(7.00, 12.88, 35.73, 117.44, 297.65)	(7.00, 12.88, 26.91, 35.73, 56.21, 117.44, 297.65)
$Θ_{36}$	(7.00, 13.28, 37.19, 121.01, 297.65)	(7.00, 13.28, 27.74, 37.19, 57.94 121.01, 297.65)
$Θ_{3}$	ξ _c	$ξ_{D_{s}}$
$Θ_{31}$	$(\begin{matrix} 7.00 & 13.82 & 47.27 & 158.52 & 297.65 \\ 0.132 & 0.213 & 0.19 & 0.287 & 0.178 \end{matrix})$	$(\begin{matrix} 7.00 & 13.82 & 48.71 & 161.73 & 297.65 \\ 0.151 & 0.269 & 0.230 & 0.231 & 0.119 \end{matrix})$
$Θ_{32}$	$(\begin{matrix} 7.00 & 13.54 & 44.97 & 152.31 & 297.650 \\ 0.162 & 0.250 & 0.179 & 0.250 & 0.159 \end{matrix})$	$(\begin{matrix} 7.00 & 13.41 & 45.87 & 153.84 & 297.65 \\ 0.148 & 0.266 & 0.231 & 0.234 & 0.121 \end{matrix})$
$Θ_{33}$	$(\begin{matrix} 7.00 & 13.41 & 13.54 & 44.52 & 149.29 & 297.65 \\ 0.191 & 0.163 & 0.136 & 0.185 & 0.202 & 0.125 \end{matrix})$	$(\begin{matrix} 7.00 & 13.41 & 45.42 & 152.31 & 297.65 \\ 0.146 & 0.265 & 0.229 & 0.235 & 0.125 \end{matrix})$
$Θ_{34}$	$(\begin{matrix} 7.00 & 11.54 & 35.02 & 129.79 & 297.65 \\ 0.176 & 0.269 & 0.183 & 0.231 & 0.142 \end{matrix})$	$(\begin{matrix} 7.00 & 11.43 & 34.33 & 129.79 & 297.65 \\ 0.150 & 0.273 & 0.233 & 0.227 & 0.116 \end{matrix})$
$Θ_{35}$	$(\begin{matrix} 7.00 & 11.66 & 36.45 & 132.41 & 297.65 \\ 0.173 & 0.262 & 0.180 & 0.238 & 0.147 \end{matrix})$	$(\begin{matrix} 7.00 & 11.66 & 35.37 & 133.74 & 297.65 \\ 0.149 & 0.272 & 0.233 & 0.228 & 0.118 \end{matrix})$
$Θ_{36}$	$(\begin{matrix} 7.00 & 12.01 & 37.19 & 137.8 & 297.65 \\ 0.172 & 0.260 & 0.174 & 0.240 & 0.153 \end{matrix})$	$(\begin{matrix} 7.00 & 12.01 & 36.82 & 136.44 & 297.65 \\ 0.150 & 0.273 & 0.233 & 0.227 & 0.117 \end{matrix})$

The D-optimal design ξ_D and the extended D-optimal design ξ_ExD have equal weights over the obtained design points, so their weights are not given in the table. Each row shows the obtained optimal design for each given parameter set. For example, the column ξ_D for the row $Θ_{31}$ shows the obtained D-optimal design for the parameter set $Θ_{31}$ , and the column ξ_c for the row $Θ_{31}$ shows the obtained c-optimal design for the parameter set $Θ_{31}$ . For both the c-optimal design ξ_c and D_s-optimal designs $ξ_{D_{s}}$ , the first row displays the design points and the second row displays their corresponding weights. $Θ =$ vector of model parameter values $= (θ_{1}, θ_{2}, θ_{3}, θ_{4}, θ_{5})$ .

Table 8 shows the performances of the designs ξ_D, ξ_ExD, the implemented design

ξ_{S 1}

for study 1 and the implemented design

ξ_{S 2}

for study 2 in terms of the three objectives, which are estimating the five parameters in the 5PL model, estimating the EC₅₀ and estimating θ₅. Clearly, the D-optimal designs have 100% D-efficiencies, as shown in the first row in both tables. For the other two objectives, the locally D-optimal designs for both studies still have much higher efficiencies than those provided by the extended design and the implemented designs. The table also shows the extended designs clearly do better than the implemented designs for estimating model parameters. The last two rows in each of the three efficiency categories show that the two implemented designs

ξ_{S 1}

and

ξ_{S 2}

clearly and substantially underperform and in some case, its D_s-efficiencies for estimating the parameter θ₅ are near 0. For both studies, the extended designs outperform the implemented designs and more so in Study 1. We also observe there is less variation in the efficiencies in Study 2 than in Study 1 across the six sets of nominal values.

Table 8.

Efficiencies of various designs for the 5PL model with different objectives and 6 sets of nominal values for Study 1 and Study 2.

	Efficiency	ξ	$Θ_{31}$	$Θ_{32}$	$Θ_{33}$	$Θ_{34}$	$Θ_{35}$	$Θ_{36}$
Study 1	e _D	ξ _D	1.00	1.00	1.00	1.00	1.00	1.00
		ξ _ExD	0.91	0.91	0.91	0.91	0.91	0.91
		$ξ_{S 1}$	0.88	0.74	0.86	0.83	0.45	0.32
	e _c	ξ _D	0.82	0.85	0.71	0.66	0.55	0.57
		ξ _ExD	0.62	0.67	0.67	0.60	0.63	0.64
		$ξ_{S 1}$	0.55	0.30	0.55	0.47	0.03	0.35
	$e_{D_{s}}$	ξ _D	0.84	0.83	0.86	0.85	0.86	0.85
		ξ _ExD	0.67	0.66	0.65	0.65	0.65	0.65
		$ξ_{S 1}$	0.59	0.35	0.56	0.48	0.05	0.04
Study 2
	e _D	ξ _D	1.00	1.00	1.00	1.00	1.00	1.00
		ξ _ExD	0.91	0.91	0.91	0.91	0.91	0.91
		$ξ_{S 2}$	0.92	0.92	0.92	0.90	0.91	0.91
	e _c	ξ _D	0.88	0.90	0.88	0.89	0.90	0.90
		ξ _ExD	0.67	0.68	0.67	0.69	0.69	0.69
		$ξ_{S 2}$	0.68	0.68	0.66	0.63	0.64	0.66
	$e_{D_{s}}$	ξ _D	0.84	0.85	0.85	0.85	0.84	0.85
		ξ _ExD	0.68	0.68	0.68	0.68	0.68	0.68
		$ξ_{S 2}$	0.66	0.66	0.66	0.61	0.62	0.63

Note: $Θ =$ vector of model parameter values $= (θ_{1}, θ_{2}, θ_{3}, θ_{4}, θ_{5})$ . These design are the locally D-optimal design, ξ_D, the locally extended D-optimal design, ξ_ExD, and the implemented designs $ξ_{S 1}$ and $ξ_{S 2}$ for the two studies.

The efficiencies of the implemented design for estimating EC₅₀ range from 3% to 65% in Study 1 and they range from 63% to 68% in Study 2. The corresponding efficiencies from the extended designs are remarkably stable averaging around 66% for Study 1 and Study 2. In the two studies, the locally D-optimal design does well for estimating the EC₅₀ and θ₅ and average in the mid-1980s for Study 1 and in the high-eighties for Study 2. The overall message from the table is that the extended D-optimal designs appears practically useful and the implemented designs $ξ_{S 1}$ and $ξ_{S 2}$ do not when nominal values for the model parameters are misspecified.

6.2 Efficiencies of the implemented designs under mean function misspecification

Many statistical models in immunoassays and bioassays revolve around the 4PL and 5PL models and sometimes the 3PL model. All three models describe a sigmoidal curve for the mean response. Frequently, it is not clear which one of these models is the most appropriate and it is desirable to have a design that works relatively well regardless which one of them holds. We propose a robust locally D-optimal design that can provide well-balanced efficiencies for estimating model parameters in the three models.

We assume the same six sets of possible nominal values in Table 7 for the 5PL model parameters in Study 1 and for each set, simulated data from the 5PL model at the design points of the implemented design $ξ_{S 1}$ and used them to estimate the model parameters for the 3PL and the 4PL models, respectively. These estimated nominal values are obtained using our nlm R program and they then serve as nominal values for the 3PL and the 4PL models. The same procedure is repeated using $ξ_{S 2}$ for Study 2 to obtain nominal values for the 3PL and 4PL models for Study 2.

To tackle the model uncertainty issue, we elicit prior probabilities $λ_{1}, λ_{2}, λ_{3}$ for the three models with a higher probability for the more likely model, subject to the probabilities sum to unity. For example, if we believe that the three models are equally plausible, we set $λ_{1} = λ_{2} = λ_{3} = 1 / 3$ . We then apply our R package and optimize the criterion (5) to obtain a robust D-optimal designs ξ_RoD for each of study. We do not display them for space consideration and note that the robust D-optimal designs always include the two endpoints of the range space.

Table 9 shows the D-efficiencies of the designs

ξ_{S 1}, ξ_{S 2}

, ξ_ExD, ξ_D, ξ_RoD across the different models

3 PL, 4 PL

and 5PL for each of the six different sets of parameter values for Study 1 and Study 2. The D-efficiencies under the 3PL and 4PL models are calculated using the definition in Section 3.4 assuming one of the three models is the true model and the nominal parameters are the estimated parameters for the 3PL and the 4PL models to obtain the robust D-optimal designs. For Study 1, the implemented design

ξ_{S 1}

consistently underperforms relative to the extended design and the robust design by a wide margin in terms of estimating model parameters. For Study 2, the D-efficiencies have a similar pattern but the differences are less dramatic than in Study 1. For both studies, the extended designs generally have satisfactory performance for estimating parameters in the three models and not too different from those provided by the D-optimal designs. The D-optimal design for the 5PL model performs well even when the 3PL or 4PL model is the true model in Study 2 since its lowest D-efficiency across the six sets of nominal values is about 85%; this number drops to about 66% in Study 1. A clear set of results is the D-efficiencies from the robust D-optimal designs are uniformly high for both studies across the six sets of nominal values for the parameters in the 3PL, 4PL and 5PL models. The minimum D-efficiency in Study 1 is about 79% and that in Study 2 is 90%.

Table 9.

D-efficiencies of the four designs, $ξ_{S 1}$ and $ξ_{S 2}$ , ξ_ExD, ξ_D, and ξ_RoD for the 3PL, 4PL and 5PL models under the six sets of nominal values for Study 1 and Study 2.

Study 1
$Θ_{3}$	ξ	$e_{D}^{3 PL}$	$e_{D}^{4 PL}$	$e_{D}^{5 PL}$	$Θ_{3}$	ξ	$e_{D}^{3 PL}$	$e_{D}^{4 PL}$	$e_{D}^{5 PL}$
$Θ_{31}$	$ξ_{S 1}$	0.60	0.83	0.88	$Θ_{34}$	$ξ_{S 1}$	0.51	0.81	0.83
	ξ _ExD	0.69	0.85	0.91		ξ _ExD	0.70	0.86	0.91
	ξ _D	0.69	0.91	1.00		ξ _D	0.68	0.91	1.00
	ξ _RoD	0.80	0.94	0.94		ξ _RoD	0.79	0.93	0.95
$Θ_{32}$	$ξ_{S 1}$	0.40	0.73	0.74	$Θ_{35}$	$ξ_{S 1}$	0.33	0.66	0.45
	ξ _ExD	0.67	0.84	0.91		ξ _ExD	0.70	0.87	0.92
	ξ _D	0.67	0.91	1.00		ξ _D	0.66	0.90	1.00
	ξ _RoD	0.80	0.94	0.94		ξ _RoD	0.80	0.94	0.93
$Θ_{33}$	$ξ_{S 1}$	0.58	0.83	0.86	$Θ_{36}$	$ξ_{S 1}$	0.20	0.46	0.32
	ξ _ExD	0.71	0.86	0.91		ξ _ExD	0.71	0.87	0.92
	ξ _D	0.69	0.90	1.00		ξ _D	0.66	0.90	1.00
	ξ _RoD	0.80	0.93	0.95		ξ _RoD	0.80	0.94	0.94
Study 2
$Θ_{3}$	ξ	$e_{D}^{3 PL}$	$e_{D}^{4 PL}$	$e_{D}^{5 PL}$	$Θ_{3}$	ξ	$e_{D}^{3 PL}$	$e_{D}^{4 PL}$	$e_{D}^{5 PL}$
$Θ_{31}$	$ξ_{S 2}$	0.84	0.86	0.92	$Θ_{34}$	$ξ_{S 2}$	0.86	0.86	0.90
	ξ _ExD	0.82	0.84	0.91		ξ _ExD	0.84	0.84	0.91
	ξ _D	0.85	0.90	1.00		ξ _D	0.88	0.90	1.00
	ξ _RoD	0.90	0.94	0.97		ξ _RoD	0.92	0.94	0.97
$Θ_{32}$	$ξ_{S 2}$	0.85	0.86	0.92	$Θ_{35}$	$ξ_{S 2}$	0.86	0.86	0.91
	ξ _ExD	0.83	0.84	0.91		ξ _ExD	0.84	0.84	0.91
	ξ _D	0.88	0.90	1.00		ξ _D	0.87	0.90	1.00
	ξ _RoD	0.92	0.93	0.97		ξ _RoD	0.92	0.94	0.97
$Θ_{33}$	$ξ_{S 2}$	0.83	0.86	0.92	$Θ_{36}$	$ξ_{S 2}$	0.85	0.86	0.91
	ξ _ExD	0.82	0.84	0.91		ξ _ExD	0.84	0.84	0.91
	ξ _D	0.87	0.90	1.00		ξ _D	0.87	0.90	1.00
	ξ _RoD	0.91	0.94	0.97		ξ _RoD	0.92	0.94	0.97

$Θ =$ vector of model parameter values $= (θ_{1}, θ_{2}, θ_{3}, θ_{4}, θ_{5})$ . ξ_RoD is the robust D-optimal design with $λ_{1} = λ_{2} = λ_{3} = 1 / 3$ and $e_{D}^{3 PL}, e_{D}^{4 PL}, e_{D}^{5 PL}$ are, respectively, the efficiencies of the design for estimating the model parameters in the 3PL, 4PL, and 5PL models.

7 Conclusions

Our work is the first to address a variety of design issues for the 5PL model. We present optimal designs for estimating model parameters and studying meaningful features of the 5PL model, which can provide a better fit to asymmetric data from bioassays than the 3PL or 4PL models. We compare performance of some of the designs that are recommended for immunoassays and bioassays and show that they may be far from optimum. We suggest designs that are robust to the mean function or nominal values misspecification for the model parameters, and so they provide more accurate statistical inference for the model parameters.

To facilitate users implement optimal designs for the 5PL model, we provide an R package Opt5PL to assess performance of the user-specified designs relative to the optimum, and study robustness properties of a design to various model assumptions. In particular, we show that the locally D-optimal design for estimating the model parameters in the 5PL model is relatively robust to misspecified parameter values for θ₂, θ₃ and θ₅ and also to the form of the mean response. Additionally, we show robust D-optimal designs consistently have high D-efficiencies for estimating model parameters regardless which of the three models 3PL, 4PL or 5PL model holds.

When there are several objectives in the study, the design strategy in Section 3.5 can be used to construct a multiple-objective optimal design that incorporates the relative importance of the objectives. We then formulate a compound criterion by taking a convex combination of the convex criteria with the weights chosen to reflect the relative importance of the criteria. The sought multiple-objective optimal design minimizes the compound criterion. Because the compound criterion is still convex, equivalence theorem can be derived to confirm optimality. Details are in Cook and Wong¹⁴ and Hyun and Wong,¹⁵ where a graphical approach is also described to find a multiple-objective optimal design.

We focus on constructing locally optimal designs and future directions for research including finding different types optimal designs for the 5PL model using the maximin, Bayesian and multistage approaches. Another interesting design issue not discussed here is finding optimal designs for the 5PL model when the data has heterogeneous variances. Sometimes bioassays data have heterogeneous variances as the concentration changes. It is not known whether optimal designs discussed here are robust to heteroscedastic errors in the model or whether use of optimal designs based on other efficient estimators such as the maximum quasi likelihood estimator (MqLE) or the extended quasi-likelihood estimator (EQL) provides a better option.

We close with the note that a main role of optimal designs is calibration so that we know what the optimal design is in an ideal situation. In practice, designs should be amended to reflect reality and the needs of the user but not stray too far from the optimum; otherwise the quality of the statistical inference from the study may suffer.

Supplemental Material

Supplemental material for Optimal designs for asymmetric sigmoidal response curves in bioassays and immunoassays

Supplemental Material for Optimal designs for asymmetric sigmoidal response curves in bioassays and immunoassays by Seung Won Hyun, Weng Kee Wong and Yarong Yang in Statistical Methods in Medical Research

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Wong was partially supported by a grant from the National Institute of General Medical Sciences of the National Institutes of Health under Award Number R01GM107639. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Supplementary Material

Supplemental material is available online for this article.

References

Law

. Immunoassay: a practical guide, Boca Raton, FL: CRC Press, 2002.

DeSilva

Smith

Weiner

, et al. Recommendations for the bioanalytical method validation of ligand-binding assays to support pharmacokinetic assessments of macromolecules. Pharm Res 2003; 20: 1885–1900.

Liao

JJZ

Duan

Meng

, et al. Selecting an appropriate dose-response curve in bioassay development. Frontiers Drug Des Discov 2010; 5: 67–96.

Leschik

Diana

Olivo

, et al. Analytical performance and clinical utility of a bioassay for thyroid-stimulating immunoglobulins. Am J Clin Pathol 2013; 139: 192–200.

Gottschalk

Dunn

. The five-parameter logistic: a characterization and comparison with the four-parameter logistic. Anal Biochem 2005; 343: 54–65.

Findlay

Dillard

. Appropriate calibration curve fitting in ligand binding assays. AAPS J 2007; 9: 260–267.

Feng

Sales

Kepler

. A Bayesian approach for estimating calibration curves and unknown concentrations in immunoassays. Bioinformatics 2011; 27: 707–712.

Dawson

Mooneyham

Jeyaratnam

, et al. Mixture toxicity of S_N2-reactive soft electrophiles: 2–evaluation of mixtures containing ethyl a-halogenated acetates. Arch Environ Contamination Toxicol 2011; 61: 547–557.

Dawson

Genco

Bensinger

, et al. Evaluation of an asymmetry parameter for curve-fitting in single chemical and mixture toxicity assessment. Toxicology 2012; 26: 156–161.

10.

Cumberland

Fong

, et al. Nonlinear calibration model choice between the four and five parameter logistic models. J Biopharm Stat 2015; 25: 972–983.

11.

Davis D, Zhang A, Torrence J, et al. Selection of standards for bio-plex cytokine assays. Bio-Rad 2009, http://www.bio-rad.com/LifeScience/pdf/Bulletin_2900.pdf (accessed 20 September 2017).

12.

Gottschalk

Dunn

. Measuring parallelism, linearity, and relative potency in bioassay and immunoassay data. J Biopharm Stat 2005b; 15: 437–463.

13.

Manukyan Z and Rosenberger WF. D-optimal design for a five-parameter logistic model. In: Giovagnoli A, Atkinson A, Torsney B and May C (eds) mODa9-Advances in Model-Oriented Design and Analysis, Contributions to Statistics. Physica-Verlag HD, 2010, pp. 113–120.

14.

Cook

Wong

. On the equivalence of constrained and compound optimal designs. J Am Stat Assoc 1994; 89: 687–692.

15.

Hyun

Wong

. Multiple objective optimal designs to study the interesting features in a dose-response relationship. Int J Biostat 2015; 11: 253–271.

16.

Fedorov

, Studden

Klimko

. Theory of optimal experiments, New York, NY: Academic, 1972.

17.

Atkinson

Donev

Tobias

. Optimum experimental designs with SAS, Oxford: Oxford University Press, 2007.

18.

Chernoff

. Locally optimal designs for estimating parameters. Ann Math Stat 1953; 24: 586–602.

19.

Ford

Kitsos

Titterington

. Recent advances in nonlinear experimental design. Technometrics 1989; 31: 49–60.

20.

Chen

Chang

Wang

, et al. Minimax optimal designs via particle swarm optimization methods. Stat Comput 2015; 25: 975–988.

21.

Coffey

. Bioassay case study applying the maximin D-optimal design algorithm to the four-parameter logistic model. Pharm Stat 2015; 14: 427–432.

22.

McCallum

Bornkamp

. Accounting for parameter uncertainty in two-stage designs for Phase II dose-response studies. Oleksandr Sverdlov (ed.). Modern adaptive randomized clinical trials, Chapman and Hall: CRC Press, 2015, pp. 427–450.

23.

Yang

Biedermann

Tang

. On optimal designs for nonlinear models: a general and efficient alg orithm. J Am Stat Assoc 2013; 108: 1411–1420.

24.

Hyun

Wong

Yang

. VNM: an R package for finding multiple-objective optimal designs for the 4-parameter logistic model. J Stat Softw 2018; 83(5): 1–19. DOI: 10.18637/jss.v083.i05.

25.

Kiefer

. Jack Carl Kiefer collected papers III, design of experiments, New York, NY: Springer-Verlag, 2014.

26.

Holland-Letz

. On the combination of c- and D-optimal designs: general approaches and applications in dose–response studies. Biometrics 2017; 73: 206–213.

27.

Padmanabhan

Dragalin

. Adaptive Dc-optimal designs for dose finding based on a continuous efficacy endpoint. Biom J 2010; 52: 836–852.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.20 MB

Optimal designs for asymmetric sigmoidal response curves in bioassays and immunoassays

Abstract

Keywords

1 Introduction

2 Background

3 Optimal designs

3.1 D-optimal designs for estimating Θ

3.2 c-Optimal designs for estimating the EC50

3.3 Ds-optimal designs for estimating the θ5

3.4 Design efficiency

3.5 A robust D-optimal design to model misspecification

4 An algorithm and R-package

5 Robustness of the D-optimal design for the 5PL model

5.1 Robustness to the model parameter values and to the 4PL model

6 Applications

6.1 Efficiencies of the implemented designs under nominal values misspecification

6.2 Efficiencies of the implemented designs under mean function misspecification

7 Conclusions

Supplemental Material

Supplemental material for Optimal designs for asymmetric sigmoidal response curves in bioassays and immunoassays

Footnotes

Declaration of conflicting interests

Funding

Supplementary Material

References

Supplementary Material

3.2 c-Optimal designs for estimating the EC₅₀

3.3 D_s-optimal designs for estimating the θ₅