Abstract
This article accentuates the estimation of a two-parameter generalized Topp-Leone distribution using dual generalized order statistics (dgos). In the part of estimation, we obtain maximum likelihood (ML) estimates and approximate confidence intervals of the model parameters using dgos, in particular, based on order statistics and lower record values. The Bayes estimate is derived with respect to a squared error loss function using gamma priors. The highest posterior density credible interval is computed based on the MH algorithm. Furthermore, the explicit expressions for single and product moments of dgos from this distribution are also derived. Based on order statistics and lower records, a simulation study is carried out to check the efficiency of these estimators. Two real life data sets, one is for order statistics and another is for lower record values have been analyzed to demonstrate how the proposed methods may work in practice.
Keywords
Introduction
Shekhawat and Sharma (2020) introduced a new extension of Topp-Leone (TL) distribution on the unit interval by adding a skewness parameter in the TL distribution using the power transformation called the generalized Topp-Leone (GTL) distribution. The probability density Eq. (1) of GTL distribution is given by
and the corresponding cumulative distribution Eq. (2) and hazard function are given by
It can be seen that
Topp-Lone distribution is a member of GTL distribution if
Bounded distributions are increasingly gaining grounds in literature owing to their significance in several areas like psychology, economics, biology, engineering and many others. For instance, in psychology, proportions and percentages play a vital role in evaluating the probability of judgments (Smithson & Shou, 2017). Similarly in economics, one may come across several instances where data are bound on the unit interval. For example, proportion of income spent on non-durable consumption, pension plan participation rates, market shares, fractional repayment on debts and capital structures (Ghosh et al., 2019; Papke & Wooldridge, 1996; Smithson & Shou, 2017). Besides, for measuring reliability, it is imperative to have models defined on the unit interval in order to have plausible results (Genç, 2013).
Distributions based on unit interval are known to have desirable failure (hazard) rate characteristics such as increasing, decreasing and bathtub shapes. However, one may encounter situations where only increasing and bathtub failure rates are used or observed. These failure rate characteristics are vital when modeling datasets. For instance, Rajarshi and Rajarshi (1998), and Lawless (2003) in their studies observed that distributions with bathtub hazard rates are needed to model lifetime of electronic, electrochemical and mechanical products; while Lai (2013) observed that the optimum number of minimal repairs for systems have increasing failure rates. Also, it has been observed that during clinical development drugs have increasing failure rate (see Woosley and Cossman (2007)).
The concept of generalized order statistics (gos) has held the attention of statisticians for a long time. It was first proposed by Kamps (1995) and includes ordered random variables arranged in increasing order of magnitude such as order statistics, sequential order statistics, progressive type II censored order statistics, records and Pfeifers records. However, the ordered random variables which are arranged in decreasing order of magnitude can not be studied in this framework. Owing to this, statisticians felt the need for the ordered random variables which can also be arranged in decreasing order of magnitude. For example, the life length of an electric bulb arranged from highest to lowest. The study of distributional properties of such random variables by using the inverse image of gos is popularly known as dual generalized order statistics. Pawlas and Szynal (2001) first proposed the concept of dual generalized order statistics (dgos) wherein the order random variables can be studied in both increasing and decreasing order of magnitude. This concept was further studied in a systematic manner by Burkschat et al. (2003). Dual gos includes order statistics (reversed ordered order statistics), lower k-records and lower Pfeifer records. For better understanding, dgos can be used when
In the last two decades or so, several studies have been carried out on the statistical properties of continuous distributions based on dgos. In this regard, readers may refer to the works of Pawlas and Szynal (2001), Ahsanullah (2004, 2005), Mbah and Ahsanullah (2007), Anwar and Athar (2008), Barakat and El-Adll (2009), Khan et al. (2010), Khan and Kumar (2010, 2011), Jaheen and Al Harbi (2011), Athar and Faizan (2011), Kumar (2013a, b), Khan and Khan (2015), Kumar (2016), Li (2016), Kumar and Dey (2017), Khan and Iqrar (2019), Kumar et al. (2020) among others.
However, to the best of our knowledge, there are no reports on GTL distribution based on dgos. The motivation of the paper is two fold: first is to derive the explicit expressions for the single and the product moments based on dgos of GTL distribution. Second is to estimate the parameters of the model from both frequentist and Bayesian view points based on order statistics and lower record values.
The paper is organized as follows. In Section 2, we present preliminaries of dgos. In Section 3, we present explicit expressions for the single moments and product moments of order statistics of dgos of GTL distribution. In the same Section, we reported the mean and variances of order statistics and lower record values. Two methods of estimation namely, maximum likelihood method of estimation and Bayesian method of estimation are discussed in Section 4. To obtain the Bayes estimates, independent gamma priors of the unknown model parameters are used under squared error loss (SEL) function. In Section 5, simluation study is carried out to evaluate the performance of the ML and the Bayes estimates based on root mean squared error (RMSE) and relative absolute bias (RAB). In addition, average length (AL) and coverage percentages (CPs) for the 95% approximate confidence interval (ACI) and highest posterior density (HPD) credible intervals of the parameters under order statistics and lower record values is provided in the same Section. We illustrate the methodology developed in this manuscript and the usefulness of the GTL distribution based on two real-life data sets, one is for order statistics and another is for lower record values in Section 6. Finally, concluding remarks are provided in Section 7.
Some lemmas useful for the derivation of the explicit expressions are provided in APPENDIX. The lemmas make use of the Gauss hypergeometric function and Kampé de Feriet’s Function defined by
and
where
The random variables
for
for
It follows also that the joint pdf of the
for
In this section, we derive explicit expressions and recurrence relations for single and product moments of dgos given a random sample
Relations for single moments of dgos
Here, first we present the explicit expressions and recurrence relations for rth dgos,
where
Proof From Eq. (4), we get
The result follows by using Lemma 1. The proof is complete.
where
That is
as obtained by Zghoul (2011) for
Theorem 2 establishes a recurrence relation for single moments
Throughout, we follow the conventions that
Proof From Eqs (3) and (4), we have
Upon integrating by parts by treating
Replacing
which verify the result of Zghoul (2010) for
and hence for lower records
as obtained by Zghoul (2011) for
Here, first we present the explicit expressions and recurrence relations for rth and sth dgos,
where
Proof From Eq. (5), we have
The result follows by using Lemma 2. The proof is complete.
Proof From Eqs (3) and (5), we have
Integrating by parts and treating
Which verifies the result of Kumar (2012) for
and hence for lower records
Maximum likelihood estimation
Let
The natural logarithm of the likelihood function
By differentiating Eq. (15) partially with respect
and
The maximum likelihood estimator (MLE) of
where
The MLE of
where
To construct the
Practically, by dropping the expectation operator
From Eq. (18), the Fisher’s elements will be
and
Under some regularity conditions, the asymptotic normality of MLEs
where
In this subsection, we focus on obtaining the Bayes estimates of the unknown model parameters
Now, we can write the joint posterior distribution of
where
From Eq. (18) we can observe that it is not possible to obtain the Bayes estimators of
and
It is observed that the conditional posterior distributions of the unknown parameters
To construct the HPD credible intervals of
where
Here
In this section, a Monte Carlo simulation is conducted to examine and compare the performance of the proposed maximum likelihood and Bayes estimators of the unknown parameters
Set two arbitrarily true values of Set hyper-parameters values of For Two special cases of dgos are considered, the first is the order statistics (OS) by taking Using Newton-Raphson iterative method, the MLEs Using Metropolis-Hastings within Gibbs algorithm described in Subsection 4.2, the MCMC Bayes estimates Repeat Steps 3–6 for 5,000 times and obtain the average estimates (AEs) for any parameteric function of
where
Extensive computations were performed using
The AEs of
The AEs of
The ALs of
Lower record data from data 1
From Tables 1–3, we are able to make the following observations. The performances of the proposed estimates of
Also, the ALs of ACI/HPD credible intervals narrow down as
Further, the RMSEs and RABs associated with the BEs of
For interval estimates, the 95% HPD credible intervals are better than ACIs in respect of their ALs and CPs. Moreover, the ACIs of
In this section, we analyze two real data sets to illustrate our established results. The first one based on lower record (LR) and the second one based on order statistics (OS). It is known that OR and LR can be obtained from the dgos as a special case, therefore, the estimators and confidence intervals of the GTL distribution based on LR and OS can be obtained directly from Section 4.
Example I: Analysis of Boeing 720 jet airplanes based on LR
The first data set consists of number of successive failure for the air conditioning system reported of each member in a fleet of 13 Boeing 720 jet airplanes studied by Tahir et al. (2015). Since the maximum number of successive failure is 603, the original data were transformed to be in the interval
The MLEs, Bayes estimates, the corresponding SE (within parentheses) and the confidence/credible interval estimates based on LR data
The MLEs, Bayes estimates, the corresponding SE (within parentheses) and the confidence/credible interval estimates based on LR data
Recovery rate of COVID-19 in Spain
Estimated SF and P-P plots for real data 1.
Example II: Analysis of recovery rate of COVID-19 in Spain based on OS
The second data set presents the daily recovery rate of COVID-19 in Spain from March 3 to May 7. The data consists of 66 daily recovery rate and available in
The MLEs, Bayes estimates, the corresponding SE (within parentheses) and the confidence/ credible interval estimates based OS data
Estimated SF and P-P plots for real data 2.
In this paper, first we have obtained the explicit expression for the single and product moments of dgos from GTL distribution. The results obtained in this paper are more generalized in the sense that it includes the moment of order statistics and lower records from GTL distribution. Further, ML and Bayes methods of estimation are used for estimation of the parameters of the GTL distribution based on order statistics and lower record values. A simulation study is carried out to compare the proposed estimators in terms of RMSE and RAB. In addition, ACIs and HPD credible intervals are compared in terms of their AL and CPs. From simulation and real data analysis, we observe that Bayesian approach is quite satisfactory as compared to non-Bayesian procedure for both OS and LR values. Although many properties of GTL distribution have been discussed recently, it seems that BLUEs/BLUPS of the parameters and prediction of future observations based on ordered data for this distribution have not been investigated yet. The work is in progress and it will be reported later.
Footnotes
Acknowledgments
The authors are grateful for the comments and suggestions by the referees and the associate editor. Their comments and suggestions have greatly improved the article.
Appendix
Then
where
where
Proof We have
where
Then
where
Proof We have
where
Now by using Eq. (28), we obtain
In view of Eq. (25),
where
