In this work, a new family of distribution, which generalizes the Beta Weibull-G family by the introduction of a shape parameter to enhance better fit and flexibility, called the Modified Beta Weibull-G family of distributions is obtained. The mixture representation of the derived family of distributions was discussed, with the results effective in studying moments, moment generating functions, order statistics. Parameters of the family of distributions were estimated using the maximum likelihood estimation method. By utilizing this modified class of distributions, we build a new distribution called the modified beta Weibull Weibull and applied it to engineering datasets. Application revealed a better performance in model fit, compared to some other distributions.
Of recent, many distributions are developed by adding parameter(s) to the baseline distribution, which are more flexible and give A better fit to real-time data. These distributions have gained application to real life situations in engineering, health, finance, education, environmental sciences, economics, e.t.c. Examples of families of distribution include Beta-G ([14]), Weibull-G ([9]), Poisson-G ([1]), Exponentiated-G ([16]),Transmuted-G ([26]), Cubic Transmuted-G ([18]), Exponentiated Chen-G ([7]) to mention a few. With these generated families of distributions, researchers have been enabled to develop new distributions. The generated families have attracted many researchers due to the availability of computational and analytical facilities in most symbolic computation software platforms. Furthermore, with the aid of the mixture representation of the Exponentiated-G distributions, several properties of the derived distribution have been extensively studied.
The Weibull-H family of distributions has been extensively studied and applied to real life datasets due to its flexibility. Some of the studies had considered using the family of distributions in deriving extensions of some baseline distributions. Some of the extended distributions include Weibull-Gamma [5], Weibull-Burr ([6]), Weibull-sigma distribution ([20]), Weibull Exponential distribution ([30]) e.t.c. Given a baseline cumulative distribution function (c.d.f) G(x;σ) with p.d.f g(x;σ), the c.d.f of the Weibull-H distribution ([9]) is
and
where
and x>0, α,φ>0. In a bid to obtain a family of distributions that will further extend and have the Weibull-H family of distribution as subfamily in order to achieve more flexibility and better fitting to the real life data, we considered the Modified Beta-H family of distributions.
The Beta- G family of distributions has also been explored by researchers after it was derived by Eugene et.al(2002) ([14]) and applied to the normal distribution. It has been used as generator to develop new distributions in order to achieve better fit for data and flexibility. Examples of these include Beta-Weibull distribution ([15]), Beta Half logistic distribution ([19]), Beta Type I generalized half logistic distribution ([8]), Beta Exponential distribution ([23]) e.t.c. Using the Beta-G family, some researchers were able to bring new families of distributions which include Beta Transmuted-G ([2]), Beta Weibull G ([21]), Beta-Odd Log-Logistic G ([10]) e.t.c. Nadarajah et.al(2014) ([24]) obtained a modified form of the beta G distribution called the Modified beta distribution. It has the c.d.f as
which equivalently gives
and the corresponding p.d.f as
where , and B (r ; ζ, φ) is an incomplete beta function.ζ, θ, and φ are shape parameters, is the incomplete beta function ratio. The additional parameters θ, ζ, and φ govern the skewness and tail weight of the generated distribution. An attractive feature of this family, just like the Beta G family, is that ζ and φ can afford greater control over the weights in both tails and in the center of the distribution. If ζ=φ=θ =1, we have the h(x;σ) and H(x;σ) of the baseline distribution.
The Modified Beta G has not been extensively used in obtaining generalized distributions. Some of its submodels in literature are Modified Beta Gompertz ([12]), Modified Beta Modified Weibull ([25]) e.t.c. Therefore, in this research, we are combining both the modified beta-G family of distributions and the Weibull-H family of distributions to obtain a new family of distributions that is more flexible and has the potentiality to fit data better than the modified beta-G and the Weibull-G family of distributions.
The plan of the paper is as follows. The Modified Beta Weibull G family of distributions is derived and defined in Section 2. Section 3 discusses the mixture representation of the p.d.f and c.d.f of the family of distributions. In Section 4, some statistical properties of this family of distributions were studied and discussed. Maximum Likelihood Estimation for the parameters of the family of distributions is discussed in Section 5. In Section 6,the family of distributions was applied to two real data sets.Section 7 has the concluding remarks given.
Derivation of the modified beta Weibull-G family of distribution
In this section, the p.d.f and c.d.f of the Modified Beta Weibull- G (MBWG) family of distributions are discussed. Inserting Equations 1 and 2 in Equation 5, we obtained the p.d.f of the MBWG as
The corresponding c.d.f of MBWG is derived by inserting equation 1 in 3 to have
Alternatively, equation 7 can be written as
where
From Equation 8, the survival function of the MBWG is
where v is written as in equation 9. The hazard function hz(x;σ) given as
and the reverse hazard function rhz(x;σ) as
Sub-Family of the MBWG family of distributions
Some of the sub-family of the MBWG family of distributions includes:
1. When θ=ζ=φ=1, we obtain the Weibull-G distribution of Bourguignon et.al(2014)
2. When ζ=φ=1, we have the Modified Weibull-G family of distribution(NEW) as
3. When θ=1, φ=1, we have the Exponentiated Weibull-G family of distributions by Cordeiro et.al(2013)
4. When φ = 1, we have the modified exponentiated Weibull G family of distributions (NEW) as
5. When θ=ζ=φ=ρ=1, we have the Exponentiated-G family of distributions of Bareto-Souzza and Simas(2013)
6. When φ=ρ=1, we have the Modified Beta G family of distributions of Nadarajah et.al(2014)
Submodels of the MBW-G family of distributions
In this section, three(3) special models of the MBWG family of distributions are presented. These models generalize some models in literatures. The models have baselines of Exponential(Ex), Weibull(W) and Frechet(F) distributions.
Modified Beta Weibull Exponential (MBWEx) distribution
The pdf and cdf of exponential distribution are given as
Therefore, the pdf (fMBWEx) and hazard function (hMBWEx) of the MBWEx distribution is given as
and
respectively and x>0,θ,ζ,α,ρ,φ, β > 0.
The MBWEx distribution includes the Weibull Exponential(WE) when θ=ζ=φ=1. For θ=α=ρ=1, the MBWEx becomes Beta Exponential(BE) distribution. For θ=ζ=1, MBEx reduces to Exponentiated Weibull Exponential(EWE) distribution. Plots of the density function and the hazard function of the MBWEx with various assigned parameter values are shown in Fig. 1.
Plots of the Modified Beta Weibull exponential distribution.
Modified Beta Weibull Weibull (MBWW) distribution
The pdf and cdf of Weibull distribution are given as
Now, the pdf fMBWW and hazard function hMBWW of the MBWW distribution is given as
and
respectively, where x>0,θ,ζ,α,ρ,φ, β,λ > 0.
For θ=α=ρ=1, the MBWW becomes Beta Weibull(BW) distribution. For θ=ζ=1, MBWW reduces to Exponentiated Weibull Weibull(EWE) distribution. Plots of the density function and the hazard function of the MBWW with various assigned parameter values are shown in Fig. 2.
Plots of the Modified Beta Weibull Weibull distribution.
Modified Beta Weibull Frechet (MBWF) distribution
The pdf and cdf of Frechet distribution are given as
Now, the pdf of the MBWF fMBWF distribution is given as
with the hazard function hMBWF
respectively where x > 0, θ, ζ, α, ρ, φ, β, λ > 0
The MBWF distribution includes the Weibull Frechet(WF) when θ=ζ=φ=1. For θ=α=ρ=1, the MBWF becomes Beta Frechet(BF) distribution. For θ=ζ=1. Plots of the density function and the hazard function of the MBWF with various assigned parameter values are shown in Fig. 3.
Plot of pdf of the Modified Beta Weibull Frechet distribution.
Linear representation
In this section, a useful mixture representation for the density function of the MBWG distribution is derived. The derived representation is crucial for the derivation of statistical properties of the MBWG distribution such as moments, generating functions, order statistic properties. Using the Binomial Expression given as
where |z|<1 and b>0, Therefore,
and
Inserting the expressions in Equations 12 and 13 in the p.d.f of the MBWG family as in Equation 5, we obtain
Writing expression in Equation 14 in terms of Exponentiated Weibull-G, we have the p.d.f as
where
and
Πw+s+φ is the p.d.f of the Exponentiated Weibull-G family of distributions with power parameters(w+s+φ). Further simplifying equation 14,
Inserting Equation 16 in Equation 14 we have
Re-writing
Therefore,inserting Equation 18 in 17 we have the expression for the p.d.f of the MBWG distribution as
Note that
Using the binomial expression in 20, we have the expression as
Inserting Equation 21 in 19, we have
Expressing Equation 22 in terms of Exp-G distribution, we have the p.d.f as
where
and
Θw+s+φ+d+k is the p.d.f of the Exponentiated-G family of distributions. Expressions obtained in this section find its use when studying properties of the distribution such as moments, order statistics e.t.c.
Furthermore, integrating expression in 23, we have the linear mixture c.d.f of the MBWG family in terms of the Exp-G distribution as
where Qw+s+φ+d+k is the c.d.f of the Exponentiated-G family of distribution
Statistical properties
In this section, the statistical properties of the MBWG distribution are extensively studied. The properties considered are moments, monet generating function, quantile function, and order statistics
Quantile function
The quantile function of the MBWG distribution is obtained by solving
where u is uniform variate on the interval [0,1]. Therefore,
where
Π is the inverse of the incomplete beta function ratio.
Moments
The rth moment of a distribution is given as
Alternatively, using the mixture expression as in equation 23, the rth moment of the distribution is
where Πw+s+φ+d+k is the Exp-G with power parameters w+s+φ+d+k.
For the nth central moment of X (Mn), the expression is obtained as
Inserting the mixture expression of 23 in 27, the expression for the rth moment becomes
The cumulants, ηn, of Xn follow recursively from
where η1= , η2=, η3=-3+. From these expressions, the skewness and kurtosis properties of the family of distribution can be measured from the ordinary moments.
Moment generating function
The moment generating function (Mf (t))= E[etX] of X.In this section, two formula in computing the moment generating function(m.g.f) of the family is discussed.
Firstly, we derived the m.g.f form equation 23 as
where Mw+s+φ+d+k is the m.g.f of the Exp-G with power parameter w+s+φ+d+k.
Secondly, the m.g.f was also derived from the baseline of the quantile function as
where Dw+s+φ+d+k= etqf(u)uw+s+φ+d+k-1∂u and qf (u) is the quantile function corresponding to f(x;σ).
Order statistics
Let X1, . . . , Xn be a random sample from the MBWG distribution, The pdf of ith order statistic, say Xi:n, can be written as
Therefore inserting the p.d.f as in equation 23 and c.d.f as in equation 24 in 30, we have the order statistics of MBWG as
Using an equation given in page 17 of Gradshteyn and Ryzhik ([27]) for a power series raised to a positive integer n, then equation 31 becomes
where r= w+s+φ+d+k and Θr is the pdf of the Exp-G with power parameter r
Parameter estimation
In this section, we derived the expressions for the estimates of the parameters of the MBWG distribution. We employed the use of the maximum likelihood estimation (M.L.E) method. Let δ=(ζ,φ,θ,α,ρ,σ) be the parameter vector and x = (x1, . . . . . , xn) be the sample from the MBWG distribution, then the log-likelihood function for δ can be written as
The elements of the score vector, that is, the partial derivatives of l(δ) with respect to the parameters are
Setting the set of equations in 33–38, to be equals to zero and solving them simultaneously yields the MLE = (,,,,) of δ=(ζ,φ,θ,α,ρ,σ). Solving these equations cannot be done analytically. This can be achieved by the aid of statistical software using iterative methods such as Newton-Raphson type algorithms to solve numerically.
For interval estimation of the model parameters, we require the observed information matrix for interval estimation and test of hypothesis on the parameters (ζ, φ, θ, α, ρ, θ), we obtain a 6x6 unit information matrix
The corresponding elements are derived by the second derivatives of l with respect to the parameters.
Under the conditions that are fulfilled for parameters, the asymptotic distribution of is distribution of δ can be used to construct approximate confidence intervals and confidence regions for the parameters and for the hazard and survival functions. The asymptotic normality is also useful for testing the goodness of fit of the beta type I generalized half logistic distribution and for comparing this distribution with some of its special sub-models using one of these two well-known asymptotically equivalent test statistics namely, the likelihood ratio statistic and Wald statistic. An asymptotic confidence interval with significance level τ, for each parameter δi is given by
where is the ith diagonal element of for i = 1, 2, 3, 4, 5, 6 and zτ/2 is the quantile of the standard normal distribution.
Application to real life data sets
In this section, application of the MBWG distribution was done on two real datasets using the Weibull distribution as the baseline model to illustrate the importance and fit of the MBWG distribution. The maximum likelihood estimates (M.L.E) of the distribution and that of the competitive distributions will be obtained. We assessed the goodness of fit of the distributions using the log-likelihood, Akaike’s information criterion (AIC), Bayesian information criterion (BIC), and corrected Akaike’s information criterion (CAIC). The R statistical software is employed for data analysis. Estimation of model parameter estimates was done using the optim() function in stats packages in R version 4.1.1 [32]. The fit of the MBWW distribution is compared with other competitive distributions which are Weibull exponential (WEx) ([30]), Kumaraswamy Weibull (KW) [31]), Beta Weibull (BW) ([28]) and Exponentiated Weibull Weibull (ExWW) ([29]) distributions. The p.d.fs of these distributions are as follows:
Weibull exponential (WEx) distribution.
Kumaraswamy Weibull (KW) distribution
Beta Weibull (BW) distribution
Exponentiated Weibull Weibull (ExWW) distribution
Data set 1: The first data set represents the breaking strength of 100 yarn as reported by Gomes-Silva et al (2017). The data-set consists of 63 measurements of the strengths of 1.5 cm glass fibres, which were initially collected by United Kingdom National Physical Laboratory staff. The data is presented below:
The second data set represents the breaking stress of carbon fibers of 50 mm length (GPa) which was reported by Nicholas and Padgett (2006). This data was used by Yousof et al. (2017). The data set is:
After analysis of the two datasets, Fig. 6 describes the shape of the hazard plots of the data. It shows that the hazard curve os non-decreasing for the two datasets. Furthermore Fig. 4 is the histogram which further reveals the fit of the model to both datasets. Figure 5 shows how close the fitted pdf is to the empirical distribution which further established the fact that the model fits the data well. In Tables 1 and 2, we have observed that the modified beta weibull weibull distribution gives the best fit when compared to its submodels, therefore making it the preferred model to consider for this data on the basis of the selection criterion considered.
Estimated pdf plots for Data I and Data II.
Estimated cdf plot for the MBWW
TTT plot of MBWW
The MLEs and Information Criteria of the models based on data set 1
Model
-LL
AIC
BIC
CAIC
MBWW
1.5549
0.5758
0.7992
0.1928
2.2280
1.4243
1.5953
11.1390
36.2780
51.2799
38.9446
WEx
5.3583
1.4233
0.8726
-
-
-
-
15.5250
37.0500
43.4794
37.45684
KW
0.5040
0.1729
6.5245
1.2858
-
-
-
15.0980
38.1959
46.7685
38.8856
BW
0.6243
6.4974
7.1824
2.3007
-
-
-
14.6656
37.3311
45.9036
38.3837
EKW
1.8536
5.5418
0.4837
5.0944
2.0662
-
-
14.8422
39.6844
50.4001
40.7371
The MLEs and Information Criteria of the models based on data set 2
Model
-LL
AIC
BIC
CAIC
MBWW
0.6931
0.0250
1.4581
1.2725
0.7170
0.0823
1.5192
67.2027
148.4056
153.345
149.708
WEx
3.2187
1.4841
0.4835
-
-
-
-
86.3347
178.6695
185.2385
179.0566
KW
0.5380
0.1338
3.6983
1.8507
-
-
-
86.3365
180.6732
189.4318
181.3289
BW
0.7280
4.5270
3.9104
4.9847
-
-
-
86.1975
180.3951
189.1537
181.0508
EKW
0.0673
0.1641
2.9947
2.8437
1.4215
-
-
85.0870
180.1732
191.1214
181.1732
Conclusion
In applications to real life situations, there has always been a clear need for extended forms of existing distributions which are more flexible and gives better fit to model real data which has high degree of kurtosis and skewness. In this work, we proposed a new family of distribution called the Modified Beta Weibull Family which generalizes the beta weibull G family by the addition of a shape parameter. Some well known family of distributions are special cases of the Modified Beta Weibull Family. Some mathematical properties of the new class including linear expression of the distribution,,moments,quantile, moment generating functions and order statistics are provided. The model parameters are estimated by the maximum likelihood estimation method and the observed information matrix is determined. We prove empirically by means of an application to a real data set that special cases of the proposed family can give better fits than other models generated from well-known families.
References
1.
AbouelmagdT., HamedM., EbraheimH., The Poisson-G Family of Distributions with Applications.?Pakistan Journal of Statistics and Operation Research13 (2017), 313. doi: 10.18187/pjsor.v13i2.1740.
2.
AfifyA., YousofH.M., NadarajahS., The beta transmuted-H family for lifetime data.?Statistics and Its Interface10 (2017), 505–520. doi: 10.4310/SII.2017.v10.n3.a13.
3.
AfifyA., YousofH.M., CordeiroG., OrtegaE., NofalZ., The Weibull Fréchet Distribution and its Applications, Journal of Applied Statistics43 (2016), 2608–2626. 10.1080/02664763.2016.1142945.
4.
OguntundeP.E., BalogunO.S., OkagbueH.I., BishopS.A., The Weibull-Exponential Distribution: Its Properties and Applications, Journal of Applied Sciences15 (2015), 1305–1311.
5.
KlakattawiH.S., The Weibull-Gamma Distribution: Properties and Applications, Entropy (2019), 21.
6.
IbrahimN.A., KhaleelM.A., MerovciF., KílíçmanA. and
ShitanN., Weibull Burr X distribution properties and application, (2017).
7.
AwodutireP., Statistical Properties and Applications of the Exponentiated Chen-G Family of Distributions: Exponential Distribution as a Baseline Distribution, Austrian Journal of Statistics51(2) (2022), 57–90.
8.
AwodutireP.O., NdukaE.C., IjomahM.A., The Beta Type I Generalized Half Logistic Distribution: Properties and Application, Asian Journal of Probability and Statistics6(2) (2020), 27–41. https://doi.org/10.9734/ajpas/2020/v6i230156.
9.
BourguignonM., SilvaR., GMC., The Weibull-G family of probability distributions, J. Data Sci.12(2014), 53.
10.
CordeiroG., AlizadehM., TahirM., MansoorM., BourguignonM., HamedaniG., The Beta Odd Log-Logistic Generalized Family of Distributions, Hacettepe University Bulletin of Natural Sciences and Engineering Series B: Mathematics and Statistics45 (2015). 10.15672/HJMS.20157311545.
ElbatalI., JamalF., ChesneauC., ElgarhyM., The Modified Beta Gompertz Distribution:Theory and Applications.?Mathematics (2018), 7. doi: 10.3390/math7010003.
13.
ElgarhyM., ShakilM., KibriaB.M., Exponentiated Weibull-Exponential Distribution with Applications, Applications and Applied Mathematics: An International Journal (AAM)12 (2017), 710–725.
14.
EugeneN., LeeC., FamoyeF., Beta-normal distribution and its applications, Commun. Stat.-Theory Methods31 (2002), 497.
15.
FamoyeF., LeeC., OlumoladeO., The beta-Weibull distribution, Journal of Statistical Theory and Applications4 (2005), 121–136.
16.
GuptaR., GuptaP., GuptaR., Modeling failure time data by Lehmann alternatives, Commun. Stat. Theory Methods27 (1998), 887.
17.
HassanA., ElgarhyM., Exponentiated Weibull-Weibull Distribution: Statistical Properties and Applications, Gazi University Journal of Science (2018).
18.
HussainZ., AsgharZ., Cubic Transmuted-G Family of Distributions and Its Properties, Stochastics and Quality Control 33 (2018). doi: 10.1515/eqc-2017-0027.
19.
JoseJ., ManoharanM., Beta half logistic distribution: A new probability model for lifetime data, Journal of Statistics and Management Systems19 (2016), 587–604. 10.1080/09720510.2015.1103457.
20.
KlakattawiH.S., The Weibull-sigma Distribution: Properties and Applications, Entropy19.
21.
MakubateB., OluyedeB., MotobetsoG., HuangS., FagbamigbeA., The Beta Weibull-G Family of Distributions: Model, Properties and Application, International Journal of Statistics and Probability7 (2018), 12. 10.5539/ijsp.v7n2p12.
22.
NadarajahS., GuptaA., The beta Fréchet distribution, Far East Journal of Theoretical Statistics14 (2004).
23.
NadarajahS., KotzS., The beta exponential distribution, Reliability Engineering and System Safety91(6) (2006), 689–697.
24.
NadarajahS., TeimouriM., ShihS., Modified Beta Distributions. Sankhya:, The Indian Journal of Statistics76 (2014), 19–48. doi: 10.1007/s13571-013-0077-0.
ShawW.T., BuckleyI.R., The alchemy of probability distributions: Beyond GramCharlier expansions, and a skew-kurtotic-normal distribution from a rank transmutation map, (2009).
27.
GradshteynI.S., RyzhikI.M., Table of Integrals, Series, and Products, seventh edition, Academic Press, New York. (2007) MR2360010 map.
28.
FamoyeF., LeeC., OlumoladeO., The beta-Weibull distribution, Journal of Statistical Theory and Applications4 (2005), 121–136.
29.
HassanA., ElgarhyM., Exponentiated Weibull-Weibull Distribution: Statistical Properties and Applications, Gazi University Journal of Science32(2) (2018), 616–635.
30.
OguntundeP., BalogunO., OkagbueH., AminaB., The Weibull-Exponential Distribution: Its Properties and Applications, Journal of Applied Fire Science15 (2015), 1305–1311.
31.
CordeiroGauss M., EdwinM.M. and NadarajahS., The Kumaraswamy Weibull distribution with application to failure data, Journal of the Franklin Institute347(8) (2010), 1399–1429.
32.
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. (2020) URL https://www.R-project.org/.