The Ewens sampling formula is a distribution related to the random partition of a positive integer. In this study, we investigate the issue of non-existent solutions in parameter estimation under the distribution. We derive the first and second moments matching estimators to the uniformly minimum variance unbiased estimator (UMVUE) in the asymptotic sense (hereafter referred to as Asymptotic UMUVE) using the Ewens sampling formula. A Monte Carlo simulation study is performed to evaluate the efficiency of the resulting estimators.
AbramowitzM and StegunI.Handbook of mathematical functions with formulas, graphs, and mathematical tables. New York:Dover, (1972).
2.
AntoniakCE.Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems. The Ann Stat1974, 2: 1152–1174.
3.
BethlehemJG, KellerWJ, and PannekoekJ.Disclosure control of microdata. J Am Stat Assoc1990, 85: 38–45.
4.
CraneH.The ubiquitous Ewens sampling formula. Stat Sci2016, 31: 1–19.
5.
DasK, JiangJ, and RaoJNK.Mean squared error of empirical predictor. Ann Stat2004, 32: 818–840.
6.
EwensWJ.The sampling theory of selectively neutral alleles. Theor Popul Biol1972, 3: 87–112.
7.
FirthD.Bias reduction of maximum likelihood estimates. Biometrika1993, 80: 27–38.
8.
GoodIJ.The population frequencies of species and the estimation of population parameters. Biometrika1953, 40: 237–264.
9.
HiroseMY, and LahiriP.Estimating variance of random effects to solve multiple problems simultaneously. Ann Stat2018, 46: 1721–1741.
10.
HiroseMY, and ManoS.Asymptotic bias reduction of maximum likelihood estimates via penalized likelihoods with differential geometry, arXiv:2011. 14747.
11.
HoshinoN.Applying Pitman’s sampling formula to microdata disclosure risk assessment. J Off Stat2001, 17: 499.
12.
HoshinoN and TakemuraA.Relationship between logarithmic series model and other superpopulation models useful for microdata disclosure risk assessment. J Japan Stat Soc1998, 28: 125–134.
13.
KingmanJF.The representation of partition structures. J London Math Soc1978, 2: 374–380.
14.
LahiriP and LiH.Generalized maximum likelihood method in linear mixed models with an application in small-area estimation. In: Tenth Islamic Countries Conference on Statistical Sciences Statistics for Development and Good Governance (p. 158). Islamic Countries Society of Statistical Sciences (ISOSS), 2010.
15.
LiH and LahiriP.An adjusted maximum likelihood method for solving small area estimation problems. J Multivar Anal2010, 101: 882–892.
16.
ManoS.Partitions, hypergeometric systems, and Dirichlet processes in statistics. Tokyo:Springer, 2018.
17.
PitmanJ.Exchangeable and partially exchangeable random partitions. Probab Theory Relat Fields1995, 102: 145–158.
SibuyaM.A random clustering process. Ann Inst Stat Math1993, 45: 459–465.
20.
TavareS and EwensWJ.The multivariate Ewens distribution. In:JohnsonNL, KotzS and BalakrishnanN(eds)Multivariate discrete distributions. New York:Wiley, 1997.
21.
TsukudaK.Estimating the large mutation parameter of the Ewens sampling formula. J Appl Probab2017, 54: 42–54.
22.
WattersonGA.The sampling theory of selectively neutral alleles. Adv Appl Prob1974, 6: 463–488.
23.
YoshimoriM and LahiriP.A new adjusted maximum likelihood method for the Fay–Herriot small area model. J Multivar Anal2014, 124: 281–294.