A Comprehensive Method for Ranking Mutual Fund Performance

Abstract

The increasing guidance requirement in choosing mutual funds contribute to the development of an abundant literature on approaches of ranking funds performance. However, all the commonly used single index and multi-indexes ranking methodologies have their own drawbacks and are in fact not suitable for a reasonable and practical mutual funds comprehensive ranking sometimes. More specifically, the single index measures are incomprehensive, controversial, or even ineffective sometimes, while the mostly used multi-indexes methods do not make full use of evaluation information and usually indirect and inflexible in reality. This paper proposes a paired competition based mutual fund multi-indexes comprehensive performance ranking method which could avoid most of the shortcomings of existing approaches with six good characteristics: it is a comprehensive ranking method that reflects different performance aspects; it provides both cardinal and ordinal information for various practical applications; it integrates both individual evaluation information and joint comparison information for obtaining a more accurate and robust ranking scheme; it has a flexible framework to process complicated data situation; it reveals true strength of fund without distortion; it is convenient to operate in practice. In addition, a more reasonable objective weighted method is proposed to deal with the indexes correlation problems.

Keywords

mutual fund performance comprehensive ranking multi-indexes paired competition

Introduction

The size of the trade round the world has created the selection of funds analysis and ranking ways of key importance (Almeida et al., 2020; Fulkerson & Hong, 2021; Kutan, 2018). For example, mutual funds were of key importance to the USA capitalist and three-fourths people families closely held mutual funds (Elton & Gruber, 2020). However, investors are faced with difficulties in choosing a mutual fund among hundreds and thousands of equity funds or bond funds. The guiding need in making this important decision has led to the development of an abundant work on how to measure and rank mutual fund performance (Durán Santomil et al., 2022; Grau-Carles et al., 2019; Mateus et al., 2019; Parida & Teo, 2018; Venkataraman & Rao, 2021; Yu et al., 2022).

The Sharpe ratio, the well known performance measure for mutual fund industry, which measures the relationship between the standard deviation of the returns and the mean excess return of the funds under the normally distributed returns assumption (Sharpe, 1966). It has several drawbacks, as pointed out by Ornelas et al. (2012). The effectiveness of some other classical measures, such as the Treynor ratio and the Jensen’s alpha depend heavily on the validity of the CAPM model. However, a large number of subsequent studies show that CAPM model is not a reasonable description of the real asset price. For example, the Fama-French 3-factor model and the Carhart 4-factor models are more critical and effective than the CAPM model for describing assets price (Carhart, 1997; Fama & French, 1993). There is a growing work of metrics that exceeds the mean-variance and CAPM model aspect and try to overcome the problems of Sharpe ratio or reflect the ability of other aspects (e.g., market timing and persistence ability) of mutual fund (Adcock et al., 2020; Cogneau & Hübner, 2009; Cuthbertson et al., 2022; Hassouni & Pirotte, 2022). In addition to these classic indicators, in recent years, scholars have also studied the mutual fund performance from other new aspects. These aspects include optionable stocks (Chung et al., 2018), portfolio concentration (Fulkerson & Riley, 2019), gross profitability (Kenchington et al., 2019), recession managers (Chen et al., 2021), beta anomaly (Irvine et al., 2022), fund size (Farid & Wahba, 2022), benchmark discrepancies (Cremers et al., 2022), Offshore concentration (Bai et al., 2022), and so on.

All the above mentioned methods are single index evaluation methods, which can evaluate for sure and thus rank the funds well to some extent from different performance aspects. However, these single index measures have the following three common drawbacks: (a) incomprehensive. These indexes reflect either return or timing ability or performance persistence. Each index can only reflect some performance aspect of mutual fund. If only one of the measures is chosen, the funds ranks are quite arbitrary because of its arbitrary selection; (b) controversial. For example, only when investors believe that risk can be properly measured by standard deviation, or in a world where returns have a nice symmetrical distribution (e.g., normal distribution), Sharp ratio is an appropriate measure of performance evaluation. However, in reality, we can find many categories of returns with non-normal shapes. Take another example, the Jensen’s alpha requires the assumptions of the famous CAPM model, which is controversial in the academic literature since the real market is quite complex to describe it well. (c) ineffective. For instance, the interpretation is also difficult when Sharpe ratio is negative: if risk increases, the Sharpe ratio also increases.

Due to these inherent shortcomings of the single index incomprehensive measures, it is meaningful to consider multi-indexes comprehensive evaluation or ranking methods to integrate different performance aspects. Compared with the large number of single index methods, there are only few literature using multi-indexes approaches to evaluate or rank mutual funds. These little literature can be divided into four typologies: operational research methods such as data envelopment analysis (Basso & Funari, 2001; Charnes et al., 1985; Gouveia et al., 2018); those based on principal component analysis (Pearson, 1901); those based on intelligent analysis such as neural network (Indro, 1999; Li & Qu, 2022) or genetic algorithm (Wang & Li, 2002); those based on multi-criteria decision making (MCDM) method such as simple scoring method, the ideal solution TOPSIS method (Alptekin, 2009; Chang et al., 2010), and other MCDM methods (Alimi et al., 2012; Lee et al., 2009). One of the advantages of these comprehensive methods is that various aspects of fund performance are considered, although many approaches may not easy to operate in practice. However, these existing multi-index ranking methods have the following three common shortcomings: (a) only using individual information. To develop a good comprehensive ranking method, we should to consider both marginal (individual) information and joint (comparison) information. Individual information is provided by measurement of each single index and thus considered as a kind of marginal information. Comparison information reflects the mutual relationships of those single index measures and thus is a kind of joint information. One way of using the joint information is to let the funds compete with each other under those different single evaluation indexes. Unfortunately, all above mentioned multi-index ranking methods only use the individual evaluation information without considering the joint comparison information. In short, the existing multi-index approaches depend only on the marginal distribution of each fund, while it is more reasonable to use information contained in the joint distribution of the pair of funds; (b) indirect scheme: borrow a evaluation method for ranking. The existing multi-index ranking methods indeed use a single ruler (integrate many indexes) to measure mutual fund performance and then rank them, which can be viewed as borrowing a metric from evaluation approach for performance ranking. Actually, evaluation and ranking have significant differences. First, performance evaluation is for an individual fund, while performance ranking is for two or more funds. Second, the results of evaluation are cardinal, while the results of ranking are ordinal (sometimes cardinal). For a ranking problem, cardinal information should be considered less because it is less important than the ordinal information. Third, performance evaluation is absolute that reflects how fine the fund is, while performance ranking is relative that reveals which fund is better. In addition, borrowing an evaluation method to rank will naturally make the comparison information waste since evaluation has nothing to do with comparison; (c) inflexible framework. The existing multi-index methods cannot solve the problems of data type and data missing well in practice. When the data types of index values are different, the common way is to unify different data types into one type, which may lead to distortion of the original data information. When some index values of some funds are missing, it is usually filled with some typical values (e.g., mean or median), which is arbitrary and actually introduces untrue data.

Through the previous literature review and analysis, we believe that a scientific mutual fund ranking method should have the following six good characteristics: (a) it should be a comprehensive ranking method which can reflect different mutual fund performance aspects; (b) it can provide both cardinal and ordinal information for various practical applications; (c) it uses both individual information and comparison information for the purpose of obtaining a more accurate and effective ranking scheme; (d) it has a flexible framework to process complicated data situation reasonably; (e) it can reflect the true strength of fund without distortion (ranking score value should proportional to the strength value of the funds); and (f) it is convenient to operate in reality. Fortunately, a paired competition based ranking scheme which was firstly introduced by mathematician Keener (1993) for football team ranking problem satisfies all the six advantages we mentioned earlier and has been applied to many other sport games, universities ranking, pages ranking, estimators ranking, etc. (Yin et al., 2018). However, it has not been applied to mutual fund ranking so far. Therefore, this paper mainly considers the combination of the paired competition based ranking scheme and the mutual fund ranking problem to obtain a more scientific comprehensive ranking approach.

Paired Competition Based Comprehensive Ranking Method

Motivation

As discussed in the introduction part, although there is a close relationship between fund evaluation and ranking, but in fact, they are two different scientific problems essentially. In this paper, we will focus on fund ranking. Obviously, fund ranking methods can also be classified into single index ranking and multi-indexes ranking. In the existing literature and practice applications, the single index mutual fund ranking directly uses the single index funds measure value to sort, which is natural and reasonable, because we can only use the scalar measurement in the single index ranking method. However, when considering the use of multi-indexes for fund ranking, the existing literature still takes the scalar comprehensive evaluation result of each fund to rank them directly, which is indeed unreasonable. It is believed that this method actually borrows an evaluation method to deal with a ranking problem, or in other words, in dealing with the mutual fund ranking problem, it introduces the sub problem of mutual fund performance evaluation. This “evaluation and then ranking” scheme will bring three obvious drawbacks in the case of multi-indexes fund ranking: (a) evaluation first and then followed by ranking procedure only uses the (marginal) individual evaluation information of each index, which naturally does not need to use the (joint) comparison information among the evaluation results (measurements) of different measures, consequently causing waste of a lot of accessible information; (b) we only need to solve the fund ranking problem, but the introduction of the sub problem of performance evaluation may bring much more new difficulties. For example, data standardization needs to be solved first in multi-indexes fund comprehensive evaluation, while the rationality of standardization method will directly affect the effectiveness of evaluation result; and (c) the comprehensive evaluation result without clear economic meaning is directly used as the ranking score of each fund, which makes it difficult for us to intuitively find the relationship between the ranking score and the real strength of the fund.

In this part, we propose a paired competition based comprehensive ranking approach which was firstly introduced by Keener for ranking football team in 1993 (Keener, 1993) to rank mutual funds. It uses not only the individual evaluation information of each index, but also the comparison information obtained by a paired competition process. It gets around the data normalization problem and reflects the true strength of the funds without distortion. In addition, it is flexible and convenient to operate in practice.

Framework of the Method

Generalized Evaluation Matrix

Suppose that we have $n$ mutual funds to be ranked with $m$ single index evaluation methods. These single index methods could be any of the widely used measures reviewed previously, any other uncommonly used objective approaches, or even subjective non-quantitative descriptive indicators (e.g., description like “excellent,”“good,” and “bad”). Then, these different types of single index evaluation results for each funds can be incorporated in the following generalized evaluation matrix (GEM).

GEM = {[y_{ij}]}_{n, m} = [\begin{matrix} y_{11} & y_{12} & . . . & y_{1 m} \\ y_{21} & y_{22} & . . . & y_{2 m} \\ . . . & . . . & . . . \\ y_{n 1} & y_{n 2} & . . . & y_{nm} \end{matrix}]

(1)

where $y_{ij}$ , the $(i, j)$ th element of the GEM, denotes the measurement of the $i$ th fund evaluated under the $j$ th index. It should be noted that $y_{ij}$ could be any data type and even could be empty when some measurement are nonexistent or missing. GEM is the original measurement (evaluation value) “matrix” with complete evaluation information.

C-Score and C-Matrix

To obtain a final rank vector (RV) for mutual funds with both ordinal and cardinal information directly and utilize the joint comparison information, we define a C-matrix $C = {[c_{ij}]}_{n, n}$ , where $c_{ij}$ , named C-score here, quantifies the relative performance score that favors the $i$ th fund relative to the $j$ th fund with the predetermined pairwise comparison measure functions. Here, “C” means contrast, competition, comparison, etc. A C-matrix contains all results of pairwise comparisons of all funds to be compared and can be regarded as a “bridge” between the final rank (RV) and the generalized evaluation matrix (GEM). The C-matrix is the critical quantity for a RV. However, the setting of the C-score are various, depending on the practical needs. In addition, there is no unified mathematical description of the C-score in the existing methods. Therefore, we first propose the following unified formulation of C-score.

c_{ij} = \sum_{\binom{k = 1}{I_{k} \in B_{ij}}}^{m} w_{k} r_{k} ({y_{i}}_{k}, {y_{j}}_{k}) + \sum_{\binom{k = 1}{I_{k} \in L_{ij}}}^{m} w_{k} p_{k} ({y_{i}}_{k}, {y_{j}}_{k}) + \sum_{\binom{k = 1}{I_{k} \in T_{ij}}}^{m} w_{k} t_{k} ({y_{i}}_{k}, y_{jk})

(2)

where $I_{k}$ stands for the index $k$ , and $B_{ij}$ , $L_{ij}$ , and $T_{ij}$ represent the set of indexes for fund $i$ beats fund $j$ , fund $i$ loses to fund $j$ , and fund $i$ ties to fund $j$ , respectively. $w_{k}$ which represents the weight of evaluation index $k$ will be discussed later. Function $r_{k} ({y_{i}}_{k}, {y_{j}}_{k})$ is the reward for fund $i$ beating fund $j$ under the index $k$ , $p_{k} ({y_{i}}_{k}, {y_{j}}_{k})$ is the penalty for fund $i$ to lose to fund $j$ under the index $k$ and $t_{k} ({y_{i}}_{k}, {y_{j}}_{k})$ is for fund $i$ to tie to fund $j$ which usually set to 0.

These three types of function which depend on practical requirements need to be predetermined. In addition, if the measurement under the index $k$ for fund $i$ or $j$ is non-existing or missing, then

r_{k} ({y_{i}}_{k}, {y_{j}}_{k}) = p_{k} ({y_{i}}_{k}, {y_{j}}_{k}) = t_{k} ({y_{i}}_{k}, {y_{j}}_{k}) = 0

(3)

for the special fund $i$ and $j$ which means no competition in fact with this situation.

In particular, if we let $p_{k} ({y_{i}}_{k}, {y_{j}}_{k}) = 0$ , $t_{k} ({y_{i}}_{k}, {y_{j}}_{k}) = 0$ , $\forall k = 1, 2, . . ., m$ , then C-score is specified as the following manner:

c_{ij} = g_{ij} = \sum_{\binom{k = 1}{I_{k} \in B_{ij}}}^{m} w_{k} r_{k} ({y_{i}}_{k}, {y_{j}}_{k})

(4)

where $g_{ij}$ , named relative gain score here, reflects the relative superior degree that favors the $i$ th fund relative to the $j$ th one after the pairwise competition process. Obviously, $g_{ij}$ will degenerate into winning rate when $r_{k} ({y_{i}}_{k}, {y_{j}}_{k})$ are set to one for each $k$ if all indexes are assumed equal weights.

Similarly, if we let $r_{k} ({y_{i}}_{k}, {y_{j}}_{k}) = 0$ , $t_{k} ({y_{i}}_{k}, {y_{j}}_{k}) = 0$ , $\forall k = 1, 2, . . ., m$ , then C-score is specified as:

c_{ij} = l_{ij} = \sum_{\binom{k = 1}{I_{k} \in B_{ij}}}^{m} w_{k} p_{k} ({y_{i}}_{k}, {y_{j}}_{k})

(5)

where $l_{ij}$ , named relative loss score here, stands for the relative inferior degree that disgusts the $i$ th fund relative to the $j$ th one after the paired competition process.

After the paired competition process, we can finally obtain the C-matrix:

C = {[c_{ij}]}_{n, n} = [\begin{matrix} c_{11} & c_{12} & . . . & c_{1 n} \\ c_{21} & c_{22} & . . . & c_{2 n} \\ . . . & . . . & . . . \\ c_{n 1} & c_{n 2} & . . . & c_{nn} \end{matrix}]

(6)

Paired competition process exploits the comparison information as much as possible. And the C-matrix contains all cumulative results of pairwise comparisons of all funds based on C-score. Therefore, C-matrix contains both individual evaluation information provided by all the single indexes and the comparison information obtained by the paired competition process. The proposed unified formulation of C-score have four advantages in practice: (a) it avoids the step of data standardization which may be unreasonable under some data situation; (b) it is allowed for the absence (non-existing or missing) of some measurements and thus no need for artificial supplementary data; (c) it allows measurement to be of different data types (e.g., numerical values, fuzzy linguistics, or preferences) since the flexible setting of those three functions for each index $k$ ; and (d) it introduces the index weight parameters which is very crucial in multi-index mutual fund ranking.

Ranking vector

For obtaining the final ranking vector (RV) for all funds, we first define a performance vector (PV) which describes the unknown inherent true performance of all funds as:

p = {(p_{1}, p_{2}, . . ., p_{n})}^{'}

(7)

where $p_{i}, i = 1, 2, . . ., n$ , represents the inherent true performance of the $i$ th fund, and $p_{i} > 0$ for all $i$ . To obtain the RV, we have the following requirements: (a) a RV should be related to a PV by the C-matrix which contains both the individual and comparison information exploited from the generalized evaluation matrix GEM; (b) the elements of RV and PV should be of the same order of magnitude. Both of these requirements are reasonable because there is a relationship between RV and PV, and they both reveal how good the corresponding funds are. It should be emphasized here that the RV is the assessment of the true PV in fact.

Take the above two requirements into consideration, we assume that a RV and a PV have the following relationship

r = Cp

(8)

where $r$ represents the RV. Then, the following equation for $i$ th fund is obtained:

r_{i} = c_{i 1} p_{1} + . . . + c_{in} p_{n}

(9)

This above relationship makes good sense: beating an excellent/strong competitor is more valuable and deserves a high score. The elements of $p$ represent how excellent/strong the corresponding funds are. $c_{ij}$ represents how much the $i$ th fund can beat the $j$ th fund. As mentioned early, we require that $r$ and $p$ should be of the same order of magnitude. This can be guaranteed if we set

r = λ p

(10)

where $λ$ is a real positive number. So $r$ is actually a scaled version of $p$ . Thus, obtaining RV is equivalent to obtaining PV.

By equations (8) and (10), we can obtain

Cp = λ p

(11)

Obviously, finding a PV $p$ becomes obtaining eigenvectors of $C$ . Then we can use the following Perron theorem:

For any $C_{n \times n} > > 0$ (with all positive elements) or irreducible $C_{n \times n} \geq \geq 0$ (with all non-negative elements), the two conclusions holds: (a) $C$ has a unique normalized eigenvector $p >> 0$ (with all positive elements and $‖ p ‖ = 1$ ); (b) The eigenvalue $λ$ of the eigenvector $p$ is positive and the magnitude is the largest (Meyer, 2000). A matrix is called irreducible if it is not similar to a block upper triangular matrix through permutation. The method to judge whether a matrix is irreducible or not is very strict and complicated. So we propose another alternative method in practice. We can obtain a positive matrix which can directly use Perron theorem by replacing zero elements with extremely small positive numbers. This approach is easy to apply and avoids the situation of no solution.

In fact, the unique normalized positive eigenvector PV $p$ can also be determined by the following limitation method (Varga, 1962):

p = lim_{n \to \infty} \frac{C^{n} p^{0}}{‖ C^{n} p^{0} ‖}

(12)

where $p_{0}$ represents the prior goodness which is actually independent of PV $p$ . That is, $p_{0}$ has nothing to do with the ranking result. In addition, let $p^{i + 1} = C p^{i} / ‖ C p^{i} ‖, i = 0, 1, . . .$ , then $p = lim_{i \to \infty} p^{i}$ . Therefore, the PV $p$ completely eliminates the influence of arbitrary prior goodness. The ranking result given by the PV depends only on the C-matrix. Thus, PV $p$ makes good use of the comparison information and exploits the C-matrix well.

Finally, the ranking score vector $r$ for all funds is naturally obtained since it is only a scaled version of the unique normalized eigenvector $p$ (i.e., the PV). Simply speaking, we can choose the unique normalized eigenvector $p$ as a RV $r$ to rank all funds.

Properties and Advantages of the Proposed Method

Now we conclude the properties and advantages of the proposed paired competition based mutual fund multi-indexes comprehensive ranking method.

Properties

A property is qualitative which represents the commonality of all quantitative results. Therefore, good properties is very important for an excellent performance metric. This is especially true for mutual fund ranking method, because it cannot be evaluated by other more basic approaches. Otherwise, how to evaluate this more basic method? This is why any ranking method must have good properties. Our proposed approach has four attractive properties:

(a) Homogeneity: If all performance metrics indicate $f_{1} \overset{beat}{\to} 2 pc f_{2}$ (i.e., fund 1 beats fund 2), then we have $f_{1} \overset{beat}{\to} 2 pc f_{2}$ by the proposed method.

(b) Invariance: The old rank would not be changed by adding a new metric that matches the old rank.

(c) Monotonicity: Assumed that $f_{i} \overset{beat}{\to} 2 pc f_{j}$ (i.e., fund $i$ beats fund $j$ ) is obtained by the proposed method, then $f_{i} \overset{beat}{\to} 2 pc f_{j}$ would still hold if $f_{j}$ is measured worse or $f_{i}$ is measured better than before for one or more of the indexes.

(d) Decisiveness: An unique RV could always be guaranteed by the proposed method.

Proofs could be found in Yin et al. (2018).

Advantages

Here, we conclude the advantages of our proposed mutual fund multi-indexes comprehensive approach.

(a) The proposed method is a multi-indexes comprehensive ranking method which can reflect different mutual funds performance aspects. Compared with arbitrary single index, our method increases the stability and rationality of the final rank.

(b) The proposed method makes good use of both individual evaluation information and joint comparison information, and thus reveals some deep insight about the fund performance which cannot be revealed by other individual information based multi-indexes based methods. Moreover, the proposed method is also of additional help in deciding between funds which are indistinguishable by approaches only based on individual information.

(c) The final ranking vector gives ordinal information to determine rank and also provides cardinal information which exhibiting how much one fund is better than the other. Ordinal information is more useful and is guaranteed through a linear mapping.

(d) The ranking vector is proportional or is the same to the true performance vector of mutual fund and thus effective and reasonable.

(e) The ranking result given by the PV (or equivalently RV) only depends on the C-matrix and is independent of the prior goodness. Thus, PV makes full use of the comparison information and exploits the C-matrix well.

(f) The proposed unified formulation of C-score allows the absence (non-existing or missing) of some evaluation data, allows the existence of different types of data and avoids data standardization which may be unseasonable in some situation. These make the proposed ranking method more flexible and practicable.

(g) The proposed method is very convenient for practical application because it only needs to find the eigenvector corresponding to the maximum eigenvalue of the C-matrix.

Objective Indexes Weights

Now, we start to deal with the important problem of deciding objective indexes weights which are incorporated in the formulation of the C-score in equation (2).

Three Widely Used Objective Weights

Mean Weight Method

The mean weight method (MW) gives equal weight to each index to ensure the objectivity of the evaluation process to some extent.

w_{j} = \frac{1}{m}, j = 1, 2, . . ., m

(13)

Entropy Weight Method

The entropy weigh (EW) proposed by Shannon and Weaver’s (1949) reveals the relative importance of its corresponding criterion and it reflects the contrast intensity of the criteria. It can be defined as in equation (14).

w_{j} = \frac{d_{j}}{\sum_{k = 1}^{m} d_{k}}, j = 1, 2, . . ., m

(14)

where $d_{j} = 1 - e_{j}, e_{j} = - \sum_{i = 1}^{n} p_{ij} \ln (p_{ij}) / \ln (n), p_{ij} = \frac{y_{ij}}{\sum_{i = 1}^{n} y_{ij}}$

CRITIC Method

The CRITIC (CR) method proposed by Diakoulaki et al. (1995) determines the objective weights which could incorporate both conflict and contrast intensity. The CR weight can be defined as in equation (15)

w_{j} = \frac{c_{j}}{\sum_{k = 1}^{m} c_{k}}, j = 1, 2, . . ., m

(15)

where $c_{j} = s_{j} \sum_{k = 1}^{m} (1 - r_{jk})$ , $s_{j}$ is the standard deviation of the corresponding evaluation value and $r_{jk}$ is the linear correlation coefficient between the measure value vectors $x_{j} = {(x_{j 1}, x_{j 2} . . ., x_{jm})}^{'}$ and $x_{k} = {(x_{k 1}, x_{k 2} . . ., x_{km})}^{'}$ .

As discussed in the beginning of this section, the processing of indexes correlation is crucial when determining the indexes weights, especially in the multi-index fund ranking problem since the correlation of similar types of measures may be very strong. Therefore, the above mentioned MW and EW approaches which lack the consideration of the indexes correlation is indeed inappropriate in processing our problem. Additionally, the widely used CR method with the processing of indexes correlation is also unreasonable here. Thus we propose the modified CRITIC (MCR) method later.

Modified CRITIC Method

Motivation

The term $1 - r_{jk}$ in equation (15) shows that the stronger the correlation between two measures, the weaker the contribution to the weights of each other. However, this “weight weakened by correlation” scheme uses the linear correlation between the evaluation results (measurements) of different indexes which is in fact inappropriate. The unreasonable reason in CR method is that the statistical correlation and sample correlation between two criteria are not correctly distinguished. And the differences between statistical correlation and sample (specific evaluation results) correlation are as follows.

Statistical correlation: The overall relevance of the two indexes in mathematical sense (representing the intrinsic linear relevance of criteria)

Sample (specific evaluation results) correlation: The relevance degree of the specific evaluation results provided by two indexes for a specific evaluation object (representing the linear correlation of (sampling) evaluation results)

The Proposed Weight

Through the previous discussion, we propose the modified CRITIC method (MCR) in detail. Assume that the statistic correlation of the $j$ th criterion and $k$ th criterion is $ρ_{jk}$ , according to the viewpoints 1 and 2, the term $1 - r_{jk}$ in equation (15) should be replaced by $1 - ρ_{jk}$ , and the final indexes weights based on MCR can be calculated as follows.

w_{j} = \frac{c_{j}}{\sum_{k = 1}^{m} c_{k}}, j = 1, 2, . . ., m

(16)

where $c_{j} = w_{j}^{p} \sum_{k = 1}^{m} (1 - ρ_{jk})$ . Here, $w_{j}^{p}$ could be any prior indexes weights depend on specific practical requirements.

However, the difficulty in reality to use the proposed MCR method lies in how to get the statistical correlation $ρ_{jk}$ among the indexes. In most cases, the statistical correlation among the evaluation measures cannot be obtained analytically. Thus, how to get the statistical correlation among criteria is the key and difficult problem in the application of our proposed weight method. This paper suggests that the large samples correlation could be used to replace the statistical correlation due to the law of large numbers. The large samples can be generated through the Monte-Carlo sampling data or can be obtained from a larger scale actual evaluation objects if available. This suggestion is reasonable and practicable and will be verified later.

Validation for Modified CRTIC Method

In this part, a simulation experiment is applied to verify the rationality of our proposed MCR method compared with the original CR method, which ultimately is capable of revealing that the proposal of using large samples correlation instead of statistical correlation is reasonable in reality.

In this simulation, the experiment is done by the following steps.

Step 1: Determine the investment range of the mock mutual funds to be evaluated: such as all Chinese A-shares, SSE 50 Component shares, HS 300 component shares, etc. Here, we select the HS 300 component shares as the basic investment set and denote it as $Ω$ .

Step 2: On the basis of the investment set $Ω$ , $N$ portfolios (i.e., mock mutual funds) are randomly constructed using Monte Carlo method by changing the weights of different shares.

Step 3: All the mock mutual funds are assumed to have been in operation from 2019.1.1 to 2021.12.31. Then, we can get the daily net value of these mock funds for the 3 years.

Step 4: We select Jensen’s alpha, Sharp ratio, information ratio and Treynor ratio as the performance metrics to evaluate the generated $N$ mock mutual funds.

Step 5: Calculate the six Pearson correlation coefficients among the selected four indexes: the correlation coefficient of Treynor ratio and Sharpe ratio (TS); the correlation coefficient of Treynor ratio and Jensen’s alpha (TZ); the correlation coefficient of Treynor ratio and information ratio (TI); the correction of Sharpe ratio and Jensen’s alpha (SZ); the correlation coefficient of Sharpe ratio and information ratio (SI); and the correlation coefficient of Jensen’s alpha and information ratio (ZI).

Step 6: Repeat step 2 to step 5 1,000 times, and calculate the mean and standard deviation of the six Pearson correlation coefficients.

Step 7: Change the number of the mock mutual funds $N$ from 10 to 200 with the interval 10, and then repeat step 2 to step 6.

The simulation results are as shown in Figures 1 and 2. Obviously, from this two figures, we can easily draw the following two main conclusions:

(a) when the number of funds to be evaluated is small, the standard deviation of the Pearson correlation coefficient among the four indexes is very large. For example, when there are 10 funds, the standard deviation of the correlation coefficient of Treynor ratio and information ratio TI reaches 0.14, which means that the deviation between the correlation coefficient calculated with (small) samples and the truth may be large. Therefore, it is unreasonable to use the correlation coefficient among the special evaluation results of indexes in CR method especially when the number of funds is small (small sample case), which has a large random error;

(b) as the number of mock mutual funds need to be evaluated increases, the standard deviation of the Pearson correlation coefficient among all indexes decreases. For example, when the number of mock funds is only 10, the standard deviation of the correlation coefficient TZ between Sharpe ratio and Treynor ratio under 1,000 Monte Carlo simulations is about 0.047. When the number of funds reaches to 50, the corresponding standard deviation is reduced to 0.015. Moreover, when the number of funds is about 200, the standard deviation is reduced to 0.005. This means that the use of large sample correlation instead of statistical correlation is reasonable in practice, because when the number of funds to be evaluated is enough, the low standard deviation indicates that the random error of using large samples correlation to approximate statistical correlation is very small. This also shows that the random error of using CR method is small when the number of funds need to be evaluated in practice is large (e.g., 100 funds) and thus feasible.

Figure 1.

Mean of Pearson correlation coefficients among Jensen’s alpha, Sharp ratio, information ratio, and Treynor ratio for 10 to 200 mock funds with 1,000 Monte-Carlo runs.

Figure 2.

Std. of Pearson correlation coefficients among Jensen’s alpha, Sharp ratio, information ratio, and Treynor ratio for 10 to 200 mock funds with 1,000 Monte-Carlo runs.

Empirical Results

Data and Alternative Methods

Data on mutual funds are from Wind database, including net data of 637 active equity mutual funds in China from January 1, 2019 to December 31, 2021. The resulting data set might have survivor ship bias since the fund samples did not change during the study period. However, we focused on how ranks change according to performance metrics rather than mutual funds, so the selection of data set should not affect the final ranks. Actually, the common way is to select mutual funds which are alive over the whole period.

The benchmarks considered in this experiment include Shanghai Composite Index (SCI), Shanghai Stock Exchange 50 Index (SSE50), and China Securities Index 300 (CSI300). The risk-free rate is selected as 3% which appropriates to the Chinese 10-year Treasury bonds rate.

After consideration, it is decided that a total of 17 indexes that are wildly used in the literature should be selected to compare with that of our proposed method, including 11 risk adjusted return based performance measures including SR, DSR, ASR, TR, IR, JA, MJA, OSR, SoR, CR, and BR (see Ornelas et al., 2012 for details), 2 indicators reflecting timing ability including HM and TM coefficients (see Elton & Gruber, 2020 for details), SCT to test performance persistence (Cogneau & Hübner, 2009), and 3 commonly used multi-index ranking methods including SM, PCA, and TOPSIS (see Chang et al., 2010 for details). In addition, the proposed paired competition based mutual fund comprehensive ranking method is abbreviated as PC here. The four multi-index approaches are all based on the measurements of 14 single index measures.

In order to consider the influence of different weights in TOPSIS and PC methods, we choose MW, CR, and the proposed MCR approaches to compare the ranking results.

To test whether different metrics give different evaluation results/ranks, we calculated the percentage change in ranking and two correlation coefficients among the ranks obtained by different metrics. The top 2 popular approaches used to compare ranks are the Kendall’s tau and the Pearson correlation coefficient.

Table 1 shows the mean return, standard deviation, skewness, and kurtosis for each fund of daily/weekly/monthly data. These moments will be used in the calculation of some measures. The data reveals that, on average, skewness and excess kurtosis were negative. This confirm the hypothesis that returns may follow a non-normal distribution and thus emphasizing the need for metrics that go beyond the Sharpe ratio. Tables 2 to 4 show that almost all 637 mutual funds have negative skewness and excess kurtosis whether using daily, weekly, or monthly data.

Table 1.

Mean Return, Standard Deviation, Skewness, and Kurtosis of the Distribution of Returns for 637 Funds.

	Daily data		Weekly data		Monthly data
	M	SD	M	SD	M	SD
Mean	0.0003	0.0006	0.0003	0.0004	0.0003	0.0003
Standard deviation	0.0079	0.0028	0.0052	0.0021	0.0031	0.0015
Skewness	−0.3300	0.3526	−0.2924	0.3013	−0.2647	0.28.87
Excess kurtosis	9.1966	8.34875	8.1098	7.5423	7.0893	6.9812

Table 2.

Number of Funds by Skewness and Excess Kurtosis of Daily Returns.

Excess kurtosis	Skewness				Total
Excess kurtosis	<−1.5	(−1.5, −0.5)	(−0.5, 0.5)	>0.5	Total
<5	1	6	81	2	90
(5, 10)	3	89	217	1	310
>10	5	149	77	6	237
Total	9	244	375	9	637

Table 3.

Number of Funds by Skewness and Excess Kurtosis of Weekly Returns.

Excess kurtosis	Skewness				Total
Excess kurtosis	<−1.5	(−1.5, −0.5)	(−0.5, 0.5)	>0.5	Total
<5	1	4	75	1	81
(5, 10)	2	73	283	1	359
>10	3	126	65	3	197
Total	6	203	413	5	637

Table 4.

Number of Funds by Skewness and Excess Kurtosis of Monthly Returns.

Excess kurtosis	Skewness				Total
Excess kurtosis	<−1.5	(−1.5, −0.5)	(−0.5, 0.5)	>0.5	Total
<5	0	2	64	0	66
(5, 10)	1	60	354	0	415
>10	1	101	53	1	156
Total	2	163	471	1	637

Ranking Comparison

In order to obtain the correlation coefficients of ranks provided by different ranking methods, we randomly select 20 and 100 mutual funds to sort them respectively. For each fixed number of funds, we have done 1,000 Monte Carlo runs. Tables 5 and 6 show the averages of the correlation coefficients among the ranks when using 18 methods to evaluate/rank 20 and 100 mutual funds respectively, and the standard deviation of these correlation coefficients under 1,000 Monte Carlo runs. It should be noted that mean weight method is used here for all methods to obtain the results.

Table 5.

Average Pearson Correlation Coefficient and Corresponding Std. Among the Results of 18 Methods for Randomly Selected 20 Funds.

	SR	DSR	ASR	TR	IR	JA	MJA	OSR	SoR	CR	BR	HM	TM	SCT	SM	PCA	TOP	PC
SR	1 (0.00)	.84 (0.07)	.91 (0.02)	.95 (0.02)	.84 (0.07)	.94 (0.02)	.89 (0.02)	.74 (0.06)	.77 (0.05)	.67 (0.08)	.71 (0.06)	.40 (0.03)	.45 (0.02)	.30 (0.03)	.71 (0.03)	.64 (0.04)	.68 (0.04)	.62 (0.06)
DSR	.84 (0.07)	1 (0.00)	.83 (0.07)	.89 (0.04)	.79 (0.04)	.82 (0.04)	.75 (0.04)	.66 (0.07)	.71 (0.08)	.63 (0.04)	.70 (0.07)	.30 (0.04)	.39 (0.05)	.22 (0.05)	.65 (0.04)	.58 (0.03)	.62 (0.06)	.57 (0.05)
ASR	.91 (0.02)	.83 (0.07)	1 (0.00)	.86 (0.06)	.76 (0.08)	.77 (0.07)	.74 (0.03)	.61 (0.07)	.68 (0.06)	.59 (0.02)	.67 (0.04)	.34 (0.06)	.30 (0.03)	.20 (0.05)	.63 (0.07)	.59 (0.04)	.61 (0.07)	.56 (0.09)
TR	.95 (0.02)	.89 (0.04)	.86 (0.06)	1 (0.00)	.83 (0.07)	.97 (0.02)	.83 (0.08)	.75 (0.08)	.78 (0.04)	.70 (0.04)	.74 (0.04)	.44 (0.04)	.38 (0.03)	.23 (0.03)	.73 (0.03)	.63 (0.04)	.67 (0.05)	.62 (0.07)
IR	.84 (0.07)	.79 (0.04)	.76 (0.08)	.83 (0.07)	1 (0.00)	.84 (0.05)	.71 (0.04)	.68 (0.05)	.71 (0.07)	.67 (0.07)	.70 (0.05)	.40 (0.04)	.36 (0.04)	.28 (0.03)	.67 (0.07)	.59 (0.06)	.65 (0.06)	.59 (0.03)
JA	.94 (0.02)	.82 (0.04)	.77 (0.07)	.97 (0.02)	.84 (0.08)	1 (0.00)	.88 (0.06)	.72 (0.05)	.75 (0.08)	.62 (0.04)	.63 (0.03)	.28 (0.03)	.25 (0.04)	.21 (0.04)	.71 (0.07)	.61 (0.04)	.68 (0.09)	.61 (0.07)
MJA	.89 (0.02)	.75 (0.04)	.74 (0.03)	.83 (0.08)	.71 (0.04)	.88 (0.06)	1 (0.00)	.72 (0.07)	.75 (0.07)	.63 (0.07)	.64 (0.06)	.26 (0.03)	.30 (0.04)	.19 (0.04)	.69 (0.06)	.60 (0.03)	.63 (0.07)	.62 (0.03)
OSR	.74 (0.06)	.66 (0.07)	.61 (0.07)	.75 (0.08)	.68 (0.05)	.72 (0.05)	.72 (0.07)	1 (0.00)	.90 (0.04)	.70 (0.04)	.80 (0.08)	.31 (0.05)	.35 (0.04)	.24 (0.04)	.65 (0.03)	.58 (0.04)	.61 (0.04)	.59 (0.09)
SoR	.77 (0.05)	.71 (0.08)	.68 (0.06)	.78 (0.04)	.71 (0.07)	.75 (0.08)	.75 (0.07)	.90 (0.04)	1 (0.00)	.72 (0.09)	.77 (0.07)	.35 (0.04)	.34 (0.05)	.28 (0.04)	.66 (0.05)	.59 (0.07)	.63 (0.03)	.59 (0.05)
CR	.67 (0.08)	.63 (0.04)	.59 (0.02)	.70 (0.04)	.67 (0.07)	.62 (0.04)	.63 (0.07)	.70 (0.04)	.72 (0.09)	1 (0.00)	.67 (0.09)	.20 (0.03)	.23 (0.03)	.31 (0.03)	.55 (0.05)	.54 (0.05)	.52 (0.07)	.53 (0.04)
BR	.71 (0.06)	.70 (0.07)	.67 (0.04)	.74 (0.04)	.70 (0.05)	.63 (0.03)	.64 (0.06)	.80 (0.08)	.77 (0.07)	.67 (0.09)	1 (0.00)	.25 (0.04)	.28 (0.05)	.27 (0.03)	.60 (0.03)	.53 (0.07)	.53 (0.07)	.54 (0.05)
HM	.40 (0.03)	.30 (0.04)	.34 (0.06)	.44 (0.04)	.40 (0.04)	.28 (0.03)	.26 (0.03)	.31 (0.05)	.35 (0.04)	.20 (0.03)	.25 (0.04)	1 (0.00)	.71 (0.05)	.18 (0.04)	.31 (0.09)	.44 (0.06)	.27 (0.07)	.37 (0.03)
TM	.45 (0.02)	.39 (0.05)	.30 (0.03)	.38 (0.03)	.36 (0.04)	.25 (0.04)	.30 (0.04)	.35 (0.04)	.34 (0.05)	.23 (0.03)	.28 (0.05)	.71 (0.05)	1 (0.00)	.20 (0.06)	.33 (0.07)	.45 (0.09)	.32 (0.03)	.39 (0.05)
SCT	.30 (0.03)	.22 (0.05)	.20 (0.05)	.23 (0.03)	.28 (0.03)	.21 (0.04)	.19 (0.04)	.24 (0.04)	.28 (0.04)	.31 (0.03)	.27 (0.03)	.18 (0.04)	.20 (0.06)	1 (0.00)	.24 (0.07)	.40 (0.06)	.21 (0.09)	.33 (0.04)
SM	.71 (0.03)	.65 (0.04)	.63 (0.07)	.73 (0.03)	.67 (0.07)	.71 (0.07)	.69 (0.06)	.65 (0.03)	.66 (0.05)	.55 (0.05)	.60 (0.03)	.31 (0.09)	.33 (0.07)	.24 (0.07)	1 (0.00)	.57 (0.07)	.82 (0.03)	.67 (0.04)
PCA	.64 (0.04)	.58 (0.03)	.59 (0.04)	.63 (0.04)	.59 (0.06)	.61 (0.04)	.60 (0.03)	.58 (0.04)	.59 (0.07)	.54 (0.05)	.53 (0.07)	.44 (0.06)	.45 (0.09)	.40 (0.06)	.57 (0.07)	1 (0.00)	.61 (0.04)	.54 (0.08)
TOP	.68 (0.04)	.62 (0.06)	.61 (0.07)	.67 (0.05)	.65 (0.06)	.68 (0.09)	.63 (0.07)	.61 (0.04)	.63 (0.03)	.52 (0.07)	.53 (0.07)	.27 (0.07)	.32 (0.03)	.21 (0.09)	.82 (0.03)	.61 (0.04)	1 (0.00)	.65 (0.05)
PC	.62 (0.06)	.57 (0.05)	.56 (0.09)	.62 (0.07)	.59 (0.03)	.61 (0.07)	.62 (0.03)	.59 (0.09)	.59 (0.05)	.53 (0.04)	.54 (0.05)	.37 (0.03)	.39 (0.05)	.33 (0.04)	.67 (0.04)	.54 (0.08)	.65 (0.05)	1 (0.00)

Table 6.

Average Pearson Correlation Coefficient and Corresponding Std. Among the Results of 18 Methods for Randomly Selected 100 Funds.

	SR	DSR	ASR	TR	IR	JA	MJA	OSR	SoR	CR	BR	HM	TM	SCT	SM	PCA	TOP	PC
SR	1 (0.00)	.86 (0.02)	.93 (0.01)	.97 (0.01)	.87 (0.02)	.96 (0.01)	.91 (0.01)	.77 (0.02)	.78 (0.02)	.69 (0.03)	.74 (0.02)	.42 (0.01)	.47 (0.01)	.32 (0.01)	.72 (0.01)	.67 (0.01)	.70 (0.01)	.64 (0.02)
DSR	.86 (0.02)	1 (0.00)	.85 (0.02)	.90 (0.01)	.81 (0.02)	.83 (0.01)	.76 (0.02)	.68 (0.03)	.73 (0.03)	.65 (0.02)	.73 (0.03)	.33 (0.01)	.41 (0.02)	.25 (0.02)	.68 (0.01)	.59 (0.01)	.65 (0.02)	.59 (0.02)
ASR	.93 (0.01)	.85 (0.02)	1 (0.00)	.89 (0.02)	.78 (0.03)	.78 (0.02)	.75 (0.01)	.63 (0.03)	.70 (0.02)	.60 (0.01)	.69 (0.01)	.35 (0.02)	.32 (0.01)	.22 (0.02)	.66 (0.03)	.62 (0.01)	.62 (0.03)	.58 (0.03)
TR	.97 (0.01)	.90 (0.01)	.89 (0.02)	1 (0.00)	.84 (0.02)	.98 (0.01)	.85 (0.03)	.77 (0.03)	.79 (0.01)	.71 (0.02)	.75 (0.02)	.46 (0.02)	.40 (0.01)	.26 (0.01)	.74 (0.01)	.66 (0.02)	.70 (0.02)	.64 (0.03)
IR	.87 (0.02)	.81 (0.02)	.78 (0.03)	.84 (0.02)	1 (0.00)	.86 (0.04)	.72 (0.01)	.71 (0.02)	.72 (0.03)	.68 (0.03)	.73 (0.02)	.43 (0.02)	.38 (0.02)	.30 (0.01)	.70 (0.03)	.60 (0.02)	.68 (0.02)	.62 (0.01)
JA	.96 (0.01)	.83 (0.01)	.78 (0.02)	.98 (0.01)	.86 (0.04)	1 (0.00)	.89 (0.02)	.74 (0.02)	.77 (0.02)	.63 (0.01)	.65 (0.01)	.31 (0.01)	.28 (0.01)	.24 (0.01)	.73 (0.02)	.63 (0.02)	.71 (0.03)	.64 (0.02)
MJA	.91 (0.01)	.76 (0.02)	.75 (0.01)	.85 (0.03)	.72 (0.01)	.89 (0.02)	1 (0.00)	.73 (0.03)	.77 (0.03)	.65 (0.03)	.66 (0.02)	.29 (0.01)	.33 (0.01)	.22 (0.01)	.71 (0.02)	.62 (0.01)	.64 (0.02)	.64 (0.01)
OSR	.77 (0.02)	.68 (0.03)	.63 (0.03)	.77 (0.03)	.71 (0.02)	.74 (0.02)	.73 (0.03)	1 (0.00)	.93 (0.01)	.71 (0.02)	.83 (0.03)	.34 (0.02)	.38 (0.01)	.26 (0.01)	.67 (0.01)	.61 (0.02)	.63 (0.01)	.61 (0.04)
SoR	.78 (0.02)	.73 (0.03)	.70 (0.02)	.79 (0.01)	.72 (0.03)	.77 (0.02)	.77 (0.03)	.93 (0.01)	1 (0.00)	.73 (0.03)	.80 (0.03)	.36 (0.02)	.37 (0.02)	.31 (0.02)	.67 (0.02)	.62 (0.02)	.66 (0.01)	.62 (0.02)
CR	.69 (0.03)	.65 (0.02)	.60 (0.01)	.71 (0.02)	.68 (0.03)	.63 (0.01)	.65 (0.03)	.71 (0.02)	.73 (0.03)	1 (0.00)	.68 (0.03)	.22 (0.01)	.25 (0.01)	.33 (0.01)	.57 (0.02)	.56 (0.02)	.53 (0.02)	.56 (0.01)
BR	.74 (0.02)	.73 (0.03)	.69 (0.01)	.75 (0.02)	.73 (0.02)	.65 (0.01)	.66 (0.02)	.83 (0.03)	.80 (0.03)	.68 (0.03)	1 (0.00)	.28 (0.01)	.31 (0.02)	.28 (0.01)	.63 (0.01)	.54 (0.03)	.55 (0.03)	.56 (0.02)
HM	.42 (0.01)	.33 (0.01)	.35 (0.02)	.46 (0.02)	.43 (0.02)	.31 (0.01)	.29 (0.01)	.34 (0.02)	.36 (0.02)	.22 (0.01)	.28 (0.01)	1 (0.00)	.72 (0.02)	.21 (0.02)	.33 (0.04)	.47 (0.03)	.30 (0.03)	.39 (0.01)
TM	.47 (0.01)	.41 (0.02)	.32 (0.01)	.40 (0.01)	.38 (0.02)	.28 (0.01)	.33 (0.01)	.38 (0.01)	.37 (0.02)	.25 (0.01)	.31 (0.02)	.72 (0.02)	1 (0.00)	.21 (0.02)	.34 (0.02)	.46 (0.03)	.35 (0.01)	.42 (0.02)
SCT	.32 (0.01)	.25 (0.02)	.22 (0.02)	.26 (0.01)	.30 (0.01)	.24 (0.01)	.22 (0.01)	.26 (0.01)	.31 (0.02)	.33 (0.01)	.28 (0.01)	.21 (0.02)	.21 (0.02)	1 (0.00)	.25 (0.02)	.41 (0.02)	.22 (0.03)	.36 (0.01)
SM	.72 (0.01)	.68 (0.01)	.66 (0.03)	.74 (0.01)	.70 (0.03)	.73 (0.02)	.71 (0.02)	.67 (0.01)	.67 (0.02)	.57 (0.02)	.63 (0.01)	.33 (0.04)	.34 (0.02)	.25 (0.02)	1 (0.00)	.59 (0.03)	.85 (0.01)	.70 (0.01)
PCA	.67 (0.01)	.59 (0.01)	.62 (0.01)	.66 (0.02)	.60 (0.02)	.63 (0.02)	.62 (0.01)	.61 (0.02)	.62 (0.02)	.56 (0.02)	.54 (0.03)	.47 (0.03)	.46 (0.03)	.41 (0.02)	.59 (0.03)	1 (0.00)	.64 (0.02)	.55 (0.03)
TOP	.70 (0.01)	.65 (0.02)	.62 (0.03)	.70 (0.02)	.68 (0.02)	.71 (0.03)	.64 (0.02)	.63 (0.01)	.66 (0.01)	.53 (0.02)	.55 (0.03)	.30 (0.03)	.35 (0.01)	.22 (0.03)	.85 (0.01)	.64 (0.02)	1 (0.00)	.67 (0.02)
PC	.64 (0.02)	.59 (0.02)	.58 (0.03)	.64 (0.03)	.62 (0.01)	.64 (0.02)	.64 (0.01)	.61 (0.04)	.62 (0.02)	.56 (0.01)	.56 (0.02)	.39 (0.01)	.42 (0.02)	.36 (0.01)	.70 (0.01)	.55 (0.03)	.67 (0.02)	1 (0.00)

From Tables 5–6, we can obviously find that: (a) different evaluation/ranking methods will produce different evaluation results/ranks for mutual funds. The Pearson correlation coefficient of the same kind of methods is higher than that of the methods of different classes. For example, the average correlation coefficient of results between Sharpe ratio and Treynor ratio is more than 0.9, while that between Sharpe ratio and HM coefficient is only about 0.4; (b) the correlation coefficient between the four multi-indexes ranking methods and the other 14 single index methods is relatively low, generally less than 0.7. This is because that multi-indexes method integrates the characteristics of multiple single index methods and can reflect the comprehensive performance of different aspects of the funds; (c) PCA method naturally takes into account the correlation of indicators. Therefore, compared with other multi-indexes approaches using mean weight method, PCA method has little difference in ranking correlation with each single index; (d) with the increase of the number of funds to be evaluated, the standard deviation of the correlation among the indicators decreased significantly, which indicates that the error using the correlation coefficient of evaluation results of funds with small sample number to replace the statistical correlation coefficient among indexes is large. In addition, when the number of funds to be evaluated is large, for example, 100 funds, the standard deviation of correlation coefficients become very low, which means that it is reasonable to use large sample correlation coefficient instead of statistical correlation coefficient. This is the original intention of our MCR method; (e) the ranking correlation of our proposed method and other multi-indexes methods (SM, PCA and TOPSIS) which also integrates multiple single index results is a bit small (less than 0.7). The reason is that, compared with other multi-indexes methods, our method not only uses the individual evaluation information given by 14 single indicators, but also uses the paired comparison information of these measures.

According to the results in Tables 5 and 6, it is still not clear that whether different methods give different ranks or not. To solve this problem, we count the change in the position of each fund based on all approaches and the proposed method is viewed as the benchmark. We calculate the mean absolute change, the maximum downward movement, the maximum upward movement in the ranks, and the standard deviation of them. Results appear in Tables 7 to 9.

Table 7.

Changes With Respect to the Proposed PC Method for Randomly Selected 50 Funds.

	Average max upward	Average max downward	Average mean absolute change
SR	63% (0.07)	51% (0.08)	38% (0.06)
DSR	63% (0.07)	60% (0.06)	45% (0.04)
ASR	62% (0.11)	67% (0.11)	48% (0.08)
TR	58% (0.09)	57% (0.08)	41% (0.07)
IR	53% (0.05)	56% (0.04)	36% (0.03)
JA	63% (0.09)	61% (0.09)	44% (0.06)
MJA	62% (0.04)	65% (0.04)	42% (0.04)
OSR	65% (0.10)	63% (0.10)	42% (0.09)
SoR	54% (0.07)	52% (0.08)	36% (0.06)
CR	78% (0.06)	66% (0.06)	52% (0.03)
BR	62% (0.07)	58% (0.07)	45% (0.06)
HM	83% (0.04)	76% (0.06)	60% (0.03)
TM	85% (0.07)	79% (0.06)	59% (0.06)
SCT	87% (0.06)	88% (0.06)	66% (0.04)
SM	64% (0.06)	53% (0.06)	38% (0.04)
PCA	64% (0.07)	65% (0.08)	50% (0.06)
TOP	56% (0.07)	52% (0.06)	35% (0.04)

Note. The mean weight (MW) method is used for TOPSIS and PC.

Table 8.

Changes With Respect to the Proposed PC Method for Randomly Selected 50 Funds.

	Average max upward	Average max downward	Average mean absolute change
SR	66% (0.11)	71% (0.11)	48% (0.09)
DSR	60% (0.11)	73% (0.11)	46% (0.08)
ASR	79% (0.15)	74% (0.15)	53% (0.12)
TR	62% (0.12)	68% (0.10)	48% (0.10)
IR	59% (0.09)	65% (0.08)	45% (0.07)
JA	67% (0.12)	67% (0.13)	46% (0.10)
MJA	59% (0.09)	64% (0.07)	40% (0.07)
OSR	63% (0.12)	71% (0.14)	50% (0.13)
SoR	64% (0.12)	73% (0.11)	50% (0.10)
CR	68% (0.10)	71% (0.09)	52% (0.08)
BR	68% (0.09)	75% (0.09)	53% (0.09)
HM	81% (0.09)	74% (0.09)	59% (0.08)
TM	75% (0.12)	80% (0.08)	55% (0.09)
SCT	73% (0.10)	75% (0.09)	58% (0.06)
SM	66% (0.10)	65% (0.10)	47% (0.09)
PCA	66% (0.13)	57% (0.13)	40% (0.11)
TOP	63% (0.10)	69% (0.10)	42% (0.08)

Note. The CRITIC (CR) method is used for TOPSIS and PC.

Table 9.

Changes With Respect to the Proposed PC Method for Randomly Selected 50 Funds.

	Average max upward	Average max downward	Average mean absolute change
SR	58% (0.07)	65% (0.08)	40% (0.07)
DSR	58% (0.06)	65% (0.06)	40% (0.04)
ASR	71% (0.11)	66% (0.10)	50% (0.08)
TR	65% (0.10)	61% (0.09)	41% (0.07)
IR	58% (0.05)	57% (0.03)	43% (0.03)
JA	67% (0.10)	63% (0.10)	47% (0.06)
MJA	65% (0.05)	69% (0.03)	47% (0.04)
OSR	83% (0.10)	71% (0.11)	56% (0.09)
SoR	77% (0.07)	73% (0.08)	51% (0.08)
CR	81% (0.07)	81% (0.06)	56% (0.02)
BR	72% (0.08)	73% (0.07)	53% (0.08)
HM	85% (0.03)	78% (0.05)	61% (0.04)
TM	86% (0.06)	84% (0.07)	60% (0.06)
SCT	73% (0.06)	81% (0.07)	55% (0.03)
SM	67% (0.06)	74% (0.07)	52% (0.05)
PCA	64% (0.09)	64% (0.10)	44% (0.07)
TOP	60% (0.07)	61% (0.06)	42% (0.06)

Note. The modified CRITIC (MCR) method is used for TOPSIS and PC.

It could be seen from Tables 7 to 9 that (a) compared with the MW method (illustrate in Table 10), CRTIC and MCR based PC methods take the index correlation into account, which make the difference of percentage rank changes between PC and other single measures smaller. This is because that weight method considers the correlation of indicators that improves the weight of “out of group” indicators; (b) compared with the multi-indexes method based on MW and MCR, the standard deviation of percentage rank changes between PC (with CRTIC) and other single measure that becomes larger. This is because that the indexes weights will change greatly in each Monte Carlo run when the number of funds to be evaluated is small, which will bring additional random error. In addition, Tables 7 to 9 also present that the ranks of our proposed PC method and other single index measures have significant difference, for example, the average max upward and the average max downward values are all above 50% and some even exceed 80%. Moreover, compared to other multi-indexes ranking method, the average max upward and the average max downward values are also more than 50%. These data show that, compared with the single index method, the proposed PC method that combines multiple aspects will bring different ranking results, and compared with the other multi-indexes methods, PC method will also produce significantly different ranking results since comparison information are used.

Table 10.

Mean and Std. of Pearson and Kendall’s Tau Correlation Coefficients Between 17 Performance Evaluation Results in Sample (3 or 6 Months Samples) and Sharpe Ratios Out of Sample (3 or 6 Months Samples) With Respect to All the 637 Funds.

	Pearson correlation coefficient		Kendall’s tau
	3 months	6 months	3 months	6 months
SR	0.30 (0.06)	0.24 (0.05)	0.34 (0.07)	0.27 (0.07)
DSR	0.28 (0.05)	0.23 (0.06)	0.31 (0.07)	0.25 (0.06)
ASR	0.29 (0.06)	0.23 (0.07)	0.32 (0.06)	0.25 (0.05)
TR	0.27 (0.05)	0.20 (0.05)	0.30 (0.07)	0.23 (0.05)
IR	0.21 (0.06)	0.16 (0.05)	0.25 (0.05)	0.19 (0.06)
JA	0.25 (0.06)	0.19 (0.06)	0.30 (0.06)	0.24 (0.05)
MJA	0.26 (0.06)	0.22 (0.07)	0.29 (0.07)	0.23 (0.05)
OSR	0.22 (0.07)	0.15 (0.06)	0.26 (0.06)	0.20 (0.06)
SoR	0.24 (0.06)	0.19 (0.05)	0.27 (0.06)	0.22 (0.05)
CR	0.19 (0.05)	0.15 (0.06)	0.22 (0.05)	0.17 (0.05)
BR	0.21 (0.06)	0.16 (0.05)	0.25 (0.06)	0.18 (0.06)
HM	0.17 (0.07)	0.12 (0.07)	0.23 (0.05)	0.15 (0.05)
TM	0.15 (0.07)	0.11 (0.06)	0.22 (0.06)	0.14 (0.05)
SM	0.24 (0.04)	0.19 (0.04)	0.29 (0.05)	0.21 (0.04)
PCA	0.27 (0.04)	0.21 (0.03)	0.33 (0.04)	0.22 (0.04)
TOP	0.28 (0.03)	0.22 (0.03)	0.32 (0.04)	0.24 (0.03)
PC	0.30 (0.03)	0.22 (0.02)	0.35 (0.04)	0.25 (0.03)

Note. The modified CRITIC (MCR) method is used for TOPSIS and PC.

In order to further demonstrate the difference between CRITIC method and MCR method, it is wise for us to consider the correlation coefficients change of the results given by the two methods with the change of the number of funds to be evaluated. The results are shown in Figure 3. It can be seen from Figure 3 that with the increase of the number of funds to be evaluated/ranked, the Pearson and Kendall correlation coefficients of the results provided by CR and MCR based PC method will become greater and greater. This suggests that if the number of funds is large in practice, it is also appropriate to use the original CR method.

Figure 3.

Mean and std. of Pearson and Kendall correlation coefficients of the results provided by CR based PC method and MCR based PC method with the number of fund increases from 5 to 200.

Out of Sample Test

To verify the effectiveness and to illustrate the predictive ability of our PC-RV method out of sample, we first use 16 existing methods and our PC-RV method to rank the performance of all funds in sample, and then use Sharpe ratio to rank the performance of all funds out of sample. In addition, we use the sliding window method for out of sample testing. The sample period is set to 3 or 6 months and the window width is set to 1 month. Finally, the mean and standard deviation of Pearson and Kendall’s tau correlation coefficients between 17 performance evaluation results in sample and Sharpe ratios out of sample with respect to all 637 funds, which are calculated and shown in Table 10. Average ranking changes and corresponding std. between 17 performance evaluation results in sample and Sharpe ratios out of sample with respect to all 637 funds are calculated and shown in Table 11.

Table 11.

Average Ranking Changes and Corresponding Std. Between 17 Performance Evaluation Results in Sample (3 or 6 Months Samples) and Sharpe Ratios Out of Sample (3 or 6 Months Samples) With Respect to All the 637 Funds.

	Average max upward		Average max downward		Average mean absolute change
	3 months	6 months	3 months	6 months	3 months	6 months
SR	76% (0.12)	80% (0.13)	79% (0.12)	82% (0.13)	60% (0.09)	62% (0.09)
DSR	78% (0.11)	83% (0.13)	81% (0.12)	86% (0.13)	61% (0.08)	64% (0.08)
ASR	81% (0.12)	82% (0.12)	85% (0.13)	87% (0.14)	71% (0.09)	72% (0.09)
TR	82% (0.11)	85% (0.13)	84% (0.12)	86% (0.13)	63% (0.09)	66% (0.09)
IR	88% (0.13)	91% (0.14)	90% (0.12)	92% (0.14)	65% (0.08)	67% (0.08)
JA	77% (0.11)	82% (0.12)	79% (0.12)	82% (0.13)	70% (0.09)	74% (0.09)
MJA	78% (0.10)	84% (0.12)	79% (0.11)	85% (0.14)	66% (0.09)	67% (0.09)
OSR	83% (0.13)	86% (0.13)	86% (0.13)	89% (0.14)	67% (0.08)	70% (0.08)
SoR	79% (0.10)	82% (0.11)	81% (0.12)	85% (0.12)	62% (0.08)	65% (0.08)
CR	82% (0.12)	87% (0.12)	85% (0.13)	88% (0.12)	72% (0.10)	73% (0.10)
BR	83% (0.13)	86% (0.13)	85% (0.13)	87% (0.13)	75% (0.10)	77% (0.10)
HM	87% (0.12)	90% (0.14)	91% (0.14)	94% (0.15)	80% (0.11)	84% (0.11)
TM	90% (0.13)	92% (0.14)	92% (0.14)	93% (0.14)	83% (0.12)	85% (0.12)
SM	81% (0.09)	86% (0.10)	85% (0.10)	87% (0.11)	67% (0.08)	68% (0.08)
PCA	79% (0.08)	83% (0.10)	82% (0.09)	85% (0.10)	64% (0.07)	67% (0.07)
TOP	77% (0.08)	84% (0.09)	81% (0.08)	83% (0.09)	62% (0.07)	65% (0.07)
PC	77% (0.07)	82% (0.08)	80% (0.08)	81% (0.09)	62% (0.06)	64% (0.06)

Note. The modified CRITIC (MCR) method is used for TOPSIS and PC.

It could be seen from Table 10 that (a) compared with all the 13 single-indicator based methods, the mean of Pearson and Kendall’s tau correlation coefficients between performance evaluation results provided by four multi-indicator based methods (SM/PCA/TOP/PC) in sample and Sharpe ratios out of sample are higher which show that the multi-indicator based comprehensive ranking method have better prediction ability; (b) compared with other three multi-indicator based methods (SM/PCA/TOP), the mean of Pearson and Kendall’s tau correlation coefficients between performance evaluation results provided by our PC-RV method (PC) in sample and Sharpe ratios out of sample are higher which shows that our PC-RV method has the best prediction ability of all multi-indicator based methods; (c) compared with the single-indicator based methods, the standard deviation of Pearson and Kendall’s tau correlation coefficients between performance evaluation results provided by four multi-indicator based methods (SM/PCA/TOP/PC) in sample and Sharpe ratios out of sample are smaller which shows that the multi-indicator based comprehensive ranking method is more stable; (d) compared with other 16 methods, the standard deviation of Pearson and Kendall’s tau correlation coefficients between performance evaluation results provided by our PC-RV method (PC) in sample and Sharpe ratios out of sample are smaller which shows that our PC-RV method is the most stable one of all methods. In additional, Table 11 demonstrates almost the same results. The above phenomena verified that our PC-RV method is more predictive and stable.

Here, we summarize some significant advantages of our proposed method over previous studies based on the empirical results.

(a) the proposed method is a multi-indexes comprehensive ranking method which can reflect different mutual funds performance aspects. While in previous studies, different single-index based evaluation methods produced different results for mutual funds only from one special aspect. In addition, the correlation coefficient among our methods and the previous 14 single index methods is relatively low, generally less than 0.7. This is because that our method integrates the characteristics of multiple single index methods and can reflect the comprehensive performance of different aspects of the funds;

(b) the proposed method makes good use of both individual evaluation information and joint comparison information, and thus reveals some deep insight about the fund performance which cannot be revealed by other individual information based multi-indexes based methods. the ranking correlation of our proposed method and other previous multi-indexes methods (SM, PCA, and TOPSIS) which also integrates multiple single index results is a bit small (less than 0.7). The reason is that, compared with other multi-indexes methods, our method not only uses the individual evaluation information given by previous 14 single indicators, but also uses the paired comparison information of these old measures.

(c) the proposed unified formulation of C-score allows the absence (non-existing or missing) of some evaluation data, allows the existence of different types of data and avoids data standardization which may be unseasonable in some situation. These make the proposed ranking method more flexible and practicable. In previous studies, they need to deal with these special data in some special way, while the process may be arbitrary and controversial.

(d) the proposed method is very convenient for practical application because it only needs to find the eigenvector corresponding to the maximum eigenvalue of the C-matrix. However, in previous studies, such as PCA and TOPSIS methods, the calculation process is relatively complex.

Conclusions

This paper first develops paired competition based ranking scheme for solving mutual funds comprehensive ranking problem. Compared with arbitrary single index, our method increases the stability and reliability of the final rank. The proposed method is a multi-index comprehensive ranking method which can reflect different mutual funds performance aspects. The proposed method makes good use of both individual evaluation information and joint comparison information, and thus reveals some deep insight about the fund performance which cannot be revealed by other individual information based multi-indexes based methods. The ranking result given by the ranking vector only depends on the Compete matrix and is independent of the prior goodness. Thus, the final ranking vector makes full use of the comparison information and exploits the Compete matrix well. Moreover, the proposed unified formulation of Compete score allows the absence of some evaluation data, allows the existence of different types of data and avoids data standardization which may be unseasonable in some situation. These make the proposed ranking method more flexible and practicable. In additional, this paper proposes a modified CRITIC method to determine the weight of each single index when a multi-indexes ranking approach is considered for use. Our work have demonstrated that the proposed MCR method is more reasonable than the original CRITIC method even when only a few funds need to be ranked. Note that, our approach is very convenient for practical application because it only needs to find the eigenvector corresponding to the maximum eigenvalue of the Compete matrix. Finally, this paper provides abundant empirical results to verify the rationality and robustness of the proposed methods.

The works in this paper still have some limitations and need to be further studied in the future work. First, the difference between our method and the existing method in the degree of information utilization has not been clearly quantified theoretically, and we have not been able to quantify the performance superiority of the proposed method compared with the existing multi-indexes based methods. In view of this problem, we will consider designing a reasonable metric to describe the degree of information utilization and developing effective method to quantify the performance superiority of our method compared with the existing methods. In addition, due to the copyright problem of mutual fund data, the rationality and effectiveness of our method have only been verified in the Chinese market, and have not been further verified in other markets. In view of this limitation, we will consider seeking more evidence from multiple mainstream financial markets for demonstrating effectiveness of our method. Moreover, this paper has not studied whether our method can predict the future performance of the fund. In the future work, we will test the performance prediction ability of our method through the out of sample mutual fund data. In fact, our method is a more general multi-attribute decision-making method, which can be used not only to evaluate the performance of mutual funds, but also to evaluate the performance of other financial assets with multiple indicators, such as the comprehensive evaluation of stock fundamentals. In the future research, we will consider extending this method to more comprehensive performance evaluation of various financial assets.

Footnotes

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Jin Yuan

Xianghui Yuan

References

Adcock

Areal

Cortez

M. C.

Oliveira

Silva

(2020). Does the choice of fund performance measure matter? Investment Analysis Journal, 49(1), 53–77.

Alimi

Zandish

Amiri

(2012). Multi-objective portfolio optimization of mutual funds under downside risk measure using fuzzy theory. International Journal of Industrial Engineering Computations, 3(5), 859–872.

Almeida

Ardison

Garcia

(2020). Nonparametric assessment of hedge fund performance. Journal of Econometrics, 214(2), 349–378.

Alptekin

(2009). Performance evaluation of Turkish type a mutual funds and pension stock funds by using TOPSIS method. International Journal of Economics and Finance, 1(2), 11–22.

Bai

J. J.

Tang

Y. H.

Wan

Yuksel

H. Z.

(2022). Fund manager skill in an era of globalization: Offshore concentration and fund performance. Journal of Financial Economics, 145(2), 18–40.

Basso

Funari

(2001). A data envelopment analysis approach to measure the mutual fund performance. European Journal of Operational Research, 135(3), 477–492.

Carhart

(1997). On persistence in mutual fund performance. Journal of Finance, 52(1), 57–82.

Chang

C. H.

Lin

J. J.

Lin

J. H.

Chiang

M. C.

(2010). Domestic open-end equity mutual fund performance evaluation using extended TOPSIS method with different distance approaches. Expert Systems with Applications, 37(6), 4642–4649.

Charnes

Cooper

W. W.

Golary

Seiford

Stutz

(1985). Foundation of data envelopment analysis for Pareto-Koopmans efficient empirical production functions. Journal of Econometrics, 30(1–2), 91–107.

10.

Chen

Lasfer

Song

Zhou

(2021). Recession managers and mutual fund performance. Journal of Corporate Finance, 69, 102010. https://doi.org/10.1016/j.jcorpfin.2021.102010

11.

Chung

C. Y.

Ryu

Wang

Zykaj

B. B.

(2018). Optionable stocks and mutual fund performance. Journal of Futures Markets, 38(3), 390–412.

12.

Cogneau

Hübner

(2009). The (more than) 100 ways to measure portfolio performance-part 1: Standardized risk-adjusted measures. Journal of Performance Measurement, 13(4), 56–71.

13.

Cremers

K. J. M.

Fulkerson

J. A.

Riley

T. B.

(2022). Benchmark discrepancies and mutual fund performance evaluation. Journal of Financial and Quantitative Analysis, 57(2), 543–571.

14.

Cuthbertson

Nitzsche

O’Sullivan

(2022). Mutual fund performance persistence: Factor models and portfolio size. International Review of Financial Analysis, 81, 102133. https://doi.org/10.1016/j.irfa.2022.102133

15.

Diakoulaki

Mavrotas

Papayannakis

(1995). Determining objective weights in multiple criteria problems: The CRITIC method. Computers and Operations Research, 22(7), 763–770.

16.

Durán Santomil

Lombardero Fernarder

P. C.

Otero Gonzorder

(2022). Do performance measures matter for stock mutual funds? An international analysis. International Journal of Emerging Markets. Advance online publication. https://doi.org/10.1108/IJOEM-04-2022-0584

17.

Elton

E. J.

Gruber

M. J.

(2020). A review of the performance measurement of long-term mutual funds. Financial Analysts Journal, 76(3), 22–37.

18.

Fama

E. F.

French

K. R.

(1993). Common risk factors in the returns on stocks and bonds. Journal of Financial Economics, 33(1), 3–56.

19.

Farid

Wahba

(2022). The effect of fund size on mutual funds performance in Egypt. Future Business Journal, 8(1), 27. https://doi.org/10.1186/s43093-022-00135-7

20.

Fulkerson

J. A.

Hong

(2021). Investment restrictions and fund performance. Journal of Empirical Finance, 64, 317–336.

21.

Fulkerson

J. A.

Riley

T. B.

(2019). Portfolio concentration and mutual fund performance. Journal of Empirical Finance, 51, 1–16.

22.

Gouveia

M. D.

Neves

E. D.

Dias

L. C.

Antunes

C. H.

(2018). Performance evaluation of Portuguese mutual fund portfolios using the value-based DEA method. Journal of the Operational Research Society, 69(10), 1628–1639.

23.

Grau-Carles

Doncel

L. M.

Sainz

(2019). Stability in mutual fund performance rankings: A new proposal. International Review of Economics & Finance, 61, 337–346.

24.

Hassouni

Pirotte

(2022). Beyond mean-variance: Assessing hedge fund performance in a non-parametric world. Financial Markets and Portfolio Management, 36, 473–488. https://doi.org/10.1007/s11408-022-00409-8

25.

Indro

D. C.

Jiang

C. X.

M. Y.

Lee

W. Y.

(1999). Mutual fund performance: Does fund size matter? Financial Analysts Journal, 55(3), 74–87.

26.

Irvine

Kim

J. H.

Ren

(2022). The beta anomaly and mutual fund performance. Management Science. Advance online publication. https://doi.org/10.1287/mnsc.2022.4639

27.

Keener

J. P.

(1993). The Perron-Frobenius theorem and the ranking of football teams. SIAM Review, 35(1), 80–93.

28.

Kenchington

Wan

Yüksel

H. Z.

(2019). Gross profitability and mutual fund performance. Journal of Banking & Finance, 104, 31–49.

29.

Kutan

A. M.

Lin

Sun

P. W.

(2018). A reliable performance measure to differentiate China’s actively managed open-end equity mutual funds. Applied Economics, 50(52), 5592–603.

30.

Lee

W. S.

Tzeng

G. H.

Guan

J. L.

Chien

K. T.

Huang

J. M.

(2009). Combined MCDM techniques for exploring stock selection based on Gordon model. Expert Systems With Applications, 36(3), 6421–6430.

31.

S. B.

S. M.

(2022). Fund performance evaluation based on Bayesian model and machine learning algorithm. Discrete Dynamics in Nature and Society, 2022, 2467521. https://doi.org/10.1155/2022/2467521

32.

Mateus

I. B.

Mateus

Todorovic

(2019). Review of new trends in the literature on factor models and mutual fund performance. International Review of Financial Analysis, 63(5), 344–354.

33.

Meyer

C. D.

(2000). Matrix analysis and applied linear algebra. SIAM.

34.

Ornelas

J. R. H.

Silva Júnior

A. F.

Fernandes

J. L. B.

(2012). Yes, the choice of performance measure does matter for ranking of US mutual funds. International Journal of Finance & Economics, 17(1), 61–72.

35.

Parida

Teo

(2018). The impact of more frequent portfolio disclosure on mutual fund performance. Journal of Banking & Finance, 87, 427–445.

36.

Pearson

(1901). On lines and planes of closest fit to systems of points in space. Philosophical Magazine, 2(11), 559–572.

37.

Shannon

C. E.

Weaver

(1949). The mathematical theory of communication. The University of Illinois Press.

38.

Sharpe

W. F.

(1966). Mutual fund performance. Journal of Business, 39(1), 119–138.

39.

Varga

R. S.

(1962). Matrix iterative analysis. Prentice-Hall.

40.

Venkataraman

S. V.

Rao

S. V. D. N.

(2021). Stochastic dominance algorithms with application to mutual fund performance evaluation. International Journal of Finance & Economics, 28(1), 681–698. https://doi.org/10.1002/ijfe.2444

41.

Wang

(2002). VaR-GARCH model based on neural network and genetic algorithm (7th ed.). APFA Annual Conference.

42.

Yin

X. R.

Lan

(2018). Pairwise comparison based ranking vector approach to estimation performance ranking. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 48(6), 942–953.

43.

Shen

Y. F.

Jin

X. J.

(2022). Does prospect theory explain mutual fund performance? Evidence from China. Pacific-Basin Finance Journal, 73(3), 101766. https://doi.org/10.1016/j.pacfin.2022.101766