Abstract
Sustainable investment is typically fulfilled by screening of environmental, social, and governance (ESG); the screening strategies are practical and expedite sustainable-investment development. However, the strategies typically build portfolios by a list of good stocks and ignore portfolio completeness. Moreover, there has been limited literature to study the portfolio weights of sustainable investment in the weight space. In such an area, this article contributes to the literature as follows: We extend a conventional portfolio-selection model and impose ESG constraints. We analytically solve our model by computing the efficient frontier and prove that the frontier’s portfolio weights all lie on a ray (half line). By the ray structure, we prove that portfolio selection for sustainable investment and conventional portfolio selection fundamentally possess highly different portfolio weights. Overall, our aim is comparing the portfolio weights of sustainable portfolio selection and of conventional portfolio selection; the comparison result has been unknown until now. The result is important for sustainable investment because portfolio weights are the foundation of portfolio selection and investments. We sample the component stocks of Dow Jones Industrial Average Index from 2004 to 2013 and find that our efficient frontier and the conventional efficient frontier are quite similar. Therefore, in plain financial language, investors can still obtain risk-return performance similar to conventional portfolio selection after imposing strong ESG requirements, although the portfolio weights can be totally different. The result is both an endorsement and a reminder for sustainable investment.
Keywords
Introduction
Sustainable Investment
The Forum of Sustainable and Responsible Investment defines sustainable investment as “an investment discipline that considers environmental, social, and governance (ESG) criteria to generate long-term competitive financial returns and positive societal impact.” 1 Sustainable investment has been rapidly growing since 2000. The Global Sustainable Investment Review reports that global sustainable-investment assets reached $30.7 trillion in 2018 with a 34% increase from 2016. 2
Sustainable investment is typically fulfilled by ESG screening as J. P. Morgan Asset Management reports that “Investing sustainably can be achieved by avoiding companies with poor practices in this area, actively selecting leaders in each field.” 3 The Forum of Sustainable and Responsible Investment and the Principles for Responsible Investment Association report similar strategies. 4
Literature for Sustainable Investment
A large body of sustainable-investment research focuses on the following:
Effect of the screening strategies,
Comparison between the returns of sustainable-investment mutual funds and conventional mutual funds, and
Relationship between ESG performance and financial performance.
For Research Line 1, Auer (2016) systematically utilizes the strategies, compares sustainable investments against passive investments, and finds mixed results. Trinks and Scholtens (2017) and Bertrand and Lapointe (2015) utilize the strategies and analyze the strategies and risk-based allocation strategies for opportunity and report some positive risk-adjusted performance. In contrast, Amel-Zadeh and Serafeim (2018) survey mainstream investment organizations, find the strategies relatively ineffective, and recommend ESG-integration strategies.
For Research Line 2, the scholars typically compare the returns of sustainable-investment mutual funds and the returns of conventional mutual funds. For example, Capelle-Blancard and Monjon (2014) report that “As of 2011, more than fifty academic papers have examined this issue, all using similar methodology. They almost unanimously show that the financial performance of SRI funds does not differ significantly from their conventional peers” (p. 495).
For Research Line 3, for example, Verheyden et al. (2016) document that “one recent review by Arabesque and Oxford University of over 200 studies reports that 90% of those studies found a positive link between ESG and the cost of capital” (p. 47).
Strength and Weakness of the Literature
Because this article is related to Research Lines 1 and 2, we focus on their strength and weakness. On one side, the screening strategies are highly practical and thus substantially expedite sustainable-investment development. However, on the other side, the strategies typically ignore the sustainability of the portfolios as Lydenberg (2016) emphasizes measuring portfolio ESG and integrating system risk in portfolio selection. Moreover, the screening strategies typically build portfolios by a list of good stocks and ignore portfolio completeness because Nobel laureate Markowitz (1959) argues that “we speak of ‘portfolio selection’ rather than ‘security selection.’ A good portfolio is more than a long list of good stocks and bonds. It is a balanced whole” (p. 3).
Research Line 2 basically focuses on only

Traditional research methodology and ours.
Originality of This Article
Up until now, there has been limited literature to explicitly impose ESG as constraints of portfolio selection for sustainable investment, analyze the (whole) efficient frontiers, and study the portfolio weights in the weight space. Therefore, it has not been answered whether portfolio selection for sustainable investment and conventional portfolio selection fundamentally possess different portfolio weights; that is, are the building blocks for sustainable investment different from those of conventional investment?
In such an area, this article contributes to the literature as follows:
We propose our portfolio-selection model for sustainable investment by explicitly imposing ESG as constraints.
We then compute the optimal solutions of our model as a thick black curve in the upper right part of Figure 1. We also depict the optimal solutions of the conventional model as a thick gray curve in the upper right part. Therefore, we compare the two curves instead of basically focusing on only
Fundamentally, we compute the thick black curve’s portfolio weights as a black ray in the weight space in the lower left part of Figure 1. We also compute the thick gray curve’s portfolio weights as a gray ray. Therefore, we compare the two rays. Specifically, we quantitatively compare the rays’ apexes in the lower right part (i.e.,
We sample the component stocks of Dow Jones Industrial Average Index from 2004 to 2013. We find that the thick black curve and thick gray curve are similar but the black ray and gray ray are totally different. The practical implication is that sustainable investors can still obtain risk-return performance similar to that of conventional portfolio selection after imposing strong ESG requirements although the portfolio weights can be totally different from those of conventional portfolio selection. The result is both an endorsement and a reminder for sustainable investment. Endorsement means that sustainable investors can obtain risk-return performance similar to that of conventional investors even after imposing ESG constraints. Reminder means that sustainable investors should mind the highly different portfolio weights.
Overall, our aim is comparing the portfolio weights of sustainable portfolio selection and of conventional portfolio selection; the comparison result has been unknown until now. The result is important for sustainable investment because portfolio weights are the foundation of portfolio selection and investments. Our comparison methodology is portfolio-selection models of Markowitz (1959). No other scholars have conducted the comparison before. Therefore, our methodology is exploratory and based on modeling instead of testing hypotheses.
The rest of this article is organized as follows: We review portfolio selection and portfolio optimization in the next section. We briefly compare major methods of portfolio optimization in the next section. We impose ESG constraints and construct portfolio-selection models for sustainable investment in the next section. Computationally, we calculate the minimum-variance frontier in the next section. We then calculate the efficient frontiers of our model and conventional models in the weight space in the next section. We illustrate the analysis by an example in the next section. We conclude this article in the next section. We put proofs in the appendix.
Portfolio Selection and Portfolio Optimization
Portfolio Selection
Markowitz (1959) emphasizes that “Two objectives, however, are common to all investors . . . 1. They want ‘return’ to be high . . . 2. They want this return to be dependable” (p. 6). Then, for “a balanced whole,” he formulates portfolio selection by two-objective optimization as follows:
where for n stocks,
Markowitz (1959) calls the optimal solutions of Equation 1 as an efficient frontier. For example, the efficient frontier is depicted as a thick gray curve in the upper right part of Figure 1.
Markowitz (1959) then suggests “first, separates efficient from inefficient portfolios; second, portrays . . . efficient portfolios; third, . . . select the combination . . . that best suits his needs; and fourth, determines the portfolio” (p. 7). He computes (whole) efficient frontiers and then asks investors to pinpoint their preferred portfolios from the frontiers, so investors never worry about missing some parts of the frontiers. Therefore, computing (whole) efficient frontiers is important for his suggestions.
Portfolio Optimization
Sharpe (1970) and Merton (1972) study the following model:
where
Merton (1972) applies an ϵ—constraint method (as described by Steuer, 1986) to Equation 2 as follows:
where ϵ is the parameter. The
where symbols a, c, and f are introduced as follows:
The minimum-variance frontier of Equation 4 is depicted as a gray curve in the upper right part of Figure 1. The efficient frontier is the minimum-variance frontier’s upper portion and depicted as a thick gray curve. Merton (1972) computes the minimum-variance portfolio of Equation 2 (i.e., the optimal solution of
Merton (1972) computes the
where
The set is a ray; that is, the ray is generated by
Comparing Major Portfolio-Optimization Methods
We briefly compare major portfolio-optimization methods and depict the result, advantage, and disadvantage in Figure 2.

The result, advantage, and disadvantage of major portfolio-optimization methods.
Analytical Methods
Analytical methods are based on calculus and linear algebra and thus understandable. Merton (1972) analytically solves Equation 2 and obtains complete optimal solution information. For example, the minimum-variance frontier
However, analytical methods are for models with equality constraints only. Unpractically unbounded portfolio weights are thus allowed. Nevertheless, with almost all results in formulae, analytical methods bypass the need of mathematical programming and bring convenience in research and instruction. Huang and Litzenberger (1988) emphasize Equation 2 as “popular because of its analytical tractability and its rich empirical implications” (p. 60).
Parametric Quadratic Programming
Markowitz and Todd (2000) deploy parametric quadratic programming and propose a critical-line algorithm to solve Equation 1. Markowitz and Todd (2000) prove that the efficient frontier is piece-wisely made up by connected parabolic segments. For example, the frontier consists of two segments in the second part of Figure 2. The segment between
Repetitive Quadratic Programming
Repetitive quadratic programming can be formulated as Equation 3. A group of ϵ is preset. Equation 2 is then repetitively solved for each ϵ; the optimal solutions are obtained for approximating the efficient frontier.
Repetitive quadratic programming is easier to understand than parametric quadratic programming and thus described in classic textbooks (e.g., those of Bodie et al., 2018; Elton et al., 2014; and Best, 2010). However, repetitive quadratic programming obtains only incomplete information of efficient frontiers by discrete approximations and thus cannot reveal the frontier structure. In the third part of Figure 2, the gray curve is copied from the second part as a comparison; repetitive quadratic programming obtains three points:
Heuristic Methods
Some models with additional nonlinear formulation are difficult to solve; heuristic methods thus become important provisional tools. Evolutionary algorithms, tabu search, and simulated annealing can be common approaches (as reported by Woodside-Oriakhi et al., 2011).
However, heuristic methods inherently provide suboptimal solutions and only incomplete information of efficient frontiers and cannot reveal the frontier structure. In the fourth part of Figure 2, the methods obtain three points
Imposing ESG Constraints and Constructing Portfolio-Selection Models for Sustainable Investment
We extend Equation 2, explicitly impose ESG as constraints, and obtain the following general model:
where
We introduce the following symbols:
We then rewrite Equation 9 in matrix format as follows:
We inherit the assumptions of Merton (1972) and extend them as follows:
As a covariance matrix, Σ is invertible and thus positive definite.
Assumption 2.
Vector μ and all row vectors of A (altogether
The assumptions are not strict prerequisite because of the following:
By index models in finance (e.g., Bodie et al., 2018, Chapter 8), we obtain
If the
With the vectors for Equation 9 as n-vectors, we know
Analytically Deriving the Minimum-Variance Frontier of Equation 9
We follow Merton (1972) and also apply an ϵ-constraint method to Equation 11 and obtain
where ϵ is the parameter. The application’s financial interpretation is taking ϵ as a target expected portfolio return and minimizing portfolio variance with
We then rewrite all constraints of Equation 12 as
where
We premultiply the first partial-derivative equation by
We then substitute
We propose the following lemma about the invertibility of
Matrix
Then, we premultiply
We substitute
We use symbols
The minimum-variance frontier of Equation 14 is more complicated than Equation 2. Moreover, we prove the frontier also as a parabola in the following theorem:
The minimum-variance frontier of Equation 14 is a parabola in (z1, z2) space of Equation 9.
By Theorem 1 and Equation 12, the parabola of Equation 14 is the boundary of the feasible region Z of Equation 9. Moreover, the efficient frontier of Equation 9 is the parabola’s upper portion. We have already depicted the parabola as a black curve in the upper right part of Figure 1.
Analytically Deriving the Efficient Frontier of Equation 9
We follow Sharpe (1970) and also apply a weighted-sums method (as described by Steuer, 1986) to Equation 11 and obtain:
where λ is the parameter. The application’s financial interpretation is taking λ as a risk-aversion parameter, computing the expected utility (i.e.,
where
We premultiply the first partial-derivative equation by
We substitute
and rearrange it as
We know
and substitute
where
To demonstrate the
where subscript (9) is for Equation 9. We then rewrite the optimal solution as:
where
The
The set is a ray; that is, the ray is generated by
We have already depicted the ray in the lower left part of Figure 1.
Comparing the Efficient Frontiers of Equations 2 and 9 in the Weight Space
We have already computed the
Comparing
We compare
where
If
If
Comparing the x Sets
Although
For the relationship between the
Complete-overlap case requires
We then compare the two
We rearrange Equation 21 into the following linear-equation-system format:
where
We propose the following theorem for the intersection possibility under three exhaustive and exclusive situations:
If
If
If
An Illustration
We utilize the following model on the basis of Equation 9:
Measuring ESG
Carroll (2000) argues for the reason and method to theoretically measure corporate social responsibility (CSR) which typically subsumes ESG. In practice, there are several professional organizations to measure ESG or more broadly measure CSR, for example, RobecoSAM AG, Sustainalytics, the KLD Research & Analytics (KLD), ARESE, and Thomson Reuters ESG data.
On one side, Hopkins (2005) criticizes the measurement systems by demonstrating several indicators as subjectively chosen. Entine (2003) draws similar conclusions. On the other side, Waddock and Graves (1997) review CSR-measure problems and conclude KLD as relatively reliable. Kang (2015), Hart and Sharfman (2015), and Waddock (2003) draw similar conclusions. Overall, Mattingly and Berman (2006), Chatterji et al. (2009), and Hart and Sharfman (2015) relatively comprehensively evaluate KLD ratings. Turker (2009) realizes the measurement methods’ limitation and proposes a new method on the basis of stakeholders’ responsibility. Graafland et al. (2004) utilize similar methods.
KLD adopts the following categories: alcohol, community, corporate governance, diversity, employee relations, environment, firearms, gambling, human rights, military, nuclear power, product, and tobacco. KLD typically divides a category into two subcategories: strengths and concerns, but some categories (e.g., alcohol) have only concerns. KLD then constructs binary variables under all subcategories. Therefore, a category measurement can be computed by adding all the strengths variables under the category and then subtracting all the concerns variables under the category. Specifically,
For environmental of ESG, the category environment can be utilized.
For social of ESG, the summation of categories community, diversity, employee relations, human rights, and product can be utilized.
For governance of ESG, the category corporate governance can be utilized.
Dyck et al. (2019), Halbritter and Dorfleitner (2015), and Galema et al. (2009) utilize similar methods.
Sampling
For Equation 9, we select the following 27 component stocks of Dow Jones Industrial Average Index because they have complete observations whereas the other component stocks do not: Apple (AAPL), American Express (AXP), Boeing (BA), Caterpillar (CAT), Cisco (CSCO), Chevron (CVX), Disney (DIS), Dow Chemical (DOW), Goldman Sachs (GS), Home Depot (HD), IBM (IBM), Intel (INTC), Johnson & Johnson (JNJ), JPMorgan Chase (JPM), Coca-Cola (KO), McDonald’s (MCD), 3M (MMM), Merck (MRK), Microsoft (MSFT), Nike (NKE), Pfizer (PFE), Procter & Gamble (PG), United Health (UNH), United Technologies (UTX), Verizon (VZ), Wal-Mart (WMT), and Exxon Mobil (XOM). (data source: https://us.spindices.com/indices/equity/dow-jones-industrial-average, S&P Dow Jones Indices LLC., October 22, 2019). To save space, we list just brief company names with ticker symbols above.
The sample is made up of the 27 stocks’ monthly return and annual ESG data from January 1, 2004, to December 31, 2013 (data source: database “CRSP” and database “MSCI ESG KLD STATS” via Wharton Research Data Services, https://wrds-web.wharton.upenn.edu/wrds/, October 24, 2019). When we wrote this article, the latest data of “MSCI ESG KLD STATS” was for 2016. However, the ESG data from 2014 to 2016 have many missing values. Therefore, we sample from 2004 to 2013.
We compute the sample covariance matrix and sample mean vector of the stocks’ return and then assume the sample statistics as the population parameters (i.e., Σ and μ, respectively) because Elton et al. (2014) contend that “It is common to use historical risk, return, and correlation as a starting point in obtaining inputs for calculating the efficient frontier” (p. 87). We annualize Σ and μ by the method of Bodie et al. (2018) and report them in Table 1.
The Σ and μ of the 27 Component Stocks of Equation 9.
Note. AAPL = Apple; AXP = American Express; BA = Boeing; CAT = Caterpillar; CSCO = Cisco; CVX = Chevron; DIS = Disney; DOW = Dow Chemical; GS = Goldman Sachs; HD = Home Depot; IBM = IBM Company; INTC = Intel; JNJ = Johnson & Johnson; JPM = JPMorgan Chase; KO = Coca-Cola; MCD = McDonald’s; MMM = 3M; MRK = Merck; MSFT = Microsoft; NKE = Nike; PFE = Pfizer; PG = Procter & Gamble; UNH = United Health; UTX = United Technologies; VZ = Verizon; WMT = Wal-Mart; XOM = Exxon Mobil.
For example, the covariance between AAPL and AXP is .051 in the second row. The expected return of AAPL is .474 in the last row.
We adopt the ESG measurement for KLD, compute ESG from 2004 to 2013, and report them in Table 2. For example, the ESG of AAPL in 2004 is 0, 1, and −1, respectively. Moreover, we compute the sample mean vectors of the ESG and then assume the sample statistics as the population parameters (i.e.,
The
Note. AAPL = Apple; AXP = American Express; BA = Boeing; CAT = Caterpillar; CSCO = Cisco; CVX = Chevron; DIS = Disney; DOW = Dow Chemical; GS = Goldman Sachs; HD = Home Depot; IBM = IBM Company; INTC = Intel; JNJ = Johnson & Johnson; JPM = JPMorgan Chase; KO = Coca-Cola; MCD = McDonald’s; MMM = 3M; MRK = Merck; MSFT = Microsoft; NKE = Nike; PFE = Pfizer; PG = Procter & Gamble; UNH = United Health; UTX = United Technologies; VZ = Verizon; WMT = Wal-Mart; XOM = Exxon Mobil.
For Equation 9, we take:
Solving Equation 9
By Equation 14, we compute the minimum-variance frontier of Equation 9 as follows:
We have already plotted the frontier as a black parabola in Figure 1. The frontier’s upper half is the efficient frontier of (29) and is plotted as a thick curve.
We compute
Note. AAPL = Apple; AXP = American Express; BA = Boeing; CAT = Caterpillar; CSCO = Cisco; CVX = Chevron; DIS = Disney; DOW = Dow Chemical; GS = Goldman Sachs; HD = Home Depot; IBM = IBM Company; INTC = Intel; JNJ = Johnson & Johnson; JPM = JPMorgan Chase; KO = Coca-Cola; MCD = McDonald’s; MMM = 3M; MRK = Merck; MSFT = Microsoft; NKE = Nike; PFE = Pfizer; PG = Procter & Gamble; UNH = United Health; UTX = United Technologies; VZ = Verizon; WMT = Wal-Mart; XOM = Exxon Mobil.
We then compute
Solving Equation 2
As a comparison, we also solve Equation 2 for the 27 component stocks. By Equation 2, we compute the minimum-variance frontier of Equation 2 as follows:
We compute
Note. AAPL = Apple; AXP = American Express; BA = Boeing; CAT = Caterpillar; CSCO = Cisco; CVX = Chevron; DIS = Disney; DOW = Dow Chemical; GS = Goldman Sachs; HD = Home Depot; IBM = IBM Company; INTC = Intel; JNJ = Johnson & Johnson; JPM = JPMorgan Chase; KO = Coca-Cola; MCD = McDonald’s; MMM = 3M; MRK = Merck; MSFT = Microsoft; NKE = Nike; PFE = Pfizer; PG = Procter & Gamble; UNH = United Health; UTX = United Technologies; VZ = Verizon; WMT = Wal-Mart; XOM = Exxon Mobil.
We then compute
Comparing the x Sets of the Efficient Frontiers of Equations 9 and 2
By Equation 20, we compute
Changing e , s , and g to the 50th Percentiles and 90th Percentiles
In addition to the 75th percentile setting for e, s, and g, we also reset them as the 50th percentiles and then reset them as the 90th percentiles. We depict the minimum-variance frontiers (including the efficient frontiers) for the 50th percentiles, 75th percentiles, and 90th percentiles in the upper left, upper right, and lower parts of Figure 3, respectively. Overall, the results of the 50th and 90th percentiles are similar to those of the 75th percentiles.

The results of setting e, s, and g as the 50th, 75th, and 90th percentiles for Equation 9.
Conclusion and Future Direction
Researching sustainable investment is challenging; difficulties in extending conventional portfolio selection are encountered in this article. However, the researching is worthwhile because sustainable investment will no longer be practical experience and will become a theoretically and quantitatively robust tool for investment on the basis of portfolio selection.
Footnotes
Appendix
Acknowledgements
The authors greatly appreciate four anonymous reviewers’ highly constructive comments.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by the National Social Science Fund of China 2018 (grant number 18BGL063).
