Abstract
Modeling the implied volatility has received extensive attention, as the implied volatility is an important parameter in option pricing. Usually the implied volatility can be approximated by fitting a polynomial about the strike and the maturity or by stochastic methods. In this article, a Gaussian semi-parametric model is proposed based on the quadratic polynomial semi-parametric model suggested by Borovkova. In the new model, the Gaussian function is used to construct a smooth term substituting the quadratic term in the polynomial model, and the arbitrage-free constraints are used to calibrate the model. The empirical tests show that the Gaussian semi-parametric model has a better performance in fitting and forecasting.
Introduction
In finance, an option is a contract which gives the owner of the option the right, but not the obligation, to buy or sell an underlying asset or instrument at a specified strike price on a specified date, depending on the form of the option. An option that conveys to the owner the right to buy at a specific price is referred to as a call; an option that conveys the right of the owner to sell at a specific price is referred to as a put.
Option pricing is one of the hot issues in the field of financial engineering in which the most famous and common used model is the Black–Scholes (B-S or Black–Scholes–Merton) model. 1 The model assumes that the implied volatility is a constant across all strikes and maturities. But, in practice, the values of implied volatilities by inverting option prices with different strikes and maturities are varied. That is, in a three-dimensional coordinate system, the implied volatility is plotted as a function of the strike price and the time-to-maturity, which forms a non-flat surface called the implied volatility surface. This surface contains a lot of information of the option market, which is an important tool for option pricing and financial risk management. Thus, modeling the implied volatility becomes very important in the field of financial engineering.
There are two main construction methodologies to model implied volatility: the deterministic implied volatility model and the stochastic implied volatility model. The deterministic implied volatility model considers there is a deterministic functional relationship among the implied volatility, underlying asset price, the time-to-maturity and the strike price, and this relationship doesn’t change with time. There are three common used deterministic models: the sticky strike rule model and the sticky delta rule model proposed by Derman, 2 and the stationary square root of time rule model proposed by Daglish et al. 3 Such model does not introduce the new risk source and is simple to model.
The stochastic implied volatility model is first proposed by Schonbucher, 4 which considers the change of implied volatility is driven by several risk sources, and assumes that these risk sources are different from the risk sources of underlying asset price. Ledoit and Santa-Clara, 5 Cont and Da Fonseca, 6 and some other researchers followed Schonbucher’s idea and put forward other new stochastic models. Because the implied volatility surface contains different strike price and maturity, whose change process is complex and multidimensional, building the model of implied volatility surface need to consider not only the random process of each of the implied volatility but also the relationship between every different random process, which is hard to practice in reality.
In recent years, with the option market data becoming abundant, researchers focus on the deterministic modeling methodologies.7–10 The deterministic modeling include the parameter model, non-parameter model, and semi-parameter model.
Ncube 11 analyzed the market data of FTSE100 index option and concluded that there is a correlation among the implied volatility, the strike price and the time-to-maturity. Dumas et al. 12 used the market data of S&P500 index option to illustrate there is a linear relationship among them, and then put forward a set of parametric models in which the implied volatility is considered as some functions about the strike price and the time-to-maturity. Cassese and Guidolin 13 followed Dumas’s idea, replaced the strike price with the moneyness in above parametric models, and concluded that the new models have better fitting effects. However, the parameter model has bid error and low stability.
Fitting the implied volatility surface can also be tackled by a non-parametric approach. Bourke 14 built the implied volatility surface model in a non-parametric way by the nearest neighbor-weighted interpolation. The implied volatility surface obtained by a non-parametric method can have any complex shape. This is because non-parametric methods lack generalization power and attempt to fit observed implied volatilities exactly. However, this feature is a big disadvantage of non-parametric methods, and the reason why they are not widely used in practice.
To combine attractive features of the parametric method with the flexibility of the non-parametric one, in 2009, Borovkova and Permana 15 proposed a semi-parametric model which considered that, when fixing the time-to-maturity, the implied volatility is a quadratic polynomial of the strike price or the moneyness. Different from the parametric models in which the parameters are independent of the time-to-maturity, the parameters of the semi-parametric models vary with the time-to-maturities. In the other hand, the stochastic volatility inspired (SVI) model proposed by Gatheral, 16 fits the implied volatility by a function of the moneyness separately for each time-to-maturity. That means SVI is also a semi-parametric model.
The semi-parametric model proposed by Borovkova using a quadratic polynomial of the moneyness to fit the implied volatility curve can exhibit the feature of “volatility smile,” but in some cases, the curvature of the fitted implied volatility curve can’t display the extent of “volatility smile.”
In this article, we have done the following main contributions: first, the quadratic term in the Borovkova’s model is replaced by a new smooth term and a Gaussian semi-parametric model is proposed; second, arbitrage-free are used to calibrate the Gaussian semi-parametric model; third, implied volatility is interpolated based on the Gaussian semi-parametric model to obtain the implied volatility surface, in order to observe the tendency of the implied volatility; fourth, the Gaussian semi-parametric model is used to forecast implied volatility and the option price in the next trading day. In empirical analysis, the market dataset of AAPL stock options (from Yahoo finance website: http://finance.yahoo.com/quote/AAPL/options) is used to verify the effectiveness of the Gaussian semi-parametric model.
The article is organized as follows: The next section documents B-S model and implied volatility surface. In “Gaussian semi-parametric model” section, we provide a brief overview of parametric models and semi-parametric models, then propose a Gaussian semi-parametric implied volatility model. In “The semi-parametric model with no-arbitrage” section, we describe the arbitrage-free conditions of the implied volatility surface and propose a calibrated Gaussian semi-parametric model. “Empirical analysis” section outlines our empirical procedure. We show the new model’s fitting and forecasting performance. The final section concludes with a summary of the main results.
B-S model and implied volatility surface
The B-S model widely used in option pricing was first proposed by Fischer Black and Myron Scholes in 1973
1
under the following assumptions:
The market is frictionless and all participants at the markets are able to borrow and lend at the constant risk-free interest rate; There is no transaction cost and tax; There is no arbitrage in the market; The underlying can be bought in any amount, including fractional units; Finally and most importantly, the underlying follows a geometric Brownian motion with constant drift and constant volatility.
For a given underlying asset, the B-S model can be specified in a partial differential equation:
If the implied volatility σ is specified, then the option price
With the equation (1) and conditions (2), the pricing formulae of European call option can be expressed as follows:
The B-S model assumes the volatility is a constant, that is, the implied volatilities for different options writing on an underlying asset have the same values. The implied volatilities, the strike price and the time-to-maturities may make up a 3D flat surface in a three-dimensional coordinate system. However, empirical studies show that the implied volatilities tend to differ across the strikes and the time-to-maturities. For example, Figure 1 shows the implied volatilities of the AAPL (Apple stock options) call options on 1 March 2016, which exhibits the systematic dependence of implied volatilities with the strike price and the time-to-maturities. Thus, how to reveal this dependence by modeling the implied volatility is important. This article mainly studies the semi-parametric models of the implied volatility surface.
The implied volatilities of AAPL call options on 1 March 2016.
Gaussian semi-parametric model
Parametric models and semi-parametric models
Dumas et al.
12
analyzed the data of S&P500 index options and developed a quadratic volatility function of the strike price and the time-to-maturity:
As the implied volatility is sensitive to the strike price and underlying price movement, Cassese and Guidolin
13
suggested replacing the strike price with the moneyness and the implied volatility model is rewritten as following:
Models 1 and 2 use market data to construct a volatility surface fitting the implied volatility. Once the parameters are located, the approximated implied volatility can be calculated for each pair of
Another important semi-parametric model is the SVI model proposed by Gatheral,
16
which could fit the “volatility smile” quite well. When fixing the time-to-maturity, SVI model is a function of the moneyness
Semi-parametric model with the exponential parameter
In the semi-parametric models modeling the implied volatility with a polynomial or fractional order polynomial, the highest degree with the moneyness is an important factor to influence the curve curvature. Wu et al.
17
proposed a semi-parametric model with the exponential parameter, which obtain a better result when using The bent effect of 
In Wu et al.,17
The parameters of this model include original parameters
Gaussian semi-parametric model with exponential parameter
The drawback of the Model 5 is that it is not smooth at the point
As we know, the shape of the Gaussian function (11) has the form of Figure 3, where μ is mathematical expectation of Gaussian function with

The function image of
Figure 4 shows the characteristics of the implied volatility “smile” (options whose strike price differs substantially from the underlying asset’s price command implied volatilities), besides, the amplitude of “smile” can be determined by the parameters of Gaussian function. Based on this, the Gaussian function can be used to construct a smooth term The function image of 
After using
Solution to the model parameters
The implied volatility calculated by using a good implied volatility model would be close to the market data. Suppose there are
Using least squares technique, the minimization of the function (14) can be converted to solve the following system of nonlinear equations:
The most common used method to solve the system of nonlinear equations is Newton method, but it requires the equation system have reversible Jacobi matrix. The system (15) may not satisfy this request due to the varying market data. In this article, the modified Levenberg Marquardt (L-M) algorithm17–19 is used to solve the system of the nonlinear equations (15), which does not require the Jacobi matrix reversible, and has global convergence.
For convenience, the system is rewritten as a vector form. Let
Here we give the algorithm for solution to the model parameters by the modified L-M algorithm. The details of the modified L-M algorithms can be found in Fan. 19
Algorithm 1: Solution to the model parameters by using the modified L-M algorithm
Step 1: For the given time-to-maturity τ
Prepare the tolerance
Prepare the initial values:
Step 2: Do while (
{calculate
Step 3: Calculate
Step 4:
Step 5: Adjust parameter
Step 6:
Step 7: Output
End-Algorithm
The model parameters solve by the modified L-M algorithm, then is taken into the implied volatility model, such as Model 6, to calculate the estimated implied volatility
The semi-parametric model with no-arbitrage
There should be no arbitrage in a complete options market. The arbitrage opportunities in the options market include the strike arbitrage and the calendar arbitrage. The strike arbitrage means obtaining profits from buying and selling at the same time two kinds of options writing on the same underlying asset with the same maturity date but different strike prices. The calendar arbitrage means obtaining profits from trading two kinds of option writing on the same underlying asset with the same strike price but different maturity date at the same time.
It is an important assumption of B-S option pricing model that there is no arbitrage opportunity in the trading market. However, the option implied volatilities and the option prices of the options market are subject to market supply and demand, so the market does exist the arbitrage. That indicates the implied volatility model obtained from fitting the market data cannot get an implied volatility predictor with no-arbitrage. In this section, we consider to calibrate the Model 6 with the arbitrage-free conditions and supply an implied volatility predictor with no-arbitrage for the options market.
Arbitrage-free conditions of the implied volatility surface
In 2009, Fengler
20
developed the arbitrage-free conditions for option prices to judge whether there is a strike arbitrage opportunity when the time-to-maturity τ is fixed:
Roper
21
extended above arbitrage-free conditions from option price to option implied volatilities. Denoting the total variance
As examples, Figures 5 and 6 show the arbitrage opportunities of market data. Figure 5 shows the option prices of AAPL call options with several different maturities. The red circle points in Figure 5 indicate that the data did not satisfy the conditions of the equations (20) and there have strike arbitrage opportunities. Figure 6 shows the total variance of AAPL call options on 1 March 2016 for different time-to-maturities. For some moneynesses, the total variance does not decreasing along the maturity, which breaks (iv) of the equations (21) and there have calendar arbitrage opportunities.
Price plot for AAPL call option data with several maturities. Total variance plot for AAPL call option data on 1 March 2016 (matu. = time-to-maturity).

Calibrate the Gaussian semi-parametric model with arbitrage-free conditions
In this section, the Gaussian semi-parametric model proposed in “Gaussian semi-parametric model with exponential parameter” section is combined with equations (21) to predict no-arbitrage implied volatilities. The new model can be expressed as following:
Model 7:
Similar to Model 6, the Model 7 can be expressed as a vector form. Let
In this article, the sequential quadratic programming algorithm
22
is used to solve above nonlinear constrained optimization problems. For every
Empirical analysis
In this section, the options market data are used to verify the effectiveness of the Gaussian semi-parametric model. The effectiveness includes fitting accuracy and the forecasting ability. The market dataset for testing comes from AAPL stock options on 1 March 2016 and 2 March 2016.
The fitting accuracy and the forecasting ability are measured by the fitting error and the forecasting error, respectively. The forecasting error means when using the curves fitted by one day market data to calculate the implied volatilities on the next day, the forecast error will occur compared with the market data of the day. In this article, the root mean-squared error and the mean absolute estimation error are used:
The root mean-squared estimation error:
The mean absolute estimation error:
where
The approximate implied volatilities calculated by the implied volatility models can be used to calculate the approximate option prices by the B-S model. In the following tests, implied volatilities and option prices are compared between the approximations calculated by the models and the market data, and their corresponding error indicators are as follows:
Fitting and forecasting results by the Gaussian semi-parametric model and comparison with the other models
In testing, the dataset of AAPL stock options on 1 March 2016 is used to fit the implied volatility curves by using the Models 3, 5, 6, and the SVI model (Model 4). The market implied volatilities with τ(time-to-maturity) = 45 days, 108 days, and 689 days are marked in Figures 7–10, which show the fitting curves of the Models 3, 5, 6, and the SVI model, respectively.
The fitted implied volatility curves with Model 3 (matu. = time-to-maturity). The fitted implied volatility curves with Model 5 (matu. = time-to-maturity). The fitted implied volatility curves with Model 6 (matu. = time-to-maturity). The fitted implied volatility curves with SVI (matu. = time-to-maturity).



It can be found that the fitting results of the Model 6 and the SVI model are better than the Model 3. The Model 5 though has a similar effect but the curve for τ = 45 days is not smooth on
The fitting results of semi-parametric models.
The forecasting results of semi-parametric models.
Figures 11 and 12 show the fitting errors of different models on 1 March 2016 with different maturities and the forecasting errors on 2 March 2016 when using the curves fitted by the data of 1 March 2016 to calculate the implied volatilities on 2 March 2016. The experimental results show that the Gaussian semi-parametric model (the Model 6) has a similar good accuracy with the SVI model.
The fitting errors of semi-parametric models. The forecasting errors of semi-parametric models.

Figure 13 shows the implied volatility surface by using the cubic spline interpolation method with the approximate implied volatilities for different maturities calculated by the Model 6, the market data comes from the AAPL stocks options on 1 March 2016. It can be found that the implied volatility functions have distinctly different shapes with different maturities, and the Gaussian semi-parametric method captures this feature remarkably well.
The fitted implied volatility surface with Model 6 on 1 March 2016.
Fitting and forecasting results by the Gaussian semi-parametric model with arbitrage-free conditions
In this test, the data of AAPL stock options on 1 March 2016 is used to fit implied volatility curves by using arbitrage-free implied volatility Model 7. Figure 14 shows some arbitrage-free calibrated option prices with different maturities, and examined by the equation (20), there is absence of the strike arbitrage. Figure 15 shows the fitted total variance by the Model 7, in which there is no intersection among different maturities (i.e. non-decreasing in moneyness), that is, there is absence of the calendar arbitrage.
The arbitrage-free price plot for AAPL call option data with several maturities on 1 March 2016. The arbitrage-free total variance plot for AAPL call option data on 1 March 2016 (matu. = time-to-maturity).

The arbitrage-free fitting and forecasting results of Model 7 and SVI.
Conclusion
In this article, a Gaussian semi-parametric implied volatility model is proposed in which the Gaussian function is used to construct a smooth term substituting the quadratic term in the Borovkova’s model. And then the arbitrage-free constraints are used to calibrate the model. The empirical analysis shows that the Gaussian semi-parametric model can obtain a good performance in fitting and forecast, which is similar to the stochastic volatility inspired model that is based on the stochastic implied volatility model.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by Natural Science Foundation of China (NSFC) under Grant No.61401098 and Natural Science Foundation of Fujian (China) under Grant No. 2015J01013.
