SciELO - Scientific Electronic Library Online

 
vol.67 número1Modelo de gestión del cambio organizacional con pensamiento lean en servicios turísticosIs there really synergy creation after M&A?Evidence from Brazil under the life cycle approach índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay artículos similaresSimilares en SciELO

Compartir


Contaduría y administración

versión impresa ISSN 0186-1042

Contad. Adm vol.67 no.1 Ciudad de México ene./mar. 2022  Epub 10-Sep-2024

https://doi.org/10.22201/fca.24488410e.2022.2789 

Articles

Parameter calibration of stochastic volatility Heston’s model: Constrained optimization vs. differential evolution

Ambrosio Ortiz-Ramíreza  * 

Francisco Venegas-Martíneza  * 

María Teresa Verónica Martínez-Palaciosa  * 

a Instituto Politécnico Nacional, México


Abstract

This paper calibrates through loss functions the parameters of Heston’s stochastic volatility model by using two different methods: minimizing a nonlinear objective function (a loss function) with constraints on the values of the parameter and using a differential evolution algorithm. Both methods are applied to implied volatilities on the Mexican Stock Exchange Index with four maturities and twenty-eight strike prices. The selection criterion for the parameters is minimizing the value of the mean square error of the implied volatility. The first method has a better performance with less error and time. However, empirical results show that for both methods the adjustment of implied volatilities is better for options with longterm maturities than for short-term maturities.

JEL Code: G13; B16; C21

Keywords: Contingent pricing; Stochastic volatility; Implied volatility; Differential evolution

Introduction

In the last decades, many stochastic volatility models (SVMs) for option pricing have arisen as an alternative (or generalization) to the standard Black-Scholes-Merton (BSM) model (Black and Scholes, 1973; and Merton, 1973). Among the main limitations of the BSM model is the assumption of constant volatility. Stochastic volatility models undertake this problem under the assumption that prices and their volatility are stochastic processes affected by several risk factors. In this regard, SVMs handle more consistent prices. Unlike the BSM model, the SVMs considers stylized facts, such as implied volatility smile and skewness, as observed in derivative markets. The latter handle the leverage effect, i.e., volatility rises after price declines. That is, the asset price and its volatility are negatively correlated. The best known SVMs are those from Hull and White (1987), Stein and Stein (1991), Heston (1993), and Schöbel and Zhu (1999). In particular, Hull and White’s (1987) approach expand the asset price with a Taylor series function up to terms of third order, and Heston’s (1993) model provides a closed formula that requires numeric methods.1

A main issue of option pricing models is their capability of generating volatility structures that are consistent with the expiration date and the strike price in order to capture the behavior of the underlying asset return distribution. There are three important features to be considered once a model is proposed: tail risk, option pricing, and implied volatility. Tail risk is empirically defined as the probability that the underlying asset return will move more than three standard deviations from its mean. The BSM model underestimates tail risk since the forecasted probability for extreme variations is low. This is because in the BSM model the tails of the asset return distribution are driven by a normal distribution. Thus, in this context the models of Hull and White (1987), Stein and Stein (1991), and Heston (1993) exhibit higher performance than the BSM model allowing estimating the probabilities for unexpected falls in the asset price (Gulisashvili and Stein, 2010).

Volatility is an unobservable variable that measures the relative changes in an asset price. Often the volatility is also seen as the instantaneous standard deviation of the asset returns. It important to point out that through periods of low volatility, asset prices do not change very much; however, in periods of high volatility, significant variations are shown for the asset prices. In the BSMmodel, volatility is constant and positive until the expiration date. In contrast, a SVM, volatility is described by a non-negative stochastic process. For example, in Hull and White’s (1987) model, volatility dynamics follows a geometric Brownian motion. In Stein and Stein’s (1991) model, volatility follows an Ornstein-Uhlenbeck process or the absolute value. While in Heston’s (1993) model, volatility is driven by the square root of the Cox-Ingersoll-Ross’ (1985) process.

The term “implied volatility” appeared for the first time in the work of Latané and Rendleman (1976), called “implied standard deviation.” These authors studied the standard deviations of returns on assets implied in observed call options prices when an investor values options prices under the standard BSM model. When implied volatility is obtained with options price data, and the strike price and expiration date are plot, the implied volatility surface is not flat, and changes along with the strike price and expiration date. This is what is known as implied volatility smile.

In relation to the estimation of the parameters of stochastic volatility, there are three main statistical techniques: maximum likelihood (ML), the generalized method of moments (GMM), and the market-based (MB) method. The latter minimizes the spread or market volatilities obtained from the theoretical model and the data market, e.g., using a loss function (Bakshi, Cao, and Chen, 1997; and Bams, Lehnert, and Wolff, 2009). In this context, innovations in the computational science field offer considerable advantages to the financial industry, e.g., evolutionary computation, which has developed optimization methods based on evolutionary algorithms, representing an alternative to traditional optimization methods (either deterministic or stochastic) whenever the latter fail or the computational cost is high. Genetic algorithms are computational algorithms based on Darwinian evolution, which use operators such as reproduction, mutation, and selection to evolve a set of candidates or possible solutions to a given optimization problem. Gilli and Schumann (2011) provided a method using differential evolution to calibrate the Heston model and proposed only one loss function with artificially generated data. A notable finding is the utilization of alternative methods to decrease the risk model and the selection criteria of parameters. On the other hand, Cui, del Baño and Germano (2017) propose an efficient algorithm to calibrate the Heston model with nonlinear least squares using the Levenberg (1944) method by using the alternative representation of the characteristic function and avoiding discontinuities.

Soler and Kizys (2017) review some literature related to the use of evolutionary algorithms to solve classic and emerging problems in the field of finance. A non-exhaustive list of examples includes portfolio optimization, index tracking, enhanced indexation, credit risk, stock investments, financial project scheduling, option pricing, feature selection, bankruptcy and financial distress prediction, and credit risk assessment, they conclude that the use evolutionary algorithms in finance is becoming increasingly more popular among researchers from different communities in order to make efficient decisions. In Liu et al (2019) a method to calibrate stochastic volatility models using artificial neural networks is proposed that includes three phases: training, prediction and calibration. Some relevant results are the efficiency and precision of the calculations when compared with other methods such as differential evolution that avoids getting stuck in a local minimum. In Palmer (2019) it is addressed a number of problems in computational finance, characterized by their high demand for computation time and resources. One of them is the calibration of the Heston model with variants of evolutionary algorithms such as particle swarm optimization (PSO) and differential evolution (DE). State-of-the-art DE algorithms were found to provide excellent calibration performance, and the prior use of rudimentary DE in the literature underestimates the use of these methods.

This paper is aimed at calibrating the parameters of Heston’s SVM with several loss functions by using two different methods: the first one minimizes a nonlinear function of several variables with constraints, and the second one employs the differential evolution algorithm. Both methods are applied to a set of implied volatilities on the Mexican Stock Exchange Index (IPC Spanish acronym for Indice de Precios y Cotizaciones) with four maturities and twenty-eight strike prices. The selection criterion for the parameters is minimizing the value of the mean square error of the implied volatility. It important to point out that the mean squared loss function weighted by the square of the vega reaches the minimum value by the first method with less error and time. Moreover, the empirical results show that for both methods the adjustment of implied volatilities is better for options with long-term maturities than those for short-term maturities, and the value of rho is negative, which means that the slope of the implied volatility curve is negative and is closely related to the asymmetry of the underlying asset returns distribution. The main conclusion is that a change in the slope of implied volatility reflects the supply and demand effects of options. Thus, high premium will move in the direction that market participants expect the underlying to move.

The structure of the paper is as follows: the next section provides a short review of SVMs; section 3 describes briefly the Heston model; section 4 describes the calibration of parameters using several loss functions; through section 5, the differential evolution algorithm to calibrate Heston’s model parameters is defined; in section 6, the Heston model parameters are calibrated by using loss functions according to the two methods of the previous sections, both methods are implemented for a set of implied volatilities of the IPC. Finally, in section 7, conclusions are given.

A Short Review of Stochastic volatility models

According to Garman’s (1976) theoretical proposal, an option price with stochastic volatility satisfy a second-order partial differential equation (PDE) with two fundamental variables: strike price and volatility. Volatility is not a trading asset, so arbitrage cannot eliminate volatility risk. Therefore, the market price is included in the PDE. Usually, the risk premium is zero or a constant volatility proportion. Final and boundary conditions for the option contracts state the premium. In these option pricing models; underlying volatility is explicit and its parameters are obtained by statistical methods. The PDE solution from which the option price is obtained engender additional problems, since most of the time closed formulas are not available, and the least it can be asked for is an analytical solution as a prerequisite for the empirical work.

Wiggins’ (1987) work solves a PDE with a finite difference method for the logarithmic transformation of a state variable. Hull and White (1987) worked with a simple power series approximation technique based on the conditional underlying distribution of the average value of the stochastic variance. Stein and Stein (1991) and Heston (1993) proposed an analytic approach using the inverse Fourier transform. Of these three methods, the PDE numerical solution is the best known, but also the most computationally expensive. Hull and White’s (1987) and Stein and Stein’s (1991) approaches depend on the average stochastic variance distribution and assume instant correlation of state variables equal to zero, and Hull and White’s (1987) approach is more practical; however, Heston (1993) claimed that the Stein and Stein (1991) model works with the inverse Fourier transform even if the correlation is different from zero. On the other hand, Schöbel and Zhu (1999) developed a model with two correlated Brownian motions and estimated the characteristic function of the logarithmic price through a system of stochastic differential equations. Also, Zhu (2010) proposed a double root stochastic volatility model akin to Longstaff´s (1989) model. Date and Islyaev (2015) proposed an SVM based on a deterministic time structure modified by an escalated random variable. In this case, a closed approximation is obtained for a European-type option premium using higher order Greeks from its volatility.

Revisiting Heston’s SVM

An important feature in the Heston model is that the characteristic functions of the risk neutral probabilities are the solution to a second-order PDE (Venegas-Martínez, 2008). Using these probabilities, a formula is obtained to value a European call option under the assumption that volatility is stochastic. In this case, there is a correlation between the volatility and the underlying price, while the strike price is stated by the put-call parity. There are some complications with Heston’s model because it includes complex variables. Therefore, the option price is obtained by calculating the probability that the call option expires in the money. This probability is obtained by calculating the inverse of the characteristic function for the logarithm of the underlying price. The formalization of the Heston model is as follows. The underlying price Χt is driven by the following stochastic differential equation:

dXtXt=αdt+vtdB1t

where α denotes the annual average return and B1t is a Brownian motion or Wiener process (normal independent increments with zero mean and variance linearly depending of time). It is important noticing that Heston model uses the variance vt instead of the volatility vt Indeed, the variance follows the Uhlenbeck and Ornstein (1930) process for volatility, denoted by gt=vt. Thus, dgt=-βgtdt+δdB2t, where B2t is a Wiener process correlated to B1t, i.e., CovdB1t,dB1t=ρ. Using Itô´s lemma with vt=gt2, the variance dynamics is obtained, and in this case, according to Cox, Ingersoll, and Ross (1985), the mean reversion process for the variance is:

dvt=kθ-vtdt+σvtdB2t, (1)

where κ=2β,θ=δ2/2β, and σ=2δ. If and κ,θ, and σ satisfy the conditions 2κθ>σ2 and v0>0. The above stochastic process is always positive, and the distribution of (1) is a non-central chi-squared. Moreover, the variance always has a mean reversion structure, where κ is the speed of adjustment toward the mean, θ>0, which is the long-term level of vt and σ the instantaneous volatility of the variance. Intuitively, the parameter θ, in (1), is such that if the variance is become greater, then the variance is attracted toward θ. Regarding the estimation of the parameters, we have v0 as an unobservable initial state variable (Bakshi et al., 1997). In this paper, v0 is considered as a parameter to be estimated. To estimate the price of a European-type call option it is used:

CTK=e-rtEXT-K,0=eYTΠ1y,v,τ-e-rtKΠ2y,v,τ. (2)

Here Πjy,v,τ, stands for the probability that the call option expires in the money XT>K. In this case, yt=lnSt (natural log of the asset price), vt is variance at time t, τ=T-1 is the term, and Πjy,v,τ=PlnXT>lnK,j=1,2. If the characteristic functions φ1ϕ;y,v and φ2ϕ;y,v related to Πj are known, it is possible to obtain each Πj from its characteristic function by using the inversion theorem (see Waller, Turnbull, and Hardin, 1995):

Πj=PlnXT>lnK=12+1π0Ree-iϕlnKφjϕ;y,viϕdϕ (3)

At maturity, when t=T, i.e., τ=0, the probabilities are constraints on the terminal condition Πjy,v,0=1y>lnK, which implies that at maturity, when XT>K, the probability that the call option is in the money is equal to 1. In summary, according to Heston (1993), the call option price is:

CTK=eytΠ1y,v,τ-e-rtKΠ2y,v,τ (4)

where Π1 and Π2 are defined in (2), and the characteristic functions are given by:

φjϕ;y,v=exp(Cjτ,ϕ+Djτ,ϕv+iϕx) (5)

Cj=rϕiτ+kθσ2bj-ρσϕi+djτ-2ln1-gjedjt1-gj,Dj=bj-ρσϕi+djσ21-edjt1-gjedjt,

gj=bj-ρσϕi+djbj-ρσϕi-dj,dj=ρσϕi+bj2-σ22ujϕi-ϕ2.

where i=-1 is the imaginary unity, u 1=1/2, u2 =-1/2, b1= κ+λ-ρσ, and b2= κ+λ. Here, λ is the parameter that represents the volatility premium as a function of the asset price, time and volatility. To obtain the option price, it is necessary to estimate the integrals for Πj in (2), and this leads to a numerical integration method. To estimate the put option price, the put-call parity is used:

VTK=CTK+Ke-rt-Xt (6)

Another relevant property of this model is that implied volatilities are estimated by option prices generated according to the model, and these have volatility smile skew. Stock options regularly have a negative slope on the implied volatilities. It is remarkable that when calibrating the parameters of Heston’s model by using market data, the correlation parameter value is negative most of the time.

The correlation parameter ρ determines the skew trend. If ρ>0, there is a positive slope, otherwise there is a negative slope. This effect is shown in Figure 1 by generating implied volatilities using the following values: X =100, r= 0.05, τ = T-t = 0.25, κ=2, θ=0.01, λ=0 and v0=0.01, with a strike range oscillating from 90 to 110. If the volatility values increase the variance, the smile skew increases, i.e. a higher σ parameter increases the implied volatility curvature, this can be seen in Figure 2 for the same values but using ρ = 0.

Source: own elaboration.

Figure 1 Effect of the parameters on implied volatility with different ρ values. 

Source: own elaboration.

Figure 2 Effect of the parameters on implied volatility using different values of σ. 

The parameters κ,θ and v0 manage the smile level. The adjustment speed of the mean reversion, κ, also controls the curve up to some point, and using bigger values of κ, the implied volatility curve flattens, as seen in Figure 3.

Source: own elaboration.

Figure 3 Effect of the parameters on implied volatility using different values of κ. 

When the values of θ and v0 either increase or decrease, a parallel movement can be observed in the volatility smile, as seen in Figures 4 and 5. To graph the plots, the bisection algorithm is used with the BSM model to calculate the set of implied volatilities provided that prices are estimated using the Heston model.

Source: own elaboration.

Figure 4 Effect of the parameters on implied volatility with different values of θ. 

Source: own elaboration.

Figure 5 Effect of the parameters on implied volatility using different values of v0

Loss functions

One alternative method for calibrating the parameters of the Heston model is to use loss functions. This method minimizes the difference amongst market prices and estimated prices, or between the marketimplied volatilities and the Heston model. The set of the estimated parameters Θ corresponds to the values that minimize the value of the corresponding loss function in such a way that the model prices or the implied volatilities are close enough to the potential market prices. To do this, it is necessary to use a minimization algorithm with the following constraints: κ>0,θ>0,σ>0,v0>0,ρ-0.9,0.9. In this regard, the loss functions include trading asset prices at a point in time (or its implied volatility) leading to risk neutral parameters in the Heston model. Let us assume that there is a set of stock maturities Mt with τt(t=1,,T) and a set of strike prices Kk(k=1,,MK). For each combination maturity-strike (τt,Kk), there is a market price Pτt,Kk=Ptk and the corresponding price obtained by Heston’s model Pτt,Kk;Θ=PtkΘ. Moreover, it is possible to link each option to a weight ωtk. There are several methods of defining a loss function, generally categorized as base prices and base-implied volatilities. The former minimizes the error between market quotes and the estimated parameters by a given model. The error is defined as the squared difference between market quotes and the model estimations, or the difference absolute value. It is also possible to use the relative error, i.e., estimating the parameters by using the loss function of the mean squared error (MSE) and minimizing:

L1=1Mt,kωtkPtk-PtkΘ2, (7)

With respect to Θ, where M is the number of quotes. The other commonly used loss function is the so-called relative loss of the mean squared error (RLMSE) given by:

L2=1Mt,kωtkPtk-PtkΘ2Ptk. (8)

Another way of defining the error is by taking the absolute value in (7) or (8).

A shortcoming of the loss function L1 is that options with short maturity, out of the money, and with low value contribute little to the sum in (7); the optimization process adjusts options in the money to the long term, impacting other options. A solution to this issue is to use only options in the money in such way that in (7) call options are used for strike prices smaller than the underlying price and put options for strike prices greater than the underlying price. Another solution is the RLMSE in (8), but the problem with L2 is the opposite. In fact, provided that Ptk is in the denominator, options on the low-price market saturate the sum in (8), although this excess is repaired by allocating the weights ωtk to the terms in the objective function. It turns out that the selection weight process is arbitrary.

The second category of loss functions is associated with minimizing the error between the implied volatilities and the parameters estimated by a theoretical model. This can be also specified as the relative difference, absolute difference or squared difference between the implied volatilities and those given by a theoretical model. This kind of loss function is sensitive because in the derivatives market, options quotes are made in terms of the implied volatility. The adjustment precision of the model is obtained by comparing the implied volatilities and the model volatility, i.e., the calibration parameters, using the loss function of the MSE of the implied volatility:

L3=1Mt,kωtkσ^tk-σ^tkΘ2, (9)

where σ^tk=σ^(τt,Kk) and σ^tkΘ=σ^(τt,Kk;Θ) are respectively the market-implied volatilities and the model volatilities. It is also possible to use the relative and absolute versions of L3. The principal drawback of (9) is that it needs exhaustive numeric calculation, i.e., for each iteration, the optimization process estimates all the prices with the Heston model PtkΘ, and then a root algorithm is used as a bisection algorithm to calculate the implied volatility σ^tkΘ in PtkΘ and thus finally is obtained σ^tk,σ^tkΘ2. A solution to this problem is to use the loss function of the weighted MSE times squared vega (Christoffersen, Heston, and Jacobs, 2009), which is an approximation to the function in (9). In this case:

L4=1Mt,kPtk-PtkΘ2vtk2, (10)

where vtk2 is the option price sensitivity in the BSM model in relation to the market-implied volatility vtk evaluated at maturity τt at strike Kt, defined as:

vtk=XeqτtΦztkτt,andztk=lnXKk+r-q+12vtk2τtvtkτt,

Here, r is the continuously compounded interest rate, q is the continuously compounded dividend yield, and Φ is the cumulative distribution function of a standard normal random variable.

The main improvement of the loss function L4 is the time reduction in its execution, although there is a loss of precision in the calibration of parameters. Another advantage is that for each evaluation of the loss function, it is necessary to recalculate the option price. The analytic expression of the characteristic functions guarantees that these calculations are done efficiently.

Differential evolution: an algorithm based on evolutionary computation

While traditional optimization methods (i.e., mathematical programming) are limited tools for finding optimal solutions, heuristic algorithms, most of the time, offer a better alternative. In recent years, the blend of computer science and biology has given rise to a new paradigm for solving highly complex problems, the so-called evolutionary computation. There are three main paradigms that make up evolutionary computation: evolutionary strategies, evolutionary programing, and genetic algorithms. It is worth mentioning that genetic algorithms are the most common method of evolutionary computation used in finance Ponsich, López and Coello (2013).2

Genetic algorithms were developed by Holland (1975) and they use the process of natural selection to find the optimal solution through a series of populations that combine and mutate solutions to reach the optimum. Different optimization problems require different methods of mutation and combination of solutions (Bradshaw, Walshaw, Ierotheou, and Parrott, 2009, p. 28). Genetic algorithms are also known as evolutionary algorithms because they are inspired by the theory of evolution of Charles Darwin, and generally work with operators such as reproduction, crossover, and mutation.

The differential evolution (DE) paradigm was developed by Storn and Price (1995). Like many other optimization algorithms, the DE was motivated by the determination of coefficients using the polynomials of Tchebyshev3 and the optimization of the coefficients in the digital filter. The first publication of DE dates back to Storn and Price (1997). The DE algorithm is described in four distinct stages (Vollrath and Wendland, 2009): initial selection of the population, mutation, recombination, and selection. These stages are typical in most evolutionary algorithms; however, the DE is unique in the way the parameter sets are mutated. The stages of DE applied to calibration of Heston’s model are listed as follows:

a) Initial population. Generate an n×MP array of randomly chosen parameter values. In this case, n is the number of parameters and MP is the number of members of the population. The rows of the matrix must satisfy the restrictions of the parameters. Lower and upper limits must be chosen in such a way that the random values for the parameters are feasible, v.g., -1<ρ<1,σ<3 and 0<κ<10,θ>0,v0>0. For Step 2, it is necessary that MP4. This implies that with n= 5:

P(5×Mp)=z1z1zMp, (11)

and every member of P is a parameter vector zi=κi,θi,σi,v0i,ρi'.

b) Mutation. For each member zi of the population i=1,,Mp, randomly select another three members zε1,zε2, and zε3, different from each other and from zi. This requires to select indexes ε1,ε2, and ε3 such that iε1ε2ε3. For every member, create a new giver member:

xi=zε1+Fzε2-zε3, (12)

where F is a constant mutation factor that controls the size of the mutations and takes values in [0,2]. The giver members and components, xi, are used to produce a candidate member that either replaces zi or not, depending on the outcomes of next steps.

c) Recombination. The candidate element ci is generated from the xij elements of the giver and components of the member zij(j=1,,5) according to the following: for each element j, a random number of a uniform variable Uij is generated and the candidate element is defined as follows:

cij=xijifUijR,orj=Ezijotherwise (13)

where E is a randomly selected integer and R is the denominated crossover ratio. E ensures that the candidate is not the same as the member, i.e., cizi. This is because even if all Uij are small and the inequality UijR is not fulfilled at all, the corresponding element xij will turn into the element cij of the candidate ci. This step is continued for each candidate parameter.

d) Candidate selection. The member zi and the candidate ci are both introduced into the objective function f(z). If the candidate’s value is less than that of the member, the candidate replaces zi as the ith term member of the population P in (11). Thus, the instruction is:

P=z1cizMpiffci<fzi,z1zizMpotherwise. (14)

When all the elements of the population have admitted the procedure, the population is renovated, and the next generation starts over again in step 2. Therefore, the algorithm is executed for all generations, G . In this way, the set of parameters is chosen for the member of the population in the final generation with the smallest value of f(z).

To further explain the above procedure, it is important to note that when evaluating a loss function given by equations (7)-(10) prices corresponding to a set of different strikes and maturities are calculated. In this case, the characteristic function φ only depends on the time to maturity, not on the strike price. Therefore, we can previously process the variables that are constant for a given maturity and then calculate the prices for all strike prices for this maturity. The following algorithm summarizes this.

Algorithm 5.1 Computing option prices for a given surface.

1. Set parameters, set C= maturities, set K strikes

2. for τC do

3. compute characteristic function φ

4. for XK do

5. compute price for strike X, maturity τ

6. end for

7. end for

8. compute objective function L

Finally, the following algorithm summarizes the proposed methodology.

Algorithm 5.2 Differential Evolution.

1. set parameters Mp, Ng, F, R

2. initialize population Pj,i(z),j=1,,p,i=1,,Mp

P=KLo+KU-KLo×rand1,Mp;θLo+θU-θLo×rand1,Mp;σLo+σU-σLo×rand1,Mp;v0Lo+v0U-v0Lo×rand1,Mp;ρ0Lo+ρ0U-ρ0Lo×rand1,Mp;

3. for k=1 to NG do

4. P(x)=P(z)

5. for i=1 to MP do

6. generate ε1,ε2,ε31,,MP,ε1ε2ε3i

7. Compute Pi(x)=Pε1(z)+F×Pε2(z)-Pε3(z)

8. for j = 1 to p do

9. If U<R then Pj,i(c)=Pj,i(c) else Pj,i(c)=Pj,i(x)

10. end for

11. If fPi(c)<fPi(x) then Pi(z)=Pi(c) else Pi(z)=Pi(x)

12. end for

13. end for

14. find member with lowest value h=findf=minif(Pi(z))

15. solution = Ph(z)

Empirical Results

This section considers a set of option prices for the Mexican Stock Exchange (BMV). The information was obtained from the Bulletin of the Summary of the Options Market since 06/26/2015, published by the Mexican Derivatives Market (MexDer). Four deadlines are considered: 84, 175, 266, and 357 days. The implied volatilities of the options traded were obtained from the price vendor, and the risk-free rate was acquired from the Mexican Central Bank (Banxico). Table 1 shows the relevant data for the calibration of the parameters.

Table 1 Relevant data for calibration 

Date Expiration Date τ Risk-free rate IPC
26/06/2015 18/09/2015 0.2333 3.2950 45,566.33
26/06/2015 18/12/2015 0.4861 3.2950 45,566.33
26/06/2015 18/03/2016 0.7389 3.2950 45,566.33
26/06/2015 17/06/2016 0.9917 3.2950 45,566.33

Source: own elaboration with data from BMV, MexDer and Banxico.

Table 2 shows the expiration dates, strike prices (K), and implied volatilities of the IPC call options obtained from the price vector. There are 31 strike prices, of which 22 are options that are in the money and the remainder is out of the money. This is important since the loss functions previously presented give different results in the calibration under this assumption.

Table 2 Implied volatility of the IPC call options 

K 84/360 175/360 266/360 357/360
35000 0.2465 0.2441 0.2324 0.2225
35500 0.2427 0.2386 0.2281 0.2192
36000 0.2395 0.2332 0.2239 0.2159
36500 0.2367 0.2280 0.2197 0.2126
37000 0.2340 0.2229 0.2154 0.2093
37500 0.2315 0.2178 0.2112 0.2060
38000 0.2290 0.2130 0.2070 0.2027
38500 0.2260 0.2081 0.2028 0.1994
39000 0.2225 0.2034 0.1985 0.1960
39500 0.2185 0.1987 0.1943 0.1926
40000 0.2138 0.1941 0.1901 0.1892
40500 0.2082 0.1897 0.1859 0.1857
41000 0.2019 0.1852 0.1817 0.1822
41500 0.1948 0.1808 0.1775 0.1786
42000 0.1871 0.1764 0.1734 0.1750
42500 0.1789 0.1720 0.1693 0.1713
43000 0.1703 0.1678 0.1653 0.1675
43500 0.1616 0.1636 0.1614 0.1637
44000 0.1530 0.1595 0.1576 0.1598
44500 0.1447 0.1554 0.1540 0.1559
45000 0.1370 0.1515 0.1506 0.1520
45500 0.1300 0.1478 0.1476 0.1481
46000 0.1239 0.1444 0.1451 0.1442
46500 0.1187 0.1415 0.1434 0.1406
47000 0.1145 0.1396 0.1428 0.1374
47500 0.1112 0.1399 0.1437 0.1351
48000 0.1088 0.1446 0.1469 0.1346
48500 0.1071 0.1541 0.1528 0.1365
49000 0.1060 0.1652 0.1599 0.1401
49500 0.1053 0.1758 0.1667 0.1444
50000 0.1051 0.1860 0.1731 0.1486

Source: own elaboration with data from BMV and MexDer.

Figure 6 shows the implied market volatility surface and the market prices of IPC options,computed from the information in the bulletin. Notice the reduction in the differences in prices, excluding options with little premium value; for the calibration the choice of the options is a settlement price greater than $100, i.e. prices from K = 49000 to K= 50000 are excluded. Therefore, there are 28 strike prices and 4 expiration dates, a total of 124 implied volatilities. An implied volatility bias is observed for each maturity level. Implied volatility is greater for small strike prices corresponding to call options in the money and put options out of the money, and implied volatility decreases as the strike price increases. Another feature is that this is more common for options with shorter maturities, and is flattened for options with greater expiration. The monotonic relationship between price and volatility shows that put options out of the money have a higher premium than the call option out of the money.

Source: own elaboration with data from BMV, MexDer and Banxico.

Figure 6 Surface of the implied market volatility and IPC market quotes. 

Table 3 shows the parameters of the Heston model calibrated with several loss functions (MSE, LMSE, and MSE of implicit volatility), the time in seconds that the calibration takes, and the value of each objective function. The initial values of the parameters are as follows: κ=9,θ=0.05,σ=0.3,v0=0.05,ρ=-0.8. In this case, the minimum of a nonlinear function of several variables with constraints4 is determined, results of table 3 are obtained with the algorithm developed in Nelder and Mead (1965). The set of parameters that is chosen is the one that has the lowest value of MSE of implicit volatility, that is, by using the function L4. In all three cases, it is observed that the value of ρ is negative, which means that the slope of the implied volatility curve is negative, as it can be seen in Figure 7.

Table 3 Calibrated parameters with loss functions 

κ θ σ v(0) ρ ECM-VI Time
L1 2.4240 0.0374 1.0590 0.0239 -0.3313 5.06E-05 32.66
L2 15.0580 0.0274 1.9992 0.0140 -0.5583 7.67E-05 40.60
L3 3.5429 0.0303 0.5518 0.0185 -0.7241 8.88E-05 4.574
L4 2.6922 0.0351 0.8059 0.0197 -0.5215 3.65E-05 28.137

Source: own elaboration with data from BMV, MexDer and Banxico.

Source: own elaboration with data from BMV, MexDer and Banxico.

Figure 7 Implied market and Heston volatilities estimated by function L4

The above results show that the adjustment of implied volatilities is better for options with long-

term maturities than for short-term maturities, and the time it takes to adjust is much lower compared to DE, as confirmed in Table 4.

Table 4 Calibrated parameters with loss functions and differential evolution 

κ θ σ v(0) ρ ECM-VI Time
L1 2.6498 0.0355 1.0073 0.0232 -0.3910 4.66E-05 210.6515
L2 15.7152 0.0277 2.0000 0.0088 -0.5979 7.38E-05 203.7326
L3 5.9857 0.0311 1.1976 0.0172 -0.5836 4.62E-05 28461.3763
L4 4.6792 0.0322 1.0751 0.0185 -0.5487 4.15E-05 266.0566

Source: own elaboration with data from BMV, MexDer and Banxico.

Figure 8 shows the surface of implied market volatility and model volatility generated with the parameters obtained from L4. In this case, 31 maturity terms are considered, varying from 84 to 357 days, and there are 271 strike prices, which range from every 50 points from K=35000 up to K=48500. The importance of the surface lies in the fact that while bonds are characterized by maturity, options are distinguished by their term at maturity and their strike price, for this reason requiring a surface instead of a curve. In addition, the yield curve on any given day is a concise description of bond market prices. Another usefulness of the surface is that it is feasible to value an exotic option with the information generated by the volatility surface; in the case that there are deadlines at maturity and strike prices that do not match the deadlines of the surface, an interpolation bilinear process is carried out.

Source: own elaboration with data from BMV, MexDer and Banxico.

Figure 8 Surface of implied market volatility and model volatility using the parameters obtained from function L4

Table 4 shows the parameters of the Heston model calibrated with the three previous loss functions but using the DE algorithm. Unlike Vollrath and Wendland (2009), we chose the number of population members equal to 45 times the number of parameters for calibration, in such a way that M P = 225, the probability of the crossover ratio R = 0.5, the mutation factor F = 0.8, and the number of generations G = 600. The initial values of the parameters for the calibration are as follows: κ=9,θ=0.05,σ=0.3,v0=0.05,ρ=-0.8 The set of parameters that is chosen is the one with the lowest value of the quadratic error of implied volatility which is the function L4. In all three cases, it can be observed that the value is negative, which means that the slope of the implied volatility curve is negative, as it can be seen in Figure 9.

Source: own elaboration with data from BMV, MexDer and MexDer.

Figure 9 Implied market and Heston volatilities estimated by function L4 using differential evolution. 

The above results indicate that the adjustment of implied volatilities is somewhat better for

options with long-term maturities than for short-term maturities, and the time it takes to adjust is much greater in comparison with loss functions.5

Finally, Figure 10 shows the adjustment for the area of implied market volatilities and the surface implied market volatility, generated with the parameters of L4 using DE. As it can be seen, the implied volatilities for short maturities are more sensitive than those obtained by the first method, which greatly affects options prices within the money as their premiums are greater than options out of the money.

Source: own elaboration with data from BMV, MexDer and Banxico.

Figure 10 Surface of implied market volatility obtained by the parameters of function using differential evolution. 

Conclusions

A relevant feature of the differential evolution algorithm is that it does not require initial parameters since it generates many tests of a population in parallel. Differential evolution performs sampling over the entire parameter space, and if it is carried out enough, it practically guarantees convergence to a global minimum. However, the disadvantage is that it requires a considerable number of repetitions, involving a high cost in time and computational resources.

Another possibility for calibrating the parameters of a stochastic volatility model is by using loss functions. This approach minimizes the distance that measures the error between market prices and model prices, or between implicit market volatilities and those form the theoretical model. In this framework, the Heston model generates consistent market volatility smiles for relatively distant maturities (six months and more). However, for short maturities, it presents problems as it does not generate consistent volatility smiles with the market; this depends to a large extent on the dynamics of the underlying asset, which affects the market quotations at a given time.

The effect of the variation of the parameters of the Heston model on the implied volatility curve has been examined in this research, and it is observed that the correlation parameter ρ determines the direction of skew; ρ>0 corresponds to a positive slope and ρ<0 to a negative slope; while the volatility of the variance σ determines the curvature of the smile. The parameters κ, θ, and v0 control the level of the smile.

The contribution of this study can be summarized as follows, the calibration of parameters of the Heston model with loss functions and differential evolution algorithm, in particular with a loss function of the weighted MSE times squared vega which employs option price sensitivity in the Black and Scholes model in relation to the market-implied volatility. An advantage of this procedure is that it is able to produce precise estimates, but at the expense of increased computation time.

In summary, in this investigation, the parameters of the Heston stochastic volatility model with loss functions have been calibrated using two different methods: the first determines the minimum of a nonlinear function of several variables with constraints; the second employs the algorithm of differential evolution. The two methods are applied to a set of IPC call options with four maturities and twenty-eight strike prices. The criterion for choosing the parameters is the one that has the minimum value of the MSE of implied volatility. The results for the two proposed methods indicate that the function of loss of the MSE weighted by the square of the vega is the one that has the minimum value, but with a minor error and time cost using the first method. In addition, in both methods the adjustment of implied volatilities is better for options with long-term maturities than for short-term maturities, and the ρ value is negative, which means that the slope of the implied volatility curve is negative which reflects a skewed distribution of the underlying asset's return. This implies that the volatility modeled as a stochastic function of the underlying asset price is in line with assumption of stochastic volatility.

References

Bakshi, G., Cao, C. and Chen, Z. (1997). Empirical performance of alternative option pricing models. Journal of Finance, 52(5), 2003-2049. https://doi.org/10.1111/j.1540-6261.1997.tb02749.x. [ Links ]

Bams, D., Lehnert, T., and Wolff, C.C.P. (2009). Loss functions in option valuation: A framework for selection. Management Science, 55(5), 853-862. https://doi.org/10.1287/mnsc.1080.0976. [ Links ]

Black, F., and Scholes, M. (1973). The pricing of options and corporate liabilities. The Journal of Political Economy, 81(3), 637-654. https://doi.org/10.1086/260062. [ Links ]

Bradshaw, N.-A., Walshaw, C., Ierotheou, C. and Parrott, A. K. (2009). A multi-objective evolutionary algorithm for portfolio optimization. Proceedings from Artificial Intelligence and Simulation of Behavior Symposium 2009 on Evolutionary Systems. The Society for the Study of Artificial Intelligence and Simulation of Behavior (pp. 27-32), London. [ Links ]

Christoffersen, P., Heston, S., and Jacobs, K. (2009). The shape and term structure of the index option smirk: Why multifactor stochastic volatility models work so well. Management Science , 55(12), 1914-1942. https://doi.org/10.1287/mnsc.1090.1065. [ Links ]

Cox, J. C., Ingersoll, J. E., and Ross, S. A. (1985). A theory of the term structure of interest rates. Econometrica, 53(2), 385-407. https://doi.org/10.2307/1911242. [ Links ]

Cui, Y., del Baño Rollin S. and G. Germano (2017). Full and fast calibration of the Heston stochastic volatility model, European Journal of Operational Research, 263(2), 625-638. https://doi.org/10.1016/j.ejor.2017.05.018. [ Links ]

Date, P. and S. Islyaev (2015). A fast calibrating volatility model for option pricing, European Journal of Operational Research , 243(2), 599-606. https://doi.org/10.1016/j.ejor.2014.12.031. [ Links ]

Garman, M. (1976). A general theory of asset valuation under diffusion state processes. Working Paper, University of California, Berkeley, CA. Retrieved from: https://econpapers.repec.org/RePEc:ucb:calbrf:50. [ Links ]

Gilli, M., and Schumann, E. (2011). Calibrating option pricing models with heuristics. In A. Brabazon, M. O’Neill, and D. Maringer (Eds.), Natural computing in computational finance (Vol. 4), pp. 9-37. Berlin: Springer-Verlag. https://doi.org/10.1007/978-3-642-23336-4_2. [ Links ]

Gulisashvili, A., and Stein, E. M. (2010). Asymptotic behavior of distribution densities in models with stochastic volatility. Mathematical Finance, 20(3), 447-477. https://doi.org/10.1111/j.1467-9965.2010.00406.x. [ Links ]

Heston, S. L. (1993). A closed-form solution for options with stochastic volatility with applications to bond and currency options. Review of Financial Studies, 6(2), 327-343. https://doi.org/10.1093/rfs/6.2.327. [ Links ]

Holland, J. H. (1975). Adaptation in natural and artificial systems. East Lansing, MI: University of Michigan Press. [ Links ]

Hull, J., and White, A. (1987). The pricing of options on assets with stochastic volatility. Journal of Finance , 42(2), 281-300. https://doi.org/10.1111/j.1540-6261.1987.tb02568.x. [ Links ]

Latané, H. A., and Rendleman, Jr., R. J. (1976). Standard deviations of stock price ratios implied in option prices. Journal of Finance , 31(2), 369-381. https://doi.org/10.1111/j.1540-6261.1976.tb01892.x. [ Links ]

Levenberg, K. (1944). A method for the solution of certain non-linear problems in least squares. Quarterly of Applied Mathematics. 2(2), 164-168. [ Links ]

Liu, S., Borovykh, A., Grzelak, L. A. and Oosterlee, C. W. (2019). A neural network-based framework for financial model calibration. Journal of Mathematics In Industry, 9(9), 2-28. https://doi.org/10.1186/s13362-019-0066-7. [ Links ]

Longstaff, F. (1989). A nonlinear general equilibrium model of the term structure of interest rates. Journal of Financial Economics, 23(2), 195-224. https://doi.org/10.1016/0304-405X(89)90056-1. [ Links ]

Merton, C. R. (1973). Theory of rational option pricing. Bell Journal of Economics and Management Science , 4(1), 141-183. https://doi.org/10.2307/3003143. [ Links ]

Nelder, J.A. and R. Mead. (1965). A Simplex Method for Function Minimization, The Computer Journal, 7(4), 308-313. https://doi.org/10.1093/comjnl/7.4.308. [ Links ]

Ortiz-Ramírez, A., F. Venegas-Martínez y F. López-Herrera (2011). Una nota sobre la sensibilidad de los parámetros del modelo de volatilidad estocástica de Heston. Quantitativa, Revista de Economía, 1(1), 83-103. Retrieved from http://quantitativa.ucol.mx/index.php/Quantitativa/article/view/8. [ Links ]

Ortiz-Ramírez, A. , F. Venegas-Martínez y M. Durán-Bustamante (2014). Valuación de opciones europeas sobre AMX-L, WALMEX-V y GMEXICO-B: calibración de parámetros de volatilidad estocástica con funciones cuadráticas de pérdida. El Trimestre Económico, 81(4), 943-988. https://doi.org/10.20430/ete.v81i324.135. [ Links ]

Palmer, S. (2019). Evolutionary algorithms and computational methods for derivatives pricing. Retrieved from https://core.ac.uk/download/pdf/196652947.pdf. [ Links ]

Ponsich, A., López, A. J. and Coello, C. A. (2013). A Survey on Multiobjective Evolutionary Algorithms for the Solution of the Portfolio Optimization Problem and Other Finance and Economics Applications, IEEE Transactions on evolutionary computation, 17(3), 321-344. https://doi.org/10.1109/TEVC.2012.2196800. [ Links ]

Schöbel, R., and Zhu, J. W. (1999). Stochastic volatility with an Ornstein-Uhlenbeck process: An extension. European Finance Review, 3(1), 23-46. https://doi.org/10.1023/A:1009803506170. [ Links ]

Simon, D. (2013). Evolutionary optimization algorithms: Biologically-inspired and population-based approaches to computer intelligence. Hoboken, New Jersey, USA, John Wiley and Sons. [ Links ]

Soler-Domínguez, A., Juan, A. A., and Kizys, R. (2017). A Survey on Financial Applications of Metaheuristics. ACM Computing Surveys (CSUR), 50(1), 1-23. https://doi.org/10.1145/3054133. [ Links ]

Stein, E. M., and Stein, J. C. (1991). Stock price distributions with stochastic volatility: An analytic approach. Review of Financial Studies , 4(4), 727-752. https://doi.org/10.1093/rfs/4.4.727. [ Links ]

Storn, R., and Price, K. (1997). Differential evolution-A simple and efficient heuristic for global optimization over continuous spaces. Journal of Global Optimization, 11(4), 341-359. https://doi.org/10.1023/A:1008202821328. [ Links ]

Uhlenbeck, G. E. and Ornstein, L. S. (1930). On the theory of Brownian motion. Physical Review, 36: 823-841. https://link.aps.org/doi/10.1103/PhysRev.36.823. [ Links ]

Venegas-Martínez, F. (2005). Maximización de utilidad y valuación de opciones con volatilidad estocástica. Revista Mexicana de Economía y Finanzas, 4(2), 175-184. https://doi.org/10.21919/remef.v4i2.201. [ Links ]

Venegas-Martínez, F. (2008). Riesgos financieros y económicos, productos derivados y decisiones económicas bajo incertidumbre (2da. edición). Mexico: Cengage. [ Links ]

Venegas-Martínez, F. (2010). Planes no creíbles de estabilización de precios, riesgo cambiario y opciones reales para posponer consumo: un análisis con volatilidad estocástica. El Trimestre Económico , 77(4), 899-936. https://doi.org/10.20430/ete.v77i308.459. [ Links ]

Vollrath, I., and Wendland, J. (2009). Calibration of interest rate and option models using differential evolution. Working Paper, FINCAD Corporation. SSRN: https://ssrn.com/abstract=1367502. [ Links ]

Waller, L. A., Turnbull, B. W., and Hardin, J. M. (1995). Obtaining distribution functions by numerical inversion of characteristic functions with applications. The American Statistician, 49(4), 346-350. https://doi.org/10.1080/00031305.1995.10476180. [ Links ]

Wiggins, J. B. (1987). Option values under stochastic volatility: Theory and empirical estimates. Journal of Financial Economics , 19(2), 351-372. https://doi.org/10.1016/0304-405X(87)90009-2. [ Links ]

Zhu, J. (2010). Applications of Fourier transform to smile modeling: Theory and implementation (2nd edition). Berlin, Heidelberg: Springer-Verlag. https://doi.org/10.1007/978-3-642-01808-4_6. [ Links ]

2 An introduction to evolutionary computing is given in Simon (2013), in which several approaches to solving optimization problems are presented.

3 A fast review of the polynomials of Tchebyshev can be seen in Venegas-Martínez (2008).

4 The non-linear problem is solved using the fmincon function in Matlab.

5 The non-linear problem is solved using the “fmincon” function in Matlab.

Received: November 20, 2019; Accepted: May 13, 2021; Published: May 19, 2021

* Corresponding author. E-mail address: amortiz@ipn.mx (A. Ortiz-Ramírez), fvenegas1111@yahoo.com.mx (Francisco Venegas-Martínez), mmartinezpa@ipn.mx (M. T. V. Martínez-Palacios).

Peer Review under the responsibility of Universidad Nacional Autónoma de México.

Creative Commons License This is an open-access article distributed under the terms of the Creative Commons Attribution License