guide:0875016693: Difference between revisions

From Stochiki
No edit summary
 
No edit summary
Line 1: Line 1:
<div class="d-none"><math>
\newcommand{\indicator}[1]{\mathbbm{1}_{\left[ {#1} \right] }}
\newcommand{\Real}{\hbox{Re}}
\newcommand{\HRule}{\rule{\linewidth}{0.5mm}}
\newcommand{\mathds}{\mathbb}</math></div>
\label{chapt:stochVol}
Stochastic volatility is another popular way to fit the implied volatility smile. It has the advantage of assuming volatility brings another source of randomness, and this randomness can be hedge if it is also a Brownian motion. Furthermore, calibration of stochastic volatility models does have some robustness to the market's daily changes. However, whereas local volatility models are generally calibrated to fit the implied volatility surface (i.e. to fit options of all maturities), stochastic volatility models have trouble fitting more than one maturity at a time.
Consider the following stochastic volatility model,


<span id{{=}}"eq:SVM"/><span id{{=}}"eq:dX"/>
<math display="block">
\begin{eqnarray}
\label{eq:SVM}
dS_t&=&\mu S_tdt+\sigma(X_t)S_tdW_t\\
\label{eq:dX}
dX_t &=&\alpha(X_t)dt+\beta(X_t)dB_t
\end{eqnarray}
</math>
where <math>W_t</math> and <math>B_t</math> are Brownian motions with correlation <math>\rho\in(-1,1)</math> such that <math>\mathbb E[dW_tdB_t]=\rho dt</math>, <math>\sigma(x) > 0</math> is the volatility function, and <math>X_t</math> is the volatility process and is usually mean-reverting. An example of a mean-revering process is the square-root process,
<math display="block">dX_t = \kappa(\bar X-X_t)dt+\gamma\sqrt{X_t}dB_t\ .</math>
The square-root process is used in the Heston model (along with <math>\sigma(x) =\sqrt{x}</math>), and usually relies on what is known as the Feller condition (<math>\gamma^2\leq 2\bar X\kappa</math>) so that <math>X_t</math> never touches zero.
==Hedging and Pricing==
Assuming that <math>\alpha</math> and <math>\beta</math> are ‘well-behaved’ functions, we define the differential operator
<math display="block">\mathcal L \doteq\frac{\partial}{\partial t}+\frac{\sigma^2(x)s^2}{2}\frac{\partial^2}{\partial s^2}+\mu s \frac{\partial}{\partial s}+\frac{\beta^2(x)}{2}\frac{\partial^2}{\partial x^2}+\alpha(x)\frac{\partial}{\partial x}+\rho\beta(x)\sigma(x)s\frac{\partial^2}{\partial x\partial s}\ ; </math>
the interpretation of <math>\mathcal L</math> is the following: <math>\frac{\sigma^2(X_t)S_t^2}{2}\frac{\partial^2}{\partial s^2}+\mu S_t \frac{\partial}{\partial s}</math> are the generator of <math>S_t</math>; <math>\frac{\beta^2(X_t)}{2}\frac{\partial^2}{\partial x^2}+\alpha(X_t)\frac{\partial}{\partial x}</math> are the generator of <math>X_t</math>; <math>\rho\beta(X_t)\sigma(X_t)S_t\frac{\partial^2}{\partial x\partial s}</math> is the cross-term; and the remaining are the separate stochastic terms. We use when applying bi-variate It\^o Lemma for the process in \eqref{eq:SVM} and \eqref{eq:dX}:\\
For any twice differentiable function <math>f(t,s,x)</math>, the It\^o differential is
<span id{{=}}"eq:2Dito"/>
<math display="block">
\begin{align}
\label{eq:2Dito}
df(t,S_t,X_t) &= \mathcal Lf(t,S_t,X_t)dt+\sigma(X_t)S_t\frac{\partial}{\partial s}f(t,S_t,X_t)dW_t+\beta(X_t)\frac{\partial}{\partial x}f(t,S_t,X_t)dB_t\ .
\end{align}
</math>
The price <math>C(t,s,x)</math> is the price of claim <math>\psi(s,x)</math> paid at time <math>T</math> given <math>S_t=s</math> and <math>X_t=x</math>. We hedge with a self-financing portfolio <math>V_t</math> consisting of risky-asset, the risk-free bank account, and a 2nd derivative <math>C'</math> that is settled at time <math>T' > T</math>. This method for hedging stochastic volatility is called the Hull-White method.
The self-financing condition is
<math display="block">dV_t = a_tdS_t+b_tdC'(t,S_t,X_t)+r(V_t - a_tS_t-b_tC'(t,S_t,X_t))dt\ .</math>
Now, taking a long position in <math>V_t</math> and a short position in <math>C(t,S_t,X_t)</math>, we apply the bivariate It\^o lemma of \eqref{eq:2Dito} to get the dynamics of this new portfolio:
<math display="block">d(V_t-C(t,S_t,X_t))</math>
<math display="block">=a_t(\mu S_tdt+\sigma(X_t)S_tdW_t)</math>
<math display="block">+ b_t\mathcal LC'(t,S_t,X_t)dt+b_t\sigma(X_t)S_t\frac{\partial}{\partial s}C'(t,S_t,X_t)dW_t+b_t\beta(X_t)\frac{\partial}{\partial x}C'(t,S_t,X_t)dB_t</math>
<math display="block">+r(V_t - a_tS_t-b_tC'(t,S_t,X_t))dt</math>
<math display="block">-\mathcal LC(t,S_t,X_t)dt-\sigma(X_t)S_t\frac{\partial}{\partial s}C(t,S_t,X_t)dW_t-\beta(X_t)\frac{\partial}{\partial x}C(t,S_t,X_t)dB_t\ .</math>
We now choose <math>a_t</math> and <math>b_t</math> so that the <math>dW_t</math> and the <math>dB_t</math> terms vanish, which means that
<math display="block">
\begin{align*}
&\sigma(X_t)S_t\frac{\partial}{\partial s}C(t,S_t,X_t) = a_t\sigma(X_t)S_t+ b_t\sigma(X_t)S_t\frac{\partial}{\partial s}C'(t,S_t,X_t)\\
&\\
&\beta(X_t)\frac{\partial}{\partial x}C(t,S_t,X_t)=b_t\beta(X_t)\frac{\partial}{\partial x}C'(t,S_t,X_t)
\end{align*}
</math>
or
<span id{{=}}"eq:a_t"/><span id{{=}}"eq:b_t"/>
<math display="block">
\begin{align}
\label{eq:a_t}
&b_t = \frac{\frac{\partial}{\partial x}C(t,S_t,X_t)}{\frac{\partial}{\partial x}C'(t,S_t,X_t)}\\
\label{eq:b_t}
&a_t = \frac{\partial}{\partial s}C(t,S_t,X_t)-b_t\frac{\partial}{\partial s}C'(t,S_t,X_t)\ .
\end{align}
</math>
Plugging these solutions into the differential, the <math>\alpha(X_t)\frac{\partial}{\partial x}</math> terms cancel, the <math>\mu S_t\frac{\partial}{\partial s}</math> terms cancel, and by the same arbitrage argument as Black-Scholes, it follows that <math>V_t-C(t,S_t,X_t)</math> must grow at the risk-free rate:
<math display="block">
\begin{align*}
&d(V_t-C_t(t,S_t,X_t))\\
&= a_t\mu S_t dt+b_t\mathcal LC'(t,S_t,X_t)dt+r(V_t - a_tS_t-b_tC'(t,S_t,X_t))dt  -\mathcal LC(t,S_t,X_t)dt\\
&=r(V_t-C(t,S_t,X_t))dt\ .
\end{align*}
</math>
Inserting <math>a_t</math> and <math>b_t</math> from equations \eqref{eq:a_t} and \eqref{eq:b_t}, and after some rearranging of terms (and dropping the <math>dt</math>'s), we get
<math display="block">\left(\mathcal L-(\mu-r)S_t\frac{\partial}{\partial s}-r\right)C(t,S_t,X_t)= b_t\left(\mathcal L-(\mu-r)S_t\frac{\partial}{\partial s}-r\right)C'(t,S_t,X_t)</math>
and dividing both sides by <math>\frac{\partial}{\partial x}C(t,S_t,X_t)</math>, we get
<math display="block">\frac{\left(\mathcal L-(\mu-r)S_t\frac{\partial}{\partial s}-r\right)C(t,S_t,X_t)}{\frac{\partial}{\partial x}C(t,S_t,X_t)}= \frac{\left(\mathcal L-(\mu-r)S_t\frac{\partial}{\partial s}-r\right)C'(t,S_t,X_t)}{\frac{\partial}{\partial x}C'(t,S_t,X_t)}\doteq R(t,s,x)\ .</math>
The left-hand side of this equation does not depend on specifics of <math>C'</math> (e.g. maturity and strikes), and the right-hand side does not depend on specifics of <math>C</math>. Therefore, there is a function <math>R(t,s,x)</math> that does not depend on parameters such as strike price and maturity, and it equates the two sides of the expression. We let <math>R</math> take the form
<math display="block">R(t,s,x) = -\alpha(x)+\beta(x)\Lambda(t,s,x)</math>
where
<math display="block">\Lambda(t,s,x) = \rho\frac{\mu-r}{\sigma(x)}+g(t,s,x)\sqrt{1-\rho^2}\ ,</math>
where <math>g</math> is an arbitrary function. Hence, we arrive at the PDE for the price <math>C(t,s,x)</math> of a European Derivative with stochastic vol:
{{proofcard|Proposition|proposition-1|'''(Pricing PDE for Stochastic Volatility).'''
\label{prop:stochasticVolPDE}
Given the stochastic volatility model of equations \eqref{eq:SVM} and \eqref{eq:dX}, the price of a European claim with payoff <math>\psi(S_T)</math> satisfies
<span id{{=}}"eq:SVpde"/>
<math display="block">
\begin{align}
\nonumber
\Bigg(\frac{\partial}{\partial t}+\frac{\sigma^2(x)s^2}{2}\frac{\partial^2}{\partial s^2}+\rho\beta(x)\sigma(x)s\frac{\partial^2}{\partial x\partial s}+\alpha(x)\frac{\partial}{\partial x}+\frac{\beta^2(x)}{2}\frac{\partial^2}{\partial x^2}+rs\frac{\partial}{\partial s}-r\Bigg)C&\\
\label{eq:SVpde}
=\beta(x)\Lambda(t,s,x)\frac{\partial}{\partial x}C&
\end{align}
</math>
with <math>C\Big|_T=\psi(s,x)</math>.|}}
The PDE in \eqref{eq:SVpde} can be solved with Feynman-Kac,
<math display="block">C(t,s,x) = e^{-r(T-t)}\mathbb E^Q\{\psi(S_T,X_T)|S_t=s,X_t=x\}\ .</math>
which yields a description of an EMM, <math>\mathbb Q</math>, under which the stochastic volatility model is
<math display="block">
\begin{eqnarray*}
\frac{dS_t}{S_t}&=&r dt+\sigma(X_t)dW_t^Q\\
dX_t&=&\left(\alpha(X_t)-\beta(X_t)\Lambda(t,S_t,X_t)\right)dt+\beta(X_t)dB_t^Q
\end{eqnarray*}
</math>
where <math>W_t^Q</math> and <math>B_t^Q</math> are <math>\mathbb Q</math>-Brownian motions with the same correlation <math>\rho</math> as before. The interpretation of <math>\Lambda(t,s,x)</math> is that it is '''the market price of volatility risk.'''
One can think of a market as complete when there are as many non-redundant assets as there are sources of randomness. In this sense we have ‘completed’ this market with stochastic volatility by including the derivative <math>C'</math> among the traded assets. This has allowed us to replicate the European claim and obtain a unique no-arbitrage price that must be the solution to \eqref{eq:SVpde}.\\
'''Example'''
<span id{{=}}"eq:dSheston"/><span id{{=}}"eq:dXheston"/>
<math display="block">
\begin{align}
\label{eq:dSheston}
&dS_t=r S_tdt+\sqrt{X_t}dW_t^Q\\
\label{eq:dXheston}
&dX_t=\kappa(\bar X-X_t)dt+\lambda X_tdt+\gamma\sqrt{X_t}dB_t^Q
\end{align}
</math>
where <math>\lambda X_t</math> is the assumed form of the volatility price of risk. This model can be re-written,
<math display="block">dX_t = \tilde\kappa(\tilde{\bar X}-X_t)dt+\gamma\sqrt{X_t}dB_t^Q</math>
where <math>\tilde\kappa =\kappa-\lambda</math> and <math>\tilde{\bar X} = \frac{\kappa\bar X}{\kappa-\lambda}</math>. It is important to have the Feller condition
<math display="block">\gamma^2\leq 2\tilde\kappa\tilde{\bar X}</math>
otherwise the PDE requires boundaries for the event <math>X_t = 0</math>. The process <math>X_t</math> is degenerate, but it is well-known that the Feller condition is the critical assumption for the application of Feynman-Kac. The fit of the Heston model to the implied volatility of market data is shown in [[#fig:hestonFit|Figure]].
<div id="fig:hestonFit" class="d-flex justify-content-center">
[[File:guide_2ee3d_hestonFit.jpg | 400px | thumb | The fit of the Heston model to implied volatility of market data on July 27 of 2012, with different maturities. Notice that the Heston model doesn't fit very well to the options with shortest time to maturity. ]]
</div>
==Monte Carlo Methods==
A slow but often reliable method for pricing derivatives is to approximate the risk-neutral expectation with independent samples. Essentially, sample paths of Brownian motion are obtained using a pseudo-random number generated, the realized derivative payoff is computed for each sample, and then the average of these sample payoffs is an approximation of the true average. For example, let <math>\ell=1,2,\dots,N</math>, let <math>S_T^{(\ell)}</math> be the index for an independent sample from <math>S_T</math>'s probability distribution. If <math>\mathbb E^QS_T^2 < \infty</math>, then by the law of large numbers we have
<math display="block">\frac 1N\sum_{\ell=1}^N(S_T^{(\ell)}-K)^+\stackrel{N\rightarrow\infty}{\longrightarrow}\mathbb E^Q(S_T-K)^+\ .</math>
For stochastic differential equations, the typical method is to generate samples that are approximately from the same distribution as the price. The basic idea is to use an '''Euler-Maruyama scheme,'''
<span id{{=}}"eq:EMscheme"/>
<math display="block">
\begin{equation}
\label{eq:EMscheme}
\log \tilde S_{t+\Delta t}^{(\ell)} = \log \tilde S_t^{(\ell)}+\left(\mu-\frac 12\sigma^2\right)\Delta t+\sigma(W_{t+\Delta t}^{(\ell)}-W_t^{(\ell)})\ ,
\end{equation}
</math>
for some small <math>\Delta t > 0</math>, with
<math display="block">W_{t+\Delta t}^{(\ell)}-W_t^{(\ell)}\sim iid \mathcal N(0,\Delta t)\ .</math>
This scheme is basically the discrete backward Riemann sum from which the It\^o integral was derived (see [[guide:6d878fef61#chapt:brownianMotion |Chapter]]). In the limit, the Monte-Carlo average will be with little-o of the true expectation,
<math display="block">\lim_{N\rightarrow\infty} \frac 1N\sum_{\ell=1}^N(\tilde S_T^{(\ell)}-K)^+ = \mathbb E^Q(S_T-K)^++o(\Delta t)\ .</math>
The scheme in \eqref{eq:EMscheme} can easily be adapted for a local volatility model, and can also be adapted to stochastic volatility models, but the latter will also need a scheme for generating the volatility process <math>X_t</math>.
'''Example'''
<math display="block">
\begin{align*}
&\frac{dS_t}{S_t} = rdt+e^{X_t}dW_t^Q\\
&dX_t = \kappa(\bar X-X_t)dt+\gamma dB_t^Q
\end{align*}
</math>
with correlation <math>\rho</math> between <math>W_t^Q</math> and <math>B_t^Q</math>. The Euler-Maruyama scheme is
<math display="block">
\begin{align*}
&\log \tilde S_{t+\Delta t} = \log \tilde S_t+\left(r-\frac 12e^{2\tilde X_t}\right)\Delta t+e^{\tilde X_t}\left(\rho (B_{t+\Delta t}-B_t)+\sqrt{1-\rho^2}(W_{t+\Delta t}-W_t)\right)\\
&\tilde X_{t+\Delta t} = \tilde X_t+\kappa\left(\bar X-\tilde X_t \right)\Delta t+\gamma(B_{t+\Delta t}-B_t)
\end{align*}
</math>
where <math>B_{t+\Delta t}^{(\ell)}-B_t^{(\ell)}</math> and <math>W_{t+\Delta t}^{(\ell)}-W_t^{(\ell)}</math> are increments of independent Brownian motions.
Sometimes, there more clever methods of sampling need to be employed. For instance, the Heston model:
'''Example'''
<math display="block">\tilde X_{t+\Delta t}=\tilde X_t+\kappa(\bar X-\tilde X_t)\Delta t+\gamma\sqrt{\tilde X_t}\left(B_{t+\Delta t}-B_t\right)</math>
will allow the volatility process to become negative. An alternative is to consider the Stratonovich-type differential,<ref group="Notes" >This refers to the Stratonovich-type integral that is the limit of forward Riemann sums, <math>\int f_s\circ dB_s = \lim_N\sum_{n=1}^Nf_{t_n}(B_{t_n}-B_{t_{n-1}})</math>. For the square-root process, the It\^o lemma on <math>\sqrt{X_t}</math> is used to show that <math>\gamma\int \sqrt{X_s}dB_s = \gamma\lim_n\sum_n\sqrt{X_{t_{n-1}}}\Delta B_{t_{n-1}} = -\gamma\lim_n\sum_n\left(\sqrt{X_{t_n}}-\sqrt{X_{t_{n-1}}}\right)(B_{t_n}-B_{t_{n-1}}) +\gamma\lim_n\sum_n\sqrt{X_{t_n}}(B_{t_n}-B_{t_{n-1}})=-\frac{\gamma^2}{2}\int ds+\gamma\int \sqrt{X_s}\circ dB_s</math>.</ref>
<math display="block">dX_t = \left(\kappa(\bar X-X_t)-\frac{\gamma^2}{2}\right)dt+\gamma\sqrt{X_t}\circ dB_t</math>
and then use an implicit Euler-Maruyama scheme
<math display="block">\tilde X_{t+\Delta t}-\tilde X_t =\left( \kappa(\bar X-\tilde X_{t+\Delta t})-\frac{\gamma^2}{2}\right)\Delta t+\gamma \sqrt{\tilde X_{t+\Delta t}} \left(B_{t+\Delta t}-B_t\right)\ .</math>
which can be solved for <math>\sqrt{\tilde X_{t+\Delta t}}</math> using the quadratic equation. Notice that a condition for the discriminant to be real is the Feller condition that <math>\gamma^2\leq 2\kappa\bar X</math>.
In general, Monte Carlo methods are good because they can be used to price pretty much any type of derivative instrument, so long as the EMM is given. For instance, it is very easy to price an Asian option with Monte Carlo. Another advantages of Monte Carlo is that there is no need for an explicit solution to any kind of PDE; the methodology for pricing is to simply simulate and average. However, as mentioned above, this can be very slow, and so there is a need (particularly among practitioners) for faster ways to compute prices.
Another important topic to remark on is '''importance sampling,''' which refers to simulations generated under an equivalent measure for which certain rare events are more common. For instance, suppose one wants to use Monte Carlo to price a call option with extreme strike. It will be a waste of time to generate samples from the EMM's model if the overwhelming majority of them will finish out of the money. Instead, one can sample from another model that has more volatility, so that more samples finish in the money. Then, assign importance weights based on ratio of densities and take the average. See <ref name="glasserman">Glasserman, P. (2004).''Monte Carlo Methods in Financial Engineering''.Springer.</ref> for further reading on Monte Carlo methods in finance.
==Fourier Transform Methods==
The most acclaimed way to compute a European option price is with the so-called ''Fourier transform methods.'' In particular, the class of affine models in <ref name="dps2000">Duffie, D., Pan, J., and Singleton, K. (2000).Transform analysis and asset pricing for affine jump-diffusions.''Econometrica'', 68:1343--1376.</ref> have tractable pricing formulae because they have explicit expressions for their Fourier transforms. The essential ingredient to these methods is the characteristic function,
<math display="block">\phi_T(u)\doteq\mathbb E^Q\exp(iu\log(S_T))\qquad\forall u\in\mathbb C,~i=\sqrt{-1}\ ,</math>
where <math>\mathbb E^Q</math> now denotes expectation under an EMM. The function <math>\phi_T(u)</math> is the Fourier transform of the <math>\log S_T</math>'s distribution function,
<math display="block">\phi_T(u) = \int_{-\infty}^\infty  e^{iu s}f(s)ds</math>
where <math>f(s)</math> is the density function for <math>\log S_T</math>. The inverse Fourier transform is <math>f(s) = \tfrac{1}{2\pi}\int_{-\infty}^\infty e^{-iu s}\phi_T(u)du</math>, which leads to the cumulative distribution function
<math display="block">F(s)-F(0)=\int_0^sf(v)dv = \frac{1}{2\pi}\int_{-\infty}^\infty  \frac{1-e^{-ius}}{iu}\phi_T(u)du\ .</math>
The crucial piece of information ''is'' the characteristic function, as any European claim can be priced provided there is also a Fourier transform for the payoff function. Let <math>p_T</math> denote the density function on the risk-neutral distribution of <math>\log(S_T)</math> and let <math>\widehat\psi</math> denote the Fourier transform of the payoff function. It is shown in <ref name="lewis">Lewis, A.L. (2000).''Option Valuation under Stochastic Volatility with Mathematica  Code''.Finance Press.</ref> that
<math display="block">\mathbb E^Q\psi(\log(S_T)) = \int_{-\infty}^\infty\psi(s)f(s)ds=\int_{-\infty}^\infty\frac{1}{2\pi} \int_{iz-\infty}^{iz+\infty}e^{-ius}\widehat \psi(u) du f(s)ds</math>
<math display="block">=\frac{1}{2\pi}\int_{iz-\infty}^{iz+\infty}\widehat \psi(u)\int_{-\infty}^\infty e^{-ius}  p_T(s)ds du=\frac{1}{2\pi}\int_{iz-\infty}^{iz+\infty}\widehat \psi(u)\phi_T(-u) du</math>
where the constant <math>z\in\mathbb R</math> in the limit of integration is the appropriate strip of regularity for non-smooth payoff functions (see [[#tab:payoffs |Table]]). This usage of characteristic functions is far reaching, as the class of L\'evy jump models are defined in terms of their characteristic function, or equivalently with the L\'evy-Khintchine representation (for more on financial models with jumps see <ref name="contTankov">Cont., R. and Tankov, P. (2003).''Financial modelling with Jump Processes''.</ref>).
\begin{table}[htb]
\center
\caption{Fourier Transforms of Various Payoffs, <math>\widehat\psi(u)=\int_{-\infty}^\infty e^{ius}\psi(s)ds</math> and <math>\psi(s)=\frac{1}{2\pi} \int_{iz-\infty}^{iz+\infty}e^{-ius}\widehat \psi(u) du</math>, <math>z=\Im(u)</math>.}
\label{tab:payoffs}
\begin{tabular}{|p{4cm}|c|c|c|}
\hline
asset&<math>\psi(s)</math>&<math>\widehat\psi(u)</math>&regularity strip\\
\hline
call &<math>(e^s-K)^+</math>&<math>-\frac{K^{iu+1}}{u^2-iu}</math>&<math>z > 1</math>\\
put&<math>(K-e^s)^+</math>&<math>-\frac{K^{iu+1}}{u^2-iu}</math>&<math>z < 0</math>\\
covered call; cash secured put&<math>\min(e^s,K)</math>&<math>\frac{K^{iu+1}}{u^2-iu}</math>&<math>0 < z < 1</math>\\
cash or nothing call &<math>\indicator{e^s\geq K}</math>&<math>-\frac{K^{iu}}{iu}</math>&<math>z > 0</math>\\
cash or nothing put &<math>\indicator{e^s\leq K}</math>&<math>\frac{K^{iu}}{iu}</math>&<math>z < 0</math>\\
asset or nothing call&<math>e^s\indicator{e^s\geq K}</math>&<math>-\frac{K^{iu+1}}{iu+1}</math>&<math>z > 1</math>\\
asset or nothing put&<math>e^s\indicator{e^s\leq K}</math>&<math>\frac{K^{iu+1}}{iu+1}</math>&<math>z < 0</math>\\
Arrow-Debreu&<math>\delta(s-\log(K))</math>&<math>K^{iu}</math>&<math>z\in\mathbb R</math>\\
\hline
\end{tabular}
\end{table}
==Call-Option Pricing==
An important tool is the formula of Gil-Pelaez:
{{proofcard|Proposition (Gil-Pelaez Inversion Theorem)|proposition-2|The distribution function <math>F(s)</math> is given by
<span id{{=}}"eq:gilPelaezInversion"/>
<math display="block">
\begin{equation}
\label{eq:gilPelaezInversion}
F(s) = \frac12+\frac{1}{2\pi}\int_0^\infty \frac{e^{ius}\phi_T(-u)-e^{-ius}\phi_T(u)}{iu}du\ ,
\end{equation}
</math>
for all <math>s\in(-\infty,\infty)</math>.
\begin{proof}
First let's prove an identity,
<math display="block">\mbox{sign}(v-s) = \frac2\pi\int_0^\infty \frac{\sin(u(v-s))}{u}du = \left\{\begin{array}{cc}-1&\mbox{if }v < s\\0&\mbox{if }v=s\\1&\mbox{if }v > s\end{array}\right. \ .</math>
This integral is shown using residue calculus:
<math display="block">
\begin{align*}
\int_0^\infty \frac{\sin(u)}{u}du &= \frac12\int_{-\infty}^\infty \frac{\sin(u)}{u}du=\frac12 \lim_{\epsilon\rightarrow 0}\lim_{R\rightarrow 0}\mbox{Im}\left(\int_{-R}^{-\epsilon}\frac{e^{iz}}{z}dz+\int_\epsilon^R\frac{e^{iz}}{z}dz\right)\ .
\end{align*}
</math>
A closed path can be constructed from <math>-R</math> to <math>R</math>, making a semi-circle of radius <math>\epsilon</math> around zero, and then from <math>R</math> back to <math>-R</math> on a semi-circle of radius <math>R</math>. Along this path the integral of <math>e^{iz}/z</math> is zero,
<math display="block">\int_{-R}^{-\epsilon}\frac{e^{iz}}{z}dz+\int_\epsilon^R\frac{e^{iz}}{z}dz=-\int_{C_\epsilon}\frac{e^{iz}}{z}dz-\int_{C_R}\frac{e^{iz}}{z}dz=0\ ,</math>
where <math>C_\epsilon =\{\epsilon e^{i(\theta-\pi)}|0\leq \theta\leq \pi\}</math> and <math>C_R=\{Re^{i\theta}|0\leq \theta\leq \pi\}</math>. Now,
<math display="block">
\begin{align*}
\left|\int_{C_R}\frac{e^{iz}}{z}dz \right|&=\frac1R\left|\int_0^\pi\frac{e^{iR\cos(\theta)-R\sin(\theta)}}{e^{i\theta}}d\theta\right|\\
&\leq \frac{1}{R}\int_0^\pi\left|\frac{e^{iR\cos(\theta)-R\sin(\theta)}}{e^{i\theta}}\right|d\theta \\
&= \frac{1}{R}\int_0^\pi e^{-R\sin(\theta)}d\theta \\
& < \frac{\pi}{R}\rightarrow 0\qquad\hbox{as }R\rightarrow\infty \ ,
\end{align*}
</math>
and
<math display="block">\int_{C_\epsilon}\frac{e^{iz}}{z}dz = \int_\pi^0 \frac{1+O(\epsilon)}{\epsilon e^{i\theta}}i\epsilon e^{i\theta}d\theta = -i\pi +O(\epsilon)\ .</math>
Hence,
<math display="block">\int_0^\infty \frac{\sin(u)}{u}du = \frac12 \lim_{\epsilon\rightarrow 0}\lim_{R\rightarrow 0}\mbox{Im}\left(\int_{-R}^{-\epsilon}\frac{e^{iz}}{z}dz+\int_\epsilon^R\frac{e^{iz}}{z}dz\right)=\frac\pi2\ .</math>
Now notice that
<math display="block">
\begin{align*}
\frac1\pi\int_0^\infty \frac{e^{ius}\phi_T(-u)-e^{-ius}\phi_T(u)}{iu}du&=\frac1\pi\int_0^\infty\int_{-\infty}^\infty \frac{e^{-iu(v-s)}-e^{iu(v-s)}}{iu}dF(v)du\\
&=-\frac2\pi\int_0^\infty\int_{-\infty}^\infty \frac{\sin(u(v-s))}{u}dF(v)du\\
&=-\int_{-\infty}^\infty d F(v)\frac2\pi\int_0^\infty \frac{\sin(u(v-s))}{u}du\\
&=\int_{-\infty}^s dF(v)-\int_s^\infty dF(v)\\
&=2F(s)-1\ ,
\end{align*}
</math>
which proves \eqref{eq:gilPelaezInversion}.
\end{proof}|}}
Notice that \eqref{eq:gilPelaezInversion} can be further simplified to have
<math display="block">F(s) = \frac12-\frac{1}{\pi}\int_0^\infty \Real\left[\frac{e^{-ius}\phi_T(u)}{iu}\right]du\ ,</math>
which is used in pricing a European call option, as the risk-neutral probability of finishing in the money is
<math display="block">\Pi_2=\mathbb Q(S_T\geq K)=\frac 12+\frac{1}{\pi}\int_0^\infty\Real \left[\frac{e^{-iu\log(K)}\phi_T(u)}{iu}\right]du</math>
where <math>\mathbb Q</math> denotes probability under the EMM. The ‘delta’ of the option is
<math display="block">\Pi_1=\frac{e^{-rT}}{S_0}\mathbb E^QS_T\indicator{S_T\geq K}=\frac{e^{-rT}}{S_0}\int_{\log K}^\infty e^sf(s)ds\ ,</math>
but we can define a change of measure,
<math display="block">\frac{d\widetilde{\mathbb Q}}{d\mathbb Q} = \frac{e^{s_T}}{\mathbb E^Qe^{s_T}}=\frac{e^{s_T}}{S_0e^{rT}}\ ,</math>
where <math>s_T=\log S_T</math>. Then,
<math display="block">\mathbb E^{\widetilde Q}e^{ius_T} = \frac{\mathbb E^ Qe^{(1+iu)s_T} }{\mathbb E^Qe^{s_T}}= \frac{\phi_T(u-i)}{\phi_T(-i)}\ ,</math>
and therefore we have the formula
<math display="block">\Pi_1=\widetilde{\mathbb Q}(S_T\geq K)=\frac 12+\frac{1}{\pi}\int_0^\infty \Real \left[\frac{e^{-iu\log(K)}\phi_T(u-i)}{iu\phi_T(-i)}\right]du\ .</math>
Assuming no dividends, the value of a European call option with strike <math>K</math>, at time <math>t=0</math>, is
<span id{{=}}"eq:fourierPrice"/>
<math display="block">
\begin{equation}
\label{eq:fourierPrice}C= S_0\Pi_1-Ke^{-rT}\Pi_2\ .
\end{equation}
</math>
The pricing formula of equation \eqref{eq:fourierPrice} is (in one way or another) interpreted as an inverse Fourier transform. Indeed, the method of Carr and Madan <ref name="carrMadan1999">Carr, P., Madan, D., and Smith, R. (1999).Option valuation using the fast Fourier transform.''Journal of Computational Finance'', 2:61--73.</ref> shows that \eqref{eq:fourierPrice} can be well-approximated as a fast Fourier transform, which has the advantage of taking less time to compute than numerical integration, but it is also less accurate. Equation \eqref{eq:fourierPrice} has become an important piece of machinery in quantitative finance because it is (or is considered to be) an explicit solution to the pricing equation. In general the pricing formula of \eqref{eq:fourierPrice} applies to just about any situation where the characteristic function is given.
==Heston Explicit Formula==
Finally, an important application of the Fourier pricing formula in \eqref{eq:fourierPrice} is to the Heston model. The call option formula for the Heston model from equations \eqref{eq:dSheston} and \eqref{eq:dXheston} has a well-known inverse Fourier transform (see <ref name="gatheralBook">Gatheral, J. (2006).''The Volatility Surface: A Practitioner's Guide''.Wiley.</ref>) with
<math display="block">
\begin{align*}
\Pi_j^{heston}&=\frac 12+\frac{1}{\pi}\int_0^\infty\Real \left[\frac{e^{-iu\log(K)}\phi_T^j(u)}{iu}\right]du\qquad\hbox{for }j=1,2,\\
\phi_T^j(u)&=\exp\left(A_j(u)+B_j(u)X_0+iu(\log(S_0)+rT)\right)\\
A_j(u)&=\frac{\kappa\bar X}{\gamma^2}\left((b_j-\rho \gamma ui-d_j)T-2\log\left(\frac{1-g_je^{-d_jr}}{1-g_j}\right)\right)\\
B_j(u) &= \frac{b_j-\rho \gamma ui-d_j}{\gamma^2}\left(\frac{1-e^{-d_jr}}{1-g_je^{-d_jr}}\right)\\
g_j&= \frac{b_j-\rho \gamma ui-d_j}{b_j-\rho \gamma ui+d_j}\\
d_j&= \sqrt{(\rho\gamma u i-b_j)^2-\gamma^2(2c_jui-u^2)}\\
c_1&=\frac 12,~~c_2=-\frac 12,~~b_1 = \kappa-\rho\gamma,~~b_2 = \kappa\ ,
\end{align*}
</math>
which is applied for the Heston model the volatility price of risk absorbed into parameters <math>\kappa</math> and <math>\bar X</math>. '''Warning:''' Use a well-made Gaussian quadrature function for numerical computation of the integrals for the Heston price; do not attempt to integrate with an evenly-spaced grid.
==The Merton Jump Diffusion==
Short time-to-maturity options are known to be well fit by the class of models where log-asset prices are L\'evy processes. For instance, the Merton jump diffusion,
<math display="block">\frac{dS_t}{S_t} = \left(r-\nu\right)dt+\sigma dW_t+\left(e^{J_t}-1\right)dN_t</math>
where <math>\sigma > 0</math>, <math>N_t</math> is an independent Poisson process with parameter <math>\lambda\in(0,\infty)</math>, <math>J_t</math> is a sequence of i.i.d. normal random variables with <math>J_{t_i}\sim N(\mu_J,\sigma_J^2)</math> when <math>N_{t_i}-N_{t_i-} = 1</math>, and <math>\nu = \lambda\left(e^{\mu_J+\frac{\sigma_J^2}{2}}-1\right)</math> is a compensator to ensure the process is a discounted martingale. The log-price is
<math display="block">
\begin{align*}
\log(S_t/S_0) &= \int_0^t\left(r-\nu-\frac{\sigma^2}{2}\right)du+\sigma W_t+\int_0^tJ_udN_u\\
&=\int_0^t\left(r-\nu-\frac{\sigma^2}{2}\right)du+\sigma W_t+\sum_{n=1}^{N_t}J_n\ .
\end{align*}
</math>
For <math>S_0=1</math>, the characteristic function is derived as follows,
<math display="block">
\begin{align*}
\phi_T(u)&= \mathbb E\exp\left( iu\left(r-\nu-\frac{\sigma^2}{2}\right)T+iu\sigma W_T+iu\int_0^tJ_tdN_t\right)\\
&=\exp\left( iu\left(r-\nu-\frac{\sigma^2}{2}\right)T-\frac{\sigma^2u^2}{2}T+\lambda T\left(e^{iu\mu_J-\frac{\sigma_J^2u^2}{2}}-1\right)\right)\ .
\end{align*}
</math>
This is an example of a characteristic function that is known explicitly. In addition, it is particular example of the general L\'evy-Khintchine representation,
<math display="block">
\begin{align*}
&\mathbb E\exp\left(iu\log(S_T)\right)\\
& = \exp\left(  iu\left(r-\nu-\frac{\sigma^2}{2}\right)T-\frac{\sigma^2u^2}{2}T +T\int_{\mathbb R\setminus\{0\}}\left(e^{iux}-1-iux\mathbf 1_{|x| < 1}\right)\eta(dx)\right)\ ,
\end{align*}
</math>
where <math>\eta</math> is the intensity measure, satisfying <math>\int_{\mathbb R\setminus\{0\}}1\wedge x^2\eta(dx) < \infty</math>. For more on the L\'evy-Khintchine representation, see <ref name="contTankov">Cont., R. and Tankov, P. (2003).''Financial modelling with Jump Processes''.</ref>.
==General references==
{{cite arXiv|last1=Papanicolaou|first1=Andrew|year=2015|title=Introduction to Stochastic Differential Equations (SDEs) for Finance|eprint=1504.05309|class=q-fin.MF}}
==References==
{{reflist}}
==Notes==
{{Reflist|group=Notes}}

Revision as of 01:30, 4 June 2024

[math] \newcommand{\indicator}[1]{\mathbbm{1}_{\left[ {#1} \right] }} \newcommand{\Real}{\hbox{Re}} \newcommand{\HRule}{\rule{\linewidth}{0.5mm}} \newcommand{\mathds}{\mathbb}[/math]

\label{chapt:stochVol} Stochastic volatility is another popular way to fit the implied volatility smile. It has the advantage of assuming volatility brings another source of randomness, and this randomness can be hedge if it is also a Brownian motion. Furthermore, calibration of stochastic volatility models does have some robustness to the market's daily changes. However, whereas local volatility models are generally calibrated to fit the implied volatility surface (i.e. to fit options of all maturities), stochastic volatility models have trouble fitting more than one maturity at a time. Consider the following stochastic volatility model,

[[math]] \begin{eqnarray} \label{eq:SVM} dS_t&=&\mu S_tdt+\sigma(X_t)S_tdW_t\\ \label{eq:dX} dX_t &=&\alpha(X_t)dt+\beta(X_t)dB_t \end{eqnarray} [[/math]]

where [math]W_t[/math] and [math]B_t[/math] are Brownian motions with correlation [math]\rho\in(-1,1)[/math] such that [math]\mathbb E[dW_tdB_t]=\rho dt[/math], [math]\sigma(x) \gt 0[/math] is the volatility function, and [math]X_t[/math] is the volatility process and is usually mean-reverting. An example of a mean-revering process is the square-root process,

[[math]]dX_t = \kappa(\bar X-X_t)dt+\gamma\sqrt{X_t}dB_t\ .[[/math]]

The square-root process is used in the Heston model (along with [math]\sigma(x) =\sqrt{x}[/math]), and usually relies on what is known as the Feller condition ([math]\gamma^2\leq 2\bar X\kappa[/math]) so that [math]X_t[/math] never touches zero.

Hedging and Pricing

Assuming that [math]\alpha[/math] and [math]\beta[/math] are ‘well-behaved’ functions, we define the differential operator

[[math]]\mathcal L \doteq\frac{\partial}{\partial t}+\frac{\sigma^2(x)s^2}{2}\frac{\partial^2}{\partial s^2}+\mu s \frac{\partial}{\partial s}+\frac{\beta^2(x)}{2}\frac{\partial^2}{\partial x^2}+\alpha(x)\frac{\partial}{\partial x}+\rho\beta(x)\sigma(x)s\frac{\partial^2}{\partial x\partial s}\ ; [[/math]]

the interpretation of [math]\mathcal L[/math] is the following: [math]\frac{\sigma^2(X_t)S_t^2}{2}\frac{\partial^2}{\partial s^2}+\mu S_t \frac{\partial}{\partial s}[/math] are the generator of [math]S_t[/math]; [math]\frac{\beta^2(X_t)}{2}\frac{\partial^2}{\partial x^2}+\alpha(X_t)\frac{\partial}{\partial x}[/math] are the generator of [math]X_t[/math]; [math]\rho\beta(X_t)\sigma(X_t)S_t\frac{\partial^2}{\partial x\partial s}[/math] is the cross-term; and the remaining are the separate stochastic terms. We use when applying bi-variate It\^o Lemma for the process in \eqref{eq:SVM} and \eqref{eq:dX}:\\ For any twice differentiable function [math]f(t,s,x)[/math], the It\^o differential is

[[math]] \begin{align} \label{eq:2Dito} df(t,S_t,X_t) &= \mathcal Lf(t,S_t,X_t)dt+\sigma(X_t)S_t\frac{\partial}{\partial s}f(t,S_t,X_t)dW_t+\beta(X_t)\frac{\partial}{\partial x}f(t,S_t,X_t)dB_t\ . \end{align} [[/math]]


The price [math]C(t,s,x)[/math] is the price of claim [math]\psi(s,x)[/math] paid at time [math]T[/math] given [math]S_t=s[/math] and [math]X_t=x[/math]. We hedge with a self-financing portfolio [math]V_t[/math] consisting of risky-asset, the risk-free bank account, and a 2nd derivative [math]C'[/math] that is settled at time [math]T' \gt T[/math]. This method for hedging stochastic volatility is called the Hull-White method. The self-financing condition is

[[math]]dV_t = a_tdS_t+b_tdC'(t,S_t,X_t)+r(V_t - a_tS_t-b_tC'(t,S_t,X_t))dt\ .[[/math]]

Now, taking a long position in [math]V_t[/math] and a short position in [math]C(t,S_t,X_t)[/math], we apply the bivariate It\^o lemma of \eqref{eq:2Dito} to get the dynamics of this new portfolio:


[[math]]d(V_t-C(t,S_t,X_t))[[/math]]

[[math]]=a_t(\mu S_tdt+\sigma(X_t)S_tdW_t)[[/math]]

[[math]]+ b_t\mathcal LC'(t,S_t,X_t)dt+b_t\sigma(X_t)S_t\frac{\partial}{\partial s}C'(t,S_t,X_t)dW_t+b_t\beta(X_t)\frac{\partial}{\partial x}C'(t,S_t,X_t)dB_t[[/math]]

[[math]]+r(V_t - a_tS_t-b_tC'(t,S_t,X_t))dt[[/math]]

[[math]]-\mathcal LC(t,S_t,X_t)dt-\sigma(X_t)S_t\frac{\partial}{\partial s}C(t,S_t,X_t)dW_t-\beta(X_t)\frac{\partial}{\partial x}C(t,S_t,X_t)dB_t\ .[[/math]]

We now choose [math]a_t[/math] and [math]b_t[/math] so that the [math]dW_t[/math] and the [math]dB_t[/math] terms vanish, which means that

[[math]] \begin{align*} &\sigma(X_t)S_t\frac{\partial}{\partial s}C(t,S_t,X_t) = a_t\sigma(X_t)S_t+ b_t\sigma(X_t)S_t\frac{\partial}{\partial s}C'(t,S_t,X_t)\\ &\\ &\beta(X_t)\frac{\partial}{\partial x}C(t,S_t,X_t)=b_t\beta(X_t)\frac{\partial}{\partial x}C'(t,S_t,X_t) \end{align*} [[/math]]

or

[[math]] \begin{align} \label{eq:a_t} &b_t = \frac{\frac{\partial}{\partial x}C(t,S_t,X_t)}{\frac{\partial}{\partial x}C'(t,S_t,X_t)}\\ \label{eq:b_t} &a_t = \frac{\partial}{\partial s}C(t,S_t,X_t)-b_t\frac{\partial}{\partial s}C'(t,S_t,X_t)\ . \end{align} [[/math]]

Plugging these solutions into the differential, the [math]\alpha(X_t)\frac{\partial}{\partial x}[/math] terms cancel, the [math]\mu S_t\frac{\partial}{\partial s}[/math] terms cancel, and by the same arbitrage argument as Black-Scholes, it follows that [math]V_t-C(t,S_t,X_t)[/math] must grow at the risk-free rate:

[[math]] \begin{align*} &d(V_t-C_t(t,S_t,X_t))\\ &= a_t\mu S_t dt+b_t\mathcal LC'(t,S_t,X_t)dt+r(V_t - a_tS_t-b_tC'(t,S_t,X_t))dt -\mathcal LC(t,S_t,X_t)dt\\ &=r(V_t-C(t,S_t,X_t))dt\ . \end{align*} [[/math]]

Inserting [math]a_t[/math] and [math]b_t[/math] from equations \eqref{eq:a_t} and \eqref{eq:b_t}, and after some rearranging of terms (and dropping the [math]dt[/math]'s), we get

[[math]]\left(\mathcal L-(\mu-r)S_t\frac{\partial}{\partial s}-r\right)C(t,S_t,X_t)= b_t\left(\mathcal L-(\mu-r)S_t\frac{\partial}{\partial s}-r\right)C'(t,S_t,X_t)[[/math]]

and dividing both sides by [math]\frac{\partial}{\partial x}C(t,S_t,X_t)[/math], we get

[[math]]\frac{\left(\mathcal L-(\mu-r)S_t\frac{\partial}{\partial s}-r\right)C(t,S_t,X_t)}{\frac{\partial}{\partial x}C(t,S_t,X_t)}= \frac{\left(\mathcal L-(\mu-r)S_t\frac{\partial}{\partial s}-r\right)C'(t,S_t,X_t)}{\frac{\partial}{\partial x}C'(t,S_t,X_t)}\doteq R(t,s,x)\ .[[/math]]

The left-hand side of this equation does not depend on specifics of [math]C'[/math] (e.g. maturity and strikes), and the right-hand side does not depend on specifics of [math]C[/math]. Therefore, there is a function [math]R(t,s,x)[/math] that does not depend on parameters such as strike price and maturity, and it equates the two sides of the expression. We let [math]R[/math] take the form

[[math]]R(t,s,x) = -\alpha(x)+\beta(x)\Lambda(t,s,x)[[/math]]

where

[[math]]\Lambda(t,s,x) = \rho\frac{\mu-r}{\sigma(x)}+g(t,s,x)\sqrt{1-\rho^2}\ ,[[/math]]

where [math]g[/math] is an arbitrary function. Hence, we arrive at the PDE for the price [math]C(t,s,x)[/math] of a European Derivative with stochastic vol:

Proposition

(Pricing PDE for Stochastic Volatility). \label{prop:stochasticVolPDE} Given the stochastic volatility model of equations \eqref{eq:SVM} and \eqref{eq:dX}, the price of a European claim with payoff [math]\psi(S_T)[/math] satisfies

[[math]] \begin{align} \nonumber \Bigg(\frac{\partial}{\partial t}+\frac{\sigma^2(x)s^2}{2}\frac{\partial^2}{\partial s^2}+\rho\beta(x)\sigma(x)s\frac{\partial^2}{\partial x\partial s}+\alpha(x)\frac{\partial}{\partial x}+\frac{\beta^2(x)}{2}\frac{\partial^2}{\partial x^2}+rs\frac{\partial}{\partial s}-r\Bigg)C&\\ \label{eq:SVpde} =\beta(x)\Lambda(t,s,x)\frac{\partial}{\partial x}C& \end{align} [[/math]]
with [math]C\Big|_T=\psi(s,x)[/math].

The PDE in \eqref{eq:SVpde} can be solved with Feynman-Kac,

[[math]]C(t,s,x) = e^{-r(T-t)}\mathbb E^Q\{\psi(S_T,X_T)|S_t=s,X_t=x\}\ .[[/math]]

which yields a description of an EMM, [math]\mathbb Q[/math], under which the stochastic volatility model is

[[math]] \begin{eqnarray*} \frac{dS_t}{S_t}&=&r dt+\sigma(X_t)dW_t^Q\\ dX_t&=&\left(\alpha(X_t)-\beta(X_t)\Lambda(t,S_t,X_t)\right)dt+\beta(X_t)dB_t^Q \end{eqnarray*} [[/math]]

where [math]W_t^Q[/math] and [math]B_t^Q[/math] are [math]\mathbb Q[/math]-Brownian motions with the same correlation [math]\rho[/math] as before. The interpretation of [math]\Lambda(t,s,x)[/math] is that it is the market price of volatility risk. One can think of a market as complete when there are as many non-redundant assets as there are sources of randomness. In this sense we have ‘completed’ this market with stochastic volatility by including the derivative [math]C'[/math] among the traded assets. This has allowed us to replicate the European claim and obtain a unique no-arbitrage price that must be the solution to \eqref{eq:SVpde}.\\

Example

[[math]] \begin{align} \label{eq:dSheston} &dS_t=r S_tdt+\sqrt{X_t}dW_t^Q\\ \label{eq:dXheston} &dX_t=\kappa(\bar X-X_t)dt+\lambda X_tdt+\gamma\sqrt{X_t}dB_t^Q \end{align} [[/math]]

where [math]\lambda X_t[/math] is the assumed form of the volatility price of risk. This model can be re-written,

[[math]]dX_t = \tilde\kappa(\tilde{\bar X}-X_t)dt+\gamma\sqrt{X_t}dB_t^Q[[/math]]

where [math]\tilde\kappa =\kappa-\lambda[/math] and [math]\tilde{\bar X} = \frac{\kappa\bar X}{\kappa-\lambda}[/math]. It is important to have the Feller condition

[[math]]\gamma^2\leq 2\tilde\kappa\tilde{\bar X}[[/math]]

otherwise the PDE requires boundaries for the event [math]X_t = 0[/math]. The process [math]X_t[/math] is degenerate, but it is well-known that the Feller condition is the critical assumption for the application of Feynman-Kac. The fit of the Heston model to the implied volatility of market data is shown in Figure.

The fit of the Heston model to implied volatility of market data on July 27 of 2012, with different maturities. Notice that the Heston model doesn't fit very well to the options with shortest time to maturity.

Monte Carlo Methods

A slow but often reliable method for pricing derivatives is to approximate the risk-neutral expectation with independent samples. Essentially, sample paths of Brownian motion are obtained using a pseudo-random number generated, the realized derivative payoff is computed for each sample, and then the average of these sample payoffs is an approximation of the true average. For example, let [math]\ell=1,2,\dots,N[/math], let [math]S_T^{(\ell)}[/math] be the index for an independent sample from [math]S_T[/math]'s probability distribution. If [math]\mathbb E^QS_T^2 \lt \infty[/math], then by the law of large numbers we have

[[math]]\frac 1N\sum_{\ell=1}^N(S_T^{(\ell)}-K)^+\stackrel{N\rightarrow\infty}{\longrightarrow}\mathbb E^Q(S_T-K)^+\ .[[/math]]

For stochastic differential equations, the typical method is to generate samples that are approximately from the same distribution as the price. The basic idea is to use an Euler-Maruyama scheme,

[[math]] \begin{equation} \label{eq:EMscheme} \log \tilde S_{t+\Delta t}^{(\ell)} = \log \tilde S_t^{(\ell)}+\left(\mu-\frac 12\sigma^2\right)\Delta t+\sigma(W_{t+\Delta t}^{(\ell)}-W_t^{(\ell)})\ , \end{equation} [[/math]]

for some small [math]\Delta t \gt 0[/math], with

[[math]]W_{t+\Delta t}^{(\ell)}-W_t^{(\ell)}\sim iid \mathcal N(0,\Delta t)\ .[[/math]]

This scheme is basically the discrete backward Riemann sum from which the It\^o integral was derived (see Chapter). In the limit, the Monte-Carlo average will be with little-o of the true expectation,

[[math]]\lim_{N\rightarrow\infty} \frac 1N\sum_{\ell=1}^N(\tilde S_T^{(\ell)}-K)^+ = \mathbb E^Q(S_T-K)^++o(\Delta t)\ .[[/math]]

The scheme in \eqref{eq:EMscheme} can easily be adapted for a local volatility model, and can also be adapted to stochastic volatility models, but the latter will also need a scheme for generating the volatility process [math]X_t[/math]. Example

[[math]] \begin{align*} &\frac{dS_t}{S_t} = rdt+e^{X_t}dW_t^Q\\ &dX_t = \kappa(\bar X-X_t)dt+\gamma dB_t^Q \end{align*} [[/math]]

with correlation [math]\rho[/math] between [math]W_t^Q[/math] and [math]B_t^Q[/math]. The Euler-Maruyama scheme is

[[math]] \begin{align*} &\log \tilde S_{t+\Delta t} = \log \tilde S_t+\left(r-\frac 12e^{2\tilde X_t}\right)\Delta t+e^{\tilde X_t}\left(\rho (B_{t+\Delta t}-B_t)+\sqrt{1-\rho^2}(W_{t+\Delta t}-W_t)\right)\\ &\tilde X_{t+\Delta t} = \tilde X_t+\kappa\left(\bar X-\tilde X_t \right)\Delta t+\gamma(B_{t+\Delta t}-B_t) \end{align*} [[/math]]

where [math]B_{t+\Delta t}^{(\ell)}-B_t^{(\ell)}[/math] and [math]W_{t+\Delta t}^{(\ell)}-W_t^{(\ell)}[/math] are increments of independent Brownian motions.

Sometimes, there more clever methods of sampling need to be employed. For instance, the Heston model: Example

[[math]]\tilde X_{t+\Delta t}=\tilde X_t+\kappa(\bar X-\tilde X_t)\Delta t+\gamma\sqrt{\tilde X_t}\left(B_{t+\Delta t}-B_t\right)[[/math]]

will allow the volatility process to become negative. An alternative is to consider the Stratonovich-type differential,[Notes 1]

[[math]]dX_t = \left(\kappa(\bar X-X_t)-\frac{\gamma^2}{2}\right)dt+\gamma\sqrt{X_t}\circ dB_t[[/math]]

and then use an implicit Euler-Maruyama scheme

[[math]]\tilde X_{t+\Delta t}-\tilde X_t =\left( \kappa(\bar X-\tilde X_{t+\Delta t})-\frac{\gamma^2}{2}\right)\Delta t+\gamma \sqrt{\tilde X_{t+\Delta t}} \left(B_{t+\Delta t}-B_t\right)\ .[[/math]]

which can be solved for [math]\sqrt{\tilde X_{t+\Delta t}}[/math] using the quadratic equation. Notice that a condition for the discriminant to be real is the Feller condition that [math]\gamma^2\leq 2\kappa\bar X[/math].

In general, Monte Carlo methods are good because they can be used to price pretty much any type of derivative instrument, so long as the EMM is given. For instance, it is very easy to price an Asian option with Monte Carlo. Another advantages of Monte Carlo is that there is no need for an explicit solution to any kind of PDE; the methodology for pricing is to simply simulate and average. However, as mentioned above, this can be very slow, and so there is a need (particularly among practitioners) for faster ways to compute prices. Another important topic to remark on is importance sampling, which refers to simulations generated under an equivalent measure for which certain rare events are more common. For instance, suppose one wants to use Monte Carlo to price a call option with extreme strike. It will be a waste of time to generate samples from the EMM's model if the overwhelming majority of them will finish out of the money. Instead, one can sample from another model that has more volatility, so that more samples finish in the money. Then, assign importance weights based on ratio of densities and take the average. See [1] for further reading on Monte Carlo methods in finance.

Fourier Transform Methods

The most acclaimed way to compute a European option price is with the so-called Fourier transform methods. In particular, the class of affine models in [2] have tractable pricing formulae because they have explicit expressions for their Fourier transforms. The essential ingredient to these methods is the characteristic function,

[[math]]\phi_T(u)\doteq\mathbb E^Q\exp(iu\log(S_T))\qquad\forall u\in\mathbb C,~i=\sqrt{-1}\ ,[[/math]]

where [math]\mathbb E^Q[/math] now denotes expectation under an EMM. The function [math]\phi_T(u)[/math] is the Fourier transform of the [math]\log S_T[/math]'s distribution function,

[[math]]\phi_T(u) = \int_{-\infty}^\infty e^{iu s}f(s)ds[[/math]]

where [math]f(s)[/math] is the density function for [math]\log S_T[/math]. The inverse Fourier transform is [math]f(s) = \tfrac{1}{2\pi}\int_{-\infty}^\infty e^{-iu s}\phi_T(u)du[/math], which leads to the cumulative distribution function

[[math]]F(s)-F(0)=\int_0^sf(v)dv = \frac{1}{2\pi}\int_{-\infty}^\infty \frac{1-e^{-ius}}{iu}\phi_T(u)du\ .[[/math]]

The crucial piece of information is the characteristic function, as any European claim can be priced provided there is also a Fourier transform for the payoff function. Let [math]p_T[/math] denote the density function on the risk-neutral distribution of [math]\log(S_T)[/math] and let [math]\widehat\psi[/math] denote the Fourier transform of the payoff function. It is shown in [3] that

[[math]]\mathbb E^Q\psi(\log(S_T)) = \int_{-\infty}^\infty\psi(s)f(s)ds=\int_{-\infty}^\infty\frac{1}{2\pi} \int_{iz-\infty}^{iz+\infty}e^{-ius}\widehat \psi(u) du f(s)ds[[/math]]

[[math]]=\frac{1}{2\pi}\int_{iz-\infty}^{iz+\infty}\widehat \psi(u)\int_{-\infty}^\infty e^{-ius} p_T(s)ds du=\frac{1}{2\pi}\int_{iz-\infty}^{iz+\infty}\widehat \psi(u)\phi_T(-u) du[[/math]]

where the constant [math]z\in\mathbb R[/math] in the limit of integration is the appropriate strip of regularity for non-smooth payoff functions (see Table). This usage of characteristic functions is far reaching, as the class of L\'evy jump models are defined in terms of their characteristic function, or equivalently with the L\'evy-Khintchine representation (for more on financial models with jumps see [4]). \begin{table}[htb] \center \caption{Fourier Transforms of Various Payoffs, [math]\widehat\psi(u)=\int_{-\infty}^\infty e^{ius}\psi(s)ds[/math] and [math]\psi(s)=\frac{1}{2\pi} \int_{iz-\infty}^{iz+\infty}e^{-ius}\widehat \psi(u) du[/math], [math]z=\Im(u)[/math].} \label{tab:payoffs} \begin{tabular}{|p{4cm}|c|c|c|} \hline asset&[math]\psi(s)[/math]&[math]\widehat\psi(u)[/math]&regularity strip\\ \hline call &[math](e^s-K)^+[/math]&[math]-\frac{K^{iu+1}}{u^2-iu}[/math]&[math]z \gt 1[/math]\\ put&[math](K-e^s)^+[/math]&[math]-\frac{K^{iu+1}}{u^2-iu}[/math]&[math]z \lt 0[/math]\\ covered call; cash secured put&[math]\min(e^s,K)[/math]&[math]\frac{K^{iu+1}}{u^2-iu}[/math]&[math]0 \lt z \lt 1[/math]\\ cash or nothing call &[math]\indicator{e^s\geq K}[/math]&[math]-\frac{K^{iu}}{iu}[/math]&[math]z \gt 0[/math]\\ cash or nothing put &[math]\indicator{e^s\leq K}[/math]&[math]\frac{K^{iu}}{iu}[/math]&[math]z \lt 0[/math]\\ asset or nothing call&[math]e^s\indicator{e^s\geq K}[/math]&[math]-\frac{K^{iu+1}}{iu+1}[/math]&[math]z \gt 1[/math]\\ asset or nothing put&[math]e^s\indicator{e^s\leq K}[/math]&[math]\frac{K^{iu+1}}{iu+1}[/math]&[math]z \lt 0[/math]\\ Arrow-Debreu&[math]\delta(s-\log(K))[/math]&[math]K^{iu}[/math]&[math]z\in\mathbb R[/math]\\ \hline \end{tabular} \end{table}

Call-Option Pricing

An important tool is the formula of Gil-Pelaez:

Proposition (Gil-Pelaez Inversion Theorem)

The distribution function [math]F(s)[/math] is given by

[[math]] \begin{equation} \label{eq:gilPelaezInversion} F(s) = \frac12+\frac{1}{2\pi}\int_0^\infty \frac{e^{ius}\phi_T(-u)-e^{-ius}\phi_T(u)}{iu}du\ , \end{equation} [[/math]]
for all [math]s\in(-\infty,\infty)[/math]. \begin{proof} First let's prove an identity,

[[math]]\mbox{sign}(v-s) = \frac2\pi\int_0^\infty \frac{\sin(u(v-s))}{u}du = \left\{\begin{array}{cc}-1&\mbox{if }v \lt s\\0&\mbox{if }v=s\\1&\mbox{if }v \gt s\end{array}\right. \ .[[/math]]
This integral is shown using residue calculus:

[[math]] \begin{align*} \int_0^\infty \frac{\sin(u)}{u}du &= \frac12\int_{-\infty}^\infty \frac{\sin(u)}{u}du=\frac12 \lim_{\epsilon\rightarrow 0}\lim_{R\rightarrow 0}\mbox{Im}\left(\int_{-R}^{-\epsilon}\frac{e^{iz}}{z}dz+\int_\epsilon^R\frac{e^{iz}}{z}dz\right)\ . \end{align*} [[/math]]
A closed path can be constructed from [math]-R[/math] to [math]R[/math], making a semi-circle of radius [math]\epsilon[/math] around zero, and then from [math]R[/math] back to [math]-R[/math] on a semi-circle of radius [math]R[/math]. Along this path the integral of [math]e^{iz}/z[/math] is zero,

[[math]]\int_{-R}^{-\epsilon}\frac{e^{iz}}{z}dz+\int_\epsilon^R\frac{e^{iz}}{z}dz=-\int_{C_\epsilon}\frac{e^{iz}}{z}dz-\int_{C_R}\frac{e^{iz}}{z}dz=0\ ,[[/math]]
where [math]C_\epsilon =\{\epsilon e^{i(\theta-\pi)}|0\leq \theta\leq \pi\}[/math] and [math]C_R=\{Re^{i\theta}|0\leq \theta\leq \pi\}[/math]. Now,

[[math]] \begin{align*} \left|\int_{C_R}\frac{e^{iz}}{z}dz \right|&=\frac1R\left|\int_0^\pi\frac{e^{iR\cos(\theta)-R\sin(\theta)}}{e^{i\theta}}d\theta\right|\\ &\leq \frac{1}{R}\int_0^\pi\left|\frac{e^{iR\cos(\theta)-R\sin(\theta)}}{e^{i\theta}}\right|d\theta \\ &= \frac{1}{R}\int_0^\pi e^{-R\sin(\theta)}d\theta \\ & \lt \frac{\pi}{R}\rightarrow 0\qquad\hbox{as }R\rightarrow\infty \ , \end{align*} [[/math]]
and

[[math]]\int_{C_\epsilon}\frac{e^{iz}}{z}dz = \int_\pi^0 \frac{1+O(\epsilon)}{\epsilon e^{i\theta}}i\epsilon e^{i\theta}d\theta = -i\pi +O(\epsilon)\ .[[/math]]
Hence,

[[math]]\int_0^\infty \frac{\sin(u)}{u}du = \frac12 \lim_{\epsilon\rightarrow 0}\lim_{R\rightarrow 0}\mbox{Im}\left(\int_{-R}^{-\epsilon}\frac{e^{iz}}{z}dz+\int_\epsilon^R\frac{e^{iz}}{z}dz\right)=\frac\pi2\ .[[/math]]
Now notice that

[[math]] \begin{align*} \frac1\pi\int_0^\infty \frac{e^{ius}\phi_T(-u)-e^{-ius}\phi_T(u)}{iu}du&=\frac1\pi\int_0^\infty\int_{-\infty}^\infty \frac{e^{-iu(v-s)}-e^{iu(v-s)}}{iu}dF(v)du\\ &=-\frac2\pi\int_0^\infty\int_{-\infty}^\infty \frac{\sin(u(v-s))}{u}dF(v)du\\ &=-\int_{-\infty}^\infty d F(v)\frac2\pi\int_0^\infty \frac{\sin(u(v-s))}{u}du\\ &=\int_{-\infty}^s dF(v)-\int_s^\infty dF(v)\\ &=2F(s)-1\ , \end{align*} [[/math]]
which proves \eqref{eq:gilPelaezInversion}. \end{proof}

Notice that \eqref{eq:gilPelaezInversion} can be further simplified to have

[[math]]F(s) = \frac12-\frac{1}{\pi}\int_0^\infty \Real\left[\frac{e^{-ius}\phi_T(u)}{iu}\right]du\ ,[[/math]]

which is used in pricing a European call option, as the risk-neutral probability of finishing in the money is

[[math]]\Pi_2=\mathbb Q(S_T\geq K)=\frac 12+\frac{1}{\pi}\int_0^\infty\Real \left[\frac{e^{-iu\log(K)}\phi_T(u)}{iu}\right]du[[/math]]

where [math]\mathbb Q[/math] denotes probability under the EMM. The ‘delta’ of the option is

[[math]]\Pi_1=\frac{e^{-rT}}{S_0}\mathbb E^QS_T\indicator{S_T\geq K}=\frac{e^{-rT}}{S_0}\int_{\log K}^\infty e^sf(s)ds\ ,[[/math]]

but we can define a change of measure,

[[math]]\frac{d\widetilde{\mathbb Q}}{d\mathbb Q} = \frac{e^{s_T}}{\mathbb E^Qe^{s_T}}=\frac{e^{s_T}}{S_0e^{rT}}\ ,[[/math]]

where [math]s_T=\log S_T[/math]. Then,

[[math]]\mathbb E^{\widetilde Q}e^{ius_T} = \frac{\mathbb E^ Qe^{(1+iu)s_T} }{\mathbb E^Qe^{s_T}}= \frac{\phi_T(u-i)}{\phi_T(-i)}\ ,[[/math]]

and therefore we have the formula

[[math]]\Pi_1=\widetilde{\mathbb Q}(S_T\geq K)=\frac 12+\frac{1}{\pi}\int_0^\infty \Real \left[\frac{e^{-iu\log(K)}\phi_T(u-i)}{iu\phi_T(-i)}\right]du\ .[[/math]]

Assuming no dividends, the value of a European call option with strike [math]K[/math], at time [math]t=0[/math], is

[[math]] \begin{equation} \label{eq:fourierPrice}C= S_0\Pi_1-Ke^{-rT}\Pi_2\ . \end{equation} [[/math]]

The pricing formula of equation \eqref{eq:fourierPrice} is (in one way or another) interpreted as an inverse Fourier transform. Indeed, the method of Carr and Madan [5] shows that \eqref{eq:fourierPrice} can be well-approximated as a fast Fourier transform, which has the advantage of taking less time to compute than numerical integration, but it is also less accurate. Equation \eqref{eq:fourierPrice} has become an important piece of machinery in quantitative finance because it is (or is considered to be) an explicit solution to the pricing equation. In general the pricing formula of \eqref{eq:fourierPrice} applies to just about any situation where the characteristic function is given.

Heston Explicit Formula

Finally, an important application of the Fourier pricing formula in \eqref{eq:fourierPrice} is to the Heston model. The call option formula for the Heston model from equations \eqref{eq:dSheston} and \eqref{eq:dXheston} has a well-known inverse Fourier transform (see [6]) with

[[math]] \begin{align*} \Pi_j^{heston}&=\frac 12+\frac{1}{\pi}\int_0^\infty\Real \left[\frac{e^{-iu\log(K)}\phi_T^j(u)}{iu}\right]du\qquad\hbox{for }j=1,2,\\ \phi_T^j(u)&=\exp\left(A_j(u)+B_j(u)X_0+iu(\log(S_0)+rT)\right)\\ A_j(u)&=\frac{\kappa\bar X}{\gamma^2}\left((b_j-\rho \gamma ui-d_j)T-2\log\left(\frac{1-g_je^{-d_jr}}{1-g_j}\right)\right)\\ B_j(u) &= \frac{b_j-\rho \gamma ui-d_j}{\gamma^2}\left(\frac{1-e^{-d_jr}}{1-g_je^{-d_jr}}\right)\\ g_j&= \frac{b_j-\rho \gamma ui-d_j}{b_j-\rho \gamma ui+d_j}\\ d_j&= \sqrt{(\rho\gamma u i-b_j)^2-\gamma^2(2c_jui-u^2)}\\ c_1&=\frac 12,~~c_2=-\frac 12,~~b_1 = \kappa-\rho\gamma,~~b_2 = \kappa\ , \end{align*} [[/math]]

which is applied for the Heston model the volatility price of risk absorbed into parameters [math]\kappa[/math] and [math]\bar X[/math]. Warning: Use a well-made Gaussian quadrature function for numerical computation of the integrals for the Heston price; do not attempt to integrate with an evenly-spaced grid.

The Merton Jump Diffusion

Short time-to-maturity options are known to be well fit by the class of models where log-asset prices are L\'evy processes. For instance, the Merton jump diffusion,

[[math]]\frac{dS_t}{S_t} = \left(r-\nu\right)dt+\sigma dW_t+\left(e^{J_t}-1\right)dN_t[[/math]]

where [math]\sigma \gt 0[/math], [math]N_t[/math] is an independent Poisson process with parameter [math]\lambda\in(0,\infty)[/math], [math]J_t[/math] is a sequence of i.i.d. normal random variables with [math]J_{t_i}\sim N(\mu_J,\sigma_J^2)[/math] when [math]N_{t_i}-N_{t_i-} = 1[/math], and [math]\nu = \lambda\left(e^{\mu_J+\frac{\sigma_J^2}{2}}-1\right)[/math] is a compensator to ensure the process is a discounted martingale. The log-price is

[[math]] \begin{align*} \log(S_t/S_0) &= \int_0^t\left(r-\nu-\frac{\sigma^2}{2}\right)du+\sigma W_t+\int_0^tJ_udN_u\\ &=\int_0^t\left(r-\nu-\frac{\sigma^2}{2}\right)du+\sigma W_t+\sum_{n=1}^{N_t}J_n\ . \end{align*} [[/math]]


For [math]S_0=1[/math], the characteristic function is derived as follows,

[[math]] \begin{align*} \phi_T(u)&= \mathbb E\exp\left( iu\left(r-\nu-\frac{\sigma^2}{2}\right)T+iu\sigma W_T+iu\int_0^tJ_tdN_t\right)\\ &=\exp\left( iu\left(r-\nu-\frac{\sigma^2}{2}\right)T-\frac{\sigma^2u^2}{2}T+\lambda T\left(e^{iu\mu_J-\frac{\sigma_J^2u^2}{2}}-1\right)\right)\ . \end{align*} [[/math]]

This is an example of a characteristic function that is known explicitly. In addition, it is particular example of the general L\'evy-Khintchine representation,

[[math]] \begin{align*} &\mathbb E\exp\left(iu\log(S_T)\right)\\ & = \exp\left( iu\left(r-\nu-\frac{\sigma^2}{2}\right)T-\frac{\sigma^2u^2}{2}T +T\int_{\mathbb R\setminus\{0\}}\left(e^{iux}-1-iux\mathbf 1_{|x| \lt 1}\right)\eta(dx)\right)\ , \end{align*} [[/math]]

where [math]\eta[/math] is the intensity measure, satisfying [math]\int_{\mathbb R\setminus\{0\}}1\wedge x^2\eta(dx) \lt \infty[/math]. For more on the L\'evy-Khintchine representation, see [4].

General references

Papanicolaou, Andrew (2015). "Introduction to Stochastic Differential Equations (SDEs) for Finance". arXiv:1504.05309 [q-fin.MF].

References

  1. Glasserman, P. (2004).Monte Carlo Methods in Financial Engineering.Springer.
  2. Duffie, D., Pan, J., and Singleton, K. (2000).Transform analysis and asset pricing for affine jump-diffusions.Econometrica, 68:1343--1376.
  3. Lewis, A.L. (2000).Option Valuation under Stochastic Volatility with Mathematica Code.Finance Press.
  4. 4.0 4.1 Cont., R. and Tankov, P. (2003).Financial modelling with Jump Processes.
  5. Carr, P., Madan, D., and Smith, R. (1999).Option valuation using the fast Fourier transform.Journal of Computational Finance, 2:61--73.
  6. Gatheral, J. (2006).The Volatility Surface: A Practitioner's Guide.Wiley.

Notes

  1. This refers to the Stratonovich-type integral that is the limit of forward Riemann sums, [math]\int f_s\circ dB_s = \lim_N\sum_{n=1}^Nf_{t_n}(B_{t_n}-B_{t_{n-1}})[/math]. For the square-root process, the It\^o lemma on [math]\sqrt{X_t}[/math] is used to show that [math]\gamma\int \sqrt{X_s}dB_s = \gamma\lim_n\sum_n\sqrt{X_{t_{n-1}}}\Delta B_{t_{n-1}} = -\gamma\lim_n\sum_n\left(\sqrt{X_{t_n}}-\sqrt{X_{t_{n-1}}}\right)(B_{t_n}-B_{t_{n-1}}) +\gamma\lim_n\sum_n\sqrt{X_{t_n}}(B_{t_n}-B_{t_{n-1}})=-\frac{\gamma^2}{2}\int ds+\gamma\int \sqrt{X_s}\circ dB_s[/math].