guide:99d739dbc8: Difference between revisions
No edit summary |
mNo edit summary |
||
Line 4: | Line 4: | ||
\newcommand{\HRule}{\rule{\linewidth}{0.5mm}} | \newcommand{\HRule}{\rule{\linewidth}{0.5mm}} | ||
\newcommand{\mathds}{\mathbb}</math></div> | \newcommand{\mathds}{\mathbb}</math></div> | ||
This chapter takes techniques from stochastic control and applies them to portfolio management. The portfolio can be of varying type, two possibilites are a portfolio for investment of (personal) wealth, or a hedging portfolio with a short position in a derivative contract. The basic problem involves an investor with a self-financing wealth process and a concave utility function to quantify their risk aversion, from which their goal is to maximize their expected utility of terminal wealth and/or consumption. To exemplify the need for hedging obtained from optimal control, recall the price of volatility risk <math>\Lambda(t,s,x)</math> from [[guide:0875016693#prop:stochasticVolPDE |Proposition]] of [[guide:0875016693#chapt:stochVol |Chapter]]. The pricing PDE for stochastic volatility depends on <math>\Lambda</math>, but incompleteness of the market means that <math>\Lambda</math> may not be uniquely specified. However, an expression can be obtained from the solution to an optimal control, hence writing <math>\Lambda</math> as a function of the investor's risk aversion. This chapter will start by considering the basic problem of optimization of (personal) wealth, and later on will show how optimal control is used in hedging derivatives. | This chapter takes techniques from stochastic control and applies them to portfolio management. The portfolio can be of varying type, two possibilites are a portfolio for investment of (personal) wealth, or a hedging portfolio with a short position in a derivative contract. The basic problem involves an investor with a self-financing wealth process and a concave utility function to quantify their risk aversion, from which their goal is to maximize their expected utility of terminal wealth and/or consumption. To exemplify the need for hedging obtained from optimal control, recall the price of volatility risk <math>\Lambda(t,s,x)</math> from [[guide:0875016693#prop:stochasticVolPDE |Proposition]] of [[guide:0875016693#chapt:stochVol |Chapter]]. The pricing PDE for stochastic volatility depends on <math>\Lambda</math>, but incompleteness of the market means that <math>\Lambda</math> may not be uniquely specified. However, an expression can be obtained from the solution to an optimal control, hence writing <math>\Lambda</math> as a function of the investor's risk aversion. This chapter will start by considering the basic problem of optimization of (personal) wealth, and later on will show how optimal control is used in hedging derivatives. | ||
Line 64: | Line 65: | ||
'''Example''' | '''Example''' | ||
<span id="ex:logOptimalConsumption"/> | |||
Suppose that <math>F(t,c_t,X_t) = e^{-\beta t}\log(c_tX_t)</math>, <math>U(x) = 0</math>, and <math>T=\infty</math>. The optimization problem is | Suppose that <math>F(t,c_t,X_t) = e^{-\beta t}\log(c_tX_t)</math>, <math>U(x) = 0</math>, and <math>T=\infty</math>. The optimization problem is | ||
Line 433: | Line 435: | ||
<math display="block">V^0(t,x,y) = \max_\pi\mathbb E\left[U(X_T)\Big|X_t=x,Y_t=y\right]\ ,</math> | <math display="block">V^0(t,x,y) = \max_\pi\mathbb E\left[U(X_T)\Big|X_t=x,Y_t=y\right]\ ,</math> | ||
we look for the amount of cash <math> | we look for the amount of cash <math>$p</math> such that the | ||
<math display="block">V^{h}(t,x+p,y,s) = V^{0}(t,x,y)\ .</math> | <math display="block">V^{h}(t,x+p,y,s) = V^{0}(t,x,y)\ .</math> | ||
Line 445: | Line 447: | ||
\end{align*} | \end{align*} | ||
</math> | </math> | ||
where we've used the differential < | where we've used the differential <math>d\left(e^{r(T-t)}X_t\right) = e^{r(T-t)}\pi_t\left(\frac{dS_t}{S_t}-rdt\right)</math>. Hence we find a price <math>$p</math> such that <math>U\left(pe^{r(T-t)}\right) = g(t,y)/g^h(t,y,s)</math>, where <math>g(t,y)</math> is the solution from \eqref{eq:HJBstochVol_g}. | ||
Depending on the risk-aversion coefficient < | Depending on the risk-aversion coefficient <math>\gamma</math>, there will be different prices <math>\$p</math>. This brings us back to the price of volatility risk <math>\Lambda(t,s,x)</math> from [[guide:0875016693#prop:stochasticVolPDE |Proposition]] of [[guide:0875016693#chapt:stochVol |Chapter]]. Namely, investors with different risk aversion will have a different <math>\Lambda</math> for their martingale evaluation of the call option. | ||
If an indifference price is obtained then there is a solution to both optimization problems, and hence there is no-arbitrage and the range of prices for < | If an indifference price is obtained then there is a solution to both optimization problems, and hence there is no-arbitrage and the range of prices for <math>$c</math> will be a no-arbitrage interval. For complete markets there will be a single price <math>$c</math> for all levels of risk aversion. | ||
==General references== | ==General references== | ||
{{cite arXiv|last1=Papanicolaou|first1=Andrew|year=2015|title=Introduction to Stochastic Differential Equations (SDEs) for Finance|eprint=1504.05309|class=q-fin.MF}} | {{cite arXiv|last1=Papanicolaou|first1=Andrew|year=2015|title=Introduction to Stochastic Differential Equations (SDEs) for Finance|eprint=1504.05309|class=q-fin.MF}} |
Revision as of 00:46, 4 June 2024
This chapter takes techniques from stochastic control and applies them to portfolio management. The portfolio can be of varying type, two possibilites are a portfolio for investment of (personal) wealth, or a hedging portfolio with a short position in a derivative contract. The basic problem involves an investor with a self-financing wealth process and a concave utility function to quantify their risk aversion, from which their goal is to maximize their expected utility of terminal wealth and/or consumption. To exemplify the need for hedging obtained from optimal control, recall the price of volatility risk [math]\Lambda(t,s,x)[/math] from Proposition of Chapter. The pricing PDE for stochastic volatility depends on [math]\Lambda[/math], but incompleteness of the market means that [math]\Lambda[/math] may not be uniquely specified. However, an expression can be obtained from the solution to an optimal control, hence writing [math]\Lambda[/math] as a function of the investor's risk aversion. This chapter will start by considering the basic problem of optimization of (personal) wealth, and later on will show how optimal control is used in hedging derivatives.
The Optimal Investment Problem
Consider a standard geometric Brownian motion for the price of a risky asset,
where [math]\mu\in\mathbb R[/math], [math]\sigma \gt 0[/math], and [math]W[/math] is a Brownian motion under the statistical measure. There is also the risk-free bank account that pays interest at a rate [math]r\geq 0[/math]. At time [math]t\geq 0[/math] the investors has a portfolio value [math]X_t[/math] with an allocation [math]\pi_t[/math] in the risky asset and a consumption stream [math]c_t[/math]. The dynamics of the portfolio are self-financing,
where
A natural constraint on consumption is [math]c_t\geq 0[/math] for all [math]t\geq 0[/math], and based on \eqref{eq:dX_wealth} a constraint of [math]X_t\geq 0[/math] is enforced automatically. Here we have taken [math]X_t\geq 0[/math] almost surely, but in general any finite lower bound on [math]X_t[/math] is necessary to ensure no-arbitrage (i.e. [math]X_t\geq -M \gt -\infty [/math] almost with constant [math]M[/math] finite), otherwise there could be doubling strateguies. The optimization problem is then formulated as
where [math]F[/math] is a concave utility on consumption and wealth, [math]U[/math] is a concave utility on terminal wealth, and the admissible pairs [math](\pi_t,c_t)_{t\geq0}[/math] are non-anticipating, adapted to [math]W[/math], with [math]\int_0^T|\pi_tX_t|^2dt \lt \infty[/math] almost surely. We refer to [math]\mathbb E\left[\int_t^TF(u,c_u,X_u)du+U(X_T)\Big|X_t=x\right][/math] as the objective function, and refer to $V$ as the optimal value function.
Example \label{ex:optimalLog} Suppose that [math]F=0[/math] and [math]U(x) = \log(x)[/math]. There is no utility of consumption, so the optimal is [math]c_t=0[/math] for all [math]t\geq 0[/math]. Now apply It\^o's lemma,
and then taking expectations,
where the right-hand side is concave in [math]\pi_t[/math]. Hence, the optimal strategy is
which is the Sharpe ratio divided by the volatility. The optimal value function is
and using [math]U^{-1}(v) = e^v[/math], we find the certainty equivalent,
which is the risk-free rate plus [math]\tfrac12[/math] times the Sharpe-ratio squared.
Example
Suppose that [math]F(t,c_t,X_t) = e^{-\beta t}\log(c_tX_t)[/math], [math]U(x) = 0[/math], and [math]T=\infty[/math]. The optimization problem is
which is constant in [math]t[/math]. Now notice for any admissible [math](\pi,c)[/math] on [math][t,t+\Delta t][/math] we have the dynamic programming principle,
with equality if and only if [math](\pi,c)[/math] is chosen optimally over [math][t,t+\Delta t][/math], and hence
as [math]\Delta t\rightarrow 0[/math]. On the other hand, from It\^o's lemma we have
and assuming the Brownian term vanishes under expectations, we have
as [math]\Delta t\rightarrow 0[/math]. Hence, for all admissible pairs [math](\pi,c)[/math] the value function [math]V(x)[/math] satisifies
with equality if and only [math]\pi[/math] and [math]c[/math] are optimal, which leads to the equation
Let's assume the ansatz
Then through first-order optimality conditions (i.e. by differentiating with respect to [math]\pi[/math] and setting equal to zero) we find the optimal
Similarly, first-order optimality conditions for [math]c[/math] yield
Putting optimal [math]\pi_t[/math] and [math]c_t[/math] back into the equation for [math]V[/math] along with the ansatz, we find
and comparing [math]\log(x)[/math] terms and non-[math]x[/math]-dependent terms we find,
The Hamilton-Jacobi-Bellman (HJB) Equation
Example is useful to get started and to get a sense for how an optimal control should look. Example is more instructive because it shows us how (i) the function [math]V[/math] inherits concavity from [math]F[/math] and [math]U[/math], and (ii) how it also shows how to derive the PDE that [math]V[/math] should satisfy. The derivation starts with the dynamic programming principle,
where [math]\max_{\pi,c}[/math] is taken over the interval [math][t,t+\Delta t][/math]. Applying It\^o's lemma to [math]V(t,X_t)[/math], we find
for any admissible [math](\pi,c)[/math] over [math][t,t+\Delta t][/math]. Hence, for any [math](\pi,c)[/math] on [math][t,t+\Delta t][/math] we have
with equality iff and only if an optimal [math](\pi,c)[/math] is chosen. Hence, dividing by [math]\Delta t[/math] and taking the limt to zero, we obtain the so-called Hamilton-Jacobi-Bellman (HJB) equation:
Merton's Optimal Investment Problem
Let [math]F=0[/math] and consider a power utility function,
where [math]\gamma \gt 0[/math], [math]\gamma\neq1[/math] is the risk aversion. The problem is to solve
The HJB equation for this problem is
for which we find the optimal [math]\pi[/math],
Inserting the optimal [math]\pi_t[/math] into \eqref{eq:HJBmerton} we obtain the nonlinear equation,
Then using the ansatz [math]V(t,x) = g(t)U(x)[/math], we find
and inserting in \eqref{eq:HJBmerton_nonlinear} we find an ODE for [math]g[/math],
with terminal condition [math]g(T)=1[/math]. The solution is
and the optimal value function is
and the certainty equivalent is
Stochastic Returns
Consider the model
with [math]dW_tdB_t=\rho dt[/math] where [math]\rho\in(-1,1)[/math]. The interpretation of [math]Y_t[/math] could be any of the following: [math]Y_t[/math] is a dividend yield with uncertainty (although somewhat of strange model because it can be negative), or [math]Y_t[/math] is the return rate on a commodities or bond portfolio where there is a role yield due to contango or backwardation.
Let's assume the simple case [math]\mu=r=0[/math], for which the value function is
and has HJB equation
The first-order condition for [math]\pi[/math] yields the optimal
For the power utility
we have the ansatz [math]V(t,x,y) = U(x)g(t,y)[/math] with
all of which are inserted into equation \eqref{eq:HJBstochReturns} to get an equation for [math]g[/math]:
We now apply another ansatz [math]g(t,y) = e^{a(t)y^2+b(t)}[/math], which when inserted into the equation for [math]g(t,y)[/math] yields the following system:
with terminal conditions [math]a(T)=b(T)=0[/math]. The solution [math]a(t)[/math] can be written as a ratio,
where [math]v(t)[/math] is the solution to a 2nd-order ODE,
The roots of this equation are
which gives the general solution
It is not necessary to fully determine constants [math]C_1[/math] and [math]C_2[/math] because we are mainly interested in the ratio [math]v'(t)/v(t)[/math].\\ Finite-Time Blowup. Complex valued [math]m_\pm[/math] leads to finite-time blowup for the optimization problem. If the roots are complex then let [math]c = -\left(\frac{\rho\beta(1-\gamma)}{\sigma\gamma}-\kappa\right)[/math] and [math]d = \frac{\beta(1-\gamma)}{\sigma\gamma}\left(2\kappa\rho+\frac{\beta}{\sigma}\right)-\kappa^2[/math] so that the general solution is
and with [math]v'(T)=-cC_1-dC_2=0[/math] to satisfy the terminal condition [math]a(T)=0[/math], so that
The solution [math]a(t)[/math] will blow at time [math]t^*[/math] such that [math]\cos(d(T-t^*))-\frac{c}{d}\sin(d(T-t^*))=0[/math], that is [math]\tan(d(T-t^*))=\frac{d}{c}[/math] or
Stochastic Volatility
Now let's consider the same optimal terminal wealth problem as the Merton problem, with exponential utility
and in the incomplete market of stochastic volatility,
where [math]dW_t\cdot dB_t = \rho dt[/math]. From \eqref{eq:SVM_stochControl} and \eqref{eq:dY_stochControl}, we have the wealth process,
no longer enforcing the non-negativity constraint. The optimization problem is
but the technique used in Example does not apply because there is some local martingale behavior in the stochastic integrals. Instead, we arrive at the optimal solution using the HJB equation. The HJB equation is
Using the ansatz [math]V(t,x,y) = U(xe^{r(T-t)})g(t,y)[/math], we have
which we insert into \eqref{eq:HJBstochVol} to find a PDE for [math]g[/math],
and the optimal strategy is
Inserting this optimal [math]\pi_t[/math] into \eqref{eq:HJBstochVol_g} yields the nonlinear equation
This equation can be reduced to a linear PDE if we look for a function [math]\psi(t,y)[/math] such that
where [math]q[/math] is a parameter. Differentiating yields,
and then plugging into \eqref{eq:stochVol_Merton_g} with chosen parameter [math]q=1/(1+\rho^2)[/math] yields a linear equation:
Example Consider a futures contract [math]F_{t,T}[/math] with settlement date [math]T[/math] and stochastic volatility and returns,
where [math]\beta^2\leq 2\kappa\bar Y[/math] and [math]dW_tdB_t=\rho dt[/math]. The wealth process for futures trading is
For [math]U(x) = -\frac1\gamma e^{-\gamma x}[/math] the optimal terminal expected utility is
where [math]\psi(t,y)[/math] is similar to a solution to equation \eqref{eq:linearPDEstochVolControl}, except the equation has no [math]r[/math],
It can be further shown that the solution to this equation is of the form
with [math]a(T)=b(T)=0[/math], and where [math]a(t)[/math] and [math]b(t)[/math] satisfy ODEs,
both of which can be solved explicitly.
Indifference Pricing
Stochastic control for terminal wealth can be implemented to find the the price of a call option under stochastic volatility,
where the investor now hedges a short position in a call option with strike [math]K[/math]. Compared to the same investor's value function that is not short the call
we look for the amount of cash [math]$p[/math] such that the
The extra cash makes the hedger utility indifferent to the short position. With exponential utility there is a separation of variables,
where we've used the differential [math]d\left(e^{r(T-t)}X_t\right) = e^{r(T-t)}\pi_t\left(\frac{dS_t}{S_t}-rdt\right)[/math]. Hence we find a price [math]$p[/math] such that [math]U\left(pe^{r(T-t)}\right) = g(t,y)/g^h(t,y,s)[/math], where [math]g(t,y)[/math] is the solution from \eqref{eq:HJBstochVol_g}. Depending on the risk-aversion coefficient [math]\gamma[/math], there will be different prices [math]\$p[/math]. This brings us back to the price of volatility risk [math]\Lambda(t,s,x)[/math] from Proposition of Chapter. Namely, investors with different risk aversion will have a different [math]\Lambda[/math] for their martingale evaluation of the call option. If an indifference price is obtained then there is a solution to both optimization problems, and hence there is no-arbitrage and the range of prices for [math]$c[/math] will be a no-arbitrage interval. For complete markets there will be a single price [math]$c[/math] for all levels of risk aversion.
General references
Papanicolaou, Andrew (2015). "Introduction to Stochastic Differential Equations (SDEs) for Finance". arXiv:1504.05309 [q-fin.MF].