guide:A0002b5f37: Difference between revisions

Latest revision as of 01:28, 13 May 2024

The mathematical approach to many financial decision-making problems has traditionally been through modelling with stochastic processes and using techniques from stochastic control. The choice of models is often dictated by the need to balance tractability with applicability. Simple models lead to tractable and implementable strategies in closed-form or that can be found through traditional numerical methods. However, these models sometimes oversimplify the mechanisms and the behaviour of financial markets which may result in strategies that are sub-optimal in practice and that can potentially result in financial losses. On the other hand, models that try to capture realistic features of financial markets are much more complex and are often mathematically and computationally intractable using the classical tools of stochastic optimal control. r2018} and deep RL methods ^[1].

Our survey will begin by discussing Markov decision processes (MDP), the framework for many reinforcement learning ideas in finance. We will then consider different approaches to learning within this framework with the main focus being on value-based and policy-based methods. In order to implement these approaches we will introduce deep reinforcement methods which incorporate deep learning ideas in this context. For financial applications we will consider a range of topics and for each we will introduce the basic underlying models before considering the RL approach to tackling them. We will discuss a range of papers in each application area and give an indication of their contributions. We conclude with some thoughts about the direction of development of reinforcement learning in finance.

General references

Hambly, Ben; Xu, Renyuan; Yang, Huining (2023). "Recent Advances in Reinforcement Learning in Finance". arXiv:2112.04553 [q-fin.MF].

References

A.MOSAVI, Y.FAGHAN, P.GHAMISI, P.DUAN, S.F. Ardabili, E.SALWANA, and S.S. Band, Comprehensive review of deep reinforcement learning methods and applications in economics, Mathematics, 8 (2020), p.1640.

[mosavi2020-1] A.MOSAVI, Y.FAGHAN, P.GHAMISI, P.DUAN, S.F. Ardabili, E.SALWANA, and S.S. Band, Comprehensive review of deep reinforcement learning methods and applications in economics, Mathematics, 8 (2020), p.1640.

[1]

@@ Line 1: / Line 1: @@
-<div class="d-none"><math>
-\newcommand*{\rom}[1]{\expandafter\@slowromancap\romannumeral #1@}
-\newcommand{\vertiii}[1]{{\left\vert\kern-0.25ex\left\vert\kern-0.25ex\left\vert #1
-    \right\vert\kern-0.25ex\right\vert\kern-0.25ex\right\vert}}
-\DeclareMathOperator*{\dprime}{\prime \prime}
-\DeclareMathOperator{\Tr}{Tr}
-\DeclareMathOperator{\E}{\mathbb{E}}
-\DeclareMathOperator{\N}{\mathbb{N}}
-\DeclareMathOperator{\R}{\mathbb{R}}
-\DeclareMathOperator{\Sc}{\mathcal{S}}
-\DeclareMathOperator{\Ac}{\mathcal{A}}
-\DeclareMathOperator{\Pc}{\mathcal{P}}
-\DeclareMathOperator*{\argmin}{arg\,min}
-\DeclareMathOperator*{\argmax}{arg\,max}
-\DeclareMathOperator{\sx}{\underline{\sigma}_{\pmb{X}}}
-\DeclareMathOperator{\sqmin}{\underline{\sigma}_{\pmb{Q}}}
-\DeclareMathOperator{\sqmax}{\overline{\sigma}_{\pmb{Q}}}
-\DeclareMathOperator{\sqi}{\underline{\sigma}_{Q,\textit{i}}}
-\DeclareMathOperator{\sqnoti}{\underline{\sigma}_{\pmb{Q},-\textit{i}}}
-\DeclareMathOperator{\sqfir}{\underline{\sigma}_{\pmb{Q},1}}
-\DeclareMathOperator{\sqsec}{\underline{\sigma}_{\pmb{Q},2}}
-\DeclareMathOperator{\sru}{\underline{\sigma}_{\pmb{R}}^{u}}
-\DeclareMathOperator{\srv}{\underline{\sigma}_{\pmb{R}}^v}
-\DeclareMathOperator{\sri}{\underline{\sigma}_{R,\textit{i}}}
-\DeclareMathOperator{\srnoti}{\underline{\sigma}_{\pmb{R},\textit{-i}}}
-\DeclareMathOperator{\srfir}{\underline{\sigma}_{\pmb{R},1}}
-\DeclareMathOperator{\srsec}{\underline{\sigma}_{\pmb{R},2}}
-\DeclareMathOperator{\srmin}{\underline{\sigma}_{\pmb{R}}}
-\DeclareMathOperator{\srmax}{\overline{\sigma}_{\pmb{R}}}
-\DeclareMathOperator{\HH}{\mathcal{H}}
-\DeclareMathOperator{\HE}{\mathcal{H}(1/\varepsilon)}
-\DeclareMathOperator{\HD}{\mathcal{H}(1/\varepsilon)}
-\DeclareMathOperator{\HCKI}{\mathcal{H}(C(\pmb{K}^0))}
-\DeclareMathOperator{\HECK}{\mathcal{H}(1/\varepsilon,C(\pmb{K}))}
-\DeclareMathOperator{\HECKI}{\mathcal{H}(1/\varepsilon,C(\pmb{K}^0))}
-\DeclareMathOperator{\HC}{\mathcal{H}(1/\varepsilon,C(\pmb{K}))}
-\DeclareMathOperator{\HCK}{\mathcal{H}(C(\pmb{K}))}
-\DeclareMathOperator{\HCKR}{\mathcal{H}(1/\varepsilon,1/{\it{r}},C(\pmb{K}))}
-\DeclareMathOperator{\HCKR}{\mathcal{H}(1/\varepsilon,C(\pmb{K}))}
-\DeclareMathOperator{\HCKIR}{\mathcal{H}(1/\varepsilon,1/{\it{r}},C(\pmb{K}^0))}
-\DeclareMathOperator{\HCKIR}{\mathcal{H}(1/\varepsilon,C(\pmb{K}^0))}
-\newcommand{\mathds}{\mathbb}</math></div>
 The mathematical approach to many financial decision-making problems has traditionally been through modelling with stochastic processes and using techniques from stochastic control. The choice of models is often dictated by the need to balance tractability with applicability. Simple models lead to tractable and implementable strategies in closed-form or that can be found through traditional numerical methods. However, these models sometimes oversimplify the mechanisms and the behaviour of financial markets which may result in strategies that are sub-optimal in practice and that can potentially result in financial losses. On the other hand, models that try to capture realistic features of financial markets are much more complex and are often mathematically and computationally intractable using the classical tools of stochastic optimal control.
-r2018} and deep RL methods <ref name="mosavi2020"/>.
+r2018} and deep RL methods <ref name="mosavi2020">A.MOSAVI, Y.FAGHAN, P.GHAMISI, P.DUAN, S.F. Ardabili, E.SALWANA, and  S.S. Band, ''Comprehensive review of deep reinforcement learning methods  and applications in economics'', Mathematics, 8 (2020), p.1640.</ref>.
 Our survey will begin by discussing Markov decision processes (MDP), the framework for many reinforcement learning ideas in finance.