The Radon-Nikodym approach for the conditional expectation
Before stating the Radon-Nikodym theorem, we recall some definitions from measure theory. Let [math](\Omega,\B)[/math] be a measurable space. A measure [math]\nu[/math] is [math]absolutely[/math] [math]continuous[/math] with respect to another measure [math]\mu[/math], written [math]\nu\ll\mu[/math] if there exists some measurable [math]f\geq 0[/math] with [math]d\nu=fd\mu[/math], that is if there is a finite measurable [math]f\geq 0[/math] with
Let [math]\mu[/math] and [math]\nu[/math] be two [math]\sigma[/math]-finite measures on a measurable space [math](\Omega,\B)[/math]. Then [math]\nu[/math] can be decomposed as
The theorem implies that there exists another, more practical way of checking whether a given [math]\sigma[/math]-finite measure [math]\nu[/math] is absolutely continuous with respect to another [math]\sigma[/math]-finite measure [math]\mu[/math]. If [math]\mu(N)=0[/math] implies that [math]\nu(N)=0[/math] for every measurable [math]N\subset \Omega[/math], then [math]\nu=\nu_{abs}[/math] is absolutely continuous. We also note that the density function [math]f[/math] with [math]fd\mu=d\nu[/math] is called the [math]Radon[/math]-[math]Nikodym[/math] [math]derivative[/math] and is often written [math]f=\frac{d\nu}{d\mu}[/math].
To prove this theorem, we need a theorem which gives us a nice relationship between a Hilbert space and its dual space. Actually we can identify a Hilbert space [math]\mathcal{H}[/math] with its dual space [math]\mathcal{H}^*[/math].
For a Hilbert space [math]\mathcal{H}[/math], the map sending [math]h\in \mathcal{H}[/math] to [math]\phi(h)\in\mathcal{H}^*[/math] defined by
[Proof of Theorem] Suppose that [math]\mu[/math] and [math]\nu[/math] are both finite measures (the general case can be reduced to this case by using the assumption that [math]\mu[/math] and [math]\nu[/math] are both [math]\sigma[/math]-finite). We define a new measure [math]m=\mu+\nu[/math] and will work with the real Hilbert space [math]\mathcal{H}=L^2(\Omega,m)[/math]. On this Hilbert space we define a linear functional [math]\phi[/math] by
We claim that [math]k[/math] takes values in [math][0,1][/math] almost surely with respect to [math]m[/math]. Indeed, for any [math]B\in\B[/math] we have
This holds by construction for all simple functions [math]g[/math], and hence for all nonnegative measurable functions by monotone convergence. Now define [math]\nu_{sing}[/math] to be [math]\nu\mid_{A}[/math], where
Let [math](\Omega,\F,\p)[/math] be a probability space. Let [math]\mathcal{G}\subset \F[/math] be a sub [math]\sigma[/math]-Algebra of [math]\F[/math] and let [math]X\in L^1(\Omega,\F,\p)[/math] be a r.v. Then there exists a unique r.v. in [math]L^1(\Omega,\mathcal{G},\p)[/math], denoted by [math]\E[X\mid\mathcal{G}][/math], such that for all [math]B\in\mathcal{G}[/math]
The uniqueness part was already done. To show existence, assume first that [math]X[/math] is positive. Define a new measure [math]\Q[/math] on [math](\Omega,\mathcal{G})[/math] by
General references
Moshayedi, Nima (2020). "Lectures on Probability Theory". arXiv:2010.16280 [math.PR].