exercise:F4045af163: Difference between revisions

Latest revision as of 12:06, 22 May 2024

The goal of this problem is to prove the following theorem:

Theorem (Johnson-Lindenstrauss Lemma)

Given [math] n [/math] points denoted by [math] X = \{x_1, \dots, x_n\} [/math] in [math] \R^d [/math], let [math] Q \in \R^{k \times d} [/math] be a random projection operator and set [math] P := \sqrt{\frac{d}{k}} Q [/math]. There is a constant [math] C \gt 0 [/math] such that if

[[math]] \begin{equation*} k \geq \frac{C}{\varepsilon^2} \log n, \end{equation*} [[/math]]

[math] P [/math] is an [math] \varepsilon [/math]-isometry for [math] X [/math], \ie

[[math]] \begin{equation*} (1 - \varepsilon) \| x_i - x_j \|_2^2 \leq \| P x_i - P x_j \|_2^2 \leq (1 + \varepsilon) \|x_i - x_j\|_2^2, \quad \text{for all } i, j \end{equation*} [[/math]]

with propability at least [math] 1 - 2 \exp(-c \varepsilon^2 k) [/math].

Convince yourself that if [math] d \gt n [/math], there is a projection [math] P \in \R^{n \times d} [/math] to an [math] n [/math] dimensional subspace such that [math] \| P x_i - P x_j \|_2 = \| x_i - x_j \|_2 [/math], \ie pairwise distances are exactly preserved.

Let [math] k \leq d [/math] be two integers, [math] Y = (y_1, \dots, y_d) \sim \cN(0, I_{d \times d} ) [/math] independent and identically distributed Gaussians and [math] Q \in \R^{d \times k} [/math] the projection onto the first [math] k [/math] coordinates, \ie [math] Qy = (y_1, \dots, y_k) [/math]. Define [math] Z = \frac{1}{\|Y\|}QY = \frac{1}{\|Y\|} (y_1, \dots, y_k) [/math] and [math] L = \|Z\|^2 [/math].

Show that [math] \E[L] = \frac{k}{d} [/math].
Show that for all [math] t \gt 0 [/math] such that [math] 1 - 2t(k \beta - d) \gt 0 [/math] and [math] 1 - 2t \beta k \gt 0 [/math],
[[math]] \begin{equation*} \label{eq:a} \p \left( \sum_{i = 1}^{k} y_i^2 \leq \beta \frac{k}{d} \sum_{i = 1}^{d} y_i^2 \right) \leq (1 - 2t(k \beta - d))^{-k/2} (1 - 2t \beta k)^{-(d-k)/2} \end{equation*} [[/math]]
(Hint: Show that [math] \E[\e^{\lambda X^2}] = \frac{1}{\sqrt{1 - 2 \lambda}} [/math] for [math] \lambda \lt \frac{1}{2} [/math] if [math] X \sim \cN(0,1) [/math].)
Conclude that for [math] \beta \lt 1 [/math],
[[math]] \begin{equation*} \label{eq:b} \p\left(L \leq \beta \frac{k}{d}\right) \leq \exp \left( \frac{k}{2} (1 - \beta + \log \beta) \right). \end{equation*} [[/math]]
Similarly, show that for [math] \beta \gt 1 [/math],
[[math]] \begin{equation*} \label{eq:c} \p\left(L \geq \beta \frac{k}{d}\right) \leq \exp \left( \frac{k}{2} (1 - \beta + \log \beta) \right). \end{equation*} [[/math]]
Show that for a random projection operator [math] Q \in \R^{k \times d} [/math] and a fixed vector [math] x \in \R^d [/math],
- [math] \E[\|Qx\|^2] = \frac{k}{d} \|x\|^2 [/math].
- For [math] \varepsilon \in (0, 1) [/math], there is a constant [math] c \gt 0 [/math] such that with probability at least [math] 1 - 2 \exp(-c k \varepsilon^2) [/math],
  [[math]] \begin{equation*} \label{eq:d} (1 - \varepsilon) \frac{k}{d} \| x \|^2 \leq \| Q x \|_2^2 \leq (1 + \varepsilon) \frac{k}{d} \|x\|_2^2. \end{equation*} [[/math]]
  (Hint: Think about how to apply the previous results in this case and use the inequalities [math] \log (1-\varepsilon) \leq -\varepsilon - \varepsilon^2/2 [/math] and [math] \log (1+\varepsilon) \leq \varepsilon - \varepsilon^2/2 + \varepsilon^3/3 [/math].)
- Prove Theorem.

@@ Line 204: / Line 204: @@
 </math>
 	with propability at least <math> 1 - 2 \exp(-c \varepsilon^2 k) </math>.|}}
-<ul><li> Convince yourself that if <math> d  >  n </math>, there is a projection <math> P \in \R^{n \times d} </math> to an <math> n </math> dimensional subspace  such that <math> \| P x_i - P x_j \|_2 = \| x_i - x_j \|_2 </math>, \ie pairwise distances are exactly preserved.
+<ol><li> Convince yourself that if <math> d  >  n </math>, there is a projection <math> P \in \R^{n \times d} </math> to an <math> n </math> dimensional subspace  such that <math> \| P x_i - P x_j \|_2 = \| x_i - x_j \|_2 </math>, \ie pairwise distances are exactly preserved.
 </li>
-</ul>
+</ol>
 Let <math> k \leq d  </math> be two integers, <math> Y = (y_1, \dots, y_d) \sim \cN(0, I_{d \times d} ) </math> independent and identically distributed Gaussians and <math> Q \in \R^{d \times k} </math> the projection onto the first <math> k </math> coordinates, \ie <math> Qy = (y_1, \dots, y_k) </math>.
 Define <math> Z = \frac{1}{\|Y\|}QY = \frac{1}{\|Y\|} (y_1, \dots, y_k) </math> and <math> L = \|Z\|^2 </math>.
-<ul><li> Show that <math> \E[L] = \frac{k}{d} </math>.
+<ol start="2">
-</li>
+<li> Show that <math> \E[L] = \frac{k}{d} </math>.</li>
 <li> Show that for all <math> t  >  0 </math> such that <math> 1 - 2t(k \beta - d)  >  0 </math> and <math> 1 - 2t \beta k  >  0 </math>,
@@ Line 219: / Line 219: @@
 \end{equation*}
 </math>
-(Hint: Show that <math> \E[\e^{\lambda X^2}] = \frac{1}{\sqrt{1 - 2 \lambda}} </math> for <math> \lambda  <  \frac{1}{2} </math> if <math> X \sim \cN(0,1) </math>.)
+('''Hint:''' Show that <math> \E[\e^{\lambda X^2}] = \frac{1}{\sqrt{1 - 2 \lambda}} </math> for <math> \lambda  <  \frac{1}{2} </math> if <math> X \sim \cN(0,1) </math>.)
 </li>
 <li> Conclude that for <math> \beta  <  1 </math>,
@@ Line 244: / Line 244: @@
 </li>
 <li> Show that for a random projection operator <math> Q \in \R^{k \times d} </math> and a fixed vector <math> x \in \R^d </math>,
-	\begin{enumerate}
+<ul style="list-style-type:lower-alpha">
-		</li>
+<li> <math> \E[\|Qx\|^2] = \frac{k}{d} \|x\|^2 </math>.</li>
-<li> <math> \E[\|Qx\|^2] = \frac{k}{d} \|x\|^2 </math>.
-		</li>
 <li> For <math> \varepsilon \in (0, 1) </math>, there is a constant <math> c  >  0 </math> such that with probability at least <math> 1 - 2 \exp(-c k \varepsilon^2) </math>,
@@ Line 260: / Line 258: @@
 <li>Prove  [[#thm:jl|Theorem]].</li>
 </ul>
+</li>
+</ol>