\documentclass{article} \usepackage{amsmath,amssymb} \setlength{\oddsidemargin}{0.25 in} \setlength{\evensidemargin}{-0.25 in} \setlength{\topmargin}{-0.6 in} \setlength{\textwidth}{6.5 in} \setlength{\textheight}{8.5 in} \setlength{\headsep}{0.75 in} \setlength{\parindent}{0 in} \setlength{\parskip}{0.1 in} \newcounter{lecnum} % % The following macro is used to generate the header. % \newcommand{\lecture}[4]{ \pagestyle{myheadings} \thispagestyle{plain} \newpage \setcounter{lecnum}{#1} \setcounter{page}{1} \noindent \begin{center} \framebox{ \vbox{\vspace{2mm} \hbox to 6.28in { {{\bf 18.317~Combinatorics, Probability, and Computations on Groups} \hfill #2} } \vspace{4mm} \hbox to 6.28in { {\Large \hfill Lecture #1 \hfill} } \vspace{2mm} \hbox to 6.28in { {\it Lecturer: #3 \hfill Scribe: #4} } \vspace{2mm}}} \end{center} \markboth{Lecture #1: #2}{Lecture #1: #2} \vspace*{4mm} } \input{epsf} %usage: \fig{LABEL}{FIGURE-HEIGHT}{CAPTION}{FILENAME} \newcommand{\fig}[4]{ \begin{figure} \setlength{\epsfysize}{#2} \centerline{\epsfbox{#4}} \caption{#3} \label{#1} \end{figure} } \newtheorem{theorem}{Theorem} \newtheorem{lemma}[theorem]{Lemma} \newtheorem{proposition}[theorem]{Proposition} \newtheorem{claim}[theorem]{Claim} \newtheorem{corollary}[theorem]{Corollary} \newtheorem{definition}[theorem]{Definition} \newenvironment{proof}{{\bf Proof:}}{\hfill\rule{2mm}{2mm}} \newenvironment{mproof}[1]{{\bf #1}}{\hfill\rule{2mm}{2mm}} \def\E#1{{\rm E}[#1]} \def\i{\hspace*{5mm}} \newcounter{lineno} \setcounter{lineno}{0} \def\n{\addtocounter{lineno}{1} \hbox to 7mm{\hfill \arabic{lineno}:}\hspace{2mm}} \def\resetn{\setcounter{lineno}{0}} \def\tqbfk{\tqbf_k^\exists} \def\ep{\epsilon} \def\erd{Erd\H{o}s} \begin{document} \lecture{9}{28 September 2001}{Igor Pak}{Christopher Malon} \newcommand{\erdren}{Erd\H{o}s--R\'{e}nyi } \section*{Generalized Random Subproducts} Today, we start to upgrade the \erdren machine to show that ``most'' generating sets of size $O(\log \vert G \vert)$ have a mixing time of $O(\log \vert G \vert)$. We'll be more precise about this in due course. First, we want to show that \erdren is robust when we insert junk in the middle of our strings. Fix $\overline{g} = (g_1, g_2, \ldots, g_k) \in G^k$. On Monday, we defined a probability distribution over $G$ $$Q_{\overline{g}} (h) = {\mathrm Pr}_{\overline{\epsilon}} \left[ g_1^{{\epsilon}_1} \cdots g_k^{{\epsilon}_k} = h \right]$$ where $\overline{\epsilon} = ({\epsilon}_1, \ldots, {\epsilon}_k)$ is picked randomly, uniformly from ${\{0, 1\}}^k$. Now fix group elements $x_1, \ldots, x_l$ and integers ${\gamma}_1, \ldots, {\gamma}_l$. We insert the $x_i$ at intervals ${\gamma}_i$: Let $$R_{\overline{g}} (h) = {\mathrm Pr}_{\overline{\epsilon}} \left[ g_1^{\epsilon_1} \cdots g_{{\gamma}_1}^{\epsilon_{{\gamma}_1}} x_1 g_{{\gamma}_1 + 1}^{\epsilon_{{\gamma}_1 + 1}} \cdots g_{{\gamma}_1 + {\gamma}_2}^{\epsilon_{{\gamma_1 + \gamma_2}}} x_2 \cdots x_l \cdots g_k^{\epsilon_k} = h \right]$$ Let's look at this in the case where $l = 1$. In this case, we are considering products of the form $$g_1^{\epsilon_1} \cdots g_i^{\epsilon_i} x g_{i+1}^{\epsilon_{i+1}} \cdots g_k^{\epsilon_k} = g_1^{\epsilon_1} \cdots g_i^{\epsilon_i} {\left( g_{i+1}^x \right)}^{\epsilon_{i+1}} \cdots {\left( g_k^x \right)}^{\epsilon_k} x$$ where $i = \gamma_1$, $x = x_1$, and $g^x$ denotes $x g x^{-1}$. Declare $\overline{z} (x, \gamma) = (1, \ldots, 1, x, \ldots, x)$, where 1's appear in the first $i$ positions, and write $$(g_1, \ldots, g_k)^{(z_1, \ldots, z_k)} = (g_1^{z_1}, \ldots, g_k^{z_k})$$ Evidently, $$R_{\overline{g}} (h) = Q_{\overline{g}^{\overline{z}(x, \gamma)}} (h x^{-1})$$ If $l > 1$, we just have to repeat these maneuvers to define a string $\overline{z} (\overline{x}, \overline{\gamma})$ and a function $f(\overline{x})$ so that $$g_1^{\epsilon_1} \cdots g_{{\gamma}_1}^{\epsilon_{{\gamma}_1}} x_1 g_{{\gamma}_1 + 1}^{\epsilon_{{\gamma}_1 + 1}} \cdots g_{{\gamma}_1 + {\gamma}_2}^{\epsilon_{{\gamma_1 + \gamma_2}}} x_2 \cdots x_l \cdots g_k^{\epsilon_k} = {(\overline{g}^{\overline{z} (\overline{x}, \overline{\gamma})})}^{\overline{\epsilon}} \cdot {(f(\overline{x}))}^{-1}$$ Then we have \begin{equation} R_{\overline{g}} (h) = Q_{\overline{g}^{\overline{z}(\overline{x}, \overline{\gamma})}} (h \cdot f(\overline{x})) \label{eq:shift} \end{equation} We want to show $R_{\overline{g}}$ is usually close to uniform. \begin{definition} A probability distribution $Q$ on a finite group $G$ is $\epsilon$--{\em uniform} if $$Q(h) > \frac{1 - \epsilon}{\vert G \vert}$$ for all $h \in G$. \end{definition} Two lectures ago, we proved: \begin{theorem} (\erdren) For all $\epsilon, \delta > 0$, $Q_{\overline{g}}$ is $\epsilon$--uniform for more than $1 - \delta$ proportion of $\overline{g} \in G^k$, given $k > 2 \log_2 \vert G \vert + 2 \log_2 \frac{1}{\epsilon} + \log_2 \frac{1}{\delta}$. \end{theorem} Multiplication by $f(\overline{x})$ is a bijection on the elements of $G$, so $\epsilon$--uniformity of $Q_{\overline{g}} (h)$ over $h$ implies $\epsilon$--uniformity of $Q_{\overline{g}} (h \cdot f(\overline{x}))$ over $h$. Because conjugation by $\overline{z} (\overline{x}, \overline{\gamma})$ is a bijection on $G^k$, the $R_{\overline{g}}$ will be $\epsilon$--uniform for the same proportion of $\overline{g}$ as $Q_{\overline{g}}$, applying equation (\ref{eq:shift}). Thus, we have \erdren for $R_{\overline{g}}$: \begin{theorem} For all $\epsilon, \delta > 0$, $R_{\overline{g}}$ is $\epsilon$--uniform for more than $1 - \delta$ proportion of $\overline{g} \in G^k$, given $k > 2 \log_2 \vert G \vert + 2 \log_2 \frac{1}{\epsilon} + \log_2 \frac{1}{\delta}$. \label{thm:newer} \end{theorem} Now, we look at a probability distribution of group elements over a larger sample space, describing what happens when the $x_i$ may or may not be inserted. Define $$Q_{\overline{g}, \overline{x}} (h) = \mathrm{Pr}_{\overline{\epsilon}, \overline{\alpha}} \left[ g_1^{\epsilon_1} \cdots g_{{\gamma}_1}^{\epsilon_{{\gamma}_1}} x_1^{\alpha_1} g_{{\gamma}_1 + 1}^{\epsilon_{{\gamma}_1 + 1}} \cdots g_{{\gamma}_1 + {\gamma}_2}^{\epsilon_{{\gamma_1 + \gamma_2}}} x_2^{\alpha_2} \cdots x_l^{\alpha_l} \cdots g_k^{\epsilon_k} = h \right]$$ where $\overline{\alpha}$ is picked uniformly from ${\{0, 1\}}^l$. For fixed $\overline{\alpha}$, let $$R_{\overline{g}, \overline{x}, \overline{\alpha}} (h) = \mathrm{Pr}_{\overline{\epsilon}} \left[ g_1^{\epsilon_1} \cdots g_{{\gamma}_1}^{\epsilon_{{\gamma}_1}} x_1^{\alpha_1} g_{{\gamma}_1 + 1}^{\epsilon_{{\gamma}_1 + 1}} \cdots g_{{\gamma}_1 + {\gamma}_2}^{\epsilon_{{\gamma_1 + \gamma_2}}} x_2^{\alpha_2} \cdots x_l^{\alpha_l} \cdots g_k^{\epsilon_k} = h \right]$$ Then we have \begin{equation} Q_{\overline{g}, \overline{x}} = \frac{1}{2^l} \sum_{\overline{\alpha}} R_{\overline{g}, \overline{x}, \overline{\alpha}} \label{eq:sumit} \end{equation} Suppose $k > 2 \log_2 \vert G \vert + 2 \log_2 \frac{1}{\epsilon} + \log_2 \frac{1}{\delta}$ is fixed. Draw a grid whose rows represent the choices of $\overline{g}$ from $G^k$, and whose columns represent the choices of $\overline{\alpha}$. Keep $l$, $\overline{\gamma}$, and $\overline{x}$ fixed. Mark the $(\overline{g}, \overline{\alpha})$ position in this grid if $R_{\overline{g}, \overline{x}, \overline{\alpha}}$ is $\epsilon$--uniform. Theorem \ref{thm:newer} applies to $R_{\overline{g}, \overline{x}, \overline{\alpha}}$, saying that in every column ({\em i.e.} for any fixed $\overline{\alpha}$), the proportion of unmarked squares is less than $\delta$. Consequently, less than $\sqrt{\delta}$ of the rows have more than $\sqrt{\delta}$ of their positions unmarked. By equation (\ref{eq:sumit}), for more than $1 - \sqrt{\delta}$ of the $\overline{g}$, $$Q_{\overline{g}, \overline{x}} (h) > \frac{1 - \sqrt{\delta}}{\vert G \vert} (1 - \epsilon)$$ This proves: \begin{theorem} For every $\epsilon, \delta > 0$, $Q_{\overline{g}, \overline{x}}$ is $(\epsilon + \delta)$--uniform for more than $1 - \sqrt{\delta}$ proportion of $\overline{g}$, given $k > 2 \log_2 \vert G \vert + 2 \log_2 \frac{1}{\epsilon} + \log_2 \frac{1}{\delta}$. \label{thm:laster} \end{theorem} \section*{Lazy Random Walks} Now we return to our question about mixing time. \begin{definition} Fix $\overline{g} = (g_1, \ldots, g_k)$, not necessarily generating $G$. A {\em lazy random walk} through $\overline{g}$ is a sequence of group elements $X_t$ such that $X_0 = 1$ and $X_{t+1} = X_t \cdot g_{i_{t+1}}^{\epsilon_{t+1}}$, where each $i_{t+1}$ is picked randomly, uniformly from $\{1, \ldots, k\}$, and $\epsilon_{t+1}$ is picked randomly, uniformly from $\{0, 1\}$. Thus $$X_t = g_{i_1}^{\epsilon_1} \cdots g_{i_t}^{\epsilon_t}$$ \end{definition} Let $P_{\overline{g}}^t$ be the probability distribution of the lazy random walk through $\overline{g}$ after $t$ steps. Today's work has shown us: If, after $t$ steps, we have selected $k^{\prime}$ distinct $i$'s (where $k^{\prime}$ is at least as big as in theorem \ref{thm:laster}), then $P_{\overline{g}}^t$ will be close to uniform for most $\overline{g}$; the redundant generators chosen in the lazy random walk will take the place of $\overline{x}$. From the Coupon Collector's Problem, we can compute how big $t$ must be in order to provide enough generators. If we need to collect all the $g_i$ and $\overline{g} = (g_1, \ldots, g_k)$, the expected waiting time is $$1 + \frac{k}{k-1} + \frac{k}{k-2} + \ldots + k = k \log k + O(k)$$ We'll fill in more details in the next lecture. \end{document}