<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress.com" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>mathco &amp;laquo; WordPress.com Tag Feed</title>
	<link>http://en.wordpress.com/tag/mathco/</link>
	<description>Feed of posts on WordPress.com tagged "mathco"</description>
	<pubDate>Thu, 23 May 2013 01:09:41 +0000</pubDate>

	<generator>http://en.wordpress.com/tags/</generator>
	<language>en</language>

<item>
<title><![CDATA[Recent progress on the Kakeya conjecture]]></title>
<link>http://terrytao.wordpress.com/2009/05/11/recent-progress-on-the-kakeya-conjecture/</link>
<pubDate>Mon, 11 May 2009 18:47:20 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2009/05/11/recent-progress-on-the-kakeya-conjecture/</guid>
<description><![CDATA[Below the fold is a version of my talk &#8220;Recent progress on the Kakeya conjecture&#8221; that I]]></description>
<content:encoded><![CDATA[<p>
Below the fold is a version of my talk &#8220;Recent progress on the Kakeya conjecture&#8221; that I gave at the Fefferman conference.
</p>
<p>
<!--more-->
</p>
<p>
One of my favourite problems in mathematics is the <a href="http://en.wikipedia.org/wiki/Kakeya_conjecture#Kakeya_conjecture">Kakeya family of conjectures</a>. There are many versions of these conjectures, but one of the simplest to state is the following:
</p>
<p>
 Let <img src='http://s0.wp.com/latex.php?latex=%7BE+%5Csubset+%7B%5Cmathbb+R%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E &#92;subset {&#92;mathbb R}^n}&amp;fg=000000' title='{E &#92;subset {&#92;mathbb R}^n}&amp;fg=000000' class='latex' /> be a compact subset of <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+R%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb R}^n}&amp;fg=000000' title='{{&#92;mathbb R}^n}&amp;fg=000000' class='latex' /> which contains a unit line segment in every direction. Then <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' /> has <a href="http://en.wikipedia.org/wiki/Hausdorff_dimension">Hausdorff</a> and <a href="http://en.wikipedia.org/wiki/Minkowski_dimension">Minkowski dimension</a> <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' />.
</p>
<p>
Sets <img src='http://s0.wp.com/latex.php?latex=%7BE+%5Csubset+%7B%5Cmathbb+R%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E &#92;subset {&#92;mathbb R}^n}&amp;fg=000000' title='{E &#92;subset {&#92;mathbb R}^n}&amp;fg=000000' class='latex' /> which contain a unit line segment in every direction are known as <em>Kakeya sets</em>. It was observed by Besicovitch that for <img src='http://s0.wp.com/latex.php?latex=%7Bn+%5Cgeq+2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n &#92;geq 2}&amp;fg=000000' title='{n &#92;geq 2}&amp;fg=000000' class='latex' />, Kakeya sets can have arbitrarily small Lebesgue measure; in fact they can have Lebesgue measure zero. This in turn implies that the solution to the Kakeya needle problem (what is the least amount of area in the plane needed to rotate a unit line segment by <img src='http://s0.wp.com/latex.php?latex=%7B360%5E%5Ccirc%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{360^&#92;circ}&amp;fg=000000' title='{360^&#92;circ}&amp;fg=000000' class='latex' />?) is that a unit needle can be rotated in arbitrarily small area (see this <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/12/31/a-remark-on-the-kakeya-needle-problem/">previous blog post of mine</a> for further discussion).
</p>
<p>
The conjecture is trivial in one dimension, and also proven in two dimensions (a result <a href="http://www.ams.org/mathscinet-getitem?mr=272988">of Davies</a>), but remains open in three and higher dimensions. Nevertheless, there are a number of partial results, typically of the form &#8220;Kakeya sets in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+R%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb R}^n}&amp;fg=000000' title='{{&#92;mathbb R}^n}&amp;fg=000000' class='latex' /> have Hausdorff or Minkowski dimension at least <img src='http://s0.wp.com/latex.php?latex=%7Bd%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d}&amp;fg=000000' title='{d}&amp;fg=000000' class='latex' />&#8221; for various values of <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7Bd%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d}&amp;fg=000000' title='{d}&amp;fg=000000' class='latex' /> (with the objective being to get <img src='http://s0.wp.com/latex.php?latex=%7Bd%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d}&amp;fg=000000' title='{d}&amp;fg=000000' class='latex' /> all the way up to <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' />&#8220;. One can also phrase such results in a largely equivalent discrete fashion, as follows. Let <img src='http://s0.wp.com/latex.php?latex=%7B0+%26%2360%3B+%5Cdelta+%26%2360%3B+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0 &lt; &#92;delta &lt; 1}&amp;fg=000000' title='{0 &lt; &#92;delta &lt; 1}&amp;fg=000000' class='latex' /> be a small number, and let <img src='http://s0.wp.com/latex.php?latex=%7BT_1%2C%5Cldots%2CT_N%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{T_1,&#92;ldots,T_N}&amp;fg=000000' title='{T_1,&#92;ldots,T_N}&amp;fg=000000' class='latex' /> be a collection of <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Ctimes+%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;times &#92;delta}&amp;fg=000000' title='{1 &#92;times &#92;delta}&amp;fg=000000' class='latex' /> tubes which are oriented in a <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' />-separated set of directions (thus if <img src='http://s0.wp.com/latex.php?latex=%7BT_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{T_i}&amp;fg=000000' title='{T_i}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7BT_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{T_j}&amp;fg=000000' title='{T_j}&amp;fg=000000' class='latex' /> are oriented in the direction <img src='http://s0.wp.com/latex.php?latex=%7B%5Comega_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;omega_i}&amp;fg=000000' title='{&#92;omega_i}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7B%5Comega_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;omega_j}&amp;fg=000000' title='{&#92;omega_j}&amp;fg=000000' class='latex' /> respectively for some <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+i+%26%2360%3Bj+%5Cleq+N%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq i &lt;j &#92;leq N}&amp;fg=000000' title='{1 &#92;leq i &lt;j &#92;leq N}&amp;fg=000000' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=%7B%5Cangle+%5Comega_i%2C+%5Comega_j+%5Cgeq+%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;angle &#92;omega_i, &#92;omega_j &#92;geq &#92;delta}&amp;fg=000000' title='{&#92;angle &#92;omega_i, &#92;omega_j &#92;geq &#92;delta}&amp;fg=000000' class='latex' />). This <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' />-separation hypothesis forces <img src='http://s0.wp.com/latex.php?latex=%7BN%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{N}&amp;fg=000000' title='{N}&amp;fg=000000' class='latex' /> to be <img src='http://s0.wp.com/latex.php?latex=%7BO%28+%5Cdelta%5E%7B1-n%7D+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O( &#92;delta^{1-n} )}&amp;fg=000000' title='{O( &#92;delta^{1-n} )}&amp;fg=000000' class='latex' /> (we allow implied constants to depend on <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' />); let us now assume that <img src='http://s0.wp.com/latex.php?latex=%7BN%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{N}&amp;fg=000000' title='{N}&amp;fg=000000' class='latex' /> is close to its maximal value, thus <img src='http://s0.wp.com/latex.php?latex=%7BN+%5Cgg+%5Cdelta%5E%7B1-n%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{N &#92;gg &#92;delta^{1-n}}&amp;fg=000000' title='{N &#92;gg &#92;delta^{1-n}}&amp;fg=000000' class='latex' />. The Kakeya set conjecture (for the Minkowski dimension) is then equivalent to the assertion that </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%26%23124%3B%5Cbigcup_%7Bi%3D1%7D%5EN+T_i%26%23124%3B+%5Cgg_%5Cepsilon+%5Cdelta%5E%5Cepsilon%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#124;&#92;bigcup_{i=1}^N T_i&#124; &#92;gg_&#92;epsilon &#92;delta^&#92;epsilon&amp;fg=000000' title='&#92;displaystyle  &#124;&#92;bigcup_{i=1}^N T_i&#124; &#92;gg_&#92;epsilon &#92;delta^&#92;epsilon&amp;fg=000000' class='latex' /></p>
<p> for all <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon &gt; 0}&amp;fg=000000' title='{&#92;epsilon &gt; 0}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BE%26%23124%3B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;E&#124;}&amp;fg=000000' title='{&#124;E&#124;}&amp;fg=000000' class='latex' /> denotes the volume of the set <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Cgg_%5Cepsilon+B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;gg_&#92;epsilon B}&amp;fg=000000' title='{A &#92;gg_&#92;epsilon B}&amp;fg=000000' class='latex' /> denotes the estimate <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Cgeq+c_%5Cepsilon+B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;geq c_&#92;epsilon B}&amp;fg=000000' title='{A &#92;geq c_&#92;epsilon B}&amp;fg=000000' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%7Bc_%5Cepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{c_&#92;epsilon &gt; 0}&amp;fg=000000' title='{c_&#92;epsilon &gt; 0}&amp;fg=000000' class='latex' /> depending only on <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon}&amp;fg=000000' title='{&#92;epsilon}&amp;fg=000000' class='latex' />. Similarly, a partial result of the form <a name="tnd">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%26%23124%3B%5Cbigcup_%7Bi%3D1%7D%5EN+T_i%26%23124%3B+%5Cgg_%5Cepsilon+%5Cdelta%5E%7Bn-d%2B%5Cepsilon%7D+%5C+%5C+%5C+%5C+%5C+%281%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#124;&#92;bigcup_{i=1}^N T_i&#124; &#92;gg_&#92;epsilon &#92;delta^{n-d+&#92;epsilon} &#92; &#92; &#92; &#92; &#92; (1)&amp;fg=000000' title='&#92;displaystyle  &#124;&#92;bigcup_{i=1}^N T_i&#124; &#92;gg_&#92;epsilon &#92;delta^{n-d+&#92;epsilon} &#92; &#92; &#92; &#92; &#92; (1)&amp;fg=000000' class='latex' /></p>
<p></a> for all <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon &gt; 0}&amp;fg=000000' title='{&#92;epsilon &gt; 0}&amp;fg=000000' class='latex' /> and some <img src='http://s0.wp.com/latex.php?latex=%7B0+%5Cleq+d+%5Cleq+n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0 &#92;leq d &#92;leq n}&amp;fg=000000' title='{0 &#92;leq d &#92;leq n}&amp;fg=000000' class='latex' /> would imply (and is basically equivalent to) the assertion that Kakeya sets have (lower) Minkowski dimension at least <img src='http://s0.wp.com/latex.php?latex=%7Bd%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d}&amp;fg=000000' title='{d}&amp;fg=000000' class='latex' />.</p>
<p>
There is also a somewhat stronger <em>Kakeya maximal function conjecture</em> which is also of interest; with the same hypotheses as above, the conjecture asserts that <a name="max">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%26%23124%3B+%5Csum_%7Bi%3D1%7D%5EN+1_%7BT_i%7D+%5C%26%23124%3B_%7BL%5E%7Bd%2F%28d-1%29%7D%28%7B%5Cmathbb+R%7D%5En%29%7D+%5Cll_%5Cepsilon+%28%5Cfrac%7B1%7D%7B%5Cdelta%7D%29%5E%7B%5Cfrac%7Bn%7D%7Bd%7D-1%2B%5Cepsilon%7D+%5C+%5C+%5C+%5C+%5C+%282%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;&#124; &#92;sum_{i=1}^N 1_{T_i} &#92;&#124;_{L^{d/(d-1)}({&#92;mathbb R}^n)} &#92;ll_&#92;epsilon (&#92;frac{1}{&#92;delta})^{&#92;frac{n}{d}-1+&#92;epsilon} &#92; &#92; &#92; &#92; &#92; (2)&amp;fg=000000' title='&#92;displaystyle  &#92;&#124; &#92;sum_{i=1}^N 1_{T_i} &#92;&#124;_{L^{d/(d-1)}({&#92;mathbb R}^n)} &#92;ll_&#92;epsilon (&#92;frac{1}{&#92;delta})^{&#92;frac{n}{d}-1+&#92;epsilon} &#92; &#92; &#92; &#92; &#92; (2)&amp;fg=000000' class='latex' /></p>
<p></a> for all <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+d+%5Cleq+n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq d &#92;leq n}&amp;fg=000000' title='{1 &#92;leq d &#92;leq n}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon &gt; 0}&amp;fg=000000' title='{&#92;epsilon &gt; 0}&amp;fg=000000' class='latex' />. This conjecture is trivial for <img src='http://s0.wp.com/latex.php?latex=%7Bd%3D1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d=1}&amp;fg=000000' title='{d=1}&amp;fg=000000' class='latex' />, but the difficulty increases as <img src='http://s0.wp.com/latex.php?latex=%7Bd%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d}&amp;fg=000000' title='{d}&amp;fg=000000' class='latex' /> approaches <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' />; for any fixed <img src='http://s0.wp.com/latex.php?latex=%7Bd%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d}&amp;fg=000000' title='{d}&amp;fg=000000' class='latex' />, the estimate <a href="#max">(2)</a> easily implies <a href="#tnd">(1)</a>, and hence that Kakeya sets in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+R%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb R}^n}&amp;fg=000000' title='{{&#92;mathbb R}^n}&amp;fg=000000' class='latex' /> have Minkowski dimension at least <img src='http://s0.wp.com/latex.php?latex=%7Bd%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d}&amp;fg=000000' title='{d}&amp;fg=000000' class='latex' />; it also can be used to imply that such sets have Hausdorff dimension at least <img src='http://s0.wp.com/latex.php?latex=%7Bd%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d}&amp;fg=000000' title='{d}&amp;fg=000000' class='latex' /> as well. (The terminology &#8220;Kakeya maximal function conjecture&#8221; comes from the fact that <a href="#max">(2)</a> is dual to a certain <img src='http://s0.wp.com/latex.php?latex=%7BL%5Ep%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{L^p}&amp;fg=000000' title='{L^p}&amp;fg=000000' class='latex' /> type bound on the <em>Kakeya maximal function</em> </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++f%5E%2A_%5Cdelta%28%5Comega%29+%3A%3D+%5Csup_%7BT+%2F%2F+%5Comega%7D+%5Cfrac%7B1%7D%7B%26%23124%3BT%26%23124%3B%7D+%5Cint_T+%26%23124%3Bf%26%23124%3B%2C%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  f^*_&#92;delta(&#92;omega) := &#92;sup_{T // &#92;omega} &#92;frac{1}{&#124;T&#124;} &#92;int_T &#124;f&#124;,&amp;fg=000000' title='&#92;displaystyle  f^*_&#92;delta(&#92;omega) := &#92;sup_{T // &#92;omega} &#92;frac{1}{&#124;T&#124;} &#92;int_T &#124;f&#124;,&amp;fg=000000' class='latex' /></p>
<p> a variant of the Hardy-Littlewood maximal function which averages over <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Ctimes+%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;times &#92;delta}&amp;fg=000000' title='{1 &#92;times &#92;delta}&amp;fg=000000' class='latex' /> tubes oriented in a direction <img src='http://s0.wp.com/latex.php?latex=%7B%5Comega%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;omega}&amp;fg=000000' title='{&#92;omega}&amp;fg=000000' class='latex' />, rather than on balls, but we will not discuss the maximal function further here.)</p>
<p>
The Kakeya conjecture is known to have applications to other fields of mathematics, for instance to Fourier analysis (as observed <a href="http://www.ams.org/mathscinet-getitem?mr=296602">by Fefferman</a>), to wave equations (as observed <a href="http://www.ams.org/mathscinet-getitem?mr=1660476">by Wolff</a>), to analytic number theory (as observed <a href="http://www.ams.org/mathscinet-getitem?mr=1863693">by Bourgain</a>), to cryptography (as observed also <a href="http://www.ams.org/mathscinet-getitem?mr=2206765">by Bourgain</a>) and to random number generation in computer science (as observed <a href="http://www.math.ias.edu/~dvir/papers/DvirWigderson08.pdf">by Dvir and Wigderson</a>). However, the focus of this talk is not on the fields of mathematics impacted by the Kakeya conjecture, but rather on the fields of mathematics that are used to make <em>progress</em> on this conjecture, as there is a striking diversity of mathematical techniques which have been usefully applied to produce a number of non-trivial partial results on the problem. In particular, I wish to discuss how the following areas of mathematics have been used to attack the Kakeya problem:
</p>
<p><ul>
<li> Incidence geometry; </li>
<li> Additive combinatorics; </li>
<li> Multiscale analysis; </li>
<li> Heat flows; </li>
<li> Algebraic geometry; </li>
<li> Algebraic topology.
</li>
</ul>
</p>
<p align="center"><b> &#8212;  1. Incidence geometry  &#8212; </b></p>
<p>
The earliest positive results on the Kakeya problem, when interpreted from a modern perspective, were based primarily on exploiting elementary quantitative results in incidence geometry &#8211; the study of how points, lines, planes, and the like (or more precisely, <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' />-thickened versions of such concepts, such as balls, tubes, and slabs) intersect each other. The incidence geometry approach was greatly clarified by the introduction by Wolff in 1995 of the <em>finite field model</em> of the Kakeya conjectures, in which the underlying field <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+R%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb R}}&amp;fg=000000' title='{{&#92;mathbb R}}&amp;fg=000000' class='latex' /> is replaced by a field <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+F%7D_q%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb F}_q}&amp;fg=000000' title='{{&#92;Bbb F}_q}&amp;fg=000000' class='latex' /> of some large but finite order <img src='http://s0.wp.com/latex.php?latex=%7Bq%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q}&amp;fg=000000' title='{q}&amp;fg=000000' class='latex' /> (which of course must be a prime, or a power of a prime). The analogue of the Kakeya set bound <a href="#tnd">(1)</a> is then an estimate of the form <a name="tnd-f">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%23+%5Cbigcup_%7B%5Cell+%5Cin+L%7D+%5Cell+%5Cgg_%5Cepsilon+q%5E%7Bd-%5Cepsilon%7D+%5C+%5C+%5C+%5C+%5C+%283%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;# &#92;bigcup_{&#92;ell &#92;in L} &#92;ell &#92;gg_&#92;epsilon q^{d-&#92;epsilon} &#92; &#92; &#92; &#92; &#92; (3)&amp;fg=000000' title='&#92;displaystyle  &#92;# &#92;bigcup_{&#92;ell &#92;in L} &#92;ell &#92;gg_&#92;epsilon q^{d-&#92;epsilon} &#92; &#92; &#92; &#92; &#92; (3)&amp;fg=000000' class='latex' /></p>
<p></a> for all <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon &gt; 0}&amp;fg=000000' title='{&#92;epsilon &gt; 0}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7BL%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{L}&amp;fg=000000' title='{L}&amp;fg=000000' class='latex' /> is a family of lines in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+F%7D_q%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb F}_q^n}&amp;fg=000000' title='{{&#92;Bbb F}_q^n}&amp;fg=000000' class='latex' /> that contains one line in every direction (thus, <img src='http://s0.wp.com/latex.php?latex=%7BL%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{L}&amp;fg=000000' title='{L}&amp;fg=000000' class='latex' /> will have cardinality comparable to <img src='http://s0.wp.com/latex.php?latex=%7Bq%5E%7Bn-1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^{n-1}}&amp;fg=000000' title='{q^{n-1}}&amp;fg=000000' class='latex' />), and <img src='http://s0.wp.com/latex.php?latex=%7B%5C%23+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;# E}&amp;fg=000000' title='{&#92;# E}&amp;fg=000000' class='latex' /> denotes the cardinality of a set <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' />. In a similar vein, the analogue of <a href="#max">(2)</a> is the assertion that <a name="max-f">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%26%23124%3B+%5Csum_%7B%5Cell+%5Cin+L%7D+1_%7B%5Cell%7D+%5C%26%23124%3B_%7B%5Cell%5E%7Bd%2F%28d-1%29%7D%28%7B%5CBbb+F%7D_q%5En%29%7D+%5Cll_%5Cepsilon+q%5E%7Bn-1%2B%5Cepsilon%7D+%5C+%5C+%5C+%5C+%5C+%284%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;&#124; &#92;sum_{&#92;ell &#92;in L} 1_{&#92;ell} &#92;&#124;_{&#92;ell^{d/(d-1)}({&#92;Bbb F}_q^n)} &#92;ll_&#92;epsilon q^{n-1+&#92;epsilon} &#92; &#92; &#92; &#92; &#92; (4)&amp;fg=000000' title='&#92;displaystyle  &#92;&#124; &#92;sum_{&#92;ell &#92;in L} 1_{&#92;ell} &#92;&#124;_{&#92;ell^{d/(d-1)}({&#92;Bbb F}_q^n)} &#92;ll_&#92;epsilon q^{n-1+&#92;epsilon} &#92; &#92; &#92; &#92; &#92; (4)&amp;fg=000000' class='latex' /></p>
<p></a> under the same hypothesis on <img src='http://s0.wp.com/latex.php?latex=%7BL%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{L}&amp;fg=000000' title='{L}&amp;fg=000000' class='latex' />. The fact that <a href="#max-f">(4)</a> implies <a href="#tnd-f">(3)</a> can be easily seen from the identity </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%26%23124%3B+%5Csum_%7B%5Cell+%5Cin+L%7D+1_%7B%5Cell%7D+%5C%26%23124%3B_%7B%5Cell%5E%7B1%7D%28%7B%5CBbb+F%7D_q%5En%29%7D+%3D+q+%5C%23+L+%5Csim+q%5Ed%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;&#124; &#92;sum_{&#92;ell &#92;in L} 1_{&#92;ell} &#92;&#124;_{&#92;ell^{1}({&#92;Bbb F}_q^n)} = q &#92;# L &#92;sim q^d&amp;fg=000000' title='&#92;displaystyle  &#92;&#124; &#92;sum_{&#92;ell &#92;in L} 1_{&#92;ell} &#92;&#124;_{&#92;ell^{1}({&#92;Bbb F}_q^n)} = q &#92;# L &#92;sim q^d&amp;fg=000000' class='latex' /></p>
<p> and H&#246;lder&#8217;s inequality, using the fact that the multiplicity function <img src='http://s0.wp.com/latex.php?latex=%7B%5Csum_%7B%5Cell+%5Cin+L%7D+1_%7B%5Cell%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sum_{&#92;ell &#92;in L} 1_{&#92;ell}}&amp;fg=000000' title='{&#92;sum_{&#92;ell &#92;in L} 1_{&#92;ell}}&amp;fg=000000' class='latex' /> is supported in <img src='http://s0.wp.com/latex.php?latex=%7B%5Cbigcup_%7B%5Cell+%5Cin+L%7D+%5Cell%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;bigcup_{&#92;ell &#92;in L} &#92;ell}&amp;fg=000000' title='{&#92;bigcup_{&#92;ell &#92;in L} &#92;ell}&amp;fg=000000' class='latex' />. </p>
<p>
There is a close analogy between the finite field Kakeya problems and the Euclidean ones; arguments that make progress in one setting can often be adapted to make progress in the other. However, there is no formal correspondence, and there are some arguments that seem to be specific to the finite field setting, and other arguments that are specific to the Euclidean one. Nevertheless, the finite field setting has proven to be an extremely useful toy model to gain intuition and insight into the (more complicated) Euclidean problem.
</p>
<p>
Very roughly speaking, every incidence geometry fact in classical Euclidean geometry can be converted (at least in principle) via elementary combinatorial arguments into a lower bound on Kakeya sets. Consider for instance the Euclidean axiom that any two lines intersect in at most one point. This can be converted to the bound <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq 2}&amp;fg=000000' title='{d &#92;geq 2}&amp;fg=000000' class='latex' /> for any dimension <img src='http://s0.wp.com/latex.php?latex=%7Bn+%5Cgeq+2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n &#92;geq 2}&amp;fg=000000' title='{n &#92;geq 2}&amp;fg=000000' class='latex' />, for any one of the versions of the Kakeya problem mentioned above; this was established <a href="http://www.ams.org/mathscinet-getitem?mr=272988">by Davies</a> for the Euclidean dimension problem and <a href="http://www.ams.org/mathscinet-getitem?mr=447949">by C&#243;rdoba</a> for the Euclidean maximal function problem. We sketch a heuristic proof for the finite field set problem <a href="#tnd-f">(3)</a>. Suppose for simplicity that <img src='http://s0.wp.com/latex.php?latex=%7B%5C%23+%5Cbigcup_%7B%5Cell+%5Cin+L%7D+%5Cell%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;# &#92;bigcup_{&#92;ell &#92;in L} &#92;ell}&amp;fg=000000' title='{&#92;# &#92;bigcup_{&#92;ell &#92;in L} &#92;ell}&amp;fg=000000' class='latex' /> has size exactly <img src='http://s0.wp.com/latex.php?latex=%7Bq%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^d}&amp;fg=000000' title='{q^d}&amp;fg=000000' class='latex' />. There are about <img src='http://s0.wp.com/latex.php?latex=%7Bq%5E%7Bn-1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^{n-1}}&amp;fg=000000' title='{q^{n-1}}&amp;fg=000000' class='latex' /> lines <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell}&amp;fg=000000' title='{&#92;ell}&amp;fg=000000' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7BL%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{L}&amp;fg=000000' title='{L}&amp;fg=000000' class='latex' />, each containing <img src='http://s0.wp.com/latex.php?latex=%7Bq%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q}&amp;fg=000000' title='{q}&amp;fg=000000' class='latex' /> points; if we adopt the heuristic that the <img src='http://s0.wp.com/latex.php?latex=%7Bq%5E%7Bn-1%7D+%5Ctimes+q+%3D+q%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^{n-1} &#92;times q = q^n}&amp;fg=000000' title='{q^{n-1} &#92;times q = q^n}&amp;fg=000000' class='latex' /> points described this way are spread out uniformly in the set <img src='http://s0.wp.com/latex.php?latex=%7B%5Csum_%7B%5Cell+%5Cin+L%7D+1_%7B%5Cell%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sum_{&#92;ell &#92;in L} 1_{&#92;ell}}&amp;fg=000000' title='{&#92;sum_{&#92;ell &#92;in L} 1_{&#92;ell}}&amp;fg=000000' class='latex' />, then each point in that set should be incident to about <img src='http://s0.wp.com/latex.php?latex=%7Bq%5E%7Bn-d%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^{n-d}}&amp;fg=000000' title='{q^{n-d}}&amp;fg=000000' class='latex' /> lines. Now, let us count the number of configurations consisting of two distinct lines in <img src='http://s0.wp.com/latex.php?latex=%7BL%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{L}&amp;fg=000000' title='{L}&amp;fg=000000' class='latex' /> intersecting at a single point. On the one hand, since <img src='http://s0.wp.com/latex.php?latex=%7BL%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{L}&amp;fg=000000' title='{L}&amp;fg=000000' class='latex' /> has about <img src='http://s0.wp.com/latex.php?latex=%7Bq%5E%7Bn-1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^{n-1}}&amp;fg=000000' title='{q^{n-1}}&amp;fg=000000' class='latex' /> lines, and any two lines intersect in at most one point, the number of such configurations is at most <img src='http://s0.wp.com/latex.php?latex=%7Bq%5E%7B2%28n-1%29%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^{2(n-1)}}&amp;fg=000000' title='{q^{2(n-1)}}&amp;fg=000000' class='latex' />; on the other hand, since there are about <img src='http://s0.wp.com/latex.php?latex=%7Bq%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^d}&amp;fg=000000' title='{q^d}&amp;fg=000000' class='latex' /> points, and each point is incident to about <img src='http://s0.wp.com/latex.php?latex=%7Bq%5E%7Bn-d%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^{n-d}}&amp;fg=000000' title='{q^{n-d}}&amp;fg=000000' class='latex' /> lines, then the number of configurations is at least <img src='http://s0.wp.com/latex.php?latex=%7Bq%5Ed+%5Ctimes+q%5E%7Bn-d%7D+%5Ctimes+q%5E%7Bn-d%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^d &#92;times q^{n-d} &#92;times q^{n-d}}&amp;fg=000000' title='{q^d &#92;times q^{n-d} &#92;times q^{n-d}}&amp;fg=000000' class='latex' />. Comparing the lower and upper bounds gives the desired bound <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq 2}&amp;fg=000000' title='{d &#92;geq 2}&amp;fg=000000' class='latex' />. (This argument assumed uniform distribution of the multiplicity function, but the general case can be handled similarly by applying the Cauchy-Schwarz inequality.)
</p>
<p>
There are several other instances of this incidence geometry strategy in action:
</p>
<p><ul>
<li> The axiom that two distinct points determine a line can be used to give the bound <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+%5Cfrac%7Bn%2B1%7D%7B2%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq &#92;frac{n+1}{2}}&amp;fg=000000' title='{d &#92;geq &#92;frac{n+1}{2}}&amp;fg=000000' class='latex' /> in any dimension <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' />. This was done (implicitly) <a href="http://www.ams.org/mathscinet-getitem?mr=684547">by Drury</a> for the Euclidean set and maximal problems, and explicitly by <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=835805">by Christ, Duandikoextea, and Rubio de Francia</a> with a refined endpoint estimate at <img src='http://s0.wp.com/latex.php?latex=%7Bd%3D%5Cfrac%7Bn%2B1%7D%7B2%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d=&#92;frac{n+1}{2}}&amp;fg=000000' title='{d=&#92;frac{n+1}{2}}&amp;fg=000000' class='latex' />. </li>
<li> The fact that any three lines in general position determine a regulus (ruled surface), which contains a one-dimensional family of lines incident to all three of the original lines, can be used to give the bound <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+%5Cfrac%7B7%7D%7B3%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq &#92;frac{7}{3}}&amp;fg=000000' title='{d &#92;geq &#92;frac{7}{3}}&amp;fg=000000' class='latex' /> in the three-dimensional case <img src='http://s0.wp.com/latex.php?latex=%7Bn%3D3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n=3}&amp;fg=000000' title='{n=3}&amp;fg=000000' class='latex' />; this was achieved <a href="http://www.ams.org/mathscinet-getitem?mr=1631271">by Schlag</a> (and the bound was also obtained by a slightly different argument earlier <a href="http://www.ams.org/mathscinet-getitem?mr=1097257">by Bourgain</a>). </li>
<li> The axiom that any two intersecting lines determine a plane, which contains a one-dimensional family of possible <em>directions</em> of lines, was used <a href="http://www.ams.org/mathscinet-getitem?mr=1363209">by Wolff</a> to give the bound <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+%5Cfrac%7Bn%2B2%7D%7B2%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq &#92;frac{n+2}{2}}&amp;fg=000000' title='{d &#92;geq &#92;frac{n+2}{2}}&amp;fg=000000' class='latex' /> in all dimensions <img src='http://s0.wp.com/latex.php?latex=%7Bn+%5Cgeq+2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n &#92;geq 2}&amp;fg=000000' title='{n &#92;geq 2}&amp;fg=000000' class='latex' />. This argument relied cruically on the fact that the lines pointed in different directions; note that if one allows lines to be parallel, then a plane contains a <em>two</em>-dimensional family of lines rather than a one-dimensional one. </li>
<li> In four dimensions (and for the finite field set problem), <a href="http://www.ams.org/mathscinet-getitem?mr=2225076">I managed</a> to improve the Wolff bound of <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq 3}&amp;fg=000000' title='{d &#92;geq 3}&amp;fg=000000' class='latex' /> in four dimensions <img src='http://s0.wp.com/latex.php?latex=%7Bn%3D4%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n=4}&amp;fg=000000' title='{n=4}&amp;fg=000000' class='latex' /> slightly to <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+3+%2B+%5Cfrac%7B1%7D%7B16%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq 3 + &#92;frac{1}{16}}&amp;fg=000000' title='{d &#92;geq 3 + &#92;frac{1}{16}}&amp;fg=000000' class='latex' />, by a more complicated axiom from incidence geometry, namely that any three reguli in general position determine an algebraic hypersurface, the lines in which only point in a two-dimensional family of directions.
</li>
</ul>
<p>
From this sequence, it seems that one needs to imply increasingly &#8220;high-degree&#8221; facts from incidence geometry to make deeper progress on the Kakeya conjecture, in particular one begins to transition from incidence geometry to algebraic geometry. Indeed, one can view the polynomial method of Dvir (introduced later in this post) as the natural continuation of these methods. However, there is some evidence that the incidence geometry approach, by itself, is not sufficient to establish the full conjecture. For instance, if one heuristically inserts an arbitrary incidence geometry fact involving bounded-degree algebraic varieties into the above method, it appears that one cannot obtain a lower bound on <img src='http://s0.wp.com/latex.php?latex=%7Bd%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d}&amp;fg=000000' title='{d}&amp;fg=000000' class='latex' /> of better than <img src='http://s0.wp.com/latex.php?latex=%7Bn%2F2+%2B+O%281%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n/2 + O(1)}&amp;fg=000000' title='{n/2 + O(1)}&amp;fg=000000' class='latex' />. Also, there are near-counterexamples to the Kakeya conjecture (such as the Heisenberg group <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B+%28z_1%2Cz_2%2Cz_3%29%3A+%5Chbox%7BIm%7D%28z_3%29+%3D+%5Chbox%7BIm%7D%28z_1+%5Coverline%7Bz_2%7D%29+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{ (z_1,z_2,z_3): &#92;hbox{Im}(z_3) = &#92;hbox{Im}(z_1 &#92;overline{z_2}) &#92;}}&amp;fg=000000' title='{&#92;{ (z_1,z_2,z_3): &#92;hbox{Im}(z_3) = &#92;hbox{Im}(z_1 &#92;overline{z_2}) &#92;}}&amp;fg=000000' class='latex' /> when the field <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+F%7D_q%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb F}_q}&amp;fg=000000' title='{{&#92;Bbb F}_q}&amp;fg=000000' class='latex' /> admits a non-trivial conjugation <img src='http://s0.wp.com/latex.php?latex=%7Bz+%5Cmapsto+%5Coverline%7Bz%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{z &#92;mapsto &#92;overline{z}}&amp;fg=000000' title='{z &#92;mapsto &#92;overline{z}}&amp;fg=000000' class='latex' />) which only fail to contradict that conjecture due to the parallel nature of some of the lines, so any argument establishing the conjecture must make essential use of the fact that lines point in different directions.
</p>
</p>
<p align="center"><b> &#8212;  2. Additive combinatorics  &#8212; </b></p>
<p>
In 1998, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1692486">Bourgain introduced</a> a somewhat different approach to the Kakeya problem, which relied on elementary facts of <em>arithmetic</em> (and in particular, addition and subtraction), rather than that of <em>geometry</em>, to obtain new bounds on the Kakeya problem. This additive-combinatorics method fared better than the geometric method in higher dimensions, basically because the additive structure of high-dimensional spaces was much the same as for low-dimensional spaces, even if the geometric structure becomes much different (and less intuitive).
</p>
<p>
It is convenient to illustrate the method using the finite field model problem <a href="#tnd-f">(3)</a>, using the method of &#8220;slices&#8221; introduced by Bourgain (though it is also possible to perform the additive combinatorial method without slicing the set), though the method can also be adapted to the other variants of the Kakeya problem. As before, we argue heuristically in order to simplify the discussion. Let <img src='http://s0.wp.com/latex.php?latex=%7BE+%5Csubset+%7B%5CBbb+F%7D_q%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E &#92;subset {&#92;Bbb F}_q^n}&amp;fg=000000' title='{E &#92;subset {&#92;Bbb F}_q^n}&amp;fg=000000' class='latex' /> be a finite field Kakeya set, and suppose it has cardinality about <img src='http://s0.wp.com/latex.php?latex=%7Bq%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^d}&amp;fg=000000' title='{q^d}&amp;fg=000000' class='latex' />. We then write <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+F%7D_q%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb F}_q^n}&amp;fg=000000' title='{{&#92;Bbb F}_q^n}&amp;fg=000000' class='latex' /> as <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+F%7D+%5Ctimes+%7B%5CBbb+F%7D_q%5E%7Bn-1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb F} &#92;times {&#92;Bbb F}_q^{n-1}}&amp;fg=000000' title='{{&#92;Bbb F} &#92;times {&#92;Bbb F}_q^{n-1}}&amp;fg=000000' class='latex' /> and consider the three &#8220;slices&#8221; <img src='http://s0.wp.com/latex.php?latex=%7BA%2C+B%2C+C%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A, B, C}&amp;fg=000000' title='{A, B, C}&amp;fg=000000' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' />, defined as the intersection of <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' /> with the horizontal hyperplanes <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B0%5C%7D+%5Ctimes+%7B%5CBbb+F%7D_q%5E%7Bn-1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{0&#92;} &#92;times {&#92;Bbb F}_q^{n-1}}&amp;fg=000000' title='{&#92;{0&#92;} &#92;times {&#92;Bbb F}_q^{n-1}}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B1%5C%7D+%5Ctimes+%7B%5CBbb+F%7D_q%5E%7Bn-1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{1&#92;} &#92;times {&#92;Bbb F}_q^{n-1}}&amp;fg=000000' title='{&#92;{1&#92;} &#92;times {&#92;Bbb F}_q^{n-1}}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B1%2F2%5C%7D+%5Ctimes+%7B%5CBbb+F%7D_q%5E%7Bn-1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{1/2&#92;} &#92;times {&#92;Bbb F}_q^{n-1}}&amp;fg=000000' title='{&#92;{1/2&#92;} &#92;times {&#92;Bbb F}_q^{n-1}}&amp;fg=000000' class='latex' /> respectively (let us assume <img src='http://s0.wp.com/latex.php?latex=%7Bq%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q}&amp;fg=000000' title='{q}&amp;fg=000000' class='latex' /> is odd for the sake of discussion, so that <img src='http://s0.wp.com/latex.php?latex=%7B1%2F2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1/2}&amp;fg=000000' title='{1/2}&amp;fg=000000' class='latex' /> is well-defined). Then we expect <img src='http://s0.wp.com/latex.php?latex=%7BA%2C+B%2C+C%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A, B, C}&amp;fg=000000' title='{A, B, C}&amp;fg=000000' class='latex' /> to all have cardinality about <img src='http://s0.wp.com/latex.php?latex=%7Bq%5E%7Bd-1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^{d-1}}&amp;fg=000000' title='{q^{d-1}}&amp;fg=000000' class='latex' />. On the other hand, <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' /> contains about <img src='http://s0.wp.com/latex.php?latex=%7Bq%5E%7Bn-1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^{n-1}}&amp;fg=000000' title='{q^{n-1}}&amp;fg=000000' class='latex' /> lines, each of which connect a point <img src='http://s0.wp.com/latex.php?latex=%7Ba+%5Cin+A%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{a &#92;in A}&amp;fg=000000' title='{a &#92;in A}&amp;fg=000000' class='latex' /> to a point <img src='http://s0.wp.com/latex.php?latex=%7Bb+%5Cin+B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{b &#92;in B}&amp;fg=000000' title='{b &#92;in B}&amp;fg=000000' class='latex' />, with the midpoint <img src='http://s0.wp.com/latex.php?latex=%7B%5Cfrac%7Ba%2Bb%7D%7B2%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;frac{a+b}{2}}&amp;fg=000000' title='{&#92;frac{a+b}{2}}&amp;fg=000000' class='latex' /> lying in <img src='http://s0.wp.com/latex.php?latex=%7BC%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{C}&amp;fg=000000' title='{C}&amp;fg=000000' class='latex' />. Since two points uniquely determine a line, we thus have about <img src='http://s0.wp.com/latex.php?latex=%7Bq%5E%7Bn-1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q^{n-1}}&amp;fg=000000' title='{q^{n-1}}&amp;fg=000000' class='latex' /> many pairs <img src='http://s0.wp.com/latex.php?latex=%7B%28a%2Cb%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(a,b)}&amp;fg=000000' title='{(a,b)}&amp;fg=000000' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Ctimes+B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;times B}&amp;fg=000000' title='{A &#92;times B}&amp;fg=000000' class='latex' /> whose sums <img src='http://s0.wp.com/latex.php?latex=%7Ba%2Bb%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{a+b}&amp;fg=000000' title='{a+b}&amp;fg=000000' class='latex' /> are contained in a small set, namely <img src='http://s0.wp.com/latex.php?latex=%7B2+%5Ccdot+C%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2 &#92;cdot C}&amp;fg=000000' title='{2 &#92;cdot C}&amp;fg=000000' class='latex' />. (This fact alone already leads to the lower bound <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+%5Cfrac%7Bn%2B1%7D%7B2%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq &#92;frac{n+1}{2}}&amp;fg=000000' title='{d &#92;geq &#92;frac{n+1}{2}}&amp;fg=000000' class='latex' /> mentioned earlier.) On the other hand, since the lines all point in different directions, the <em>differences</em> <img src='http://s0.wp.com/latex.php?latex=%7Ba-b%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{a-b}&amp;fg=000000' title='{a-b}&amp;fg=000000' class='latex' /> of all these pairs <img src='http://s0.wp.com/latex.php?latex=%7B%28a%2Cb%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(a,b)}&amp;fg=000000' title='{(a,b)}&amp;fg=000000' class='latex' /> are distinct. The additive combinatorial strategy is then to play off the compressed nature of the sums on one hand, and the dispersed nature of the differences on the other. One of the main ingredients here are elementary additive identities, such as </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++a%2Bb+%3D+a%27%2Bb%27+%5Cimplies+a-b%27+%3D+a%27-b%2C%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  a+b = a&#039;+b&#039; &#92;implies a-b&#039; = a&#039;-b,&amp;fg=000000' title='&#92;displaystyle  a+b = a&#039;+b&#039; &#92;implies a-b&#039; = a&#039;-b,&amp;fg=000000' class='latex' /></p>
<p> which suggests that collisions of sums should imply collisions of differences. This identity, by itself, is insufficient to obtain any new bound on the Kakeya problem, because even if the pairs <img src='http://s0.wp.com/latex.php?latex=%7B%28a%2Cb%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(a,b)}&amp;fg=000000' title='{(a,b)}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B%28a%27%2Cb%27%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(a&#039;,b&#039;)}&amp;fg=000000' title='{(a&#039;,b&#039;)}&amp;fg=000000' class='latex' /> come from lines in <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' />, there is no reason why the pairs <img src='http://s0.wp.com/latex.php?latex=%7B%28a%2Cb%27%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(a,b&#039;)}&amp;fg=000000' title='{(a,b&#039;)}&amp;fg=000000' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%7B%28a%27%2Cb%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(a&#039;,b)}&amp;fg=000000' title='{(a&#039;,b)}&amp;fg=000000' class='latex' /> should also. But one can then combine the above identity with further identities, such as
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++a-b+%3D+%28a-b%27%29+-+%28a%27-b%27%29+%2B+%28a%27-b%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  a-b = (a-b&#039;) - (a&#039;-b&#039;) + (a&#039;-b)&amp;fg=000000' title='&#92;displaystyle  a-b = (a-b&#039;) - (a&#039;-b&#039;) + (a&#039;-b)&amp;fg=000000' class='latex' /></p>
<p> which can be used to convert collisions of some differences to collisions of other differences. Implementing this strategy rigorously using a combinatorial tool now known as the <em>Balog-Szemer&#233;di-Gowers lemma</em>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1692486">Bourgain improved</a> the bound <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+%5Cfrac%7Bn%2B1%7D%7B2%7D+%3D+%5Cfrac%7Bn-1%7D%7B2%7D%2B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq &#92;frac{n+1}{2} = &#92;frac{n-1}{2}+1}&amp;fg=000000' title='{d &#92;geq &#92;frac{n+1}{2} = &#92;frac{n-1}{2}+1}&amp;fg=000000' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+%5Cfrac%7Bn-1%7D%7B2-1%2F13%7D%2B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq &#92;frac{n-1}{2-1/13}+1}&amp;fg=000000' title='{d &#92;geq &#92;frac{n-1}{2-1/13}+1}&amp;fg=000000' class='latex' /> (for the Minkowski dimension in both Euclidean and finite field settings), which was superior to the bounds obtained by incidence geometry methods in high dimensions. By inserting more and more such additive identities into this framework (and taking more and more slices), the constants improved somewhat; for instance, in high dimensions, the best result so far (on the Minkowski Euclidean problem) is <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+%5Cfrac%7Bn-1%7D%7B%5Calpha%7D%2B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq &#92;frac{n-1}{&#92;alpha}+1}&amp;fg=000000' title='{d &#92;geq &#92;frac{n-1}{&#92;alpha}+1}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7B%5Calpha+%3D+1.675%5Cldots%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;alpha = 1.675&#92;ldots}&amp;fg=000000' title='{&#92;alpha = 1.675&#92;ldots}&amp;fg=000000' class='latex' /> is the largest root of <img src='http://s0.wp.com/latex.php?latex=%7B%5Calpha%5E3-4%5Calpha%2B2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;alpha^3-4&#92;alpha+2}&amp;fg=000000' title='{&#92;alpha^3-4&#92;alpha+2}&amp;fg=000000' class='latex' />, a <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1945284">result of Nets Katz and myself</a>; see <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1964819">this survey article of Nets and myself</a> for further discussion. </p>
<p>
It may well be that the arithmetic approach could eventually settle the entire Kakeya conjecture (this would correpsond to lowering the exponent <img src='http://s0.wp.com/latex.php?latex=%7B%5Calpha%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;alpha}&amp;fg=000000' title='{&#92;alpha}&amp;fg=000000' class='latex' /> appearing in the above results all the way to <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />); there is a formalisation of this assertion, known as the &#8220;arithmetic Kakeya conjecture&#8221;, which has some amusing <a href="http://www.ams.org/mathscinet-getitem?mr=2157367">relationships with group theory</a>. However, we have not been able to find additive combinatorial arguments that are so efficient that they do not lose anything in the exponents, and it is not clear whether this conjecture is within reach of known technology. One possible direction to pursue is to move from additive combinatorics (the combinatorics of addition and subtraction) to <em>arithmetic</em> combinatorics (the combinatorics of addition, subtraction, multiplication, and division). For instance, the sum-product phenomenon in finite fields was used <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2053599">by Bourgain, Katz, and myself</a> to improve the bound on three-dimensional Kakeya sets slightly from <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+5%2F2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq 5/2}&amp;fg=000000' title='{d &#92;geq 5/2}&amp;fg=000000' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+5%2F2%2B%5Cepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq 5/2+&#92;epsilon}&amp;fg=000000' title='{d &#92;geq 5/2+&#92;epsilon}&amp;fg=000000' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon%26%2362%3B0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon&gt;0}&amp;fg=000000' title='{&#92;epsilon&gt;0}&amp;fg=000000' class='latex' />. Nets Katz and I are exploring this direction further, and I hope to report on our results at some point in the future.
</p>
</p>
<p align="center"><b> &#8212;  3. Multiscale analysis  &#8212; </b></p>
<p>
The finite field model is considered simpler than the Euclidean model for a number of reasons, but one of the main ones is that the finite field model does not have the infinite number of scales that are present in the Euclidean setting. But one can reverse this viewpoint, and instead look for ways to <em>exploit</em> the multitude of scales available in the Euclidean case. One promising strategy in this direction is the <em>induction on scales</em> strategy, introduced by Bourgain for the closely related restriction and Bochner-Riesz problems, and developed further by Wolff. The basic idea here is to deduce a Kakeya estimate at scale <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' /> from the corresponding the Kakeya estimate at a coarser scale, such as <img src='http://s0.wp.com/latex.php?latex=%7B%5Csqrt%7B%5Cdelta%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sqrt{&#92;delta}}&amp;fg=000000' title='{&#92;sqrt{&#92;delta}}&amp;fg=000000' class='latex' />. We sketch the basic idea as follows. Suppose that the Kakeya conjecture was already established at scale <img src='http://s0.wp.com/latex.php?latex=%7B%5Csqrt%7B%5Cdelta%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sqrt{&#92;delta}}&amp;fg=000000' title='{&#92;sqrt{&#92;delta}}&amp;fg=000000' class='latex' />; roughly speaking, this means that any connection of <img src='http://s0.wp.com/latex.php?latex=%7B%5Csqrt%7B%5Cdelta%7D%5Ctimes+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sqrt{&#92;delta}&#92;times 1}&amp;fg=000000' title='{&#92;sqrt{&#92;delta}&#92;times 1}&amp;fg=000000' class='latex' /> tubes that point in different directions (i.e. <img src='http://s0.wp.com/latex.php?latex=%7B%5Csqrt%7B%5Cdelta%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sqrt{&#92;delta}}&amp;fg=000000' title='{&#92;sqrt{&#92;delta}}&amp;fg=000000' class='latex' />-separated directions) must be essentially disjoint. One can then argue that that this should implies the same assertion for <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta+%5Ctimes+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta &#92;times 1}&amp;fg=000000' title='{&#92;delta &#92;times 1}&amp;fg=000000' class='latex' /> tubes. To justify this, we assume that these &#8220;thin&#8221; <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta+%5Ctimes+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta &#92;times 1}&amp;fg=000000' title='{&#92;delta &#92;times 1}&amp;fg=000000' class='latex' /> tubes are arranged in a sufficiently &#8220;self-similar&#8221; (or &#8220;sticky&#8221;) fashion that they can be organised into &#8220;fat&#8221; <img src='http://s0.wp.com/latex.php?latex=%7B%5Csqrt%7B%5Cdelta%7D+%5Ctimes+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sqrt{&#92;delta} &#92;times 1}&amp;fg=000000' title='{&#92;sqrt{&#92;delta} &#92;times 1}&amp;fg=000000' class='latex' /> tubes, which themselves point in different (i.e. <img src='http://s0.wp.com/latex.php?latex=%7B%5Csqrt%7B%5Cdelta%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sqrt{&#92;delta}}&amp;fg=000000' title='{&#92;sqrt{&#92;delta}}&amp;fg=000000' class='latex' />-separated) directions. By the Kakeya hypotheses, these fat tubes are essentially disjoint. What about the thin tubes inside any given fat tube? Well, if one rescales a fat tube about its axis, dilating its lateral dimensions by <img src='http://s0.wp.com/latex.php?latex=%7B1%2F%5Csqrt%7B%5Cdelta%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1/&#92;sqrt{&#92;delta}}&amp;fg=000000' title='{1/&#92;sqrt{&#92;delta}}&amp;fg=000000' class='latex' /> so that it becomes a <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Ctimes+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;times 1}&amp;fg=000000' title='{1 &#92;times 1}&amp;fg=000000' class='latex' /> cylinder, then the thin tubes inside that fat tube essentially expand into fat tubes, which by hypothesis are again essentially disjoint. Since the thin tubes inside each fat tube are essentially disjoint, and the fat tubes are themselves essentially disjoint, the entire collection of thin tubes should be essentially disjoint as well.
</p>
<p>
This argument is remarkably difficult to make rigorous, in part because it is not obvious at all why the tubes should stick together in a self-similar way, but also because the fat tubes can intersect each other in a coplanar fashion, allowing the thin tubes in each fat tube to align up with an unusually high multiplicity. In <a href="http://www.ams.org/mathscinet-getitem?mr=1804528">a 63-page paper</a>, Nets Katz, Izabella Laba, and I managed to implement this idea, in conjunction with much of the incidence geometry and additive combinatorics arguments discussed in previous sections, but only managed to improve the (Euclidean Minkowski dimension) bound for three-dimensional Kakeya sets from <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+5%2F2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq 5/2}&amp;fg=000000' title='{d &#92;geq 5/2}&amp;fg=000000' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7Bd+%5Cgeq+5%2F2%2B10%5E%7B-10%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d &#92;geq 5/2+10^{-10}}&amp;fg=000000' title='{d &#92;geq 5/2+10^{-10}}&amp;fg=000000' class='latex' /> (with <a href="http://www.ams.org/mathscinet-getitem?mr=1866801">similar small improvements</a> in higher dimensions). Nevertheless, the multiscale method did lead to another method which gave further results, namely the heat flow method which we now discuss.
</p>
</p>
<p align="center"><b> &#8212;  4. Heat flow  &#8212; </b></p>
<p>
It is possible to eliminate the coplanarity and stickiness issues arising in the multiscale approach by replacing a linear Kakeya problem such as <a href="#max">(2)</a> with a multilinear variant, namely <a name="multimax">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5C%26%23124%3B+%5Cprod_%7Bj%3D1%7D%5En+%5Csum_%7Bi%3D1%7D%5EN+1_%7BT%5E%7B%28j%29%7D_i%7D+%5C%26%23124%3B_%7BL%5E%7B1%2F%28n-1%29%7D%28%7B%5Cmathbb+R%7D%5En%29%7D+%5Cll_%5Cepsilon+%5Cdelta%5E%7B-%5Cepsilon%7D+%5C+%5C+%5C+%5C+%5C+%285%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;&#124; &#92;prod_{j=1}^n &#92;sum_{i=1}^N 1_{T^{(j)}_i} &#92;&#124;_{L^{1/(n-1)}({&#92;mathbb R}^n)} &#92;ll_&#92;epsilon &#92;delta^{-&#92;epsilon} &#92; &#92; &#92; &#92; &#92; (5)&amp;fg=000000' title='&#92;displaystyle  &#92;&#124; &#92;prod_{j=1}^n &#92;sum_{i=1}^N 1_{T^{(j)}_i} &#92;&#124;_{L^{1/(n-1)}({&#92;mathbb R}^n)} &#92;ll_&#92;epsilon &#92;delta^{-&#92;epsilon} &#92; &#92; &#92; &#92; &#92; (5)&amp;fg=000000' class='latex' /></p>
<p></a> whenever <img src='http://s0.wp.com/latex.php?latex=%7BT%5E%7B%28j%29%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{T^{(j)}_i}&amp;fg=000000' title='{T^{(j)}_i}&amp;fg=000000' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+j+%5Cleq+n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq j &#92;leq n}&amp;fg=000000' title='{1 &#92;leq j &#92;leq n}&amp;fg=000000' class='latex' /> are families of <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta+%5Ctimes+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta &#92;times 1}&amp;fg=000000' title='{&#92;delta &#92;times 1}&amp;fg=000000' class='latex' />-tubes with <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' />-separated directions, pointing close to the <img src='http://s0.wp.com/latex.php?latex=%7Bj%5E%7Bth%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{j^{th}}&amp;fg=000000' title='{j^{th}}&amp;fg=000000' class='latex' /> cardinal direction <img src='http://s0.wp.com/latex.php?latex=%7Be_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{e_j}&amp;fg=000000' title='{e_j}&amp;fg=000000' class='latex' />. Roughly speaking, the difference between <a href="#multimax">(5)</a> and <a href="#max">(2)</a> is that &#8220;coplanar&#8221; interactions have been abolished from <a href="#multimax">(5)</a> by fiat; the expression inside the norm consists only of products of tubes whose directions are in general position. With these difficulties eliminated, the induction on scales argument comes much closer to working properly. However, the degradation of constants is too poor; when passing from scale <img src='http://s0.wp.com/latex.php?latex=%7B%5Csqrt%7B%5Cdelta%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sqrt{&#92;delta}}&amp;fg=000000' title='{&#92;sqrt{&#92;delta}}&amp;fg=000000' class='latex' /> to scale <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' />, the losses get squared, leading to a net loss of <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%5E%7B-C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta^{-C}}&amp;fg=000000' title='{&#92;delta^{-C}}&amp;fg=000000' class='latex' /> for some constant <img src='http://s0.wp.com/latex.php?latex=%7BC%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{C}&amp;fg=000000' title='{C}&amp;fg=000000' class='latex' />, which is unacceptable. However, it turns out (as shown <a href="http://www.ams.org/mathscinet-getitem?mr=2275834">by Bennett, Carbery, and myself</a>) that one can avoid all such losses by performing a continuous analogue of the induction on scales procedure, in which the tubes are replaced by distorted Gaussian functions, which are then continuously dilated about their major axis by heat flow, effectively increasing their thickness from <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' /> towards <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />. One then shows that (a suitable modification of) the expression in <a href="#multimax">(5)</a> increases along this heat flow, allowing one to deduce <a href="#multimax">(5)</a> at scale <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' /> from <a href="#multimax">(5)</a> at scale <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />. (One still incurs a <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%5E%7B-%5Cepsilon%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta^{-&#92;epsilon}}&amp;fg=000000' title='{&#92;delta^{-&#92;epsilon}}&amp;fg=000000' class='latex' /> loss because, for technical reasons, one can only obtain a monotonicity formula if one works with <img src='http://s0.wp.com/latex.php?latex=%7BL%5E%7B1%2F%28n-1%29-%5Cepsilon%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{L^{1/(n-1)-&#92;epsilon}}&amp;fg=000000' title='{L^{1/(n-1)-&#92;epsilon}}&amp;fg=000000' class='latex' /> norms rather than <img src='http://s0.wp.com/latex.php?latex=%7BL%5E%7B1%2F%28n-1%29%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{L^{1/(n-1)}}&amp;fg=000000' title='{L^{1/(n-1)}}&amp;fg=000000' class='latex' /> norms, requiring an interpolation to finish the job.)
</p>
<p>
Another way to view the heat flow argument is to take the (gaussian-smoothed) tubes <img src='http://s0.wp.com/latex.php?latex=%7B1_%7BT%5E%7B%28j%29%7D_i%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1_{T^{(j)}_i}}&amp;fg=000000' title='{1_{T^{(j)}_i}}&amp;fg=000000' class='latex' /> and slide them continuously toward the origin, so that at the final stage one obtains a &#8220;bush&#8221; of (smoothed) tubes through the origin, for which <a href="#multimax">(5)</a> is easy to establish. One can use essentially the same monotonicity computations as before to show that the <img src='http://s0.wp.com/latex.php?latex=%7BL%5E%7B1%2F%28n-1%29%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{L^{1/(n-1)}}&amp;fg=000000' title='{L^{1/(n-1)}}&amp;fg=000000' class='latex' /> norm (or more precisely, a slightly weighted <img src='http://s0.wp.com/latex.php?latex=%7BL%5E%7B1%2F%28n-1%29-%5Cepsilon%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{L^{1/(n-1)-&#92;epsilon}}&amp;fg=000000' title='{L^{1/(n-1)-&#92;epsilon}}&amp;fg=000000' class='latex' /> norm) in <a href="#multimax">(5)</a> increases along this sliding procedure, thus providing a sort of variational proof of <a href="#multimax">(5)</a>.
</p>
<p>
The multilinear Kakeya conjecture <a href="#multimax">(5)</a> implies its usual counterpart <a href="#max">(2)</a> in two dimensions by a standard angular rescaling argument, but unfortunately it seems to be strictly weaker than the linear Kakeya conjecture in three and higher dimensions, because it says nothing about the coplanar intersections which seem to be a major feature in the higher-dimensional Kakeya problem. (Indeed, <a href="#multimax">(5)</a> suggests, roughly speaking, that if there is a counterexample to the Kakeya conjecture, it will come from &#8220;plany&#8221; Kakeya sets, in which the lines that pass through a typical point will essentially all lie on a hyperplane.) Thus far we have been unable to find a monotonicity formula that can handle the coplanar case (and perhaps one should not expect to find one, given that the extremisers for the linear maximal function are likely to resemble Besicovitch sets and thus not be amenable to a simple variational argument). Nevertheless <a href="#multimax">(5)</a> is one of the few Kakeya-type estimates we have in higher dimensions whose exponents are sharp (up to epsilons).
</p>
</p>
<p align="center"><b> &#8212;  5. Algebraic geometry  &#8212; </b></p>
<p>
A striking breakthrough on the finite field side of the Kakeya problem was achieved recently <a href="http://arxiv.org/abs/0803.2336">by Dvir</a>, by introducing a high-degree analogue of the low-degree algebraic geometry used in the incidence geometry approaches (see also <a href="http://terrytao.wordpress.com/2008/03/24/dvirs-proof-of-the-finite-field-kakeya-conjecture/">this blog post</a> for more discussion). Indeed, one can view Dvir&#8217;s argument in the incidence geometry framework, with the two key incidence geometry inputs involving high-degree hypersurfaces. The first is the following (a high-degree analogue of Wolff&#8217;s observation that a plane contains only a one-dimensional family of lines; see also <a href="http://www.ams.org/mathscinet-getitem?mr=2031165">my paper with Mockenhaupt</a>, or <a href="http://www.ams.org/mathscinet-getitem?mr=2225076">another paper of mine</a>, for similar lemmas):
</p>
<blockquote><p><b>Lemma 1</b>  If <img src='http://s0.wp.com/latex.php?latex=%7BS%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{S}&amp;fg=000000' title='{S}&amp;fg=000000' class='latex' /> is a degree <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' /> hypersurface in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+F%7D_q%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb F}_q^d}&amp;fg=000000' title='{{&#92;Bbb F}_q^d}&amp;fg=000000' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=%7Bk+%26%2360%3B+q%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k &lt; q}&amp;fg=000000' title='{k &lt; q}&amp;fg=000000' class='latex' />, then the number of possible directions of lines in <img src='http://s0.wp.com/latex.php?latex=%7BS%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{S}&amp;fg=000000' title='{S}&amp;fg=000000' class='latex' /> is at most <img src='http://s0.wp.com/latex.php?latex=%7BO%28+k+q%5E%7Bd-2%7D+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O( k q^{d-2} )}&amp;fg=000000' title='{O( k q^{d-2} )}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
Indeed, if one extends the degree <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' /> hypersurface <img src='http://s0.wp.com/latex.php?latex=%7BS%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{S}&amp;fg=000000' title='{S}&amp;fg=000000' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+F%7D_q%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb F}_q^d}&amp;fg=000000' title='{{&#92;Bbb F}_q^d}&amp;fg=000000' class='latex' /> to a degree <img src='http://s0.wp.com/latex.php?latex=%7B%5Cleq+k%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;leq k}&amp;fg=000000' title='{&#92;leq k}&amp;fg=000000' class='latex' /> hypersurface <img src='http://s0.wp.com/latex.php?latex=%7BS_%5Cinfty%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{S_&#92;infty}&amp;fg=000000' title='{S_&#92;infty}&amp;fg=000000' class='latex' /> at the hyperplane at infinity, then every line in <img src='http://s0.wp.com/latex.php?latex=%7BS%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{S}&amp;fg=000000' title='{S}&amp;fg=000000' class='latex' /> will extend to a point in <img src='http://s0.wp.com/latex.php?latex=%7BS_%5Cinfty%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{S_&#92;infty}&amp;fg=000000' title='{S_&#92;infty}&amp;fg=000000' class='latex' />, representing the direction of <img src='http://s0.wp.com/latex.php?latex=%7BS%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{S}&amp;fg=000000' title='{S}&amp;fg=000000' class='latex' />. (Here we use the fact that a polynomial of degree <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' /> cannot vanish at every point of a line unless it is identically zero (in the algebraic sense) on that line (and in particular, on the extension of that line to the plane at infinity.) So the number of possible directions is bounded by the cardinality of <img src='http://s0.wp.com/latex.php?latex=%7BS_%5Cinfty%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{S_&#92;infty}&amp;fg=000000' title='{S_&#92;infty}&amp;fg=000000' class='latex' />, which is <img src='http://s0.wp.com/latex.php?latex=%7BO%28k+q%5E%7Bd-2%7D%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O(k q^{d-2})}&amp;fg=000000' title='{O(k q^{d-2})}&amp;fg=000000' class='latex' /> by the <a href="http://en.wikipedia.org/wiki/Schwartz&#37;E2&#37;80&#37;93Zippel_lemma_and_testing_polynomial_identities">Schwartz-Zippel lemma</a>.
</p>
<p>
The second ingredient can be viewed as a high-degree analogue of such facts as &#8220;two points determine a line&#8221; or &#8220;three points determine a plane&#8221;, and is as follows:
</p>
<blockquote><p><b>Lemma 2</b> <a name="inc"></a> If <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' /> is a set of points in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+F%7D_q%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb F}_q^d}&amp;fg=000000' title='{{&#92;Bbb F}_q^d}&amp;fg=000000' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' /> is contained inside a hypersurface of degree <img src='http://s0.wp.com/latex.php?latex=%7BO%28+%26%23124%3BE%26%23124%3B%5E%7B1%2Fd%7D+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O( &#124;E&#124;^{1/d} )}&amp;fg=000000' title='{O( &#124;E&#124;^{1/d} )}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
Indeed, to prove this lemma one needs to find a non-trivial polynomial <img src='http://s0.wp.com/latex.php?latex=%7BP%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P}&amp;fg=000000' title='{P}&amp;fg=000000' class='latex' /> of degree at most <img src='http://s0.wp.com/latex.php?latex=%7BD%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{D}&amp;fg=000000' title='{D}&amp;fg=000000' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%7BD+%3D+O%28%26%23124%3BE%26%23124%3B%5E%7B1%2Fd%7D%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{D = O(&#124;E&#124;^{1/d})}&amp;fg=000000' title='{D = O(&#124;E&#124;^{1/d})}&amp;fg=000000' class='latex' /> which vanishes at every point in <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' />. But the vector space of polynomials of degree at most <img src='http://s0.wp.com/latex.php?latex=%7BD%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{D}&amp;fg=000000' title='{D}&amp;fg=000000' class='latex' /> has dimension about <img src='http://s0.wp.com/latex.php?latex=%7BD%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{D^d}&amp;fg=000000' title='{D^d}&amp;fg=000000' class='latex' />, and the requirement that <img src='http://s0.wp.com/latex.php?latex=%7BP%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P}&amp;fg=000000' title='{P}&amp;fg=000000' class='latex' /> vanish at every point in <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' /> imposes <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BE%26%23124%3B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;E&#124;}&amp;fg=000000' title='{&#124;E&#124;}&amp;fg=000000' class='latex' /> linear constraints, so by linear algebra one can find such a polynomial as soon as the <img src='http://s0.wp.com/latex.php?latex=%7BD%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{D^d}&amp;fg=000000' title='{D^d}&amp;fg=000000' class='latex' /> is much larger than <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' />, and the claim follows.
</p>
<p>
Putting the two lemmas together, we see that any Kakeya set <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' /> (which, by definition, contains lines in <img src='http://s0.wp.com/latex.php?latex=%7B%5Cgg+q%5E%7Bd-1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;gg q^{d-1}}&amp;fg=000000' title='{&#92;gg q^{d-1}}&amp;fg=000000' class='latex' /> directions) cannot be contained in any hypersurface of degree much less than <img src='http://s0.wp.com/latex.php?latex=%7Bq%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q}&amp;fg=000000' title='{q}&amp;fg=000000' class='latex' />, and thus must have cardinality <img src='http://s0.wp.com/latex.php?latex=%7B%5Cgg+q%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;gg q^n}&amp;fg=000000' title='{&#92;gg q^n}&amp;fg=000000' class='latex' />. (In fact, the optimal size of a Kakeya set is now known to be within a factor of <img src='http://s0.wp.com/latex.php?latex=%7B2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2}&amp;fg=000000' title='{2}&amp;fg=000000' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%7B2%5E%7B-n%7D+q%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2^{-n} q^n}&amp;fg=000000' title='{2^{-n} q^n}&amp;fg=000000' class='latex' />, a result <a href="http://arxiv.org/abs/0901.2529">of Dvir, Kopparty, Saraf, and Sudan</a>.) The argument can also be adapted to establish the maximal function estimate in finite field vector spaces, and also generalises to other families of curves in algebraic varieties, as was done recently <a href="http://terrytao.wordpress.com/2009/03/12/the-kakeya-set-and-maximal-conjectures-for-algebraic-varieties-over-finite-fields/">by Ellenberg, Oberlin, and myself</a> (see this blog post for further discussion). There are even some tentative indications that these algebraic geometry methods will adapt well to more abstract algebraic settings, such as that of schemes.
</p>
</p>
<p align="center"><b> &#8212;  6. Algebraic topology  &#8212; </b></p>
<p>
It was thought that the high-degree algebraic geometry methods of Dvir and others were strongly dependent on the discrete nature of the finite field setting, and would not extend to continuous settings such as that of Euclidean space. Very recently, however, <a href="http://arxiv.org/abs/0811.2251">Larry Guth</a> managed to partially extend Dvir&#8217;s results to this case. One key observation is to replace Lemma <a href="#inc">2</a> by a topological counterpart, namely
</p>
<blockquote><p><b>Lemma 3 (Polynomial Ham Sandwich theorem)</b>  If <img src='http://s0.wp.com/latex.php?latex=%7BB_1%2C%5Cldots%2CB_N%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{B_1,&#92;ldots,B_N}&amp;fg=000000' title='{B_1,&#92;ldots,B_N}&amp;fg=000000' class='latex' /> are a collection of bounded open sets in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+R%7D%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb R}^d}&amp;fg=000000' title='{{&#92;mathbb R}^d}&amp;fg=000000' class='latex' />, then there exists a hypersurface <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B+x+%5Cin+%7B%5Cmathbb+R%7D%5Ed%3A+P%28x%29%3D0%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{ x &#92;in {&#92;mathbb R}^d: P(x)=0&#92;}}&amp;fg=000000' title='{&#92;{ x &#92;in {&#92;mathbb R}^d: P(x)=0&#92;}}&amp;fg=000000' class='latex' /> of degree <img src='http://s0.wp.com/latex.php?latex=%7BO%28N%5E%7B1%2Fd%7D%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O(N^{1/d})}&amp;fg=000000' title='{O(N^{1/d})}&amp;fg=000000' class='latex' /> which bisects each of these open sets (thus the sets <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B+x+%5Cin+B_i%3A+P%28x%29+%26%2362%3B+0+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{ x &#92;in B_i: P(x) &gt; 0 &#92;}}&amp;fg=000000' title='{&#92;{ x &#92;in B_i: P(x) &gt; 0 &#92;}}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B+x+%5Cin+B_i%3A+P%28x%29+%26%2360%3B+0+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{ x &#92;in B_i: P(x) &lt; 0 &#92;}}&amp;fg=000000' title='{&#92;{ x &#92;in B_i: P(x) &lt; 0 &#92;}}&amp;fg=000000' class='latex' /> have equal volume). </p></blockquote>
</p>
<p>
Note that if one shrinks these bounded open sets <img src='http://s0.wp.com/latex.php?latex=%7BB_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{B_i}&amp;fg=000000' title='{B_i}&amp;fg=000000' class='latex' /> to a tiny neighbourhood of points <img src='http://s0.wp.com/latex.php?latex=%7Bx_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x_i}&amp;fg=000000' title='{x_i}&amp;fg=000000' class='latex' />, then in the limit one recovers the analogue of Lemma <a href="#inc">2</a> over the reals <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+R%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb R}}&amp;fg=000000' title='{{&#92;mathbb R}}&amp;fg=000000' class='latex' /> (note that the proof of Lemma <a href="#inc">2</a> is valid over any field). This allows one to adapt Dvir&#8217;s method to continuous settings, basically by replacing points by balls. (There is another ingredient needed, which is an isoperimetric-type inequality that asserts that if a hypersurface bisects a ball, then the intersection of that hypersurface with the ball has a large area; see <a href="http://terrytao.wordpress.com/2008/11/27/the-kakeya-conjecture-and-the-ham-sandwich-theorem/">this blog post</a> for further discussion.)
</p>
<p>
Unfortunately, the arguments, like the heat flow arguments, so far are restricted to the non-coplanar case, as otherwise certain Jacobian factors begin to enter in an unfavorable fashion in the estimates, so the Euclidean linear Kakeya problem has so far not been in impacted by these results. However, the arguments of Guth are able to recover the multilinear estimate <a href="#multimax">(5)</a>, and in fact can also remove the <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%5E%7B-%5Cepsilon%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta^{-&#92;epsilon}}&amp;fg=000000' title='{&#92;delta^{-&#92;epsilon}}&amp;fg=000000' class='latex' /> loss, thus obtaining a very sharp estimate. Even more recently, <a href="http://arxiv.org/abs/0812.1043">Guth and Katz have managed</a> to solve a discrete analogue of the Euclidean Kakeya problem in which coplanarity has been eliminated by fiat, namely the <em>joints problem</em> of Sharir. Define a <em>joint</em> in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+R%7D%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb R}^d}&amp;fg=000000' title='{{&#92;mathbb R}^d}&amp;fg=000000' class='latex' /> to be a configuration of <img src='http://s0.wp.com/latex.php?latex=%7Bd%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d}&amp;fg=000000' title='{d}&amp;fg=000000' class='latex' /> concurrent lines, which do not all lie in a hyperplane. <a href="http://www.ams.org/mathscinet-getitem?mr=1280600">Sharir conjectured</a> that a collection of <img src='http://s0.wp.com/latex.php?latex=%7BN%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{N}&amp;fg=000000' title='{N}&amp;fg=000000' class='latex' /> lines in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+R%7D%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb R}^d}&amp;fg=000000' title='{{&#92;mathbb R}^d}&amp;fg=000000' class='latex' /> could form at most <img src='http://s0.wp.com/latex.php?latex=%7BO%28+N%5E%7Bd%2F%28d-1%29%7D+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O( N^{d/(d-1)} )}&amp;fg=000000' title='{O( N^{d/(d-1)} )}&amp;fg=000000' class='latex' /> joints (this is optimal, as can be seen by looking at lines parallel to the coordinate axes passing through a discrete cube of sidelength <img src='http://s0.wp.com/latex.php?latex=%7BO%28N%5E%7B1%2F%28d-1%29%7D%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O(N^{1/(d-1)})}&amp;fg=000000' title='{O(N^{1/(d-1)})}&amp;fg=000000' class='latex' /> and unit spacing); this conjecture has now been verified by Guth and Katz, in the three-dimensional case at least. (The estimate <a href="#multimax">(5)</a> was already observed to be closely related to the joints problem <a href="http://www.ams.org/mathscinet-getitem?mr=2275834">by Bennett, Carbery, and myself</a>, although it only gives a strong result in the case when one has a quantitative lower bound on the non-coplanarity of the joints.)
</p>
<p>
In summary, there has been an influx of techniques from many different areas of mathematics that have each contributed significant progress on the Kakeya conjecture. One may still need a couple more key ideas before the problem is finally solved in full, and such ideas may come from a quite unexpected source, but with the current rate of progress I am now optimistic that we will continue to see significant advances in this area in the near future.
</p></p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Szemeredi's regularity lemma via the correspondence principle]]></title>
<link>http://terrytao.wordpress.com/2009/05/08/szemeredis-regularity-lemma-via-the-correspondence-principle/</link>
<pubDate>Sat, 09 May 2009 04:12:53 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2009/05/08/szemeredis-regularity-lemma-via-the-correspondence-principle/</guid>
<description><![CDATA[In a previous post, we discussed the Szemer&eacute;di regularity lemma, and how a given graph could]]></description>
<content:encoded><![CDATA[<p>
 In a <a class="snap_noshots" href="http://terrytao.wordpress.com/2009/04/26/szemeredis-regularity-lemma-via-random-partitions/">previous post</a>, we discussed the <a href="http://en.wikipedia.org/wiki/Szemer&#37;C3&#37;A9di_regularity_lemma">Szemer&#233;di regularity lemma</a>, and how a given graph could be regularised by partitioning the vertex set into random neighbourhoods. More precisely, we gave a proof of
</p>
<blockquote><p><b>Lemma 1 (Regularity lemma via random neighbourhoods)</b> <a name="regularity3"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon &gt; 0}&amp;fg=000000' title='{&#92;varepsilon &gt; 0}&amp;fg=000000' class='latex' />. Then there exists integers <img src='http://s0.wp.com/latex.php?latex=%7BM_1%2C%5Cldots%2CM_m%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_1,&#92;ldots,M_m}&amp;fg=000000' title='{M_1,&#92;ldots,M_m}&amp;fg=000000' class='latex' /> with the following property: whenever <img src='http://s0.wp.com/latex.php?latex=%7BG+%3D+%28V%2CE%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G = (V,E)}&amp;fg=000000' title='{G = (V,E)}&amp;fg=000000' class='latex' /> be a graph on finitely many vertices, if one selects one of the integers <img src='http://s0.wp.com/latex.php?latex=%7BM_r%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_r}&amp;fg=000000' title='{M_r}&amp;fg=000000' class='latex' /> at random from <img src='http://s0.wp.com/latex.php?latex=%7BM_1%2C%5Cldots%2CM_m%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_1,&#92;ldots,M_m}&amp;fg=000000' title='{M_1,&#92;ldots,M_m}&amp;fg=000000' class='latex' />, then selects <img src='http://s0.wp.com/latex.php?latex=%7BM_r%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_r}&amp;fg=000000' title='{M_r}&amp;fg=000000' class='latex' /> vertices <img src='http://s0.wp.com/latex.php?latex=%7Bv_1%2C%5Cldots%2Cv_%7BM_r%7D+%5Cin+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_1,&#92;ldots,v_{M_r} &#92;in V}&amp;fg=000000' title='{v_1,&#92;ldots,v_{M_r} &#92;in V}&amp;fg=000000' class='latex' /> uniformly from <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> at random, then the <img src='http://s0.wp.com/latex.php?latex=%7B2%5E%7BM_r%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2^{M_r}}&amp;fg=000000' title='{2^{M_r}}&amp;fg=000000' class='latex' /> vertex cells <img src='http://s0.wp.com/latex.php?latex=%7BV%5E%7BM_r%7D_1%2C%5Cldots%2CV%5E%7BM_r%7D_%7B2%5E%7BM_r%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^{M_r}_1,&#92;ldots,V^{M_r}_{2^{M_r}}}&amp;fg=000000' title='{V^{M_r}_1,&#92;ldots,V^{M_r}_{2^{M_r}}}&amp;fg=000000' class='latex' /> (some of which can be empty) generated by the vertex neighbourhoods <img src='http://s0.wp.com/latex.php?latex=%7BA_t+%3A%3D+%5C%7B+v+%5Cin+V%3A+%28v%2Cv_t%29+%5Cin+E+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_t := &#92;{ v &#92;in V: (v,v_t) &#92;in E &#92;}}&amp;fg=000000' title='{A_t := &#92;{ v &#92;in V: (v,v_t) &#92;in E &#92;}}&amp;fg=000000' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+t+%5Cleq+M_r%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq t &#92;leq M_r}&amp;fg=000000' title='{1 &#92;leq t &#92;leq M_r}&amp;fg=000000' class='latex' />, will obey the regularity property <a name="regular">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Csum_%7B%28V_i%2CV_j%29+%5Chbox%7B+not+%7D+%5Cvarepsilon-%5Chbox%7Bregular%7D%7D+%26%23124%3BV_i%26%23124%3B+%26%23124%3BV_j%26%23124%3B+%5Cleq+%5Cvarepsilon+%26%23124%3BV%26%23124%3B%5E2+%5C+%5C+%5C+%5C+%5C+%281%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;sum_{(V_i,V_j) &#92;hbox{ not } &#92;varepsilon-&#92;hbox{regular}} &#124;V_i&#124; &#124;V_j&#124; &#92;leq &#92;varepsilon &#124;V&#124;^2 &#92; &#92; &#92; &#92; &#92; (1)&amp;fg=000000' title='&#92;displaystyle  &#92;sum_{(V_i,V_j) &#92;hbox{ not } &#92;varepsilon-&#92;hbox{regular}} &#124;V_i&#124; &#124;V_j&#124; &#92;leq &#92;varepsilon &#124;V&#124;^2 &#92; &#92; &#92; &#92; &#92; (1)&amp;fg=000000' class='latex' /></p>
<p></a> with probability at least <img src='http://s0.wp.com/latex.php?latex=%7B1-O%28%5Cvarepsilon%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1-O(&#92;varepsilon)}&amp;fg=000000' title='{1-O(&#92;varepsilon)}&amp;fg=000000' class='latex' />, where the sum is over all pairs <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+i+%5Cleq+j+%5Cleq+k%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq i &#92;leq j &#92;leq k}&amp;fg=000000' title='{1 &#92;leq i &#92;leq j &#92;leq k}&amp;fg=000000' class='latex' /> for which <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> is not <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon}&amp;fg=000000' title='{&#92;varepsilon}&amp;fg=000000' class='latex' />-regular between <img src='http://s0.wp.com/latex.php?latex=%7BV_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_i}&amp;fg=000000' title='{V_i}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BV_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_j}&amp;fg=000000' title='{V_j}&amp;fg=000000' class='latex' />. [Recall that a pair <img src='http://s0.wp.com/latex.php?latex=%7B%28V_i%2CV_j%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(V_i,V_j)}&amp;fg=000000' title='{(V_i,V_j)}&amp;fg=000000' class='latex' /> is <em><img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon}&amp;fg=000000' title='{&#92;varepsilon}&amp;fg=000000' class='latex' />-regular</em> for <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> if one has
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%26%23124%3Bd%28+A%2C+B+%29+-+d%28+V_i%2C+V_j+%29%26%23124%3B+%5Cleq+%5Cvarepsilon%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#124;d( A, B ) - d( V_i, V_j )&#124; &#92;leq &#92;varepsilon&amp;fg=000000' title='&#92;displaystyle  &#124;d( A, B ) - d( V_i, V_j )&#124; &#92;leq &#92;varepsilon&amp;fg=000000' class='latex' /></p>
<p> for any <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Csubset+V_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;subset V_i}&amp;fg=000000' title='{A &#92;subset V_i}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BB+%5Csubset+V_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{B &#92;subset V_j}&amp;fg=000000' title='{B &#92;subset V_j}&amp;fg=000000' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BA%26%23124%3B+%5Cgeq+%5Cvarepsilon+%26%23124%3BV_i%26%23124%3B%2C+%26%23124%3BB%26%23124%3B+%5Cgeq+%5Cvarepsilon+%26%23124%3BV_j%26%23124%3B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;A&#124; &#92;geq &#92;varepsilon &#124;V_i&#124;, &#124;B&#124; &#92;geq &#92;varepsilon &#124;V_j&#124;}&amp;fg=000000' title='{&#124;A&#124; &#92;geq &#92;varepsilon &#124;V_i&#124;, &#124;B&#124; &#92;geq &#92;varepsilon &#124;V_j&#124;}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7Bd%28A%2CB%29+%3A%3D+%26%23124%3BE+%5Ccap+%28A+%5Ctimes+B%29%26%23124%3B%2F%26%23124%3BA%26%23124%3B+%26%23124%3BB%26%23124%3B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d(A,B) := &#124;E &#92;cap (A &#92;times B)&#124;/&#124;A&#124; &#124;B&#124;}&amp;fg=000000' title='{d(A,B) := &#124;E &#92;cap (A &#92;times B)&#124;/&#124;A&#124; &#124;B&#124;}&amp;fg=000000' class='latex' /> is the density of edges between <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BB%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{B}&amp;fg=000000' title='{B}&amp;fg=000000' class='latex' />.] </p></blockquote>
</p>
<p>
The proof was a combinatorial one, based on the standard energy increment argument.
</p>
<p>
In this post I would like to discuss an alternate approach to the regularity lemma, which is an infinitary approach passing through a graph-theoretic version of the Furstenberg correspondence principle (mentioned briefly in <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/08/30/the-correspondence-principle-and-finitary-ergodic-theory/">this earlier post of mine</a>). While this approach superficially looks quite different from the combinatorial approach, it in fact uses many of the same ingredients, most notably a reliance on random neighbourhoods to regularise the graph. This approach was introduced <a class="snap_noshots" href="http://front.math.ucdavis.edu/math.CO/0602037">by myself</a> back in 2006, and used <a class="snap_noshots" href="http://arxiv.org/abs/0801.1698">by Austin</a> and <a class="snap_noshots" href="http://arxiv.org/abs/0801.2179">by Austin and myself</a> to establish some property testing results for hypergraphs; more recently, a closely related infinitary hypergraph removal lemma developed in the 2006 paper was also used by Austin to give <a class="snap_noshots" href="http://arxiv.org/abs/0808.2267">new proofs of the multidimensional Szemeredi theorem</a> and <a class="snap_noshots" href="http://arxiv.org/abs/0903.1633">of the density Hales-Jewett theorem</a> (the latter being a spinoff of the <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Main_Page">polymath1 project</a>).
</p>
<p>
For various technical reasons we will not be able to use the correspondence principle to recover Lemma <a href="#regularity3">1</a> in its full strength; instead, we will establish the following slightly weaker variant.
</p>
<blockquote><p><b>Lemma 2 (Regularity lemma via random neighbourhoods, weak version)</b> <a name="regularity4"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon &gt; 0}&amp;fg=000000' title='{&#92;varepsilon &gt; 0}&amp;fg=000000' class='latex' />. Then there exist an integer <img src='http://s0.wp.com/latex.php?latex=%7BM_%2A%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_*}&amp;fg=000000' title='{M_*}&amp;fg=000000' class='latex' /> with the following property: whenever <img src='http://s0.wp.com/latex.php?latex=%7BG+%3D+%28V%2CE%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G = (V,E)}&amp;fg=000000' title='{G = (V,E)}&amp;fg=000000' class='latex' /> be a graph on finitely many vertices, there exists <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+M+%5Cleq+M_%2A%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq M &#92;leq M_*}&amp;fg=000000' title='{1 &#92;leq M &#92;leq M_*}&amp;fg=000000' class='latex' /> such that if one selects <img src='http://s0.wp.com/latex.php?latex=%7BM%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M}&amp;fg=000000' title='{M}&amp;fg=000000' class='latex' /> vertices <img src='http://s0.wp.com/latex.php?latex=%7Bv_1%2C%5Cldots%2Cv_%7BM%7D+%5Cin+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_1,&#92;ldots,v_{M} &#92;in V}&amp;fg=000000' title='{v_1,&#92;ldots,v_{M} &#92;in V}&amp;fg=000000' class='latex' /> uniformly from <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> at random, then the <img src='http://s0.wp.com/latex.php?latex=%7B2%5E%7BM%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2^{M}}&amp;fg=000000' title='{2^{M}}&amp;fg=000000' class='latex' /> vertex cells <img src='http://s0.wp.com/latex.php?latex=%7BV%5E%7BM%7D_1%2C%5Cldots%2CV%5E%7BM%7D_%7B2%5E%7BM%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^{M}_1,&#92;ldots,V^{M}_{2^{M}}}&amp;fg=000000' title='{V^{M}_1,&#92;ldots,V^{M}_{2^{M}}}&amp;fg=000000' class='latex' /> generated by the vertex neighbourhoods <img src='http://s0.wp.com/latex.php?latex=%7BA_t+%3A%3D+%5C%7B+v+%5Cin+V%3A+%28v%2Cv_t%29+%5Cin+E+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_t := &#92;{ v &#92;in V: (v,v_t) &#92;in E &#92;}}&amp;fg=000000' title='{A_t := &#92;{ v &#92;in V: (v,v_t) &#92;in E &#92;}}&amp;fg=000000' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+t+%5Cleq+M%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq t &#92;leq M}&amp;fg=000000' title='{1 &#92;leq t &#92;leq M}&amp;fg=000000' class='latex' />, will obey the regularity property <a href="#regular">(1)</a> with probability at least <img src='http://s0.wp.com/latex.php?latex=%7B1-%5Cvarepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1-&#92;varepsilon}&amp;fg=000000' title='{1-&#92;varepsilon}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
Roughly speaking, Lemma <a href="#regularity3">1</a> asserts that one can regularise a large graph <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> with high probability by using <img src='http://s0.wp.com/latex.php?latex=%7BM_r%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_r}&amp;fg=000000' title='{M_r}&amp;fg=000000' class='latex' /> random neighbourhoods, where <img src='http://s0.wp.com/latex.php?latex=%7BM_r%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_r}&amp;fg=000000' title='{M_r}&amp;fg=000000' class='latex' /> is chosen at random from one of a number of choices <img src='http://s0.wp.com/latex.php?latex=%7BM_1%2C%5Cldots%2CM_m%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_1,&#92;ldots,M_m}&amp;fg=000000' title='{M_1,&#92;ldots,M_m}&amp;fg=000000' class='latex' />; in contrast, the weaker Lemma <a href="#regularity4">2</a> asserts that one can regularise a large graph <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> with high probability by using <em>some</em> integer <img src='http://s0.wp.com/latex.php?latex=%7BM%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M}&amp;fg=000000' title='{M}&amp;fg=000000' class='latex' /> from <img src='http://s0.wp.com/latex.php?latex=%7B1%2C%5Cldots%2CM_%2A%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1,&#92;ldots,M_*}&amp;fg=000000' title='{1,&#92;ldots,M_*}&amp;fg=000000' class='latex' />, but the exact choice of <img src='http://s0.wp.com/latex.php?latex=%7BM%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M}&amp;fg=000000' title='{M}&amp;fg=000000' class='latex' /> depends on <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' />, and it is not guaranteed that a randomly chosen <img src='http://s0.wp.com/latex.php?latex=%7BM%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M}&amp;fg=000000' title='{M}&amp;fg=000000' class='latex' /> will be likely to work. While Lemma <a href="#regularity4">2</a> is strictly weaker than Lemma <a href="#regularity3">1</a>, it still implies the (weighted) Szemer&#233;di regularity lemma (Lemma 2 from the <a class="snap_noshots" href="http://terrytao.wordpress.com/2009/04/26/szemeredis-regularity-lemma-via-random-partitions/">previous post</a>).
</p>
<p>
<!--more-->
</p>
</p>
<p align="center"><b> &#8212;  1. The graph correspondence principle  &#8212; </b></p>
<p>
The first key tool in this argument is the <em>graph correspondence principle</em>, which takes a sequence of (increasingly large) graphs and uses random sampling to extract an infinitary limit object, which will turn out to be an infinite but random (and, crucially, <em>exchangeable</em>) graph. This concept of a graph limit is related to (though slightly different from) the &#8220;graphons&#8221; used as graph limits in the <a class="snap_noshots" href="http://arxiv.org/abs/math/0408173">work of Lovasz and Szegedy</a>, or the ultraproducts used in the work of <a class="snap_noshots" href="http://arxiv.org/abs/0810.4062">Elek and Szegedy</a>. It also seems to be related to the concept of an elementary limit that I discussed in <a class="snap_noshots" href="http://terrytao.wordpress.com/2009/04/10/the-completeness-and-compactness-theorems-of-first-order-logic/">this post</a>, though this connection is still rather tentative.
</p>
<p>
The correspondence works as follows. We start with a finite, deterministic graph <img src='http://s0.wp.com/latex.php?latex=%7BG+%3D+%28V%2CE%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G = (V,E)}&amp;fg=000000' title='{G = (V,E)}&amp;fg=000000' class='latex' />. We can then form an infinite, random graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G+%3D+%28%7B%5Cmathbb+Z%7D%2C+%5Chat+E%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G = ({&#92;mathbb Z}, &#92;hat E)}&amp;fg=000000' title='{&#92;hat G = ({&#92;mathbb Z}, &#92;hat E)}&amp;fg=000000' class='latex' /> from this graph by the following recipe:
</p>
<p><ul>
<li> The vertex set of <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> will be the integers <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+Z%7D+%3D+%5C%7B+-2%2C-1%2C0%2C1%2C2%2C%5Cldots%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb Z} = &#92;{ -2,-1,0,1,2,&#92;ldots&#92;}}&amp;fg=000000' title='{{&#92;mathbb Z} = &#92;{ -2,-1,0,1,2,&#92;ldots&#92;}}&amp;fg=000000' class='latex' />. </li>
<li> For every integer <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' />, we randomly select a vertex <img src='http://s0.wp.com/latex.php?latex=%7Bv_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_n}&amp;fg=000000' title='{v_n}&amp;fg=000000' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' />, uniformly and independently at random. (Note that there will be many collisions, i.e. integers <img src='http://s0.wp.com/latex.php?latex=%7Bn%2Cm%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n,m}&amp;fg=000000' title='{n,m}&amp;fg=000000' class='latex' /> for which <img src='http://s0.wp.com/latex.php?latex=%7Bv_n%3Dv_m%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_n=v_m}&amp;fg=000000' title='{v_n=v_m}&amp;fg=000000' class='latex' />, but these collisions will become asymptotically negligible in the limit <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BV%26%23124%3B+%5Crightarrow+%5Cinfty%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;V&#124; &#92;rightarrow &#92;infty}&amp;fg=000000' title='{&#124;V&#124; &#92;rightarrow &#92;infty}&amp;fg=000000' class='latex' />.) </li>
<li> We then define the edge set <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat E}&amp;fg=000000' title='{&#92;hat E}&amp;fg=000000' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> by declaring <img src='http://s0.wp.com/latex.php?latex=%7B%28n%2Cm%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(n,m)}&amp;fg=000000' title='{(n,m)}&amp;fg=000000' class='latex' /> to be an edge on <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat E}&amp;fg=000000' title='{&#92;hat E}&amp;fg=000000' class='latex' /> if and only if <img src='http://s0.wp.com/latex.php?latex=%7B%28v_n%2Cv_m%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(v_n,v_m)}&amp;fg=000000' title='{(v_n,v_m)}&amp;fg=000000' class='latex' /> is an edge in <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' /> (which in particular requires <img src='http://s0.wp.com/latex.php?latex=%7Bv_n+%5Cneq+v_m%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_n &#92;neq v_m}&amp;fg=000000' title='{v_n &#92;neq v_m}&amp;fg=000000' class='latex' />).
</li>
</ul>
<p>
More succinctly, <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> is the pullback of <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> under a random map from <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+Z%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb Z}}&amp;fg=000000' title='{{&#92;mathbb Z}}&amp;fg=000000' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' />.
</p>
<p>
The random graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> captures all the &#8220;local&#8221; information of <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' />, while obscuring all the &#8220;global&#8221; information. For instance, the edge density of <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> is essentially just the probability that a given edge, say <img src='http://s0.wp.com/latex.php?latex=%7B%281%2C2%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(1,2)}&amp;fg=000000' title='{(1,2)}&amp;fg=000000' class='latex' />, lies in <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' />. (There is a small error term due to the presence of collisions, but this goes away in the limit <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BV%26%23124%3B+%5Crightarrow+%5Cinfty%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;V&#124; &#92;rightarrow &#92;infty}&amp;fg=000000' title='{&#124;V&#124; &#92;rightarrow &#92;infty}&amp;fg=000000' class='latex' />.) Similarly, the triangle density of <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> is essentially the probability that a given triangle, say <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B%281%2C2%29%2C+%282%2C3%29%2C+%283%2C1%29%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{(1,2), (2,3), (3,1)&#92;}}&amp;fg=000000' title='{&#92;{(1,2), (2,3), (3,1)&#92;}}&amp;fg=000000' class='latex' />, lies in <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' />. On the other hand, it is difficult to read off global properties of <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' />, such as being connected or <img src='http://s0.wp.com/latex.php?latex=%7B4%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{4}&amp;fg=000000' title='{4}&amp;fg=000000' class='latex' />-colourable, just from <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' />.
</p>
<p>
At first glance, it may seem a poor bargain to trade in a finite deterministic graph <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> for an infinite random graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' />, which is a more complicated and less elementary object. However, there are three major advantages of working with <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> rather than <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' />:
</p>
<p><ul>
<li> <b>Exchangeability</b>. The probability distribution of <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> has a powerful symmetry or <a href="http://en.wikipedia.org/wiki/Exchangeable_random_variables">exchangeability</a> property: if one takes the random graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> and interchanges any two vertices in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+Z%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb Z}}&amp;fg=000000' title='{{&#92;mathbb Z}}&amp;fg=000000' class='latex' />, e.g. <img src='http://s0.wp.com/latex.php?latex=%7B3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{3}&amp;fg=000000' title='{3}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B5%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{5}&amp;fg=000000' title='{5}&amp;fg=000000' class='latex' />, one obtains a new graph which is not equal to <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' />, but nevertheless has the same probability distribution as <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' />, basically because the <img src='http://s0.wp.com/latex.php?latex=%7Bv_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_n}&amp;fg=000000' title='{v_n}&amp;fg=000000' class='latex' /> were selected in an <a href="http://en.wikipedia.org/wiki/Independent_and_identically_distributed">iid</a> manner. More generally, given any permutation <img src='http://s0.wp.com/latex.php?latex=%7B%5Csigma%3A+%7B%5Cmathbb+Z%7D+%5Crightarrow+%7B%5Cmathbb+Z%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sigma: {&#92;mathbb Z} &#92;rightarrow {&#92;mathbb Z}}&amp;fg=000000' title='{&#92;sigma: {&#92;mathbb Z} &#92;rightarrow {&#92;mathbb Z}}&amp;fg=000000' class='latex' />, the pullback <img src='http://s0.wp.com/latex.php?latex=%7B%5Csigma%5E%2A%28%5Chat+G%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sigma^*(&#92;hat G)}&amp;fg=000000' title='{&#92;sigma^*(&#92;hat G)}&amp;fg=000000' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> by <img src='http://s0.wp.com/latex.php?latex=%7B%5Csigma%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sigma}&amp;fg=000000' title='{&#92;sigma}&amp;fg=000000' class='latex' /> has the same probability distribution as <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' />; thus we have a measure-preserving action of the symmetric group <img src='http://s0.wp.com/latex.php?latex=%7BS_%5Cinfty%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{S_&#92;infty}&amp;fg=000000' title='{S_&#92;infty}&amp;fg=000000' class='latex' />, which places us in the general framework of ergodic theory. </li>
<li> <b>Limits</b>. The space of probability measures on the space <img src='http://s0.wp.com/latex.php?latex=%7B2%5E%7B%5Cbinom%7B%7B%5Cmathbb+Z%7D%7D%7B2%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2^{&#92;binom{{&#92;mathbb Z}}{2}}}&amp;fg=000000' title='{2^{&#92;binom{{&#92;mathbb Z}}{2}}}&amp;fg=000000' class='latex' /> of infinite graphs is sequentially compact; given any sequence <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G_n+%3D+%28%7B%5Cmathbb+Z%7D%2C+%5Chat+E_n%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G_n = ({&#92;mathbb Z}, &#92;hat E_n)}&amp;fg=000000' title='{&#92;hat G_n = ({&#92;mathbb Z}, &#92;hat E_n)}&amp;fg=000000' class='latex' /> of infinite random graphs, one can find a subsequence <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G_%7Bn_j%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G_{n_j}}&amp;fg=000000' title='{&#92;hat G_{n_j}}&amp;fg=000000' class='latex' /> which converges in the <a href="http://en.wikipedia.org/wiki/Vague_topology">vague topology</a> to another infinite random graph. What this means is that given any event <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' /> on infinite graphs that involve only finitely many of the edges, the probability that <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G_%7Bn_j%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G_{n_j}}&amp;fg=000000' title='{&#92;hat G_{n_j}}&amp;fg=000000' class='latex' /> obeys <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' /> converges to the probability that <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> obeys <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' />. (Thus, for instance, the probability that <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G_%7Bn_j%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G_{n_j}}&amp;fg=000000' title='{&#92;hat G_{n_j}}&amp;fg=000000' class='latex' /> contains the triangle <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B%281%2C2%29%2C+%282%2C3%29%2C+%283%2C1%29%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{(1,2), (2,3), (3,1)&#92;}}&amp;fg=000000' title='{&#92;{(1,2), (2,3), (3,1)&#92;}}&amp;fg=000000' class='latex' /> will converge to the probability that <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> contains the same triangle.) Note that properties that involve infinitely many edges (e.g. connectedness) need not be preserved under vague limits. </li>
<li> <b>Factors</b>. The underlying probability space for the random variable <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> is the space <img src='http://s0.wp.com/latex.php?latex=%7B2%5E%7B%5Cbinom%7B%7B%5Cmathbb+Z%7D%7D%7B2%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2^{&#92;binom{{&#92;mathbb Z}}{2}}}&amp;fg=000000' title='{2^{&#92;binom{{&#92;mathbb Z}}{2}}}&amp;fg=000000' class='latex' /> of infinite graphs, and it is natural to give this space the Borel <img src='http://s0.wp.com/latex.php?latex=%7B%5Csigma%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sigma}&amp;fg=000000' title='{&#92;sigma}&amp;fg=000000' class='latex' />-algebra <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5Cmathbb+Z%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;mathbb Z}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;mathbb Z}}&amp;fg=000000' class='latex' />, which is the <img src='http://s0.wp.com/latex.php?latex=%7B%5Csigma%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sigma}&amp;fg=000000' title='{&#92;sigma}&amp;fg=000000' class='latex' />-algebra generated by the <a href="http://en.wikipedia.org/wiki/Cylinder_set">cylinder events</a> &#8220;<img src='http://s0.wp.com/latex.php?latex=%7B%28i%2Cj%29+%5Cin+%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(i,j) &#92;in &#92;hat G}&amp;fg=000000' title='{(i,j) &#92;in &#92;hat G}&amp;fg=000000' class='latex' />&#8221; for <img src='http://s0.wp.com/latex.php?latex=%7Bi%2Cj+%5Cin+%7B%5Cmathbb+Z%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i,j &#92;in {&#92;mathbb Z}}&amp;fg=000000' title='{i,j &#92;in {&#92;mathbb Z}}&amp;fg=000000' class='latex' />. But this <img src='http://s0.wp.com/latex.php?latex=%7B%5Csigma%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sigma}&amp;fg=000000' title='{&#92;sigma}&amp;fg=000000' class='latex' />-algebra also has a number of useful sub-<img src='http://s0.wp.com/latex.php?latex=%7B%5Csigma%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sigma}&amp;fg=000000' title='{&#92;sigma}&amp;fg=000000' class='latex' />-algebras or <em>factors</em>, representing various partial information on the graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' />. In particular, given any subset <img src='http://s0.wp.com/latex.php?latex=%7BI%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{I}&amp;fg=000000' title='{I}&amp;fg=000000' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+Z%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb Z}}&amp;fg=000000' title='{{&#92;mathbb Z}}&amp;fg=000000' class='latex' />, one can create the factor <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_I%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_I}&amp;fg=000000' title='{{&#92;mathcal B}_I}&amp;fg=000000' class='latex' />, defined as the <img src='http://s0.wp.com/latex.php?latex=%7B%5Csigma%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sigma}&amp;fg=000000' title='{&#92;sigma}&amp;fg=000000' class='latex' />-algebra generated by the events &#8220;<img src='http://s0.wp.com/latex.php?latex=%7B%28i%2Cj%29+%5Cin+%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(i,j) &#92;in &#92;hat G}&amp;fg=000000' title='{(i,j) &#92;in &#92;hat G}&amp;fg=000000' class='latex' />&#8221; for <img src='http://s0.wp.com/latex.php?latex=%7Bi%2Cj+%5Cin+I%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i,j &#92;in I}&amp;fg=000000' title='{i,j &#92;in I}&amp;fg=000000' class='latex' />. Thus for instance, the event that <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> contains the triangle is measurable in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B1%2C2%2C3%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{1,2,3&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{1,2,3&#92;}}}&amp;fg=000000' class='latex' />, but not in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B1%2C2%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{1,2&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{1,2&#92;}}}&amp;fg=000000' class='latex' />. One can also look at compound factors such as <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_I+%5Cwedge+%7B%5Cmathcal+B%7D_J%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_I &#92;wedge {&#92;mathcal B}_J}&amp;fg=000000' title='{{&#92;mathcal B}_I &#92;wedge {&#92;mathcal B}_J}&amp;fg=000000' class='latex' />, the factor generated by the union of <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_I%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_I}&amp;fg=000000' title='{{&#92;mathcal B}_I}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_J%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_J}&amp;fg=000000' title='{{&#92;mathcal B}_J}&amp;fg=000000' class='latex' />. For instance, the event that <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> contains the edges <img src='http://s0.wp.com/latex.php?latex=%7B%281%2C2%29%2C+%281%2C3%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(1,2), (1,3)}&amp;fg=000000' title='{(1,2), (1,3)}&amp;fg=000000' class='latex' /> is measurable in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B1%2C2%5C%7D%7D+%5Cvee+%7B%5Cmathcal+B%7D_%7B%5C%7B1%2C3%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{1,2&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{1,3&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{1,2&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{1,3&#92;}}}&amp;fg=000000' class='latex' />, but the event that <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> contains the triangle <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B%281%2C2%29%2C+%282%2C3%29%2C+%283%2C1%29%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{(1,2), (2,3), (3,1)&#92;}}&amp;fg=000000' title='{&#92;{(1,2), (2,3), (3,1)&#92;}}&amp;fg=000000' class='latex' /> is not.
</li>
</ul>
<p>
The connection between the infinite random graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> and partitioning through random neighbourhoods comes when contemplating the relative difference between a factor such as <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B-n%2C%5Cldots%2C-1%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{-n,&#92;ldots,-1&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{-n,&#92;ldots,-1&#92;}}}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B-n%2C%5Cldots%2C-1%5C%7D+%5Ccup+%5C%7B1%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{-n,&#92;ldots,-1&#92;} &#92;cup &#92;{1&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{-n,&#92;ldots,-1&#92;} &#92;cup &#92;{1&#92;}}}&amp;fg=000000' class='latex' /> (say). The latter factor is generated by the former factor, together with the events &#8220;<img src='http://s0.wp.com/latex.php?latex=%7B%281%2C-i%29+%5Cin+%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(1,-i) &#92;in &#92;hat E}&amp;fg=000000' title='{(1,-i) &#92;in &#92;hat E}&amp;fg=000000' class='latex' />&#8221; for <img src='http://s0.wp.com/latex.php?latex=%7Bi%3D1%2C%5Cldots%2Cn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i=1,&#92;ldots,n}&amp;fg=000000' title='{i=1,&#92;ldots,n}&amp;fg=000000' class='latex' />. But observe if <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G+%3D+%28%7B%5Cmathbb+Z%7D%2C+%5Chat+E%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G = ({&#92;mathbb Z}, &#92;hat E)}&amp;fg=000000' title='{&#92;hat G = ({&#92;mathbb Z}, &#92;hat E)}&amp;fg=000000' class='latex' /> is generated from a finite deterministic graph <img src='http://s0.wp.com/latex.php?latex=%7BG+%3D+%28V%2CE%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G = (V,E)}&amp;fg=000000' title='{G = (V,E)}&amp;fg=000000' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=%7B%281%2C-i%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(1,-i)}&amp;fg=000000' title='{(1,-i)}&amp;fg=000000' class='latex' /> lies in <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat E}&amp;fg=000000' title='{&#92;hat E}&amp;fg=000000' class='latex' /> if and only if <img src='http://s0.wp.com/latex.php?latex=%7Bv_1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_1}&amp;fg=000000' title='{v_1}&amp;fg=000000' class='latex' /> lies in the vertex neighbourhood of <img src='http://s0.wp.com/latex.php?latex=%7Bv_%7B-i%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_{-i}}&amp;fg=000000' title='{v_{-i}}&amp;fg=000000' class='latex' />. Thus, if one uses the vertex neighbourhoods of <img src='http://s0.wp.com/latex.php?latex=%7Bv_%7B-1%7D%2C%5Cldots%2Cv_%7B-n%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_{-1},&#92;ldots,v_{-n}}&amp;fg=000000' title='{v_{-1},&#92;ldots,v_{-n}}&amp;fg=000000' class='latex' /> to subdivide the original vertex set <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> into <img src='http://s0.wp.com/latex.php?latex=%7B2%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2^n}&amp;fg=000000' title='{2^n}&amp;fg=000000' class='latex' /> cells of varying sizes, the factor <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B-n%2C%5Cldots%2C-1%5C%7D+%5Ccup+%5C%7B1%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{-n,&#92;ldots,-1&#92;} &#92;cup &#92;{1&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{-n,&#92;ldots,-1&#92;} &#92;cup &#92;{1&#92;}}}&amp;fg=000000' class='latex' /> is generated from <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B-n%2C%5Cldots%2C-1%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{-n,&#92;ldots,-1&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{-n,&#92;ldots,-1&#92;}}}&amp;fg=000000' class='latex' />, together with the random variable that computes which of these <img src='http://s0.wp.com/latex.php?latex=%7B2%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2^n}&amp;fg=000000' title='{2^n}&amp;fg=000000' class='latex' /> cells the random vertex <img src='http://s0.wp.com/latex.php?latex=%7Bv_1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_1}&amp;fg=000000' title='{v_1}&amp;fg=000000' class='latex' /> falls into. We will see this connection in more detail later in this post, when we use the correspondence principle to prove Lemma <a href="#regularity4">2</a>.
</p>
<p>
Combining the exchangeability and limit properties (and noting that the vague limit of exchangeable random graphs is still exchangeable), we obtain
</p>
<blockquote><p><b>Lemma 3 (Graph correspondence principle)</b>  Let <img src='http://s0.wp.com/latex.php?latex=%7BG_n+%3D+%28V_n%2CE_n%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_n = (V_n,E_n)}&amp;fg=000000' title='{G_n = (V_n,E_n)}&amp;fg=000000' class='latex' /> be a sequence of finite deterministic graphs, and let <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G_n+%3D+%28%7B%5Cmathbb+Z%7D%2C+%5Chat+E_n%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G_n = ({&#92;mathbb Z}, &#92;hat E_n)}&amp;fg=000000' title='{&#92;hat G_n = ({&#92;mathbb Z}, &#92;hat E_n)}&amp;fg=000000' class='latex' /> be their infinite random counterparts. Then there exists a subsequence <img src='http://s0.wp.com/latex.php?latex=%7Bn_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n_j}&amp;fg=000000' title='{n_j}&amp;fg=000000' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G_%7Bn_j%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G_{n_j}}&amp;fg=000000' title='{&#92;hat G_{n_j}}&amp;fg=000000' class='latex' /> converges in the vague topology to an exchangeable infinite random graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G+%3D+%28%7B%5Cmathbb+Z%7D%2C+%5Chat+E%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G = ({&#92;mathbb Z}, &#92;hat E)}&amp;fg=000000' title='{&#92;hat G = ({&#92;mathbb Z}, &#92;hat E)}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
We can illustrate this principle with three main examples, two from opposing extremes of the &#8220;dichotomy between structure and randomness&#8221;, and one intermediate one.
</p>
<blockquote><p><b>Example 1 (Random example)</b> <a name="randex"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7BG_n+%3D+%28V_n%2CE_n%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_n = (V_n,E_n)}&amp;fg=000000' title='{G_n = (V_n,E_n)}&amp;fg=000000' class='latex' /> be a sequence of <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon_n}&amp;fg=000000' title='{&#92;varepsilon_n}&amp;fg=000000' class='latex' />-regular graphs of edge density <img src='http://s0.wp.com/latex.php?latex=%7Bp_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{p_n}&amp;fg=000000' title='{p_n}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BV_n%26%23124%3B+%5Crightarrow+%5Cinfty%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;V_n&#124; &#92;rightarrow &#92;infty}&amp;fg=000000' title='{&#124;V_n&#124; &#92;rightarrow &#92;infty}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon_n+%5Crightarrow+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon_n &#92;rightarrow 0}&amp;fg=000000' title='{&#92;varepsilon_n &#92;rightarrow 0}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7Bp_n+%5Crightarrow+p%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{p_n &#92;rightarrow p}&amp;fg=000000' title='{p_n &#92;rightarrow p}&amp;fg=000000' class='latex' /> as <img src='http://s0.wp.com/latex.php?latex=%7Bn+%5Crightarrow+%5Cinfty%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n &#92;rightarrow &#92;infty}&amp;fg=000000' title='{n &#92;rightarrow &#92;infty}&amp;fg=000000' class='latex' />. Then any graph limit <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G+%3D+%28%7B%5Cmathbb+Z%7D%2C%5Chat+G%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G = ({&#92;mathbb Z},&#92;hat G)}&amp;fg=000000' title='{&#92;hat G = ({&#92;mathbb Z},&#92;hat G)}&amp;fg=000000' class='latex' /> of this sequence will be an <a href="http://en.wikipedia.org/wiki/Erd&#37;C5&#37;91s&#37;E2&#37;80&#37;93R&#37;C3&#37;A9nyi_model">Erd&#246;s-R&#233;nyi graph</a> <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G+%3D+G%28%5Cinfty%2Cp%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G = G(&#92;infty,p)}&amp;fg=000000' title='{&#92;hat G = G(&#92;infty,p)}&amp;fg=000000' class='latex' />, where each edge <img src='http://s0.wp.com/latex.php?latex=%7B%28i%2Cj%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(i,j)}&amp;fg=000000' title='{(i,j)}&amp;fg=000000' class='latex' /> lies in <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> with an independent probability of <img src='http://s0.wp.com/latex.php?latex=%7Bp%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{p}&amp;fg=000000' title='{p}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<blockquote><p><b>Example 2 (Structured example)</b> <a name="structex"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7BG_n+%3D+%28V_n%2CE_n%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_n = (V_n,E_n)}&amp;fg=000000' title='{G_n = (V_n,E_n)}&amp;fg=000000' class='latex' /> be a sequence of complete bipartite graphs, where the two cells of the bipartite graph have vertex density <img src='http://s0.wp.com/latex.php?latex=%7Bq_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q_n}&amp;fg=000000' title='{q_n}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B1-q_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1-q_n}&amp;fg=000000' title='{1-q_n}&amp;fg=000000' class='latex' /> respectively, with <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BV_n%26%23124%3B+%5Crightarrow+%5Cinfty%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;V_n&#124; &#92;rightarrow &#92;infty}&amp;fg=000000' title='{&#124;V_n&#124; &#92;rightarrow &#92;infty}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7Bq_n+%5Crightarrow+q%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q_n &#92;rightarrow q}&amp;fg=000000' title='{q_n &#92;rightarrow q}&amp;fg=000000' class='latex' />. Then any graph limit <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G+%3D+%28%7B%5Cmathbb+Z%7D%2C%5Chat+E%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G = ({&#92;mathbb Z},&#92;hat E)}&amp;fg=000000' title='{&#92;hat G = ({&#92;mathbb Z},&#92;hat E)}&amp;fg=000000' class='latex' /> of this sequence will be a random complete bipartite graph, constructed as follows: first, randomly colour each vertex <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+Z%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb Z}}&amp;fg=000000' title='{{&#92;mathbb Z}}&amp;fg=000000' class='latex' /> red with probability <img src='http://s0.wp.com/latex.php?latex=%7Bq%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q}&amp;fg=000000' title='{q}&amp;fg=000000' class='latex' /> and blue with probability <img src='http://s0.wp.com/latex.php?latex=%7B1-q%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1-q}&amp;fg=000000' title='{1-q}&amp;fg=000000' class='latex' />, independently for each vertex. Then define <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> to be the complete bipartite graph between the red vertices and the blue vertices. </p></blockquote>
</p>
<blockquote><p><b>Example 3 (Random+structured example)</b> <a name="combex"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7BG_n+%3D+%28V_n%2CE_n%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_n = (V_n,E_n)}&amp;fg=000000' title='{G_n = (V_n,E_n)}&amp;fg=000000' class='latex' /> be a sequence of <em>incomplete</em> bipartite graphs, where the two cells of the bipartite graph have vertex density <img src='http://s0.wp.com/latex.php?latex=%7Bp_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{p_n}&amp;fg=000000' title='{p_n}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B1-p_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1-p_n}&amp;fg=000000' title='{1-p_n}&amp;fg=000000' class='latex' /> respectively, and the graph <img src='http://s0.wp.com/latex.php?latex=%7BG_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_n}&amp;fg=000000' title='{G_n}&amp;fg=000000' class='latex' /> is <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon_n}&amp;fg=000000' title='{&#92;varepsilon_n}&amp;fg=000000' class='latex' />-regular between these two cells with edge density <img src='http://s0.wp.com/latex.php?latex=%7Bp_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{p_n}&amp;fg=000000' title='{p_n}&amp;fg=000000' class='latex' />, with <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BV_n%26%23124%3B+%5Crightarrow+%5Cinfty%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;V_n&#124; &#92;rightarrow &#92;infty}&amp;fg=000000' title='{&#124;V_n&#124; &#92;rightarrow &#92;infty}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon_n+%5Crightarrow+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon_n &#92;rightarrow 0}&amp;fg=000000' title='{&#92;varepsilon_n &#92;rightarrow 0}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7Bp_n+%5Crightarrow+p%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{p_n &#92;rightarrow p}&amp;fg=000000' title='{p_n &#92;rightarrow p}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7Bq_n+%5Crightarrow+q%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q_n &#92;rightarrow q}&amp;fg=000000' title='{q_n &#92;rightarrow q}&amp;fg=000000' class='latex' />. Then any graph limit <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G+%3D+%28%7B%5Cmathbb+Z%7D%2C%5Chat+E%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G = ({&#92;mathbb Z},&#92;hat E)}&amp;fg=000000' title='{&#92;hat G = ({&#92;mathbb Z},&#92;hat E)}&amp;fg=000000' class='latex' /> of this sequence will be a random bipartite graph, constructed as follows: first, randomly colour each vertex <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+Z%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb Z}}&amp;fg=000000' title='{{&#92;mathbb Z}}&amp;fg=000000' class='latex' /> red with probability <img src='http://s0.wp.com/latex.php?latex=%7Bq%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q}&amp;fg=000000' title='{q}&amp;fg=000000' class='latex' /> and blue with probability <img src='http://s0.wp.com/latex.php?latex=%7B1-q%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1-q}&amp;fg=000000' title='{1-q}&amp;fg=000000' class='latex' />, independently for each vertex. Then define <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> to be the bipartite graph between the red vertices and the blue vertices, with each edge between red and blue having an independent probability of <img src='http://s0.wp.com/latex.php?latex=%7Bp%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{p}&amp;fg=000000' title='{p}&amp;fg=000000' class='latex' /> of lying in <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat E}&amp;fg=000000' title='{&#92;hat E}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
One can use the graph correspondence principle to prove statements about finite deterministic graphs, by the usual <a class="snap_noshots" href="http://www.tricki.org/article/Use_the_compactness_and_contradiction_method_to_derive_finitary_quantitative_results_from_infinitary_qualitative_ones">compactness and contradiction approach</a>: argue by contradiction, create a sequence of finite deterministic graph counterexamples, use the correspondence principle to pass to an infinite random exchangeable limit, and obtain the desired contradiction in the infinitary setting. (See <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/08/30/the-correspondence-principle-and-finitary-ergodic-theory/">this earlier blog post of mine</a> for further discussion.) This will be how we shall approach the proof of Lemma <a href="#regularity4">2</a>.
</p>
</p>
<p align="center"><b> &#8212;  2. The infinitary regularity lemma  &#8212; </b></p>
<p>
To prove the finitary regularity lemma via the correspondence principle, one must first develop an infinitary counterpart. We will present this infinitary regularity lemma (first introduced in this paper) shortly, but let us motivate it by a discussion based on the three model examples of infinite exchangeable graphs <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G+%3D+%28%7B%5Cmathbb+Z%7D%2C+%5Chat+E%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G = ({&#92;mathbb Z}, &#92;hat E)}&amp;fg=000000' title='{&#92;hat G = ({&#92;mathbb Z}, &#92;hat E)}&amp;fg=000000' class='latex' /> from the previous section.
</p>
<p>
First, consider the &#8220;random&#8221; graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> from Example <a href="#randex">1</a>. Here, we observe that the events &#8220;<img src='http://s0.wp.com/latex.php?latex=%7B%28i%2Cj%29+%5Cin+%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(i,j) &#92;in &#92;hat E}&amp;fg=000000' title='{(i,j) &#92;in &#92;hat E}&amp;fg=000000' class='latex' />&#8221; are jointly independent of each other, thus for instance </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+P%7D%28+%281%2C2%29%2C+%282%2C3%29%2C+%283%2C1%29+%5Cin+%5Chat+E+%29+%3D+%5Cprod_%7B%28i%2Cj%29+%3D+%281%2C2%29%2C+%282%2C3%29%2C+%283%2C1%29%7D+%7B%5CBbb+P%7D%28+%28i%2Cj%29+%5Cin+%5Chat+E+%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb P}( (1,2), (2,3), (3,1) &#92;in &#92;hat E ) = &#92;prod_{(i,j) = (1,2), (2,3), (3,1)} {&#92;Bbb P}( (i,j) &#92;in &#92;hat E ).&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb P}( (1,2), (2,3), (3,1) &#92;in &#92;hat E ) = &#92;prod_{(i,j) = (1,2), (2,3), (3,1)} {&#92;Bbb P}( (i,j) &#92;in &#92;hat E ).&amp;fg=000000' class='latex' /></p>
<p> More generally, we see that the factors <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7Bi%2Cj%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{i,j&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{i,j&#92;}}}&amp;fg=000000' class='latex' /> for all distinct <img src='http://s0.wp.com/latex.php?latex=%7Bi%2Cj+%5Cin+%7B%5Cmathbb+Z%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i,j &#92;in {&#92;mathbb Z}}&amp;fg=000000' title='{i,j &#92;in {&#92;mathbb Z}}&amp;fg=000000' class='latex' /> are independent, which means that
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+P%7D%28+E_1+%5Cwedge+%5Cldots+%5Cwedge+E_n+%29+%3D+%7B%5CBbb+P%7D%28E_1%29+%5Cldots+%7B%5CBbb+P%7D%28E_n%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb P}( E_1 &#92;wedge &#92;ldots &#92;wedge E_n ) = {&#92;Bbb P}(E_1) &#92;ldots {&#92;Bbb P}(E_n)&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb P}( E_1 &#92;wedge &#92;ldots &#92;wedge E_n ) = {&#92;Bbb P}(E_1) &#92;ldots {&#92;Bbb P}(E_n)&amp;fg=000000' class='latex' /></p>
<p> whenever <img src='http://s0.wp.com/latex.php?latex=%7BE_1+%5Cin+%7B%5Cmathcal+B%7D_%7B%5C%7Bi_1%2Cj_1%5C%7D%7D%2C+%5Cldots%2C+E_n+%5Cin+%7B%5Cmathcal+B%7D_%7B%5C%7Bi_n%2Cj_n%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_1 &#92;in {&#92;mathcal B}_{&#92;{i_1,j_1&#92;}}, &#92;ldots, E_n &#92;in {&#92;mathcal B}_{&#92;{i_n,j_n&#92;}}}&amp;fg=000000' title='{E_1 &#92;in {&#92;mathcal B}_{&#92;{i_1,j_1&#92;}}, &#92;ldots, E_n &#92;in {&#92;mathcal B}_{&#92;{i_n,j_n&#92;}}}&amp;fg=000000' class='latex' /> and the <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7Bi_1%2Cj_1%5C%7D%2C%5Cldots%2C%5C%7Bi_n%2Cj_n%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{i_1,j_1&#92;},&#92;ldots,&#92;{i_n,j_n&#92;}}&amp;fg=000000' title='{&#92;{i_1,j_1&#92;},&#92;ldots,&#92;{i_n,j_n&#92;}}&amp;fg=000000' class='latex' /> are distinct. </p>
<p>
Next, we consider the &#8220;structured&#8221; graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> from Example <a href="#structex">2</a>, where we take <img src='http://s0.wp.com/latex.php?latex=%7B0+%26%2360%3B+p+%26%2360%3B+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0 &lt; p &lt; 1}&amp;fg=000000' title='{0 &lt; p &lt; 1}&amp;fg=000000' class='latex' /> to avoid degeneracies. In contrast to the preceding example, the events &#8220;<img src='http://s0.wp.com/latex.php?latex=%7B%28i%2Cj%29+%5Cin+%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(i,j) &#92;in &#92;hat E}&amp;fg=000000' title='{(i,j) &#92;in &#92;hat E}&amp;fg=000000' class='latex' />&#8221; are now highly dependent; for instance, if <img src='http://s0.wp.com/latex.php?latex=%7B%281%2C2%29+%5Cin+%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(1,2) &#92;in &#92;hat E}&amp;fg=000000' title='{(1,2) &#92;in &#92;hat E}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B%281%2C3%29+%5Cin+%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(1,3) &#92;in &#92;hat E}&amp;fg=000000' title='{(1,3) &#92;in &#92;hat E}&amp;fg=000000' class='latex' />, then this forces <img src='http://s0.wp.com/latex.php?latex=%7B%282%2C3%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(2,3)}&amp;fg=000000' title='{(2,3)}&amp;fg=000000' class='latex' /> to lie outside of <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat E}&amp;fg=000000' title='{&#92;hat E}&amp;fg=000000' class='latex' />, despite the fact that the events &#8220;<img src='http://s0.wp.com/latex.php?latex=%7B%28i%2Cj%29+%5Cin+%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(i,j) &#92;in &#92;hat E}&amp;fg=000000' title='{(i,j) &#92;in &#92;hat E}&amp;fg=000000' class='latex' />&#8221; each occur with a non-zero probability of <img src='http://s0.wp.com/latex.php?latex=%7Bp+%281-p%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{p (1-p)}&amp;fg=000000' title='{p (1-p)}&amp;fg=000000' class='latex' />. In particular, the factors <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B1%2C2%5C%7D%7D%2C+%7B%5Cmathcal+B%7D_%7B%5C%7B1%2C3%5C%7D%7D%2C+%7B%5Cmathcal+B%7D_%7B%5C%7B2%2C3%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{1,2&#92;}}, {&#92;mathcal B}_{&#92;{1,3&#92;}}, {&#92;mathcal B}_{&#92;{2,3&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{1,2&#92;}}, {&#92;mathcal B}_{&#92;{1,3&#92;}}, {&#92;mathcal B}_{&#92;{2,3&#92;}}}&amp;fg=000000' class='latex' /> are not jointly independent.
</p>
<p>
However, one can recover a <em>conditional</em> independence by introducing some new factors. Specifically, let <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_i}&amp;fg=000000' title='{{&#92;mathcal B}_i}&amp;fg=000000' class='latex' /> be the factor generated by the event that the vertex <img src='http://s0.wp.com/latex.php?latex=%7Bi%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i}&amp;fg=000000' title='{i}&amp;fg=000000' class='latex' /> is coloured red. Then we see that the factors <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B1%2C2%5C%7D%7D%2C+%7B%5Cmathcal+B%7D_%7B%5C%7B1%2C3%5C%7D%7D%2C+%7B%5Cmathcal+B%7D_%7B%5C%7B2%2C3%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{1,2&#92;}}, {&#92;mathcal B}_{&#92;{1,3&#92;}}, {&#92;mathcal B}_{&#92;{2,3&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{1,2&#92;}}, {&#92;mathcal B}_{&#92;{1,3&#92;}}, {&#92;mathcal B}_{&#92;{2,3&#92;}}}&amp;fg=000000' class='latex' /> now become <em>conditionally</em> jointly independent, relative to the base factor <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_1+%5Cvee+%7B%5Cmathcal+B%7D_2+%5Cvee+%7B%5Cmathcal+B%7D_3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_1 &#92;vee {&#92;mathcal B}_2 &#92;vee {&#92;mathcal B}_3}&amp;fg=000000' title='{{&#92;mathcal B}_1 &#92;vee {&#92;mathcal B}_2 &#92;vee {&#92;mathcal B}_3}&amp;fg=000000' class='latex' />, which means that we have conditional independence identities such as </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+P%7D%28+%281%2C2%29%2C+%282%2C3%29%2C+%283%2C1%29+%5Cin+%5Chat%7BE%7D+%26%23124%3B+%7B%5Cmathcal+B%7D_1+%5Cvee+%7B%5Cmathcal+B%7D_2+%5Cvee+%7B%5Cmathcal+B%7D_3+%29+%3D+%5Cprod_%7B%28i%2Cj%29+%3D+%281%2C2%29%2C+%282%2C3%29%2C+%283%2C1%29%7D+%7B%5CBbb+P%7D%28+%28i%2Cj%29+%5Cin+%5Chat%7BE%7D+%26%23124%3B+%7B%5Cmathcal+B%7D_1+%5Cvee+%7B%5Cmathcal+B%7D_2+%5Cvee+%7B%5Cmathcal+B%7D_3+%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb P}( (1,2), (2,3), (3,1) &#92;in &#92;hat{E} &#124; {&#92;mathcal B}_1 &#92;vee {&#92;mathcal B}_2 &#92;vee {&#92;mathcal B}_3 ) = &#92;prod_{(i,j) = (1,2), (2,3), (3,1)} {&#92;Bbb P}( (i,j) &#92;in &#92;hat{E} &#124; {&#92;mathcal B}_1 &#92;vee {&#92;mathcal B}_2 &#92;vee {&#92;mathcal B}_3 ).&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb P}( (1,2), (2,3), (3,1) &#92;in &#92;hat{E} &#124; {&#92;mathcal B}_1 &#92;vee {&#92;mathcal B}_2 &#92;vee {&#92;mathcal B}_3 ) = &#92;prod_{(i,j) = (1,2), (2,3), (3,1)} {&#92;Bbb P}( (i,j) &#92;in &#92;hat{E} &#124; {&#92;mathcal B}_1 &#92;vee {&#92;mathcal B}_2 &#92;vee {&#92;mathcal B}_3 ).&amp;fg=000000' class='latex' /></p>
<p> Indeed, once one fixes (conditions) the information in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_1+%5Cvee+%7B%5Cmathcal+B%7D_2+%5Cvee+%7B%5Cmathcal+B%7D_3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_1 &#92;vee {&#92;mathcal B}_2 &#92;vee {&#92;mathcal B}_3}&amp;fg=000000' title='{{&#92;mathcal B}_1 &#92;vee {&#92;mathcal B}_2 &#92;vee {&#92;mathcal B}_3}&amp;fg=000000' class='latex' /> (i.e. once one knows what colour the vertices <img src='http://s0.wp.com/latex.php?latex=%7B1%2C2%2C3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1,2,3}&amp;fg=000000' title='{1,2,3}&amp;fg=000000' class='latex' /> are), the events &#8220;<img src='http://s0.wp.com/latex.php?latex=%7B%28i%2Cj%29+%5Cin+%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(i,j) &#92;in &#92;hat E}&amp;fg=000000' title='{(i,j) &#92;in &#92;hat E}&amp;fg=000000' class='latex' />&#8221; for <img src='http://s0.wp.com/latex.php?latex=%7B%28i%2Cj%29%3D%281%2C2%29%2C+%282%2C3%29%2C+%283%2C1%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(i,j)=(1,2), (2,3), (3,1)}&amp;fg=000000' title='{(i,j)=(1,2), (2,3), (3,1)}&amp;fg=000000' class='latex' /> either occur with probability <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' /> (if <img src='http://s0.wp.com/latex.php?latex=%7Bi%2C+j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i, j}&amp;fg=000000' title='{i, j}&amp;fg=000000' class='latex' /> have distinct colours) or probability <img src='http://s0.wp.com/latex.php?latex=%7B0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0}&amp;fg=000000' title='{0}&amp;fg=000000' class='latex' /> (if <img src='http://s0.wp.com/latex.php?latex=%7Bi%2Cj%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i,j}&amp;fg=000000' title='{i,j}&amp;fg=000000' class='latex' /> have the same colour), and so the conditional independence is trivially true.</p>
<p>
A similar phenomenon holds for the &#8220;random+structured&#8221; graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> from Example <a href="#combex">3</a>, with <img src='http://s0.wp.com/latex.php?latex=%7B0+%26%2360%3B+p%2Cq+%26%2360%3B+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0 &lt; p,q &lt; 1}&amp;fg=000000' title='{0 &lt; p,q &lt; 1}&amp;fg=000000' class='latex' />. Again, the factors <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7Bi%2Cj%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{i,j&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{i,j&#92;}}}&amp;fg=000000' class='latex' /> are not jointly independent in an absolute sense, but once one introduces the factors <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_i}&amp;fg=000000' title='{{&#92;mathcal B}_i}&amp;fg=000000' class='latex' /> based on the colour of the vertex <img src='http://s0.wp.com/latex.php?latex=%7Bi%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i}&amp;fg=000000' title='{i}&amp;fg=000000' class='latex' />, we see once again that the <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7Bi%2Cj%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{i,j&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{i,j&#92;}}}&amp;fg=000000' class='latex' /> become conditionally jointly independent relative to the <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_i}&amp;fg=000000' title='{{&#92;mathcal B}_i}&amp;fg=000000' class='latex' />.
</p>
<p>
These examples suggest, more generally, that we should be able to <em>regularise</em> the graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> (or more precisely, the system of edge factors <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7Bi%2Cj%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{i,j&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{i,j&#92;}}}&amp;fg=000000' class='latex' />) by introducing some single-vertex factors <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_i}&amp;fg=000000' title='{{&#92;mathcal B}_i}&amp;fg=000000' class='latex' />, with respect to which the edge factors become conditionally independent; this is the infinitary analogue of a finite graph becoming <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon}&amp;fg=000000' title='{&#92;varepsilon}&amp;fg=000000' class='latex' />-regular relative to a suitably chosen partition of the vertex set into cells.
</p>
<p>
Now, in Examples <a href="#structex">2</a>, <a href="#combex">3</a> we were able to obtain this regularisation because the vertices of the graph were conveniently coloured for us (red or blue). But for general infinite exchangeable graphs <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' />, such a vertex colouring is not provided to us, so how is one to generate the vertex factors <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_i}&amp;fg=000000' title='{{&#92;mathcal B}_i}&amp;fg=000000' class='latex' />?
</p>
<p>
The key trick &#8211; which is the infinitary analogue of using random neighbourhoods to regularise a finitary graph &#8211; is to sequester half of the infinite vertices in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathbb+Z%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathbb Z}}&amp;fg=000000' title='{{&#92;mathbb Z}}&amp;fg=000000' class='latex' /> &#8211; e.g. the negative vertices <img src='http://s0.wp.com/latex.php?latex=%7B-1%2C+-2%2C+%5Cldots%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{-1, -2, &#92;ldots}&amp;fg=000000' title='{-1, -2, &#92;ldots}&amp;fg=000000' class='latex' /> &#8211; away as &#8220;reference&#8221; or &#8220;training&#8221; vertices, and then and colorise the remaining vertices <img src='http://s0.wp.com/latex.php?latex=%7Bi%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i}&amp;fg=000000' title='{i}&amp;fg=000000' class='latex' /> of the graph based on how that vertex interacts with the reference vertices. More formally, we define <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_i}&amp;fg=000000' title='{{&#92;mathcal B}_i}&amp;fg=000000' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=%7Bi%3D0%2C1%2C2%2C%5Cldots%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i=0,1,2,&#92;ldots}&amp;fg=000000' title='{i=0,1,2,&#92;ldots}&amp;fg=000000' class='latex' /> by the formula
</p>
</p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5Cmathcal+B%7D_i+%3A%3D+%7B%5Cmathcal+B%7D_%7B%5C%7B+-1%2C-2%2C%5Cldots%5C%7D+%5Ccup+%5C%7Bi%5C%7D%7D.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;mathcal B}_i := {&#92;mathcal B}_{&#92;{ -1,-2,&#92;ldots&#92;} &#92;cup &#92;{i&#92;}}.&amp;fg=000000' title='&#92;displaystyle  {&#92;mathcal B}_i := {&#92;mathcal B}_{&#92;{ -1,-2,&#92;ldots&#92;} &#92;cup &#92;{i&#92;}}.&amp;fg=000000' class='latex' /></p>
<p>
We then have
</p>
<blockquote><p><b>Lemma 4 (Infinitary regularity lemma)</b>  Let <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G+%3D+%28%7B%5Cmathbb+Z%7D%2C%5Chat+E%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G = ({&#92;mathbb Z},&#92;hat E)}&amp;fg=000000' title='{&#92;hat G = ({&#92;mathbb Z},&#92;hat E)}&amp;fg=000000' class='latex' /> be a infinite exchangeable random graph. Then the <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7Bi%2Cj%5C%7D%7D+%5Cvee+%7B%5Cmathcal+B%7D_i+%5Cvee+%7B%5Cmathcal+B%7D_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{i,j&#92;}} &#92;vee {&#92;mathcal B}_i &#92;vee {&#92;mathcal B}_j}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{i,j&#92;}} &#92;vee {&#92;mathcal B}_i &#92;vee {&#92;mathcal B}_j}&amp;fg=000000' class='latex' /> for natural numbers <img src='http://s0.wp.com/latex.php?latex=%7Bi%2Cj%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i,j}&amp;fg=000000' title='{i,j}&amp;fg=000000' class='latex' /> are conditinally jointly independent relative to the <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_i}&amp;fg=000000' title='{{&#92;mathcal B}_i}&amp;fg=000000' class='latex' />. More precisely, if <img src='http://s0.wp.com/latex.php?latex=%7BI%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{I}&amp;fg=000000' title='{I}&amp;fg=000000' class='latex' /> is a set of natural numbers, <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' /> is a subset of <img src='http://s0.wp.com/latex.php?latex=%7B%5Cbinom%7BI%7D%7B2%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;binom{I}{2}}&amp;fg=000000' title='{&#92;binom{I}{2}}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7BE_e%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_e}&amp;fg=000000' title='{E_e}&amp;fg=000000' class='latex' /> is a <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_e+%5Cwedge+%5Cbigwedge_%7Bi+%5Cin+e%7D+%7B%5Cmathcal+B%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_e &#92;wedge &#92;bigwedge_{i &#92;in e} {&#92;mathcal B}_i}&amp;fg=000000' title='{{&#92;mathcal B}_e &#92;wedge &#92;bigwedge_{i &#92;in e} {&#92;mathcal B}_i}&amp;fg=000000' class='latex' />-measurable event for all <img src='http://s0.wp.com/latex.php?latex=%7Be+%5Cin+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{e &#92;in E}&amp;fg=000000' title='{e &#92;in E}&amp;fg=000000' class='latex' />, then
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cmathbb+P%7D%28+%5Cbigwedge_%7Be+%5Cin+E%7D+E_%7Be%7D+%26%23124%3B+%5Cbigwedge_%7Bi+%5Cin+I%7D+%7B%5Cmathcal+B%7D_i+%29+%3D+%5Cprod_%7Be+%5Cin+E%7D+%5Cmathop%7B%5Cmathbb+P%7D%28+E_e+%26%23124%3B+%5Cbigwedge_%7Bi+%5Cin+I%7D+%7B%5Cmathcal+B%7D_i+%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;mathop{&#92;mathbb P}( &#92;bigwedge_{e &#92;in E} E_{e} &#124; &#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i ) = &#92;prod_{e &#92;in E} &#92;mathop{&#92;mathbb P}( E_e &#124; &#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i ).&amp;fg=000000' title='&#92;displaystyle  &#92;mathop{&#92;mathbb P}( &#92;bigwedge_{e &#92;in E} E_{e} &#124; &#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i ) = &#92;prod_{e &#92;in E} &#92;mathop{&#92;mathbb P}( E_e &#124; &#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i ).&amp;fg=000000' class='latex' /></p>
</blockquote>
</p>
<p>
<em>Proof:</em>  By induction on <img src='http://s0.wp.com/latex.php?latex=%7BE%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E}&amp;fg=000000' title='{E}&amp;fg=000000' class='latex' />, it suffices to show that for any <img src='http://s0.wp.com/latex.php?latex=%7Be_0+%5Cin+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{e_0 &#92;in E}&amp;fg=000000' title='{e_0 &#92;in E}&amp;fg=000000' class='latex' />, the event <img src='http://s0.wp.com/latex.php?latex=%7BE_%7Be_0%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{e_0}}&amp;fg=000000' title='{E_{e_0}}&amp;fg=000000' class='latex' /> and the event <img src='http://s0.wp.com/latex.php?latex=%7B%5Cbigwedge_%7Be+%5Cin+E+%5Cbackslash+%5C%7Be_0%5C%7D%7D+E_e%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;bigwedge_{e &#92;in E &#92;backslash &#92;{e_0&#92;}} E_e}&amp;fg=000000' title='{&#92;bigwedge_{e &#92;in E &#92;backslash &#92;{e_0&#92;}} E_e}&amp;fg=000000' class='latex' /> are independent relative to <img src='http://s0.wp.com/latex.php?latex=%7B%5Cbigwedge_%7Bi+%5Cin+I%7D+%7B%5Cmathcal+B%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i}&amp;fg=000000' title='{&#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i}&amp;fg=000000' class='latex' />.
</p>
<p>
By relabeling we may take <img src='http://s0.wp.com/latex.php?latex=%7BI+%3D+%5C%7B1%2C%5Cldots%2Cn%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{I = &#92;{1,&#92;ldots,n&#92;}}&amp;fg=000000' title='{I = &#92;{1,&#92;ldots,n&#92;}}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7Be_0%3D%5C%7B1%2C2%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{e_0=&#92;{1,2&#92;}}&amp;fg=000000' title='{e_0=&#92;{1,2&#92;}}&amp;fg=000000' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%7Bn+%5Cgeq+2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n &#92;geq 2}&amp;fg=000000' title='{n &#92;geq 2}&amp;fg=000000' class='latex' />. We use the exchangeability of <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> (and <a href="http://en.wikipedia.org/wiki/Hilbert&#37;27s_paradox_of_the_Grand_Hotel">Hilbert&#8217;s hotel</a>) to observe that the random variables </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D%28+1_%7BE_%7Be_0%7D%7D+%26%23124%3B+%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%5C%7D+%5Ccup+%5C%7B1%5C%7D%7D+%5Cvee+%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%5C%7D+%5Ccup+%5C%7B2%5C%7D%7D+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}( 1_{E_{e_0}} &#124; {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{1&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{2&#92;}} )&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}( 1_{E_{e_0}} &#124; {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{1&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{2&#92;}} )&amp;fg=000000' class='latex' /></p>
<p> and
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D%28+1_%7BE_%7Be_0%7D%7D+%26%23124%3B+%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%5C%7D+%5Ccup+%5C%7B1%5C%7D+%5Ccup+%5C%7B3%2C%5Cldots%2Cn%5C%7D%7D+%5Cvee+%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%5C%7D+%5Ccup+%5C%7B2%5C%7D+%5Ccup+%5C%7B3%2C%5Cldots%2Cn%5C%7D%7D+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}( 1_{E_{e_0}} &#124; {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{1&#92;} &#92;cup &#92;{3,&#92;ldots,n&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{2&#92;} &#92;cup &#92;{3,&#92;ldots,n&#92;}} )&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}( 1_{E_{e_0}} &#124; {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{1&#92;} &#92;cup &#92;{3,&#92;ldots,n&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{2&#92;} &#92;cup &#92;{3,&#92;ldots,n&#92;}} )&amp;fg=000000' class='latex' /></p>
<p> have the same distribution; in particular, they have the same <img src='http://s0.wp.com/latex.php?latex=%7BL%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{L^2}&amp;fg=000000' title='{L^2}&amp;fg=000000' class='latex' /> norm. By Pythagoras&#8217; theorem, they must therefore be equal almost surely; furthermore, for any intermediate <img src='http://s0.wp.com/latex.php?latex=%7B%5Csigma%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sigma}&amp;fg=000000' title='{&#92;sigma}&amp;fg=000000' class='latex' />-algebra <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}}&amp;fg=000000' title='{{&#92;mathcal B}}&amp;fg=000000' class='latex' /> between <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%5C%7D+%5Ccup+%5C%7B1%5C%7D%7D+%5Cvee+%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%5C%7D+%5Ccup+%5C%7B2%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{1&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{2&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{1&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{2&#92;}}}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%5C%7D+%5Ccup+%5C%7B1%5C%7D+%5Ccup+%5C%7B3%2C%5Cldots%2Cn%5C%7D%7D+%5Cvee+%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%5C%7D+%5Ccup+%5C%7B2%5C%7D+%5Ccup+%5C%7B3%2C%5Cldots%2Cn%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{1&#92;} &#92;cup &#92;{3,&#92;ldots,n&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{2&#92;} &#92;cup &#92;{3,&#92;ldots,n&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{1&#92;} &#92;cup &#92;{3,&#92;ldots,n&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots&#92;} &#92;cup &#92;{2&#92;} &#92;cup &#92;{3,&#92;ldots,n&#92;}}}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D%28+1_%7BE_%7Be_0%7D%7D+%26%23124%3B+%7B%5Cmathcal+B%7D+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E}( 1_{E_{e_0}} &#124; {&#92;mathcal B} )}&amp;fg=000000' title='{{&#92;Bbb E}( 1_{E_{e_0}} &#124; {&#92;mathcal B} )}&amp;fg=000000' class='latex' /> is also equal almost surely to the above two expressions. (The astute reader will observe that we have just run the &#8220;energy increment argument&#8221;; in the infinitary world, it is somewhat slicker than in the finitary world, due to the convenience of the Hilbert&#8217;s hotel trick, and the fact that the existence of orthogonal projections (and in particular, conditional expectation) is itself encoding an energy increment argument.)</p>
<p>
As a special case of the above observation, we see that </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D%28+1_%7BE_%7Be_0%7D%7D+%26%23124%3B+%5Cbigwedge_%7Bi+%5Cin+I%7D+%7B%5Cmathcal+B%7D_i+%29+%3D+%7B%5CBbb+E%7D%28+1_%7BE_%7Be_0%7D%7D+%26%23124%3B+%5Cbigwedge_%7Bi+%5Cin+I%7D+%7B%5Cmathcal+B%7D_i+%5Cwedge+%5Cbigwedge_%7Be+%5Cin+E+%5Cbackslash+%5C%7Be_0%5C%7D%7D+%7B%5Cmathcal+B%7D_e+%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}( 1_{E_{e_0}} &#124; &#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i ) = {&#92;Bbb E}( 1_{E_{e_0}} &#124; &#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i &#92;wedge &#92;bigwedge_{e &#92;in E &#92;backslash &#92;{e_0&#92;}} {&#92;mathcal B}_e ).&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}( 1_{E_{e_0}} &#124; &#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i ) = {&#92;Bbb E}( 1_{E_{e_0}} &#124; &#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i &#92;wedge &#92;bigwedge_{e &#92;in E &#92;backslash &#92;{e_0&#92;}} {&#92;mathcal B}_e ).&amp;fg=000000' class='latex' /></p>
<p> In particular, this implies that <img src='http://s0.wp.com/latex.php?latex=%7BE_0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_0}&amp;fg=000000' title='{E_0}&amp;fg=000000' class='latex' /> is conditionally independent of every event measurable in <img src='http://s0.wp.com/latex.php?latex=%7B%5Cbigwedge_%7Bi+%5Cin+I%7D+%7B%5Cmathcal+B%7D_i+%5Cwedge+%5Cbigwedge_%7Be+%5Cin+E+%5Cbackslash+%5C%7Be_0%5C%7D%7D+%7B%5Cmathcal+B%7D_e%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i &#92;wedge &#92;bigwedge_{e &#92;in E &#92;backslash &#92;{e_0&#92;}} {&#92;mathcal B}_e}&amp;fg=000000' title='{&#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i &#92;wedge &#92;bigwedge_{e &#92;in E &#92;backslash &#92;{e_0&#92;}} {&#92;mathcal B}_e}&amp;fg=000000' class='latex' />, relative to <img src='http://s0.wp.com/latex.php?latex=%7B%5Cbigwedge_%7Bi+%5Cin+I%7D+%7B%5Cmathcal+B%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i}&amp;fg=000000' title='{&#92;bigwedge_{i &#92;in I} {&#92;mathcal B}_i}&amp;fg=000000' class='latex' />, and the claim follows. <img src='http://s0.wp.com/latex.php?latex=%5CBox%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Box&amp;fg=000000' title='&#92;Box&amp;fg=000000' class='latex' /></p>
<p>
We remark that the same argument also allows one to easily regularise infinite exchangeable hypergraphs; see <a class="snap_noshots" href="http://front.math.ucdavis.edu/math.CO/0602037">my paper</a> for details. In fact one can go further and obtain a structural theorem for these hypergraphs generalising <a href="http://en.wikipedia.org/wiki/De_Finetti&#37;27s_theorem">de Finetti&#8217;s theorem</a>, and also closely related to the graphons of Lovasz and Szegedy; see <a class="snap_noshots" href="http://aps.arxiv.org/abs/0801.1698">this paper of Austin</a> for details.
</p>
</p>
<p align="center"><b> &#8212;  3. Proof of finitary regularity lemma  &#8212; </b></p>
<p>
Having proven the infinitary regularity lemma, we now use the correspondence principle and the <a class="snap_noshots" href="http://www.tricki.org/article/Use_the_compactness_and_contradiction_method_to_derive_finitary_quantitative_results_from_infinitary_qualitative_ones">compactness and contradiction argument</a> to recover the finitary regularity lemma, Lemma <a href="#regularity4">2</a>.
</p>
<p>
Suppose this lemma failed. Carefully negating all the quantifiers, this means that there exists <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon &gt; 0}&amp;fg=000000' title='{&#92;varepsilon &gt; 0}&amp;fg=000000' class='latex' />, a sequence <img src='http://s0.wp.com/latex.php?latex=%7BM_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_n}&amp;fg=000000' title='{M_n}&amp;fg=000000' class='latex' /> going to infinity, and a sequence of finite deterministic graphs <img src='http://s0.wp.com/latex.php?latex=%7BG_n+%3D+%28V_n%2CE_n%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_n = (V_n,E_n)}&amp;fg=000000' title='{G_n = (V_n,E_n)}&amp;fg=000000' class='latex' /> such that for every <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+M+%5Cleq+M_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq M &#92;leq M_n}&amp;fg=000000' title='{1 &#92;leq M &#92;leq M_n}&amp;fg=000000' class='latex' />, if one selects vertices <img src='http://s0.wp.com/latex.php?latex=%7Bv_1%2C%5Cldots%2Cv_M+%5Cin+V_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_1,&#92;ldots,v_M &#92;in V_n}&amp;fg=000000' title='{v_1,&#92;ldots,v_M &#92;in V_n}&amp;fg=000000' class='latex' /> uniformly from <img src='http://s0.wp.com/latex.php?latex=%7BV_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_n}&amp;fg=000000' title='{V_n}&amp;fg=000000' class='latex' />, then the <img src='http://s0.wp.com/latex.php?latex=%7B2%5E%7BM%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2^{M}}&amp;fg=000000' title='{2^{M}}&amp;fg=000000' class='latex' /> vertex cells <img src='http://s0.wp.com/latex.php?latex=%7BV%5E%7BM%7D_1%2C%5Cldots%2CV%5E%7BM%7D_%7B2%5E%7BM%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^{M}_1,&#92;ldots,V^{M}_{2^{M}}}&amp;fg=000000' title='{V^{M}_1,&#92;ldots,V^{M}_{2^{M}}}&amp;fg=000000' class='latex' /> generated by the vertex neighbourhoods <img src='http://s0.wp.com/latex.php?latex=%7BA_t+%3A%3D+%5C%7B+v+%5Cin+V%3A+%28v%2Cv_t%29+%5Cin+E+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_t := &#92;{ v &#92;in V: (v,v_t) &#92;in E &#92;}}&amp;fg=000000' title='{A_t := &#92;{ v &#92;in V: (v,v_t) &#92;in E &#92;}}&amp;fg=000000' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+t+%5Cleq+M%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq t &#92;leq M}&amp;fg=000000' title='{1 &#92;leq t &#92;leq M}&amp;fg=000000' class='latex' />, will obey the regularity property <a href="#regular">(1)</a> with probability less than <img src='http://s0.wp.com/latex.php?latex=%7B1-%5Cvarepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1-&#92;varepsilon}&amp;fg=000000' title='{1-&#92;varepsilon}&amp;fg=000000' class='latex' />.
</p>
<p>
We convert each of the finite deterministic graphs <img src='http://s0.wp.com/latex.php?latex=%7BG_n+%3D+%28V_n%2CE_n%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_n = (V_n,E_n)}&amp;fg=000000' title='{G_n = (V_n,E_n)}&amp;fg=000000' class='latex' /> to an infinite random exchangeable graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G_n+%3D+%28%7B%5Cmathbb+Z%7D%2C+%5Chat+E_n%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G_n = ({&#92;mathbb Z}, &#92;hat E_n)}&amp;fg=000000' title='{&#92;hat G_n = ({&#92;mathbb Z}, &#92;hat E_n)}&amp;fg=000000' class='latex' />; invoking the ocrrespondence principle and passing to a subsequence if necessary, we can assume that this graph converges in the vague topology to an exchangeable limit <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G+%3D+%28%7B%5Cmathbb+Z%7D%2C+%5Chat+E%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G = ({&#92;mathbb Z}, &#92;hat E)}&amp;fg=000000' title='{&#92;hat G = ({&#92;mathbb Z}, &#92;hat E)}&amp;fg=000000' class='latex' />. Applying the infinitary regularity lemma to this graph, we see that the edge factors <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7Bi%2Cj%5C%7D%7D+%5Cwedge+%7B%5Cmathcal+B%7D_i+%5Cwedge+%7B%5Cmathcal+B%7D_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{i,j&#92;}} &#92;wedge {&#92;mathcal B}_i &#92;wedge {&#92;mathcal B}_j}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{i,j&#92;}} &#92;wedge {&#92;mathcal B}_i &#92;wedge {&#92;mathcal B}_j}&amp;fg=000000' class='latex' /> for natural numbers <img src='http://s0.wp.com/latex.php?latex=%7Bi%2Cj%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i,j}&amp;fg=000000' title='{i,j}&amp;fg=000000' class='latex' /> are conditionally jointly independent relative to the vertex factors <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_i}&amp;fg=000000' title='{{&#92;mathcal B}_i}&amp;fg=000000' class='latex' />.
</p>
<p>
Now for any distinct natural numbers <img src='http://s0.wp.com/latex.php?latex=%7Bi%2Cj%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i,j}&amp;fg=000000' title='{i,j}&amp;fg=000000' class='latex' />, let <img src='http://s0.wp.com/latex.php?latex=%7Bf%28i%2Cj%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(i,j)}&amp;fg=000000' title='{f(i,j)}&amp;fg=000000' class='latex' /> be the indicator of the event &#8220;<img src='http://s0.wp.com/latex.php?latex=%7B%28i%2Cj%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(i,j)}&amp;fg=000000' title='{(i,j)}&amp;fg=000000' class='latex' /> lies in <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat E}&amp;fg=000000' title='{&#92;hat E}&amp;fg=000000' class='latex' />&#8220;, thus <img src='http://s0.wp.com/latex.php?latex=%7Bf%3D1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f=1}&amp;fg=000000' title='{f=1}&amp;fg=000000' class='latex' /> when <img src='http://s0.wp.com/latex.php?latex=%7B%28i%2Cj%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(i,j)}&amp;fg=000000' title='{(i,j)}&amp;fg=000000' class='latex' /> lies in <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat E}&amp;fg=000000' title='{&#92;hat E}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7Bf%28i%2Cj%29%3D0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(i,j)=0}&amp;fg=000000' title='{f(i,j)=0}&amp;fg=000000' class='latex' /> otherwise. Clearly <img src='http://s0.wp.com/latex.php?latex=%7Bf%28i%2Cj%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(i,j)}&amp;fg=000000' title='{f(i,j)}&amp;fg=000000' class='latex' /> is <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7Bi%2Cj%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{i,j&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{i,j&#92;}}}&amp;fg=000000' class='latex' />-measurable. We can write </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++f%28i%2Cj%29+%3D+f_%7BU%5E%5Cperp%7D%28i%2Cj%29+%2B+f_U%28i%2Cj%29+%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  f(i,j) = f_{U^&#92;perp}(i,j) + f_U(i,j) &amp;fg=000000' title='&#92;displaystyle  f(i,j) = f_{U^&#92;perp}(i,j) + f_U(i,j) &amp;fg=000000' class='latex' /></p>
<p> where
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++f_%7BU%5E%5Cperp%7D%28i%2Cj%29+%3A%3D+%7B%5CBbb+E%7D%28+f%28i%2Cj%29+%26%23124%3B+%7B%5Cmathcal+B%7D_i+%5Cwedge+%7B%5Cmathcal+B%7D_j+%29+%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  f_{U^&#92;perp}(i,j) := {&#92;Bbb E}( f(i,j) &#124; {&#92;mathcal B}_i &#92;wedge {&#92;mathcal B}_j ) &amp;fg=000000' title='&#92;displaystyle  f_{U^&#92;perp}(i,j) := {&#92;Bbb E}( f(i,j) &#124; {&#92;mathcal B}_i &#92;wedge {&#92;mathcal B}_j ) &amp;fg=000000' class='latex' /></p>
<p> and
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++f_U%28i%2Cj%29+%3A%3D+f%28i%2Cj%29+-+f_%7BU%5E%5Cperp%7D%28i%2Cj%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  f_U(i,j) := f(i,j) - f_{U^&#92;perp}(i,j).&amp;fg=000000' title='&#92;displaystyle  f_U(i,j) := f(i,j) - f_{U^&#92;perp}(i,j).&amp;fg=000000' class='latex' /></p>
<p>
The exchangeability of <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G}&amp;fg=000000' title='{&#92;hat G}&amp;fg=000000' class='latex' /> ensures that <img src='http://s0.wp.com/latex.php?latex=%7Bf%2C+f_U%2C+f_%7BU%5E%5Cperp%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f, f_U, f_{U^&#92;perp}}&amp;fg=000000' title='{f, f_U, f_{U^&#92;perp}}&amp;fg=000000' class='latex' /> are exchangeable with respect to permutations of the natural numbers, in particular <img src='http://s0.wp.com/latex.php?latex=%7Bf_U%28i%2Cj%29%3Df_U%28j%2Ci%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_U(i,j)=f_U(j,i)}&amp;fg=000000' title='{f_U(i,j)=f_U(j,i)}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7Bf_%7BU%5E%5Cperp%7D%28i%2Cj%29%3Df_%7BU%5E%5Cperp%7D%28j%2Ci%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_{U^&#92;perp}(i,j)=f_{U^&#92;perp}(j,i)}&amp;fg=000000' title='{f_{U^&#92;perp}(i,j)=f_{U^&#92;perp}(j,i)}&amp;fg=000000' class='latex' />.
</p>
<p>
By the infinitary regularity lemma, the <img src='http://s0.wp.com/latex.php?latex=%7Bf_U%28i%2Cj%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_U(i,j)}&amp;fg=000000' title='{f_U(i,j)}&amp;fg=000000' class='latex' /> are jointly independent relative to the <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_i}&amp;fg=000000' title='{{&#92;mathcal B}_i}&amp;fg=000000' class='latex' />, and also have mean zero relative to these factors, so in particular they are infinitely pseudorandom in the sense that </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+f_U%281%2C2%29+f_U%283%2C2%29+f_U%281%2C4%29+f_U%283%2C4%29+%3D+0.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} f_U(1,2) f_U(3,2) f_U(1,4) f_U(3,4) = 0.&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} f_U(1,2) f_U(3,2) f_U(1,4) f_U(3,4) = 0.&amp;fg=000000' class='latex' /></p>
<p> Meanwhile, the random variable <img src='http://s0.wp.com/latex.php?latex=%7Bf_%7BU%5E%5Cperp%7D%281%2C2%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_{U^&#92;perp}(1,2)}&amp;fg=000000' title='{f_{U^&#92;perp}(1,2)}&amp;fg=000000' class='latex' /> is measurable with respect to the factor <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_1+%5Cvee+%7B%5Cmathcal+B%7D_2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_1 &#92;vee {&#92;mathcal B}_2}&amp;fg=000000' title='{{&#92;mathcal B}_1 &#92;vee {&#92;mathcal B}_2}&amp;fg=000000' class='latex' />, which is the limit of the factors <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%2C-M%5C%7D+%5Ccup+%5C%7B1%5C%7D%7D+%5Cvee+%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%2C-M%5C%7D+%5Ccup+%5C%7B2%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots,-M&#92;} &#92;cup &#92;{1&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots,-M&#92;} &#92;cup &#92;{2&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots,-M&#92;} &#92;cup &#92;{1&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots,-M&#92;} &#92;cup &#92;{2&#92;}}}&amp;fg=000000' class='latex' /> as <img src='http://s0.wp.com/latex.php?latex=%7BM%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M}&amp;fg=000000' title='{M}&amp;fg=000000' class='latex' /> increases. Thus, given any <img src='http://s0.wp.com/latex.php?latex=%7B%5Ctilde+%5Cvarepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;tilde &#92;varepsilon &gt; 0}&amp;fg=000000' title='{&#92;tilde &#92;varepsilon &gt; 0}&amp;fg=000000' class='latex' /> (to be chosen later), one can find an approximation <img src='http://s0.wp.com/latex.php?latex=%7B%5Ctilde+f_%7BU%5E%5Cperp%7D%281%2C2%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;tilde f_{U^&#92;perp}(1,2)}&amp;fg=000000' title='{&#92;tilde f_{U^&#92;perp}(1,2)}&amp;fg=000000' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7Bf_%7BU%5E%5Cperp%7D%281%2C2%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_{U^&#92;perp}(1,2)}&amp;fg=000000' title='{f_{U^&#92;perp}(1,2)}&amp;fg=000000' class='latex' />, bounded between <img src='http://s0.wp.com/latex.php?latex=%7B0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0}&amp;fg=000000' title='{0}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />, which is <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%2C-M%5C%7D+%5Ccup+%5C%7B1%5C%7D%7D+%5Cvee+%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%2C-M%5C%7D+%5Ccup+%5C%7B2%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots,-M&#92;} &#92;cup &#92;{1&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots,-M&#92;} &#92;cup &#92;{2&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots,-M&#92;} &#92;cup &#92;{1&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots,-M&#92;} &#92;cup &#92;{2&#92;}}}&amp;fg=000000' class='latex' />-measurable for some <img src='http://s0.wp.com/latex.php?latex=%7BM%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M}&amp;fg=000000' title='{M}&amp;fg=000000' class='latex' />, and such that
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%26%23124%3B%5Ctilde+f_%7BU%5E%5Cperp%7D%281%2C2%29+-+f_%7BU%5E%5Cperp%7D%281%2C2%29%26%23124%3B+%5Cleq+%5Ctilde+%5Cvarepsilon.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#124;&#92;tilde f_{U^&#92;perp}(1,2) - f_{U^&#92;perp}(1,2)&#124; &#92;leq &#92;tilde &#92;varepsilon.&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#124;&#92;tilde f_{U^&#92;perp}(1,2) - f_{U^&#92;perp}(1,2)&#124; &#92;leq &#92;tilde &#92;varepsilon.&amp;fg=000000' class='latex' /></p>
<p> We can also impose the symmetry condition <img src='http://s0.wp.com/latex.php?latex=%7B%5Ctilde+f_%7BU%5E%5Cperp%7D%281%2C2%29+%3D%5Ctilde+f_%7BU%5E%5Cperp%7D%282%2C1%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;tilde f_{U^&#92;perp}(1,2) =&#92;tilde f_{U^&#92;perp}(2,1)}&amp;fg=000000' title='{&#92;tilde f_{U^&#92;perp}(1,2) =&#92;tilde f_{U^&#92;perp}(2,1)}&amp;fg=000000' class='latex' />. Now let <img src='http://s0.wp.com/latex.php?latex=%7B%5Ctilde+%5Cvarepsilon%27+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;tilde &#92;varepsilon&#039; &gt; 0}&amp;fg=000000' title='{&#92;tilde &#92;varepsilon&#039; &gt; 0}&amp;fg=000000' class='latex' /> be an extremely small number (depending on <img src='http://s0.wp.com/latex.php?latex=%7B%5Ctilde+%5Cvarepsilon%2Cn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;tilde &#92;varepsilon,n}&amp;fg=000000' title='{&#92;tilde &#92;varepsilon,n}&amp;fg=000000' class='latex' />) to be chosen later. Then one can find an approximation <img src='http://s0.wp.com/latex.php?latex=%7B%5Ctilde+f_U%281%2C2%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;tilde f_U(1,2)}&amp;fg=000000' title='{&#92;tilde f_U(1,2)}&amp;fg=000000' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7Bf_U%281%2C2%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_U(1,2)}&amp;fg=000000' title='{f_U(1,2)}&amp;fg=000000' class='latex' />, bounded between <img src='http://s0.wp.com/latex.php?latex=%7B-1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{-1}&amp;fg=000000' title='{-1}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />, which is <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%2C-M%27%5C%7D+%5Ccup+%5C%7B1%5C%7D%7D+%5Cvee+%7B%5Cmathcal+B%7D_%7B%5C%7B-1%2C-2%2C%5Cldots%2C-M%27%5C%7D+%5Ccup+%5C%7B2%5C%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots,-M&#039;&#92;} &#92;cup &#92;{1&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots,-M&#039;&#92;} &#92;cup &#92;{2&#92;}}}&amp;fg=000000' title='{{&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots,-M&#039;&#92;} &#92;cup &#92;{1&#92;}} &#92;vee {&#92;mathcal B}_{&#92;{-1,-2,&#92;ldots,-M&#039;&#92;} &#92;cup &#92;{2&#92;}}}&amp;fg=000000' class='latex' />-measurable for some <img src='http://s0.wp.com/latex.php?latex=%7BM%27%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M&#039;}&amp;fg=000000' title='{M&#039;}&amp;fg=000000' class='latex' />, and such that
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%26%23124%3B%5Ctilde+f_%7BU%7D%281%2C2%29+-+f_%7BU%7D%281%2C2%29%26%23124%3B+%5Cleq+%5Ctilde+%5Cvarepsilon%27.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#124;&#92;tilde f_{U}(1,2) - f_{U}(1,2)&#124; &#92;leq &#92;tilde &#92;varepsilon&#039;.&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#124;&#92;tilde f_{U}(1,2) - f_{U}(1,2)&#124; &#92;leq &#92;tilde &#92;varepsilon&#039;.&amp;fg=000000' class='latex' /></p>
<p> Again we can impose the symmetry condition <img src='http://s0.wp.com/latex.php?latex=%7B%5Ctilde+f_%7BU%7D%281%2C2%29+%3D+%5Ctilde+f_%7BU%7D%282%2C1%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;tilde f_{U}(1,2) = &#92;tilde f_{U}(2,1)}&amp;fg=000000' title='{&#92;tilde f_{U}(1,2) = &#92;tilde f_{U}(2,1)}&amp;fg=000000' class='latex' />. We can then extend <img src='http://s0.wp.com/latex.php?latex=%7B%5Ctilde+f_U%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;tilde f_U}&amp;fg=000000' title='{&#92;tilde f_U}&amp;fg=000000' class='latex' /> by exchangeability, so that
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%26%23124%3B%5Ctilde+f_%7BU%7D%28i%2Cj%29+-+f_%7BU%7D%28i%2Cj%29%26%23124%3B+%5Cleq+%5Ctilde+%5Cvarepsilon%27.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#124;&#92;tilde f_{U}(i,j) - f_{U}(i,j)&#124; &#92;leq &#92;tilde &#92;varepsilon&#039;.&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#124;&#92;tilde f_{U}(i,j) - f_{U}(i,j)&#124; &#92;leq &#92;tilde &#92;varepsilon&#039;.&amp;fg=000000' class='latex' /></p>
<p> for all distinct natural numbers <img src='http://s0.wp.com/latex.php?latex=%7Bi%2Cj%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i,j}&amp;fg=000000' title='{i,j}&amp;fg=000000' class='latex' />. By the triangle inequality we then have <a name="f1">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%5Ctilde+f_U%281%2C2%29+%5Ctilde+f_U%283%2C2%29+%5Ctilde+f_U%281%2C4%29+%5Ctilde+f_U%283%2C4%29+%3D+O%28%5Ctilde+%5Cvarepsilon%27%29+%5C+%5C+%5C+%5C+%5C+%282%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#92;tilde f_U(1,2) &#92;tilde f_U(3,2) &#92;tilde f_U(1,4) &#92;tilde f_U(3,4) = O(&#92;tilde &#92;varepsilon&#039;) &#92; &#92; &#92; &#92; &#92; (2)&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#92;tilde f_U(1,2) &#92;tilde f_U(3,2) &#92;tilde f_U(1,4) &#92;tilde f_U(3,4) = O(&#92;tilde &#92;varepsilon&#039;) &#92; &#92; &#92; &#92; &#92; (2)&amp;fg=000000' class='latex' /></p>
<p></a> and by a separate application of the triangle inequality <a name="f2">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%26%23124%3Bf%28i%2Cj%29+-+%5Ctilde+f_%7BU%5E%5Cperp%7D%28i%2Cj%29+-+%5Ctilde+f_U%28i%2Cj%29%26%23124%3B+%3D+O%28%5Ctilde+%5Cvarepsilon%29.+%5C+%5C+%5C+%5C+%5C+%283%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#124;f(i,j) - &#92;tilde f_{U^&#92;perp}(i,j) - &#92;tilde f_U(i,j)&#124; = O(&#92;tilde &#92;varepsilon). &#92; &#92; &#92; &#92; &#92; (3)&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#124;f(i,j) - &#92;tilde f_{U^&#92;perp}(i,j) - &#92;tilde f_U(i,j)&#124; = O(&#92;tilde &#92;varepsilon). &#92; &#92; &#92; &#92; &#92; (3)&amp;fg=000000' class='latex' /></p>
<p></a></p>
<p>
The bounds <a href="#f1">(2)</a>, <a href="#f2">(3)</a> apply to the limiting infinite random graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G+%3D+%28%7B%5Cmathbb+Z%7D%2C%5Chat+E%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G = ({&#92;mathbb Z},&#92;hat E)}&amp;fg=000000' title='{&#92;hat G = ({&#92;mathbb Z},&#92;hat E)}&amp;fg=000000' class='latex' />. On the other hand, all the random variables appearing in <a href="#f1">(2)</a>, <a href="#f2">(3)</a> involve at most finitely many of the edges of the graph. Thus, by vague convergence, the bounds <a href="#f1">(2)</a>, <a href="#f2">(3)</a> also apply to the graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G_n+%3D+%28%7B%5Cmathbb+Z%7D%2C+%5Chat+E_n%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G_n = ({&#92;mathbb Z}, &#92;hat E_n)}&amp;fg=000000' title='{&#92;hat G_n = ({&#92;mathbb Z}, &#92;hat E_n)}&amp;fg=000000' class='latex' /> for sufficiently large <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' />.
</p>
<p>
Now we unwind the definitions to move back to the finite graphs <img src='http://s0.wp.com/latex.php?latex=%7BG_n+%3D+%28V_n%2CE_n%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_n = (V_n,E_n)}&amp;fg=000000' title='{G_n = (V_n,E_n)}&amp;fg=000000' class='latex' />. Observe that, when applied to the graph <img src='http://s0.wp.com/latex.php?latex=%7B%5Chat+G_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hat G_n}&amp;fg=000000' title='{&#92;hat G_n}&amp;fg=000000' class='latex' />, one has </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Ctilde+f_%7BU%5E%5Cperp%7D%281%2C2%29+%3D+F_%7BU%5E%5Cperp%2Cn%7D%28+v_1%2C+v_2+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;tilde f_{U^&#92;perp}(1,2) = F_{U^&#92;perp,n}( v_1, v_2 )&amp;fg=000000' title='&#92;displaystyle  &#92;tilde f_{U^&#92;perp}(1,2) = F_{U^&#92;perp,n}( v_1, v_2 )&amp;fg=000000' class='latex' /></p>
<p> where <img src='http://s0.wp.com/latex.php?latex=%7BF_%7BU%2Cn%7D%3A+V_n+%5Ctimes+V_n+%5Crightarrow+%5B0%2C1%5D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{F_{U,n}: V_n &#92;times V_n &#92;rightarrow [0,1]}&amp;fg=000000' title='{F_{U,n}: V_n &#92;times V_n &#92;rightarrow [0,1]}&amp;fg=000000' class='latex' /> is a symmetric function which is constant on the pairs of cells <img src='http://s0.wp.com/latex.php?latex=%7BV%5E%7BM%7D_1%2C%5Cldots%2CV%5E%7BM%7D_%7B2%5E%7BM%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^{M}_1,&#92;ldots,V^{M}_{2^{M}}}&amp;fg=000000' title='{V^{M}_1,&#92;ldots,V^{M}_{2^{M}}}&amp;fg=000000' class='latex' /> generated the vertex neighbourhoods of <img src='http://s0.wp.com/latex.php?latex=%7Bv_%7B-1%7D%2C%5Cldots%2Cv_%7B-M%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_{-1},&#92;ldots,v_{-M}}&amp;fg=000000' title='{v_{-1},&#92;ldots,v_{-M}}&amp;fg=000000' class='latex' />. Similarly,
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Ctilde+f_%7BU%7D%281%2C2%29+%3D+F_%7BU%2Cn%7D%28+v_1%2C+v_2+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;tilde f_{U}(1,2) = F_{U,n}( v_1, v_2 )&amp;fg=000000' title='&#92;displaystyle  &#92;tilde f_{U}(1,2) = F_{U,n}( v_1, v_2 )&amp;fg=000000' class='latex' /></p>
<p> for some symmetric function <img src='http://s0.wp.com/latex.php?latex=%7BF_%7BU%2Cn%7D%3A+V_n+%5Ctimes+V_n+%5Crightarrow+%5B-1%2C1%5D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{F_{U,n}: V_n &#92;times V_n &#92;rightarrow [-1,1]}&amp;fg=000000' title='{F_{U,n}: V_n &#92;times V_n &#92;rightarrow [-1,1]}&amp;fg=000000' class='latex' />. The estimate <a href="#f1">(2)</a> can then be converted to a uniformity estimate on <img src='http://s0.wp.com/latex.php?latex=%7BF_%7BU%2Cn%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{F_{U,n}}&amp;fg=000000' title='{F_{U,n}}&amp;fg=000000' class='latex' />
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+F_%7BU%2Cn%7D%28v_1%2Cv_2%29+F_%7BU%2Cn%7D%28v_3%2Cv_2%29+F_%7BU%2Cn%7D%28v_1%2Cv_4%29+F_%7BU%2Cn%7D%28v_3%2Cv_4%29+%3D+O%28%5Ctilde+%5Cvarepsilon%27%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} F_{U,n}(v_1,v_2) F_{U,n}(v_3,v_2) F_{U,n}(v_1,v_4) F_{U,n}(v_3,v_4) = O(&#92;tilde &#92;varepsilon&#039;)&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} F_{U,n}(v_1,v_2) F_{U,n}(v_3,v_2) F_{U,n}(v_1,v_4) F_{U,n}(v_3,v_4) = O(&#92;tilde &#92;varepsilon&#039;)&amp;fg=000000' class='latex' /></p>
<p> while the estimate <a href="#f2">(3)</a> can be similarly converted to
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%26%23124%3B1_%7BE_n%7D%28v_1%2Cv_2%29+-+F_%7BU%5E%5Cperp%2Cn%7D%28v_1%2Cv_2%29+-+F_%7BU%2Cn%7D%28v_1%2Cv_2%29%26%23124%3B+%3D+O%28%5Ctilde+%5Cvarepsilon%29.+%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#124;1_{E_n}(v_1,v_2) - F_{U^&#92;perp,n}(v_1,v_2) - F_{U,n}(v_1,v_2)&#124; = O(&#92;tilde &#92;varepsilon). &amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#124;1_{E_n}(v_1,v_2) - F_{U^&#92;perp,n}(v_1,v_2) - F_{U,n}(v_1,v_2)&#124; = O(&#92;tilde &#92;varepsilon). &amp;fg=000000' class='latex' /></p>
<p> If one then repeats the arguments in the preceding blog post, we conclude (if <img src='http://s0.wp.com/latex.php?latex=%7B%5Ctilde+%5Cvarepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;tilde &#92;varepsilon}&amp;fg=000000' title='{&#92;tilde &#92;varepsilon}&amp;fg=000000' class='latex' /> is sufficiently small depending on <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon}&amp;fg=000000' title='{&#92;varepsilon}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7B%5Ctilde+%5Cvarepsilon%27%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;tilde &#92;varepsilon&#039;}&amp;fg=000000' title='{&#92;tilde &#92;varepsilon&#039;}&amp;fg=000000' class='latex' /> is sufficiently small depending on <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon}&amp;fg=000000' title='{&#92;varepsilon}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7B%5Ctilde+eps%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;tilde eps}&amp;fg=000000' title='{&#92;tilde eps}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7BM%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M}&amp;fg=000000' title='{M}&amp;fg=000000' class='latex' />) that for <img src='http://s0.wp.com/latex.php?latex=%7B1-%5Cvarepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1-&#92;varepsilon}&amp;fg=000000' title='{1-&#92;varepsilon}&amp;fg=000000' class='latex' /> of the choices for <img src='http://s0.wp.com/latex.php?latex=%7Bv_%7B-1%7D%2C%5Cldots%2Cv_%7B-M%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_{-1},&#92;ldots,v_{-M}}&amp;fg=000000' title='{v_{-1},&#92;ldots,v_{-M}}&amp;fg=000000' class='latex' />, the partition <img src='http://s0.wp.com/latex.php?latex=%7BV%5E%7BM%7D_1%2C%5Cldots%2CV%5E%7BM%7D_%7B2%5E%7BM%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^{M}_1,&#92;ldots,V^{M}_{2^{M}}}&amp;fg=000000' title='{V^{M}_1,&#92;ldots,V^{M}_{2^{M}}}&amp;fg=000000' class='latex' /> induced by the corresponding vertex neighbourhoods will obey <a href="#regular">(1)</a>. But this contradicts the construction of the <img src='http://s0.wp.com/latex.php?latex=%7BG_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_n}&amp;fg=000000' title='{G_n}&amp;fg=000000' class='latex' />, and the claim follows.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Szemeredi's regularity lemma via random partitions]]></title>
<link>http://terrytao.wordpress.com/2009/04/26/szemeredis-regularity-lemma-via-random-partitions/</link>
<pubDate>Mon, 27 Apr 2009 02:46:18 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2009/04/26/szemeredis-regularity-lemma-via-random-partitions/</guid>
<description><![CDATA[In the theory of dense graphs on vertices, where is large, a fundamental role is played by the Szeme]]></description>
<content:encoded><![CDATA[<p>
 In the theory of dense graphs on <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> vertices, where <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> is large, a fundamental role is played by the <a href="http://en.wikipedia.org/wiki/Szemer&#37;C3&#37;A9di_regularity_lemma">Szemer&#233;di regularity lemma</a>:
</p>
<blockquote><p><b>Lemma 1 (Regularity lemma, standard version)</b> <a name="regularity"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7BG+%3D+%28V%2CE%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G = (V,E)}&amp;fg=000000' title='{G = (V,E)}&amp;fg=000000' class='latex' /> be a graph on <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> vertices, and let <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon &gt; 0}&amp;fg=000000' title='{&#92;epsilon &gt; 0}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7Bk_0+%5Cgeq+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k_0 &#92;geq 0}&amp;fg=000000' title='{k_0 &#92;geq 0}&amp;fg=000000' class='latex' />. Then there exists a partition of the vertices <img src='http://s0.wp.com/latex.php?latex=%7BV+%3D+V_1+%5Ccup+%5Cldots+%5Ccup+V_k%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V = V_1 &#92;cup &#92;ldots &#92;cup V_k}&amp;fg=000000' title='{V = V_1 &#92;cup &#92;ldots &#92;cup V_k}&amp;fg=000000' class='latex' />, with <img src='http://s0.wp.com/latex.php?latex=%7Bk_0+%5Cleq+k+%5Cleq+C%28k_0%2C%5Cepsilon%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k_0 &#92;leq k &#92;leq C(k_0,&#92;epsilon)}&amp;fg=000000' title='{k_0 &#92;leq k &#92;leq C(k_0,&#92;epsilon)}&amp;fg=000000' class='latex' /> bounded below by <img src='http://s0.wp.com/latex.php?latex=%7Bk_0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k_0}&amp;fg=000000' title='{k_0}&amp;fg=000000' class='latex' /> and above by a quantity <img src='http://s0.wp.com/latex.php?latex=%7BC%28k_0%2C%5Cepsilon%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{C(k_0,&#92;epsilon)}&amp;fg=000000' title='{C(k_0,&#92;epsilon)}&amp;fg=000000' class='latex' /> depending only on <img src='http://s0.wp.com/latex.php?latex=%7Bk_0%2C+%5Cepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k_0, &#92;epsilon}&amp;fg=000000' title='{k_0, &#92;epsilon}&amp;fg=000000' class='latex' />, obeying the following properties: </p>
<ul>
<li> (Equitable partition) For any <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+i%2Cj+%5Cleq+k%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq i,j &#92;leq k}&amp;fg=000000' title='{1 &#92;leq i,j &#92;leq k}&amp;fg=000000' class='latex' />, the cardinalities <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BV_i%26%23124%3B%2C+%26%23124%3BV_j%26%23124%3B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;V_i&#124;, &#124;V_j&#124;}&amp;fg=000000' title='{&#124;V_i&#124;, &#124;V_j&#124;}&amp;fg=000000' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%7BV_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_i}&amp;fg=000000' title='{V_i}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BV_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_j}&amp;fg=000000' title='{V_j}&amp;fg=000000' class='latex' /> differ by at most <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />. </li>
<li> (Regularity) For all but at most <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon+k%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon k^2}&amp;fg=000000' title='{&#92;epsilon k^2}&amp;fg=000000' class='latex' /> pairs <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+i+%26%2360%3B+j+%5Cleq+k%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq i &lt; j &#92;leq k}&amp;fg=000000' title='{1 &#92;leq i &lt; j &#92;leq k}&amp;fg=000000' class='latex' />, the portion of the graph <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> between <img src='http://s0.wp.com/latex.php?latex=%7BV_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_i}&amp;fg=000000' title='{V_i}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BV_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_j}&amp;fg=000000' title='{V_j}&amp;fg=000000' class='latex' /> is <em><img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon}&amp;fg=000000' title='{&#92;epsilon}&amp;fg=000000' class='latex' />-regular</em> in the sense that one has
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%26%23124%3Bd%28+A%2C+B+%29+-+d%28+V_i%2C+V_j+%29%26%23124%3B+%5Cleq+%5Cepsilon%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#124;d( A, B ) - d( V_i, V_j )&#124; &#92;leq &#92;epsilon&amp;fg=000000' title='&#92;displaystyle  &#124;d( A, B ) - d( V_i, V_j )&#124; &#92;leq &#92;epsilon&amp;fg=000000' class='latex' /></p>
<p> for any <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Csubset+V_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;subset V_i}&amp;fg=000000' title='{A &#92;subset V_i}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BB+%5Csubset+V_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{B &#92;subset V_j}&amp;fg=000000' title='{B &#92;subset V_j}&amp;fg=000000' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BA%26%23124%3B+%5Cgeq+%5Cepsilon+%26%23124%3BV_i%26%23124%3B%2C+%26%23124%3BB%26%23124%3B+%5Cgeq+%5Cepsilon+%26%23124%3BV_j%26%23124%3B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;A&#124; &#92;geq &#92;epsilon &#124;V_i&#124;, &#124;B&#124; &#92;geq &#92;epsilon &#124;V_j&#124;}&amp;fg=000000' title='{&#124;A&#124; &#92;geq &#92;epsilon &#124;V_i&#124;, &#124;B&#124; &#92;geq &#92;epsilon &#124;V_j&#124;}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7Bd%28A%2CB%29+%3A%3D+%26%23124%3BE+%5Ccap+%28A+%5Ctimes+B%29%26%23124%3B%2F%26%23124%3BA%26%23124%3B+%26%23124%3BB%26%23124%3B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d(A,B) := &#124;E &#92;cap (A &#92;times B)&#124;/&#124;A&#124; &#124;B&#124;}&amp;fg=000000' title='{d(A,B) := &#124;E &#92;cap (A &#92;times B)&#124;/&#124;A&#124; &#124;B&#124;}&amp;fg=000000' class='latex' /> is the density of edges between <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BB%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{B}&amp;fg=000000' title='{B}&amp;fg=000000' class='latex' />.
</li>
</ul>
</blockquote>
</p>
<p>
This lemma becomes useful in the regime when <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> is very large compared to <img src='http://s0.wp.com/latex.php?latex=%7Bk_0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k_0}&amp;fg=000000' title='{k_0}&amp;fg=000000' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%7B1%2F%5Cepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1/&#92;epsilon}&amp;fg=000000' title='{1/&#92;epsilon}&amp;fg=000000' class='latex' />, because all the conclusions of the lemma are uniform in <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' />. Very roughly speaking, it says that &#8220;up to errors of size <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon}&amp;fg=000000' title='{&#92;epsilon}&amp;fg=000000' class='latex' />&#8220;, a large graph can be more or less described completely by a bounded number of quantities <img src='http://s0.wp.com/latex.php?latex=%7Bd%28V_i%2C+V_j%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d(V_i, V_j)}&amp;fg=000000' title='{d(V_i, V_j)}&amp;fg=000000' class='latex' />. This can be interpreted as saying that the space of all graphs is <a href="http://en.wikipedia.org/wiki/Totally_bounded">totally bounded</a> (and hence <a href="http://en.wikipedia.org/wiki/Precompact">precompact</a>) in a suitable metric space, thus allowing one to take formal limits of sequences (or subsequences) of graphs; see for instance <a class="snap_noshots" href="http://arxiv.org/abs/math/0408173">this paper of Lovasz and Szegedy</a> for a discussion.
</p>
<p>
For various technical reasons it is easier to work with a slightly weaker version of the lemma, which allows for the cells <img src='http://s0.wp.com/latex.php?latex=%7BV_1%2C%5Cldots%2CV_k%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_1,&#92;ldots,V_k}&amp;fg=000000' title='{V_1,&#92;ldots,V_k}&amp;fg=000000' class='latex' /> to have unequal sizes:
</p>
<blockquote><p><b>Lemma 2 (Regularity lemma, weighted version)</b> <a name="regularity2"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7BG+%3D+%28V%2CE%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G = (V,E)}&amp;fg=000000' title='{G = (V,E)}&amp;fg=000000' class='latex' /> be a graph on <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> vertices, and let <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon &gt; 0}&amp;fg=000000' title='{&#92;epsilon &gt; 0}&amp;fg=000000' class='latex' />. Then there exists a partition of the vertices <img src='http://s0.wp.com/latex.php?latex=%7BV+%3D+V_1+%5Ccup+%5Cldots+%5Ccup+V_k%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V = V_1 &#92;cup &#92;ldots &#92;cup V_k}&amp;fg=000000' title='{V = V_1 &#92;cup &#92;ldots &#92;cup V_k}&amp;fg=000000' class='latex' />, with <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+k+%5Cleq+C%28%5Cepsilon%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq k &#92;leq C(&#92;epsilon)}&amp;fg=000000' title='{1 &#92;leq k &#92;leq C(&#92;epsilon)}&amp;fg=000000' class='latex' /> bounded above by a quantity <img src='http://s0.wp.com/latex.php?latex=%7BC%28%5Cepsilon%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{C(&#92;epsilon)}&amp;fg=000000' title='{C(&#92;epsilon)}&amp;fg=000000' class='latex' /> depending only on <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon}&amp;fg=000000' title='{&#92;epsilon}&amp;fg=000000' class='latex' />, obeying the following properties: </p>
<ul>
<li> (Regularity) One has <a name="regular">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Csum_%7B%28V_i%2CV_j%29+%5Chbox%7B+not+%7D+%5Cepsilon-%5Chbox%7Bregular%7D%7D+%26%23124%3BV_i%26%23124%3B+%26%23124%3BV_j%26%23124%3B+%3D+O%28%5Cepsilon+%26%23124%3BV%26%23124%3B%5E2%29+%5C+%5C+%5C+%5C+%5C+%281%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;sum_{(V_i,V_j) &#92;hbox{ not } &#92;epsilon-&#92;hbox{regular}} &#124;V_i&#124; &#124;V_j&#124; = O(&#92;epsilon &#124;V&#124;^2) &#92; &#92; &#92; &#92; &#92; (1)&amp;fg=000000' title='&#92;displaystyle  &#92;sum_{(V_i,V_j) &#92;hbox{ not } &#92;epsilon-&#92;hbox{regular}} &#124;V_i&#124; &#124;V_j&#124; = O(&#92;epsilon &#124;V&#124;^2) &#92; &#92; &#92; &#92; &#92; (1)&amp;fg=000000' class='latex' /></p>
<p></a> where the sum is over all pairs <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+i+%5Cleq+j+%5Cleq+k%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq i &#92;leq j &#92;leq k}&amp;fg=000000' title='{1 &#92;leq i &#92;leq j &#92;leq k}&amp;fg=000000' class='latex' /> for which <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> is not <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon}&amp;fg=000000' title='{&#92;epsilon}&amp;fg=000000' class='latex' />-regular between <img src='http://s0.wp.com/latex.php?latex=%7BV_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_i}&amp;fg=000000' title='{V_i}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BV_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_j}&amp;fg=000000' title='{V_j}&amp;fg=000000' class='latex' />.
</li>
</ul>
</blockquote>
</p>
<p>
While Lemma <a href="#regularity2">2</a> is, strictly speaking, weaker than Lemma <a href="#regularity">1</a> in that it does not enforce the equitable size property between the atoms, in practice it seems that the two lemmas are roughly of equal utility; most of the combinatorial consequences of Lemma <a href="#regularity">1</a> can also be proven using Lemma <a href="#regularity2">2</a>. The point is that one always has to remember to weight each cell <img src='http://s0.wp.com/latex.php?latex=%7BV_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_i}&amp;fg=000000' title='{V_i}&amp;fg=000000' class='latex' /> by its density <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BV_i%26%23124%3B%2F%26%23124%3BV%26%23124%3B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;V_i&#124;/&#124;V&#124;}&amp;fg=000000' title='{&#124;V_i&#124;/&#124;V&#124;}&amp;fg=000000' class='latex' />, rather than by giving each cell an equal weight as in Lemma <a href="#regularity">1</a>. Lemma <a href="#regularity2">2</a> also has the advantage that one can easily generalise the result from finite vertex sets <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> to other probability spaces (for instance, one could weight <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> with something other than the uniform distribution). For applications to hypergraph regularity, it turns out to be slightly more convenient to have <em>two</em> partitions (coarse and fine) rather than just one; see for instance <a class="snap_noshots" href="http://arxiv.org/abs/math.CO/0503572">my own paper on this topic</a>. In any event the arguments below that we give to prove Lemma <a href="#regularity2">2</a> can be modified to give a proof of Lemma <a href="#regularity">1</a> also. The proof of the regularity lemma is usually conducted by a greedy algorithm. Very roughly speaking, one starts with the trivial partition of <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' />. If this partition already regularises the graph, we are done; if not, this means that there are some sets <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BB%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{B}&amp;fg=000000' title='{B}&amp;fg=000000' class='latex' /> in which there is a significant density fluctuation beyond what has already been detected by the original partition. One then adds these sets to the partition and iterates the argument. Every time a new density fluctuation is incorporated into the partition that models the original graph, this increases a certain &#8220;index&#8221; or &#8220;energy&#8221; of the partition. On the other hand, this energy remains bounded no matter how complex the partition, so eventually one must reach a long &#8220;energy plateau&#8221; in which no further refinement is possible, at which point one can find the regular partition.
</p>
<p>
One disadvantage of the greedy algorithm is that it is not efficient in the limit <img src='http://s0.wp.com/latex.php?latex=%7Bn+%5Crightarrow+%5Cinfty%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n &#92;rightarrow &#92;infty}&amp;fg=000000' title='{n &#92;rightarrow &#92;infty}&amp;fg=000000' class='latex' />, as it requires one to search over <em>all</em> pairs of subsets <img src='http://s0.wp.com/latex.php?latex=%7BA%2C+B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A, B}&amp;fg=000000' title='{A, B}&amp;fg=000000' class='latex' /> of a given pair <img src='http://s0.wp.com/latex.php?latex=%7BV_i%2C+V_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_i, V_j}&amp;fg=000000' title='{V_i, V_j}&amp;fg=000000' class='latex' /> of cells, which is an exponentially long search. There are more algorithmically efficient ways to regularise, for instance a polynomial time algorithm was given <a href="http://www.ams.org/mathscinet-getitem?mr=1251840">by Alon, Duke, Lefmann, R&#246;dl, and Yuster</a>. However, one can do even better, if one is willing to (a) allow cells of unequal size, (b) allow a small probability of failure, (c) have the ability to sample vertices from <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> at random, and (d) allow for the cells to be defined &#8220;implicitly&#8221; (via their relationships with a fixed set of reference vertices) rather than &#8220;explicitly&#8221; (as a list of vertices). In that case, one can regularise a graph in a number of operations <em>bounded</em> in <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' />. Indeed, one has
</p>
<blockquote><p><b>Lemma 3 (Regularity lemma via random neighbourhoods)</b> <a name="regularity3"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon &gt; 0}&amp;fg=000000' title='{&#92;epsilon &gt; 0}&amp;fg=000000' class='latex' />. Then there exists integers <img src='http://s0.wp.com/latex.php?latex=%7BM_1%2C%5Cldots%2CM_m%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_1,&#92;ldots,M_m}&amp;fg=000000' title='{M_1,&#92;ldots,M_m}&amp;fg=000000' class='latex' /> with the following property: whenever <img src='http://s0.wp.com/latex.php?latex=%7BG+%3D+%28V%2CE%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G = (V,E)}&amp;fg=000000' title='{G = (V,E)}&amp;fg=000000' class='latex' /> be a graph on finitely many vertices, if one selects one of the integers <img src='http://s0.wp.com/latex.php?latex=%7BM_r%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_r}&amp;fg=000000' title='{M_r}&amp;fg=000000' class='latex' /> at random from <img src='http://s0.wp.com/latex.php?latex=%7BM_1%2C%5Cldots%2CM_m%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_1,&#92;ldots,M_m}&amp;fg=000000' title='{M_1,&#92;ldots,M_m}&amp;fg=000000' class='latex' />, then selects <img src='http://s0.wp.com/latex.php?latex=%7BM_r%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_r}&amp;fg=000000' title='{M_r}&amp;fg=000000' class='latex' /> vertices <img src='http://s0.wp.com/latex.php?latex=%7Bv_1%2C%5Cldots%2Cv_%7BM_r%7D+%5Cin+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_1,&#92;ldots,v_{M_r} &#92;in V}&amp;fg=000000' title='{v_1,&#92;ldots,v_{M_r} &#92;in V}&amp;fg=000000' class='latex' /> uniformly from <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> at random, then the <img src='http://s0.wp.com/latex.php?latex=%7B2%5E%7BM_r%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2^{M_r}}&amp;fg=000000' title='{2^{M_r}}&amp;fg=000000' class='latex' /> vertex cells <img src='http://s0.wp.com/latex.php?latex=%7BV%5E%7BM_r%7D_1%2C%5Cldots%2CV%5E%7BM_r%7D_%7B2%5E%7BM_r%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^{M_r}_1,&#92;ldots,V^{M_r}_{2^{M_r}}}&amp;fg=000000' title='{V^{M_r}_1,&#92;ldots,V^{M_r}_{2^{M_r}}}&amp;fg=000000' class='latex' /> (some of which can be empty) generated by the vertex neighbourhoods <img src='http://s0.wp.com/latex.php?latex=%7BA_t+%3A%3D+%5C%7B+v+%5Cin+V%3A+%28v%2Cv_t%29+%5Cin+E+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_t := &#92;{ v &#92;in V: (v,v_t) &#92;in E &#92;}}&amp;fg=000000' title='{A_t := &#92;{ v &#92;in V: (v,v_t) &#92;in E &#92;}}&amp;fg=000000' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+t+%5Cleq+M_r%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq t &#92;leq M_r}&amp;fg=000000' title='{1 &#92;leq t &#92;leq M_r}&amp;fg=000000' class='latex' />, will obey the conclusions of Lemma <a href="#regularity2">2</a> with probability at least <img src='http://s0.wp.com/latex.php?latex=%7B1-O%28%5Cepsilon%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1-O(&#92;epsilon)}&amp;fg=000000' title='{1-O(&#92;epsilon)}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
Thus, roughly speaking, one can regularise a graph simply by taking a large number of random vertex neighbourhoods, and using the partition (or Venn diagram) generated by these neighbourhoods as the partition. The intuition is that if there is any non-uniformity in the graph (e.g. if the graph exhibits bipartite behaviour), this will bias the random neighbourhoods to seek out the partitions that would regularise that non-uniformity (e.g. vertex neighbourhoods would begin to fill out the two vertex cells associated to the bipartite property); if one takes sufficiently many such random neighbourhoods, the probability that all detectable non-uniformity is captured by the partition should converge to <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />. (It is more complicated than this, because the finer one makes the partition, the finer the types of non-uniformity one can begin to detect, but this is the basic idea.)
</p>
<p>
This fact seems to be reasonably well-known folklore, discovered independently by many authors; it is for instance quite close to the graph property testing results <a href="http://www.ams.org/mathscinet-getitem?mr=2411033">of Alon and Shapira</a>, and also appears implicitly in a <a href="http://arxiv.org/abs/math/0612838">paper of Ishigami</a>, as well as a <a href="http://arxiv.org/abs/0801.1698">paper of Austin</a> (and perhaps even more implicitly in a <a href="http://arxiv.org/abs/math/0602037">paper of myself</a>). However, in none of these papers is the above lemma stated explicitly. I was asked about this lemma recently, so I decided to provide a proof here.
</p>
<p>
<!--more-->
</p>
</p>
<p align="center"><b> &#8212;  1. Warmup: a weak regularity lemma  &#8212; </b></p>
<p>
To motivate the idea, let&#8217;s first prove a weaker but simpler (and more quantitatively effective) regularity lemma, analogous to that established by Frieze and Kannan:
</p>
<blockquote><p><b>Lemma 4 (Weak regularity lemma via random neighbourhoods)</b> <a name="regularity4"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon &gt; 0}&amp;fg=000000' title='{&#92;epsilon &gt; 0}&amp;fg=000000' class='latex' />. Then there exists an integer <img src='http://s0.wp.com/latex.php?latex=%7BM%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M}&amp;fg=000000' title='{M}&amp;fg=000000' class='latex' /> with the following property: whenever <img src='http://s0.wp.com/latex.php?latex=%7BG+%3D+%28V%2CE%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G = (V,E)}&amp;fg=000000' title='{G = (V,E)}&amp;fg=000000' class='latex' /> be a graph on finitely many vertices, if one selects <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+t+%5Cleq+M%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq t &#92;leq M}&amp;fg=000000' title='{1 &#92;leq t &#92;leq M}&amp;fg=000000' class='latex' /> at random, then selects <img src='http://s0.wp.com/latex.php?latex=%7Bt%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t}&amp;fg=000000' title='{t}&amp;fg=000000' class='latex' /> vertices <img src='http://s0.wp.com/latex.php?latex=%7Bv_1%2C%5Cldots%2Cv_t+%5Cin+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_1,&#92;ldots,v_t &#92;in V}&amp;fg=000000' title='{v_1,&#92;ldots,v_t &#92;in V}&amp;fg=000000' class='latex' /> uniformly from <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> at random, then the <img src='http://s0.wp.com/latex.php?latex=%7B2%5E%7Bt%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2^{t}}&amp;fg=000000' title='{2^{t}}&amp;fg=000000' class='latex' /> vertex cells <img src='http://s0.wp.com/latex.php?latex=%7BV%5Et_1%2C%5Cldots%2CV%5Et_%7B2%5Et%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^t_1,&#92;ldots,V^t_{2^t}}&amp;fg=000000' title='{V^t_1,&#92;ldots,V^t_{2^t}}&amp;fg=000000' class='latex' /> (some of which can be empty) generated by the vertex neighbourhoods <img src='http://s0.wp.com/latex.php?latex=%7BA_%7Bt%27%7D+%3A%3D+%5C%7B+v+%5Cin+V%3A+%28v%2Cv_%7Bt%27%7D%29+%5Cin+E+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_{t&#039;} := &#92;{ v &#92;in V: (v,v_{t&#039;}) &#92;in E &#92;}}&amp;fg=000000' title='{A_{t&#039;} := &#92;{ v &#92;in V: (v,v_{t&#039;}) &#92;in E &#92;}}&amp;fg=000000' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+t%27+%5Cleq+t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq t&#039; &#92;leq t}&amp;fg=000000' title='{1 &#92;leq t&#039; &#92;leq t}&amp;fg=000000' class='latex' />, obey the following property with probability at least <img src='http://s0.wp.com/latex.php?latex=%7B1-O%28%5Cepsilon%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1-O(&#92;epsilon)}&amp;fg=000000' title='{1-O(&#92;epsilon)}&amp;fg=000000' class='latex' />: for any vertex sets <img src='http://s0.wp.com/latex.php?latex=%7BA%2C+B+%5Csubset+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A, B &#92;subset V}&amp;fg=000000' title='{A, B &#92;subset V}&amp;fg=000000' class='latex' />, the number of edges <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BE+%5Ccap+%28A+%5Ctimes+B%29%26%23124%3B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;E &#92;cap (A &#92;times B)&#124;}&amp;fg=000000' title='{&#124;E &#92;cap (A &#92;times B)&#124;}&amp;fg=000000' class='latex' /> connecting <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BB%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{B}&amp;fg=000000' title='{B}&amp;fg=000000' class='latex' /> can be approximated by the formula <a name="youf">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%26%23124%3BE+%5Ccap+%28A+%5Ctimes+B%29%26%23124%3B+%3D+%5Csum_%7Bi%3D1%7D%5E%7B2%5Et%7D+%5Csum_%7Bj%3D1%7D%5E%7B2%5Et%7D+d%28V%5Et_i%2CV%5Et_j%29+%26%23124%3BA+%5Ccap+V%5Et_i%26%23124%3B+%26%23124%3BB+%5Ccap+V%5Et_j%26%23124%3B+%2B+O%28+%5Cepsilon+%26%23124%3BV%26%23124%3B%5E2+%29.+%5C+%5C+%5C+%5C+%5C+%282%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#124;E &#92;cap (A &#92;times B)&#124; = &#92;sum_{i=1}^{2^t} &#92;sum_{j=1}^{2^t} d(V^t_i,V^t_j) &#124;A &#92;cap V^t_i&#124; &#124;B &#92;cap V^t_j&#124; + O( &#92;epsilon &#124;V&#124;^2 ). &#92; &#92; &#92; &#92; &#92; (2)&amp;fg=000000' title='&#92;displaystyle  &#124;E &#92;cap (A &#92;times B)&#124; = &#92;sum_{i=1}^{2^t} &#92;sum_{j=1}^{2^t} d(V^t_i,V^t_j) &#124;A &#92;cap V^t_i&#124; &#124;B &#92;cap V^t_j&#124; + O( &#92;epsilon &#124;V&#124;^2 ). &#92; &#92; &#92; &#92; &#92; (2)&amp;fg=000000' class='latex' /></p>
<p></a> </p></blockquote>
</p>
<p>
This weaker lemma only lets us count &#8220;macroscopic&#8221; edge densities <img src='http://s0.wp.com/latex.php?latex=%7Bd%28A%2CB%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d(A,B)}&amp;fg=000000' title='{d(A,B)}&amp;fg=000000' class='latex' />, when <img src='http://s0.wp.com/latex.php?latex=%7BA%2C+B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A, B}&amp;fg=000000' title='{A, B}&amp;fg=000000' class='latex' /> are dense subsets of <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' />, whereas the full regularity lemma is stronger in that it also controls &#8220;microscopic&#8221; edge densities <img src='http://s0.wp.com/latex.php?latex=%7Bd%28A%2CB%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d(A,B)}&amp;fg=000000' title='{d(A,B)}&amp;fg=000000' class='latex' /> where <img src='http://s0.wp.com/latex.php?latex=%7BA%2C+B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A, B}&amp;fg=000000' title='{A, B}&amp;fg=000000' class='latex' /> are now dense subsets of the cells <img src='http://s0.wp.com/latex.php?latex=%7BV%5E%7BM_r%7D_i%2C+V%5E%7BM_r%7D_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^{M_r}_i, V^{M_r}_j}&amp;fg=000000' title='{V^{M_r}_i, V^{M_r}_j}&amp;fg=000000' class='latex' />. Nevertheless this weaker lemma is easier to prove and already illustrates many of the ideas.
</p>
<p>
Let&#8217;s now prove this lemma. Fix <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon &gt; 0}&amp;fg=000000' title='{&#92;epsilon &gt; 0}&amp;fg=000000' class='latex' />, let <img src='http://s0.wp.com/latex.php?latex=%7BM%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M}&amp;fg=000000' title='{M}&amp;fg=000000' class='latex' /> be chosen later, let <img src='http://s0.wp.com/latex.php?latex=%7BG+%3D+%28V%2CE%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G = (V,E)}&amp;fg=000000' title='{G = (V,E)}&amp;fg=000000' class='latex' /> be a graph, and select <img src='http://s0.wp.com/latex.php?latex=%7Bv_1%2C%5Cldots%2Cv_M%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_1,&#92;ldots,v_M}&amp;fg=000000' title='{v_1,&#92;ldots,v_M}&amp;fg=000000' class='latex' /> at random. (There can of course be many vertices selected more than once; this will not bother us.) Let <img src='http://s0.wp.com/latex.php?latex=%7BA_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_t}&amp;fg=000000' title='{A_t}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BV%5Et_1%2C%5Cldots%2CV%5Et_%7B2%5Et%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^t_1,&#92;ldots,V^t_{2^t}}&amp;fg=000000' title='{V^t_1,&#92;ldots,V^t_{2^t}}&amp;fg=000000' class='latex' /> be as in the above lemma. For notational purposes it is more convenient to work with the (random) <img src='http://s0.wp.com/latex.php?latex=%7B%5Csigma%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sigma}&amp;fg=000000' title='{&#92;sigma}&amp;fg=000000' class='latex' />-algebra <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_t}&amp;fg=000000' title='{{&#92;mathcal B}_t}&amp;fg=000000' class='latex' /> generated by the <img src='http://s0.wp.com/latex.php?latex=%7BA_1%2C%5Cldots%2CA_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_1,&#92;ldots,A_t}&amp;fg=000000' title='{A_1,&#92;ldots,A_t}&amp;fg=000000' class='latex' /> (i.e. the collection of all sets that can be formed from <img src='http://s0.wp.com/latex.php?latex=%7BA_1%2C%5Cldots%2CA_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_1,&#92;ldots,A_t}&amp;fg=000000' title='{A_1,&#92;ldots,A_t}&amp;fg=000000' class='latex' /> by boolean operations); this is an atomic <img src='http://s0.wp.com/latex.php?latex=%7B%5Csigma%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sigma}&amp;fg=000000' title='{&#92;sigma}&amp;fg=000000' class='latex' />-algebra whose atoms are precisely the (non-empty) cells <img src='http://s0.wp.com/latex.php?latex=%7BV%5Et_1%2C%5Cldots%2CV%5Et_%7B2%5Et%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^t_1,&#92;ldots,V^t_{2^t}}&amp;fg=000000' title='{V^t_1,&#92;ldots,V^t_{2^t}}&amp;fg=000000' class='latex' /> in the partition. Observe that these <img src='http://s0.wp.com/latex.php?latex=%7B%5Csigma%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sigma}&amp;fg=000000' title='{&#92;sigma}&amp;fg=000000' class='latex' />-algebras are nested: <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_t+%5Csubset+%7B%5Cmathcal+B%7D_%7Bt%2B1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_t &#92;subset {&#92;mathcal B}_{t+1}}&amp;fg=000000' title='{{&#92;mathcal B}_t &#92;subset {&#92;mathcal B}_{t+1}}&amp;fg=000000' class='latex' />.
</p>
<p>
We will use the <a class="snap_noshots" href="http://www.tricki.org/article/Turn_sets_into_functions">trick of turning sets into functions</a>, and view the graph as a function <img src='http://s0.wp.com/latex.php?latex=%7B1_E%3A+V+%5Ctimes+V+%5Crightarrow+%7B%5Cmathbb+R%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1_E: V &#92;times V &#92;rightarrow {&#92;mathbb R}}&amp;fg=000000' title='{1_E: V &#92;times V &#92;rightarrow {&#92;mathbb R}}&amp;fg=000000' class='latex' />. One can then form the <a href="http://en.wikipedia.org/wiki/Conditional_expectation">conditional expectation</a> <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D%281_E+%26%23124%3B+%7B%5Cmathcal+B%7D_t+%5Ctimes+%7B%5Cmathcal+B%7D_t+%29%3A+V+%5Ctimes+V+%5Crightarrow+%7B%5Cmathbb+R%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t ): V &#92;times V &#92;rightarrow {&#92;mathbb R}}&amp;fg=000000' title='{{&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t ): V &#92;times V &#92;rightarrow {&#92;mathbb R}}&amp;fg=000000' class='latex' /> of this function to the product <img src='http://s0.wp.com/latex.php?latex=%7B%5Csigma%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sigma}&amp;fg=000000' title='{&#92;sigma}&amp;fg=000000' class='latex' />-algebra <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_t+%5Ctimes+%7B%5Cmathcal+B%7D_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t}&amp;fg=000000' title='{{&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t}&amp;fg=000000' class='latex' />, whose value on <img src='http://s0.wp.com/latex.php?latex=%7BV%5Et_i+%5Ctimes+V%5Et_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^t_i &#92;times V^t_j}&amp;fg=000000' title='{V^t_i &#92;times V^t_j}&amp;fg=000000' class='latex' /> is simply the average value of <img src='http://s0.wp.com/latex.php?latex=%7B1_E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1_E}&amp;fg=000000' title='{1_E}&amp;fg=000000' class='latex' /> on the product set <img src='http://s0.wp.com/latex.php?latex=%7BV%5Et_i+%5Ctimes+V%5Et_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^t_i &#92;times V^t_j}&amp;fg=000000' title='{V^t_i &#92;times V^t_j}&amp;fg=000000' class='latex' />. (When <img src='http://s0.wp.com/latex.php?latex=%7Bi%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i}&amp;fg=000000' title='{i}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7Bj%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{j}&amp;fg=000000' title='{j}&amp;fg=000000' class='latex' /> are different, this is simply the edge density <img src='http://s0.wp.com/latex.php?latex=%7Bd%28V%5Et_i%2C+V%5Et_j%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d(V^t_i, V^t_j)}&amp;fg=000000' title='{d(V^t_i, V^t_j)}&amp;fg=000000' class='latex' />). One can view <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D%281_E+%26%23124%3B+%7B%5Cmathcal+B%7D_t+%5Ctimes+%7B%5Cmathcal+B%7D_t+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t )}&amp;fg=000000' title='{{&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t )}&amp;fg=000000' class='latex' /> more combinatorially, as a weighted graph on <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> such that all edges between two distinct cells <img src='http://s0.wp.com/latex.php?latex=%7BV%5Et_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^t_i}&amp;fg=000000' title='{V^t_i}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7BV%5Et_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^t_j}&amp;fg=000000' title='{V^t_j}&amp;fg=000000' class='latex' /> have the same constant weight of <img src='http://s0.wp.com/latex.php?latex=%7Bd%28V%5Et_i%2CV%5Et_j%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d(V^t_i,V^t_j)}&amp;fg=000000' title='{d(V^t_i,V^t_j)}&amp;fg=000000' class='latex' />.
</p>
<p>
We give <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> (and <img src='http://s0.wp.com/latex.php?latex=%7BV+%5Ctimes+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V &#92;times V}&amp;fg=000000' title='{V &#92;times V}&amp;fg=000000' class='latex' />) the uniform probability measure, and define the <em>energy</em> <img src='http://s0.wp.com/latex.php?latex=%7Be_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{e_t}&amp;fg=000000' title='{e_t}&amp;fg=000000' class='latex' /> at time <img src='http://s0.wp.com/latex.php?latex=%7Bt%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t}&amp;fg=000000' title='{t}&amp;fg=000000' class='latex' /> to be the (random) quantity </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++e_t+%3A%3D+%5C%26%23124%3B%7B%5CBbb+E%7D%281_E+%26%23124%3B+%7B%5Cmathcal+B%7D_t+%5Ctimes+%7B%5Cmathcal+B%7D_t+%29+%5C%26%23124%3B_%7BL%5E2%28V+%5Ctimes+V%29%7D%5E2+%3D+%5Cfrac%7B1%7D%7B%26%23124%3BV%26%23124%3B%5E2%7D+%5Csum_%7Bv%2Cw+%5Cin+V%7D+%7B%5CBbb+E%7D%281_E+%26%23124%3B+%7B%5Cmathcal+B%7D_t+%5Ctimes+%7B%5Cmathcal+B%7D_t+%29%5E2.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  e_t := &#92;&#124;{&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t ) &#92;&#124;_{L^2(V &#92;times V)}^2 = &#92;frac{1}{&#124;V&#124;^2} &#92;sum_{v,w &#92;in V} {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t )^2.&amp;fg=000000' title='&#92;displaystyle  e_t := &#92;&#124;{&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t ) &#92;&#124;_{L^2(V &#92;times V)}^2 = &#92;frac{1}{&#124;V&#124;^2} &#92;sum_{v,w &#92;in V} {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t )^2.&amp;fg=000000' class='latex' /></p>
<p> one can interpret this as the mean square of the edge densities <img src='http://s0.wp.com/latex.php?latex=%7Bd%28V%5Et_i%2C+V%5Et_j%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d(V^t_i, V^t_j)}&amp;fg=000000' title='{d(V^t_i, V^t_j)}&amp;fg=000000' class='latex' />, weighted by the size of the cells <img src='http://s0.wp.com/latex.php?latex=%7BV%5Et_i%2C+V%5Et_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^t_i, V^t_j}&amp;fg=000000' title='{V^t_i, V^t_j}&amp;fg=000000' class='latex' />. From Pythagoras&#8217; theorem we have the identity
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++e_%7Bt%27%7D+%3D+e_t+%2B+%5C%26%23124%3B+%7B%5CBbb+E%7D%281_E+%26%23124%3B+%7B%5Cmathcal+B%7D_%7Bt%27%7D+%5Ctimes+%7B%5Cmathcal+B%7D_%7Bt%27%7D+%29+-+%7B%5CBbb+E%7D%281_E+%26%23124%3B+%7B%5Cmathcal+B%7D_t+%5Ctimes+%7B%5Cmathcal+B%7D_t+%29+%5C%26%23124%3B_%7BL%5E2%28V+%5Ctimes+V%29%7D%5E2%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  e_{t&#039;} = e_t + &#92;&#124; {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_{t&#039;} &#92;times {&#92;mathcal B}_{t&#039;} ) - {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t ) &#92;&#124;_{L^2(V &#92;times V)}^2&amp;fg=000000' title='&#92;displaystyle  e_{t&#039;} = e_t + &#92;&#124; {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_{t&#039;} &#92;times {&#92;mathcal B}_{t&#039;} ) - {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t ) &#92;&#124;_{L^2(V &#92;times V)}^2&amp;fg=000000' class='latex' /></p>
<p> for all <img src='http://s0.wp.com/latex.php?latex=%7Bt%27%26%2362%3Bt%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t&#039;&gt;t}&amp;fg=000000' title='{t&#039;&gt;t}&amp;fg=000000' class='latex' />; in particular, the <img src='http://s0.wp.com/latex.php?latex=%7Be_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{e_t}&amp;fg=000000' title='{e_t}&amp;fg=000000' class='latex' /> are increasing in <img src='http://s0.wp.com/latex.php?latex=%7Bt%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t}&amp;fg=000000' title='{t}&amp;fg=000000' class='latex' />. This implies that the expectations <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D+e_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E} e_t}&amp;fg=000000' title='{{&#92;Bbb E} e_t}&amp;fg=000000' class='latex' /> are also increasing in <img src='http://s0.wp.com/latex.php?latex=%7Bt%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t}&amp;fg=000000' title='{t}&amp;fg=000000' class='latex' />. On the other hand, these expectations are bounded between <img src='http://s0.wp.com/latex.php?latex=%7B0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0}&amp;fg=000000' title='{0}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />. Thus, if we select <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+t+%5Cleq+M%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq t &#92;leq M}&amp;fg=000000' title='{1 &#92;leq t &#92;leq M}&amp;fg=000000' class='latex' /> at random, expectation of
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%28e_%7Bt%2B2%7D+-+e_%7Bt%7D%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} (e_{t+2} - e_{t})&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} (e_{t+2} - e_{t})&amp;fg=000000' class='latex' /></p>
<p> telescopes to be <img src='http://s0.wp.com/latex.php?latex=%7BO%281%2FM%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O(1/M)}&amp;fg=000000' title='{O(1/M)}&amp;fg=000000' class='latex' />. Thus, by Markov&#8217;s inequality, with probability <img src='http://s0.wp.com/latex.php?latex=%7B1-O%28%5Cepsilon%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1-O(&#92;epsilon)}&amp;fg=000000' title='{1-O(&#92;epsilon)}&amp;fg=000000' class='latex' /> we can freeze <img src='http://s0.wp.com/latex.php?latex=%7Bv_1%2C%5Cldots%2Cv_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_1,&#92;ldots,v_t}&amp;fg=000000' title='{v_1,&#92;ldots,v_t}&amp;fg=000000' class='latex' /> such that we have the conditional expectation bound <a name="josh">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D%28e_%7Bt%2B2%7D+-+e_t%26%23124%3B+v_1%2C%5Cldots%2Cv_t%29+%3D+O%28+%5Cfrac%7B1%7D%7BM%5Cepsilon%7D+%29.+%5C+%5C+%5C+%5C+%5C+%283%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}(e_{t+2} - e_t&#124; v_1,&#92;ldots,v_t) = O( &#92;frac{1}{M&#92;epsilon} ). &#92; &#92; &#92; &#92; &#92; (3)&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}(e_{t+2} - e_t&#124; v_1,&#92;ldots,v_t) = O( &#92;frac{1}{M&#92;epsilon} ). &#92; &#92; &#92; &#92; &#92; (3)&amp;fg=000000' class='latex' /></p>
<p></a> Suppose <img src='http://s0.wp.com/latex.php?latex=%7Bv_1%2C%5Cldots%2Cv_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_1,&#92;ldots,v_t}&amp;fg=000000' title='{v_1,&#92;ldots,v_t}&amp;fg=000000' class='latex' /> have this property. We split
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++1_E+%3D+f_%7BU%5E%5Cperp%7D+%2B+f_U%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  1_E = f_{U^&#92;perp} + f_U&amp;fg=000000' title='&#92;displaystyle  1_E = f_{U^&#92;perp} + f_U&amp;fg=000000' class='latex' /></p>
<p> where
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++f_%7BU%5E%5Cperp%7D+%3A%3D+%7B%5CBbb+E%7D%281_E+%26%23124%3B+%7B%5Cmathcal+B%7D_t+%5Ctimes+%7B%5Cmathcal+B%7D_t+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  f_{U^&#92;perp} := {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t )&amp;fg=000000' title='&#92;displaystyle  f_{U^&#92;perp} := {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_t &#92;times {&#92;mathcal B}_t )&amp;fg=000000' class='latex' /></p>
<p> and
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++f_U+%3A%3D+1_E+-+%7B%5CBbb+E%7D%281_E+%26%23124%3B+%7B%5Cmathcal+B%7D_%7Bt%7D+%5Ctimes+%7B%5Cmathcal+B%7D_%7Bt%7D+%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  f_U := 1_E - {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_{t} &#92;times {&#92;mathcal B}_{t} ).&amp;fg=000000' title='&#92;displaystyle  f_U := 1_E - {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_{t} &#92;times {&#92;mathcal B}_{t} ).&amp;fg=000000' class='latex' /></p>
<p>
We now assert that the partition <img src='http://s0.wp.com/latex.php?latex=%7BV%5E%7Bt%7D_1%2C%5Cldots%2CV%5E%7Bt%7D_%7B2%5Et%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^{t}_1,&#92;ldots,V^{t}_{2^t}}&amp;fg=000000' title='{V^{t}_1,&#92;ldots,V^{t}_{2^t}}&amp;fg=000000' class='latex' /> induced by <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7Bt%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{t}}&amp;fg=000000' title='{{&#92;mathcal B}_{t}}&amp;fg=000000' class='latex' /> obeys the conclusions of Lemma <a href="#regularity3">3</a>. For this, we observe various properties on the two components of <img src='http://s0.wp.com/latex.php?latex=%7B1_E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1_E}&amp;fg=000000' title='{1_E}&amp;fg=000000' class='latex' />:
</p>
<blockquote><p><b>Lemma 5 (<img src='http://s0.wp.com/latex.php?latex=%7Bf_%7BU%5E%5Cperp%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_{U^&#92;perp}}&amp;fg=000000' title='{f_{U^&#92;perp}}&amp;fg=000000' class='latex' /> is structured)</b>  <img src='http://s0.wp.com/latex.php?latex=%7Bf_%7BU%5E%5Cperp%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_{U^&#92;perp}}&amp;fg=000000' title='{f_{U^&#92;perp}}&amp;fg=000000' class='latex' /> is constant on each product set <img src='http://s0.wp.com/latex.php?latex=%7BV%5E%7Bt%7D_i+%5Ctimes+V%5E%7Bt%7D_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^{t}_i &#92;times V^{t}_j}&amp;fg=000000' title='{V^{t}_i &#92;times V^{t}_j}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
<em>Proof:</em>  This is clear from construction. <img src='http://s0.wp.com/latex.php?latex=%5CBox%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Box&amp;fg=000000' title='&#92;Box&amp;fg=000000' class='latex' /></p>
<blockquote><p><b>Lemma 6 (<img src='http://s0.wp.com/latex.php?latex=%7Bf_U%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_U}&amp;fg=000000' title='{f_U}&amp;fg=000000' class='latex' /> is pseudorandom)</b> <a name="fu"></a> The expression
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cfrac%7B1%7D%7B%26%23124%3BV%26%23124%3B%5E4%7D+%5Csum_%7Bv%2Cw%2Cv%27%2Cw%27+%5Cin+V%7D+f_U%28v%2Cw%29+f_U%28v%2Cw%27%29+f_U%28v%27%2Cw%29+f_U%28v%27%2Cw%27%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;frac{1}{&#124;V&#124;^4} &#92;sum_{v,w,v&#039;,w&#039; &#92;in V} f_U(v,w) f_U(v,w&#039;) f_U(v&#039;,w) f_U(v&#039;,w&#039;)&amp;fg=000000' title='&#92;displaystyle  &#92;frac{1}{&#124;V&#124;^4} &#92;sum_{v,w,v&#039;,w&#039; &#92;in V} f_U(v,w) f_U(v,w&#039;) f_U(v&#039;,w) f_U(v&#039;,w&#039;)&amp;fg=000000' class='latex' /></p>
<p> is of size <img src='http://s0.wp.com/latex.php?latex=%7BO%28+%5Cfrac%7B1%7D%7B%5Csqrt%7BM+%5Cepsilon%7D%7D+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O( &#92;frac{1}{&#92;sqrt{M &#92;epsilon}} )}&amp;fg=000000' title='{O( &#92;frac{1}{&#92;sqrt{M &#92;epsilon}} )}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
<em>Proof:</em>  The left-hand side can be rewritten as </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%5Cfrac%7B1%7D%7B%26%23124%3BV%26%23124%3B%5E2%7D+%5Csum_%7Bv%2Cw+%5Cin+V%7D+f_U%28v%2Cw%29+f_U%28v%2Cv_%7Bt%2B2%7D%29+f_U%28v_%7Bt%2B1%7D%2Cw%29+f_U%28v_%7Bt%2B1%7D%2Cv_%7Bt%2B2%7D%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#92;frac{1}{&#124;V&#124;^2} &#92;sum_{v,w &#92;in V} f_U(v,w) f_U(v,v_{t+2}) f_U(v_{t+1},w) f_U(v_{t+1},v_{t+2}).&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#92;frac{1}{&#124;V&#124;^2} &#92;sum_{v,w &#92;in V} f_U(v,w) f_U(v,v_{t+2}) f_U(v_{t+1},w) f_U(v_{t+1},v_{t+2}).&amp;fg=000000' class='latex' /></p>
<p> Observe that the function <img src='http://s0.wp.com/latex.php?latex=%7B%28v%2Cw%29+%5Cmapsto+f_U%28v%2Cv_%7Bt%2B2%7D%29+f_U%28v_%7Bt%2B1%7D%2Cw%29+f_U%28v_%7Bt%2B1%7D%2Cv_%7Bt%2B2%7D%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(v,w) &#92;mapsto f_U(v,v_{t+2}) f_U(v_{t+1},w) f_U(v_{t+1},v_{t+2})}&amp;fg=000000' title='{(v,w) &#92;mapsto f_U(v,v_{t+2}) f_U(v_{t+1},w) f_U(v_{t+1},v_{t+2})}&amp;fg=000000' class='latex' /> is measurable with respect to <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7Bt%2B2%7D+%5Ctimes+%7B%5Cmathcal+B%7D_%7Bt%2B2%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{t+2} &#92;times {&#92;mathcal B}_{t+2}}&amp;fg=000000' title='{{&#92;mathcal B}_{t+2} &#92;times {&#92;mathcal B}_{t+2}}&amp;fg=000000' class='latex' />, so we can rewrite this expression as
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%5Cfrac%7B1%7D%7B%26%23124%3BV%26%23124%3B%5E2%7D+%5Csum_%7Bv%2Cw+%5Cin+V%7D+%7B%5CBbb+E%7D%28f_U%26%23124%3B%7B%5Cmathcal+B%7D_%7Bt%2B2%7D+%5Ctimes+%7B%5Cmathcal+B%7D_%7Bt%2B2%7D%29%28v%2Cw%29+f_U%28v%2Cv_%7Bt%2B2%7D%29+f_U%28v_%7Bt%2B1%7D%2Cw%29+f_U%28v_%7Bt%2B1%7D%2Cv_%7Bt%2B2%7D%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#92;frac{1}{&#124;V&#124;^2} &#92;sum_{v,w &#92;in V} {&#92;Bbb E}(f_U&#124;{&#92;mathcal B}_{t+2} &#92;times {&#92;mathcal B}_{t+2})(v,w) f_U(v,v_{t+2}) f_U(v_{t+1},w) f_U(v_{t+1},v_{t+2}).&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#92;frac{1}{&#124;V&#124;^2} &#92;sum_{v,w &#92;in V} {&#92;Bbb E}(f_U&#124;{&#92;mathcal B}_{t+2} &#92;times {&#92;mathcal B}_{t+2})(v,w) f_U(v,v_{t+2}) f_U(v_{t+1},w) f_U(v_{t+1},v_{t+2}).&amp;fg=000000' class='latex' /></p>
<p> Applying Cauchy-Schwarz, one can bound this by
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%5C%26%23124%3B%7B%5CBbb+E%7D%28f_U%26%23124%3B%7B%5Cmathcal+B%7D_%7Bt%2B2%7D+%5Ctimes+%7B%5Cmathcal+B%7D_%7Bt%2B2%7D%29%5C%26%23124%3B_%7BL%5E2%28V+%5Ctimes+V%29%7D.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#92;&#124;{&#92;Bbb E}(f_U&#124;{&#92;mathcal B}_{t+2} &#92;times {&#92;mathcal B}_{t+2})&#92;&#124;_{L^2(V &#92;times V)}.&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#92;&#124;{&#92;Bbb E}(f_U&#124;{&#92;mathcal B}_{t+2} &#92;times {&#92;mathcal B}_{t+2})&#92;&#124;_{L^2(V &#92;times V)}.&amp;fg=000000' class='latex' /></p>
<p> But from Pythagoras we have
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D%28f_U%26%23124%3B%7B%5Cmathcal+B%7D_%7Bt%2B2%7D+%5Ctimes+%7B%5Cmathcal+B%7D_%7Bt%2B2%7D%29%5E2+%3D+e_%7Bt%2B2%7D+-+e_t%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}(f_U&#124;{&#92;mathcal B}_{t+2} &#92;times {&#92;mathcal B}_{t+2})^2 = e_{t+2} - e_t&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}(f_U&#124;{&#92;mathcal B}_{t+2} &#92;times {&#92;mathcal B}_{t+2})^2 = e_{t+2} - e_t&amp;fg=000000' class='latex' /></p>
<p> and so the claim follows from <a href="#josh">(3)</a> and another application of Cauchy-Schwarz. <img src='http://s0.wp.com/latex.php?latex=%5CBox%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Box&amp;fg=000000' title='&#92;Box&amp;fg=000000' class='latex' /></p>
<p>
Now we can prove Lemma <a href="#regularity4">4</a>. Observe that </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%26%23124%3BE+%5Ccap+%28A+%5Ctimes+B%29%26%23124%3B+-+%5Csum_%7Bi%3D1%7D%5E%7B2%5Et%7D+%5Csum_%7Bj%3D1%7D%5E%7B2%5Et%7D+d%28V%5Et_i%2CV%5Et_j%29+%26%23124%3BA+%5Ccap+V%5Et_i%26%23124%3B+%26%23124%3BB+%5Ccap+V%5Et_j%26%23124%3B+%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#124;E &#92;cap (A &#92;times B)&#124; - &#92;sum_{i=1}^{2^t} &#92;sum_{j=1}^{2^t} d(V^t_i,V^t_j) &#124;A &#92;cap V^t_i&#124; &#124;B &#92;cap V^t_j&#124; &amp;fg=000000' title='&#92;displaystyle  &#124;E &#92;cap (A &#92;times B)&#124; - &#92;sum_{i=1}^{2^t} &#92;sum_{j=1}^{2^t} d(V^t_i,V^t_j) &#124;A &#92;cap V^t_i&#124; &#124;B &#92;cap V^t_j&#124; &amp;fg=000000' class='latex' /></p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%3D+%5Csum_%7Bv%2Cw+%5Cin+V%7D+1_A%28v%29+1_B%28w%29+f_U%28v%2Cw%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle = &#92;sum_{v,w &#92;in V} 1_A(v) 1_B(w) f_U(v,w).&amp;fg=000000' title='&#92;displaystyle = &#92;sum_{v,w &#92;in V} 1_A(v) 1_B(w) f_U(v,w).&amp;fg=000000' class='latex' /></p>
<p> Applying Cauchy-Schwarz twice in <img src='http://s0.wp.com/latex.php?latex=%7Bv%2C+w%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v, w}&amp;fg=000000' title='{v, w}&amp;fg=000000' class='latex' /> and using Lemma <a href="#fu">6</a>, we see that the RHS is <img src='http://s0.wp.com/latex.php?latex=%7BO%28+%28M%5Cepsilon%29%5E%7B-1%2F8%7D+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O( (M&#92;epsilon)^{-1/8} )}&amp;fg=000000' title='{O( (M&#92;epsilon)^{-1/8} )}&amp;fg=000000' class='latex' />; choosing <img src='http://s0.wp.com/latex.php?latex=%7BM+%5Cgg+%5Cepsilon%5E%7B-9%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M &#92;gg &#92;epsilon^{-9}}&amp;fg=000000' title='{M &#92;gg &#92;epsilon^{-9}}&amp;fg=000000' class='latex' /> we obtain the claim.</p>
<p align="center"><b> &#8212;  2. Strong regularity via random neighbourhoods  &#8212; </b></p>
<p>
We now prove Lemma <a href="#regularity3">3</a>, which of course implies Lemma <a href="#regularity2">2</a>.
</p>
<p>
Fix <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon &gt; 0}&amp;fg=000000' title='{&#92;epsilon &gt; 0}&amp;fg=000000' class='latex' /> and a graph <img src='http://s0.wp.com/latex.php?latex=%7BG+%3D+%28V%2CE%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G = (V,E)}&amp;fg=000000' title='{G = (V,E)}&amp;fg=000000' class='latex' /> on <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> vertices. We randomly select an infinite sequence <img src='http://s0.wp.com/latex.php?latex=%7Bv_1%2C+v_2%2C+%5Cldots+%5Cin+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_1, v_2, &#92;ldots &#92;in V}&amp;fg=000000' title='{v_1, v_2, &#92;ldots &#92;in V}&amp;fg=000000' class='latex' /> of vertices in <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' />, drawn uniformly and independently at random. We define <img src='http://s0.wp.com/latex.php?latex=%7BA_t%2C+V%5Et_i%2C+%7B%5Cmathcal+B%7D_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_t, V^t_i, {&#92;mathcal B}_t}&amp;fg=000000' title='{A_t, V^t_i, {&#92;mathcal B}_t}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7Be_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{e_t}&amp;fg=000000' title='{e_t}&amp;fg=000000' class='latex' />, as before.
</p>
<p>
Now let <img src='http://s0.wp.com/latex.php?latex=%7Bm%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{m}&amp;fg=000000' title='{m}&amp;fg=000000' class='latex' /> be a large number depending on <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon &gt; 0}&amp;fg=000000' title='{&#92;varepsilon &gt; 0}&amp;fg=000000' class='latex' /> to be chosen later, let <img src='http://s0.wp.com/latex.php?latex=%7BF%3A+%7B%5Cmathbb+Z%7D%5E%2B+%5Crightarrow+%7B%5Cmathbb+Z%7D%5E%2B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{F: {&#92;mathbb Z}^+ &#92;rightarrow {&#92;mathbb Z}^+}&amp;fg=000000' title='{F: {&#92;mathbb Z}^+ &#92;rightarrow {&#92;mathbb Z}^+}&amp;fg=000000' class='latex' /> be a rapidly growing function (also to be chosen later), and set <img src='http://s0.wp.com/latex.php?latex=%7BM_1+%3A%3D+F%281%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_1 := F(1)}&amp;fg=000000' title='{M_1 := F(1)}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BM_r+%3A%3D+2%28M_%7Br-1%7D+%2B+F%28M_%7Br-1%7D%29%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_r := 2(M_{r-1} + F(M_{r-1}))}&amp;fg=000000' title='{M_r := 2(M_{r-1} + F(M_{r-1}))}&amp;fg=000000' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+r+%5Cleq+m%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq r &#92;leq m}&amp;fg=000000' title='{1 &#92;leq r &#92;leq m}&amp;fg=000000' class='latex' />, thus <img src='http://s0.wp.com/latex.php?latex=%7BM_1+%26%2360%3B+M_2+%26%2360%3B+%5Cldots+%26%2360%3B+M_%7Bm%2B1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_1 &lt; M_2 &lt; &#92;ldots &lt; M_{m+1}}&amp;fg=000000' title='{M_1 &lt; M_2 &lt; &#92;ldots &lt; M_{m+1}}&amp;fg=000000' class='latex' /> grows rapidly to infinity. The expected energies <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D+e_%7BM_r%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E} e_{M_r}}&amp;fg=000000' title='{{&#92;Bbb E} e_{M_r}}&amp;fg=000000' class='latex' /> are increasing from <img src='http://s0.wp.com/latex.php?latex=%7B0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0}&amp;fg=000000' title='{0}&amp;fg=000000' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />, thus if we pick <img src='http://s0.wp.com/latex.php?latex=%7B1+%5Cleq+r+%5Cleq+m%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1 &#92;leq r &#92;leq m}&amp;fg=000000' title='{1 &#92;leq r &#92;leq m}&amp;fg=000000' class='latex' /> uniformly at random, the expectation of </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+e_%7BM_%7Br%2B1%7D%7D+-+e_%7BM_r%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} e_{M_{r+1}} - e_{M_r}&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} e_{M_{r+1}} - e_{M_r}&amp;fg=000000' class='latex' /></p>
<p> telescopes to be <img src='http://s0.wp.com/latex.php?latex=%7BO%281%2Fm%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O(1/m)}&amp;fg=000000' title='{O(1/m)}&amp;fg=000000' class='latex' />. Thus, by Markov&#8217;s inequality, with probability <img src='http://s0.wp.com/latex.php?latex=%7B1-O%28%5Cepsilon%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1-O(&#92;epsilon)}&amp;fg=000000' title='{1-O(&#92;epsilon)}&amp;fg=000000' class='latex' /> we will have
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+e_%7BM_%7Br%2B1%7D%7D+-+e_%7BM_r%7D+%3D+O%28+%5Cfrac%7B1%7D%7Bm%5Cepsilon%7D+%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} e_{M_{r+1}} - e_{M_r} = O( &#92;frac{1}{m&#92;epsilon} ).&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} e_{M_{r+1}} - e_{M_r} = O( &#92;frac{1}{m&#92;epsilon} ).&amp;fg=000000' class='latex' /></p>
<p> Assume that <img src='http://s0.wp.com/latex.php?latex=%7Br%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{r}&amp;fg=000000' title='{r}&amp;fg=000000' class='latex' /> is chosen to obey this. Then, by another application of the pigeonhole principle, we can find <img src='http://s0.wp.com/latex.php?latex=%7BM_%7Br%2B1%7D%2F2+%5Cleq+t+%26%2360%3B+M_%7Br%2B1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_{r+1}/2 &#92;leq t &lt; M_{r+1}}&amp;fg=000000' title='{M_{r+1}/2 &#92;leq t &lt; M_{r+1}}&amp;fg=000000' class='latex' /> such that
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%28e_%7Bt%2B2%7D+-+e_%7Bt%7D%29+%3D+O%28+%5Cfrac%7B1%7D%7Bm%5Cepsilon+M_%7Br%2B1%7D%7D+%29+%3D+O%28+%5Cfrac%7B1%7D%7Bm%5Cepsilon+F%28M_r%29%7D+%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} (e_{t+2} - e_{t}) = O( &#92;frac{1}{m&#92;epsilon M_{r+1}} ) = O( &#92;frac{1}{m&#92;epsilon F(M_r)} ).&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} (e_{t+2} - e_{t}) = O( &#92;frac{1}{m&#92;epsilon M_{r+1}} ) = O( &#92;frac{1}{m&#92;epsilon F(M_r)} ).&amp;fg=000000' class='latex' /></p>
<p> Fix this <img src='http://s0.wp.com/latex.php?latex=%7Bt%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t}&amp;fg=000000' title='{t}&amp;fg=000000' class='latex' />. We have
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%28e_%7Bt%7D+-+e_%7BM_r%7D%29+%3D+O%28+%5Cfrac%7B1%7D%7Bm%5Cepsilon%7D+%29%2C%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} (e_{t} - e_{M_r}) = O( &#92;frac{1}{m&#92;epsilon} ),&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} (e_{t} - e_{M_r}) = O( &#92;frac{1}{m&#92;epsilon} ),&amp;fg=000000' class='latex' /></p>
<p> so by Markov&#8217;s inequality, with probability <img src='http://s0.wp.com/latex.php?latex=%7B1-O%28%5Cepsilon%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1-O(&#92;epsilon)}&amp;fg=000000' title='{1-O(&#92;epsilon)}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7Bv_1%2C%5Cldots%2Cv_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v_1,&#92;ldots,v_t}&amp;fg=000000' title='{v_1,&#92;ldots,v_t}&amp;fg=000000' class='latex' /> are such that <a name="etmr">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++e_%7Bt%7D+-+e_%7BM_r%7D+%3D+O%28+%5Cfrac%7B1%7D%7Bm%5Cepsilon%5E2%7D+%29+%5C+%5C+%5C+%5C+%5C+%284%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  e_{t} - e_{M_r} = O( &#92;frac{1}{m&#92;epsilon^2} ) &#92; &#92; &#92; &#92; &#92; (4)&amp;fg=000000' title='&#92;displaystyle  e_{t} - e_{M_r} = O( &#92;frac{1}{m&#92;epsilon^2} ) &#92; &#92; &#92; &#92; &#92; (4)&amp;fg=000000' class='latex' /></p>
<p></a> and also obey the conditional expectation bound <a name="cond">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D%28e_%7Bt%2B2%7D+-+e_t+%26%23124%3B+v_1%2C%5Cldots%2Cv_t%29+%3D+O%28+%5Cfrac%7B1%7D%7Bm%5Cepsilon+F%28M_r%29%7D+%29.+%5C+%5C+%5C+%5C+%5C+%285%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}(e_{t+2} - e_t &#124; v_1,&#92;ldots,v_t) = O( &#92;frac{1}{m&#92;epsilon F(M_r)} ). &#92; &#92; &#92; &#92; &#92; (5)&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}(e_{t+2} - e_t &#124; v_1,&#92;ldots,v_t) = O( &#92;frac{1}{m&#92;epsilon F(M_r)} ). &#92; &#92; &#92; &#92; &#92; (5)&amp;fg=000000' class='latex' /></p>
<p></a> Assume that this is the case. We split
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++1_E+%3D+f_%7BU%5E%5Cperp%7D+%2B+f_%7Berr%7D+%2B+f_U%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  1_E = f_{U^&#92;perp} + f_{err} + f_U&amp;fg=000000' title='&#92;displaystyle  1_E = f_{U^&#92;perp} + f_{err} + f_U&amp;fg=000000' class='latex' /></p>
<p> where
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++f_%7BU%5E%5Cperp%7D+%3A%3D+%7B%5CBbb+E%7D%281_E+%26%23124%3B+%7B%5Cmathcal+B%7D_%7BM_r%7D+%5Ctimes+%7B%5Cmathcal+B%7D_%7BM_r%7D+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  f_{U^&#92;perp} := {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_{M_r} &#92;times {&#92;mathcal B}_{M_r} )&amp;fg=000000' title='&#92;displaystyle  f_{U^&#92;perp} := {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_{M_r} &#92;times {&#92;mathcal B}_{M_r} )&amp;fg=000000' class='latex' /></p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++f_%7Berr%7D+%3A%3D+%7B%5CBbb+E%7D%281_E+%26%23124%3B+%7B%5Cmathcal+B%7D_%7Bt%7D+%5Ctimes+%7B%5Cmathcal+B%7D_%7Bt%7D+%29+-+%7B%5CBbb+E%7D%281_E+%26%23124%3B+%7B%5Cmathcal+B%7D_%7BM_r%7D+%5Ctimes+%7B%5Cmathcal+B%7D_%7BM_r%7D+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  f_{err} := {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_{t} &#92;times {&#92;mathcal B}_{t} ) - {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_{M_r} &#92;times {&#92;mathcal B}_{M_r} )&amp;fg=000000' title='&#92;displaystyle  f_{err} := {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_{t} &#92;times {&#92;mathcal B}_{t} ) - {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_{M_r} &#92;times {&#92;mathcal B}_{M_r} )&amp;fg=000000' class='latex' /></p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++f_U+%3A%3D+1_E+-+%7B%5CBbb+E%7D%281_E+%26%23124%3B+%7B%5Cmathcal+B%7D_%7Bt%7D+%5Ctimes+%7B%5Cmathcal+B%7D_%7Bt%7D+%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  f_U := 1_E - {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_{t} &#92;times {&#92;mathcal B}_{t} ).&amp;fg=000000' title='&#92;displaystyle  f_U := 1_E - {&#92;Bbb E}(1_E &#124; {&#92;mathcal B}_{t} &#92;times {&#92;mathcal B}_{t} ).&amp;fg=000000' class='latex' /></p>
<p>
We now assert that the partition <img src='http://s0.wp.com/latex.php?latex=%7BV%5E%7BM_r%7D_1%2C%5Cldots%2CV%5E%7BM_r%7D_%7B2%5E%7BM_r%7D%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^{M_r}_1,&#92;ldots,V^{M_r}_{2^{M_r}}}&amp;fg=000000' title='{V^{M_r}_1,&#92;ldots,V^{M_r}_{2^{M_r}}}&amp;fg=000000' class='latex' /> induced by <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cmathcal+B%7D_%7BM_r%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;mathcal B}_{M_r}}&amp;fg=000000' title='{{&#92;mathcal B}_{M_r}}&amp;fg=000000' class='latex' /> obeys the conclusions of Lemma <a href="#regularity2">2</a>. For this, we observe various properties on the three components of <img src='http://s0.wp.com/latex.php?latex=%7B1_E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1_E}&amp;fg=000000' title='{1_E}&amp;fg=000000' class='latex' />:
</p>
<blockquote><p><b>Lemma 7 (<img src='http://s0.wp.com/latex.php?latex=%7Bf_%7BU%5E%5Cperp%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_{U^&#92;perp}}&amp;fg=000000' title='{f_{U^&#92;perp}}&amp;fg=000000' class='latex' /> locally constant)</b>  <img src='http://s0.wp.com/latex.php?latex=%7Bf_%7BU%5E%5Cperp%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_{U^&#92;perp}}&amp;fg=000000' title='{f_{U^&#92;perp}}&amp;fg=000000' class='latex' /> is constant on each product set <img src='http://s0.wp.com/latex.php?latex=%7BV%5E%7BM_r%7D_i+%5Ctimes+V%5E%7BM_r%7D_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^{M_r}_i &#92;times V^{M_r}_j}&amp;fg=000000' title='{V^{M_r}_i &#92;times V^{M_r}_j}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
<em>Proof:</em>  This is clear from construction. <img src='http://s0.wp.com/latex.php?latex=%5CBox%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Box&amp;fg=000000' title='&#92;Box&amp;fg=000000' class='latex' /></p>
<blockquote><p><b>Lemma 8 (<img src='http://s0.wp.com/latex.php?latex=%7Bf_%7Berr%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_{err}}&amp;fg=000000' title='{f_{err}}&amp;fg=000000' class='latex' /> small)</b> <a name="fsm"></a> We have <img src='http://s0.wp.com/latex.php?latex=%7B%5C%26%23124%3B+f_%7Berr%7D+%5C%26%23124%3B_%7BL%5E2%28V+%5Ctimes+V%29%7D%5E2+%3D+O%28+%5Cfrac%7B1%7D%7Bm%5Cepsilon%5E2%7D+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;&#124; f_{err} &#92;&#124;_{L^2(V &#92;times V)}^2 = O( &#92;frac{1}{m&#92;epsilon^2} )}&amp;fg=000000' title='{&#92;&#124; f_{err} &#92;&#124;_{L^2(V &#92;times V)}^2 = O( &#92;frac{1}{m&#92;epsilon^2} )}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
<em>Proof:</em>  This follows from <a href="#etmr">(4)</a> and Pythagoras&#8217; theorem. <img src='http://s0.wp.com/latex.php?latex=%5CBox%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Box&amp;fg=000000' title='&#92;Box&amp;fg=000000' class='latex' /></p>
<blockquote><p><b>Lemma 9 (<img src='http://s0.wp.com/latex.php?latex=%7Bf_U%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_U}&amp;fg=000000' title='{f_U}&amp;fg=000000' class='latex' /> uniform)</b> <a name="frum"></a> The expression
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cfrac%7B1%7D%7B%26%23124%3BV%26%23124%3B%5E4%7D+%5Csum_%7Bv%2Cw%2Cv%27%2Cw%27+%5Cin+V%7D+f_U%28v%2Cw%29+f_U%28v%2Cw%27%29+f_U%28v%27%2Cw%29+f_U%28v%27%2Cw%27%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;frac{1}{&#124;V&#124;^4} &#92;sum_{v,w,v&#039;,w&#039; &#92;in V} f_U(v,w) f_U(v,w&#039;) f_U(v&#039;,w) f_U(v&#039;,w&#039;)&amp;fg=000000' title='&#92;displaystyle  &#92;frac{1}{&#124;V&#124;^4} &#92;sum_{v,w,v&#039;,w&#039; &#92;in V} f_U(v,w) f_U(v,w&#039;) f_U(v&#039;,w) f_U(v&#039;,w&#039;)&amp;fg=000000' class='latex' /></p>
<p> is of size <img src='http://s0.wp.com/latex.php?latex=%7BO%28+%5Cfrac%7B1%7D%7B%5Csqrt%7Bm+%5Cepsilon+F%28M_r%29%7D%7D+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O( &#92;frac{1}{&#92;sqrt{m &#92;epsilon F(M_r)}} )}&amp;fg=000000' title='{O( &#92;frac{1}{&#92;sqrt{m &#92;epsilon F(M_r)}} )}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
<em>Proof:</em>  This follows by repeating the proof of Lemma <a href="#fu">6</a>, but using <a href="#cond">(5)</a> instead of <a href="#josh">(3)</a>. <img src='http://s0.wp.com/latex.php?latex=%5CBox%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Box&amp;fg=000000' title='&#92;Box&amp;fg=000000' class='latex' /></p>
<p>
Now we verify the regularity.
</p>
<p>
First, we eliminate <em>small atoms</em>: the pairs <img src='http://s0.wp.com/latex.php?latex=%7B%28V_i%2CV_j%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(V_i,V_j)}&amp;fg=000000' title='{(V_i,V_j)}&amp;fg=000000' class='latex' /> for which <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BV%5E%7BM_r%7D_i%26%23124%3B+%5Cleq+%5Cepsilon+%26%23124%3BV%26%23124%3B+%2F+2%5E%7BM_r%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;V^{M_r}_i&#124; &#92;leq &#92;epsilon &#124;V&#124; / 2^{M_r}}&amp;fg=000000' title='{&#124;V^{M_r}_i&#124; &#92;leq &#92;epsilon &#124;V&#124; / 2^{M_r}}&amp;fg=000000' class='latex' /> clearly give a net contribution of at most <img src='http://s0.wp.com/latex.php?latex=%7BO%28+%5Cepsilon+%26%23124%3BV%26%23124%3B%5E2+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O( &#92;epsilon &#124;V&#124;^2 )}&amp;fg=000000' title='{O( &#92;epsilon &#124;V&#124;^2 )}&amp;fg=000000' class='latex' /> and are acceptable; similarly for those pairs for which <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BV%5E%7BM_r%7D_j%26%23124%3B+%5Cleq+%5Cepsilon+%26%23124%3BV%26%23124%3B+%2F+2%5E%7BM_r%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;V^{M_r}_j&#124; &#92;leq &#92;epsilon &#124;V&#124; / 2^{M_r}}&amp;fg=000000' title='{&#124;V^{M_r}_j&#124; &#92;leq &#92;epsilon &#124;V&#124; / 2^{M_r}}&amp;fg=000000' class='latex' />. So we may henceforth assume that <a name="vij-small">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%26%23124%3BV%5E%7BM_r%7D_i%26%23124%3B%2C+%26%23124%3BV%5E%7BM_r%7D_j%26%23124%3B+%5Cleq+%5Cepsilon+%26%23124%3BV%26%23124%3B+%2F+2%5E%7BM_r%7D.+%5C+%5C+%5C+%5C+%5C+%286%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#124;V^{M_r}_i&#124;, &#124;V^{M_r}_j&#124; &#92;leq &#92;epsilon &#124;V&#124; / 2^{M_r}. &#92; &#92; &#92; &#92; &#92; (6)&amp;fg=000000' title='&#92;displaystyle  &#124;V^{M_r}_i&#124;, &#124;V^{M_r}_j&#124; &#92;leq &#92;epsilon &#124;V&#124; / 2^{M_r}. &#92; &#92; &#92; &#92; &#92; (6)&amp;fg=000000' class='latex' /></p>
<p></a>
</p>
<p>
Now, let <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Csubset+V%5E%7BM_r%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;subset V^{M_r}_i}&amp;fg=000000' title='{A &#92;subset V^{M_r}_i}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7BB+%5Csubset+V%5E%7BM_r%7D_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{B &#92;subset V^{M_r}_i}&amp;fg=000000' title='{B &#92;subset V^{M_r}_i}&amp;fg=000000' class='latex' /> have densities </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Calpha+%3A%3D+%26%23124%3BA%26%23124%3B%2F%26%23124%3BV%5E%7BM_r%7D_i%26%23124%3B+%5Cge+%5Cepsilon%3B+%5Cbeta+%3A%3D+%26%23124%3BB%26%23124%3B%2F%26%23124%3BV%5E%7BM_r%7D_j%26%23124%3B+%5Cgeq+%5Cepsilon%2C%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;alpha := &#124;A&#124;/&#124;V^{M_r}_i&#124; &#92;ge &#92;epsilon; &#92;beta := &#124;B&#124;/&#124;V^{M_r}_j&#124; &#92;geq &#92;epsilon,&amp;fg=000000' title='&#92;displaystyle  &#92;alpha := &#124;A&#124;/&#124;V^{M_r}_i&#124; &#92;ge &#92;epsilon; &#92;beta := &#124;B&#124;/&#124;V^{M_r}_j&#124; &#92;geq &#92;epsilon,&amp;fg=000000' class='latex' /></p>
<p> then
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Calpha+%5Cbeta+d%28A%2CB%29+%3D+%5Cfrac%7B1%7D%7B%26%23124%3BV%5E%7BM_r%7D_i%26%23124%3B+%26%23124%3BV%5E%7BM_r%7D_j%26%23124%3B%7D+%5Csum_%7Bv+%5Cin+V%5E%7BM_r%7D_i%7D+%5Csum_%7Bw+%5Cin+V%5E%7BM_r%7D_i%7D+1_A%28v%29+1_B%28w%29+1_E%28v%2Cw%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;alpha &#92;beta d(A,B) = &#92;frac{1}{&#124;V^{M_r}_i&#124; &#124;V^{M_r}_j&#124;} &#92;sum_{v &#92;in V^{M_r}_i} &#92;sum_{w &#92;in V^{M_r}_i} 1_A(v) 1_B(w) 1_E(v,w).&amp;fg=000000' title='&#92;displaystyle  &#92;alpha &#92;beta d(A,B) = &#92;frac{1}{&#124;V^{M_r}_i&#124; &#124;V^{M_r}_j&#124;} &#92;sum_{v &#92;in V^{M_r}_i} &#92;sum_{w &#92;in V^{M_r}_i} 1_A(v) 1_B(w) 1_E(v,w).&amp;fg=000000' class='latex' /></p>
<p> We divide <img src='http://s0.wp.com/latex.php?latex=%7B1_E%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1_E}&amp;fg=000000' title='{1_E}&amp;fg=000000' class='latex' /> into the three pieces <img src='http://s0.wp.com/latex.php?latex=%7Bf_%7BU%5E%5Cperp%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_{U^&#92;perp}}&amp;fg=000000' title='{f_{U^&#92;perp}}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7Bf_%7Berr%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_{err}}&amp;fg=000000' title='{f_{err}}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7Bf_U%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_U}&amp;fg=000000' title='{f_U}&amp;fg=000000' class='latex' />. </p>
<p>
The contribution of <img src='http://s0.wp.com/latex.php?latex=%7Bf_%7BU%5E%5Cperp%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_{U^&#92;perp}}&amp;fg=000000' title='{f_{U^&#92;perp}}&amp;fg=000000' class='latex' /> is exactly <img src='http://s0.wp.com/latex.php?latex=%7B%5Calpha+%5Cbeta+d%28V%5E%7BM_r%7D_i%2C+V%5E%7BM_r%7D_j%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;alpha &#92;beta d(V^{M_r}_i, V^{M_r}_j)}&amp;fg=000000' title='{&#92;alpha &#92;beta d(V^{M_r}_i, V^{M_r}_j)}&amp;fg=000000' class='latex' />.
</p>
<p>
The contribution of <img src='http://s0.wp.com/latex.php?latex=%7Bf_%7Berr%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_{err}}&amp;fg=000000' title='{f_{err}}&amp;fg=000000' class='latex' /> can be bounded using Cauchy-Schwarz as </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++O%28+%5Cfrac%7B1%7D%7B%26%23124%3BV%5E%7BM_r%7D_i%26%23124%3B+%26%23124%3BV%5E%7BM_r%7D_j%26%23124%3B%7D+%5Csum_%7Bv+%5Cin+V%5E%7BM_r%7D_i%7D+%5Csum_%7Bw+%5Cin+V%5E%7BM_r%7D_i%7D+%26%23124%3Bf_%7Berr%7D%28v%2Cw%29%26%23124%3B%5E2%29%5E%7B1%2F2%7D.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  O( &#92;frac{1}{&#124;V^{M_r}_i&#124; &#124;V^{M_r}_j&#124;} &#92;sum_{v &#92;in V^{M_r}_i} &#92;sum_{w &#92;in V^{M_r}_i} &#124;f_{err}(v,w)&#124;^2)^{1/2}.&amp;fg=000000' title='&#92;displaystyle  O( &#92;frac{1}{&#124;V^{M_r}_i&#124; &#124;V^{M_r}_j&#124;} &#92;sum_{v &#92;in V^{M_r}_i} &#92;sum_{w &#92;in V^{M_r}_i} &#124;f_{err}(v,w)&#124;^2)^{1/2}.&amp;fg=000000' class='latex' /></p>
<p> Using Lemma <a href="#fsm">8</a> and Chebyshev&#8217;s inequality, we see that the pairs <img src='http://s0.wp.com/latex.php?latex=%7B%28V_i%2CV_j%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(V_i,V_j)}&amp;fg=000000' title='{(V_i,V_j)}&amp;fg=000000' class='latex' /> for which this quantity exceeds <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon%5E3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon^3}&amp;fg=000000' title='{&#92;epsilon^3}&amp;fg=000000' class='latex' /> will contribute at most <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon%5E%7B-8%7D%2Fm%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon^{-8}/m}&amp;fg=000000' title='{&#92;epsilon^{-8}/m}&amp;fg=000000' class='latex' /> to <a href="#regular">(1)</a>, which is acceptable if we choose <img src='http://s0.wp.com/latex.php?latex=%7Bm%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{m}&amp;fg=000000' title='{m}&amp;fg=000000' class='latex' /> so that <img src='http://s0.wp.com/latex.php?latex=%7Bm+%5Cgg+%5Cepsilon%5E%7B-9%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{m &#92;gg &#92;epsilon^{-9}}&amp;fg=000000' title='{m &#92;gg &#92;epsilon^{-9}}&amp;fg=000000' class='latex' />. Let us now discard these bad pairs.</p>
<p>
Finally, the contribution of <img src='http://s0.wp.com/latex.php?latex=%7Bf_U%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f_U}&amp;fg=000000' title='{f_U}&amp;fg=000000' class='latex' /> can be bounded by two applications of Cauchy-Schwarz and <a href="#frum">(9)</a> as </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++O%28+%5Cfrac%7B%26%23124%3BV%26%23124%3B%5E2%7D%7B%26%23124%3BV%5E%7BM_r%7D_i%26%23124%3B+%26%23124%3BV%5E%7BM_r%7D_j%26%23124%3B%7D+%5Cfrac%7B1%7D%7B%28m+%5Cepsilon+F%28M_r%29%29%5E%7B1%2F8%7D%7D+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  O( &#92;frac{&#124;V&#124;^2}{&#124;V^{M_r}_i&#124; &#124;V^{M_r}_j&#124;} &#92;frac{1}{(m &#92;epsilon F(M_r))^{1/8}} )&amp;fg=000000' title='&#92;displaystyle  O( &#92;frac{&#124;V&#124;^2}{&#124;V^{M_r}_i&#124; &#124;V^{M_r}_j&#124;} &#92;frac{1}{(m &#92;epsilon F(M_r))^{1/8}} )&amp;fg=000000' class='latex' /></p>
<p> which by <a href="#vij-small">(6)</a> is bounded by
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++O%28+2%5E%7B2M_r%7D+%5Cepsilon%5E%7B-2%7D+%2F+%28m+%5Cepsilon+F%28M_r%29+%29%5E%7B1%2F8%7D+%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  O( 2^{2M_r} &#92;epsilon^{-2} / (m &#92;epsilon F(M_r) )^{1/8} ).&amp;fg=000000' title='&#92;displaystyle  O( 2^{2M_r} &#92;epsilon^{-2} / (m &#92;epsilon F(M_r) )^{1/8} ).&amp;fg=000000' class='latex' /></p>
<p> This can be made <img src='http://s0.wp.com/latex.php?latex=%7BO%28%5Cepsilon%5E3%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O(&#92;epsilon^3)}&amp;fg=000000' title='{O(&#92;epsilon^3)}&amp;fg=000000' class='latex' /> by selecting <img src='http://s0.wp.com/latex.php?latex=%7BF%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{F}&amp;fg=000000' title='{F}&amp;fg=000000' class='latex' /> sufficiently rapidly growing depending on <img src='http://s0.wp.com/latex.php?latex=%7B%5Cepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;epsilon}&amp;fg=000000' title='{&#92;epsilon}&amp;fg=000000' class='latex' />. Putting this all together we see that
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Calpha+%5Cbeta+d%28A%2CB%29+%3D+%5Calpha+%5Cbeta+d%28V%5E%7BM_r%7D_i%2C+V%5E%7BM_r%7D_j%29+%2B+O%28%5Cepsilon%5E3%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;alpha &#92;beta d(A,B) = &#92;alpha &#92;beta d(V^{M_r}_i, V^{M_r}_j) + O(&#92;epsilon^3)&amp;fg=000000' title='&#92;displaystyle  &#92;alpha &#92;beta d(A,B) = &#92;alpha &#92;beta d(V^{M_r}_i, V^{M_r}_j) + O(&#92;epsilon^3)&amp;fg=000000' class='latex' /></p>
<p> which (since <img src='http://s0.wp.com/latex.php?latex=%7B%5Calpha%2C%5Cbeta+%5Cgeq+%5Cepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;alpha,&#92;beta &#92;geq &#92;epsilon}&amp;fg=000000' title='{&#92;alpha,&#92;beta &#92;geq &#92;epsilon}&amp;fg=000000' class='latex' />) gives the desired regularity.</p>
<blockquote><p><b>Remark 1</b>  Of course, this argument gives tower-exponential bounds (as <img src='http://s0.wp.com/latex.php?latex=%7BF%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{F}&amp;fg=000000' title='{F}&amp;fg=000000' class='latex' /> is exponential and needs to be iterated <img src='http://s0.wp.com/latex.php?latex=%7Bm%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{m}&amp;fg=000000' title='{m}&amp;fg=000000' class='latex' /> times), which will be familiar to any reader already acquainted with the regularity lemma. </p></blockquote>
</p>
<blockquote><p><b>Remark 2</b>  One can take the partition induced by random neighbourhoods here and carve it up further to be both equitable and (mostly) regular, thus recovering a proof of Lemma 1, by following the arguments <a class="snap_noshots" href="http://front.math.ucdavis.edu/math.CO/0504472">in this paper of mine</a>. Of course, when one does so, one no longer has a partition created purely from random neighbourhoods, but it is pretty clear that one is not going to be able to make an equitable partition just from boolean operations applied to a few random neighbourhoods. </p></blockquote></p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Air Heart]]></title>
<link>http://mathography.wordpress.com/2009/04/15/air-heart/</link>
<pubDate>Thu, 16 Apr 2009 06:10:15 +0000</pubDate>
<dc:creator>momar06</dc:creator>
<guid>http://mathography.wordpress.com/2009/04/15/air-heart/</guid>
<description><![CDATA[Let be a d-dimensional polytope (defined below) whose vertices have integer coordinates. For any pos]]></description>
<content:encoded><![CDATA[<p>Let <img src='http://s0.wp.com/latex.php?latex=P+%5Csubset+%5Cmathbb%7BR%7D%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P &#92;subset &#92;mathbb{R}^d' title='P &#92;subset &#92;mathbb{R}^d' class='latex' /> be a d-dimensional polytope (defined below) whose vertices have integer coordinates. For any positive integer <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n' title='n' class='latex' />, let <img src='http://s0.wp.com/latex.php?latex=f_P+%28n%29+%3D+%26%23124%3B%5C%7Bx+%5Cin+P+%5C+%26%23124%3B+%5C+nx+%5Cin+%5Cmathbb%7BZ%7D%5Ed+%5C%7D%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f_P (n) = &#124;&#92;{x &#92;in P &#92; &#124; &#92; nx &#92;in &#92;mathbb{Z}^d &#92;}&#124;' title='f_P (n) = &#124;&#92;{x &#92;in P &#92; &#124; &#92; nx &#92;in &#92;mathbb{Z}^d &#92;}&#124;' class='latex' />.  It is known that for any such polytope, <img src='http://s0.wp.com/latex.php?latex=f_P%28n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f_P(n)' title='f_P(n)' class='latex' /> is a degree d polynomial in n with rational coefficients.  Given this as fact, prove that the volume of <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P' title='P' class='latex' /> is the leading coefficient in <img src='http://s0.wp.com/latex.php?latex=f_P%28n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f_P(n)' title='f_P(n)' class='latex' />.</p>
<p>As an example, if P is the square in <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BR%7D%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mathbb{R}^2' title='&#92;mathbb{R}^2' class='latex' /> with vertices (0,0),(2,0),(0,2),(2,2), then <img src='http://s0.wp.com/latex.php?latex=f_P%28n%29+%3D+%7B%282n%2B1%29%7D%5E2+%3D+4n%5E2+%2B+4n%2B+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f_P(n) = {(2n+1)}^2 = 4n^2 + 4n+ 1' title='f_P(n) = {(2n+1)}^2 = 4n^2 + 4n+ 1' class='latex' /> , and the volume of the square in <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BR%7D%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mathbb{R}^2' title='&#92;mathbb{R}^2' class='latex' /> is 4.</p>
<p>Background:</p>
<p><img src='http://s0.wp.com/latex.php?latex=P+%5Csubset+%5Cmathbb%7BR%7D%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P &#92;subset &#92;mathbb{R}^d' title='P &#92;subset &#92;mathbb{R}^d' class='latex' /> is a d-dimensional polytope if :</p>
<p>1) P is the intersection of finitely many halfspaces.</p>
<p>2) P is bounded</p>
<p>3) P is d-dimensional</p>
<p>Some examples of such polytopes in <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BR%7D%5E3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mathbb{R}^3' title='&#92;mathbb{R}^3' class='latex' /> are cubes, rectangular prisms, simplices, octahedra, dodecahedra, etc.  A square in <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BR%7D%5E3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mathbb{R}^3' title='&#92;mathbb{R}^3' class='latex' /> is a polytope but is not 3-dimensional this is not an example of such a <img src='http://s0.wp.com/latex.php?latex=P&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P' title='P' class='latex' />.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Polymath1 and three new proofs of the density Hales-Jewett theorem]]></title>
<link>http://terrytao.wordpress.com/2009/04/02/polymath1-and-three-new-proofs-of-the-density-hales-jewett-theorem/</link>
<pubDate>Fri, 03 Apr 2009 01:00:32 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2009/04/02/polymath1-and-three-new-proofs-of-the-density-hales-jewett-theorem/</guid>
<description><![CDATA[This week I gave a talk at UCLA on some aspects of polymath1 project, and specifically on the three]]></description>
<content:encoded><![CDATA[<p>
 This week I gave a talk at UCLA on some aspects of <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Main_Page">polymath1 project</a>, and specifically on the three new combinatorial proofs of the <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Density_Hales-Jewett_theorem">density Hales-Jewett theorem</a> that have been obtained as a consequence of that project. (There have been a number of other achievements of this project, including some accomplished on the <a class="snap_noshots" href="http://terrytao.wordpress.com/tag/polymath1/">polymath1 threads hosted on this blog</a>, but I will have to postpone a presentation of those to a later post.) It seems that at least two of the proofs will extend to prove this theorem for arbitrary alphabet lengths, but for this talk I will focus on the <img src='http://s0.wp.com/latex.php?latex=%7Bk%3D3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=3}&amp;fg=000000' title='{k=3}&amp;fg=000000' class='latex' /> case. The theorem to prove is then
</p>
<blockquote><p><b>Theorem 1 (<img src='http://s0.wp.com/latex.php?latex=%7Bk%3D3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=3}&amp;fg=000000' title='{k=3}&amp;fg=000000' class='latex' /> density Hales-Jewett theorem)</b> <a name="dhj"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7B0+%26%2360%3B+%5Cdelta+%5Cleq+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0 &lt; &#92;delta &#92;leq 1}&amp;fg=000000' title='{0 &lt; &#92;delta &#92;leq 1}&amp;fg=000000' class='latex' />. Then if <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> is a sufficiently large integer, any subset <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> of the cube <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5En+%3D+%5C%7B1%2C2%2C3%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^n = &#92;{1,2,3&#92;}^n}&amp;fg=000000' title='{[3]^n = &#92;{1,2,3&#92;}^n}&amp;fg=000000' class='latex' /> of density <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BA%26%23124%3B%2F3%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;A&#124;/3^n}&amp;fg=000000' title='{&#124;A&#124;/3^n}&amp;fg=000000' class='latex' /> at least <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' /> contains at least one combinatorial line <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B+%5Cell%281%29%2C+%5Cell%282%29%2C+%5Cell%283%29%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{ &#92;ell(1), &#92;ell(2), &#92;ell(3)&#92;}}&amp;fg=000000' title='{&#92;{ &#92;ell(1), &#92;ell(2), &#92;ell(3)&#92;}}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell+%5Cin+%5C%7B1%2C2%2C3%2Cx%5C%7D%5En+%5Cbackslash+%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell &#92;in &#92;{1,2,3,x&#92;}^n &#92;backslash [3]^n}&amp;fg=000000' title='{&#92;ell &#92;in &#92;{1,2,3,x&#92;}^n &#92;backslash [3]^n}&amp;fg=000000' class='latex' /> is a string of <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />s, <img src='http://s0.wp.com/latex.php?latex=%7B2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2}&amp;fg=000000' title='{2}&amp;fg=000000' class='latex' />s, <img src='http://s0.wp.com/latex.php?latex=%7B3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{3}&amp;fg=000000' title='{3}&amp;fg=000000' class='latex' />s, and <img src='http://s0.wp.com/latex.php?latex=%7Bx%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x}&amp;fg=000000' title='{x}&amp;fg=000000' class='latex' />&#8216;s containing at least one &#8220;wildcard&#8221; <img src='http://s0.wp.com/latex.php?latex=%7Bx%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x}&amp;fg=000000' title='{x}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%28i%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell(i)}&amp;fg=000000' title='{&#92;ell(i)}&amp;fg=000000' class='latex' /> is the string formed from <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell}&amp;fg=000000' title='{&#92;ell}&amp;fg=000000' class='latex' /> by replacing all <img src='http://s0.wp.com/latex.php?latex=%7Bx%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x}&amp;fg=000000' title='{x}&amp;fg=000000' class='latex' />&#8216;s with <img src='http://s0.wp.com/latex.php?latex=%7Bi%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i}&amp;fg=000000' title='{i}&amp;fg=000000' class='latex' />&#8216;s. </p></blockquote>
</p>
<p>
The full density Hales-Jewett theorem is the same statement, but with <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]}&amp;fg=000000' title='{[3]}&amp;fg=000000' class='latex' /> replaced by <img src='http://s0.wp.com/latex.php?latex=%7B%5Bk%5D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[k]}&amp;fg=000000' title='{[k]}&amp;fg=000000' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%7Bk+%5Cgeq+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k &#92;geq 1}&amp;fg=000000' title='{k &#92;geq 1}&amp;fg=000000' class='latex' />. (The case <img src='http://s0.wp.com/latex.php?latex=%7Bk%3D1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=1}&amp;fg=000000' title='{k=1}&amp;fg=000000' class='latex' /> is trivial, and the case <img src='http://s0.wp.com/latex.php?latex=%7Bk%3D2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=2}&amp;fg=000000' title='{k=2}&amp;fg=000000' class='latex' /> follows from <a href="http://en.wikipedia.org/wiki/Sperner&#37;27s_theorem">Sperner&#8217;s theorem</a>.)
</p>
<p>
This theorem was first proven <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1001397">by Furstenberg and Katznelson</a>, by first converting it to a statement in ergodic theory; the original paper of Furstenberg-Katznelson argument was for the <img src='http://s0.wp.com/latex.php?latex=%7Bk%3D3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=3}&amp;fg=000000' title='{k=3}&amp;fg=000000' class='latex' /> case only, and gave only part of the proof in detail, but in <a class="snap_noshots" href="http://math.stanford.edu/ katznel/dhj12.pdf">a subsequent paper</a> a full proof in general <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' /> was provided. The remaining components of the original <img src='http://s0.wp.com/latex.php?latex=%7Bk%3D3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=3}&amp;fg=000000' title='{k=3}&amp;fg=000000' class='latex' /> argument were later completed in <a class="snap_noshots" href="http://www.msci.memphis.edu/ randall/preprints/HJk3.pdf">unpublished notes of McCutcheon</a>. One of the new proofs is essentially a finitary translation of this <img src='http://s0.wp.com/latex.php?latex=%7Bk%3D3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=3}&amp;fg=000000' title='{k=3}&amp;fg=000000' class='latex' /> argument; in principle one could also finitise the significantly more complicated argument of Furstenberg and Katznelson for general <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' />, but this has not been properly carried out yet (the other two proofs are likely to generalise much more easily to higher <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' />). The result is considered quite deep; for instance, the general <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' /> case of the density Hales-Jewett theorem already implies <a href="http://en.wikipedia.org/wiki/Szemeredi&#37;27s_theorem">Szemer&#233;di&#8217;s theorem</a>, which is a highly non-trivial theorem in its own right, as a special case.
</p>
<p>
Another of the proofs is based primarily on the density increment method that goes back to Roth, and also incorporates some ideas from a <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=369299">paper of Ajtai and Szemer&#233;di</a> establishing what we have called the &#8220;<a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Corners_theorem">corners theorem</a>&#8221; (and which is also implied by the <img src='http://s0.wp.com/latex.php?latex=%7Bk%3D3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=3}&amp;fg=000000' title='{k=3}&amp;fg=000000' class='latex' /> case of the density Hales-Jewett theorem). A key new idea involved studying the correlations of the original set <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> with special subsets of <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^n}&amp;fg=000000' title='{[3]^n}&amp;fg=000000' class='latex' />, such as <img src='http://s0.wp.com/latex.php?latex=%7Bij%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{ij}&amp;fg=000000' title='{ij}&amp;fg=000000' class='latex' />-insensitive sets, or intersections of <img src='http://s0.wp.com/latex.php?latex=%7Bij%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{ij}&amp;fg=000000' title='{ij}&amp;fg=000000' class='latex' />-insensitive and <img src='http://s0.wp.com/latex.php?latex=%7Bik%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{ik}&amp;fg=000000' title='{ik}&amp;fg=000000' class='latex' />-insensitive sets. Work is currently ongoing to extend this argument to higher <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' />, and it seems that there are no major obstacles in doing so.
</p>
<p>
This correlations idea inspired a new ergodic proof of the density Hales-Jewett theorem for all values of <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' /> <a class="snap_noshots" href="http://arxiv.org/abs/0903.1633">by Austin</a>, which is in the spirit of the triangle removal lemma (or hypergraph removal lemma) proofs of Roth&#8217;s theorem (or the multidimensional Szemer&#233;di theorem). A finitary translation of this argument in the <img src='http://s0.wp.com/latex.php?latex=%7Bk%3D3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=3}&amp;fg=000000' title='{k=3}&amp;fg=000000' class='latex' /> case has been sketched out; I believe it also extends in a relatively straightforward manner to the higher <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' /> case (in analogy with some proofs of the hypergraph removal lemma ).
</p>
<p>
<!--more-->
</p>
</p>
<p align="center"><b> &#8212;  1. Simpler cases of density Hales-Jewett  &#8212; </b></p>
<p>
In order to motivate the known proofs of the density Hales-Jewett theorem, it is instructive to consider some simpler theorems which are implied by this theorem. The first is the <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Corners_theorem">corners theorem</a> of Ajtai and Szemer&#233;di:
</p>
<blockquote><p><b>Theorem 2 (Corners theorem)</b> <a name="corners"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7B0+%26%2360%3B+%5Cdelta+%5Cleq+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0 &lt; &#92;delta &#92;leq 1}&amp;fg=000000' title='{0 &lt; &#92;delta &#92;leq 1}&amp;fg=000000' class='latex' />. Then if <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> is a sufficiently large integer, any subset <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> of the square <img src='http://s0.wp.com/latex.php?latex=%7B%5Bn%5D%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[n]^2}&amp;fg=000000' title='{[n]^2}&amp;fg=000000' class='latex' /> of density <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BA%26%23124%3B%2Fn%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;A&#124;/n^2}&amp;fg=000000' title='{&#124;A&#124;/n^2}&amp;fg=000000' class='latex' /> at least <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' /> contains at least one right-angled triangle (or &#8220;corner&#8221;) <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B+%28x%2Cy%29%2C+%28x%2Br%2Cy%29%2C+%28x%2Cy%2Br%29+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{ (x,y), (x+r,y), (x,y+r) &#92;}}&amp;fg=000000' title='{&#92;{ (x,y), (x+r,y), (x,y+r) &#92;}}&amp;fg=000000' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=%7Br+%5Cneq+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{r &#92;neq 0}&amp;fg=000000' title='{r &#92;neq 0}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
The <img src='http://s0.wp.com/latex.php?latex=%7Bk%3D3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=3}&amp;fg=000000' title='{k=3}&amp;fg=000000' class='latex' /> density Hales-Jewett theorem implies the corners theorem; this is proven by utilising the map <img src='http://s0.wp.com/latex.php?latex=%7B%5Cphi%3A+%5B3%5D%5En+%5Crightarrow+%5Bn%5D%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;phi: [3]^n &#92;rightarrow [n]^2}&amp;fg=000000' title='{&#92;phi: [3]^n &#92;rightarrow [n]^2}&amp;fg=000000' class='latex' /> from the cube to the square, defined by mapping a string <img src='http://s0.wp.com/latex.php?latex=%7Bx+%5Cin+%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x &#92;in [3]^n}&amp;fg=000000' title='{x &#92;in [3]^n}&amp;fg=000000' class='latex' /> to a pair <img src='http://s0.wp.com/latex.php?latex=%7B%28a%2Cb%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(a,b)}&amp;fg=000000' title='{(a,b)}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7Ba%2C+b%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{a, b}&amp;fg=000000' title='{a, b}&amp;fg=000000' class='latex' /> are the number of <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />s and <img src='http://s0.wp.com/latex.php?latex=%7B2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2}&amp;fg=000000' title='{2}&amp;fg=000000' class='latex' />s respectively in <img src='http://s0.wp.com/latex.php?latex=%7Bx%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x}&amp;fg=000000' title='{x}&amp;fg=000000' class='latex' />. The key point is that <img src='http://s0.wp.com/latex.php?latex=%7B%5Cphi%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;phi}&amp;fg=000000' title='{&#92;phi}&amp;fg=000000' class='latex' /> maps combinatorial lines to corners. (Strictly speaking, this mapping only establishes the corners theorem for dense subsets of <img src='http://s0.wp.com/latex.php?latex=%7B%5Bn%2F3+-+%5Csqrt%7Bn%7D%2C+n%2F3+%2B+%5Csqrt%7Bn%7D%5D%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[n/3 - &#92;sqrt{n}, n/3 + &#92;sqrt{n}]^2}&amp;fg=000000' title='{[n/3 - &#92;sqrt{n}, n/3 + &#92;sqrt{n}]^2}&amp;fg=000000' class='latex' />, but it is not difficult to obtain the general case from this by replacing <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> by <img src='http://s0.wp.com/latex.php?latex=%7Bn%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n^2}&amp;fg=000000' title='{n^2}&amp;fg=000000' class='latex' /> and using translation-invariance.)
</p>
<p>
(The corners theorem is also closely related to the problem of finding dense sets of points in a triangular grid without any equilateral triangles, a problem which we have called <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Fujimura&#37;27s_problem">Fujimura&#8217;s problem</a>.
</p>
<p>
The corners theorem in turn implies
</p>
<blockquote><p><b>Theorem 3 (Roth&#8217;s theorem)</b> <a name="roth"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7B0+%26%2360%3B+%5Cdelta+%5Cleq+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0 &lt; &#92;delta &#92;leq 1}&amp;fg=000000' title='{0 &lt; &#92;delta &#92;leq 1}&amp;fg=000000' class='latex' />. Then if <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> is a sufficiently large integer, any subset <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> of the interval <img src='http://s0.wp.com/latex.php?latex=%7B%5Bn%5D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[n]}&amp;fg=000000' title='{[n]}&amp;fg=000000' class='latex' /> of density <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BA%26%23124%3B%2Fn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;A&#124;/n}&amp;fg=000000' title='{&#124;A&#124;/n}&amp;fg=000000' class='latex' /> at least <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' /> contains at least one arithmetic progression <img src='http://s0.wp.com/latex.php?latex=%7Ba%2C+a%2Br%2C+a%2B2r%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{a, a+r, a+2r}&amp;fg=000000' title='{a, a+r, a+2r}&amp;fg=000000' class='latex' /> of length three. </p></blockquote>
</p>
<p>
Roth&#8217;s theorem can be deduced from the corners theorem by considering the map <img src='http://s0.wp.com/latex.php?latex=%7B%5Cpsi%3A+%5Bn%5D%5E2+%5Crightarrow+%5B3n%5D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;psi: [n]^2 &#92;rightarrow [3n]}&amp;fg=000000' title='{&#92;psi: [n]^2 &#92;rightarrow [3n]}&amp;fg=000000' class='latex' /> defined by <img src='http://s0.wp.com/latex.php?latex=%7B%5Cpsi%28a%2Cb%29+%3A%3D+a%2B2b%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;psi(a,b) := a+2b}&amp;fg=000000' title='{&#92;psi(a,b) := a+2b}&amp;fg=000000' class='latex' />; the key point is that <img src='http://s0.wp.com/latex.php?latex=%7B%5Cpsi%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;psi}&amp;fg=000000' title='{&#92;psi}&amp;fg=000000' class='latex' /> maps corners to arithmetic progressions of length three.
</p>
<p>
There are higher <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' /> analogues of these implications; the general <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' /> version of the density Hales-Jewett theorem implies a general <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' /> version of the corners theorem known as the multidimensional Szemer&#233;di theorem, which in term implies a general version of Roth&#8217;s theorem known as <a href="http://en.wikipedia.org/wiki/Szemeredi&#37;27s_theorem">Szemer&#233;di&#8217;s theorem</a>.
</p>
</p>
<p align="center"><b> &#8212;  2. The density increment argument  &#8212; </b></p>
<p>
The strategy of the density increment argument, which goes back to Roth&#8217;s proof of Theorem <a href="#roth">3</a> in 1956, is to perform a downward induction on the density <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' />. Indeed, the theorem is obvious for high enough values of <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' />; for instance, if <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta+%26%2362%3B+2%2F3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta &gt; 2/3}&amp;fg=000000' title='{&#92;delta &gt; 2/3}&amp;fg=000000' class='latex' />, then partitioning the cube <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^n}&amp;fg=000000' title='{[3]^n}&amp;fg=000000' class='latex' /> into lines and applying the pigeonhole principle will already give a combinatorial line. So the idea is to deduce the claim for a fixed density <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' /> from that of a higher density <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' />.
</p>
<p>
A key concept here is that of an <img src='http://s0.wp.com/latex.php?latex=%7Bm%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{m}&amp;fg=000000' title='{m}&amp;fg=000000' class='latex' />-dimensional <em>combinatorial subspace</em> of <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^n}&amp;fg=000000' title='{[3]^n}&amp;fg=000000' class='latex' /> &#8211; a set of the form <img src='http://s0.wp.com/latex.php?latex=%7B%5Cphi%28%5B3%5D%5Em%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;phi([3]^m)}&amp;fg=000000' title='{&#92;phi([3]^m)}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7B%5Cphi+%5Cin+%5C%7B1%2C2%2C3%2C%2A_1%2C%5Cldots%2C%2A_m%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;phi &#92;in &#92;{1,2,3,*_1,&#92;ldots,*_m&#92;}^n}&amp;fg=000000' title='{&#92;phi &#92;in &#92;{1,2,3,*_1,&#92;ldots,*_m&#92;}^n}&amp;fg=000000' class='latex' /> is a string formed using the base alphabet and <img src='http://s0.wp.com/latex.php?latex=%7Bm%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{m}&amp;fg=000000' title='{m}&amp;fg=000000' class='latex' /> wildcards <img src='http://s0.wp.com/latex.php?latex=%7B%2A_1%2C%5Cldots%2C%2A_m%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{*_1,&#92;ldots,*_m}&amp;fg=000000' title='{*_1,&#92;ldots,*_m}&amp;fg=000000' class='latex' /> (with each wildcard appearing at least opnce), and <img src='http://s0.wp.com/latex.php?latex=%7B%5Cphi%28a_1+%5Cldots+a_m%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;phi(a_1 &#92;ldots a_m)}&amp;fg=000000' title='{&#92;phi(a_1 &#92;ldots a_m)}&amp;fg=000000' class='latex' /> is the string formed by substituting <img src='http://s0.wp.com/latex.php?latex=%7Ba_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{a_i}&amp;fg=000000' title='{a_i}&amp;fg=000000' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=%7B%2A_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{*_i}&amp;fg=000000' title='{*_i}&amp;fg=000000' class='latex' /> for each <img src='http://s0.wp.com/latex.php?latex=%7Bi%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{i}&amp;fg=000000' title='{i}&amp;fg=000000' class='latex' />. (Thus, for instance, a combinatorial line is a combinatorial subspace of dimension <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />.) The identification <img src='http://s0.wp.com/latex.php?latex=%7B%5Cphi%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;phi}&amp;fg=000000' title='{&#92;phi}&amp;fg=000000' class='latex' /> between <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5Em%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^m}&amp;fg=000000' title='{[3]^m}&amp;fg=000000' class='latex' /> and the combinatorial space <img src='http://s0.wp.com/latex.php?latex=%7B%5Cphi%28%5B3%5D%5Em%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;phi([3]^m)}&amp;fg=000000' title='{&#92;phi([3]^m)}&amp;fg=000000' class='latex' /> maps combinatorial lines to combinatorial lines. Thus, to prove Theorem <a href="#dhj">1</a>, it suffices to show
</p>
<blockquote><p><b>Proposition 4 (Lack of lines implies density increment)</b> <a name="lack"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7B0+%26%2360%3B+%5Cdelta+%5Cleq+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0 &lt; &#92;delta &#92;leq 1}&amp;fg=000000' title='{0 &lt; &#92;delta &#92;leq 1}&amp;fg=000000' class='latex' />. Then if <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> is a sufficiently large integer, and <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Csubset+%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;subset [3]^n}&amp;fg=000000' title='{A &#92;subset [3]^n}&amp;fg=000000' class='latex' /> has density at least <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' /> and has no combinatorial lines, then there exists an <img src='http://s0.wp.com/latex.php?latex=%7Bm%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{m}&amp;fg=000000' title='{m}&amp;fg=000000' class='latex' />-dimensional subspace <img src='http://s0.wp.com/latex.php?latex=%7B%5Cphi%28%5B3%5D%5Em%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;phi([3]^m)}&amp;fg=000000' title='{&#92;phi([3]^m)}&amp;fg=000000' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^n}&amp;fg=000000' title='{[3]^n}&amp;fg=000000' class='latex' /> on which <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> has density at least <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta+%2B+c%28%5Cdelta%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta + c(&#92;delta)}&amp;fg=000000' title='{&#92;delta + c(&#92;delta)}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7Bc%28%5Cdelta%29+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{c(&#92;delta) &gt; 0}&amp;fg=000000' title='{c(&#92;delta) &gt; 0}&amp;fg=000000' class='latex' /> depends only on <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' /> (and is bounded away from zero on any compact range of <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' />), and <img src='http://s0.wp.com/latex.php?latex=%7Bm+%5Cgeq+m_0%28n%2C%5Cdelta%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{m &#92;geq m_0(n,&#92;delta)}&amp;fg=000000' title='{m &#92;geq m_0(n,&#92;delta)}&amp;fg=000000' class='latex' /> for some function <img src='http://s0.wp.com/latex.php?latex=%7Bm_0%28n%2C%5Cdelta%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{m_0(n,&#92;delta)}&amp;fg=000000' title='{m_0(n,&#92;delta)}&amp;fg=000000' class='latex' /> that goes to infinity as <img src='http://s0.wp.com/latex.php?latex=%7Bn+%5Crightarrow+%5Cinfty%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n &#92;rightarrow &#92;infty}&amp;fg=000000' title='{n &#92;rightarrow &#92;infty}&amp;fg=000000' class='latex' /> for fixed <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
It is easy to see that Proposition <a href="#lack">4</a> implies Theorem <a href="#dhj">1</a> (for instance, one could consider the infimum of all <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta}&amp;fg=000000' title='{&#92;delta}&amp;fg=000000' class='latex' /> for which the theorem holds, and show that having this infimum non-zero would lead to a contradiction).
</p>
<p>
Now we have to figure out how to get that density increment. The original argument of Roth relied on Fourier analysis, which in turn relies on an underlying translation-invariant structure which is not present in the density Hales-Jewett setting. (Arithmetic progressions are translation-invariant, but combinatorial lines are not.) It turns out that one can proceed instead by adapting a (<a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Modification_of_the_Ajtai-Szemer&#37;C3&#37;A9di_argument">modification of</a>) <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Ajtai-Szemer&#37;C3&#37;A9di&#37;27s_proof_of_the_corners_theorem">an argument of Ajtai and Szemer&#233;di</a>, which gave the first proof of Theorem <a href="#corners">2</a>.
</p>
<p>
The (modified) Ajtai-Szemer&#233;di argument uses the density increment method, assuming that <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> has no right-angled triangles and showing that <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> has an increased density on a <em>subgrid</em> &#8211; a product <img src='http://s0.wp.com/latex.php?latex=%7BP+%5Ctimes+Q%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P &#92;times Q}&amp;fg=000000' title='{P &#92;times Q}&amp;fg=000000' class='latex' /> of fairly long arithmetic progressions with the same spacing. The argument proceeds in two stages, which we describe slightly informally (in particular, glossing over some technical details regarding quantitative parameters such as <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon}&amp;fg=000000' title='{&#92;varepsilon}&amp;fg=000000' class='latex' />) as follows:
</p>
<p><ul>
<li> Step 1. If <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Csubset+%5Bn%5D%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;subset [n]^2}&amp;fg=000000' title='{A &#92;subset [n]^2}&amp;fg=000000' class='latex' /> is dense but has no right-angled triangles, then <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> has an increased density on a cartesian product <img src='http://s0.wp.com/latex.php?latex=%7BU+%5Ctimes+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U &#92;times V}&amp;fg=000000' title='{U &#92;times V}&amp;fg=000000' class='latex' /> of dense sets <img src='http://s0.wp.com/latex.php?latex=%7BU%2C+V+%5Csubset+%5Bn%5D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U, V &#92;subset [n]}&amp;fg=000000' title='{U, V &#92;subset [n]}&amp;fg=000000' class='latex' /> (which are not necessarily arithmetic progressions). </li>
<li> Step 2. Any Cartesian product <img src='http://s0.wp.com/latex.php?latex=%7BU+%5Ctimes+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U &#92;times V}&amp;fg=000000' title='{U &#92;times V}&amp;fg=000000' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7B%5Bn%5D%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[n]^2}&amp;fg=000000' title='{[n]^2}&amp;fg=000000' class='latex' /> can be partitioned into reasonably large grids <img src='http://s0.wp.com/latex.php?latex=%7BP+%5Ctimes+Q%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P &#92;times Q}&amp;fg=000000' title='{P &#92;times Q}&amp;fg=000000' class='latex' />, plus a remainder term of small density.
</li>
</ul>
<p>
From Step 1, Step 2 and the pigeonhole principle we obtain the desired density increment of <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> on a grid <img src='http://s0.wp.com/latex.php?latex=%7BP+%5Ctimes+Q%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P &#92;times Q}&amp;fg=000000' title='{P &#92;times Q}&amp;fg=000000' class='latex' />, and then the density increment argument gives us the corners theorem.
</p>
<p>
Step 1 is actually quite easy. If <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> is dense, then it must also be dense on some diagonal <img src='http://s0.wp.com/latex.php?latex=%7BD+%3D+%5C%7B+%28x%2Cy%29%3A+x%2By+%3D+const+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{D = &#92;{ (x,y): x+y = const &#92;}}&amp;fg=000000' title='{D = &#92;{ (x,y): x+y = const &#92;}}&amp;fg=000000' class='latex' />, by the pigeonhole principle. Let <img src='http://s0.wp.com/latex.php?latex=%7BU%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U}&amp;fg=000000' title='{U}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> denote the rows and columns that <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Ccap+D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;cap D}&amp;fg=000000' title='{A &#92;cap D}&amp;fg=000000' class='latex' /> occupies. Every pair of points in <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Ccap+D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;cap D}&amp;fg=000000' title='{A &#92;cap D}&amp;fg=000000' class='latex' /> forms the hypotenuse of some corner, whose third vertex lies in <img src='http://s0.wp.com/latex.php?latex=%7BU+%5Ctimes+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U &#92;times V}&amp;fg=000000' title='{U &#92;times V}&amp;fg=000000' class='latex' />. Thus, if <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> has no corners, then <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> must avoid all the points formed by <img src='http://s0.wp.com/latex.php?latex=%7BU+%5Ctimes+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U &#92;times V}&amp;fg=000000' title='{U &#92;times V}&amp;fg=000000' class='latex' /> (except for those of the diagonal <img src='http://s0.wp.com/latex.php?latex=%7BD%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{D}&amp;fg=000000' title='{D}&amp;fg=000000' class='latex' />). Thus <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> has a significant density <em>decrease</em> on the Cartesian product <img src='http://s0.wp.com/latex.php?latex=%7BU+%5Ctimes+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U &#92;times V}&amp;fg=000000' title='{U &#92;times V}&amp;fg=000000' class='latex' />. Dividing the remainder <img src='http://s0.wp.com/latex.php?latex=%7B%5Bn%5D%5E2+%5Cbackslash+%28U+%5Ctimes+V%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[n]^2 &#92;backslash (U &#92;times V)}&amp;fg=000000' title='{[n]^2 &#92;backslash (U &#92;times V)}&amp;fg=000000' class='latex' /> into three further Cartesian products <img src='http://s0.wp.com/latex.php?latex=%7BU+%5Ctimes+%28%5Bn%5D%5Cbackslash+V%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U &#92;times ([n]&#92;backslash V)}&amp;fg=000000' title='{U &#92;times ([n]&#92;backslash V)}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7B%28%5Bn%5D+%5Cbackslash+U%29+%5Ctimes+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{([n] &#92;backslash U) &#92;times V}&amp;fg=000000' title='{([n] &#92;backslash U) &#92;times V}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7B%28%5Bn%5D+%5Cbackslash+U%29+%5Ctimes+%28%5Bn%5D+%5Cbackslash+V%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{([n] &#92;backslash U) &#92;times ([n] &#92;backslash V)}&amp;fg=000000' title='{([n] &#92;backslash U) &#92;times ([n] &#92;backslash V)}&amp;fg=000000' class='latex' /> and using the pigeonhole principle we obtain the claim (after redefining <img src='http://s0.wp.com/latex.php?latex=%7BU%2C+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U, V}&amp;fg=000000' title='{U, V}&amp;fg=000000' class='latex' /> appropriately).
</p>
<p>
Step 2 can be obtained by iterating a one-dimensional version:
</p>
<p><ul>
<li> Step 2a. Any set <img src='http://s0.wp.com/latex.php?latex=%7BU+%5Csubset+%5Bn%5D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U &#92;subset [n]}&amp;fg=000000' title='{U &#92;subset [n]}&amp;fg=000000' class='latex' /> can be partitioned into reasonably long arithmetic progressions <img src='http://s0.wp.com/latex.php?latex=%7BP%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P}&amp;fg=000000' title='{P}&amp;fg=000000' class='latex' />, plus a remainder term of small density.
</li>
</ul>
<p>
Indeed, from Step 2a, one can partition <img src='http://s0.wp.com/latex.php?latex=%7BU+%5Ctimes+%5Bn%5D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U &#92;times [n]}&amp;fg=000000' title='{U &#92;times [n]}&amp;fg=000000' class='latex' /> into products <img src='http://s0.wp.com/latex.php?latex=%7BP+%5Ctimes+%5Bn%5D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P &#92;times [n]}&amp;fg=000000' title='{P &#92;times [n]}&amp;fg=000000' class='latex' /> (plus a small remainder), which can be easily repartitioned into grids <img src='http://s0.wp.com/latex.php?latex=%7BP+%5Ctimes+Q%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P &#92;times Q}&amp;fg=000000' title='{P &#92;times Q}&amp;fg=000000' class='latex' /> (plus small remainder). This partitions <img src='http://s0.wp.com/latex.php?latex=%7BU+%5Ctimes+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U &#92;times V}&amp;fg=000000' title='{U &#92;times V}&amp;fg=000000' class='latex' /> into sets <img src='http://s0.wp.com/latex.php?latex=%7BP+%5Ctimes+%28V+%5Ccap+Q%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P &#92;times (V &#92;cap Q)}&amp;fg=000000' title='{P &#92;times (V &#92;cap Q)}&amp;fg=000000' class='latex' /> (plus small remainder). Applying Step 2a again, each <img src='http://s0.wp.com/latex.php?latex=%7BV+%5Ccap+Q%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V &#92;cap Q}&amp;fg=000000' title='{V &#92;cap Q}&amp;fg=000000' class='latex' /> can be partitioned further into progressions <img src='http://s0.wp.com/latex.php?latex=%7BQ%27%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{Q&#039;}&amp;fg=000000' title='{Q&#039;}&amp;fg=000000' class='latex' /> (plus small remainder), which allows us to partition each <img src='http://s0.wp.com/latex.php?latex=%7BP+%5Ctimes+%28V+%5Ccap+Q%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P &#92;times (V &#92;cap Q)}&amp;fg=000000' title='{P &#92;times (V &#92;cap Q)}&amp;fg=000000' class='latex' /> into grids <img src='http://s0.wp.com/latex.php?latex=%7BP%27+%5Ctimes+Q%27%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P&#039; &#92;times Q&#039;}&amp;fg=000000' title='{P&#039; &#92;times Q&#039;}&amp;fg=000000' class='latex' /> (plus small remainder).
</p>
<p>
So all one has left to do is establish Step 2a. But this can be done by the greedy algorithm: locate one long arithmetic progression <img src='http://s0.wp.com/latex.php?latex=%7BP%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P}&amp;fg=000000' title='{P}&amp;fg=000000' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7BU%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U}&amp;fg=000000' title='{U}&amp;fg=000000' class='latex' /> and remove it from <img src='http://s0.wp.com/latex.php?latex=%7BU%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U}&amp;fg=000000' title='{U}&amp;fg=000000' class='latex' />, then locate another to remove, and so forth until no further long progressions remain in the set. But <a href="http://en.wikipedia.org/wiki/Szemeredi&#37;27s_theorem">Szemer&#233;di&#8217;s theorem</a> then tells us the remaining set has low density, and one is done!
</p>
<p>
This argument has the apparent disadvantage of requiring a deep theorem (Szemer&#233;di&#8217;s theorem) in order to complete the proof. However, interestingly enough, when one adapts the argument to the density Hales-Jewett theorem, one gets to replace Szemer&#233;di&#8217;s theorem by a more elementary result &#8211; one which in fact follows from the (easy) <img src='http://s0.wp.com/latex.php?latex=%7Bk%3D2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=2}&amp;fg=000000' title='{k=2}&amp;fg=000000' class='latex' /> version of the density Hales-Jewett theorem, i.e. <a href="http://en.wikipedia.org/wiki/Sperner&#37;27s_theorem">Sperner&#8217;s theorem</a>.
</p>
<p>
We first need to understand the analogue of the Cartesian products <img src='http://s0.wp.com/latex.php?latex=%7BU+%5Ctimes+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U &#92;times V}&amp;fg=000000' title='{U &#92;times V}&amp;fg=000000' class='latex' />. Note that <img src='http://s0.wp.com/latex.php?latex=%7BU+%5Ctimes+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U &#92;times V}&amp;fg=000000' title='{U &#92;times V}&amp;fg=000000' class='latex' /> is the intersection of a &#8220;vertically insensitive set&#8221; <img src='http://s0.wp.com/latex.php?latex=%7BU+%5Ctimes+%5Bn%5D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U &#92;times [n]}&amp;fg=000000' title='{U &#92;times [n]}&amp;fg=000000' class='latex' /> and a &#8220;horizontally insensitive set&#8221; <img src='http://s0.wp.com/latex.php?latex=%7B%5Bn%5D+%5Ctimes+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[n] &#92;times V}&amp;fg=000000' title='{[n] &#92;times V}&amp;fg=000000' class='latex' />. By &#8220;vertically insensitive&#8221; we mean that membership of a point <img src='http://s0.wp.com/latex.php?latex=%7B%28x%2Cy%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(x,y)}&amp;fg=000000' title='{(x,y)}&amp;fg=000000' class='latex' /> in that set is unaffected if one moves that point in a vertical direction, and similarly for &#8220;horizontally insensitive&#8221;. In a similar fashion, define a &#8220;<img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-insensitive set&#8221; to be a subset of <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^n}&amp;fg=000000' title='{[3]^n}&amp;fg=000000' class='latex' />, membership in which is unaffected if one flips a coordinate from a <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' /> to a <img src='http://s0.wp.com/latex.php?latex=%7B2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2}&amp;fg=000000' title='{2}&amp;fg=000000' class='latex' /> or vice versa (e.g. if <img src='http://s0.wp.com/latex.php?latex=%7B1223%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1223}&amp;fg=000000' title='{1223}&amp;fg=000000' class='latex' /> lies in the set, then so must <img src='http://s0.wp.com/latex.php?latex=%7B1213%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1213}&amp;fg=000000' title='{1213}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7B1113%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1113}&amp;fg=000000' title='{1113}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7B2113%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2113}&amp;fg=000000' title='{2113}&amp;fg=000000' class='latex' />, etc.). Similarly define the notion of a &#8220;<img src='http://s0.wp.com/latex.php?latex=%7B13%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{13}&amp;fg=000000' title='{13}&amp;fg=000000' class='latex' />-insensitive set&#8221;. We then define a &#8220;complexity <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' /> set&#8221; to be the intersection <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D+%5Ccap+E_%7B13%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12} &#92;cap E_{13}}&amp;fg=000000' title='{E_{12} &#92;cap E_{13}}&amp;fg=000000' class='latex' /> of a <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-insensitive set <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}}&amp;fg=000000' title='{E_{12}}&amp;fg=000000' class='latex' /> and a <img src='http://s0.wp.com/latex.php?latex=%7B13%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{13}&amp;fg=000000' title='{13}&amp;fg=000000' class='latex' />-insensitive set <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B13%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{13}}&amp;fg=000000' title='{E_{13}}&amp;fg=000000' class='latex' />; these are analogous to the Cartesian products <img src='http://s0.wp.com/latex.php?latex=%7BU+%5Ctimes+V%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{U &#92;times V}&amp;fg=000000' title='{U &#92;times V}&amp;fg=000000' class='latex' />.
</p>
<p>
(For technical reasons, one actually has to deal with <em>local</em> versions of insensitive sets and complexity <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' /> sets, in which one is only allowed to flip a moderately small number of the <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> coordinates rather than all of them. But to simplify the discussion let me ignore this (important) detail, which is also a major issue to address in the other two proofs of this theorem.)
</p>
<p>
The analogues of Steps 1, 2 for the density Hales-Jewett theorem are then
</p>
<p><ul>
<li> Step 1. If <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Csubset+%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;subset [3]^n}&amp;fg=000000' title='{A &#92;subset [3]^n}&amp;fg=000000' class='latex' /> is dense but has no combinatorial lines, then <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> has an increased density on a (local) complexity <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' /> set <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D+%5Ccap+E_%7B13%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12} &#92;cap E_{13}}&amp;fg=000000' title='{E_{12} &#92;cap E_{13}}&amp;fg=000000' class='latex' />. </li>
<li> Step 2. Any (local) complexity <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' /> set <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D+%5Ccap+E_%7B13%7D+%5Csubset+%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12} &#92;cap E_{13} &#92;subset [3]^n}&amp;fg=000000' title='{E_{12} &#92;cap E_{13} &#92;subset [3]^n}&amp;fg=000000' class='latex' /> can be partitioned into moderately large combinatorial subspaces (plus a small remainder).
</li>
</ul>
<p>
We can sketch how Step 1 works as follows. Given any <img src='http://s0.wp.com/latex.php?latex=%7Bx+%5Cin+%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x &#92;in [3]^n}&amp;fg=000000' title='{x &#92;in [3]^n}&amp;fg=000000' class='latex' />, let <img src='http://s0.wp.com/latex.php?latex=%7B%5Cpi_%7B1+%5Crightarrow+2%7D%28x%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;pi_{1 &#92;rightarrow 2}(x)}&amp;fg=000000' title='{&#92;pi_{1 &#92;rightarrow 2}(x)}&amp;fg=000000' class='latex' /> denote the string formed by replacing all <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />s with <img src='http://s0.wp.com/latex.php?latex=%7B2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2}&amp;fg=000000' title='{2}&amp;fg=000000' class='latex' />s, e.g. <img src='http://s0.wp.com/latex.php?latex=%7B%5Cpi_%7B1+%5Crightarrow+2%7D%281321%29+%3D+2322%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;pi_{1 &#92;rightarrow 2}(1321) = 2322}&amp;fg=000000' title='{&#92;pi_{1 &#92;rightarrow 2}(1321) = 2322}&amp;fg=000000' class='latex' />. Similarly define <img src='http://s0.wp.com/latex.php?latex=%7B%5Cpi_%7B1+%5Crightarrow+3%7D%28x%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;pi_{1 &#92;rightarrow 3}(x)}&amp;fg=000000' title='{&#92;pi_{1 &#92;rightarrow 3}(x)}&amp;fg=000000' class='latex' />. Observe that <img src='http://s0.wp.com/latex.php?latex=%7Bx%2C+%5Cpi_%7B1+%5Crightarrow+2%7D%28x%29%2C+%5Cpi_%7B1+%5Crightarrow+3%7D%28x%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x, &#92;pi_{1 &#92;rightarrow 2}(x), &#92;pi_{1 &#92;rightarrow 3}(x)}&amp;fg=000000' title='{x, &#92;pi_{1 &#92;rightarrow 2}(x), &#92;pi_{1 &#92;rightarrow 3}(x)}&amp;fg=000000' class='latex' /> forms a combinatorial line (except in the rare case when <img src='http://s0.wp.com/latex.php?latex=%7Bx%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x}&amp;fg=000000' title='{x}&amp;fg=000000' class='latex' /> doesn&#8217;t contain any <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />s). Thus if we let <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D+%3A%3D+%5C%7B+x%3A+%5Cpi_%7B1+%5Crightarrow+2%7D%28x%29+%5Cin+A%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12} := &#92;{ x: &#92;pi_{1 &#92;rightarrow 2}(x) &#92;in A&#92;}}&amp;fg=000000' title='{E_{12} := &#92;{ x: &#92;pi_{1 &#92;rightarrow 2}(x) &#92;in A&#92;}}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B13%7D+%3A%3D+%5C%7B+x%3A+%5Cpi_%7B1+%5Crightarrow+3%7D%28x%29+%5Cin+A%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{13} := &#92;{ x: &#92;pi_{1 &#92;rightarrow 3}(x) &#92;in A&#92;}}&amp;fg=000000' title='{E_{13} := &#92;{ x: &#92;pi_{1 &#92;rightarrow 3}(x) &#92;in A&#92;}}&amp;fg=000000' class='latex' />, we see that <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> must avoid essentially all of <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D+%5Ccap+E_%7B13%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12} &#92;cap E_{13}}&amp;fg=000000' title='{E_{12} &#92;cap E_{13}}&amp;fg=000000' class='latex' />. On the other hand, observe that <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}}&amp;fg=000000' title='{E_{12}}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B13%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{13}}&amp;fg=000000' title='{E_{13}}&amp;fg=000000' class='latex' /> are <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-insensitive and <img src='http://s0.wp.com/latex.php?latex=%7B13%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{13}&amp;fg=000000' title='{13}&amp;fg=000000' class='latex' />-insensitive sets respectively. Taking complements and using the same sort of pigeonhole argument as before, we obtain the claim. (Actually, this argument doesn&#8217;t quite work because <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}}&amp;fg=000000' title='{E_{12}}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B13%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{13}}&amp;fg=000000' title='{E_{13}}&amp;fg=000000' class='latex' /> could be very sparse; this problem can be fixed, but requires one to use local complexity <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' /> sets rather than global ones, and also to introduce the concept of &#8220;equal-slices measure&#8221;; I will not discuss these issues here.)
</p>
<p>
Step 2 can be reduced, much as before, to the following analogue of Step 2a:
</p>
<p><ul>
<li> Step 2a. Any <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-insensitive set <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D+%5Csubset+%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12} &#92;subset [3]^n}&amp;fg=000000' title='{E_{12} &#92;subset [3]^n}&amp;fg=000000' class='latex' /> can be partitioned into moderately large combinatorial subspaces (plus a small remainder).
</li>
</ul>
<p>
Identifying the letters <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2}&amp;fg=000000' title='{2}&amp;fg=000000' class='latex' /> together, one can quotient <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^n}&amp;fg=000000' title='{[3]^n}&amp;fg=000000' class='latex' /> down to <img src='http://s0.wp.com/latex.php?latex=%7B%5B2%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[2]^n}&amp;fg=000000' title='{[2]^n}&amp;fg=000000' class='latex' />; the preimages of this projection are precisely the <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-insensitive sets. Because of this, Step 2a is basically equivalent (modulo some technicalities about measure) to
</p>
<p><ul>
<li> Step 2a&#8217;. Any <img src='http://s0.wp.com/latex.php?latex=%7BE+%5Csubset+%5B2%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E &#92;subset [2]^n}&amp;fg=000000' title='{E &#92;subset [2]^n}&amp;fg=000000' class='latex' /> can be partitioned into moderately large combinatorial subspaces (plus a small remainder).
</li>
</ul>
<p>
By the greedy algorithm, we will be able to accomplish this step if we can show that every dense subset of <img src='http://s0.wp.com/latex.php?latex=%7B%5B2%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[2]^n}&amp;fg=000000' title='{[2]^n}&amp;fg=000000' class='latex' /> contains moderately large subspaces. But this turns out to be possible by carefully iterating Sperner&#8217;s theorem (which shows that every dense subset of <img src='http://s0.wp.com/latex.php?latex=%7B%5B2%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[2]^n}&amp;fg=000000' title='{[2]^n}&amp;fg=000000' class='latex' /> contains combinatorial lines).
</p>
<p>
This proof of Theorem <a href="#dhj">1</a> seems to extend without major difficulty to the case of higher <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' />, though details are still being worked out.
</p>
</p>
<p align="center"><b> &#8212;  3. The triangle removal argument  &#8212; </b></p>
<p>
The <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Triangle_removal_lemma">triangle removal lemma</a> of Ruzsa and Szemer&#233;di is a graph-theoretic result which implies the corners theorem (and hence Roth&#8217;s theorem). It asserts the following:
</p>
<blockquote><p><b>Lemma 5 (Triangle removal lemma)</b>  For every <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon &gt; 0}&amp;fg=000000' title='{&#92;varepsilon &gt; 0}&amp;fg=000000' class='latex' /> there exists <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta &gt; 0}&amp;fg=000000' title='{&#92;delta &gt; 0}&amp;fg=000000' class='latex' /> such that if a graph <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> on <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' /> vertices has fewer than <img src='http://s0.wp.com/latex.php?latex=%7B%5Cdelta+n%5E3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;delta n^3}&amp;fg=000000' title='{&#92;delta n^3}&amp;fg=000000' class='latex' /> triangles, then the triangles can be deleted entirely by removing at most <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon+n%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon n^2}&amp;fg=000000' title='{&#92;varepsilon n^2}&amp;fg=000000' class='latex' /> edges. </p></blockquote>
</p>
<p>
Let&#8217;s see how the triangle removal lemma implies the corners theorem. A corner is, of course, already a triangle in the geometric sense, but we need to convert it to a triangle in the graph-theoretic sense, as follows. Let <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> be a subset of <img src='http://s0.wp.com/latex.php?latex=%7B%5Bn%5D%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[n]^2}&amp;fg=000000' title='{[n]^2}&amp;fg=000000' class='latex' /> with no corners; the aim is to show that <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> has small density. Let <img src='http://s0.wp.com/latex.php?latex=%7BV_h%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_h}&amp;fg=000000' title='{V_h}&amp;fg=000000' class='latex' /> be the set of all horizontal lines in <img src='http://s0.wp.com/latex.php?latex=%7B%5Bn%5D%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[n]^2}&amp;fg=000000' title='{[n]^2}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7BV_v%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_v}&amp;fg=000000' title='{V_v}&amp;fg=000000' class='latex' /> the set of vertical lines, and <img src='http://s0.wp.com/latex.php?latex=%7BV_d%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_d}&amp;fg=000000' title='{V_d}&amp;fg=000000' class='latex' /> the set of diagonal lines (thus all three sets have size about <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' />). We create a tripartite graph <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> on the vertex sets <img src='http://s0.wp.com/latex.php?latex=%7BV_h+%5Ccup+V_v+%5Ccup+V_d%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_h &#92;cup V_v &#92;cup V_d}&amp;fg=000000' title='{V_h &#92;cup V_v &#92;cup V_d}&amp;fg=000000' class='latex' /> by joining a horizontal line <img src='http://s0.wp.com/latex.php?latex=%7Bh+%5Cin+V_h%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{h &#92;in V_h}&amp;fg=000000' title='{h &#92;in V_h}&amp;fg=000000' class='latex' /> to a vertical line <img src='http://s0.wp.com/latex.php?latex=%7Bv+%5Cin+V_v%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v &#92;in V_v}&amp;fg=000000' title='{v &#92;in V_v}&amp;fg=000000' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=%7Bh%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{h}&amp;fg=000000' title='{h}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7Bv%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{v}&amp;fg=000000' title='{v}&amp;fg=000000' class='latex' /> intersect at a point in <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' />, and similarly connecting <img src='http://s0.wp.com/latex.php?latex=%7BV_h%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_h}&amp;fg=000000' title='{V_h}&amp;fg=000000' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%7BV_v%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_v}&amp;fg=000000' title='{V_v}&amp;fg=000000' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7BV_d%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_d}&amp;fg=000000' title='{V_d}&amp;fg=000000' class='latex' />. Observe that a triangle in <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> corresponds either to a corner in <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' />, or to a &#8220;degenerate&#8221; corner in which the horizontal, vertical, and diagonal line are all concurrent. In particular, there are very few triangle in <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' />, which can then be deleted by removing a small number of edges from <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> by the triangle removal lemma. But each edge removed can delete at most one degenerate corner, and the number of degenerate corners is <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BA%26%23124%3B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;A&#124;}&amp;fg=000000' title='{&#124;A&#124;}&amp;fg=000000' class='latex' />, and so <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3BA%26%23124%3B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;A&#124;}&amp;fg=000000' title='{&#124;A&#124;}&amp;fg=000000' class='latex' /> is small as required.
</p>
<p>
All known proofs of the triangle removal lemma proceed by some version of the following three steps:
</p>
<p><ul>
<li> &#8220;Regularity lemma step&#8221;: Applying tools such as the <a href="http://en.wikipedia.org/wiki/Szemer&#37;C3&#37;A9di_regularity_lemma">Szemer&#233;di regularity lemma</a>, one can partition the graph <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> into components <img src='http://s0.wp.com/latex.php?latex=%7BG_%7Bij%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_{ij}}&amp;fg=000000' title='{G_{ij}}&amp;fg=000000' class='latex' /> between cells <img src='http://s0.wp.com/latex.php?latex=%7BV_i%2C+V_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_i, V_j}&amp;fg=000000' title='{V_i, V_j}&amp;fg=000000' class='latex' /> of vertices, such that most of the <img src='http://s0.wp.com/latex.php?latex=%7BG_%7Bij%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_{ij}}&amp;fg=000000' title='{G_{ij}}&amp;fg=000000' class='latex' /> are &#8220;pseudorandom&#8221;. One way to define what pseudorandom means is to view each graph component <img src='http://s0.wp.com/latex.php?latex=%7BG_%7Bij%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_{ij}}&amp;fg=000000' title='{G_{ij}}&amp;fg=000000' class='latex' /> as a subset of the Cartesian product <img src='http://s0.wp.com/latex.php?latex=%7BV_i+%5Ctimes+V_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_i &#92;times V_j}&amp;fg=000000' title='{V_i &#92;times V_j}&amp;fg=000000' class='latex' />, in which case <img src='http://s0.wp.com/latex.php?latex=%7BG_%7Bij%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_{ij}}&amp;fg=000000' title='{G_{ij}}&amp;fg=000000' class='latex' /> is pseudorandom if it does not have a significant density increment on any smaller Cartesian product <img src='http://s0.wp.com/latex.php?latex=%7BV%27_i+%5Ctimes+V%27_j%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V&#039;_i &#92;times V&#039;_j}&amp;fg=000000' title='{V&#039;_i &#92;times V&#039;_j}&amp;fg=000000' class='latex' /> of non-trivial size. </li>
<li> &#8221;Counting lemma step&#8221;: By exploiting the pseudorandomness property, one shows that if <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> has a triple <img src='http://s0.wp.com/latex.php?latex=%7BG_%7Bij%7D%2C+G_%7Bjk%7D%2C+G_%7Bki%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G_{ij}, G_{jk}, G_{ki}}&amp;fg=000000' title='{G_{ij}, G_{jk}, G_{ki}}&amp;fg=000000' class='latex' /> of dense pseudorandom graphs between cells <img src='http://s0.wp.com/latex.php?latex=%7BV_i%2C+V_j%2C+V_k%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_i, V_j, V_k}&amp;fg=000000' title='{V_i, V_j, V_k}&amp;fg=000000' class='latex' /> of non-trivial size, then this triple must generate a large number of triangles; hence, if <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> has very few triangles, then one cannot find such a triple of dense pseudorandom graphs. </li>
<li> &#8221;Cleaning step&#8221;: If one then removes all components of <img src='http://s0.wp.com/latex.php?latex=%7BG%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{G}&amp;fg=000000' title='{G}&amp;fg=000000' class='latex' /> which are too sparse or insufficiently pseudorandom, one can thus eliminate all triangles.
</li>
</ul>
<p>
Pulling this argument back to the corners theorem, we see that cells such as <img src='http://s0.wp.com/latex.php?latex=%7BV_i%2C+V_j%2C+V_k%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_i, V_j, V_k}&amp;fg=000000' title='{V_i, V_j, V_k}&amp;fg=000000' class='latex' /> will correspond either to horizontally insensitive sets, vertically insensitive sets, or diagonally insensitive sets. Thus this proof of the corners theorem proceeds by partitioning <img src='http://s0.wp.com/latex.php?latex=%7B%5Bn%5D%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[n]^2}&amp;fg=000000' title='{[n]^2}&amp;fg=000000' class='latex' /> in three different ways into insensitive sets in such a way that <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> is pseudorandom with respect to many of the cells created by any two of these partitions, counting the corners generated by any triple of large cells in which A is pseudorandom and dense, and cleaning out all the other cells.
</p>
<p>
It turns out that a variant of this argument can give Theorem <a href="#dhj">1</a>; this was in fact the original approach studied by the polymath1 project, though it was only after a detour through ergodic theory (as well as the development of the density-increment argument discussed above) that the triangle-removal approach could be properly executed. In particular, an ergodic argument based on the infinitary analogue of the triangle removal lemma (and its hypergraph generalisations) was developed <a class="snap_noshots" href="http://arxiv.org/abs/0903.1633">by Austin</a>, which then inspired the combinatorial version sketched here.
</p>
<p>
The analogue of the vertex cells <img src='http://s0.wp.com/latex.php?latex=%7BV_i%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V_i}&amp;fg=000000' title='{V_i}&amp;fg=000000' class='latex' /> are given by certain <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-insensitive sets <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%5Ea%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}^a}&amp;fg=000000' title='{E_{12}^a}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7B13%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{13}&amp;fg=000000' title='{13}&amp;fg=000000' class='latex' />-insensitive sets <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B13%7D%5Eb%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{13}^b}&amp;fg=000000' title='{E_{13}^b}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7B23%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{23}&amp;fg=000000' title='{23}&amp;fg=000000' class='latex' />-insensitive sets <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B23%7D%5Ec%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{23}^c}&amp;fg=000000' title='{E_{23}^c}&amp;fg=000000' class='latex' />. Roughly speaking, a set <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Csubset+%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;subset [3]^n}&amp;fg=000000' title='{A &#92;subset [3]^n}&amp;fg=000000' class='latex' /> would be said to be pseudorandom with respect to a cell <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%5Ea+%5Ccap+E_%7B13%7D%5Eb%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}^a &#92;cap E_{13}^b}&amp;fg=000000' title='{E_{12}^a &#92;cap E_{13}^b}&amp;fg=000000' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Ccap+E_%7B12%7D%5Ea+%5Ccap+E_%7B13%7D%5Eb%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;cap E_{12}^a &#92;cap E_{13}^b}&amp;fg=000000' title='{A &#92;cap E_{12}^a &#92;cap E_{13}^b}&amp;fg=000000' class='latex' /> has no further density increment on any smaller cell <img src='http://s0.wp.com/latex.php?latex=%7BE%27_%7B12%7D+%5Ccap+E%27_%7B13%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E&#039;_{12} &#92;cap E&#039;_{13}}&amp;fg=000000' title='{E&#039;_{12} &#92;cap E&#039;_{13}}&amp;fg=000000' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=%7BE%27_%7B12%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E&#039;_{12}}&amp;fg=000000' title='{E&#039;_{12}}&amp;fg=000000' class='latex' /> a <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-insensitive subset of <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%5Ea%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}^a}&amp;fg=000000' title='{E_{12}^a}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7BE%27_%7B13%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E&#039;_{13}}&amp;fg=000000' title='{E&#039;_{13}}&amp;fg=000000' class='latex' /> a <img src='http://s0.wp.com/latex.php?latex=%7B13%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{13}&amp;fg=000000' title='{13}&amp;fg=000000' class='latex' />-insensitive subset of <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B13%7D%5Eb%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{13}^b}&amp;fg=000000' title='{E_{13}^b}&amp;fg=000000' class='latex' />. (This is an oversimplification, glossing over an important refinement of the concept of pseudorandomness involving the discrepancy between global densities in <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^n}&amp;fg=000000' title='{[3]^n}&amp;fg=000000' class='latex' /> and local densities in subspaces of <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^n}&amp;fg=000000' title='{[3]^n}&amp;fg=000000' class='latex' />.) There is a similar notion of <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> being pseudorandom with respect to a cell <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B13%7D%5Eb+%5Ccap+E_%7B23%7D%5Ec%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{13}^b &#92;cap E_{23}^c}&amp;fg=000000' title='{E_{13}^b &#92;cap E_{23}^c}&amp;fg=000000' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B23%7D%5Ec+%5Ccap+E_%7B12%7D%5Ea%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{23}^c &#92;cap E_{12}^a}&amp;fg=000000' title='{E_{23}^c &#92;cap E_{12}^a}&amp;fg=000000' class='latex' />.
</p>
<p>
We briefly describe the &#8220;regularity lemma&#8221; step. By modifying the proof of the regularity lemma, one can obtain three partitions </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5B3%5D%5En+%3D+E_%7B12%7D%5E1+%5Ccup+%5Cldots+%5Ccup+E_%7B12%7D%5E%7BM_%7B12%7D%7D+%3D+E_%7B13%7D%5E1+%5Ccup+%5Cldots+%5Ccup+E_%7B13%7D%5E%7BM_%7B13%7D%7D+%3D+E_%7B23%7D%5E1+%5Ccup+%5Cldots+%5Ccup+E_%7B23%7D%5E%7BM_%7B23%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  [3]^n = E_{12}^1 &#92;cup &#92;ldots &#92;cup E_{12}^{M_{12}} = E_{13}^1 &#92;cup &#92;ldots &#92;cup E_{13}^{M_{13}} = E_{23}^1 &#92;cup &#92;ldots &#92;cup E_{23}^{M_{23}}&amp;fg=000000' title='&#92;displaystyle  [3]^n = E_{12}^1 &#92;cup &#92;ldots &#92;cup E_{12}^{M_{12}} = E_{13}^1 &#92;cup &#92;ldots &#92;cup E_{13}^{M_{13}} = E_{23}^1 &#92;cup &#92;ldots &#92;cup E_{23}^{M_{23}}&amp;fg=000000' class='latex' /></p>
<p> into <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-insensitive, <img src='http://s0.wp.com/latex.php?latex=%7B13%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{13}&amp;fg=000000' title='{13}&amp;fg=000000' class='latex' />-insensitive, and <img src='http://s0.wp.com/latex.php?latex=%7B23%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{23}&amp;fg=000000' title='{23}&amp;fg=000000' class='latex' />-insensitive components respectively, where <img src='http://s0.wp.com/latex.php?latex=%7BM_%7B12%7D%2C+M_%7B13%7D%2C+M_%7B23%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M_{12}, M_{13}, M_{23}}&amp;fg=000000' title='{M_{12}, M_{13}, M_{23}}&amp;fg=000000' class='latex' /> are not too large, and <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> is pseudorandom with respect to most cells <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%5Ea+%5Ccap+E_%7B13%7D%5Eb%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}^a &#92;cap E_{13}^b}&amp;fg=000000' title='{E_{12}^a &#92;cap E_{13}^b}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B13%7D%5Eb+%5Ccap+E_%7B23%7D%5Ec%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{13}^b &#92;cap E_{23}^c}&amp;fg=000000' title='{E_{13}^b &#92;cap E_{23}^c}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B23%7D%5Ec+%5Ccap+E_%7B12%7D%5Ea%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{23}^c &#92;cap E_{12}^a}&amp;fg=000000' title='{E_{23}^c &#92;cap E_{12}^a}&amp;fg=000000' class='latex' />.</p>
<p>
In order for the counting step to work, one also needs an additional &#8220;stationarity&#8221; reduction, which is difficult to state precisely, but roughly speaking asserts that the &#8220;local&#8221; statistics of sets such as <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%5Ea%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}^a}&amp;fg=000000' title='{E_{12}^a}&amp;fg=000000' class='latex' /> on medium-dimensional subspaces are close to the corresponding &#8220;global&#8221; statistics of such sets; this can be achieved by an additional pigeonholing argument. We will gloss over this issue, pretending that there is no distinction between local statistics and global statistics. (Thus, for instance, if <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%5Ea%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}^a}&amp;fg=000000' title='{E_{12}^a}&amp;fg=000000' class='latex' /> has large global density in <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^n}&amp;fg=000000' title='{[3]^n}&amp;fg=000000' class='latex' />, we shall assume that <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%5Ea%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}^a}&amp;fg=000000' title='{E_{12}^a}&amp;fg=000000' class='latex' /> also has large density on most medium-sized subspaces of <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^n}&amp;fg=000000' title='{[3]^n}&amp;fg=000000' class='latex' />.)
</p>
<p>
Now for the &#8220;counting lemma&#8221; step. Suppose we can find <img src='http://s0.wp.com/latex.php?latex=%7Ba%2Cb%2Cc%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{a,b,c}&amp;fg=000000' title='{a,b,c}&amp;fg=000000' class='latex' /> such that the cells <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%5Ea%2C+E_%7B13%7D%5Eb%2C+E_%7B23%7D%5Ec%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}^a, E_{13}^b, E_{23}^c}&amp;fg=000000' title='{E_{12}^a, E_{13}^b, E_{23}^c}&amp;fg=000000' class='latex' /> are large, and that <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> intersects <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%5Ea+%5Ccap+E_%7B13%7D%5Eb%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}^a &#92;cap E_{13}^b}&amp;fg=000000' title='{E_{12}^a &#92;cap E_{13}^b}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B13%7D%5Eb+%5Ccap+E_%7B23%7D%5Ec%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{13}^b &#92;cap E_{23}^c}&amp;fg=000000' title='{E_{13}^b &#92;cap E_{23}^c}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B23%7D%5Ec+%5Ccap+E_%7B12%7D%5Ea%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{23}^c &#92;cap E_{12}^a}&amp;fg=000000' title='{E_{23}^c &#92;cap E_{12}^a}&amp;fg=000000' class='latex' /> in a dense pseudorandom manner. We claim that this will force <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> to have a large number of combinatorial lines <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell}&amp;fg=000000' title='{&#92;ell}&amp;fg=000000' class='latex' />, with <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%281%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell(1)}&amp;fg=000000' title='{&#92;ell(1)}&amp;fg=000000' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Ccap+E_%7B12%7D%5Ea+%5Ccap+E_%7B13%7D%5Eb%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;cap E_{12}^a &#92;cap E_{13}^b}&amp;fg=000000' title='{A &#92;cap E_{12}^a &#92;cap E_{13}^b}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%282%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell(2)}&amp;fg=000000' title='{&#92;ell(2)}&amp;fg=000000' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Ccap+E_%7B23%7D%5Ec+%5Ccap+E_%7B12%7D%5Ea%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;cap E_{23}^c &#92;cap E_{12}^a}&amp;fg=000000' title='{A &#92;cap E_{23}^c &#92;cap E_{12}^a}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%283%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell(3)}&amp;fg=000000' title='{&#92;ell(3)}&amp;fg=000000' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Ccap+E_%7B13%7D%5Eb+%5Ccap+E_%7B23%7D%5Ec%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;cap E_{13}^b &#92;cap E_{23}^c}&amp;fg=000000' title='{A &#92;cap E_{13}^b &#92;cap E_{23}^c}&amp;fg=000000' class='latex' />. Because of the dense pseudorandom nature of <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> in these cells, it turns out that it will suffice to show that there are a lot of lines <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%281%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell(1)}&amp;fg=000000' title='{&#92;ell(1)}&amp;fg=000000' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%281%29+%5Cin+E_%7B12%7D%5Ea+%5Ccap+E_%7B13%7D%5Eb%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell(1) &#92;in E_{12}^a &#92;cap E_{13}^b}&amp;fg=000000' title='{&#92;ell(1) &#92;in E_{12}^a &#92;cap E_{13}^b}&amp;fg=000000' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%282%29+%5Cin+E_%7B23%7D%5Ec+%5Ccap+E_%7B12%7D%5Ea%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell(2) &#92;in E_{23}^c &#92;cap E_{12}^a}&amp;fg=000000' title='{&#92;ell(2) &#92;in E_{23}^c &#92;cap E_{12}^a}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%283%29+%5Cin+E_%7B13%7D%5Eb+%5Ccap+E_%7B23%7D%5Ec%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell(3) &#92;in E_{13}^b &#92;cap E_{23}^c}&amp;fg=000000' title='{&#92;ell(3) &#92;in E_{13}^b &#92;cap E_{23}^c}&amp;fg=000000' class='latex' />.
</p>
<p>
One way to generate a line <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell}&amp;fg=000000' title='{&#92;ell}&amp;fg=000000' class='latex' /> is by taking the triple <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B+x%2C+%5Cpi_%7B1+%5Crightarrow+2%7D%28x%29%2C+%5Cpi_%7B1+%5Crightarrow+3%7D%28x%29+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{ x, &#92;pi_{1 &#92;rightarrow 2}(x), &#92;pi_{1 &#92;rightarrow 3}(x) &#92;}}&amp;fg=000000' title='{&#92;{ x, &#92;pi_{1 &#92;rightarrow 2}(x), &#92;pi_{1 &#92;rightarrow 3}(x) &#92;}}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7Bx+%5Cin+%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x &#92;in [3]^n}&amp;fg=000000' title='{x &#92;in [3]^n}&amp;fg=000000' class='latex' /> is a generic point. (Actually, as we will see below, we would have to to a subspace of <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^n}&amp;fg=000000' title='{[3]^n}&amp;fg=000000' class='latex' /> before using this recipe to generate lines.) Then we need to find many <img src='http://s0.wp.com/latex.php?latex=%7Bx%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x}&amp;fg=000000' title='{x}&amp;fg=000000' class='latex' /> obeying the constraints </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++x+%5Cin+E_%7B12%7D%5Ea+%5Ccap+E_%7B13%7D%5Eb%3B+%5Cquad+%5Cpi_%7B1+%5Crightarrow+2%7D%28x%29+%5Cin+E_%7B23%7D%5Ec+%5Ccap+E_%7B12%7D%5Ea%3B+%5Cquad+%5Cpi_%7B1+%5Crightarrow+3%7D%28x%29+%5Cin+E_%7B13%7D%5Eb+%5Ccap+E_%7B23%7D%5Ec.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  x &#92;in E_{12}^a &#92;cap E_{13}^b; &#92;quad &#92;pi_{1 &#92;rightarrow 2}(x) &#92;in E_{23}^c &#92;cap E_{12}^a; &#92;quad &#92;pi_{1 &#92;rightarrow 3}(x) &#92;in E_{13}^b &#92;cap E_{23}^c.&amp;fg=000000' title='&#92;displaystyle  x &#92;in E_{12}^a &#92;cap E_{13}^b; &#92;quad &#92;pi_{1 &#92;rightarrow 2}(x) &#92;in E_{23}^c &#92;cap E_{12}^a; &#92;quad &#92;pi_{1 &#92;rightarrow 3}(x) &#92;in E_{13}^b &#92;cap E_{23}^c.&amp;fg=000000' class='latex' /></p>
<p> Because of the various insensitivity properties, many of these conditions are redundant, and we can simplify to
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++x+%5Cin+E_%7B12%7D%5Ea+%5Ccap+E_%7B13%7D%5Eb%3B+%5Cquad+%5Cpi_%7B1+%5Crightarrow+2%7D%28x%29+%5Cin+E_%7B23%7D%5Ec.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  x &#92;in E_{12}^a &#92;cap E_{13}^b; &#92;quad &#92;pi_{1 &#92;rightarrow 2}(x) &#92;in E_{23}^c.&amp;fg=000000' title='&#92;displaystyle  x &#92;in E_{12}^a &#92;cap E_{13}^b; &#92;quad &#92;pi_{1 &#92;rightarrow 2}(x) &#92;in E_{23}^c.&amp;fg=000000' class='latex' /></p>
<p> Now note that the property &#8220;<img src='http://s0.wp.com/latex.php?latex=%7B%5Cpi_%7B1+%5Crightarrow+2%7D%28x%29+%5Cin+E_%7B23%7D%5Ec%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;pi_{1 &#92;rightarrow 2}(x) &#92;in E_{23}^c}&amp;fg=000000' title='{&#92;pi_{1 &#92;rightarrow 2}(x) &#92;in E_{23}^c}&amp;fg=000000' class='latex' />&#8221; is 123-insensitive; it is simultaneously 12-insensitive, 23-insensitive, and 13-insensitive. As <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B23%7D%5Ec%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{23}^c}&amp;fg=000000' title='{E_{23}^c}&amp;fg=000000' class='latex' /> is assumed to be large, there will be large combinatorial subspaces on which (a suitably localised version of) this property &#8220;<img src='http://s0.wp.com/latex.php?latex=%7B%5Cpi_%7B1+%5Crightarrow+2%7D%28x%29+%5Cin+E_%7B23%7D%5Ec%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;pi_{1 &#92;rightarrow 2}(x) &#92;in E_{23}^c}&amp;fg=000000' title='{&#92;pi_{1 &#92;rightarrow 2}(x) &#92;in E_{23}^c}&amp;fg=000000' class='latex' />&#8221; will be always true. Localising to this space (taking advantage of the stationarity properties alluded to earlier), we are now looking for solutions to
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++x+%5Cin+E_%7B12%7D%5Ea+%5Ccap+E_%7B13%7D%5Eb.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  x &#92;in E_{12}^a &#92;cap E_{13}^b.&amp;fg=000000' title='&#92;displaystyle  x &#92;in E_{12}^a &#92;cap E_{13}^b.&amp;fg=000000' class='latex' /></p>
<p> We&#8217;ll pick <img src='http://s0.wp.com/latex.php?latex=%7Bx%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x}&amp;fg=000000' title='{x}&amp;fg=000000' class='latex' /> to be of the form <img src='http://s0.wp.com/latex.php?latex=%7B%5Cpi_%7B2+%5Crightarrow+1%7D%28y%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;pi_{2 &#92;rightarrow 1}(y)}&amp;fg=000000' title='{&#92;pi_{2 &#92;rightarrow 1}(y)}&amp;fg=000000' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%7By%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{y}&amp;fg=000000' title='{y}&amp;fg=000000' class='latex' />. We can then rewrite the constraints on <img src='http://s0.wp.com/latex.php?latex=%7By%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{y}&amp;fg=000000' title='{y}&amp;fg=000000' class='latex' /> as
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++y+%5Cin+E_%7B12%7D%5Ea%3B+%5Cquad+%5Cpi_%7B2+%5Crightarrow+1%7D%28y%29+%5Cin+E_%7B13%7D%5Eb.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  y &#92;in E_{12}^a; &#92;quad &#92;pi_{2 &#92;rightarrow 1}(y) &#92;in E_{13}^b.&amp;fg=000000' title='&#92;displaystyle  y &#92;in E_{12}^a; &#92;quad &#92;pi_{2 &#92;rightarrow 1}(y) &#92;in E_{13}^b.&amp;fg=000000' class='latex' /></p>
<p> The property &#8220;<img src='http://s0.wp.com/latex.php?latex=%7B%5Cpi_%7B2+%5Crightarrow+1%7D%28y%29+%5Cin+E_%7B13%7D%5Eb%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;pi_{2 &#92;rightarrow 1}(y) &#92;in E_{13}^b}&amp;fg=000000' title='{&#92;pi_{2 &#92;rightarrow 1}(y) &#92;in E_{13}^b}&amp;fg=000000' class='latex' />&#8221; is 123-invariant, and <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B13%7D%5Eb%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{13}^b}&amp;fg=000000' title='{E_{13}^b}&amp;fg=000000' class='latex' /> is large, so by arguing as before we can pass to a large subspace where this property is always true. The largeness of <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%5Ea%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}^a}&amp;fg=000000' title='{E_{12}^a}&amp;fg=000000' class='latex' /> then gives us a large number of solutions.</p>
<p>
Taking contrapositives, we conclude that if <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> in fact has no combinatorial lines, then there do not exist any triple <img src='http://s0.wp.com/latex.php?latex=%7BE_%7B12%7D%5Ea%2C+E_%7B13%7D%5Eb%2C+E_%7B23%7D%5Ec%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{E_{12}^a, E_{13}^b, E_{23}^c}&amp;fg=000000' title='{E_{12}^a, E_{13}^b, E_{23}^c}&amp;fg=000000' class='latex' /> of large cells with respect to which <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> is dense and pseudorandom. This forces <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> to be confined either to very small cells, or to very sparse subsets of cells, or to the rare cells which fail to be pseudorandom. None of these cases can contribute much to the density of <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' />, and so <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> itself is very sparse &#8211; contradicting the hypothesis in Theorem <a href="#dhj">1</a> that <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> is dense (this is the &#8220;cleaning step&#8221;). This concludes the sketch of the triangle-removal proof of this theorem.
</p>
<p>
The ergodic version of this argument, <a class="snap_noshots" href="http://arxiv.org/abs/0903.1633">due to Austin</a>, works for all values of <img src='http://s0.wp.com/latex.php?latex=%7Bk%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k}&amp;fg=000000' title='{k}&amp;fg=000000' class='latex' />, so I expect the combinatorial version to do so as well.
</p>
</p>
<p align="center"><b> &#8212;  4. The finitary Furstenberg-Katznelson argument  &#8212; </b></p>
<p>
In 1989, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1001397">Furstenberg and Katznelson</a> gave the first proof of Theorem <a href="#dhj">1</a>, by translating it into a recurrence statement about a certain type of stationary process indexed by an infinite cube <img src='http://s0.wp.com/latex.php?latex=%7B%5B3%5D%5E%5Comega+%3A%3D+%5Cbigcup_%7Bn%3D1%7D%5E%5Cinfty+%5B3%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[3]^&#92;omega := &#92;bigcup_{n=1}^&#92;infty [3]^n}&amp;fg=000000' title='{[3]^&#92;omega := &#92;bigcup_{n=1}^&#92;infty [3]^n}&amp;fg=000000' class='latex' />. This argument was inspired by a long string of other successful proofs of density Ramsey theorems via ergodic means, starting with the initial 1977 <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=498471">paper of Furstenberg</a> giving an ergodic theory proof of Szemeredi&#8217;s theorem. The latter proof was transcribed into a finitary language <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2274314">by myself</a>, so it was reasonable to expect that the Furstenberg-Katznelson argument could similarly be translated into a combinatorial framework.
</p>
<p>
Let us first briefly describe the original strategy of Furstenberg to establish Roth&#8217;s theorem, but phrased in an informal, and vaguely combinatorial, language. The basic task is to get a non-trivial lower bound on averages of the form <a name="fff">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D_%7Ba%2Cr%7D+f%28a%29+f%28a%2Br%29+f%28a%2B2r%29+%5C+%5C+%5C+%5C+%5C+%281%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}_{a,r} f(a) f(a+r) f(a+2r) &#92; &#92; &#92; &#92; &#92; (1)&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}_{a,r} f(a) f(a+r) f(a+2r) &#92; &#92; &#92; &#92; &#92; (1)&amp;fg=000000' class='latex' /></p>
<p></a> where we will be a bit vague about what <img src='http://s0.wp.com/latex.php?latex=%7Ba%2Cr%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{a,r}&amp;fg=000000' title='{a,r}&amp;fg=000000' class='latex' /> are ranging over, and where <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> is some non-negative function of positive mean. It is then natural to study more general averages of the form <a name="fgh">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D_%7Ba%2Cr%7D+f%28a%29+g%28a%2Br%29+h%28a%2B2r%29.+%5C+%5C+%5C+%5C+%5C+%282%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}_{a,r} f(a) g(a+r) h(a+2r). &#92; &#92; &#92; &#92; &#92; (2)&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}_{a,r} f(a) g(a+r) h(a+2r). &#92; &#92; &#92; &#92; &#92; (2)&amp;fg=000000' class='latex' /></p>
<p></a> Now, it turns out that certain types of functions <img src='http://s0.wp.com/latex.php?latex=%7Bf%2Cg%2Ch%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f,g,h}&amp;fg=000000' title='{f,g,h}&amp;fg=000000' class='latex' /> give a negligible contribution to expressions such as <a href="#fgh">(2)</a>. In particular, if <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> is <em>weakly mixing</em>, which roughly means that the pair correlations </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D_a+f%28a%29+f%28a%2Br%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}_a f(a) f(a+r)&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}_a f(a) f(a+r)&amp;fg=000000' class='latex' /></p>
<p> are small for most <img src='http://s0.wp.com/latex.php?latex=%7Br%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{r}&amp;fg=000000' title='{r}&amp;fg=000000' class='latex' />, then the average <a href="#fgh">(2)</a> is small no matter what <img src='http://s0.wp.com/latex.php?latex=%7Bg%2C+h%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{g, h}&amp;fg=000000' title='{g, h}&amp;fg=000000' class='latex' /> are (so long as they are bounded). This can be established by some applications of the Cauchy-Schwarz inequality (or its close cousin, the <em>van der Corput lemma</em>). As a consequence of this, all weakly mixing components of <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> can essentially be discarded when considering an average such as <a href="#fff">(1)</a>.</p>
<p>
After getting rid of the weakly mixing components, what is left? Being weakly mixing is like saying that almost all the shifts <img src='http://s0.wp.com/latex.php?latex=%7Bf%28%5Ccdot%2Br%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(&#92;cdot+r)}&amp;fg=000000' title='{f(&#92;cdot+r)}&amp;fg=000000' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> are close to orthogonal to each other. At the other extreme is that of <em>periodicity</em> &#8211; the shifts <img src='http://s0.wp.com/latex.php?latex=%7Bf%28%5Ccdot+%2B+r%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(&#92;cdot + r)}&amp;fg=000000' title='{f(&#92;cdot + r)}&amp;fg=000000' class='latex' /> periodically recur to become equal to <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> again. There is a slightly more general notion of <em>almost periodicity</em> &#8211; roughly, this means that the shifts <img src='http://s0.wp.com/latex.php?latex=%7Bf%28%5Ccdot+%2B+r%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(&#92;cdot + r)}&amp;fg=000000' title='{f(&#92;cdot + r)}&amp;fg=000000' class='latex' /> don&#8217;t have to recur exactly to <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> again, but they are forced to range in a precompact set, which basically means that for every <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon &gt; 0}&amp;fg=000000' title='{&#92;varepsilon &gt; 0}&amp;fg=000000' class='latex' />, that <img src='http://s0.wp.com/latex.php?latex=%7Bf%28%5Ccdot%2Br%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(&#92;cdot+r)}&amp;fg=000000' title='{f(&#92;cdot+r)}&amp;fg=000000' class='latex' /> lies within <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon}&amp;fg=000000' title='{&#92;varepsilon}&amp;fg=000000' class='latex' /> (in some suitable norm) of some finite-dimensional space. A good example of an almost periodic function is an <em>eigenfunction</em>, in which we have <img src='http://s0.wp.com/latex.php?latex=%7Bf%28a%2Br%29+%3D+%5Clambda_r+f%28a%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(a+r) = &#92;lambda_r f(a)}&amp;fg=000000' title='{f(a+r) = &#92;lambda_r f(a)}&amp;fg=000000' class='latex' /> for each <img src='http://s0.wp.com/latex.php?latex=%7Br%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{r}&amp;fg=000000' title='{r}&amp;fg=000000' class='latex' /> and some quantity <img src='http://s0.wp.com/latex.php?latex=%7B%5Clambda_r%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;lambda_r}&amp;fg=000000' title='{&#92;lambda_r}&amp;fg=000000' class='latex' /> independent of <img src='http://s0.wp.com/latex.php?latex=%7Ba%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{a}&amp;fg=000000' title='{a}&amp;fg=000000' class='latex' /> (e.g. one can take <img src='http://s0.wp.com/latex.php?latex=%7Bf%28a%29+%3D+e%5E%7B2%5Cpi+i+%5Calpha+a%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(a) = e^{2&#92;pi i &#92;alpha a}}&amp;fg=000000' title='{f(a) = e^{2&#92;pi i &#92;alpha a}}&amp;fg=000000' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%7B%5Calpha+%5Cin+%7B%5Cmathbb+R%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;alpha &#92;in {&#92;mathbb R}}&amp;fg=000000' title='{&#92;alpha &#92;in {&#92;mathbb R}}&amp;fg=000000' class='latex' />). In this case, the finite-dimensional space is simply the scalar multiples of <img src='http://s0.wp.com/latex.php?latex=%7Bf%28a%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(a)}&amp;fg=000000' title='{f(a)}&amp;fg=000000' class='latex' /> (and one can even take <img src='http://s0.wp.com/latex.php?latex=%7B%5Cvarepsilon%3D0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;varepsilon=0}&amp;fg=000000' title='{&#92;varepsilon=0}&amp;fg=000000' class='latex' /> in this special case).
</p>
<p>
It is easy to see that non-trivial almost periodic functions are not weakly mixing; more generally, any function which correlates non-trivially with an almost periodic function can also be seen to not be weakly mixing. In the converse direction, it is also fairly easy to show that any function which is not weakly mixing must have non-trivial correlation with an almost periodic function. Because of this, it turns out that one can basically decompose <em>any</em> function into almost periodic and weakly mixing components. For the purposes of getting lower bounds on <a href="#fff">(1)</a>, this allows us to essentially reduce matters to the special case when <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> is almost periodic. But then the shifts <img src='http://s0.wp.com/latex.php?latex=%7Bf%28%5Ccdot+%2B+r%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(&#92;cdot + r)}&amp;fg=000000' title='{f(&#92;cdot + r)}&amp;fg=000000' class='latex' /> are almost ranging in a finite-dimensional set, which allows one to essentially assign each shift <img src='http://s0.wp.com/latex.php?latex=%7Br%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{r}&amp;fg=000000' title='{r}&amp;fg=000000' class='latex' /> a colour from a finite range of colours. If one then applies the <a href="http://en.wikipedia.org/wiki/Van_der_Waerden&#37;27s_theorem">van der Waerden theorem</a>, one can find many arithmetic progressions <img src='http://s0.wp.com/latex.php?latex=%7Ba%2Ca%2Br%2Ca%2B2r%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{a,a+r,a+2r}&amp;fg=000000' title='{a,a+r,a+2r}&amp;fg=000000' class='latex' /> which have the same colour, and this can be used to give a non-trivial lower bound on <a href="#fff">(1)</a>. (Thus we see that the role of a compactness property such as almost periodicity is to reduce density Ramsey theorems to colouring Ramsey theorems.)
</p>
<p>
This type of argument can be extended to more advanced recurrence theorems, but certain things become more complicated. For instance, suppose one wanted to count progressions of length <img src='http://s0.wp.com/latex.php?latex=%7B4%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{4}&amp;fg=000000' title='{4}&amp;fg=000000' class='latex' />; this amounts to lower bounding expressions such as <a name="ffff">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D_%7Ba%2Cr%7D+f%28a%29+f%28a%2Br%29+f%28a%2B2r%29+f%28a%2B3r%29.+%5C+%5C+%5C+%5C+%5C+%283%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}_{a,r} f(a) f(a+r) f(a+2r) f(a+3r). &#92; &#92; &#92; &#92; &#92; (3)&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}_{a,r} f(a) f(a+r) f(a+2r) f(a+3r). &#92; &#92; &#92; &#92; &#92; (3)&amp;fg=000000' class='latex' /></p>
<p></a> It turns out that <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> being weakly mixing is no longer enough to give a negligible contribution to expressions such as <a href="#ffff">(3)</a>. For that, one needs the stronger property of being <em>weakly mixing relative to almost periodic functions</em>; roughly speaking, this means that for most <img src='http://s0.wp.com/latex.php?latex=%7Br%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{r}&amp;fg=000000' title='{r}&amp;fg=000000' class='latex' />, the expression <img src='http://s0.wp.com/latex.php?latex=%7Bf%28%5Ccdot%29+f%28%5Ccdot%2Br%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(&#92;cdot) f(&#92;cdot+r)}&amp;fg=000000' title='{f(&#92;cdot) f(&#92;cdot+r)}&amp;fg=000000' class='latex' /> is not merely of small mean (which is what weak mixing would mean), but that this expression furthermore does not correlate strongly with any almost periodic function (i.e. <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D_a+f%28a%29+f%28a%2Br%29+g%28a%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E}_a f(a) f(a+r) g(a)}&amp;fg=000000' title='{{&#92;Bbb E}_a f(a) f(a+r) g(a)}&amp;fg=000000' class='latex' /> is small for any almost periodic <img src='http://s0.wp.com/latex.php?latex=%7Bg%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{g}&amp;fg=000000' title='{g}&amp;fg=000000' class='latex' />). Once one has this stronger weak mixing property, then one can discard all components of <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> which are weakly mixing relative to almost periodic functions.
</p>
<p>
One then has to figure out what is left after all these components are discarded. Because we strengthened the notion of weak mixing, we have to weaken the notion of almost periodicity to compensate. The correct notion is no longer that of almost periodicity &#8211; in which the shifts <img src='http://s0.wp.com/latex.php?latex=%7Bf%28%5Ccdot+%2B+r%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(&#92;cdot + r)}&amp;fg=000000' title='{f(&#92;cdot + r)}&amp;fg=000000' class='latex' /> almost take values in a finite-dimensional vector space &#8211; but that of almost periodicity <em>relative</em> to almost periodic functions, in which the shifts almost take values in a finite-dimensional <em>module</em> over the algebra of almost periodic functions. A good example of such a beast is that of a <em>quadratic eigenfunction</em>, in which we have <img src='http://s0.wp.com/latex.php?latex=%7Bf%28a%2Br%29+%3D+%5Clambda_r%28a%29+f%28a%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(a+r) = &#92;lambda_r(a) f(a)}&amp;fg=000000' title='{f(a+r) = &#92;lambda_r(a) f(a)}&amp;fg=000000' class='latex' /> where <img src='http://s0.wp.com/latex.php?latex=%7B%5Clambda_r%28a%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;lambda_r(a)}&amp;fg=000000' title='{&#92;lambda_r(a)}&amp;fg=000000' class='latex' /> is itself an ordinary eigenfunction, and thus almost periodic in the ordinary sense; here, the relative module is the one-dimensional module formed by almost periodic multiples of <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' />. (A typical example of a quadratic eigenfunction is <img src='http://s0.wp.com/latex.php?latex=%7Bf%28a%29+%3D+e%5E%7B2%5Cpi+i+%5Calpha+a%5E2%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(a) = e^{2&#92;pi i &#92;alpha a^2}}&amp;fg=000000' title='{f(a) = e^{2&#92;pi i &#92;alpha a^2}}&amp;fg=000000' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%7B%5Calpha+%5Cin+%7B%5Cmathbb+R%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;alpha &#92;in {&#92;mathbb R}}&amp;fg=000000' title='{&#92;alpha &#92;in {&#92;mathbb R}}&amp;fg=000000' class='latex' />.)
</p>
<p>
It turns out that one can &#8220;relativise&#8221; all of the previous arguments to the almost periodic &#8220;factor&#8221;, and decompose an arbitrary <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> into a component which is weakly mixing relative to almost periodic functions, and another component which is almost periodic relative to almost periodic functions. The former type of components can be discarded. For the latter, we can once again start colouring the shifts <img src='http://s0.wp.com/latex.php?latex=%7Bf%28%5Ccdot%2Br%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(&#92;cdot+r)}&amp;fg=000000' title='{f(&#92;cdot+r)}&amp;fg=000000' class='latex' /> with a finite number of colours, but with the caveat that the colour assigned is no longer independent of <img src='http://s0.wp.com/latex.php?latex=%7Ba%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{a}&amp;fg=000000' title='{a}&amp;fg=000000' class='latex' />, but depends in an almost periodic fashion on <img src='http://s0.wp.com/latex.php?latex=%7Ba%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{a}&amp;fg=000000' title='{a}&amp;fg=000000' class='latex' />. Nevertheless, it is still possible to combine the van der Waerden colouring Ramsey theorem with the theory of recurrence for ordinary almost periodic functions to get a lower bound on <a href="#ffff">(3)</a> in this case. One can then iterate this argument to deal with arithmetic progressions of longer length, but one now needs to consider even more intricate notions of almost periodicity, e.g. almost periodicity relative to (almost periodic functions relative to almost periodic functions), etc.
</p>
<p>
It turns out that these types of ideas can be adapted (with some effort) to the density Hales-Jewett setting. It&#8217;s simplest to begin with the <img src='http://s0.wp.com/latex.php?latex=%7Bk%3D2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=2}&amp;fg=000000' title='{k=2}&amp;fg=000000' class='latex' /> situation rather than the <img src='http://s0.wp.com/latex.php?latex=%7Bk%3D3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=3}&amp;fg=000000' title='{k=3}&amp;fg=000000' class='latex' /> situation. Here, we are trying to obtain non-trivial lower bounds for averages of the form <a name="fl">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D_%7B%5Cell%7D+f%28%5Cell%281%29%29+f%28%5Cell%282%29%29+%5C+%5C+%5C+%5C+%5C+%284%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}_{&#92;ell} f(&#92;ell(1)) f(&#92;ell(2)) &#92; &#92; &#92; &#92; &#92; (4)&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}_{&#92;ell} f(&#92;ell(1)) f(&#92;ell(2)) &#92; &#92; &#92; &#92; &#92; (4)&amp;fg=000000' class='latex' /></p>
<p></a> where <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell}&amp;fg=000000' title='{&#92;ell}&amp;fg=000000' class='latex' /> ranges in some fashion over combinatorial lines in <img src='http://s0.wp.com/latex.php?latex=%7B%5B2%5D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{[2]^n}&amp;fg=000000' title='{[2]^n}&amp;fg=000000' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> is some non-negative function with large mean.
</p>
<p>
The analogues of weakly mixing and almost periodic in this setting are the <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-uniform and <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-low influence functions respectively. Roughly speaking, a function is <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-low influence if its value usually doesn&#8217;t change much if a <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' /> is flipped to a <img src='http://s0.wp.com/latex.php?latex=%7B2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2}&amp;fg=000000' title='{2}&amp;fg=000000' class='latex' /> or vice versa (e.g. the indicator function of a <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-insensitive set is <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-low influence); conversely, a <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-uniform function is a function <img src='http://s0.wp.com/latex.php?latex=%7Bg%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{g}&amp;fg=000000' title='{g}&amp;fg=000000' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D_%7B%5Cell%7D+f%28%5Cell%281%29%29+g%28%5Cell%282%29%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E}_{&#92;ell} f(&#92;ell(1)) g(&#92;ell(2))}&amp;fg=000000' title='{{&#92;Bbb E}_{&#92;ell} f(&#92;ell(1)) g(&#92;ell(2))}&amp;fg=000000' class='latex' /> is small for all (bounded) <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' />. One can show that any function can be decomposed, more or less orthogonally, into a <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-uniform function and a <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-low influence function, with the upshot being that one can basically reduce the task of lower bounding <a href="#fl">(4)</a> to the case when <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> is <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-low influence. But then <img src='http://s0.wp.com/latex.php?latex=%7Bf%28%5Cell%281%29%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(&#92;ell(1))}&amp;fg=000000' title='{f(&#92;ell(1))}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7Bf%28%5Cell%282%29%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(&#92;ell(2))}&amp;fg=000000' title='{f(&#92;ell(2))}&amp;fg=000000' class='latex' /> are approximately equal to each other, and it is straightforward to get a lower-bound in this case.
</p>
<p>
Now we turn to the <img src='http://s0.wp.com/latex.php?latex=%7Bk%3D3%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{k=3}&amp;fg=000000' title='{k=3}&amp;fg=000000' class='latex' /> setting, where we are looking at lower-bounding expressions such as <a name="fl3">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D_%7B%5Cell%7D+f%28%5Cell%281%29%29+g%28%5Cell%282%29%29+h%28%5Cell%283%29%29+%5C+%5C+%5C+%5C+%5C+%285%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}_{&#92;ell} f(&#92;ell(1)) g(&#92;ell(2)) h(&#92;ell(3)) &#92; &#92; &#92; &#92; &#92; (5)&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}_{&#92;ell} f(&#92;ell(1)) g(&#92;ell(2)) h(&#92;ell(3)) &#92; &#92; &#92; &#92; &#92; (5)&amp;fg=000000' class='latex' /></p>
<p></a> with <img src='http://s0.wp.com/latex.php?latex=%7Bf%3Dg%3Dh%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f=g=h}&amp;fg=000000' title='{f=g=h}&amp;fg=000000' class='latex' />.
</p>
<p>
It turns out that <img src='http://s0.wp.com/latex.php?latex=%7Bg%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{g}&amp;fg=000000' title='{g}&amp;fg=000000' class='latex' /> (say) being <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-uniform is no longer enough to give a negligible contribution to the average <a href="#fl3">(5)</a>. Instead, one needs the more complicated notion of <img src='http://s0.wp.com/latex.php?latex=%7Bg%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{g}&amp;fg=000000' title='{g}&amp;fg=000000' class='latex' /> being <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-uniform relative to <img src='http://s0.wp.com/latex.php?latex=%7B23%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{23}&amp;fg=000000' title='{23}&amp;fg=000000' class='latex' />-low influence functions; this means that not only are the averages <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D_%7B%5Cell%7D+f%28%5Cell%281%29%29+g%28%5Cell%282%29%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E}_{&#92;ell} f(&#92;ell(1)) g(&#92;ell(2))}&amp;fg=000000' title='{{&#92;Bbb E}_{&#92;ell} f(&#92;ell(1)) g(&#92;ell(2))}&amp;fg=000000' class='latex' /> small for all bounded <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' />, but furthermore <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D_%7B%5Cell%7D+f%28%5Cell%281%29%29+g%28%5Cell%282%29%29+h%28%5Cell%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E}_{&#92;ell} f(&#92;ell(1)) g(&#92;ell(2)) h(&#92;ell)}&amp;fg=000000' title='{{&#92;Bbb E}_{&#92;ell} f(&#92;ell(1)) g(&#92;ell(2)) h(&#92;ell)}&amp;fg=000000' class='latex' /> is small for all bounded <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> and all <img src='http://s0.wp.com/latex.php?latex=%7B23%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{23}&amp;fg=000000' title='{23}&amp;fg=000000' class='latex' />-low influence <img src='http://s0.wp.com/latex.php?latex=%7Bh%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{h}&amp;fg=000000' title='{h}&amp;fg=000000' class='latex' /> (there is a minor technical point here that <img src='http://s0.wp.com/latex.php?latex=%7Bh%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{h}&amp;fg=000000' title='{h}&amp;fg=000000' class='latex' /> is a function of a line rather than of a point, but this should be ignored). Any component of <img src='http://s0.wp.com/latex.php?latex=%7Bg%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{g}&amp;fg=000000' title='{g}&amp;fg=000000' class='latex' /> in <a href="#fl3">(5)</a> which is <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-uniform relative to <img src='http://s0.wp.com/latex.php?latex=%7B23%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{23}&amp;fg=000000' title='{23}&amp;fg=000000' class='latex' />-low influence functions are negligible and so can be removed.
</p>
<p>
One then needs to figure out what is left in <img src='http://s0.wp.com/latex.php?latex=%7Bg%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{g}&amp;fg=000000' title='{g}&amp;fg=000000' class='latex' /> when these components are removed. The answer turns out to be functions <img src='http://s0.wp.com/latex.php?latex=%7Bg%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{g}&amp;fg=000000' title='{g}&amp;fg=000000' class='latex' /> that are <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-almost periodic relative to <img src='http://s0.wp.com/latex.php?latex=%7B23%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{23}&amp;fg=000000' title='{23}&amp;fg=000000' class='latex' />-low influence. The precise definition of this concept is technical, but very roughly speaking it means that if one flips a digit from a <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' /> to a <img src='http://s0.wp.com/latex.php?latex=%7B2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2}&amp;fg=000000' title='{2}&amp;fg=000000' class='latex' />, then the value of <img src='http://s0.wp.com/latex.php?latex=%7Bg%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{g}&amp;fg=000000' title='{g}&amp;fg=000000' class='latex' /> changes in a manner which is controlled by <img src='http://s0.wp.com/latex.php?latex=%7B23%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{23}&amp;fg=000000' title='{23}&amp;fg=000000' class='latex' />-low influence functions. Anyway, the upshot is that one can reduce <img src='http://s0.wp.com/latex.php?latex=%7Bg%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{g}&amp;fg=000000' title='{g}&amp;fg=000000' class='latex' /> in <a href="#fl3">(5)</a> from <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> to the components of <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> which are <img src='http://s0.wp.com/latex.php?latex=%7B12%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{12}&amp;fg=000000' title='{12}&amp;fg=000000' class='latex' />-almost periodic relative to <img src='http://s0.wp.com/latex.php?latex=%7B23%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{23}&amp;fg=000000' title='{23}&amp;fg=000000' class='latex' />-low influence. Similarly, one can reduce <img src='http://s0.wp.com/latex.php?latex=%7Bh%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{h}&amp;fg=000000' title='{h}&amp;fg=000000' class='latex' /> in <a href="#fl3">(5)</a> from <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> to the components of <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> which are <img src='http://s0.wp.com/latex.php?latex=%7B13%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{13}&amp;fg=000000' title='{13}&amp;fg=000000' class='latex' />-almost periodic relative to <img src='http://s0.wp.com/latex.php?latex=%7B23%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{23}&amp;fg=000000' title='{23}&amp;fg=000000' class='latex' />-low influence.
</p>
<p>
At this point, one has to use a colouring Ramsey theorem &#8211; in this case, the Graham-Rothschild theorem &#8211; in conjunction with the relative almost periodicity to locate lots of places in which <img src='http://s0.wp.com/latex.php?latex=%7Bg%28%5Cell%282%29%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{g(&#92;ell(2))}&amp;fg=000000' title='{g(&#92;ell(2))}&amp;fg=000000' class='latex' /> is close to <img src='http://s0.wp.com/latex.php?latex=%7Bg%28%5Cell%281%29%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{g(&#92;ell(1))}&amp;fg=000000' title='{g(&#92;ell(1))}&amp;fg=000000' class='latex' /> while <img src='http://s0.wp.com/latex.php?latex=%7Bh%28%5Cell%283%29%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{h(&#92;ell(3))}&amp;fg=000000' title='{h(&#92;ell(3))}&amp;fg=000000' class='latex' /> is simultaneously close to <img src='http://s0.wp.com/latex.php?latex=%7Bh%28%5Cell%281%29%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{h(&#92;ell(1))}&amp;fg=000000' title='{h(&#92;ell(1))}&amp;fg=000000' class='latex' />. This turns <a href="#fl3">(5)</a> into an expression of the form <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D_x+f%28x%29+g%28x%29+h%28x%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E}_x f(x) g(x) h(x)}&amp;fg=000000' title='{{&#92;Bbb E}_x f(x) g(x) h(x)}&amp;fg=000000' class='latex' />, which turns out to be relatively easy to lower bound (because <img src='http://s0.wp.com/latex.php?latex=%7Bg%2C+h%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{g, h}&amp;fg=000000' title='{g, h}&amp;fg=000000' class='latex' />, being projections of <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' />, tend to be large wherever <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> is large).
</p></p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[DHJ(k): 1200-1299 (Density Hales-Jewett type numbers)]]></title>
<link>http://terrytao.wordpress.com/2009/03/30/dhjk-1200-1299-density-hales-jewett-type-numbers/</link>
<pubDate>Mon, 30 Mar 2009 21:25:54 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2009/03/30/dhjk-1200-1299-density-hales-jewett-type-numbers/</guid>
<description><![CDATA[This is a continuation of the 1100-1199 thread of the polymath1 project, which is now full.  The foc]]></description>
<content:encoded><![CDATA[<p>This is a continuation of the <a class="snap_noshots" href="http://terrytao.wordpress.com/2009/03/14/dhj3-1100-1199-density-hales-jewett-type-numbers/">1100-1199 thread</a> of the <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Main_Page">polymath1</a> project, which is now full.  The focus has now mostly shifted to generalisations of the previous problems being studied to larger alphabet sizes k, so I am changing the title here from DHJ(3) to DHJ(k) to reflect this.</p>
<p>The discussion is evolving rapidly, but here are some of the topics currently being discussed:</p>
<ul>
<li>Understanding the density Hales-Jewett numbers <img src='http://s0.wp.com/latex.php?latex=c_%7Bn%2Ck%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_{n,k}' title='c_{n,k}' class='latex' />, defined as the size of the largest subset of <img src='http://s0.wp.com/latex.php?latex=%5Bk%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='[k]^n' title='[k]^n' class='latex' /> that does not contain a combinatorial line.  The progress on that problem is summarised <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Higher-dimensional_DHJ_numbers">here</a>.</li>
<li><a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Higher-dimensional_Fujimura">Fujimura&#8217;s problem in higher dimensions</a>.</li>
<li>The <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Hyper-optimistic_conjecture">hyper-optimistic conjecture</a> in higher dimensions.  It looks like the conjecture in fact fails in higher dimensions, though perhaps it could be salvaged by reformulating it.</li>
<li>Reducing the need for computer assistance in<a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Moser%27s_cube_problem"> our result</a> <img src='http://s0.wp.com/latex.php?latex=c%27_5%3D124&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_5=124' title='c&#039;_5=124' class='latex' /> on Moser&#8217;s problem.</li>
<li>New lower bounds for the <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Hales-Jewett_theorem">coloring Hales-Jewett numbers</a>.</li>
</ul>
<p>Note that much of the most recent progress has not yet been ported to the <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Main_Page">wiki</a>.  In order to help everyone else catch up, it may useful if authors of comments (particularly comments with lengthy computations, or with corrections to previous comments) put their work on the relevant page of the wiki (not necessarily in the most polished format), and perhaps only place a summary of it on the thread itself.</p>
<p>[Incidentally, for the more casual followers of this project, a non-technical introduction to this project can be found at <a HREF="http://numberwarrior.wordpress.com/2009/03/25/a-gentle-introduction-to-the-polymath-project/">this post of Jason Dyer</a>.]</p>
<p>Comments here should start from 1200.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[DHJ(3): 1100-1199 (Density Hales-Jewett type numbers)]]></title>
<link>http://terrytao.wordpress.com/2009/03/14/dhj3-1100-1199-density-hales-jewett-type-numbers/</link>
<pubDate>Sat, 14 Mar 2009 16:35:20 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2009/03/14/dhj3-1100-1199-density-hales-jewett-type-numbers/</guid>
<description><![CDATA[This is a continuation of the 900-999 thread of the polymath1 project, which is now full.  We&#8217;]]></description>
<content:encoded><![CDATA[<p>This is a continuation of the <a class="snap_noshots" href="http://terrytao.wordpress.com/2009/03/04/dhj3-900-999-density-hales-jewett-type-numbers">900-999 thread</a> of the <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Main_Page">polymath1</a> project, which is now full.  We&#8217;ve made quite a bit of progress so far on our original mission of bounding density Hales-Jewett numbers.  In particular, we have shown</p>
<ul>
<li><img src='http://s0.wp.com/latex.php?latex=c_6%3D450&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_6=450' title='c_6=450' class='latex' />: Any subset of <img src='http://s0.wp.com/latex.php?latex=%5B3%5D%5E6&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='[3]^6' title='[3]^6' class='latex' /> with 451 points contains a <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Line">combinatorial line</a>; and there is exactly one example with 450 points with no combinatorial line.  (We have two proofs, one by a large integer program, and the other being purely human-verifiable; the latter can be <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Upper_and_lower_bounds">found here</a>.)</li>
<li><img src='http://s0.wp.com/latex.php?latex=c%27_5%3D124&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_5=124' title='c&#039;_5=124' class='latex' />: Any subset of <img src='http://s0.wp.com/latex.php?latex=%5B3%5D%5E5&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='[3]^5' title='[3]^5' class='latex' /> with 125 points contains a <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Line">geometric line</a>; and we have several examples with 124 points with no geometric line.  (The proof is partly computer-assisted; <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Moser%27s_cube_problem">details are here</a>.)</li>
<li><img src='http://s0.wp.com/latex.php?latex=353+%5Cleq+c%27_6+%5Cleq+361&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='353 &#92;leq c&#039;_6 &#92;leq 361' title='353 &#92;leq c&#039;_6 &#92;leq 361' class='latex' />: A genetic algorithm has constructed 353-point solutions in <img src='http://s0.wp.com/latex.php?latex=%5B3%5D%5E6&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='[3]^6' title='[3]^6' class='latex' /> with no geometric line; in the other direction, <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Maple_calculations">linear and integer programming methods have shown</a> that any set with 362 points must have a geometric line.</li>
<li><img src='http://s0.wp.com/latex.php?latex=%5Coverline%7Bc%7D%5E%5Cmu_%7B12%7D+%3D+40&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;overline{c}^&#92;mu_{12} = 40' title='&#92;overline{c}^&#92;mu_{12} = 40' class='latex' />: In the triangular grid <img src='http://s0.wp.com/latex.php?latex=%5C%7B%28a%2Cb%2Cc%29+%5Cin+%7B%5CBbb+N%7D%5E3%3Aa%2Bb%2Bc%3D12%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{(a,b,c) &#92;in {&#92;Bbb N}^3:a+b+c=12&#92;}' title='&#92;{(a,b,c) &#92;in {&#92;Bbb N}^3:a+b+c=12&#92;}' class='latex' />, any set of 41 points contains an upwards-pointing equilateral triangle <img src='http://s0.wp.com/latex.php?latex=%28a%2Br%2Cb%2Cc%29%2C%28a%2Cb%2Br%2Cc%29%2C%28a%2Cb%2Cc%2Br%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(a+r,b,c),(a,b+r,c),(a,b,c+r)' title='(a+r,b,c),(a,b+r,c),(a,b,c+r)' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=r%26%2362%3B0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='r&gt;0' title='r&gt;0' class='latex' />; and we have 40-point examples without such triangles.  This was conducted by an integer program.</li>
</ul>
<p>This <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Timeline">timeline</a> shows the history of these and other developments in the project.</p>
<p>There are still several numbers that look feasible to compute.  For instance, the bounds on <img src='http://s0.wp.com/latex.php?latex=c%27_6&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_6' title='c&#039;_6' class='latex' /> should be able to be narrowed further.  Work is slowly progressing also on the <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Hyper-optimistic_conjecture">equal-slices Hales-Jewett numbers</a> <img src='http://s0.wp.com/latex.php?latex=c%5E%5Cmu_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c^&#92;mu_n' title='c^&#92;mu_n' class='latex' />, which are <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Hyper-optimistic_conjecture">hyper-optimistically conjectured</a> to equal the <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Fujimura%27s_problem">Fujimura numbers</a> <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7Bc%7D%5E%5Cmu_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;overline{c}^&#92;mu_n' title='&#92;overline{c}^&#92;mu_n' class='latex' />; this has been verified by hand up to n=3 and by integer programming up to n=5.  We are also looking at trying to reduce the dependence on computer assistance in establishing the <img src='http://s0.wp.com/latex.php?latex=c%27_5+%3D+124&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_5 = 124' title='c&#039;_5 = 124' class='latex' /> result; the best human result so far is <img src='http://s0.wp.com/latex.php?latex=124+%5Cleq+c%27_5+%5Cleq+125&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='124 &#92;leq c&#039;_5 &#92;leq 125' title='124 &#92;leq c&#039;_5 &#92;leq 125' class='latex' />.</p>
<p>Some progress has recently been made on some other related questions.  For instance, we now have a precise description of the lower bound on <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /> coming from the Behrend-Elkin construction, namely</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c_n+%26%2362%3B+C+3%5E%7Bn+-+4%5Csqrt%7B%5Clog+2%7D%5Csqrt%7B%5Clog+n%7D%2B%5Cfrac+12+%5Clog+%5Clog+n%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n &gt; C 3^{n - 4&#92;sqrt{&#92;log 2}&#92;sqrt{&#92;log n}+&#92;frac 12 &#92;log &#92;log n}' title='c_n &gt; C 3^{n - 4&#92;sqrt{&#92;log 2}&#92;sqrt{&#92;log n}+&#92;frac 12 &#92;log &#92;log n}' class='latex' /></p>
<p>for some absolute constant c.</p>
<p>Also, it is probably a good time to transport some of the discussion in earlier threads to <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Main_Page">the wiki</a> and make it easier for outsiders to catch up.  (Incidentally, we need a logo for that wiki; any suggestions would be welcome!)</p>
<p>Comments on this thread should be numbered starting at 1100.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[The Kakeya set and maximal conjectures for algebraic varieties over finite fields]]></title>
<link>http://terrytao.wordpress.com/2009/03/12/the-kakeya-set-and-maximal-conjectures-for-algebraic-varieties-over-finite-fields/</link>
<pubDate>Thu, 12 Mar 2009 22:42:15 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2009/03/12/the-kakeya-set-and-maximal-conjectures-for-algebraic-varieties-over-finite-fields/</guid>
<description><![CDATA[Jordan Ellenberg, Richard Oberlin, and I have just uploaded to the arXiv the paper &#8220;The Kakeya]]></description>
<content:encoded><![CDATA[<p><a class="snap_noshots" href="http://www.math.wisc.edu/~ellenber/">Jordan Ellenberg</a>, <a class="snap_noshots" href="http://www.math.ucla.edu/people/pages/oberlin.shtml">Richard Oberlin</a>, and I have just uploaded to the <a class="snap_noshots" href="http://www.arxiv.org">arXiv</a> the paper &#8220;<a class="snap_noshots" href="http://arxiv.org/abs/0903.1879">The Kakeya set and maximal conjectures for algebraic varieties over finite fields</a>&#8220;, submitted to <a class="snap_noshots" href="http://www.ucl.ac.uk/Mathematics/Mathematika/">Mathematika</a>.  This paper builds upon some <a class="snap_noshots" href="http://arxiv.org/abs/0803.2336">work of Dvir</a> and later authors on the Kakeya problem in finite fields, which I have discussed in this <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/03/24/dvirs-proof-of-the-finite-field-kakeya-conjecture/">earlier blog post</a>.  Dvir established the following:</p>
<blockquote><p><strong>Kakeya set conjecture for finite fields.</strong> Let F be a finite field, and let E be a subset of <img src='http://s0.wp.com/latex.php?latex=F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^n' title='F^n' class='latex' /> that contains a line in every direction.  Then E has cardinality at least <img src='http://s0.wp.com/latex.php?latex=c_n+%26%23124%3BF%26%23124%3B%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n &#124;F&#124;^n' title='c_n &#124;F&#124;^n' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=c_n+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n &gt; 0' title='c_n &gt; 0' class='latex' />.</p></blockquote>
<p>The initial argument of Dvir gave <img src='http://s0.wp.com/latex.php?latex=c_n+%3D+1%2Fn%21&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n = 1/n!' title='c_n = 1/n!' class='latex' />.  This was improved to <img src='http://s0.wp.com/latex.php?latex=c_n+%3D+c%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n = c^n' title='c_n = c^n' class='latex' /> for some explicit <img src='http://s0.wp.com/latex.php?latex=0+%26%2360%3B+c+%26%2360%3B+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='0 &lt; c &lt; 1' title='0 &lt; c &lt; 1' class='latex' /> by <a class="snap_noshots" href="http://arxiv.org/abs/0808.2499">Saraf and Sudan</a>, and recently to <img src='http://s0.wp.com/latex.php?latex=c_n+%3D1%2F2%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n =1/2^n' title='c_n =1/2^n' class='latex' /> by <a class="snap_noshots" href="http://arxiv.org/abs/0901.2529">Dvir, Kopparty, Saraf, and Sudan</a>, which is within a factor 2 of the optimal result.</p>
<p>In our work we investigate a somewhat different set of improvements to Dvir&#8217;s result.  The first concerns the <em>Kakeya maximal function</em> <img src='http://s0.wp.com/latex.php?latex=f%5E%2A%3A+%7B%5CBbb+P%7D%5E%7Bn-1%7D%28F%29+%5Cto+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f^*: {&#92;Bbb P}^{n-1}(F) &#92;to {&#92;Bbb R}' title='f^*: {&#92;Bbb P}^{n-1}(F) &#92;to {&#92;Bbb R}' class='latex' /> of a function <img src='http://s0.wp.com/latex.php?latex=f%3A+F%5En+%5Cto+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: F^n &#92;to {&#92;Bbb R}' title='f: F^n &#92;to {&#92;Bbb R}' class='latex' />, defined for all directions <img src='http://s0.wp.com/latex.php?latex=%5Cxi+%5Cin+%7B%5CBbb+P%7D%5E%7Bn-1%7D%28F%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi &#92;in {&#92;Bbb P}^{n-1}(F)' title='&#92;xi &#92;in {&#92;Bbb P}^{n-1}(F)' class='latex' /> in the projective hyperplane at infinity by the formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=f%5E%2A%28%5Cxi%29+%3D+%5Csup_%7B%5Cell+%2F%2F+%5Cxi%7D+%5Csum_%7Bx+%5Cin+%5Cell%7D+%26%23124%3Bf%28x%29%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f^*(&#92;xi) = &#92;sup_{&#92;ell // &#92;xi} &#92;sum_{x &#92;in &#92;ell} &#124;f(x)&#124;' title='f^*(&#92;xi) = &#92;sup_{&#92;ell // &#92;xi} &#92;sum_{x &#92;in &#92;ell} &#124;f(x)&#124;' class='latex' /></p>
<p>where the supremum ranges over all lines <img src='http://s0.wp.com/latex.php?latex=%5Cell&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;ell' title='&#92;ell' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^n' title='F^n' class='latex' /> oriented in the direction <img src='http://s0.wp.com/latex.php?latex=%5Cxi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi' title='&#92;xi' class='latex' />.  Our first result is the endpoint <img src='http://s0.wp.com/latex.php?latex=L%5Ep&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L^p' title='L^p' class='latex' /> estimate for this operator, namely</p>
<blockquote><p><strong>Kakeya maximal function conjecture in finite fields.</strong> We have <img src='http://s0.wp.com/latex.php?latex=%5C%26%23124%3B+f%5E%2A+%5C%26%23124%3B_%7B%5Cell%5En%28%7B%5CBbb+P%7D%5E%7Bn-1%7D%28F%29%29%7D+%5Cleq+C_n+%26%23124%3BF%26%23124%3B%5E%7B%28n-1%29%2Fn%7D+%5C%26%23124%3Bf%5C%26%23124%3B_%7B%5Cell%5En%28F%5En%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;&#124; f^* &#92;&#124;_{&#92;ell^n({&#92;Bbb P}^{n-1}(F))} &#92;leq C_n &#124;F&#124;^{(n-1)/n} &#92;&#124;f&#92;&#124;_{&#92;ell^n(F^n)}' title='&#92;&#124; f^* &#92;&#124;_{&#92;ell^n({&#92;Bbb P}^{n-1}(F))} &#92;leq C_n &#124;F&#124;^{(n-1)/n} &#92;&#124;f&#92;&#124;_{&#92;ell^n(F^n)}' class='latex' /> for some constant <img src='http://s0.wp.com/latex.php?latex=C_n+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='C_n &gt; 0' title='C_n &gt; 0' class='latex' />.</p></blockquote>
<p>This result implies Dvir&#8217;s result, since if f is the indicator function of the set E in Dvir&#8217;s result, then <img src='http://s0.wp.com/latex.php?latex=f%5E%2A%28%5Cxi%29+%3D+%26%23124%3BF%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f^*(&#92;xi) = &#124;F&#124;' title='f^*(&#92;xi) = &#124;F&#124;' class='latex' /> for every <img src='http://s0.wp.com/latex.php?latex=%5Cxi+%5Cin+%7B%5CBbb+P%7D%5E%7Bn-1%7D%28F%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi &#92;in {&#92;Bbb P}^{n-1}(F)' title='&#92;xi &#92;in {&#92;Bbb P}^{n-1}(F)' class='latex' />.  However, it also gives information on more general sets E which do not necessarily contain a line in every direction, but instead contain a certain fraction of a line in a subset of directions.  The exponents here are best possible in the sense that all other <img src='http://s0.wp.com/latex.php?latex=%5Cell%5Ep+%5Cto+%5Cell%5Eq&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;ell^p &#92;to &#92;ell^q' title='&#92;ell^p &#92;to &#92;ell^q' class='latex' /> mapping properties of the operator can be deduced (with bounds that are optimal up to constants) by interpolating the above estimate with more trivial estimates.  This result is the finite field analogue of a long-standing (and still open) conjecture for the Kakeya maximal function in Euclidean spaces; we rely on the polynomial method of Dvir, which thus far has not extended to the Euclidean setting (but note the very interesting variant of this method <a class="snap_noshots" href="http://arxiv.org/abs/0811.2251">by Guth</a> that has established the endpoint multilinear Kakeya maximal function estimate in this setting, see <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/11/27/the-kakeya-conjecture-and-the-ham-sandwich-theorem/">this blog post</a> for further discussion).</p>
<p>It turns out that a direct application of the polynomial method is not sufficient to recover the full strength of the maximal function estimate; but by combining the polynomial method with the Nikishin-Maurey-Pisier-Stein &#8220;method of random rotations&#8221; (as interpreted nowadays by Stein and later by Bourgain, and originally inspired by the factorisation theorems of Nikishin, Maurey, and Pisier), one can already recover a &#8220;restricted weak type&#8221; version of the above estimate.  If one then enhances the polynomial method with the &#8220;method of multiplicities&#8221; (as introduced by <a class="snap_noshots" href="http://arxiv.org/abs/0808.2499">Saraf and Sudan</a>) we can then recover the full &#8220;strong type&#8221; estimate; a few more details below the fold.</p>
<p>It turns out that one can generalise the above results to more general affine or projective algebraic varieties over finite fields.  In particular, we showed</p>
<blockquote><p><strong>Kakeya maximal function conjecture in algebraic varieties.</strong> Suppose that <img src='http://s0.wp.com/latex.php?latex=W+%5Csubset+%7B%5CBbb+P%7D%5EN&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='W &#92;subset {&#92;Bbb P}^N' title='W &#92;subset {&#92;Bbb P}^N' class='latex' /> is an (n-1)-dimensional algebraic variety.  Let <img src='http://s0.wp.com/latex.php?latex=d+%5Cgeq+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d &#92;geq 1' title='d &#92;geq 1' class='latex' /> be an integer. Then we have</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5C%26%23124%3B+%5Csup_%7B%5Cgamma+%5Cni+x%3B+%5Cgamma+%5Cnot+%5Csubset+W%7D+%5Csum_%7By+%5Cin+%5Cgamma%7D+f%28y%29+%5C%26%23124%3B_%7B%5Cell%5En_x%28W%28F%29%29%7D+%5Cleq+C_%7Bn%2Cd%2CN%2CW%7D+%26%23124%3BF%26%23124%3B%5E%7B%28n-1%29%2Fn%7D+%5C%26%23124%3Bf%5C%26%23124%3B_%7B%5Cell%5En%28%7B%5CBbb+P%7D%5EN%28F%29%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;&#124; &#92;sup_{&#92;gamma &#92;ni x; &#92;gamma &#92;not &#92;subset W} &#92;sum_{y &#92;in &#92;gamma} f(y) &#92;&#124;_{&#92;ell^n_x(W(F))} &#92;leq C_{n,d,N,W} &#124;F&#124;^{(n-1)/n} &#92;&#124;f&#92;&#124;_{&#92;ell^n({&#92;Bbb P}^N(F))}' title='&#92;&#124; &#92;sup_{&#92;gamma &#92;ni x; &#92;gamma &#92;not &#92;subset W} &#92;sum_{y &#92;in &#92;gamma} f(y) &#92;&#124;_{&#92;ell^n_x(W(F))} &#92;leq C_{n,d,N,W} &#124;F&#124;^{(n-1)/n} &#92;&#124;f&#92;&#124;_{&#92;ell^n({&#92;Bbb P}^N(F))}' class='latex' /></p>
<p>for some constant <img src='http://s0.wp.com/latex.php?latex=C_%7Bn%2Cd%2CN%2CW%7D+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='C_{n,d,N,W} &gt; 0' title='C_{n,d,N,W} &gt; 0' class='latex' />, where the supremum is over all irreducible algebraic curves <img src='http://s0.wp.com/latex.php?latex=%5Cgamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gamma' title='&#92;gamma' class='latex' /> of degree at most d that pass through x but do not lie in W, and W(F) denotes the F-points of W.</p></blockquote>
<p>The ordinary Kakeya maximal function conjecture corresponds to the case when N=n, W is the hyperplane at infinity, and the degree d is equal to 1.  One corollary of this estimate is a Dvir-type result: a subset of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+P%7D%5EN%28F%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb P}^N(F)' title='{&#92;Bbb P}^N(F)' class='latex' /> which contains, for each x in W, an irreducible algebraic curve of degree d passing through x but not lying in W, has cardinality <img src='http://s0.wp.com/latex.php?latex=%5Cgg+%26%23124%3BF%26%23124%3B%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gg &#124;F&#124;^n' title='&#92;gg &#124;F&#124;^n' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BW%26%23124%3B+%5Cgg+%26%23124%3BF%26%23124%3B%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;W&#124; &#92;gg &#124;F&#124;^{n-1}' title='&#124;W&#124; &#92;gg &#124;F&#124;^{n-1}' class='latex' />.  (In particular this implies a lower bound for Nikodym sets worked out <a class="snap_noshots" href="http://arxiv.org/abs/0803.3525">by Li</a>.)  The dependence of the implied constant on W is only via the degree of W.</p>
<p>The techniques used in the flat case can easily handle curves <img src='http://s0.wp.com/latex.php?latex=%5Cgamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gamma' title='&#92;gamma' class='latex' /> of higher degree (provided that we allow the implied constants to depend on d), but the method of random rotations does not seem to work directly on the algebraic variety W as there are usually no symmetries of this variety to exploit.  Fortunately, we can get around this by using a &#8220;random projection trick&#8221; to &#8220;flatten&#8221; W into a hyperplane (after first expressing W as the zero locus of some polynomials, and then composing with the graphing map for such polynomials), reducing the non-flat case to the flat case.</p>
<p>Below the fold, I wish to sketch two of the key ingredients in our arguments, the random rotations method and the random projections trick.  (We of course also use some algebraic geometry, but mostly low-tech stuff, on the level of <a href="http://en.wikipedia.org/wiki/Bezout%27s_theorem">Bezout&#8217;s theorem</a>, though we do need one non-trivial result of Kleiman (from <a href="http://www.ams.org/mathscinet-getitem?mr=354655">SGA6</a>), that asserts that bounded degree varieties can be cut out by a bounded number of polynomials of bounded degree.)</p>
<p>[Update, March 14: See also <a class="snap_noshots" href="http://quomodocumque.wordpress.com/2009/03/14/blogging-is-good-for-math-or-kakeya-problems-over-finite-fields/">Jordan's own blog post on our paper</a>.]</p>
<p style="text-align:center;"><!--more-->&#8211; The random rotations method &#8211;</p>
<p>The random rotations method allows one to amplify a result about large sets into an estimate about small sets, by observing that large sets can be created by taking many random rotations (or translations, etc.) of a given small set; it is useful in any situation which enjoys symmetries that one can push small sets around this way (e.g. a <a href="http://en.wikipedia.org/wiki/Homogeneous_space">homogeneous space</a> will do nicely).  Let&#8217;s see how it works in this context.  Dvir&#8217;s argument lets us give the following conclusion (stated somewhat informally):</p>
<blockquote><p><strong>Proposition 1.</strong> Let E be a subset of <img src='http://s0.wp.com/latex.php?latex=F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^n' title='F^n' class='latex' /> which contains lines in J different directions.  If <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BJ%26%23124%3B+%5Csim+%26%23124%3BF%26%23124%3B%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;J&#124; &#92;sim &#124;F&#124;^{n-1}' title='&#124;J&#124; &#92;sim &#124;F&#124;^{n-1}' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%5Cgtrsim+%26%23124%3BF%26%23124%3B%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &#92;gtrsim &#124;F&#124;^n' title='&#124;E&#124; &#92;gtrsim &#124;F&#124;^n' class='latex' />.  (Implied constants here are allowed to depend on n.)</p></blockquote>
<p>We now amplify this by the random rotations method to extend from the large J case to the small J case:</p>
<blockquote><p><strong>Proposition 2.</strong> Let E be a subset of <img src='http://s0.wp.com/latex.php?latex=F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^n' title='F^n' class='latex' /> which contains lines in J different directions.  Then <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%5Cgtrsim+%26%23124%3BJ%26%23124%3B+%26%23124%3BF%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &#92;gtrsim &#124;J&#124; &#124;F&#124;' title='&#124;E&#124; &#92;gtrsim &#124;J&#124; &#124;F&#124;' class='latex' />.</p></blockquote>
<p><strong>Proof. </strong>If <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BJ%26%23124%3B+%5Csim+%26%23124%3BF%26%23124%3B%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;J&#124; &#92;sim &#124;F&#124;^{n-1}' title='&#124;J&#124; &#92;sim &#124;F&#124;^{n-1}' class='latex' /> then we are done by Proposition 1 already, so suppose that J is much smaller than <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BF%26%23124%3B%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;F&#124;^{n-1}' title='&#124;F&#124;^{n-1}' class='latex' />.  (There are only <img src='http://s0.wp.com/latex.php?latex=O%28%26%23124%3BF%26%23124%3B%5E%7Bn-1%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O(&#124;F&#124;^{n-1})' title='O(&#124;F&#124;^{n-1})' class='latex' /> directions available in <img src='http://s0.wp.com/latex.php?latex=F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^n' title='F^n' class='latex' />.)  Let m be an integer such that <img src='http://s0.wp.com/latex.php?latex=Jm+%5Csim+%26%23124%3BF%26%23124%3B%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Jm &#92;sim &#124;F&#124;^{n-1}' title='Jm &#92;sim &#124;F&#124;^{n-1}' class='latex' />.</p>
<p>We use the <a href="http://en.wikipedia.org/wiki/Probabilistic_method">probabilistic method</a> (and more precisely, the first moment method).  Now let <img src='http://s0.wp.com/latex.php?latex=R_1%2C+R_2%2C+%5Cldots%2C+R_m%3A+F%5En+%5Cto+F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='R_1, R_2, &#92;ldots, R_m: F^n &#92;to F^n' title='R_1, R_2, &#92;ldots, R_m: F^n &#92;to F^n' class='latex' /> be m randomly chosen &#8220;rotations&#8221;, or more precisely invertible linear transformations, and consider the set <img src='http://s0.wp.com/latex.php?latex=E%27+%3A%3D+%5Cbigcup_%7Bi%3D1%7D%5Em+R_i%28E%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='E&#039; := &#92;bigcup_{i=1}^m R_i(E)' title='E&#039; := &#92;bigcup_{i=1}^m R_i(E)' class='latex' />.  Clearly, <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%27%26%23124%3B+%5Cleq+m+%26%23124%3BE%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#039;&#124; &#92;leq m &#124;E&#124;' title='&#124;E&#039;&#124; &#92;leq m &#124;E&#124;' class='latex' />.  On the other hand, we expect <img src='http://s0.wp.com/latex.php?latex=E%27&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='E&#039;' title='E&#039;' class='latex' /> to contain lines in about <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BF%26%23124%3B%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;F&#124;^{n-1}' title='&#124;F&#124;^{n-1}' class='latex' /> different directions.  Indeed, pick any direction <img src='http://s0.wp.com/latex.php?latex=%5Cxi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi' title='&#92;xi' class='latex' />.  Then <img src='http://s0.wp.com/latex.php?latex=R_i%5E%7B-1%7D%28%5Cxi%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='R_i^{-1}(&#92;xi)' title='R_i^{-1}(&#92;xi)' class='latex' /> has a probability about <img src='http://s0.wp.com/latex.php?latex=J%2F%26%23124%3BF%26%23124%3B%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='J/&#124;F&#124;^{n-1}' title='J/&#124;F&#124;^{n-1}' class='latex' /> of being one of the J directions of the lines that E contains; this is also the probability that <img src='http://s0.wp.com/latex.php?latex=R_i%28E%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='R_i(E)' title='R_i(E)' class='latex' /> contains a line in the direction <img src='http://s0.wp.com/latex.php?latex=%5Cxi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi' title='&#92;xi' class='latex' />.  If we choose each of the <img src='http://s0.wp.com/latex.php?latex=R_i&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='R_i' title='R_i' class='latex' /> independently at random, then basic probability then tells us that for each direction <img src='http://s0.wp.com/latex.php?latex=%5Cxi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi' title='&#92;xi' class='latex' />, that E&#8217; will contain a line in the direction <img src='http://s0.wp.com/latex.php?latex=%5Cxi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi' title='&#92;xi' class='latex' /> with probability about <img src='http://s0.wp.com/latex.php?latex=1+-+%281+-+J%2F%26%23124%3BF%26%23124%3B%5E%7Bn-1%7D%29%5Em+%5Csim+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1 - (1 - J/&#124;F&#124;^{n-1})^m &#92;sim 1' title='1 - (1 - J/&#124;F&#124;^{n-1})^m &#92;sim 1' class='latex' />.  From linearity of expectation we thus see that the expected number of directions along which E&#8217; contains a line is <img src='http://s0.wp.com/latex.php?latex=%5Csim+%26%23124%3BF%26%23124%3B%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sim &#124;F&#124;^{n-1}' title='&#92;sim &#124;F&#124;^{n-1}' class='latex' />, so in particular we will have <img src='http://s0.wp.com/latex.php?latex=%5Cgtrsim+%26%23124%3BF%26%23124%3B%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gtrsim &#124;F&#124;^{n-1}' title='&#92;gtrsim &#124;F&#124;^{n-1}' class='latex' /> such lines for at least one choice of E&#8217;.  Applying Proposition 1 to that choice we obtain <img src='http://s0.wp.com/latex.php?latex=m%26%23124%3BE%26%23124%3B+%5Cgeq+%26%23124%3BE%27%26%23124%3B+%5Cgtrsim+%26%23124%3BF%26%23124%3B%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='m&#124;E&#124; &#92;geq &#124;E&#039;&#124; &#92;gtrsim &#124;F&#124;^n' title='m&#124;E&#124; &#92;geq &#124;E&#039;&#124; &#92;gtrsim &#124;F&#124;^n' class='latex' />, and so <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%5Cgtrsim+J+%26%23124%3BF%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &#92;gtrsim J &#124;F&#124;' title='&#124;E&#124; &#92;gtrsim J &#124;F&#124;' class='latex' /> as required.</p>
<p style="text-align:center;">&#8211; The random projection trick &#8211;</p>
<p>The random projection trick is a different application of the probabilistic method.  It uses the fact that random maps tend to be &#8220;mostly injective&#8221; whenever the range is larger than the domain (and, dually, are &#8220;mostly surjective&#8221; when the domain is larger than the range); this often allows one to cut either the domain or range down in size until they are roughly equal.  It is used in a number of places in the literature, for instance in <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1281447">Ruzsa&#8217;s proof of Freiman&#8217;s theorem</a>.</p>
<p>For instance, we have the following statement (stated a bit informally to simplify the formulation):</p>
<blockquote><p><strong>Lemma 1.</strong> Let E be a subset of <img src='http://s0.wp.com/latex.php?latex=F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^n' title='F^n' class='latex' />, and let <img src='http://s0.wp.com/latex.php?latex=T%3A+F%5En+%5Cto+F%5Em&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T: F^n &#92;to F^m' title='T: F^n &#92;to F^m' class='latex' /> be a random linear transformation.  If <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%5Clesssim+%26%23124%3BF%26%23124%3B%5Em&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &#92;lesssim &#124;F&#124;^m' title='&#124;E&#124; &#92;lesssim &#124;F&#124;^m' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BT%28E%29%26%23124%3B+%5Csim+%26%23124%3BE%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;T(E)&#124; &#92;sim &#124;E&#124;' title='&#124;T(E)&#124; &#92;sim &#124;E&#124;' class='latex' /> with high probability.</p></blockquote>
<p><strong>Proof. </strong>Consider the number of &#8220;collisions&#8221;,</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=X+%3A%3D+%26%23124%3B+%5C%7B+%28x%2Cy%29+%5Cin+E%3A+T%28x%29+%3D+T%28y%29+%5C%7D+%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X := &#124; &#92;{ (x,y) &#92;in E: T(x) = T(y) &#92;} &#124;' title='X := &#124; &#92;{ (x,y) &#92;in E: T(x) = T(y) &#92;} &#124;' class='latex' />.</p>
<p>On the one hand, each pair <img src='http://s0.wp.com/latex.php?latex=%28x%2Cy%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(x,y)' title='(x,y)' class='latex' /> has a probability of <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BF%26%23124%3B%5E%7B-m%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;F&#124;^{-m}' title='&#124;F&#124;^{-m}' class='latex' /> of contributing a collision if x and y are distinct, and a probability 1 of contributing a collision if they are equal.  Thus</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+E%7D+X+%5Csim+%26%23124%3BE%26%23124%3B+%2B+%26%23124%3BF%26%23124%3B%5E%7B-m%7D+%26%23124%3BE%26%23124%3B%5E2+%5Csim+%26%23124%3BE%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb E} X &#92;sim &#124;E&#124; + &#124;F&#124;^{-m} &#124;E&#124;^2 &#92;sim &#124;E&#124;' title='{&#92;Bbb E} X &#92;sim &#124;E&#124; + &#124;F&#124;^{-m} &#124;E&#124;^2 &#92;sim &#124;E&#124;' class='latex' />.</p>
<p>From <a href="http://en.wikipedia.org/wiki/Markov%27s_inequality">Markov&#8217;s inequality</a> we conclude that <img src='http://s0.wp.com/latex.php?latex=X+%3D+O%28%26%23124%3BE%26%23124%3B%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X = O(&#124;E&#124;)' title='X = O(&#124;E&#124;)' class='latex' /> with high probability.</p>
<p>On the other hand, we can write <img src='http://s0.wp.com/latex.php?latex=X+%3A%3D+%5Csum_%7Bz+%5Cin+T%28E%29%7D+%26%23124%3BT%5E%7B-1%7D%28z%29+%5Ccap+E%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X := &#92;sum_{z &#92;in T(E)} &#124;T^{-1}(z) &#92;cap E&#124;^2' title='X := &#92;sum_{z &#92;in T(E)} &#124;T^{-1}(z) &#92;cap E&#124;^2' class='latex' />.  Since <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%3D+%5Csum_%7Bz+%5Cin+T%28E%29%7D+%26%23124%3BT%5E%7B-1%7D%28z%29+%5Ccap+E%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; = &#92;sum_{z &#92;in T(E)} &#124;T^{-1}(z) &#92;cap E&#124;' title='&#124;E&#124; = &#92;sum_{z &#92;in T(E)} &#124;T^{-1}(z) &#92;cap E&#124;' class='latex' />, we conclude from the Cauchy-Schwarz inequality that</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%26%23124%3BX%26%23124%3B+%5Cgeq+%26%23124%3BE%26%23124%3B%5E2+%2F+%26%23124%3BT%28E%29%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;X&#124; &#92;geq &#124;E&#124;^2 / &#124;T(E)&#124;' title='&#124;X&#124; &#92;geq &#124;E&#124;^2 / &#124;T(E)&#124;' class='latex' />,</p>
<p>and the claim follows. <img src='http://s0.wp.com/latex.php?latex=%5CBox&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Box' title='&#92;Box' class='latex' /></p>
<p><strong>Remark. </strong> While T is &#8220;mostly injective&#8221; once <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%5Clesssim+%26%23124%3BF%26%23124%3B%5Em&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &#92;lesssim &#124;F&#124;^m' title='&#124;E&#124; &#92;lesssim &#124;F&#124;^m' class='latex' /> (in the sense that <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BT%28E%29%26%23124%3B+%5Csim+%26%23124%3BE%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;T(E)&#124; &#92;sim &#124;E&#124;' title='&#124;T(E)&#124; &#92;sim &#124;E&#124;' class='latex' />), T will not be <em>completely</em> injective with high probability until <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%5Clesssim+%26%23124%3BF%26%23124%3B%5E%7Bm%2F2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &#92;lesssim &#124;F&#124;^{m/2}' title='&#124;E&#124; &#92;lesssim &#124;F&#124;^{m/2}' class='latex' />, thanks to the <a href="http://en.wikipedia.org/wiki/Birthday_paradox">birthday paradox</a>. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p>Here is a typical way in which one can hope to use this trick for Kakeya-type problems.  A simple modification of Dvir&#8217;s original argument (as done in this <a class="snap_noshots" href="http://arxiv.org/abs/0803.3525">paper of Li</a>) gives</p>
<blockquote><p><strong>Proposition 3.</strong> (Nikodym-type estimate)  Let E be a subset of <img src='http://s0.wp.com/latex.php?latex=F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^n' title='F^n' class='latex' /> of size <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%5Csim+%26%23124%3BF%26%23124%3B%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &#92;sim &#124;F&#124;^n' title='&#124;E&#124; &#92;sim &#124;F&#124;^n' class='latex' />, and let <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G' title='G' class='latex' /> be another set in <img src='http://s0.wp.com/latex.php?latex=F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^n' title='F^n' class='latex' /> such that for every point x in E, there exists a line <img src='http://s0.wp.com/latex.php?latex=%5Cell_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;ell_x' title='&#92;ell_x' class='latex' /> passing through x such that G contains <img src='http://s0.wp.com/latex.php?latex=%5Cell_x+%5Cbackslash+%5C%7Bx%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;ell_x &#92;backslash &#92;{x&#92;}' title='&#92;ell_x &#92;backslash &#92;{x&#92;}' class='latex' />.  Then <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BG%26%23124%3B+%5Csim+%26%23124%3BF%26%23124%3B%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;G&#124; &#92;sim &#124;F&#124;^n' title='&#124;G&#124; &#92;sim &#124;F&#124;^n' class='latex' />.</p></blockquote>
<p>The random rotations trick (or more precisely, the random translations trick) from the preceding section lets us extend this result to smaller values of &#124;E&#124; (shrinking &#124;G&#124; proportionally).  But here I wish to discuss a different extension, using the random projections trick instead:</p>
<blockquote><p><strong>Proposition 3.</strong> (Nikodym-type estimate)  Let <img src='http://s0.wp.com/latex.php?latex=N+%5Cgeq+n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N &#92;geq n' title='N &#92;geq n' class='latex' />.  Let E be a subset of <img src='http://s0.wp.com/latex.php?latex=F%5EN&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^N' title='F^N' class='latex' /> of size <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%5Csim+%26%23124%3BF%26%23124%3B%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &#92;sim &#124;F&#124;^n' title='&#124;E&#124; &#92;sim &#124;F&#124;^n' class='latex' />, and let <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G' title='G' class='latex' /> be another set in <img src='http://s0.wp.com/latex.php?latex=F%5EN&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^N' title='F^N' class='latex' /> such that for every point x in E, there exists a line <img src='http://s0.wp.com/latex.php?latex=%5Cell_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;ell_x' title='&#92;ell_x' class='latex' /> passing through x such that G contains <img src='http://s0.wp.com/latex.php?latex=%5Cell_x+%5Cbackslash+%5C%7Bx%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;ell_x &#92;backslash &#92;{x&#92;}' title='&#92;ell_x &#92;backslash &#92;{x&#92;}' class='latex' />.  Then <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BG%26%23124%3B+%5Cgtrsim+%26%23124%3BF%26%23124%3B%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;G&#124; &#92;gtrsim &#124;F&#124;^n' title='&#124;G&#124; &#92;gtrsim &#124;F&#124;^n' class='latex' />.</p></blockquote>
<p><strong>Proof. </strong>By  Lemma 1, we can find a linear map <img src='http://s0.wp.com/latex.php?latex=T%3A+F%5EN+%5Cto+F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T: F^N &#92;to F^n' title='T: F^N &#92;to F^n' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BT%28E%29%26%23124%3B+%5Csim+%26%23124%3BF%26%23124%3B%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;T(E)&#124; &#92;sim &#124;F&#124;^n' title='&#124;T(E)&#124; &#92;sim &#124;F&#124;^n' class='latex' />.  Then observe that T(G) obeys the same hypotheses as G, but with E replaced by T(E).  By Proposition 2, <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BT%28G%29%26%23124%3B+%5Csim+%26%23124%3BF%26%23124%3B%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;T(G)&#124; &#92;sim &#124;F&#124;^n' title='&#124;T(G)&#124; &#92;sim &#124;F&#124;^n' class='latex' />, and the claim follows. <img src='http://s0.wp.com/latex.php?latex=%5CBox&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Box' title='&#92;Box' class='latex' /></p>
<p>&#8211;</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[DHJ(3): 900-999 (Density Hales-Jewett type numbers)]]></title>
<link>http://terrytao.wordpress.com/2009/03/04/dhj3-900-999-density-hales-jewett-type-numbers/</link>
<pubDate>Wed, 04 Mar 2009 21:27:13 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2009/03/04/dhj3-900-999-density-hales-jewett-type-numbers/</guid>
<description><![CDATA[This is a continuation of the 700-799 thread of the polymath1 project, which is now full.  During th]]></description>
<content:encoded><![CDATA[<p>This is a continuation of the <a href="http://terrytao.wordpress.com/2009/02/13/bounds-for-the-first-few-density-hales-jewett-numbers-and-related-quantities/">700-799 thread</a> of the <a href="http://michaelnielsen.org/polymath1/index.php?title=Main_Page">polymath1</a> project, which is now full.  During the course of that thread, we have made significant progress on the three problems being focused on:</p>
<p><strong>1.  Upper and lower bounds for <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /> for small n.</strong></p>
<p>Let <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /> be the largest size of a set in <img src='http://s0.wp.com/latex.php?latex=%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='[3]^n' title='[3]^n' class='latex' /> without a <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Line">combinatorial line</a>.   We now have <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Upper_and_lower_bounds">both human and computer-assisted proofs</a> of the first few values of this sequence:</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c_0%3D1%3B+c_1%3D2%3B+c_2%3D6%3B+c_3%3D18%3B+c_4%3D52%3B+c_5+%3D+150%3B+c_6+%3D+450&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_0=1; c_1=2; c_2=6; c_3=18; c_4=52; c_5 = 150; c_6 = 450' title='c_0=1; c_1=2; c_2=6; c_3=18; c_4=52; c_5 = 150; c_6 = 450' class='latex' />.</p>
<p>The current best-known bounds for <img src='http://s0.wp.com/latex.php?latex=c_7&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_7' title='c_7' class='latex' /> are <img src='http://s0.wp.com/latex.php?latex=1302+%5Cleq+c_7+%5Cleq+1348&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1302 &#92;leq c_7 &#92;leq 1348' title='1302 &#92;leq c_7 &#92;leq 1348' class='latex' />.  Given the gap involved here, and the rate at which the complexity of the problem has increased with n, it seems unlikely that we will be able to compute <img src='http://s0.wp.com/latex.php?latex=c_7&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_7' title='c_7' class='latex' /> exactly any time soon, but it is possible that some improvement can still be made here.</p>
<p><strong>2.  A hyper-optimistic conjecture</strong></p>
<p>Consider a variant of the above problem in which each element of <img src='http://s0.wp.com/latex.php?latex=%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='[3]^n' title='[3]^n' class='latex' /> with a 1s, b 2s, and c 3s is weighted by the factor <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7Ba%21+b%21+c%21%7D%7Bn%21%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;frac{a! b! c!}{n!}' title='&#92;frac{a! b! c!}{n!}' class='latex' />; this gives <img src='http://s0.wp.com/latex.php?latex=%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='[3]^n' title='[3]^n' class='latex' /> a total weight of <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7B%28n%2B1%29%28n%2B2%29%7D%7B2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;frac{(n+1)(n+2)}{2}' title='&#92;frac{(n+1)(n+2)}{2}' class='latex' />.  Let <img src='http://s0.wp.com/latex.php?latex=c%5E%5Cmu_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c^&#92;mu_n' title='c^&#92;mu_n' class='latex' /> be the largest weight of a line-free set of <img src='http://s0.wp.com/latex.php?latex=%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='[3]^n' title='[3]^n' class='latex' />, and let <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7Bc%7D%5E%5Cmu_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;overline{c}^&#92;mu_n' title='&#92;overline{c}^&#92;mu_n' class='latex' /> be the largest size of a subset of</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5CDelta_n+%3A%3D+%5C%7B+%28a%2Cb%2Cc%29+%5Cin+%7B%5CBbb+Z%7D_%2B%5E3%3A+a%2Bb%2Bc%3Dn+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Delta_n := &#92;{ (a,b,c) &#92;in {&#92;Bbb Z}_+^3: a+b+c=n &#92;}' title='&#92;Delta_n := &#92;{ (a,b,c) &#92;in {&#92;Bbb Z}_+^3: a+b+c=n &#92;}' class='latex' /></p>
<p>which contains no upward-pointing equilateral triangles <img src='http://s0.wp.com/latex.php?latex=%28a%2Br%2Cb%2Cc%29%2C+%28a%2Cb%2Br%2Cc%29%2C+%28a%2Cb%2Cc%2Br%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(a+r,b,c), (a,b+r,c), (a,b,c+r)' title='(a+r,b,c), (a,b+r,c), (a,b,c+r)' class='latex' /> with r&#62;0.  It is known that <img src='http://s0.wp.com/latex.php?latex=c%5E%5Cmu_n+%5Cgeq+%5Coverline%7Bc%7D%5E%5Cmu_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c^&#92;mu_n &#92;geq &#92;overline{c}^&#92;mu_n' title='c^&#92;mu_n &#92;geq &#92;overline{c}^&#92;mu_n' class='latex' />; the “<a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Hyper-optimistic_conjecture">hyper-optimistic conjecture</a>” is that one in fact has <img src='http://s0.wp.com/latex.php?latex=c%5E%5Cmu_n+%3D+%5Coverline%7Bc%7D%5E%5Cmu_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c^&#92;mu_n = &#92;overline{c}^&#92;mu_n' title='c^&#92;mu_n = &#92;overline{c}^&#92;mu_n' class='latex' />.  This would imply density Hales-Jewett for k=3.</p>
<p>Currently, the conjecture is verified for <img src='http://s0.wp.com/latex.php?latex=n+%5Cleq+5&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n &#92;leq 5' title='n &#92;leq 5' class='latex' />, where the values of <img src='http://s0.wp.com/latex.php?latex=c%5E%5Cmu_n+%3D+%5Coverline%7Bc%7D%5E%5Cmu_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c^&#92;mu_n = &#92;overline{c}^&#92;mu_n' title='c^&#92;mu_n = &#92;overline{c}^&#92;mu_n' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=n%3D0%2C1%2C2%2C3%2C4%2C5&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n=0,1,2,3,4,5' title='n=0,1,2,3,4,5' class='latex' /> are 1,2,4,6,9,12 respectively; see <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Fujimura%27s_problem">this page</a> and <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Hyper-optimistic_conjecture">this page</a> for details.  It seems feasible to handle <img src='http://s0.wp.com/latex.php?latex=n%3D6&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n=6' title='n=6' class='latex' />.  Currently we know that <img src='http://s0.wp.com/latex.php?latex=15+%5Cleq+%5Coverline%7Bc%7D%5E%5Cmu_6+%5Cleq+17&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='15 &#92;leq &#92;overline{c}^&#92;mu_6 &#92;leq 17' title='15 &#92;leq &#92;overline{c}^&#92;mu_6 &#92;leq 17' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7Bc%7D%5E%5Cmu_6+%5Cleq+c%5E%5Cmu_6+%5Cleq+28&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;overline{c}^&#92;mu_6 &#92;leq c^&#92;mu_6 &#92;leq 28' title='&#92;overline{c}^&#92;mu_6 &#92;leq c^&#92;mu_6 &#92;leq 28' class='latex' />.</p>
<p style="text-align:left;"><strong>3.  Asymptotics for Moser’s cube problem</strong></p>
<p style="text-align:left;"><a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Moser%27s_cube_problem">Moser’s cube problem</a> asks to compute the largest size <img src='http://s0.wp.com/latex.php?latex=c%27_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_n' title='c&#039;_n' class='latex' /> of a subset of the cube <img src='http://s0.wp.com/latex.php?latex=%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='[3]^n' title='[3]^n' class='latex' /> without geometric lines.  The first few values of <img src='http://s0.wp.com/latex.php?latex=c%27_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_n' title='c&#039;_n' class='latex' /> are known:</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c%27_0%3D1%3B+c%27_1+%3D+2%3B+c%27_2+%3D+6%3B+c%27_3+%3D+16%3B+c%27_4+%3D+43&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_0=1; c&#039;_1 = 2; c&#039;_2 = 6; c&#039;_3 = 16; c&#039;_4 = 43' title='c&#039;_0=1; c&#039;_1 = 2; c&#039;_2 = 6; c&#039;_3 = 16; c&#039;_4 = 43' class='latex' />.</p>
<p>The best asymptotic lower bound known is still of the order of <img src='http://s0.wp.com/latex.php?latex=3%5En%2F%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='3^n/&#92;sqrt{n}' title='3^n/&#92;sqrt{n}' class='latex' />.  Improving this bound seems related to the well-known problem of improving the bounds in Behrend&#8217;s construction of an AP-3 free set of integers.</p>
<p>We are quite close now to pinning down <img src='http://s0.wp.com/latex.php?latex=c%27_5&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_5' title='c&#039;_5' class='latex' />; we know that it is equal to either 124 or 125, and it is looking increasingly unlikely that it is 125.</p>
<p>Comments on this thread should start at 900.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[A sharp inverse Littlewood-Offord theorem]]></title>
<link>http://terrytao.wordpress.com/2009/02/16/a-sharp-inverse-littlewood-offord-theorem/</link>
<pubDate>Mon, 16 Feb 2009 17:46:13 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2009/02/16/a-sharp-inverse-littlewood-offord-theorem/</guid>
<description><![CDATA[Van Vu and I have just uploaded to the arXiv our preprint &#8220;A sharp inverse Littlewood-Offord t]]></description>
<content:encoded><![CDATA[<p><a class="snap_noshots" href="http://www.math.rutgers.edu/~vanvu/">Van Vu</a> and I have just uploaded to the <a class="snap_noshots" href="http://www.arxiv.org">arXiv</a> our preprint &#8220;<a class="snap_noshots" href="http://arxiv.org/abs/0902.2357">A sharp inverse Littlewood-Offord theorem</a>&#8220;, which we have submitted to <a class="snap_noshots" href="http://www3.interscience.wiley.com/journal/38107/home">Random Structures and Algorithms</a>.  This paper gives a solution to the (inverse) Littlewood-Offord problem of understanding when random walks are concentrated in the case when the concentration is of polynomial size in the length <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n' title='n' class='latex' /> of the walk; our description is sharp up to epsilon powers of <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n' title='n' class='latex' />.  The theory of inverse Littlewood-Offord problems and related topics has been of importance in recent developments in the spectral theory of discrete random matrices (e.g. a &#8220;robust&#8221; variant of these theorems was crucial in <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/08/02/random-matrices-universality-of-esds-and-the-circular-law/">our work on the circular law</a>).</p>
<p>For simplicity I will restrict attention to the Bernoulli random walk.  Given <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n' title='n' class='latex' /> real numbers <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' />, one can form the random variable</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=S+%3A%3D+%5Cepsilon_1+v_1+%2B+%5Cldots+%2B+%5Cepsilon_n+v_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S := &#92;epsilon_1 v_1 + &#92;ldots + &#92;epsilon_n v_n' title='S := &#92;epsilon_1 v_1 + &#92;ldots + &#92;epsilon_n v_n' class='latex' /></p>
<p>where <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon_1%2C%5Cldots%2C%5Cepsilon_n+%5Cin+%5C%7B-1%2C%2B1%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;epsilon_1,&#92;ldots,&#92;epsilon_n &#92;in &#92;{-1,+1&#92;}' title='&#92;epsilon_1,&#92;ldots,&#92;epsilon_n &#92;in &#92;{-1,+1&#92;}' class='latex' /> are iid random signs (with either sign +1, -1 chosen with probability 1/2).  This is a discrete random variable which typically takes <img src='http://s0.wp.com/latex.php?latex=2%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='2^n' title='2^n' class='latex' /> values.  However, if there are various arithmetic relations between the step sizes <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' />, then many of the <img src='http://s0.wp.com/latex.php?latex=2%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='2^n' title='2^n' class='latex' /> possible sums collide, and certain values may then arise with much higher probability.  To measure this, define the <em>concentration probability</em> <img src='http://s0.wp.com/latex.php?latex=p%28v_1%2C%5Cldots%2Cv_n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p(v_1,&#92;ldots,v_n)' title='p(v_1,&#92;ldots,v_n)' class='latex' /> by the formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=p%28v_1%2C%5Cldots%2Cv_n%29+%3D+%5Csup_x+%7B%5CBbb+P%7D%28S%3Dx%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p(v_1,&#92;ldots,v_n) = &#92;sup_x {&#92;Bbb P}(S=x)' title='p(v_1,&#92;ldots,v_n) = &#92;sup_x {&#92;Bbb P}(S=x)' class='latex' />.</p>
<p>Intuitively, this probability measures the amount of additive structure present between the <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' />.  There are two (opposing) problems in the subject:</p>
<ul>
<li>(Forward Littlewood-Offord problem) Given some structural assumptions on <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' />, what bounds can one place on <img src='http://s0.wp.com/latex.php?latex=p%28v_1%2C%5Cldots%2Cv_n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p(v_1,&#92;ldots,v_n)' title='p(v_1,&#92;ldots,v_n)' class='latex' />?</li>
<li>(Inverse Littlewood-Offord problem) Given some bounds on <img src='http://s0.wp.com/latex.php?latex=p%28v_1%2C%5Cldots%2Cv_n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p(v_1,&#92;ldots,v_n)' title='p(v_1,&#92;ldots,v_n)' class='latex' />, what structural assumptions can one then conclude about <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' />?</li>
</ul>
<p>Ideally one would like answers to both of these problems which come close to inverting each other, and this is the guiding motivation for our paper.</p>
<p><!--more-->One of the first forward Littlewood-Offord theorems was <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=14608">by Erdős</a>, who showed</p>
<blockquote><p><strong>Theorem 1. </strong> If at least k of the <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' /> are non-zero, then <img src='http://s0.wp.com/latex.php?latex=p%28v_1%2C%5Cldots%2Cv_n%29+%5Cleq+O%28k%5E%7B-1%2F2%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p(v_1,&#92;ldots,v_n) &#92;leq O(k^{-1/2})' title='p(v_1,&#92;ldots,v_n) &#92;leq O(k^{-1/2})' class='latex' />.</p></blockquote>
<p>In fact the sharp bound was computed by Erdős as $\binom{k}{\lfloor k/2\rfloor}/2^k$; the proof relies, incidentally, on <a href="http://en.wikipedia.org/wiki/Sperner%27s_theorem">Sperner&#8217;s theorem</a>, which is of relevance to the ongoing <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Main_Page">polymath1 project</a>.  (An earlier result of Littlewood and Offord gave a weaker bound of <img src='http://s0.wp.com/latex.php?latex=O%28+k%5E%7B-1%2F2%7D+%5Clog+k+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O( k^{-1/2} &#92;log k )' title='O( k^{-1/2} &#92;log k )' class='latex' />.)  Taking contrapositives, we obtain an inverse Littlewood-Offord theorem:</p>
<blockquote><p><strong>Corollary 1. </strong> If <img src='http://s0.wp.com/latex.php?latex=p%28v_1%2C%5Cldots%2Cv_n%29+%5Cgeq+p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p(v_1,&#92;ldots,v_n) &#92;geq p' title='p(v_1,&#92;ldots,v_n) &#92;geq p' class='latex' />, then at most <img src='http://s0.wp.com/latex.php?latex=O%281%2Fp%5E2%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O(1/p^2)' title='O(1/p^2)' class='latex' /> of the <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' /> are non-zero.</p></blockquote>
<p>The bound is sharp in the sense that it is attained in the case when all the <img src='http://s0.wp.com/latex.php?latex=v_i&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_i' title='v_i' class='latex' /> are equal.  However, the theorem is far from sharp in other cases; if k of the <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' /> are non-zero, then <img src='http://s0.wp.com/latex.php?latex=p%28v_1%2C%5Cldots%2Cv_n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p(v_1,&#92;ldots,v_n)' title='p(v_1,&#92;ldots,v_n)' class='latex' /> can be as small as <img src='http://s0.wp.com/latex.php?latex=2%5E%7B-k%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='2^{-k}' title='2^{-k}' class='latex' />.</p>
<p>Another forward Littlewood-Offord theorem in a similar spirit is <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=182619">by Särkőzy and Szemerédi</a>, who showed</p>
<blockquote><p><strong>Theorem 2. </strong> If at least k of the <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' /> are distinct, then <img src='http://s0.wp.com/latex.php?latex=p%28v_1%2C%5Cldots%2Cv_n%29+%5Cleq+O%28k%5E%7B-3%2F2%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p(v_1,&#92;ldots,v_n) &#92;leq O(k^{-3/2})' title='p(v_1,&#92;ldots,v_n) &#92;leq O(k^{-3/2})' class='latex' />.</p></blockquote>
<p>(The slightly weaker bound of <img src='http://s0.wp.com/latex.php?latex=O%28+k%5E%7B-3%2F2%7D+%5Clog+k+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O( k^{-3/2} &#92;log k )' title='O( k^{-3/2} &#92;log k )' class='latex' /> was obtained <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=174539">by Erdős and Moser</a>.)  Again, it has a contrapositive:</p>
<blockquote><p><strong>Corollary 2. </strong>If <img src='http://s0.wp.com/latex.php?latex=p%28v_1%2C%5Cldots%2Cv_n%29+%5Cgeq+p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p(v_1,&#92;ldots,v_n) &#92;geq p' title='p(v_1,&#92;ldots,v_n) &#92;geq p' class='latex' />, then the <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' /> take on at most <img src='http://s0.wp.com/latex.php?latex=O%28+p%5E%7B-2%2F3%7D+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O( p^{-2/3} )' title='O( p^{-2/3} )' class='latex' /> distinct values.</p></blockquote>
<p>Again, Theorem 2 is optimal in the sense that the bound is attained when <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' /> lie in an arithmetic progression (e.g. <img src='http://s0.wp.com/latex.php?latex=v_i+%3D+i&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_i = i' title='v_i = i' class='latex' />), but is not always sharp otherwise.  There are many further forward and inverse Littlewood-Offord results; see our paper for a discussion of some of these.</p>
<p>In recent years it has become clearer that the concentration probability is connected to the extent to which the <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' /> lie in a <em>generalised arithmetic progression</em> (GAP)</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=P+%3D+%5C%7B+n_1+w_1+%2B+%5Cldots+%2B+n_d+w_d%3A+-N_i+%5Cleq+n_i+%5Cleq+N_i+%5Chbox%7B+for+%7D+i%3D1%2C%5Cldots%2Cd+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P = &#92;{ n_1 w_1 + &#92;ldots + n_d w_d: -N_i &#92;leq n_i &#92;leq N_i &#92;hbox{ for } i=1,&#92;ldots,d &#92;}' title='P = &#92;{ n_1 w_1 + &#92;ldots + n_d w_d: -N_i &#92;leq n_i &#92;leq N_i &#92;hbox{ for } i=1,&#92;ldots,d &#92;}' class='latex' />;</p>
<p>the quantity d is known as the <em>rank</em> of the GAP.  For instance, an easy computation gives</p>
<blockquote><p><strong>Theorem 3.</strong> (Forward Littlewood-Offord)  If <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' /> lie in a GAP P of rank d, then</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=p%28n_1%2C%5Cldots%2Cn_d%29+%5Cll_d+n%5E%7B-d%2F2%7D+%26%23124%3BP%26%23124%3B%5E%7B-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p(n_1,&#92;ldots,n_d) &#92;ll_d n^{-d/2} &#124;P&#124;^{-1}' title='p(n_1,&#92;ldots,n_d) &#92;ll_d n^{-d/2} &#124;P&#124;^{-1}' class='latex' />.</p>
</blockquote>
<p>Intuitively, a random sum of n vectors in P should lie in a dilate <img src='http://s0.wp.com/latex.php?latex=O%28%5Csqrt%7Bn%7D%29+P&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O(&#92;sqrt{n}) P' title='O(&#92;sqrt{n}) P' class='latex' /> of P, which has size about <img src='http://s0.wp.com/latex.php?latex=n%5E%7Bd%2F2%7D+%26%23124%3BP%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n^{d/2} &#124;P&#124;' title='n^{d/2} &#124;P&#124;' class='latex' />, which leads to the stated concentration result.</p>
<p>One of our main results is a sort of converse to the above claim, in the regime where the concentration probability is of polynomial size:</p>
<blockquote><p><strong>Theorem 4. </strong>(Inverse Littlewood-Offord)  If <img src='http://s0.wp.com/latex.php?latex=p%28v_1%2C%5Cldots%2Cv_n%29+%5Cgeq+n%5E%7B-A%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p(v_1,&#92;ldots,v_n) &#92;geq n^{-A}' title='p(v_1,&#92;ldots,v_n) &#92;geq n^{-A}' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%5Cvarepsilon+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;varepsilon &gt; 0' title='&#92;varepsilon &gt; 0' class='latex' />, then there exists a GAP P of rank d at most 2A that contains all but <img src='http://s0.wp.com/latex.php?latex=O_%7BA%2C%5Cvarepsilon%7D%28n%5E%7B1-%5Cvarepsilon%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O_{A,&#92;varepsilon}(n^{1-&#92;varepsilon})' title='O_{A,&#92;varepsilon}(n^{1-&#92;varepsilon})' class='latex' /> of the <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' />, and such that</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=p%28n_1%2C%5Cldots%2Cn_d%29+%5Cll_%7BA%2C%5Cvarepsilon%7D+n%5E%7B-d%2F2%2B%5Cvarepsilon%7D+%26%23124%3BP%26%23124%3B%5E%7B-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p(n_1,&#92;ldots,n_d) &#92;ll_{A,&#92;varepsilon} n^{-d/2+&#92;varepsilon} &#124;P&#124;^{-1}' title='p(n_1,&#92;ldots,n_d) &#92;ll_{A,&#92;varepsilon} n^{-d/2+&#92;varepsilon} &#124;P&#124;^{-1}' class='latex' />.</p>
</blockquote>
<p>Thus, Theorem 3 is sharp up to epsilon losses, and up to throwing away a small number of vectors.  One can use this theorem to deduce several earlier theorems, such as Theorem 1 and Theorem 2, except for some epsilon losses.  (It is certainly of interest to try to see how one can remove these epsilon losses; I know this problem is currently being looked at.)</p>
<p><a class="snap_noshots" href="http://arxiv.org/abs/math.PR/0511215">In an earlier paper</a> we had proven a weaker version of Theorem 4, in which the rank d was only bounded by <img src='http://s0.wp.com/latex.php?latex=O_A%281%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O_A(1)' title='O_A(1)' class='latex' /> instead of the optimal value of 2A, and the nearly-sharp <img src='http://s0.wp.com/latex.php?latex=n%5E%7B-d%2F2%2B%5Cvarepsilon%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n^{-d/2+&#92;varepsilon}' title='n^{-d/2+&#92;varepsilon}' class='latex' /> factor was replaced by <img src='http://s0.wp.com/latex.php?latex=n%5E%7B-O_A%281%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n^{-O_A(1)}' title='n^{-O_A(1)}' class='latex' />.</p>
<p>Our methods are purely combinatorial, based on &#8220;growing&#8221; the GAP P by a greedy algorithm.  Roughly speaking, the idea is as follows:</p>
<ol>
<li>Start with the trivial progression <img src='http://s0.wp.com/latex.php?latex=Q+%3D+%5C%7B0%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q = &#92;{0&#92;}' title='Q = &#92;{0&#92;}' class='latex' />.  Also, pick an integer k a little bit smaller than <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sqrt{n}' title='&#92;sqrt{n}' class='latex' />.</li>
<li>Call an element x of Q &#8220;bad&#8221; if adding the progression <img src='http://s0.wp.com/latex.php?latex=%5C%7B+-kx%2C+%5Cldots%2C+-x%2C0%2Cx%2C%5Cldots%2C+kx%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{ -kx, &#92;ldots, -x,0,x,&#92;ldots, kx&#92;}' title='&#92;{ -kx, &#92;ldots, -x,0,x,&#92;ldots, kx&#92;}' class='latex' /> to Q significantly increases the size of Q, and &#8220;good&#8221; otherwise.</li>
<li>If only a few of the <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' /> are bad, STOP.</li>
<li>Otherwise, if there are a lot of bad <img src='http://s0.wp.com/latex.php?latex=v_i&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_i' title='v_i' class='latex' />, there is a way to use Hölder&#8217;s inequality to find a bad x with the property that S is much more likely to fall into a translate of <img src='http://s0.wp.com/latex.php?latex=Q+%2B+%5C%7B-kx%2C%5Cldots%2Ckx%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q + &#92;{-kx,&#92;ldots,kx&#92;}' title='Q + &#92;{-kx,&#92;ldots,kx&#92;}' class='latex' /> than it is to Q.    Replace Q with this larger GAP and return to step 2.</li>
</ol>
<p>This algorithm turns out to terminate in a bounded number of steps if the concentration probability of S was initially of polynomial size, and will trap most of the <img src='http://s0.wp.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1,&#92;ldots,v_n' title='v_1,&#92;ldots,v_n' class='latex' /> in the set of good points relating to Q, which turns out to essentially be a GAP P of size about <img src='http://s0.wp.com/latex.php?latex=n%5E%7Bd%2F2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n^{d/2}' title='n^{d/2}' class='latex' /> times bigger than that of Q, where d is the rank of Q.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Bounds for the first few density Hales-Jewett numbers, and related quantities]]></title>
<link>http://terrytao.wordpress.com/2009/02/13/bounds-for-the-first-few-density-hales-jewett-numbers-and-related-quantities/</link>
<pubDate>Sat, 14 Feb 2009 04:13:05 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2009/02/13/bounds-for-the-first-few-density-hales-jewett-numbers-and-related-quantities/</guid>
<description><![CDATA[This thread is a continuation of the previous thread here on the polymath1 project.  Currently, acti]]></description>
<content:encoded><![CDATA[<p>This thread is a continuation of the <a class="snap_noshots" href="http://terrytao.wordpress.com/2009/02/05/upper-and-lower-bounds-for-the-density-hales-jewett-problem/">previous thread here</a> on the <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Main_Page">polymath1 project</a>.  Currently, activity is focusing on the following problems:</p>
<p><strong>1.  Upper and lower bounds for <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /> for small n.</strong></p>
<p>Let <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /> be the largest size of a set in <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' /> without a <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Line">combinatorial line</a>.  Thanks to efforts from previous threads, we have the first five values of <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' />:</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c_0%3D1%3B+c_1%3D2%3B+c_2%3D6%3B+c_3%3D18%3B+c_4%3D52&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_0=1; c_1=2; c_2=6; c_3=18; c_4=52' title='c_0=1; c_1=2; c_2=6; c_3=18; c_4=52' class='latex' />.</p>
<p style="text-align:left;">We also know that <img src='http://s0.wp.com/latex.php?latex=150+%5Cleq+c_5+%5Cleq+154&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='150 &#92;leq c_5 &#92;leq 154' title='150 &#92;leq c_5 &#92;leq 154' class='latex' />, and are working to narrow this further.  The arguments justifying these bounds can be <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Upper_and_lower_bounds">found here</a>.  The latest bounds for the next few values of <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /> can be <a class="snap_noshots" href="http://spreadsheets.google.com/ccc?key=p5T0SktZY9DsU-uZ1tK7VEg">found here</a>.</p>
<p style="text-align:left;"><strong>2.  A hyper-optimistic conjecture</strong></p>
<p style="text-align:left;">Consider a variant of the above problem in which each element of <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' /> with a 1s, b 2s, and c 3s is weighted by the factor <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7Ba%21+b%21+c%21%7D%7Bn%21%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;frac{a! b! c!}{n!}' title='&#92;frac{a! b! c!}{n!}' class='latex' />; this gives <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' /> a total weight of <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7B%28n%2B1%29%28n%2B2%29%7D%7B2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;frac{(n+1)(n+2)}{2}' title='&#92;frac{(n+1)(n+2)}{2}' class='latex' />.  Let <img src='http://s0.wp.com/latex.php?latex=c%5E%5Cmu_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c^&#92;mu_n' title='c^&#92;mu_n' class='latex' /> be the largest weight of a line-free set of <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' />, and let <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7Bc%7D%5E%5Cmu_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;overline{c}^&#92;mu_n' title='&#92;overline{c}^&#92;mu_n' class='latex' /> be the largest size of a subset of</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5CDelta_n+%3A%3D+%5C%7B+%28a%2Cb%2Cc%29+%5Cin+%7B%5CBbb+Z%7D_%2B%5E3%3A+a%2Bb%2Bc%3Dn+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Delta_n := &#92;{ (a,b,c) &#92;in {&#92;Bbb Z}_+^3: a+b+c=n &#92;}' title='&#92;Delta_n := &#92;{ (a,b,c) &#92;in {&#92;Bbb Z}_+^3: a+b+c=n &#92;}' class='latex' /></p>
<p style="text-align:left;">which contains no upward-pointing equilateral triangles <img src='http://s0.wp.com/latex.php?latex=%28a%2Br%2Cb%2Cc%29%2C+%28a%2Cb%2Br%2Cc%29%2C+%28a%2Cb%2Cc%2Br%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(a+r,b,c), (a,b+r,c), (a,b,c+r)' title='(a+r,b,c), (a,b+r,c), (a,b,c+r)' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=r%26%2362%3B0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='r&gt;0' title='r&gt;0' class='latex' />.  It is known that <img src='http://s0.wp.com/latex.php?latex=c%5E%5Cmu_n+%5Cgeq+%5Coverline%7Bc%7D%5E%5Cmu_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c^&#92;mu_n &#92;geq &#92;overline{c}^&#92;mu_n' title='c^&#92;mu_n &#92;geq &#92;overline{c}^&#92;mu_n' class='latex' />; the &#8220;<a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Hyper-optimistic_conjecture">hyper-optimistic conjecture</a>&#8221; is that one in fact has <img src='http://s0.wp.com/latex.php?latex=c%5E%5Cmu_n+%3D+%5Coverline%7Bc%7D%5E%5Cmu_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c^&#92;mu_n = &#92;overline{c}^&#92;mu_n' title='c^&#92;mu_n = &#92;overline{c}^&#92;mu_n' class='latex' />.  This would imply density Hales-Jewett for k=3.</p>
<p style="text-align:left;">Currently, we know the following values for these sequences:</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c%5E%5Cmu_0+%3D+1%3B+c%5E%5Cmu_1+%3D+2%3B+c%5E%5Cmu_2+%3D+4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c^&#92;mu_0 = 1; c^&#92;mu_1 = 2; c^&#92;mu_2 = 4' title='c^&#92;mu_0 = 1; c^&#92;mu_1 = 2; c^&#92;mu_2 = 4' class='latex' /></p>
<p style="text-align:left;">and</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Coverline%7Bc%7D%5E%5Cmu_0+%3D+1%3B+%5Coverline%7Bc%7D%5E%5Cmu_1+%3D+2%3B+%5Coverline%7Bc%7D%5E%5Cmu_2+%3D+4%3B+%5Coverline%7Bc%7D%5E%5Cmu_3+%3D+6%3B+%5Coverline%7Bc%7D%5E%5Cmu_4+%3D+9%3B+%5Coverline%7Bc%7D%5E%5Cmu_5+%3D+12&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;overline{c}^&#92;mu_0 = 1; &#92;overline{c}^&#92;mu_1 = 2; &#92;overline{c}^&#92;mu_2 = 4; &#92;overline{c}^&#92;mu_3 = 6; &#92;overline{c}^&#92;mu_4 = 9; &#92;overline{c}^&#92;mu_5 = 12' title='&#92;overline{c}^&#92;mu_0 = 1; &#92;overline{c}^&#92;mu_1 = 2; &#92;overline{c}^&#92;mu_2 = 4; &#92;overline{c}^&#92;mu_3 = 6; &#92;overline{c}^&#92;mu_4 = 9; &#92;overline{c}^&#92;mu_5 = 12' class='latex' />.</p>
<p style="text-align:left;">There are also some further upper and lower bounds known; see <a class="snap_noshots" href="http://spreadsheets.google.com/ccc?key=p5T0SktZY9DsU-uZ1tK7VEg">this spreadsheet</a> for the latest bounds, and <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Fujimura's_problem">this page</a> for proofs of the bounds.  The data so far is consistent with the conjecture, but more work would be needed to obtain a more convincing case for it.</p>
<p style="text-align:left;"><strong>3.  Asymptotics for Moser&#8217;s cube problem</strong></p>
<p style="text-align:left;"><a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Moser%27s_cube_problem">Moser&#8217;s cube problem</a> asks to compute the largest size <img src='http://s0.wp.com/latex.php?latex=c%27_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_n' title='c&#039;_n' class='latex' /> of a subset of the cube <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' /> without geometric lines.  The first few values of <img src='http://s0.wp.com/latex.php?latex=c%27_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_n' title='c&#039;_n' class='latex' /> are known:</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c%27_0%3D1%3B+c%27_1+%3D+2%3B+c%27_2+%3D+6%3B+c%27_3+%3D+14%3B+c%27_4+%3D+43&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_0=1; c&#039;_1 = 2; c&#039;_2 = 6; c&#039;_3 = 14; c&#039;_4 = 43' title='c&#039;_0=1; c&#039;_1 = 2; c&#039;_2 = 6; c&#039;_3 = 14; c&#039;_4 = 43' class='latex' />.</p>
<p style="text-align:left;">The best asymptotic lower bound known is <img src='http://s0.wp.com/latex.php?latex=c%27_n+%5Cgg+3%5En%2F%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_n &#92;gg 3^n/&#92;sqrt{n}' title='c&#039;_n &#92;gg 3^n/&#92;sqrt{n}' class='latex' />.  Given that we have a significantly superior lower bound of <img src='http://s0.wp.com/latex.php?latex=c_n+%5Cgeq+3%5E%7Bn-O%28%5Csqrt%7B%5Clog+n%7D%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n &#92;geq 3^{n-O(&#92;sqrt{&#92;log n})}' title='c_n &#92;geq 3^{n-O(&#92;sqrt{&#92;log n})}' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' />, one might hope to similarly improve the lower bound for Moser&#8217;s problem.  But there seems to be a technical snag, based on the observation that between two slices <img src='http://s0.wp.com/latex.php?latex=%5CGamma_%7Ba%2Cb%2Cc%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_{a,b,c}' title='&#92;Gamma_{a,b,c}' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5CGamma_%7Ba%27%2Cb%27%2Cc%27%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_{a&#039;,b&#039;,c&#039;}' title='&#92;Gamma_{a&#039;,b&#039;,c&#039;}' class='latex' /> with the <img src='http://s0.wp.com/latex.php?latex=c-a%3Dc%27-a%27&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c-a=c&#039;-a&#039;' title='c-a=c&#039;-a&#039;' class='latex' />, there are in fact quite a few combinatorial lines connecting them, and so it is not obvious how to create a geometric line-free set that involves more than one slice per value of c-a.</p>
<p style="text-align:left;">It is also of interest to work out what asymptotic lower bound for <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /> one can get for larger values of k than 3.</p>
<p style="text-align:left;">Comments on this thread should start at 700.  We will also try to record progress made at the <a class="snap_noshots" href="http://michaelnielsen.org/polymath1/index.php?title=Main_Page">polymath1 wiki</a> and at the <a class="snap_noshots" href="http://spreadsheets.google.com/ccc?key=p5T0SktZY9DsU-uZ1tK7VEg">polymath1 spreadsheet</a>, as appropriate.</p>
<p style="text-align:left;">
<p style="text-align:left;">
]]></content:encoded>
</item>
<item>
<title><![CDATA[Upper and lower bounds for the density Hales-Jewett problem]]></title>
<link>http://terrytao.wordpress.com/2009/02/05/upper-and-lower-bounds-for-the-density-hales-jewett-problem/</link>
<pubDate>Fri, 06 Feb 2009 01:05:33 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2009/02/05/upper-and-lower-bounds-for-the-density-hales-jewett-problem/</guid>
<description><![CDATA[This is a continuation of several threads from Tim Gowers&#8217; massively collaborative mathematics]]></description>
<content:encoded><![CDATA[<p>This is a continuation of several threads from Tim Gowers&#8217; massively collaborative mathematics project, “<a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/">A combinatorial approach to density Hales-Jewett</a>&#8220;, which I discussed in my <a class="snap_noshots" href="http://terrytao.wordpress.com/2009/02/01/a-massively-collaborative-mathematical-project/">previous post</a>.</p>
<p>For any positive integer n, let <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' /> be the set of strings of length n consisting of 1s, 2s, and 3s, and define a combinatorial line to be a triple of such strings arising by taking a string in <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C3%2Cx%5C%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,2,3,x&#92;}^n' title='&#92;{1,2,3,x&#92;}^n' class='latex' /> with at least one wildcard x, and substituting x=1, x=2, x=3 in that string (e.g. xx1x3 would give the combinatorial line <img src='http://s0.wp.com/latex.php?latex=%5C%7B11113%2C+22123%2C+33133%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{11113, 22123, 33133&#92;}' title='&#92;{11113, 22123, 33133&#92;}' class='latex' />).  Call a set <img src='http://s0.wp.com/latex.php?latex=A+%5Csubset+%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;subset [3]^n' title='A &#92;subset [3]^n' class='latex' /> of strings <em>line-free</em> if it contains no combinatorial lines, and let <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /> be the size of the largest set A which is line-free.  We then have</p>
<blockquote><p><strong>Density Hales-Jewett theorem (k=3)</strong>: <img src='http://s0.wp.com/latex.php?latex=c_n+%3D+o%283%5En%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n = o(3^n)' title='c_n = o(3^n)' class='latex' />, i.e. <img src='http://s0.wp.com/latex.php?latex=%5Clim_%7Bn+%5Cto+%5Cinfty%7D+c_n%2F3%5En+%3D+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lim_{n &#92;to &#92;infty} c_n/3^n = 0' title='&#92;lim_{n &#92;to &#92;infty} c_n/3^n = 0' class='latex' />.</p></blockquote>
<p>This theorem implies several other important results, most notably Roth&#8217;s theorem on length three progressions (in an arbitrary abelian group!) and also the corners theorem of Ajtai and Szemeredi (again in an arbitrary abelian group).</p>
<p>Here are some of the questions we are initially exploring in this thread (and which I will comment further on below the fold):</p>
<ol>
<li>What are the best upper and lower bounds for <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /> for small n?  Currently we know <img src='http://s0.wp.com/latex.php?latex=c_0%3D1%2C+c_1%3D2%2C+c_2%3D6%2C+c_3%3D18%2C+c_4%3D52&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_0=1, c_1=2, c_2=6, c_3=18, c_4=52' title='c_0=1, c_1=2, c_2=6, c_3=18, c_4=52' class='latex' />, with some partial results for higher n.    We also have some relationships between different <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' />.</li>
<li>What do extremal or near-extremal sets (i.e. line-free sets with cardinality close to <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' />) look like?</li>
<li>What are some good constructions of line-free sets that lead to asymptotic lower bounds on <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' />?  The best asymptotic lower bound we have currently is <img src='http://s0.wp.com/latex.php?latex=c_n+%5Cgeq+3%5E%7Bn+-+O%28+%5Csqrt%7B%5Clog+n%7D+%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n &#92;geq 3^{n - O( &#92;sqrt{&#92;log n} )}' title='c_n &#92;geq 3^{n - O( &#92;sqrt{&#92;log n} )}' class='latex' />.</li>
<li>Can these methods extend to other related problems, such as the Moser cube problem or the capset problem?</li>
<li>How feasible is it to extend the existing combinatorial proofs of Roth&#8217;s theorem (or the corners theorem), in particular the arguments of Szemeredi, to the Density Hales-Jewett theorem?</li>
</ol>
<p>But I imagine that further threads might develop in the course of the discussion.</p>
<p>As with the rest of the project, this is supposed to be an open collaboration: please feel free to pose a question or a comment, even if (or especially if) it is just barely non-trivial.  (For more on the &#8220;rules of the game&#8221;, see <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/questions-of-procedure/">this post</a>.)</p>
<p><!--more--></p>
<p style="text-align:center;">&#8211; I. <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /> for small n &#8211;</p>
<p>Because the Cartesian product of two line-free sets is line-free, we have a lower bound</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c_%7Bn%2Bm%7D+%5Cgeq+c_n+c_m&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_{n+m} &#92;geq c_n c_m' title='c_{n+m} &#92;geq c_n c_m' class='latex' /> (1)</p>
<p>for all <img src='http://s0.wp.com/latex.php?latex=n%2Cm+%5Cgeq+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n,m &#92;geq 1' title='n,m &#92;geq 1' class='latex' />, although this seems to be an inferior bound in practice.</p>
<p>There is a trivial bound</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c_%7Bn%2B1%7D+%5Cleq+3+c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_{n+1} &#92;leq 3 c_n' title='c_{n+1} &#92;leq 3 c_n' class='latex' /> (2)</p>
<p>which comes from the observation that any line-free subset of <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5E%7Bn%2B1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^{n+1}' title='{}[3]^{n+1}' class='latex' /> can be split into three slices, each of which is essentially a line-free subset of <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' />.  In particular, since <img src='http://s0.wp.com/latex.php?latex=c_1+%3D+2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_1 = 2' title='c_1 = 2' class='latex' />, we see that</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c_n+%5Cleq+2+%5Ctimes+3%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n &#92;leq 2 &#92;times 3^{n-1}' title='c_n &#92;leq 2 &#92;times 3^{n-1}' class='latex' /> (3)</p>
<p>for <img src='http://s0.wp.com/latex.php?latex=n+%5Cgeq+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n &#92;geq 1' title='n &#92;geq 1' class='latex' />.</p>
<p>If we identify <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D+%3D+%5C%7B1%2C2%2C3%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3] = &#92;{1,2,3&#92;}' title='{}[3] = &#92;{1,2,3&#92;}' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%2F3%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}/3{&#92;Bbb Z}' title='{&#92;Bbb Z}/3{&#92;Bbb Z}' class='latex' /> and consider the set</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=D_n+%3A%3D+%5C%7B+%28x_1%2C%5Cldots%2Cx_n%29+%5Cin+%7B%5CBbb+Z%7D%2F3%7B%5CBbb+Z%7D%3A+x_1%2B%5Cldots%2Bx_n+%5Cneq+0+%5Chbox%7B+mod+%7D+3+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='D_n := &#92;{ (x_1,&#92;ldots,x_n) &#92;in {&#92;Bbb Z}/3{&#92;Bbb Z}: x_1+&#92;ldots+x_n &#92;neq 0 &#92;hbox{ mod } 3 &#92;}' title='D_n := &#92;{ (x_1,&#92;ldots,x_n) &#92;in {&#92;Bbb Z}/3{&#92;Bbb Z}: x_1+&#92;ldots+x_n &#92;neq 0 &#92;hbox{ mod } 3 &#92;}' class='latex' />, (4)</p>
<p>thus <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BD_n%26%23124%3B+%3D+2+%5Ctimes+3%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;D_n&#124; = 2 &#92;times 3^{n-1}' title='&#124;D_n&#124; = 2 &#92;times 3^{n-1}' class='latex' />.  Observe that a combinatorial line with a,b,c,k 1s, 2s, 3s, and wildcards respectively lies in <img src='http://s0.wp.com/latex.php?latex=D_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='D_n' title='D_n' class='latex' /> if and only if k is a multiple of 3, and a is not equal to b mod 3.  In particular, <img src='http://s0.wp.com/latex.php?latex=D_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='D_n' title='D_n' class='latex' /> is line-free for <img src='http://s0.wp.com/latex.php?latex=n+%3D+1%2C2%2C3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n = 1,2,3' title='n = 1,2,3' class='latex' />, thus matching the upper bound (3) and concluding that</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c_1+%3D+2%3B+c_2+%3D+6%3B+c_3+%3D+18.&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_1 = 2; c_2 = 6; c_3 = 18.' title='c_1 = 2; c_2 = 6; c_3 = 18.' class='latex' /> (5)</p>
<p>For n=4, the only combinatorial lines remaining in <img src='http://s0.wp.com/latex.php?latex=D_4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='D_4' title='D_4' class='latex' /> are <img src='http://s0.wp.com/latex.php?latex=1xxx%2C+2xxx&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1xxx, 2xxx' title='1xxx, 2xxx' class='latex' /> and their permutations.  Thus (as observed in <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1776">Neylon.83</a>) the set <img src='http://s0.wp.com/latex.php?latex=D_4+%5Cbackslash+%5C%7B1111%2C+2222%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='D_4 &#92;backslash &#92;{1111, 2222&#92;}' title='D_4 &#92;backslash &#92;{1111, 2222&#92;}' class='latex' /> is line-free, leading to the lower bound <img src='http://s0.wp.com/latex.php?latex=c_4+%5Cgeq+52&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_4 &#92;geq 52' title='c_4 &#92;geq 52' class='latex' />.</p>
<p>This bound was complemented with the lower bound <img src='http://s0.wp.com/latex.php?latex=c_4+%5Cleq+52&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_4 &#92;leq 52' title='c_4 &#92;leq 52' class='latex' /> in <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1790">Jakobsen.90</a>.  Thus <img src='http://s0.wp.com/latex.php?latex=c_4+%3D+52&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_4 = 52' title='c_4 = 52' class='latex' />.</p>
<p>For <img src='http://s0.wp.com/latex.php?latex=c_5&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_5' title='c_5' class='latex' />, the best lower bound currently is <img src='http://s0.wp.com/latex.php?latex=c_5+%5Cgeq+145&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_5 &#92;geq 145' title='c_5 &#92;geq 145' class='latex' /> (<a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1776">Neylon.83</a>) and <img src='http://s0.wp.com/latex.php?latex=c_5+%5Cleq+3c_4+%3D+156&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_5 &#92;leq 3c_4 = 156' title='c_5 &#92;leq 3c_4 = 156' class='latex' />.</p>
<p>No attempt at getting reasonable bounds for <img src='http://s0.wp.com/latex.php?latex=c_6%2C+c_7&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_6, c_7' title='c_6, c_7' class='latex' />, etc. appears in the previous threads.</p>
<p><strong>Question I.A:</strong> Can we improve the upper and lower bounds on <img src='http://s0.wp.com/latex.php?latex=c_5%2C+c_6%2C+c_7&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_5, c_6, c_7' title='c_5, c_6, c_7' class='latex' />?</p>
<p><strong>Question I.B:</strong> Is there some reasonably efficient way to automate this process?</p>
<p><strong>Question I.C:</strong> What do the extremal line-free sets (sets of size close to <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' />) look like?</p>
<p>At some point we should submit this sequence to the OEIS.</p>
<p style="text-align:center;">&#8211; II. Lower bounds on <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /> for large n &#8211;</p>
<p>For any integers a,b,c adding up to n, let <img src='http://s0.wp.com/latex.php?latex=%5CGamma_%7Ba%2Cb%2Cc%7D+%5Csubset+%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_{a,b,c} &#92;subset [3]^n' title='&#92;Gamma_{a,b,c} &#92;subset [3]^n' class='latex' /> be the set of all strings with a 1s, b 2s, and c 3s.  Observe that if <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B' title='B' class='latex' /> is any set of triples (a,b,c) that avoids equilateral triangles <img src='http://s0.wp.com/latex.php?latex=%28a%2Br%2Cb%2Cc%29%2C+%28a%2Cb%2Br%2Cc%29%2C+%28a%2Cb%2Cc%2Br%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(a+r,b,c), (a,b+r,c), (a,b,c+r)' title='(a+r,b,c), (a,b+r,c), (a,b,c+r)' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=r+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='r &gt; 0' title='r &gt; 0' class='latex' />, then the set <img src='http://s0.wp.com/latex.php?latex=%5CGamma_B+%3A%3D+%5Cbigcup_%7B%28a%2Cb%2Cc%29+%5Cin+B%7D+%5CGamma_%7Ba%2Cb%2Cc%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_B := &#92;bigcup_{(a,b,c) &#92;in B} &#92;Gamma_{a,b,c}' title='&#92;Gamma_B := &#92;bigcup_{(a,b,c) &#92;in B} &#92;Gamma_{a,b,c}' class='latex' /> is line-free.  In particular, if we choose B to be the set of all triples <img src='http://s0.wp.com/latex.php?latex=%28a%2Cb%2Cc%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(a,b,c)' title='(a,b,c)' class='latex' /> such that a+2b lies in a set free of length three arithmetic progressions (e.g. a Behrend set), then <img src='http://s0.wp.com/latex.php?latex=%5CGamma_B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_B' title='&#92;Gamma_B' class='latex' /> is line-free.  This seems to give a lower bound</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c_n+%5Cgeq+3%5E%7Bn+-+O%28%5Csqrt%7B%5Clog+n%7D%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n &#92;geq 3^{n - O(&#92;sqrt{&#92;log n})}' title='c_n &#92;geq 3^{n - O(&#92;sqrt{&#92;log n})}' class='latex' />. (6)</p>
<p><strong>Question II.A:</strong> Using the <a class="snap_noshots" href="http://arxiv.org/abs/0801.4310">Elkin bound</a> on Behrend sets, what is the precise lower bound one gets? (<a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1715">Solymosi.46.5</a> notes that we should restrict the Behrend set to scale <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7Bn%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sqrt{n}' title='&#92;sqrt{n}' class='latex' /> rather than n).</p>
<p><strong>Question II.B:</strong> What does one get for higher k than 3?  Presumably one would use the O&#8217;Bryant bound (<a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1842">O&#8217;Bryant.126</a>).</p>
<p><strong>Question II.C:</strong> Do the examples from Section I extend to give competitive lower bounds for large n?</p>
<p><strong>Question II.D:</strong> Are there better ways to avoid triangles or corners in two dimensions than simply lifting up the Behrend example from one dimension?</p>
<p style="text-align:center;">&#8211; III. Miscellanous facts about <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /> &#8211;</p>
<p>Using the colouring Hales-Jewett theorem, one can show that for every m, one has <img src='http://s0.wp.com/latex.php?latex=c_n+%26%2360%3B+3%5E%7Bn-m%7D+c_m&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n &lt; 3^{n-m} c_m' title='c_n &lt; 3^{n-m} c_m' class='latex' /> for sufficiently large m, improving (2) very slightly; see (<a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1779">Tao.85</a>).</p>
<p><strong>Question III.A:</strong> Anything else we can say about the <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' />?  For instance, is there a relation to Hales-Jewett numbers HJ(3,r) (defined as the first n such that every r-colouring of <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' /> contains a monochromatic combinatorial line?)</p>
<p>Note: as pointed out in <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1713">Gasarch.45.5</a>, <a class="snap_noshots" href="http://www.math.ucsd.edu/~etressle/hj32.pdf">Hindman and Tressler</a> have recently established that <img src='http://s0.wp.com/latex.php?latex=HJ%283%2C2%29%3D4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='HJ(3,2)=4' title='HJ(3,2)=4' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=HJ%283%2C3%29+%26%2362%3B+6&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='HJ(3,3) &gt; 6' title='HJ(3,3) &gt; 6' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=HJ%284%2C2%29+%26%2362%3B+6&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='HJ(4,2) &gt; 6' title='HJ(4,2) &gt; 6' class='latex' />.</p>
<p style="text-align:center;">&#8211; IV. Related problems &#8211;</p>
<p>Let <img src='http://s0.wp.com/latex.php?latex=c%27_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_n' title='c&#039;_n' class='latex' /> denote the largest subset of <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' /> which does not contain any geometric line (which is the same as a combinatorial line, but has a second wildcard y which goes from 3 to 1 whilst x goes from 1 to 3, e.g. xx2yy gives the geometric line <img src='http://s0.wp.com/latex.php?latex=11233%2C+22222%2C+33211&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='11233, 22222, 33211' title='11233, 22222, 33211' class='latex' />).  The Moser cube problem is to understand the behaviour of <img src='http://s0.wp.com/latex.php?latex=c%27_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_n' title='c&#039;_n' class='latex' />.  The first few values are (see <a class="snap_noshots" href="http://www.research.att.com/~njas/sequences/A003142">OEIS A003142</a>):</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c%27_0+%3D+1%3B+c%27_1+%3D+2%3B+c%27_2+%3D+6%3B+c%27_3+%3D+16%3B+c%27_4+%3D+43&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_0 = 1; c&#039;_1 = 2; c&#039;_2 = 6; c&#039;_3 = 16; c&#039;_4 = 43' title='c&#039;_0 = 1; c&#039;_1 = 2; c&#039;_2 = 6; c&#039;_3 = 16; c&#039;_4 = 43' class='latex' />. (7)</p>
<p>It is clear that <img src='http://s0.wp.com/latex.php?latex=c%27_n+%5Cleq+c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_n &#92;leq c_n' title='c&#039;_n &#92;leq c_n' class='latex' />.</p>
<p><a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1707">Elsholtz.43</a> proposed modifying the construction of (6) to give a similar lower bound on <img src='http://s0.wp.com/latex.php?latex=c%27_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_n' title='c&#039;_n' class='latex' />, but a difficulty was found in <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1719">Solymosi.48</a>.  Currently, the best known asymptotic bounds for <img src='http://s0.wp.com/latex.php?latex=c%27_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_n' title='c&#039;_n' class='latex' /> are</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=3%5En+%2F+%5Csqrt%7Bn%7D+%5Cll+c%27_n+%5Cll+o%283%5En%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='3^n / &#92;sqrt{n} &#92;ll c&#039;_n &#92;ll o(3^n)' title='3^n / &#92;sqrt{n} &#92;ll c&#039;_n &#92;ll o(3^n)' class='latex' /> (8)</p>
<p><strong>Question IV.A</strong>.  Can we improve the lower bound in (8)?</p>
<p><strong>Question IV.B.</strong> Can we get good bounds on <img src='http://s0.wp.com/latex.php?latex=c%27_5%2C+c%27_6&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_5, c&#039;_6' title='c&#039;_5, c&#039;_6' class='latex' />?</p>
<p>Define <img src='http://s0.wp.com/latex.php?latex=c%27%27_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;&#039;_n' title='c&#039;&#039;_n' class='latex' /> to be the largest cardinality of a subset of <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' /> which contains no algebraic lines, defined as a triple <img src='http://s0.wp.com/latex.php?latex=%5C%7B+x%2C+x%2Br%2C+x%2B2r%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{ x, x+r, x+2r&#92;}' title='&#92;{ x, x+r, x+2r&#92;}' class='latex' /> where <img src='http://s0.wp.com/latex.php?latex=x%2C+r+%5Cin+%28%7B%5CBbb+Z%7D%2F3%7B%5CBbb+Z%7D%29%5En+%5Cequiv+%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x, r &#92;in ({&#92;Bbb Z}/3{&#92;Bbb Z})^n &#92;equiv [3]^n' title='x, r &#92;in ({&#92;Bbb Z}/3{&#92;Bbb Z})^n &#92;equiv [3]^n' class='latex' />; such sets are known as capsets.  Clearly <img src='http://s0.wp.com/latex.php?latex=c%27%27_n+%5Cleq+c%27_n+%5Cleq+c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;&#039;_n &#92;leq c&#039;_n &#92;leq c_n' title='c&#039;&#039;_n &#92;leq c&#039;_n &#92;leq c_n' class='latex' />.  As noted <a class="snap_noshots" href="http://terrytao.wordpress.com/2007/02/23/open-question-best-bounds-for-cap-sets/">my previous blog post</a>, the best asymptotic bounds are</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%282.2174%5Cldots%29%5En+%5Cleq+c%27%27_n+%5Cll+3%5En%2Fn&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(2.2174&#92;ldots)^n &#92;leq c&#039;&#039;_n &#92;ll 3^n/n' title='(2.2174&#92;ldots)^n &#92;leq c&#039;&#039;_n &#92;ll 3^n/n' class='latex' /> (9)</p>
<p><strong>Question IV.C. </strong> Can we improve on these bounds in any way?</p>
<p><strong>Question IV.D.</strong> What are the first few values of <img src='http://s0.wp.com/latex.php?latex=c%27%27_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;&#039;_n' title='c&#039;&#039;_n' class='latex' />?  Brute force calculation reveals <img src='http://s0.wp.com/latex.php?latex=c%27%27_0+%3D+1%2C+c%27%27_1+%3D+2%2C+c%27%27_2+%3D+4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;&#039;_0 = 1, c&#039;&#039;_1 = 2, c&#039;&#039;_2 = 4' title='c&#039;&#039;_0 = 1, c&#039;&#039;_1 = 2, c&#039;&#039;_2 = 4' class='latex' />.  (Presumably this sequence is also in the OEIS.)</p>
<p>[<em>Update</em>, Feb 7: I will now maintain a running table of the current "world records" for <img src='http://s0.wp.com/latex.php?latex=c_n%2C+c%27_n%2C+c%27%27_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n, c&#039;_n, c&#039;&#039;_n' title='c_n, c&#039;_n, c&#039;&#039;_n' class='latex' /> for small n, formed by combining all the bounds already mentioned in this post and comments.]</p>
<p>[<em>Update</em>, Feb 8: A more complete (and up-to-date) spreadsheet can now be found <a class="snap_noshots" href="http://spreadsheets.google.com/ccc?key=p5T0SktZY9DsU-uZ1tK7VEg">here</a>.]</p>
<table border="1">
<tbody>
<tr>
<td valign="top">n</td>
<td valign="top"><img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /></td>
<td valign="top"><img src='http://s0.wp.com/latex.php?latex=c%27_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;_n' title='c&#039;_n' class='latex' /></td>
<td valign="top"><img src='http://s0.wp.com/latex.php?latex=c%27%27_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;&#039;_n' title='c&#039;&#039;_n' class='latex' /></td>
</tr>
<tr>
<td valign="top">0</td>
<td valign="top">1</td>
<td valign="top">1</td>
<td valign="top">1</td>
</tr>
<tr>
<td valign="top">1</td>
<td valign="top">2</td>
<td valign="top">2</td>
<td valign="top">2</td>
</tr>
<tr>
<td valign="top">2</td>
<td valign="top">6</td>
<td valign="top">6</td>
<td valign="top">4</td>
</tr>
<tr>
<td valign="top">3</td>
<td valign="top">18</td>
<td valign="top">16</td>
<td valign="top"><a class="snap_noshot" href="http://www.math.rutgers.edu/~maclagan/papers/set.pdf">9</a></td>
</tr>
<tr>
<td valign="top">4</td>
<td valign="top">52</td>
<td valign="top">43</td>
<td valign="top"><a class="snap_noshot" href="http://www.math.rutgers.edu/~maclagan/papers/set.pdf">20</a></td>
</tr>
<tr>
<td valign="top">5</td>
<td valign="top">[<a class="snap_noshot" href="http://terrytao.wordpress.com/2009/02/05/upper-and-lower-bounds-for-the-density-hales-jewett-problem/#comment-35537">150</a>,<a href="http://terrytao.wordpress.com/2009/02/05/upper-and-lower-bounds-for-the-density-hales-jewett-problem/#comment-35681">154</a>]</td>
<td valign="top">[<span style="color:#008000;">96</span>,<span style="color:#ff6600;">129</span>]</td>
<td valign="top"><a class="snap_noshot" href="http://www.math.rutgers.edu/~maclagan/papers/set.pdf">45</a></td>
</tr>
<tr>
<td valign="top">6</td>
<td valign="top">[<a class="snap_noshot" href="http://terrytao.wordpress.com/2009/02/05/upper-and-lower-bounds-for-the-density-hales-jewett-problem/#comment-35537">450</a>,<span style="color:#ff6600;">462</span>]</td>
<td valign="top">[<span style="color:#008000;">258</span>,<span style="color:#ff6600;">387</span>]</td>
<td valign="top"><a href="http://www.springerlink.com/content/h003577g11112308/">112</a><a class="snap_noshot" href="http://www.math.rutgers.edu/~maclagan/papers/set.pdf"></a></td>
</tr>
<tr>
<td valign="top">7</td>
<td valign="top">[<a class="snap_noshots" href="http://terrytao.wordpress.com/2009/02/05/upper-and-lower-bounds-for-the-density-hales-jewett-problem/#comment-35585">1302</a>,<span style="color:#ff6600;">1386</span>]</td>
<td valign="top">[<span style="color:#008000;">688</span>,<span style="color:#ff6600;">1161</span>]</td>
<td valign="top">[<a class="snap_noshots" href="http://www.springerlink.com/content/m55136x765161240/">236</a>,<span style="color:#cc99ff;">292</span>]</td>
</tr>
<tr>
<td valign="top">8</td>
<td valign="top">[<a class="snap_noshots" href="http://terrytao.wordpress.com/2009/02/05/upper-and-lower-bounds-for-the-density-hales-jewett-problem/#comment-35585">3780</a>,<span style="color:#ff6600;">4158</span>]</td>
<td valign="top">[<span style="color:#008000;">1849</span>,<span style="color:#ff6600;">3483</span>]</td>
<td valign="top">[<span style="color:#008000;">472</span>,<span style="color:#cc99ff;">773</span>]</td>
</tr>
<tr>
<td valign="top">9</td>
<td valign="top">[<a class="snap_noshot" href="http://terrytao.wordpress.com/2009/02/05/upper-and-lower-bounds-for-the-density-hales-jewett-problem/#comment-35495">11340</a>,<span style="color:#ff6600;">12474</span>]</td>
<td valign="top">[<span style="color:#008000;">4128</span>,<span style="color:#ff6600;">10449</span>]</td>
<td valign="top">[<span style="color:#008000;">1008</span>,<span style="color:#cc99ff;">2075</span>]</td>
</tr>
<tr>
<td valign="top">10</td>
<td valign="top">[<a class="snap_noshots" href="http://terrytao.wordpress.com/2009/02/05/upper-and-lower-bounds-for-the-density-hales-jewett-problem/#comment-35604">32864</a>,<span style="color:#ff6600;">37422</span>]</td>
<td valign="top">[<span style="color:#008000;">11094</span>,<span style="color:#ff6600;">31347</span>]</td>
<td valign="top">[<span style="color:#008000;">2240</span>,<span style="color:#cc99ff;">5632</span>]</td>
</tr>
<tr>
<td valign="top">11</td>
<td valign="top">[<span style="color:#ff6600;">95964</span>,<span style="color:#ff6600;">112266</span>]</td>
<td valign="top">[<span style="color:#008000;">29584</span>,<span style="color:#ff6600;">94041</span>]</td>
<td valign="top">[<span style="color:#008000;">5040</span>,<span style="color:#cc99ff;">15425</span>]</td>
</tr>
<tr>
<td valign="top">12</td>
<td valign="top">[<a class="snap_noshots" href="http://terrytao.wordpress.com/2009/02/05/upper-and-lower-bounds-for-the-density-hales-jewett-problem/#comment-35597">287892</a>,<span style="color:#ff6600;">336798</span>]</td>
<td valign="top">[<span style="color:#008000;">79507</span>,<span style="color:#ff6600;">282123</span>]</td>
<td valign="top">[<span style="color:#008000;">12544</span>,<span style="color:#cc99ff;">42569</span>]</td>
</tr>
<tr>
<td valign="top">13</td>
<td valign="top">[<a class="snap_noshots" href="http://terrytao.wordpress.com/2009/02/05/upper-and-lower-bounds-for-the-density-hales-jewett-problem/#comment-35604">837850</a>,<span style="color:#ff6600;">1010394</span>]</td>
<td valign="top">[<span style="color:#008000;">175504</span>,<span style="color:#ff6600;">846369</span>]</td>
<td valign="top">[<span style="color:#008000;">26432</span>,<span style="color:#cc99ff;">118237</span>]</td>
</tr>
<tr>
<td valign="top">14</td>
<td valign="top">[<span style="color:#ff6600;">2458950</span>,<span style="color:#ff6600;">3031182</span>]</td>
<td valign="top">[<span style="color:#008000;">477042</span>,<span style="color:#ff6600;">2539107</span>]</td>
<td valign="top">[<span style="color:#008000;">52864</span>,<span style="color:#cc99ff;">330222</span>]</td>
</tr>
<tr>
<td valign="top">15</td>
<td valign="top">[<a class="snap_noshots" href="http://terrytao.wordpress.com/2009/02/05/upper-and-lower-bounds-for-the-density-hales-jewett-problem/#comment-35593">7376850</a>,<span style="color:#ff6600;">9093946</span>]</td>
<td valign="top">[<span style="color:#008000;">1272112</span>,<span style="color:#ff6600;">7617321</span>]</td>
<td valign="top">[<span style="color:#008000;">112896</span>,<span style="color:#cc99ff;">926687</span>]</td>
</tr>
<tr>
<td valign="top"><img src='http://s0.wp.com/latex.php?latex=%5Cto+%5Cinfty&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;to &#92;infty' title='&#92;to &#92;infty' class='latex' /></td>
<td valign="top"><img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5E%7Bn-O%28%5Csqrt%7B%5Clog+n%7D%29%7D%2C+o%283%5En%29%5D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3^{n-O(&#92;sqrt{&#92;log n})}, o(3^n)]' title='{}[3^{n-O(&#92;sqrt{&#92;log n})}, o(3^n)]' class='latex' /></td>
<td valign="top"><img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B%5COmega%28%5Cfrac%7B3%5En%7D%7B%5Csqrt%7Bn%7D%7D%29%2C+o%283%5En%29%5D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[&#92;Omega(&#92;frac{3^n}{&#92;sqrt{n}}), o(3^n)]' title='{}[&#92;Omega(&#92;frac{3^n}{&#92;sqrt{n}}), o(3^n)]' class='latex' /></td>
<td valign="top"><img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B%282.21%5Cldots%29%5En%2C+O%28%5Cfrac%7B3%5En%7D%7Bn%7D%29%5D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[(2.21&#92;ldots)^n, O(&#92;frac{3^n}{n})]' title='{}[(2.21&#92;ldots)^n, O(&#92;frac{3^n}{n})]' class='latex' /></td>
</tr>
</tbody>
</table>
<ul>
<li><span style="color:#008000;">green</span> &#8211; obtained via the inequality (1)</li>
<li><span style="color:#ff6600;">orange</span> &#8211; obtained via the inequality (2)</li>
<li><span style="color:#cc99ff;">lavender</span> &#8211; uses <a class="snap_noshots" href="http://www.mathi.uni-heidelberg.de/~yves/Papers/ABound.pdf">the inequality</a> <img src='http://s0.wp.com/latex.php?latex=c%27%27_n+%5Cleq+%283c%27%27_%7Bn-1%7D%2B1%29%2F%281+%2B+c%27%27_%7Bn-1%7D%2F3%5E%7Bn-1%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&#039;&#039;_n &#92;leq (3c&#039;&#039;_{n-1}+1)/(1 + c&#039;&#039;_{n-1}/3^{n-1})' title='c&#039;&#039;_n &#92;leq (3c&#039;&#039;_{n-1}+1)/(1 + c&#039;&#039;_{n-1}/3^{n-1})' class='latex' /></li>
</ul>
<p style="text-align:center;">&#8211; V. Szemeredi&#8217;s proof of Roth&#8217;s theorem &#8211;</p>
<p>Szemeredi provided a short proof of Roth&#8217;s theorem by purely combinatorial means, and also famously proved the full case of Szemeredi&#8217;s theorem by much more intricate combinatorial argument.  Both arguments are based on the density increment method.  In the setting of <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' />, the density increment method revolves around the critical density <img src='http://s0.wp.com/latex.php?latex=%5Calpha+%3D+%5Clim_%7Bn+%5Cto+%5Cinfty%7D+c_n+%2F+3%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha = &#92;lim_{n &#92;to &#92;infty} c_n / 3^n' title='&#92;alpha = &#92;lim_{n &#92;to &#92;infty} c_n / 3^n' class='latex' />, which exists by (2); the density Hales-Jewett theorem is precisely the statement that <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' /> vanishes, so suppose for contradiction that <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' /> is positive.</p>
<p>Then one can find a line-free set <img src='http://s0.wp.com/latex.php?latex=A+%5Csubset+%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;subset [3]^n' title='A &#92;subset [3]^n' class='latex' /> of density <img src='http://s0.wp.com/latex.php?latex=%5Calpha+%2B+o_%7Bn+%5Cto+%5Cinfty%7D%281%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha + o_{n &#92;to &#92;infty}(1)' title='&#92;alpha + o_{n &#92;to &#92;infty}(1)' class='latex' /> such that A has density at most <img src='http://s0.wp.com/latex.php?latex=%5Calpha+%2B+o_%7Bm+%5Cto+%5Cinfty%7D%281%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha + o_{m &#92;to &#92;infty}(1)' title='&#92;alpha + o_{m &#92;to &#92;infty}(1)' class='latex' /> for every m-dimensional subspace of <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' /> (and one can complement this lower bound with an upper bound for &#8220;most&#8221; subspaces, see <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1803">Tao.100</a>).  One can also regularise the density on other sets than subspaces, see <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1834">Tao.121</a>).  So, basically, the moment one gets a significant density increment on a non-trivial subspace, one is done (see <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1833">Tao.120</a>, <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1835">O&#8217;Donnell.122</a>, <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1843">Tao.127</a>).</p>
<p>Szemeredi&#8217;s proof of Roth&#8217;s theorem proceeds by first showing that a dense set A contains a large cube Q, and thus (if it is free of algebraic lines) must completely avoid the set <img src='http://s0.wp.com/latex.php?latex=2.Q+-+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='2.Q - A' title='2.Q - A' class='latex' />.  This set contains (most of) lots of parallel subspaces and so squeezes A to have higher density on some other subspace, closing the argument.  (See <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1781">Tao.86</a>, <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1831">Tao.118</a>, <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1854">Tao.131</a>)</p>
<p><strong>Question V.A</strong>: Is there some way to adapt this argument to density Hales-Jewett, or at least to the corners problem?  The sparsity of lines is a serious difficulty, see <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1831">Tao.118</a>, <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1859">Solymosi.135</a>.</p>
<p>One can at least generate lots of cubes inside A without difficulty (<a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1806">Tao.102</a>, <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1859">Solymosi.135</a>), lots of combinatorial lines with two elements inside A (<a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1853">Tao.130</a>, <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1856">Bukh.132</a>, <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1857">O&#8217;Donnell.133</a>, <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1859">Solymosi.135</a>), and many unorganised subspaces in the complement of A (<a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1831">Tao.118</a>, <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/#comment-1859">Solymosi.135</a>), but this does not seem to be enough as yet.</p>
<p><strong>Question V.B:</strong> What about using the techniques from Szemeredi&#8217;s big paper on Szemeredi&#8217;s theorem?  (The immediate issue is that there is a key point where one needs to regularise a graph, and the relevant graph in HJ or even for corners is far too sparse; I&#8217;ll try to clarify this at some point.)</p>
<p style="text-align:center;">&#8211; A note on comments &#8211;</p>
<p>As in the first thread, please number your comments (starting with 200, then 201, etc.) and provide a title, to assist with following the comments.  (Presumably, in future projects of this type, we will use a platform that allows for comment threading; see <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/04/quick-question/">this post</a> for further discussion.)</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[A massively collaborative mathematical project]]></title>
<link>http://terrytao.wordpress.com/2009/02/01/a-massively-collaborative-mathematical-project/</link>
<pubDate>Sun, 01 Feb 2009 19:11:32 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2009/02/01/a-massively-collaborative-mathematical-project/</guid>
<description><![CDATA[My good friend Tim Gowers has just started an experimental &#8220;massively collaborative mathematic]]></description>
<content:encoded><![CDATA[<p>My good friend <a href="http://en.wikipedia.org/wiki/Timothy_Gowers">Tim Gowers</a> has just started an experimental &#8220;<a class="snap_noshots" href="http://gowers.wordpress.com/2009/01/27/is-massively-collaborative-mathematics-possible/">massively collaborative mathematical project</a>&#8221; over at <a class="snap_noshots" href="http://gowers.wordpress.com/">his blog</a>.  The project is entitled &#8220;<a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/">A combinatorial approach to density Hales-Jewett</a>&#8220;, and the aim is to see if progress can be made on this problem by many small contributions by a large number of people, as opposed to the traditional model of a few very large contributions by a small number of people (see <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/questions-of-procedure/">this article</a> for more on the &#8220;rules of the game&#8221;, and <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/why-this-particular-problem/">this article</a> for why this particular project was picked as a test project).  I think this is an interesting experiment, and hopefully a successful one, though it is far too early to tell as yet.</p>
<p>I can describe the problem here.  Let n be a large integer, and let <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' /> be the set of all strings of length n using the alphabet <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C3%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,2,3&#92;}' title='&#92;{1,2,3&#92;}' class='latex' />, thus for instance <img src='http://s0.wp.com/latex.php?latex=13323+%5Cin+%5B3%5D%5E5&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='13323 &#92;in [3]^5' title='13323 &#92;in [3]^5' class='latex' />.  A <em>combinatorial line</em> in <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' /> is a triple of points in <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' /> that can be formed by taking a string of length n using the alphabet <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C2%2C3%2Cx%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,2,3,x&#92;}' title='&#92;{1,2,3,x&#92;}' class='latex' /> with at least one occurrence of the &#8220;wildcard&#8221; x, and then substituting the values of 1, 2, 3 for the wildcard.  For instance, the string <img src='http://s0.wp.com/latex.php?latex=1xx2x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1xx2x' title='1xx2x' class='latex' /> would lead to the combinatorial line <img src='http://s0.wp.com/latex.php?latex=%5C%7B11121%2C+12222%2C+13323%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{11121, 12222, 13323&#92;}' title='&#92;{11121, 12222, 13323&#92;}' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5E5&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^5' title='{}[3]^5' class='latex' />.  The (k=3) case of the <em>density Hales-Jewett theorem</em> <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1001397">of Furstenberg and Katznelson</a> asserts:</p>
<blockquote><p><strong>Density Hales-Jewett theorem.</strong> Let <img src='http://s0.wp.com/latex.php?latex=0+%26%2360%3B+%5Cdelta+%26%2360%3B+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='0 &lt; &#92;delta &lt; 1' title='0 &lt; &#92;delta &lt; 1' class='latex' />.  Then if n is sufficiently large depending on <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />, every subset of <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B3%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[3]^n' title='{}[3]^n' class='latex' /> of density at least <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> contains a combinatorial line.</p></blockquote>
<p>[Furstenberg and Katznelson handled the case of general k <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1191743">in a subsequent paper</a>.  The k=1 case is trivial, and as pointed out in <a class="snap_noshots" href="http://gilkalai.wordpress.com/2009/01/28/mathematics-science-and-blogs/">this post by Gil Kalai</a>, the k=2 case follows from <a href="http://en.wikipedia.org/wiki/Sperner%27s_theorem">Sperner's theorem</a>.]</p>
<p>Furstenberg and Katznelson&#8217;s proof uses ergodic theory, and in particular does not obviously give any bound as to how large n has to be depending on <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> before the theorem takes effect.  No other proofs of this theorem are currently known.  So it would be desirable to have a combinatorial proof of the k=3 density Hales-Jewett theorem.  Since this theorem implies <a href="http://en.wikipedia.org/wiki/Szemer%C3%A9di%27s_theorem">Roth&#8217;s theorem</a>, and Roth&#8217;s theorem has a combinatorial proof based on the triangle removal lemma (see e.g. my <a class="snap_noshots" href="http://terrytao.wordpress.com/2007/04/07/simons-lecture-ii-structure-and-randomness-in-ergodic-theory-and-graph-theory/">Simons lecture on the subject</a>, or Tim Gowers&#8217; <a class="snap_noshots" href="http://gowers.wordpress.com/2009/01/30/background-to-a-polymath-project/">background article</a> for the project), it is thus natural to ask whether the density Hales-Jewett theorem has a proof based on something similar to the triangle removal lemma.  This is basically the question being explored in <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/">the above project</a>.  (Some further motivation for this problem can be <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/why-this-particular-problem/">found here</a>.)</p>
<p>Further articles on <a class="snap_noshots" href="http://gowers.wordpress.com/2009/02/01/a-combinatorial-approach-to-density-hales-jewett/">this project</a> are collected at <a class="snap_noshots" href="http://wordpress.com/tag/polymath1/">this page</a>.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Tricks Wiki: Use basic examples to calibrate exponents]]></title>
<link>http://terrytao.wordpress.com/2008/12/27/tricks-wiki-use-basic-examples-to-calibrate-exponents/</link>
<pubDate>Sat, 27 Dec 2008 08:21:21 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2008/12/27/tricks-wiki-use-basic-examples-to-calibrate-exponents/</guid>
<description><![CDATA[Title: Use basic examples to calibrate exponents Motivation: In the more quantitative areas of mathe]]></description>
<content:encoded><![CDATA[<p><strong>Title:</strong> Use basic examples to calibrate exponents</p>
<p><strong>Motivation:</strong> In the more quantitative areas of mathematics, such as analysis and combinatorics, one has to frequently keep track of a large number of exponents in one&#8217;s identities, inequalities, and estimates.  For instance, if one is studying a set of N elements, then many expressions that one is faced with will often involve some power <img src='http://s0.wp.com/latex.php?latex=N%5Ep&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N^p' title='N^p' class='latex' /> of N; if one is instead studying a function f on a measure space X, then perhaps it is an <img src='http://s0.wp.com/latex.php?latex=L%5Ep&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L^p' title='L^p' class='latex' /> norm <img src='http://s0.wp.com/latex.php?latex=%5C%26%23124%3Bf%5C%26%23124%3B_%7BL%5Ep%28X%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;&#124;f&#92;&#124;_{L^p(X)}' title='&#92;&#124;f&#92;&#124;_{L^p(X)}' class='latex' /> which will appear instead.  The exponent <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p' title='p' class='latex' /> involved will typically evolve slowly over the course of the argument, as various algebraic or analytic manipulations are applied.  In some cases, the exact value of this exponent is immaterial, but at other times it is crucial to have the correct value of <img src='http://s0.wp.com/latex.php?latex=p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p' title='p' class='latex' /> at hand.   One can (and should) of course carefully go through one&#8217;s arguments line by line to work out the exponents correctly, but it is all too easy to make a sign error or other mis-step at one of the lines, causing all the exponents on subsequent lines to be incorrect.  However, one can guard against this (and avoid some tedious line-by-line exponent checking) by continually <em>calibrating</em> these exponents at key junctures of the arguments by using basic examples of the object of study (sets, functions, graphs, etc.) as test cases.  This is a simple trick, but it lets one avoid many unforced errors with exponents, and also lets one compute more rapidly.</p>
<p><strong>Quick description: </strong> When trying to quickly work out what an exponent p in an estimate, identity, or inequality should be without deriving that statement line-by-line, test that statement with a simple example which has non-trivial behaviour with respect to that exponent p, but trivial behaviour with respect to as many other components of that statement as one is able to manage.   The &#8220;non-trivial&#8221; behaviour should be parametrised by some very large or very small parameter.  By matching the dependence on this parameter on both sides of the estimate, identity, or inequality, one should recover p (or at least a good prediction as to what p should be).</p>
<p><strong>General discussion:</strong> The test examples should be as basic as possible; ideally they should have trivial behaviour in all aspects except for one feature that relates to the exponent p that one is trying to calibrate, thus being only &#8220;barely&#8221; non-trivial.   When the object of study is a function, then (appropriately rescaled, or otherwise modified) bump functions are very typical test objects, as are Dirac masses, constant functions, Gaussians, or other functions that are simple and easy to compute with.  In additive combinatorics, when the object of study is a subset of a group, then subgroups, arithmetic progressions, or random sets are typical test objects.  In graph theory, typical examples of test objects include complete graphs, complete bipartite graphs, and random graphs. And so forth.</p>
<p>This trick is closely related to that of using <a href="http://en.wikipedia.org/wiki/Dimensional_analysis">dimensional analysis</a> to recover exponents; indeed, one can view dimensional analysis as the special case of exponent calibration when using test objects which are non-trivial in one dimensional aspect (e.g. they exist at a single very large or very small length scale) but are otherwise of a trivial or &#8220;featureless&#8221; nature.   But the calibration trick is more general, as it can involve parameters (such as probabilities, angles, or eccentricities) which are not commonly associated with the physical concept of a dimension.  And personally, I find example-based calibration to be a much more satisfying (and convincing) explanation of an exponent than a calibration arising from formal dimensional analysis.</p>
<p>When one is trying to calibrate an inequality or estimate, one should try to pick a basic example which one expects to <em>saturate</em> that inequality or estimate, i.e. an example for which the inequality is close to being an equality.  Otherwise, one would only expect to obtain some partial information on the desired exponent p (e.g. a lower bound or an upper bound only).  Knowing the examples that saturate an estimate that one is trying to prove is also useful for several other reasons &#8211; for instance, it strongly suggests that any technique which is not efficient when applied to the saturating example, is unlikely to be strong enough to prove the estimate in general, thus eliminating fruitless approaches to a problem and (hopefully) refocusing one&#8217;s attention on those strategies which actually have a chance of working.</p>
<p>Calibration is best used for the type of quick-and-dirty calculations one uses when trying to rapidly map out an argument that one has roughly worked out already, but without precise details; in particular, I find it particularly useful when <a class="snap_noshots" href="http://terrytao.wordpress.com/advice-on-writing-papers/write-a-rapid-prototype-first/">writing up a rapid prototype</a>.  When the time comes to write out the paper in full detail, then of course one should instead carefully work things out line by line, but if all goes well, the exponents obtained in that process should match up with the preliminary guesses for those exponents obtained by calibration, which adds confidence that there are no exponent errors have been committed.</p>
<p><strong>Prerequisites:</strong> Undergraduate analysis and combinatorics.</p>
<p><!--more--></p>
<p><strong>Example 1. </strong>(Elementary identities)  There is a familiar identity for the sum of the first n squares:</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+1%5E2+%2B+2%5E2+%2B+3%5E2+%2B+%5Cldots+%2B+n%5E2+%3D+%3F%3F%3F&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle 1^2 + 2^2 + 3^2 + &#92;ldots + n^2 = ???' title='&#92;displaystyle 1^2 + 2^2 + 3^2 + &#92;ldots + n^2 = ???' class='latex' /> (1)</p>
<p>But imagine that one has forgotten exactly what the RHS of (1) was supposed to be&#8230; one remembers that it was some polynomial in n, but can&#8217;t remember what the degree or coefficients of the polynomial were.  Now one can of course try to rederive the identity, but there are faster (albeit looser) ways to reconstruct the right-hand side.  Firstly, we can look at the asymptotic test case <img src='http://s0.wp.com/latex.php?latex=n+%5Cto+%5Cinfty&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n &#92;to &#92;infty' title='n &#92;to &#92;infty' class='latex' />.  On the LHS, we are summing n terms of size at most <img src='http://s0.wp.com/latex.php?latex=n%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n^2' title='n^2' class='latex' />, so the LHS is at most <img src='http://s0.wp.com/latex.php?latex=n%5E3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n^3' title='n^3' class='latex' />; thus, if we believe the RHS to be a polynomial in n, it should be at most cubic in n.  We can do a bit better by approximating the sum in the LHS by the integral <img src='http://s0.wp.com/latex.php?latex=%5Cint_0%5En+x%5E2%5C+dx+%3D+n%5E3%2F3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;int_0^n x^2&#92; dx = n^3/3' title='&#92;int_0^n x^2&#92; dx = n^3/3' class='latex' />, which strongly suggests that the cubic term on the RHS should be <img src='http://s0.wp.com/latex.php?latex=n%5E3%2F3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n^3/3' title='n^3/3' class='latex' />.  So now we have</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+1%5E2+%2B+2%5E2+%2B+3%5E2+%2B+%5Cldots+%2B+n%5E2+%3D+%5Cfrac%7B1%7D%7B3%7D+n%5E3+%2B+a+n%5E2+%2B+b+n+%2B+c&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle 1^2 + 2^2 + 3^2 + &#92;ldots + n^2 = &#92;frac{1}{3} n^3 + a n^2 + b n + c' title='&#92;displaystyle 1^2 + 2^2 + 3^2 + &#92;ldots + n^2 = &#92;frac{1}{3} n^3 + a n^2 + b n + c' class='latex' /></p>
<p>for some coefficients a,b,c that we still have to work out.</p>
<p>We can plug in some other basic cases.  A simple one is n=0.  The LHS is now zero, and so the constant coefficient c on the RHS should also vanish.  A slightly less obvious base case is n=-1.  The LHS is still zero (note that the LHS for n-1 should be the LHS for n, minus <img src='http://s0.wp.com/latex.php?latex=n%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n^2' title='n^2' class='latex' />), and so the RHS still vanishes here; thus by the factor theorem, the RHS should have both n and n+1 as factors.  We are now looking at</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+1%5E2+%2B+2%5E2+%2B+3%5E2+%2B+%5Cldots+%2B+n%5E2+%3D+n%28n%2B1%29+%28+%5Cfrac%7B1%7D%7B3%7D+n+%2B+d+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle 1^2 + 2^2 + 3^2 + &#92;ldots + n^2 = n(n+1) ( &#92;frac{1}{3} n + d )' title='&#92;displaystyle 1^2 + 2^2 + 3^2 + &#92;ldots + n^2 = n(n+1) ( &#92;frac{1}{3} n + d )' class='latex' /></p>
<p>for some unspecified constant d.  But now we just need to try one more test case, e.g. n=1, and we learn that <img src='http://s0.wp.com/latex.php?latex=d+%3D+1%2F6&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d = 1/6' title='d = 1/6' class='latex' />, thus recovering the correct formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+1%5E2+%2B+2%5E2+%2B+3%5E2+%2B+%5Cldots+%2B+n%5E2+%3D+%5Cfrac%7Bn%28n%2B1%29+%282n%2B1%29%7D%7B6%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle 1^2 + 2^2 + 3^2 + &#92;ldots + n^2 = &#92;frac{n(n+1) (2n+1)}{6}' title='&#92;displaystyle 1^2 + 2^2 + 3^2 + &#92;ldots + n^2 = &#92;frac{n(n+1) (2n+1)}{6}' class='latex' />. (1&#8242;)</p>
<p style="text-align:left;">Once one has the formula (1&#8242;) in hand, of course, it is not difficult to verify by a textbook use of mathematical induction that the formula is in fact valid.  (Alternatively, one can prove a more abstract theorem that the sum of the first n <img src='http://s0.wp.com/latex.php?latex=k%5E%7Bth%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k^{th}' title='k^{th}' class='latex' /> powers is necessarily a polynomial in n for any given k, at which point the above analysis actually becomes a rigorous derivation of (1&#8242;).)</p>
<p>Note that the optimal strategy here is to start with the most basic test cases (<img src='http://s0.wp.com/latex.php?latex=n+%5Cto+%5Cinfty%2C+n+%3D+0%2C+n+%3D+-1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n &#92;to &#92;infty, n = 0, n = -1' title='n &#92;to &#92;infty, n = 0, n = -1' class='latex' />) first before moving on to less trivial cases.  If instead one used, e.g. n=1, n=2, n=3, n=4 as the test cases, one would eventually have obtained the right answer, but it would have been more work.</p>
<p><strong>Exercise 1.</strong> (<a href="http://en.wikipedia.org/wiki/Partial_fraction">Partial fractions</a>)  If <img src='http://s0.wp.com/latex.php?latex=w_1%2C%5Cldots%2Cw_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='w_1,&#92;ldots,w_k' title='w_1,&#92;ldots,w_k' class='latex' /> are distinct complex numbers, and P(z) is a polynomial of degree less than k, establish the existence of a partial fraction decomposition</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cfrac%7BP%28z%29%7D%7B%28z-w_1%29+%5Cldots+%28z-w_k%29%7D+%3D+%5Cfrac%7Bc_1%7D%7Bz-w_1%7D+%2B+%5Cldots+%2B+%5Cfrac%7Bc_k%7D%7Bz-w_k%7D%2C&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;frac{P(z)}{(z-w_1) &#92;ldots (z-w_k)} = &#92;frac{c_1}{z-w_1} + &#92;ldots + &#92;frac{c_k}{z-w_k},' title='&#92;displaystyle &#92;frac{P(z)}{(z-w_1) &#92;ldots (z-w_k)} = &#92;frac{c_1}{z-w_1} + &#92;ldots + &#92;frac{c_k}{z-w_k},' class='latex' /></p>
<p>(Hint: use the <a href="http://en.wikipedia.org/wiki/Polynomial_remainder_theorem">remainder theorem</a> and induction) and use the test cases <img src='http://s0.wp.com/latex.php?latex=z+%5Cto+w_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='z &#92;to w_j' title='z &#92;to w_j' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=j%3D1%2C%5Cldots%2Ck&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='j=1,&#92;ldots,k' title='j=1,&#92;ldots,k' class='latex' /> to compute the coefficients <img src='http://s0.wp.com/latex.php?latex=c_1%2C%5Cldots%2Cc_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_1,&#92;ldots,c_k' title='c_1,&#92;ldots,c_k' class='latex' />.  Use this to deduce the <a href="http://en.wikipedia.org/wiki/Lagrange_polynomial">Lagrange interpolation formula</a>. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Example 2.</strong> (Counting cycles in a graph)  Suppose one has a <a href="http://en.wikipedia.org/wiki/Graph_(mathematics)">graph</a> G on N vertices with an edge density of <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> (thus, the number of edges is <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%5Cbinom%7BN%7D%7B2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta &#92;binom{N}{2}' title='&#92;delta &#92;binom{N}{2}' class='latex' />, or roughly <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+N%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta N^2' title='&#92;delta N^2' class='latex' /> up to constants).   There is a standard <a href="http://en.wikipedia.org/wiki/Cauchy%E2%80%93Schwarz_inequality">Cauchy-Schwarz</a> argument that gives a lower bound on the number of four-<a href="http://en.wikipedia.org/wiki/Cycle_(graph_theory)">cycles</a> <img src='http://s0.wp.com/latex.php?latex=C_4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='C_4' title='C_4' class='latex' /> (i.e. a circuit of four vertices connected by four edges) present in G, as a function of <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> and N.  It only takes a few minutes to reconstruct this argument to obtain the precise bound, but suppose one was in a hurry and wanted to guess the bound rapidly.  Given the &#8220;polynomial&#8221; nature of the Cauchy-Schwarz inequality, the bound is likely to be some polynomial combination of <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> and N, such as <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%5Ep+N%5Eq&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta^p N^q' title='&#92;delta^p N^q' class='latex' /> (omitting constants and lower order terms).   But what should p and q be?</p>
<p>Well, one can test things with some basic examples.  A really trivial example is the empty graph (where <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%3D+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta = 0' title='&#92;delta = 0' class='latex' />), but this is too trivial to tell us anything much (other than that p should probably be positive).   At the other extreme, consider the complete graph on N vertices, where <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%3D+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta = 1' title='&#92;delta = 1' class='latex' />; this renders p irrelevant, but still makes q non-trivial (and thus, hopefully, computable).  In the complete graph, every set of four points yields a four-cycle <img src='http://s0.wp.com/latex.php?latex=C_4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='C_4' title='C_4' class='latex' />, so the number of four-cycles here should be about <img src='http://s0.wp.com/latex.php?latex=N%5E4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N^4' title='N^4' class='latex' /> (give or take some constant factors, such as 4! &#8211; remember that we are in a hurry here, and are ignoring these sorts of constant factors).  This tells us that q should be at most 4, and if we expect the Cauchy-Schwarz bound to be saturated for the complete graph (which is a good bet &#8211; arguments based on the Cauchy-Schwarz inequality tend to work well in very &#8220;uniformly distributed&#8221; situations) &#8211; then we would expect q to be exactly 4.</p>
<p>To calibrate p, we need to test with graphs of density <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> less than 1.  Given the previous intuition that Cauchy-Schwarz arguments work well in uniformly distributed situations, we would want to use a test graph of density <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> that is more or less uniformly distributed.  A good example of such a graph is a <em>random</em> graph G on N vertices, in which each edge has an independent probability of <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> of lying in G.  By the <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/06/18/the-strong-law-of-large-numbers/">law of large numbers</a>, we expect the edge density of such a random graph to be close to <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> on the average.  On the other hand, each one of the roughly <img src='http://s0.wp.com/latex.php?latex=N%5E4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N^4' title='N^4' class='latex' /> four-cycles <img src='http://s0.wp.com/latex.php?latex=C_4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='C_4' title='C_4' class='latex' /> connecting the N vertices has a probability about <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%5E4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta^4' title='&#92;delta^4' class='latex' /> of lying in the graph, since the <img src='http://s0.wp.com/latex.php?latex=C_4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='C_4' title='C_4' class='latex' /> has four edges, each with an independent probability of <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> of lying in the edge.   The events that each of the four-cycles lies in the graph G aren&#8217;t completely independent of each other, but they are still close enough to being so that one can guess using the law of large numbers that the total number of 4-cycles should be about <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%5E4+N%5E4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta^4 N^4' title='&#92;delta^4 N^4' class='latex' /> on the average (up to constants).  [Actually, linearity of expectation will give us this claim even without any independence whatsoever.]  So this leads one to predict p=4, thus the number of 4-cycles in any graph on N vertices of density <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> should be <img src='http://s0.wp.com/latex.php?latex=%5Cgeq+c+%5Cdelta%5E4+N%5E4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;geq c &#92;delta^4 N^4' title='&#92;geq c &#92;delta^4 N^4' class='latex' /> for some absolute constant <img src='http://s0.wp.com/latex.php?latex=c%26%2362%3B0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&gt;0' title='c&gt;0' class='latex' />, and this is indeed the case (provided that one also counts degenerate cycles, in which some vertices are repeated).</p>
<p>If one is nervous about using the random graph as the test graph, one could try a graph at the other end of the spectrum &#8211; e.g. the complete graph on about <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B%5Cdelta%7D+N&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sqrt{&#92;delta} N' title='&#92;sqrt{&#92;delta} N' class='latex' /> vertices, which also has edge density about <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />.  Here one quickly calculates that the number of 4-cycles is about <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%5E2+N%5E4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta^2 N^4' title='&#92;delta^2 N^4' class='latex' />, which is a larger quantity than in the random case (and this fits with the intuition that this graph is packing the same number of edges into a tighter space, and should thus increase the number of cycles).  So the random graph is still the best candidate for a near-extremiser for this bound.  (Actually, if the number of 4-cycles is close to the Cauchy-Schwarz lower bound, then the graph becomes <em>pseudorandom</em>, which roughly speaking means any randomly selected small subgraph of that graph is indistinguishable from a random graph.)</p>
<p>One should caution that sometimes the random object is <em>not</em> the extremiser, and so does not always calibrate an estimate correctly.  For instance, consider <a href="http://en.wikipedia.org/wiki/Szemer%C3%A9di%27s_theorem">Szemerédi&#8217;s theorem</a>, that asserts that given any <img src='http://s0.wp.com/latex.php?latex=0+%26%2360%3B+%5Cdelta+%26%2360%3B+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='0 &lt; &#92;delta &lt; 1' title='0 &lt; &#92;delta &lt; 1' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=k+%26%2362%3B+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k &gt; 1' title='k &gt; 1' class='latex' />, that any subset of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> of density <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> should contain at least one arithmetic progression of length k, if N is large enough.  One can then ask what is the minimum number of k-term arithmetic progressions such a set would contain.  Using the random subset of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> of density <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> as a test case, we would guess that there should be about <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%5Ek+N%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta^k N^2' title='&#92;delta^k N^2' class='latex' /> (up to constants depending on k).    However, it turns out that the number of progressions can be significantly less than this (basically thanks to the old <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=18694">counterexample of Behrend</a>): given any constant C, one can get significantly fewer than <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%5EC+N%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta^C N^2' title='&#92;delta^C N^2' class='latex' /> k-term progressions.  But, thanks to an <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=106865">averaging argument of Varnavides</a>, it is known that there are at least <img src='http://s0.wp.com/latex.php?latex=c%28k%2C%5Cdelta%29+N%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c(k,&#92;delta) N^2' title='c(k,&#92;delta) N^2' class='latex' /> k-term progressions (for N large enough), where <img src='http://s0.wp.com/latex.php?latex=c%28k%2C%5Cdelta%29+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c(k,&#92;delta) &gt; 0' title='c(k,&#92;delta) &gt; 0' class='latex' /> is a positive quantity.  (Determining the exact order of magnitude of <img src='http://s0.wp.com/latex.php?latex=c%28k%2C%5Cdelta%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c(k,&#92;delta)' title='c(k,&#92;delta)' class='latex' /> is still an important open problem in this subject.)  So one can at least calibrate the correct dependence on N, even if the dependence on <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> is still unknown.</p>
<p><strong>Example 3.</strong> (Sobolev embedding)  Given a reasonable function <img src='http://s0.wp.com/latex.php?latex=f%3A+%7B%5CBbb+R%7D%5En+%5Cto+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: {&#92;Bbb R}^n &#92;to {&#92;Bbb R}' title='f: {&#92;Bbb R}^n &#92;to {&#92;Bbb R}' class='latex' /> (e.g. a <a href="http://en.wikipedia.org/wiki/Schwartz_space">Schwartz class function</a> will do), the <a href="http://en.wikipedia.org/wiki/Sobolev_inequality">Sobolev embedding theorem</a> gives estimates such as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%26%23124%3B+f+%5C%26%23124%3B_%7BL%5Eq%28%7B%5CBbb+R%7D%5En%29%7D+%5Cleq+C_%7Bn%2Cp%2Cq%7D+%5C%26%23124%3B%5Cnabla+f%5C%26%23124%3B_%7BL%5Ep%28%7B%5CBbb+R%7D%5En%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;&#124; f &#92;&#124;_{L^q({&#92;Bbb R}^n)} &#92;leq C_{n,p,q} &#92;&#124;&#92;nabla f&#92;&#124;_{L^p({&#92;Bbb R}^n)}' title='&#92;displaystyle &#92;&#124; f &#92;&#124;_{L^q({&#92;Bbb R}^n)} &#92;leq C_{n,p,q} &#92;&#124;&#92;nabla f&#92;&#124;_{L^p({&#92;Bbb R}^n)}' class='latex' /> (2)</p>
<p>for various exponents p, q.  Suppose one has forgotten the exact relationship between p, q, and n and wants to quickly reconstruct it, without rederiving the proof of the theorem or looking it up.  One could use dimensional analysis to work out the relationship (and we will come to that shortly), but an equivalent way to achieve the same result is to test the inequality (2) against a suitably basic example, preferably one that one expects to saturate (2).</p>
<p>To come as close to saturating (2) as possible, one wants to keep the gradient of f small, while making f large; among other things, this suggests that unnecessary oscillations in f should be kept to a minimum.  A natural candidate for an extremiser, then, would be a <em>rescaled bump function</em> <img src='http://s0.wp.com/latex.php?latex=f%28x%29+%3D+A%5Cphi%28x%2FL%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(x) = A&#92;phi(x/L)' title='f(x) = A&#92;phi(x/L)' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%5Cphi+%5Cin+C%5E%5Cinfty_0%28%7B%5CBbb+R%7D%5En%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;phi &#92;in C^&#92;infty_0({&#92;Bbb R}^n)' title='&#92;phi &#92;in C^&#92;infty_0({&#92;Bbb R}^n)' class='latex' /> is some fixed bump function, <img src='http://s0.wp.com/latex.php?latex=A+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &gt; 0' title='A &gt; 0' class='latex' /> is an amplitude parameter, and <img src='http://s0.wp.com/latex.php?latex=L+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L &gt; 0' title='L &gt; 0' class='latex' /> is a parameter, thus f is a rescaled bump function of bounded amplitude O(A) that is supported on a ball of radius O(r) centred at the origin.  [As the estimate (2) is linear, the amplitude A turns out to ultimately be irrelevant here, but the amplitude plays a more crucial role in nonlinear estimates; for instance, it explains why nonlienar estimates typically have the same number of appearances of a given unknown function f in each term.  Also, it is sometimes convenient to carefully choose the amplitude in order to attain a convenient normalisation, e.g. to set one of the norms in (2) equal to 1.]</p>
<p>The ball that f is supported on has volume about <img src='http://s0.wp.com/latex.php?latex=O%28L%5En%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O(L^n)' title='O(L^n)' class='latex' /> (allowing implied constants to depend on n), and so the <img src='http://s0.wp.com/latex.php?latex=L%5Eq&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L^q' title='L^q' class='latex' /> norm of f should be about <img src='http://s0.wp.com/latex.php?latex=O%28L%5E%7Bn%2Fq%7D+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O(L^{n/q} )' title='O(L^{n/q} )' class='latex' /> (allowing implied constants to depend on q as well).  As for the gradient of f, since f oscillates by O(A) over a length scale of O(L), one expects <img src='http://s0.wp.com/latex.php?latex=%5Cnabla+f&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla f' title='&#92;nabla f' class='latex' /> to have size about <img src='http://s0.wp.com/latex.php?latex=O%28A%2FL%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O(A/L)' title='O(A/L)' class='latex' /> on this ball (remember, derivatives measure &#8220;<a href="http://en.wikipedia.org/wiki/Slope">rise over run</a>&#8220;!), and so the <img src='http://s0.wp.com/latex.php?latex=L%5Ep&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L^p' title='L^p' class='latex' /> norm of <img src='http://s0.wp.com/latex.php?latex=%5Cnabla+f&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla f' title='&#92;nabla f' class='latex' /> should be about <img src='http://s0.wp.com/latex.php?latex=O%28+%5Cfrac%7BA%7D%7BL%7D+L%5E%7Bn%2Fp%7D+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O( &#92;frac{A}{L} L^{n/p} )' title='O( &#92;frac{A}{L} L^{n/p} )' class='latex' />.  Inserting this numerology into (2), and equating powers of L (note A cancels itself into irrelevance, and could in any case be set to equal 1), we are led to the relation</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cfrac%7Bn%7D%7Bp%7D+-+1+%3D+%5Cfrac%7Bn%7D%7Bq%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;frac{n}{p} - 1 = &#92;frac{n}{q}' title='&#92;displaystyle &#92;frac{n}{p} - 1 = &#92;frac{n}{q}' class='latex' /> (2)</p>
<p>which is indeed one of the necessary conditions for (2).  (The other necessary conditions are that p and q lie strictly between 1 and infinity, but these require a more complicated test example to establish.)</p>
<p>One can efficiently perform the above argument using the language of dimensional analysis.  Giving f the units of amplitude A, and giving space the units of length L, we see that the n-dimensional integral <img src='http://s0.wp.com/latex.php?latex=%5Cint_%7B%7B%5CBbb+R%7D%5En%7D%5C+dx&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;int_{{&#92;Bbb R}^n}&#92; dx' title='&#92;int_{{&#92;Bbb R}^n}&#92; dx' class='latex' /> has units of <img src='http://s0.wp.com/latex.php?latex=L%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L^n' title='L^n' class='latex' />, and thus <img src='http://s0.wp.com/latex.php?latex=L%5Ep%28%7B%5CBbb+R%7D%5En%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L^p({&#92;Bbb R}^n)' title='L^p({&#92;Bbb R}^n)' class='latex' /> norms have units of <img src='http://s0.wp.com/latex.php?latex=L%5E%7Bn%2Fp%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L^{n/p}' title='L^{n/p}' class='latex' />.  Meanwhile, from the rise-over-run interpretation of the derivative, <img src='http://s0.wp.com/latex.php?latex=%5Cnabla+f&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla f' title='&#92;nabla f' class='latex' /> has units of <img src='http://s0.wp.com/latex.php?latex=A%2FL&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A/L' title='A/L' class='latex' />, thus the LHS and RHS of (2) have units of <img src='http://s0.wp.com/latex.php?latex=A+L%5E%7Bn%2Fq%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A L^{n/q}' title='A L^{n/q}' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7BA%7D%7BL%7D+L%5E%7Bn%2Fp%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;frac{A}{L} L^{n/p}' title='&#92;frac{A}{L} L^{n/p}' class='latex' /> respectively.  Equating these dimensions gives (3).  Observe how this argument is basically a shorthand form of the argument based on using the rescaled bump function as a test case; with enough practice one can use this shorthand to calibrate exponents rapidly for a wide variety of estimates.</p>
<p><strong>Exercise 2.</strong> Convert the above discussion into a rigorous proof that (3) is a necessary condition for (2).  (Hint: exploit the freedom to send L to zero or to infinity.)  What happens to the necessary conditions if <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^n' title='{&#92;Bbb R}^n' class='latex' /> is replaced with a bounded domain (such as the unit cube <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5B0%2C1%5D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[0,1]^n' title='{}[0,1]^n' class='latex' />, assuming Dirichlet boundary conditions) or a discrete domain (such as the lattice <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}^n' title='{&#92;Bbb Z}^n' class='latex' />, replacing the gradient with a discrete gradient of course)? <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Exercise 3. </strong> If one replaces (2) by the variant estimate</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%26%23124%3B+f+%5C%26%23124%3B_%7BL%5Eq%28%7B%5CBbb+R%7D%5En%29%7D+%5Cleq+C_%7Bn%2Cp%2Cq%7D+%28%5C%26%23124%3Bf%5C%26%23124%3B_%7BL%5Ep%28%7B%5CBbb+R%7D%5En%29%7D+%2B+%5C%26%23124%3B%5Cnabla+f%5C%26%23124%3B_%7BL%5Ep%28%7B%5CBbb+R%7D%5En%29%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;&#124; f &#92;&#124;_{L^q({&#92;Bbb R}^n)} &#92;leq C_{n,p,q} (&#92;&#124;f&#92;&#124;_{L^p({&#92;Bbb R}^n)} + &#92;&#124;&#92;nabla f&#92;&#124;_{L^p({&#92;Bbb R}^n)})' title='&#92;displaystyle &#92;&#124; f &#92;&#124;_{L^q({&#92;Bbb R}^n)} &#92;leq C_{n,p,q} (&#92;&#124;f&#92;&#124;_{L^p({&#92;Bbb R}^n)} + &#92;&#124;&#92;nabla f&#92;&#124;_{L^p({&#92;Bbb R}^n)})' class='latex' /> (2&#8242;)</p>
<p>establish the necessary condition</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cfrac%7Bn%7D%7Bp%7D+-+1+%5Cleq+%5Cfrac%7Bn%7D%7Bq%7D+%5Cleq+%5Cfrac%7Bn%7D%7Bp%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;frac{n}{p} - 1 &#92;leq &#92;frac{n}{q} &#92;leq &#92;frac{n}{p}' title='&#92;displaystyle &#92;frac{n}{p} - 1 &#92;leq &#92;frac{n}{q} &#92;leq &#92;frac{n}{p}' class='latex' />. (3&#8242;)</p>
<p style="text-align:left;">What happens to the dimensional analysis argument in this case? <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Remark 1. </strong> There are many other estimates in harmonic analysis which are saturated by some modification of a bump function; in addition to the isotropically rescaled bump functions used above, one could also rescale bump functions by some non-isotropic linear transformation (thus creating various &#8220;squashed&#8221; or &#8220;stretched&#8221; bumps adapted to disks, tubes, rectangles, or other sets), or modulate bumps by various frequencies, or translate them around in space.  One can also try to superimpose several such transformed bump functions together to amplify the counterexample.  The art of selecting good counterexamples can be somewhat difficult, although with enough trial and error one gets a sense of what kind of arrangement of bump functions are needed to make the right-hand side small and the left-hand side large in the estimate under study. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Example 3 </strong> (Scale-invariance in nonlinear PDE)  The model equations and systems studied in nonlinear PDE often enjoy various symmetries, notably scale-invariance symmetry, that can then be used to calibrate various identities and estimates regarding solutions to those equations.  For sake of discussion, let us work with the <a href="http://en.wikipedia.org/wiki/Nonlinear_Schr%C3%B6dinger_equation">nonlinear Schrödinger equation</a> (NLS)</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+i+u_t+%2B+%5CDelta+u+%3D+%26%23124%3Bu%26%23124%3B%5E%7Bp-1%7D+u&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle i u_t + &#92;Delta u = &#124;u&#124;^{p-1} u' title='&#92;displaystyle i u_t + &#92;Delta u = &#124;u&#124;^{p-1} u' class='latex' /> (4)</p>
<p>where <img src='http://s0.wp.com/latex.php?latex=u%3A+%7B%5CBbb+R%7D+%5Ctimes+%7B%5CBbb+R%7D%5En+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u: {&#92;Bbb R} &#92;times {&#92;Bbb R}^n &#92;to {&#92;Bbb C}' title='u: {&#92;Bbb R} &#92;times {&#92;Bbb R}^n &#92;to {&#92;Bbb C}' class='latex' /> is the unknown field, <img src='http://s0.wp.com/latex.php?latex=%5CDelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Delta' title='&#92;Delta' class='latex' /> is the spatial Laplacian, and <img src='http://s0.wp.com/latex.php?latex=p+%26%2362%3B+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p &gt; 1' title='p &gt; 1' class='latex' /> is a fixed exponent.  (One can also place in some other constants in (4), such as Planck&#8217;s constant <img src='http://s0.wp.com/latex.php?latex=%5Chbar&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbar' title='&#92;hbar' class='latex' />, but we have normalised this constant to equal 1 here, although it is sometimes useful to reinstate this constant for calibration purposes.) If u is one solution to (4), then we can form a rescaled family <img src='http://s0.wp.com/latex.php?latex=u%5E%7B%28%5Clambda%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u^{(&#92;lambda)}' title='u^{(&#92;lambda)}' class='latex' /> of such solutions by the formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+u%5E%7B%28%5Clambda%29%7D%28t%2Cx%29+%3A%3D+%5Cfrac%7B1%7D%7B%5Clambda%5Ea%7D+u%28+%5Cfrac%7Bt%7D%7B%5Clambda%5Eb%7D%2C+%5Cfrac%7Bx%7D%7B%5Clambda%7D+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle u^{(&#92;lambda)}(t,x) := &#92;frac{1}{&#92;lambda^a} u( &#92;frac{t}{&#92;lambda^b}, &#92;frac{x}{&#92;lambda} )' title='&#92;displaystyle u^{(&#92;lambda)}(t,x) := &#92;frac{1}{&#92;lambda^a} u( &#92;frac{t}{&#92;lambda^b}, &#92;frac{x}{&#92;lambda} )' class='latex' /> (5)</p>
<p>for some specific exponents a, b; these play the role of the rescaled bump functions in Example 2.  The exponents a,b can be worked out by testing (4) using (5), and we leave this as an exercise to the reader, but let us instead use the shorthand of dimensional analysis to work these exponents out.  Let&#8217;s give u the units of amplitude A, space the units of length L, and time the units of duration T.  Then the three terms in (4) have units <img src='http://s0.wp.com/latex.php?latex=A%2FT&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A/T' title='A/T' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=A%2FL%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A/L^2' title='A/L^2' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=A%5Ep&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A^p' title='A^p' class='latex' /> respectively; equating these dimensions gives <img src='http://s0.wp.com/latex.php?latex=T%3DL%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T=L^2' title='T=L^2' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=A%3DL%5E%7B-2%2F%28p-1%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A=L^{-2/(p-1)}' title='A=L^{-2/(p-1)}' class='latex' />.  (In particular, time has &#8220;twice the dimension&#8221; of space; this is a feature of many non-relativistic equations such as Schrödinger, heat, or viscosity equations.  For relativistic equations, of course, time and space have the same dimension with respect to scaling.)  On the other hand, the scaling (5) multiplies A, T, and L by <img src='http://s0.wp.com/latex.php?latex=%5Clambda%5E%7B-a%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda^{-a}' title='&#92;lambda^{-a}' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%5Clambda%5Eb&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda^b' title='&#92;lambda^b' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' /> respectively; to maintain consistency with the relations <img src='http://s0.wp.com/latex.php?latex=T%3DL%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T=L^2' title='T=L^2' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=A%3DL%5E%7B-2%2F%28p-1%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A=L^{-2/(p-1)}' title='A=L^{-2/(p-1)}' class='latex' /> we must thus have <img src='http://s0.wp.com/latex.php?latex=a%3D2%2F%28p-1%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a=2/(p-1)' title='a=2/(p-1)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=b%3D2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='b=2' title='b=2' class='latex' />.</p>
<p><strong>Exercise 4. </strong> Solutions to (4) (with suitable smoothness and decay properties) enjoy a conserved Hamiltonian <img src='http://s0.wp.com/latex.php?latex=H%28u%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='H(u)' title='H(u)' class='latex' />, of the form</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+H%28u%29+%3D+%5Cint_%7B%7B%5CBbb+R%7D%5En%7D+%5Cfrac%7B1%7D%7B2%7D+%26%23124%3B%5Cnabla+u%26%23124%3B%5E2+%2B+%5Calpha+%26%23124%3Bu%26%23124%3B%5Eq%5C+dx&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle H(u) = &#92;int_{{&#92;Bbb R}^n} &#92;frac{1}{2} &#124;&#92;nabla u&#124;^2 + &#92;alpha &#124;u&#124;^q&#92; dx' title='&#92;displaystyle H(u) = &#92;int_{{&#92;Bbb R}^n} &#92;frac{1}{2} &#124;&#92;nabla u&#124;^2 + &#92;alpha &#124;u&#124;^q&#92; dx' class='latex' /></p>
<p>for some constants <img src='http://s0.wp.com/latex.php?latex=%5Calpha%2C+q&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha, q' title='&#92;alpha, q' class='latex' />.  Use dimensional analysis (or the rescaled solutions (5) as test cases) to compute q.  (The constant <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' />, unfortunately, cannot be recovered from dimensional analysis, and other model test cases, such as solitons or other solutions obtained via <a href="http://en.wikipedia.org/wiki/Separation_of_variables">separation of variables</a>, also turn out unfortunately to not be sensitive enough to <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' /> to calibrate this parameter.) <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Remark 2.</strong> The scaling symmetry (5) is not the only symmetry that can be deployed to calibrate identities and estimates for solutions to NLS.  For instance, we have a simple phase rotation symmetry <img src='http://s0.wp.com/latex.php?latex=u+%5Cmapsto+e%5E%7Bi%5Ctheta%7D+u&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u &#92;mapsto e^{i&#92;theta} u' title='u &#92;mapsto e^{i&#92;theta} u' class='latex' /> for such solutions, where <img src='http://s0.wp.com/latex.php?latex=%5Ctheta+%5Cin+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;theta &#92;in {&#92;Bbb R}' title='&#92;theta &#92;in {&#92;Bbb R}' class='latex' /> is an arbitrary phase.  This symmetry suggests that in any identity involving u and its complex conjugate <img src='http://s0.wp.com/latex.php?latex=%5Cbar%7Bu%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;bar{u}' title='&#92;bar{u}' class='latex' />, the net number of factors of u, minus the factors of <img src='http://s0.wp.com/latex.php?latex=%5Cbar%7Bu%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;bar{u}' title='&#92;bar{u}' class='latex' />, in each term of the identity should be the same.  (Factors without phase, such as &#124;u&#124;, should be ignored for this analysis.)  Other important symmetries of NLS, which can also be used for calibration, include space translation symmetry, time translation symmetry, and <a class="snap_noshots" href="tosio.math.toronto.edu/wiki/index.php/Galilean">Galilean invariance</a>.  (While these symmetries can of course be joined together, to create a large-dimensional family of transformed solutions arising from a single base solution u, for the purposes of calibration it is usually best to just use each of the generating symmetries separately.)  For gauge field equations, gauge invariance is of course a crucial symmetry, though one can make the calibration procedure with respect to this symmetry automatic by working exclusively with gauge-invariant notation (see also my earlier post on <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/09/27/what-is-a-gauge/">gauge theory</a>).  Another important test case for Schrödinger equations is the <a class="snap_noshots" href="http://tosio.math.toronto.edu/wiki/index.php/High-frequency_limit">high-frequency limit</a> <img src='http://s0.wp.com/latex.php?latex=%26%23124%3B%5Cxi%26%23124%3B+%5Cto+%5Cinfty&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;&#92;xi&#124; &#92;to &#92;infty' title='&#124;&#92;xi&#124; &#92;to &#92;infty' class='latex' />, closely related to the <em>semi-classical limit</em> <img src='http://s0.wp.com/latex.php?latex=%5Chbar+%5Cto+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbar &#92;to 0' title='&#92;hbar &#92;to 0' class='latex' />, that allows one to use classical mechanics to calibrate various identities and estimates in quantum mechanics.  <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Exercise 5.</strong> Solutions to (4) (again assuming suitable smoothness and decay) also enjoy a <a class="snap_noshots" href="tosio.math.toronto.edu/wiki/index.php/Virial_identity">virial identity</a> of the form</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cpartial_%7Btt%7D+%5Cint_%7B%7B%5CBbb+R%7D%5En%7D+x%5E2+%26%23124%3Bu%28t%2Cx%29%26%23124%3B%5E2%5C+dx+%3D+%5Cint_%7B%7B%5CBbb+R%7D%5En%7D+%3F%3F%3F%5C+dx&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;partial_{tt} &#92;int_{{&#92;Bbb R}^n} x^2 &#124;u(t,x)&#124;^2&#92; dx = &#92;int_{{&#92;Bbb R}^n} ???&#92; dx' title='&#92;displaystyle &#92;partial_{tt} &#92;int_{{&#92;Bbb R}^n} x^2 &#124;u(t,x)&#124;^2&#92; dx = &#92;int_{{&#92;Bbb R}^n} ???&#92; dx' class='latex' /></p>
<p>where the right-hand side only involves u and its spatial derivatives <img src='http://s0.wp.com/latex.php?latex=%5Cnabla+u&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla u' title='&#92;nabla u' class='latex' />, and does not explicitly involve the spatial variable x.  Using the various symmetries, predict the type of terms that should go on the right-hand side.  (Again, the coefficients of these terms are unable to be calibrated using these methods, but the exponents should be accessible.) <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Remark 3.</strong> Einstein used this sort of calibration technique (using the symmetry of spacetime diffeomorphisms, better known as the <a href="http://en.wikipedia.org/wiki/Principle_of_relativity">general principle of relativity</a>, as well as the non-relativistic limit of Newtonian gravity as another test case) to derive the <a href="http://en.wikipedia.org/wiki/Einstein_field_equations">Einstein equations of gravity</a>, although the one constant that he was unable to calibrate in this fashion was the <a href="http://en.wikipedia.org/wiki/Cosmological_constant">cosmological constant</a>. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Example 4</strong> (Fourier-analytic identities in additive combinatorics).   Fourier analysis is a useful tool in additive combinatorics for counting various configurations in sets, such as arithmetic progressions <img src='http://s0.wp.com/latex.php?latex=n%2C+n%2Br%2C+n%2B2r&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n, n+r, n+2r' title='n, n+r, n+2r' class='latex' /> of length three.  (It turns out that classical Fourier analysis is not able to handle progressions of any longer length, but that is a story for another time &#8211; see e.g. <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1631259">this paper of Gowers</a> for some discussion.)  A typical situation arises when working in a finite group such as <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}/N{&#92;Bbb Z}' title='{&#92;Bbb Z}/N{&#92;Bbb Z}' class='latex' />, and one has to compute an expression such as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Csum_%7Bn%2C+r+%5Cin+%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D%7D+f%28n%29+g%28n%2Br%29+h%28n%2B2r%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;sum_{n, r &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}} f(n) g(n+r) h(n+2r)' title='&#92;displaystyle &#92;sum_{n, r &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}} f(n) g(n+r) h(n+2r)' class='latex' /> (6)</p>
<p>for some functions <img src='http://s0.wp.com/latex.php?latex=f%2Cg%2Ch%3A+%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f,g,h: {&#92;Bbb Z}/N{&#92;Bbb Z} &#92;to {&#92;Bbb C}' title='f,g,h: {&#92;Bbb Z}/N{&#92;Bbb Z} &#92;to {&#92;Bbb C}' class='latex' /> (for instance, these functions could all be the indicator function of a single set <img src='http://s0.wp.com/latex.php?latex=A+%5Csubset+%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;subset {&#92;Bbb Z}/N{&#92;Bbb Z}' title='A &#92;subset {&#92;Bbb Z}/N{&#92;Bbb Z}' class='latex' />).  The quantity (6) can be expressed neatly in terms of the Fourier transforms <img src='http://s0.wp.com/latex.php?latex=%5Chat+f%2C+%5Chat+g%2C+%5Chat+h%3A+%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hat f, &#92;hat g, &#92;hat h: {&#92;Bbb Z}/N{&#92;Bbb Z} &#92;to {&#92;Bbb C}' title='&#92;hat f, &#92;hat g, &#92;hat h: {&#92;Bbb Z}/N{&#92;Bbb Z} &#92;to {&#92;Bbb C}' class='latex' />, which we normalise as <img src='http://s0.wp.com/latex.php?latex=%5Chat+f%28%5Cxi%29+%3A%3D+%5Cfrac%7B1%7D%7BN%7D+%5Csum_%7Bx+%5Cin+%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D%7D+f%28x%29+e%5E%7B-2%5Cpi+i+x+%5Cxi%2FN%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hat f(&#92;xi) := &#92;frac{1}{N} &#92;sum_{x &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}} f(x) e^{-2&#92;pi i x &#92;xi/N}' title='&#92;hat f(&#92;xi) := &#92;frac{1}{N} &#92;sum_{x &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}} f(x) e^{-2&#92;pi i x &#92;xi/N}' class='latex' />.  It is not too difficult to compute this expression by means of the Fourier inversion formula and some routine calculation, but suppose one was in a hurry and only had a vague recollection of what the Fourier-analytic expression of (6) was &#8211; something like</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+N%5Ep+%5Csum_%7B%5Cxi+%5Cin+%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D%7D%5Chat+f%28+a+%5Cxi+%29+%5Chat+g%28+b+%5Cxi+%29+%5Chat+h%28+c+%5Cxi+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle N^p &#92;sum_{&#92;xi &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}}&#92;hat f( a &#92;xi ) &#92;hat g( b &#92;xi ) &#92;hat h( c &#92;xi )' title='&#92;displaystyle N^p &#92;sum_{&#92;xi &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}}&#92;hat f( a &#92;xi ) &#92;hat g( b &#92;xi ) &#92;hat h( c &#92;xi )' class='latex' /> (7)</p>
<p>for some coefficients p, a, b, c, but the precise values of which have been forgotten.  (In view of some other Fourier-analytic formulae, one might think that some of the Fourier transforms <img src='http://s0.wp.com/latex.php?latex=%5Chat+f%2C+%5Chat+g%2C+%5Chat+h&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hat f, &#92;hat g, &#92;hat h' title='&#92;hat f, &#92;hat g, &#92;hat h' class='latex' /> might need to be complex conjugated for (7), but this should not happen here, because (6) is linear in f,g,h rather than anti-linear; cf. the discussion in Example 3 about factors of u and <img src='http://s0.wp.com/latex.php?latex=%5Cbar%7Bu%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;bar{u}' title='&#92;bar{u}' class='latex' />.)  How can one quickly calibrate the values of p,a,b,c without doing the full calculation?</p>
<p>To isolate the exponent p, we can consider the basic case <img src='http://s0.wp.com/latex.php?latex=f+%5Cequiv+g+%5Cequiv+h+%5Cequiv+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f &#92;equiv g &#92;equiv h &#92;equiv 1' title='f &#92;equiv g &#92;equiv h &#92;equiv 1' class='latex' />, in which case the Fourier transforms are just the <a href="http://en.wikipedia.org/wiki/Kronecker_delta">Kronecker delta</a> function (e.g. <img src='http://s0.wp.com/latex.php?latex=%5Chat+f%28%5Cxi%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hat f(&#92;xi)' title='&#92;hat f(&#92;xi)' class='latex' /> equals 1 for <img src='http://s0.wp.com/latex.php?latex=%5Cxi%3D0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi=0' title='&#92;xi=0' class='latex' /> and vanishes otherwise).  The expression (6) is just <img src='http://s0.wp.com/latex.php?latex=N%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N^2' title='N^2' class='latex' />, while the expression (7) is <img src='http://s0.wp.com/latex.php?latex=N%5Ep&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N^p' title='N^p' class='latex' /> (because only one of the summands is non-trivial); thus p must equal 2.  (Exercise: reinterpret the above analysis as a dimensional analysis.)</p>
<p>Next, to calibrate a,b,c, we modify the above basic test case slightly, modulating the f,g,h so that a different element of the sum in (7) is non-zero.  Let us take <img src='http://s0.wp.com/latex.php?latex=f%28x%29+%3A%3D+e%5E%7B2%5Cpi+i+a+x+%5Cxi%2FN%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(x) := e^{2&#92;pi i a x &#92;xi/N}' title='f(x) := e^{2&#92;pi i a x &#92;xi/N}' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=g%28x%29+%3A%3D+e%5E%7B2%5Cpi+i+b+x+%5Cxi%2FN%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g(x) := e^{2&#92;pi i b x &#92;xi/N}' title='g(x) := e^{2&#92;pi i b x &#92;xi/N}' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=h%28x%29+%3A%3D+e%5E%7B2%5Cpi+i+c+x+%5Cxi%2FN%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='h(x) := e^{2&#92;pi i c x &#92;xi/N}' title='h(x) := e^{2&#92;pi i c x &#92;xi/N}' class='latex' /> for some fixed frequency <img src='http://s0.wp.com/latex.php?latex=%5Cxi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi' title='&#92;xi' class='latex' />; then (4) is again equal to <img src='http://s0.wp.com/latex.php?latex=N%5Ep%3DN%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N^p=N^2' title='N^p=N^2' class='latex' />, while (6) is equal to</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Csum_%7Bn%2Cr+%5Cin+%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D%7D+e%5E%7B2%5Cpi+i+%5B+a+n+%2B+b+%28n%2Br%29+%2B+c%28n%2B2r%29%5D+%5Cxi+%2F+N%7D.&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;sum_{n,r &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}} e^{2&#92;pi i [ a n + b (n+r) + c(n+2r)] &#92;xi / N}.' title='&#92;displaystyle &#92;sum_{n,r &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}} e^{2&#92;pi i [ a n + b (n+r) + c(n+2r)] &#92;xi / N}.' class='latex' /></p>
<p>In order for this to equal <img src='http://s0.wp.com/latex.php?latex=N%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N^2' title='N^2' class='latex' /> for any <img src='http://s0.wp.com/latex.php?latex=%5Cxi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi' title='&#92;xi' class='latex' />, we need the linear form <img src='http://s0.wp.com/latex.php?latex=an%2Bb%28n%2Br%29%2Bc%28n%2B2r%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='an+b(n+r)+c(n+2r)' title='an+b(n+r)+c(n+2r)' class='latex' /> to vanish identically, which forces a=c and b=-2a.  We can normalise a=1 (by using the change of variables <img src='http://s0.wp.com/latex.php?latex=%5Cxi+%5Cmapsto+a+%5Cxi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi &#92;mapsto a &#92;xi' title='&#92;xi &#92;mapsto a &#92;xi' class='latex' />), thus leading us to the correct expression for (7), namely</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+N%5E2+%5Csum_%7B%5Cxi+%5Cin+%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D%7D%5Chat+f%28+%5Cxi+%29+%5Chat+g%28+-2+%5Cxi+%29+%5Chat+h%28+%5Cxi+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle N^2 &#92;sum_{&#92;xi &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}}&#92;hat f( &#92;xi ) &#92;hat g( -2 &#92;xi ) &#92;hat h( &#92;xi )' title='&#92;displaystyle N^2 &#92;sum_{&#92;xi &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}}&#92;hat f( &#92;xi ) &#92;hat g( -2 &#92;xi ) &#92;hat h( &#92;xi )' class='latex' />.</p>
<p>Once one actually has this formula, of course, it is a routine matter to check that it actually is the right answer.</p>
<p><strong>Remark 4.</strong> One can also calibrate a,b,c in (7) by observing the identity <img src='http://s0.wp.com/latex.php?latex=n+-+2%28n%2Br%29+%2B+%28n%2B2r%29%3D0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n - 2(n+r) + (n+2r)=0' title='n - 2(n+r) + (n+2r)=0' class='latex' /> (which reflects the fact that the second derivative of a linear function is necessarily zero), which gives a modulation symmetry <img src='http://s0.wp.com/latex.php?latex=f%28x%29+%5Cmapsto+f%28x%29+e%5E%7B2%5Cpi+i+%5Calpha+x%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(x) &#92;mapsto f(x) e^{2&#92;pi i &#92;alpha x}' title='f(x) &#92;mapsto f(x) e^{2&#92;pi i &#92;alpha x}' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=g%28x%29+%5Cmapsto+g%28x%29+e%5E%7B-4%5Cpi+i+%5Calpha+x%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g(x) &#92;mapsto g(x) e^{-4&#92;pi i &#92;alpha x}' title='g(x) &#92;mapsto g(x) e^{-4&#92;pi i &#92;alpha x}' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=h%28x%29+%5Cmapsto+h%28x%29+e%5E%7B2%5Cpi+i+%5Calpha+x%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='h(x) &#92;mapsto h(x) e^{2&#92;pi i &#92;alpha x}' title='h(x) &#92;mapsto h(x) e^{2&#92;pi i &#92;alpha x}' class='latex' /> to (6).  Inserting this symmetry into (7) reveals that <img src='http://s0.wp.com/latex.php?latex=a%3Dc&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a=c' title='a=c' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=b%3D-2a&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='b=-2a' title='b=-2a' class='latex' /> as before. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Remark 5.</strong> By choosing appropriately normalised conventions, one can avoid some calibration duties altogether.  For instance, when using Fourier analysis on a finite group such as <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}/N{&#92;Bbb Z}' title='{&#92;Bbb Z}/N{&#92;Bbb Z}' class='latex' />, if one expects to be analysing functions that are close to constant (or subsets of the group of positive density), then it is natural to endow physical space with normalised counting measure (and thus, by Pontryagin duality, frequency space should be given non-normalised counting measure).  [Conversely, if one is analysing functions concentrated on only a bounded number of points, then it may be more convenient to give physical space counting measure and frequency space normalised counting measure.]  In practical terms, this means that any physical space sum, such as <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bx+%5Cin+%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D%7D+f%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_{x &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}} f(x)' title='&#92;sum_{x &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}} f(x)' class='latex' />, should instead be replaced with a physical space average <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+E%7D_%7Bx+%5Cin+%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D%7D+f%28x%29+%3D+%5Cfrac%7B1%7D%7BN%7D+%5Csum_%7Bx+%5Cin+%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D%7D+f%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb E}_{x &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}} f(x) = &#92;frac{1}{N} &#92;sum_{x &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}} f(x)' title='{&#92;Bbb E}_{x &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}} f(x) = &#92;frac{1}{N} &#92;sum_{x &#92;in {&#92;Bbb Z}/N{&#92;Bbb Z}} f(x)' class='latex' />, while keeping sums over frequency space variables unchanged; when one does so, all powers of N &#8220;miraculously&#8221; disappear, and there is no longer any need to calibrate using the constant function 1 as was done above. Of course, this does not eliminate the need to perform other calibrations, such as that of the coefficients a,b,c above. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[The Kakeya conjecture and the Ham Sandwich theorem]]></title>
<link>http://terrytao.wordpress.com/2008/11/27/the-kakeya-conjecture-and-the-ham-sandwich-theorem/</link>
<pubDate>Thu, 27 Nov 2008 23:35:22 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2008/11/27/the-kakeya-conjecture-and-the-ham-sandwich-theorem/</guid>
<description><![CDATA[One of my favourite family of conjectures (and one that has preoccupied a significant fraction of my]]></description>
<content:encoded><![CDATA[<p>One of my favourite family of conjectures (and one that has preoccupied a <a class="snap_noshots" href="http://www.math.ucla.edu/~tao/preprints/kakeya.html">significant fraction</a> of my own research) is the family of <a href="http://en.wikipedia.org/wiki/Kakeya_conjecture#Kakeya_conjecture">Kakeya conjectures</a> in geometric measure theory and harmonic analysis.  There are many (not quite equivalent) conjectures in this family.  The cleanest one to state is the set conjecture:</p>
<blockquote><p><a href="http://en.wikipedia.org/wiki/Kakeya_conjecture#Kakeya_conjecture"><strong>Kakeya set conjecture</strong>:</a> Let <img src='http://s0.wp.com/latex.php?latex=n+%5Cgeq+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n &#92;geq 1' title='n &#92;geq 1' class='latex' />, and let <img src='http://s0.wp.com/latex.php?latex=E+%5Csubset+%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='E &#92;subset {&#92;Bbb R}^n' title='E &#92;subset {&#92;Bbb R}^n' class='latex' /> contain a unit line segment in every direction (such sets are known as <em>Kakeya sets</em> or <em>Besicovitch sets</em>).  Then E has <a href="http://en.wikipedia.org/wiki/Hausdorff_dimension">Hausdorff dimension</a> and <a href="http://en.wikipedia.org/wiki/Minkowski_dimension">Minkowski dimension </a>equal to n.</p></blockquote>
<p>One reason why I find these conjectures fascinating is the sheer variety of mathematical fields that arise both in the partial results towards this conjecture, and in the applications of those results to other problems.  See for instance <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1660476">this survey of Wolff</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1820041">my Notices article</a> and <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2358378">this article of Łaba</a> on the connections between this problem and other problems in Fourier analysis, PDE, and additive combinatorics; there have even been some connections <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1863693">to number theory</a> and <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2206765">to cryptography</a>.  At the other end of the pipeline, the mathematical tools that have gone <em>into</em> the proofs of various partial results have included:</p>
<ul>
<li>Maximal functions, covering lemmas, <img src='http://s0.wp.com/latex.php?latex=L%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L^2' title='L^2' class='latex' /> methods (<a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=447949">Cordoba</a>, Strömberg, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=433117">Cordoba-Fefferman</a>);</li>
<li>Fourier analysis (<a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=466470">Nagel-Stein-Wainger</a>);</li>
<li>Multilinear integration (<a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=684547">Drury</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=763948">Christ</a>)</li>
<li>Paraproducts (<a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1711029">Katz</a>);</li>
<li>Combinatorial incidence geometry (<a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1097257">Bourgain</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1363209">Wolff</a>);</li>
<li>Multi-scale analysis (<a class="snap_noshots" href="http://www.ams.org/mathscinet/search/publdoc.html?pg1=MR&#38;s1=1393383&#38;loc=fromreflist">Barrionuevo</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1804528">Katz-Łaba-Tao</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1866801">Łaba-Tao</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1960122">Alfonseca-Soria-Vargas</a>);</li>
<li>Probabilistic constructions (<a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2367175">Bateman-Katz</a>, <a class="snap_noshots" href="http://arxiv.org/abs/math/0703559">Bateman</a>);</li>
<li>Additive combinatorics and graph theory (<a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1692486">Bourgain</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1804528">Katz-Łaba-Tao</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1739220">Katz-Tao</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1945284">Katz-Tao</a>);</li>
<li>Sum-product theorems (<a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2053599">Bourgain-Katz-Tao</a>);</li>
<li>Bilinear estimates (<a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1625056">Tao-Vargas-Vega</a>);</li>
<li>Perron trees (Perron, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=146698">Schoenberg</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1664129">Keich</a>);</li>
<li>Group theory (<a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2157367">Katz</a>);</li>
<li>Low-degree algebraic geometry (<a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1631271">Schlag</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2225076">Tao</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2031165">Mockenhaupt-Tao</a>);</li>
<li>High-degree algebraic geometry (<a class="snap_noshots" href="http://arxiv.org/abs/0803.2336">Dvir</a>, <a class="snap_noshots" href="http://arxiv.org/abs/0808.2499">Saraf-Sudan</a>);</li>
<li>Heat flow monotonicity formulae (<a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2275834">Bennett-Carbery-Tao</a>)</li>
</ul>
<p>[This list is not exhaustive.]</p>
<p>Very recently, I was pleasantly surprised to see yet another mathematical tool used to obtain new progress on the Kakeya conjecture, namely (a generalisation of) the famous <a href="http://en.wikipedia.org/wiki/Ham_sandwich_theorem">Ham Sandwich theorem</a> from algebraic topology.  This was recently used <a class="snap_noshots" href="http://arxiv.org/abs/0811.2251">by Guth</a> to establish a certain endpoint multilinear Kakeya estimate left open by the work <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2275834">of Bennett, Carbery, and myself</a>.  With regards to the Kakeya set conjecture, Guth&#8217;s arguments assert, roughly speaking, that the only Kakeya sets that can fail to have full dimension are those which obey a certain &#8220;planiness&#8221; property, which informally means that the line segments that pass through a typical point in the set must be essentially coplanar.  (This property first surfaced in my <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1804528">paper with Katz and Łaba</a>.)  Guth&#8217;s arguments can be viewed as a partial analogue of <a class="snap_noshots" href="http://arxiv.org/abs/0803.2336">Dvir&#8217;s arguments</a> in the finite field setting (which I discussed in <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/03/24/dvirs-proof-of-the-finite-field-kakeya-conjecture/">this blog post</a>) to the Euclidean setting; in particular, both arguments rely crucially on the ability to create a polynomial of controlled degree that vanishes at or near a large number of points.  Unfortunately, while these arguments fully settle the Kakeya conjecture in the finite field setting, it appears that some new ideas are still needed to finish off the problem in the Euclidean setting.  Nevertheless this is an interesting new development in the long history of this conjecture, in particular demonstrating that the polynomial method can be successfully applied to continuous Euclidean problems (i.e. it is not confined to the finite field setting).</p>
<p>In this post I would like to sketch some of the key ideas in Guth&#8217;s paper, in particular the role of the Ham Sandwich theorem (or more precisely, a polynomial generalisation of this theorem first observed <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1978494">by Gromov</a>).</p>
<p style="text-align:center;"><!--more-->&#8211; The polynomial Ham Sandwich theorem &#8211;</p>
<p>Let us first recall the classical Ham Sandwich theorem:</p>
<blockquote><p><a href="http://en.wikipedia.org/wiki/Ham_sandwich_theorem"><strong>Ham Sandwich theorem. </strong></a> Let <img src='http://s0.wp.com/latex.php?latex=U_1%2C+%5Cldots%2C+U_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U_1, &#92;ldots, U_n' title='U_1, &#92;ldots, U_n' class='latex' /> be n bounded open sets in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^n' title='{&#92;Bbb R}^n' class='latex' />.  Then there exists a <a href="http://en.wikipedia.org/wiki/Hyperplane">hyperplane</a> in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^n' title='{&#92;Bbb R}^n' class='latex' /> that divides each of the open sets <img src='http://s0.wp.com/latex.php?latex=U_1%2C%5Cldots%2CU_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U_1,&#92;ldots,U_n' title='U_1,&#92;ldots,U_n' class='latex' /> into two sets of equal volume.</p></blockquote>
<p>(The name of the theorem derives from the special case when <img src='http://s0.wp.com/latex.php?latex=n%3D3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n=3' title='n=3' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=U_1%2CU_2%2CU_3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U_1,U_2,U_3' title='U_1,U_2,U_3' class='latex' /> are two slices of bread and a slice of ham.  One can view this theorem as a &#8220;thickened&#8221; version of the Euclidean geometry axiom that every n points in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^n' title='{&#92;Bbb R}^n' class='latex' /> determine at least one hyperplane.)</p>
<p>There are many proofs of this theorem, but I will focus on the proof that is based on the Borsuk-Ulam theorem:</p>
<blockquote><p><a href="http://en.wikipedia.org/wiki/Borsuk%E2%80%93Ulam_theorem"><strong>Borsuk-Ulam theorem</strong></a>: Let <img src='http://s0.wp.com/latex.php?latex=f%3A+S%5En+%5Cto+%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: S^n &#92;to {&#92;Bbb R}^n' title='f: S^n &#92;to {&#92;Bbb R}^n' class='latex' /> be a continuous map from the n-dimensional sphere <img src='http://s0.wp.com/latex.php?latex=S%5En+%5Csubset+%7B%5CBbb+R%7D%5E%7Bn%2B1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S^n &#92;subset {&#92;Bbb R}^{n+1}' title='S^n &#92;subset {&#92;Bbb R}^{n+1}' class='latex' /> to the Euclidean space <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^n' title='{&#92;Bbb R}^n' class='latex' /> which is antipodal (which means that <img src='http://s0.wp.com/latex.php?latex=f%28-x%29%3D-f%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(-x)=-f(x)' title='f(-x)=-f(x)' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+S%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x &#92;in S^n' title='x &#92;in S^n' class='latex' />.  Then <img src='http://s0.wp.com/latex.php?latex=f%28x%29%3D0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(x)=0' title='f(x)=0' class='latex' /> for at least one <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+S%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x &#92;in S^n' title='x &#92;in S^n' class='latex' />.</p></blockquote>
<p><strong>Proof.</strong> (Sketch)  The set of zeroes of an antipodal map automatically come in antipodal pairs x,-x.   To prove the theorem, we shall establish the stronger fact that <img src='http://s0.wp.com/latex.php?latex=f%28x%29%3D0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(x)=0' title='f(x)=0' class='latex' /> for an odd number of disjoint antipodal pairs, counting multiplicity (avoiding the degenerate antipodal maps which vanish at an infinite set of points).  To see this, first observe that this is true for at least one antipodal map (e.g. one can use the horizontal projection map <img src='http://s0.wp.com/latex.php?latex=%28x_1%2C%5Cldots%2Cx_%7Bn%2B1%7D%29+%5Cmapsto+%28x_1%2C%5Cldots%2Cx_n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(x_1,&#92;ldots,x_{n+1}) &#92;mapsto (x_1,&#92;ldots,x_n)' title='(x_1,&#92;ldots,x_{n+1}) &#92;mapsto (x_1,&#92;ldots,x_n)' class='latex' />).  Also, the space of all antipodal maps is a vector space, and thus connected (though it takes some effort to show that the space of <em>non-degenerate</em> antipodal maps is still connected).  So one just needs to show that the parity of the number of pairs of antipodal points where f vanishes (counting multiplicity) is unchanged with respect to continuous deformations of f.  But some elementary <a href="http://en.wikipedia.org/wiki/Degree_(continuous_map)">degree</a> theory (or <a href="http://en.wikipedia.org/wiki/Morse_theory">Morse theory</a>) shows that any (non-degenerate) perturbation of f can annihilate two such antipodal pairs by collision, or (by the reverse procedure) spontaneously create two such antipodal pairs from nothing, but cannot otherwise affect the number of pairs; thus the parity of the number of such pairs remains invariant.  (It takes some non-trivial effort to make this informal argument rigorous; see for instance <a class="snap_noshots" href="http://kam.mff.cuni.cz/~matousek/bu-zhbu.ps.gz">this chapter</a> of <a class="snap_noshots" href="http://kam.mff.cuni.cz/~matousek/akt.html">Matousek&#8217;s book </a>on the Borsuk-Ulam theorem, which also contains a number of other proofs of this result.  [Thanks to Benny Sudakov for this great reference.] One can also formalise this argument using the language of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}_2' title='{&#92;Bbb Z}_2' class='latex' /> <a href="http://en.wikipedia.org/wiki/Singular_cohomology">singular cohomology</a>.) <img src='http://s0.wp.com/latex.php?latex=%5CBox&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Box' title='&#92;Box' class='latex' /></p>
<p><strong>Remark 1.</strong> The Borsuk-Ulam theorem is tied to the more general theory of <a href="http://en.wikipedia.org/wiki/Lyusternik-Schnirelmann_category">Lyusternik-Schnirelmann category</a>, which is the viewpoint taken in Guth&#8217;s paper, but we will not explicitly use this theory here. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Proof of the Ham-Sandwich theorem using the Borsuk-Ulam theorem.</strong> We can identify <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5E%7Bn%2B1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^{n+1}' title='{&#92;Bbb R}^{n+1}' class='latex' /> with the space of affine-linear forms <img src='http://s0.wp.com/latex.php?latex=%28x_1%2C%5Cldots%2Cx_n%29+%5Cmapsto+a_1+x_1+%2B+%5Cldots+%2B+a_n+x_n+%2B+a_0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(x_1,&#92;ldots,x_n) &#92;mapsto a_1 x_1 + &#92;ldots + a_n x_n + a_0' title='(x_1,&#92;ldots,x_n) &#92;mapsto a_1 x_1 + &#92;ldots + a_n x_n + a_0' class='latex' /> on <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^n' title='{&#92;Bbb R}^n' class='latex' />.  Each non-trivial affine-linear form <img src='http://s0.wp.com/latex.php?latex=P+%5Cin+%7B%5CBbb+R%7D%5E%7Bn%2B1%7D+%5Cbackslash+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P &#92;in {&#92;Bbb R}^{n+1} &#92;backslash 0' title='P &#92;in {&#92;Bbb R}^{n+1} &#92;backslash 0' class='latex' /> determines a hyperplane <img src='http://s0.wp.com/latex.php?latex=%5C%7BP%3D0%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{P=0&#92;}' title='&#92;{P=0&#92;}' class='latex' /> that divides <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^n' title='{&#92;Bbb R}^n' class='latex' /> into two half-spaces <img src='http://s0.wp.com/latex.php?latex=%5C%7B+P+%26%2362%3B+0+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{ P &gt; 0 &#92;}' title='&#92;{ P &gt; 0 &#92;}' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5C%7BP+%26%2360%3B+0%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{P &lt; 0&#92;}' title='&#92;{P &lt; 0&#92;}' class='latex' />.  We can then define <img src='http://s0.wp.com/latex.php?latex=f%3A+%7B%5CBbb+R%7D%5E%7Bn%2B1%7D+%5Cbackslash+%5C%7B0%5C%7D+%5Cto+%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: {&#92;Bbb R}^{n+1} &#92;backslash &#92;{0&#92;} &#92;to {&#92;Bbb R}^n' title='f: {&#92;Bbb R}^{n+1} &#92;backslash &#92;{0&#92;} &#92;to {&#92;Bbb R}^n' class='latex' /> to be the function whose <img src='http://s0.wp.com/latex.php?latex=j%5E%7Bth%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='j^{th}' title='j^{th}' class='latex' /> coordinate <img src='http://s0.wp.com/latex.php?latex=f_j%28P%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f_j(P)' title='f_j(P)' class='latex' /> at P is the volume of <img src='http://s0.wp.com/latex.php?latex=U_j+%5Ccap+%5C%7B+P+%26%2362%3B+0+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U_j &#92;cap &#92;{ P &gt; 0 &#92;}' title='U_j &#92;cap &#92;{ P &gt; 0 &#92;}' class='latex' /> minus the volume of <img src='http://s0.wp.com/latex.php?latex=U_j+%5Ccap+%5C%7B+P+%26%2360%3B+0+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U_j &#92;cap &#92;{ P &lt; 0 &#92;}' title='U_j &#92;cap &#92;{ P &lt; 0 &#92;}' class='latex' />; thus f measures the extent to which the hyperplane <img src='http://s0.wp.com/latex.php?latex=%5C%7BP%3D0%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{P=0&#92;}' title='&#92;{P=0&#92;}' class='latex' /> fails to bisect all of the <img src='http://s0.wp.com/latex.php?latex=U_1%2C%5Cldots%2CU_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U_1,&#92;ldots,U_n' title='U_1,&#92;ldots,U_n' class='latex' />.  It is easy to see that f is continuous, homogeneous of degree zero, and odd, and so its restriction to <img src='http://s0.wp.com/latex.php?latex=S%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S^n' title='S^n' class='latex' /> is an antipodal map.  By the Borsuk-Ulam theorem, there exists P such that <img src='http://s0.wp.com/latex.php?latex=f%28P%29%3D0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(P)=0' title='f(P)=0' class='latex' />, and the claim follows. <img src='http://s0.wp.com/latex.php?latex=%5CBox&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Box' title='&#92;Box' class='latex' /></p>
<p><a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1978494">Gromov observed</a> the following polynomial generalisation of the Ham Sandwich theorem:</p>
<blockquote><p><strong>Polynomial Ham Sandwich theorem.</strong> Let <img src='http://s0.wp.com/latex.php?latex=d+%5Cgeq+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d &#92;geq 1' title='d &#92;geq 1' class='latex' />, and let <img src='http://s0.wp.com/latex.php?latex=U_1%2C+%5Cldots%2C+U_%7B%5Cbinom%7Bn%2Bd%7D%7Bd%7D+-+1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U_1, &#92;ldots, U_{&#92;binom{n+d}{d} - 1}' title='U_1, &#92;ldots, U_{&#92;binom{n+d}{d} - 1}' class='latex' /> be bounded open sets in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^n' title='{&#92;Bbb R}^n' class='latex' />.  Then there exists a non-trivial polynomial <img src='http://s0.wp.com/latex.php?latex=P%3A+%7B%5CBbb+R%7D%5En+%5Cto+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P: {&#92;Bbb R}^n &#92;to {&#92;Bbb R}' title='P: {&#92;Bbb R}^n &#92;to {&#92;Bbb R}' class='latex' /> of degree at most d such that the sets <img src='http://s0.wp.com/latex.php?latex=%5C%7B+P+%26%2362%3B+0+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{ P &gt; 0 &#92;}' title='&#92;{ P &gt; 0 &#92;}' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%5C%7B+P+%26%2360%3B+0+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{ P &lt; 0 &#92;}' title='&#92;{ P &lt; 0 &#92;}' class='latex' /> partition each of the <img src='http://s0.wp.com/latex.php?latex=U_1%2C%5Cldots%2CU_%7B%5Cbinom%7Bn%2Bd%7D%7Bd%7D-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U_1,&#92;ldots,U_{&#92;binom{n+d}{d}-1}' title='U_1,&#92;ldots,U_{&#92;binom{n+d}{d}-1}' class='latex' /> into two sets of equal measure.</p></blockquote>
<p>Note that the ordinary Ham-Sandwich theorem corresponds to the d=1 case of this theorem.  This theorem can be deduced from the Borsuk-Ulam theorem in exactly the same way that the ordinary one is (note that the space of polynomials of degree at most d has dimension <img src='http://s0.wp.com/latex.php?latex=%5Cbinom%7Bn%2Bd%7D%7Bd%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;binom{n+d}{d}' title='&#92;binom{n+d}{d}' class='latex' />; the continuity of the appropriate antipodal function <img src='http://s0.wp.com/latex.php?latex=f%3A+S%5E%7B%5Cbinom%7Bn%2Bd%7D%7Bd%7D-1%7D+%5Cto+%7B%5CBbb+R%7D%5E%7B%5Cbinom%7Bn%2Bd%7D%7Bd%7D-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: S^{&#92;binom{n+d}{d}-1} &#92;to {&#92;Bbb R}^{&#92;binom{n+d}{d}-1}' title='f: S^{&#92;binom{n+d}{d}-1} &#92;to {&#92;Bbb R}^{&#92;binom{n+d}{d}-1}' class='latex' /> follows from the <a href="http://en.wikipedia.org/wiki/Dominated_convergence_theorem">dominated convergence theorem</a> and the basic observation that a non-trivial polynomial is non-zero almost everywhere).</p>
<p><strong>Remark 2. </strong> One can also deduce the polynomial Ham Sandwich theorem directly from the ordinary Ham Sandwich theorem (in <img src='http://s0.wp.com/latex.php?latex=%5Cbinom%7Bn%2Bd%7D%7Bd%7D-1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;binom{n+d}{d}-1' title='&#92;binom{n+d}{d}-1' class='latex' /> dimensions) by embedding <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^n' title='{&#92;Bbb R}^n' class='latex' /> into <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5E%7B%5Cbinom%7Bn%2Bd%7D%7Bd%7D-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^{&#92;binom{n+d}{d}-1}' title='{&#92;Bbb R}^{&#92;binom{n+d}{d}-1}' class='latex' /> via the <a href="http://en.wikipedia.org/wiki/Veronese_surface">Veronese embedding</a>, and then thickening the images of <img src='http://s0.wp.com/latex.php?latex=U_1%2C%5Cldots%2CU_%7B%5Cbinom%7Bn%2Bd%7D%7Bd%7D-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U_1,&#92;ldots,U_{&#92;binom{n+d}{d}-1}' title='U_1,&#92;ldots,U_{&#92;binom{n+d}{d}-1}' class='latex' /> slightly in an appropriate fashion; we leave the details as an exercise to the reader.   <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p>The polynomial Ham Sandwich theorem should be compared with the following finitary counterpart, which morally corresponds to the case when all the <img src='http://s0.wp.com/latex.php?latex=U_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U_j' title='U_j' class='latex' /> are points (but works over arbitrary fields F), and which was a key tool in <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/03/24/dvirs-proof-of-the-finite-field-kakeya-conjecture/">Dvir&#8217;s proof of the Kakeya conjecture in finite fields</a>:</p>
<blockquote><p><strong>Lemma.</strong> Let <img src='http://s0.wp.com/latex.php?latex=d+%5Cgeq+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d &#92;geq 1' title='d &#92;geq 1' class='latex' /> and let <img src='http://s0.wp.com/latex.php?latex=u_1%2C%5Cldots%2Cu_%7B%5Cbinom%7Bn%2Bd%7D%7Bd%7D-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u_1,&#92;ldots,u_{&#92;binom{n+d}{d}-1}' title='u_1,&#92;ldots,u_{&#92;binom{n+d}{d}-1}' class='latex' /> be points in <img src='http://s0.wp.com/latex.php?latex=F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^n' title='F^n' class='latex' /> for some field F.  Then there exists a non-trivial polynomial <img src='http://s0.wp.com/latex.php?latex=P%3A+F%5En+%5Cto+F&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P: F^n &#92;to F' title='P: F^n &#92;to F' class='latex' /> of degree at most d which vanishes on all of <img src='http://s0.wp.com/latex.php?latex=u_1%2C%5Cldots%2Cu_%7B%5Cbinom%7Bn%2Bd%7D%7Bd%7D-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u_1,&#92;ldots,u_{&#92;binom{n+d}{d}-1}' title='u_1,&#92;ldots,u_{&#92;binom{n+d}{d}-1}' class='latex' />.</p></blockquote>
<p><strong>Proof.</strong> The evaluation map <img src='http://s0.wp.com/latex.php?latex=P+%5Cmapsto+%28P%28u_1%29%2C%5Cldots%2CP%28u_%7B%5Cbinom%7Bn%2Bd%7D%7Bd%7D-1%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P &#92;mapsto (P(u_1),&#92;ldots,P(u_{&#92;binom{n+d}{d}-1})' title='P &#92;mapsto (P(u_1),&#92;ldots,P(u_{&#92;binom{n+d}{d}-1})' class='latex' /> is a linear map from a <img src='http://s0.wp.com/latex.php?latex=%5Cbinom%7Bn%2Bd%7D%7Bd%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;binom{n+d}{d}' title='&#92;binom{n+d}{d}' class='latex' />-dimensional space to a <img src='http://s0.wp.com/latex.php?latex=%5Cbinom%7Bn%2Bd%7D%7Bd%7D-1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;binom{n+d}{d}-1' title='&#92;binom{n+d}{d}-1' class='latex' />-dimensional space, and must therefore have a non-trivial kernel. <img src='http://s0.wp.com/latex.php?latex=%5CBox&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Box' title='&#92;Box' class='latex' /></p>
<p style="text-align:center;">&#8211; Connection with the Kakeya problem &#8211;</p>
<p>Now we connect the polynomial Ham Sandwich theorem to the Kakeya problem.  We begin by replacing the continuous Kakeya set conjecture with a more quantitative &#8220;<img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />-discretised&#8221; problem:</p>
<blockquote><p><a href="http://en.wikipedia.org/wiki/Kakeya_conjecture#Kakeya_maximal_function"><strong>Kakeya maximal conjecture.</strong></a> Let <img src='http://s0.wp.com/latex.php?latex=0+%26%2360%3B+%5Cdelta+%26%2360%3B+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='0 &lt; &#92;delta &lt; 1' title='0 &lt; &#92;delta &lt; 1' class='latex' />, and let <img src='http://s0.wp.com/latex.php?latex=T_1%2C+%5Cldots%2C+T_M&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_1, &#92;ldots, T_M' title='T_1, &#92;ldots, T_M' class='latex' /> be a collection of <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%5Ctimes+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta &#92;times 1' title='&#92;delta &#92;times 1' class='latex' /> cylindrical tubes pointing in a <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />-separated set of directions (thus the directions of any two of the tubes make an angle of at least <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />).  For each <img src='http://s0.wp.com/latex.php?latex=%5Cmu+%5Cgeq+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mu &#92;geq 1' title='&#92;mu &#92;geq 1' class='latex' />, let <img src='http://s0.wp.com/latex.php?latex=E_%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='E_&#92;mu' title='E_&#92;mu' class='latex' /> be the set of points x which are contained in at least <img src='http://s0.wp.com/latex.php?latex=%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mu' title='&#92;mu' class='latex' /> of the tubes <img src='http://s0.wp.com/latex.php?latex=T_1%2C%5Cldots%2CT_M&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_1,&#92;ldots,T_M' title='T_1,&#92;ldots,T_M' class='latex' />.  Then the volume <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE_%5Cmu%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E_&#92;mu&#124;' title='&#124;E_&#92;mu&#124;' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=E_%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='E_&#92;mu' title='E_&#92;mu' class='latex' /> obeys the bound <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE_%5Cmu%26%23124%3B+%5Clesssim_%7B%5Cvarepsilon%7D+%5Cdelta%5E%7B-%5Cvarepsilon%7D+%5Cmu%5E%7B-n%2F%28n-1%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E_&#92;mu&#124; &#92;lesssim_{&#92;varepsilon} &#92;delta^{-&#92;varepsilon} &#92;mu^{-n/(n-1)}' title='&#124;E_&#92;mu&#124; &#92;lesssim_{&#92;varepsilon} &#92;delta^{-&#92;varepsilon} &#92;mu^{-n/(n-1)}' class='latex' /> for any <img src='http://s0.wp.com/latex.php?latex=%5Cvarepsilon+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;varepsilon &gt; 0' title='&#92;varepsilon &gt; 0' class='latex' />.</p></blockquote>
<p>Here we are using the asymptotic notation that <img src='http://s0.wp.com/latex.php?latex=X+%5Cgtrsim+Y&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X &#92;gtrsim Y' title='X &#92;gtrsim Y' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=X+%5Cgeq+cY&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X &#92;geq cY' title='X &#92;geq cY' class='latex' /> for some positive constant c (if the <img src='http://s0.wp.com/latex.php?latex=%5Cgtrsim&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gtrsim' title='&#92;gtrsim' class='latex' /> is subscripted by parameters, this indicates that c is allowed to depend on those parameters); we always allow constants to depend on the dimension n.  This conjecture (which is limiting the extent to which tubes in different directions can overlap) implies the Kakeya set conjecture (for both Minkowski and Hausdorff dimension) by fairly standard arguments from geometric measure theory, see e.g. <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1097257">this paper of Bourgain</a> or <a class="snap_noshots" href="http://www.math.ucla.edu/%7Etao/254b.1.99s/notes5.dvi">these lecture notes of myself</a>. The factor of <img src='http://s0.wp.com/latex.php?latex=%5Cmu%5E%7B-n%2F%28n-1%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mu^{-n/(n-1)}' title='&#92;mu^{-n/(n-1)}' class='latex' /> is natural (and best possible), as can be seen by considering the example in which <img src='http://s0.wp.com/latex.php?latex=M+%5Csim+%5Cdelta%5E%7B1-n%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='M &#92;sim &#92;delta^{1-n}' title='M &#92;sim &#92;delta^{1-n}' class='latex' /> and all the tubes pass through a common point.</p>
<p>[The name "<a href="http://en.wikipedia.org/wiki/Kakeya_conjecture#Kakeya_maximal_function">maximal conjecture</a>" has to do with the formulation of the above conjecture involving the Kakeya maximal function, which I will not discuss here.]</p>
<p>The maximal conjecture (and the set conjecture) is verified in the two-dimensional case <img src='http://s0.wp.com/latex.php?latex=n%3D2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n=2' title='n=2' class='latex' /> (with the one-dimensional case <img src='http://s0.wp.com/latex.php?latex=n%3D1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n=1' title='n=1' class='latex' /> being trivial), but only partial results are known in higher dimensions.  However, one can do better if one only considers certain types of overlap.  Let us say (somewhat informally) that a point x has <em>non-planar multiplicity</em> <img src='http://s0.wp.com/latex.php?latex=%5Cgtrsim+%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gtrsim &#92;mu' title='&#92;gtrsim &#92;mu' class='latex' /> with respect to a given collection of tubes <img src='http://s0.wp.com/latex.php?latex=T_1%2C%5Cldots%2CT_M&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_1,&#92;ldots,T_M' title='T_1,&#92;ldots,T_M' class='latex' /> if there exist <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n' title='n' class='latex' /> separate families of <img src='http://s0.wp.com/latex.php?latex=%5Cgtrsim+%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gtrsim &#92;mu' title='&#92;gtrsim &#92;mu' class='latex' /> tubes each passing through x, such that given any <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n' title='n' class='latex' /> tubes from each of these three families, the solid angle between the <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n' title='n' class='latex' /> directions is comparable to 1.  (Informally, this is a stronger assertion than saying that x has <img src='http://s0.wp.com/latex.php?latex=%5Cgtrsim+%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gtrsim &#92;mu' title='&#92;gtrsim &#92;mu' class='latex' /> tubes passing through it, because we prohibit these tubes from being essentially contained in a hyperplane).  Then, as a special case of Guth&#8217;s results, one has</p>
<blockquote><p><strong>Multilinear Kakeya conjecture (special case):</strong> Let <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%2C+n%2C+T_1%2C%5Cldots%2CT_M%2C+%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta, n, T_1,&#92;ldots,T_M, &#92;mu' title='&#92;delta, n, T_1,&#92;ldots,T_M, &#92;mu' class='latex' /> be as in the Kakeya maximal conjecture, and let <img src='http://s0.wp.com/latex.php?latex=E_%5Cmu%5E%2A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='E_&#92;mu^*' title='E_&#92;mu^*' class='latex' /> be the set of points with non-planar multiplicity <img src='http://s0.wp.com/latex.php?latex=%5Cgtrsim+%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gtrsim &#92;mu' title='&#92;gtrsim &#92;mu' class='latex' />.  Then <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE_%5Cmu%5E%2A%26%23124%3B+%5Clesssim+%5Cmu%5E%7B-n%2F%28n-1%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E_&#92;mu^*&#124; &#92;lesssim &#92;mu^{-n/(n-1)}' title='&#124;E_&#92;mu^*&#124; &#92;lesssim &#92;mu^{-n/(n-1)}' class='latex' />.</p></blockquote>
<p>Informally, this implies that the only counterexamples to the Kakeya maximal conjecture can come from configurations of tubes such that the tubes that pass through a typical point largely lie in a hyperplane.  A previous <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2275834">paper of Bennett, Carbery, and myself</a> established this estimate with an additional loss of <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%5E%5Cvarepsilon&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta^&#92;varepsilon' title='&#92;delta^&#92;varepsilon' class='latex' /> by a totally different method (based on heat flow monotonicity formulae).  For a precise statement of the full multilinear Kakeya conjecture (which is now proven without any epsilon loss), I refer you to that paper (or the <a class="snap_noshots" href="http://arxiv.org/abs/0811.2251">paper of Guth</a>).</p>
<p>Let&#8217;s now sketch why the above result is true (details can be found in the <a class="snap_noshots" href="http://arxiv.org/abs/0811.2251">paper of Guth</a>).  I&#8217;ll drop the dependence of implied constants on n.  Let <img src='http://s0.wp.com/latex.php?latex=x_1%2C%5Cldots%2Cx_A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x_1,&#92;ldots,x_A' title='x_1,&#92;ldots,x_A' class='latex' /> be a maximal <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />-net of <img src='http://s0.wp.com/latex.php?latex=E%5E%2A_%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='E^*_&#92;mu' title='E^*_&#92;mu' class='latex' /> (i.e. a set of <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />-separated points in <img src='http://s0.wp.com/latex.php?latex=E%5E%2A_%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='E^*_&#92;mu' title='E^*_&#92;mu' class='latex' /> that is maximal with respect to set inclusion), then it will suffice to show that</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=A+%5Clesssim+%5Cdelta%5E%7B-n%7D+%5Cmu%5E%7B-n%2F%28n-1%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;lesssim &#92;delta^{-n} &#92;mu^{-n/(n-1)}' title='A &#92;lesssim &#92;delta^{-n} &#92;mu^{-n/(n-1)}' class='latex' /> (1).</p>
<p>Let <img src='http://s0.wp.com/latex.php?latex=Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q_j' title='Q_j' class='latex' /> be the cube of sidelength <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> centred at <img src='http://s0.wp.com/latex.php?latex=x_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x_j' title='x_j' class='latex' /> with sides parallel to the axes.  Applying the polynomial Ham Sandwich theorem, we can find a non-trivial polynomial P of degree <img src='http://s0.wp.com/latex.php?latex=O%28+A%5E%7B1%2Fn%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O( A^{1/n})' title='O( A^{1/n})' class='latex' /> whose zero locus <img src='http://s0.wp.com/latex.php?latex=V+%3A%3D+%5C%7BP%3D0%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V := &#92;{P=0&#92;}' title='V := &#92;{P=0&#92;}' class='latex' /> bisects each of the cubes <img src='http://s0.wp.com/latex.php?latex=Q_1%2C%5Cldots%2CQ_A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q_1,&#92;ldots,Q_A' title='Q_1,&#92;ldots,Q_A' class='latex' />.</p>
<p>For each j, we claim that the hypersurfaces <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap Q_j' title='V &#92;cap Q_j' class='latex' /> have surface area <img src='http://s0.wp.com/latex.php?latex=%5Cgtrsim+%5Cdelta%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gtrsim &#92;delta^{n-1}' title='&#92;gtrsim &#92;delta^{n-1}' class='latex' />.  Indeed, if instead one of the <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap Q_j' title='V &#92;cap Q_j' class='latex' /> had surface area <img src='http://s0.wp.com/latex.php?latex=o%28%5Cdelta%5E%7Bn-1%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='o(&#92;delta^{n-1})' title='o(&#92;delta^{n-1})' class='latex' />, this would imply that the projection of <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap Q_j' title='V &#92;cap Q_j' class='latex' /> to any (n-1)-dimensional coordinate subspace of <img src='http://s0.wp.com/latex.php?latex=Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q_j' title='Q_j' class='latex' /> has area <img src='http://s0.wp.com/latex.php?latex=o%28%5Cdelta%5E%7Bn-1%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='o(&#92;delta^{n-1})' title='o(&#92;delta^{n-1})' class='latex' />, in contrast with the projection of <img src='http://s0.wp.com/latex.php?latex=Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q_j' title='Q_j' class='latex' /> itself which has area <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta^{n-1}' title='&#92;delta^{n-1}' class='latex' />.  Thus for each <img src='http://s0.wp.com/latex.php?latex=1+%5Cleq+i+%5Cleq+n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1 &#92;leq i &#92;leq n' title='1 &#92;leq i &#92;leq n' class='latex' /> the complement of V in <img src='http://s0.wp.com/latex.php?latex=Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q_j' title='Q_j' class='latex' /> contains a subset of <img src='http://s0.wp.com/latex.php?latex=Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q_j' title='Q_j' class='latex' /> of relative density <img src='http://s0.wp.com/latex.php?latex=1-o%281%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1-o(1)' title='1-o(1)' class='latex' /> that consists entirely of line segments of length <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> in the basis direction <img src='http://s0.wp.com/latex.php?latex=e_i&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e_i' title='e_i' class='latex' />.  From this it is not hard to see that <img src='http://s0.wp.com/latex.php?latex=Q_j+%5Cbackslash+V&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q_j &#92;backslash V' title='Q_j &#92;backslash V' class='latex' /> contains a path-connected component of relative density <img src='http://s0.wp.com/latex.php?latex=1-o%281%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1-o(1)' title='1-o(1)' class='latex' />, which contradicts the claim that <img src='http://s0.wp.com/latex.php?latex=V&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V' title='V' class='latex' /> bisects <img src='http://s0.wp.com/latex.php?latex=Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q_j' title='Q_j' class='latex' />.</p>
<p>On the other hand, we know that <img src='http://s0.wp.com/latex.php?latex=Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q_j' title='Q_j' class='latex' /> meets <img src='http://s0.wp.com/latex.php?latex=%5Cgtrsim+%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gtrsim &#92;mu' title='&#92;gtrsim &#92;mu' class='latex' /> tubes <img src='http://s0.wp.com/latex.php?latex=T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_k' title='T_k' class='latex' />, which are arranged in a non-planar fashion.  Because of this, one can show that for a &#8220;typical&#8221; tube <img src='http://s0.wp.com/latex.php?latex=T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_k' title='T_k' class='latex' /> hitting <img src='http://s0.wp.com/latex.php?latex=Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q_j' title='Q_j' class='latex' />, the projection of <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap Q_j' title='V &#92;cap Q_j' class='latex' /> to the orthogonal complement of the direction of <img src='http://s0.wp.com/latex.php?latex=T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_k' title='T_k' class='latex' /> has area <img src='http://s0.wp.com/latex.php?latex=%5Cgtrsim+%5Cdelta%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gtrsim &#92;delta^{n-1}' title='&#92;gtrsim &#92;delta^{n-1}' class='latex' />.  (Basically, the point is that at any given point of <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap Q_j' title='V &#92;cap Q_j' class='latex' />, the normal vector cannot be perpendicular (or close to perpendicular) to all the directions of all the <img src='http://s0.wp.com/latex.php?latex=T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_k' title='T_k' class='latex' /> simultaneously, due to non-planarity.)  To simplify the exposition, let us assume that in fact <em>all</em> tubes <img src='http://s0.wp.com/latex.php?latex=T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_k' title='T_k' class='latex' /> touching <img src='http://s0.wp.com/latex.php?latex=Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q_j' title='Q_j' class='latex' /> are typical.</p>
<p>Each <img src='http://s0.wp.com/latex.php?latex=Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q_j' title='Q_j' class='latex' /> touches <img src='http://s0.wp.com/latex.php?latex=%5Csim+%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sim &#92;mu' title='&#92;sim &#92;mu' class='latex' /> tubes <img src='http://s0.wp.com/latex.php?latex=T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_k' title='T_k' class='latex' /> (they may touch more than this, but for sake of exposition let us suppose that they touch exactly this number of tubes).  By <a href="http://en.wikipedia.org/wiki/Double_counting_(proof_technique)">double counting</a>, this means that each tube <img src='http://s0.wp.com/latex.php?latex=T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_k' title='T_k' class='latex' /> touches about</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Clambda+%3A%3D+A+%5Cmu+%2F+M+%5Cgtrsim+%5Cdelta%5E%7Bn-1%7D+A+%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda := A &#92;mu / M &#92;gtrsim &#92;delta^{n-1} A &#92;mu' title='&#92;lambda := A &#92;mu / M &#92;gtrsim &#92;delta^{n-1} A &#92;mu' class='latex' /> (2)</p>
<p>cubes <img src='http://s0.wp.com/latex.php?latex=Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q_j' title='Q_j' class='latex' /> on the average, where the inequality in (2) comes from the <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />-separated directions of the tubes.  In particular, we can find a (typical) tube <img src='http://s0.wp.com/latex.php?latex=T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_k' title='T_k' class='latex' /> which touches at least <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' /> such balls. Let <img src='http://s0.wp.com/latex.php?latex=v_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_k' title='v_k' class='latex' /> be the direction vector of <img src='http://s0.wp.com/latex.php?latex=T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_k' title='T_k' class='latex' />.</p>
<p>Now look at <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap T_k' title='V &#92;cap T_k' class='latex' />.  (Technically, one has to replace <img src='http://s0.wp.com/latex.php?latex=T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_k' title='T_k' class='latex' /> by a slight thickening of itself here, but let us ignore this technicality.)  This set contains <img src='http://s0.wp.com/latex.php?latex=%5Cgtrsim+%5Clambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gtrsim &#92;lambda' title='&#92;gtrsim &#92;lambda' class='latex' /> disjoint sets of the form <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap Q_j' title='V &#92;cap Q_j' class='latex' />.  Each of these sets, when projected to the orthogonal complement of <img src='http://s0.wp.com/latex.php?latex=T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_k' title='T_k' class='latex' />, has measure <img src='http://s0.wp.com/latex.php?latex=%5Cgtrsim+%5Cdelta%5E%7Bn-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gtrsim &#92;delta^{n-1}' title='&#92;gtrsim &#92;delta^{n-1}' class='latex' />.  On the other hand, <img src='http://s0.wp.com/latex.php?latex=T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_k' title='T_k' class='latex' /> itself, when projected to this complement, has a measure of <img src='http://s0.wp.com/latex.php?latex=O%28%5Cdelta%5E%7Bn-1%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O(&#92;delta^{n-1})' title='O(&#92;delta^{n-1})' class='latex' />.  By the pigeonhole principle, we may thus find a positive measure family of lines <img src='http://s0.wp.com/latex.php?latex=%5Cell&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;ell' title='&#92;ell' class='latex' /> in the direction <img src='http://s0.wp.com/latex.php?latex=v_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_k' title='v_k' class='latex' /> passing through <img src='http://s0.wp.com/latex.php?latex=T_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T_k' title='T_k' class='latex' /> which intersect at  <img src='http://s0.wp.com/latex.php?latex=%5Cgtrsim+%5Clambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gtrsim &#92;lambda' title='&#92;gtrsim &#92;lambda' class='latex' /> of the <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+Q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap Q_j' title='V &#92;cap Q_j' class='latex' />.  In particular, all lines <img src='http://s0.wp.com/latex.php?latex=%5Cell&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;ell' title='&#92;ell' class='latex' /> in this family intersect V in <img src='http://s0.wp.com/latex.php?latex=%5Cgtrsim+%5Clambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gtrsim &#92;lambda' title='&#92;gtrsim &#92;lambda' class='latex' /> different points.</p>
<p>On the other hand, the restriction of P to <img src='http://s0.wp.com/latex.php?latex=%5Cell&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;ell' title='&#92;ell' class='latex' /> is a polynomial of degree <img src='http://s0.wp.com/latex.php?latex=O%28+A%5E%7B1%2Fn%7D+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O( A^{1/n} )' title='O( A^{1/n} )' class='latex' />.  If this degree is much less than <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' />, this forces P to vanish on each line <img src='http://s0.wp.com/latex.php?latex=%5Cell&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;ell' title='&#92;ell' class='latex' /> [cf. <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/03/24/dvirs-proof-of-the-finite-field-kakeya-conjecture/">Dvir's argument</a>]; since the set of such lines has positive measure, this forces P to be identically zero, a contradiction.  Hence we must have</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=A%5E%7B1%2Fn%7D+%5Cgtrsim+%5Clambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A^{1/n} &#92;gtrsim &#92;lambda' title='A^{1/n} &#92;gtrsim &#92;lambda' class='latex' /></p>
<p>which when combined with (2), gives (1).</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Marker lecture IV: "Sieving for almost primes and expanders"]]></title>
<link>http://terrytao.wordpress.com/2008/11/20/marker-lecture-iv-sieving-for-almost-primes-and-expanders/</link>
<pubDate>Thu, 20 Nov 2008 21:09:00 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2008/11/20/marker-lecture-iv-sieving-for-almost-primes-and-expanders/</guid>
<description><![CDATA[In this final lecture in the Marker lecture series, I discuss the recent work of Bourgain, Gamburd,]]></description>
<content:encoded><![CDATA[<p>In this final lecture in the <a class="snap_noshots" href="http://www.math.psu.edu/seminars/marker2008.php">Marker lecture series</a>, I discuss the recent work of Bourgain, Gamburd, and Sarnak on how <a href="http://en.wikipedia.org/wiki/Arithmetic_combinatorics">arithmetic combinatorics</a> and <a href="http://en.wikipedia.org/wiki/Expander_graph">expander graphs</a> were used to <a href="http://en.wikipedia.org/wiki/Sieve_theory">sieve</a> for <a href="http://en.wikipedia.org/wiki/Almost_prime">almost primes</a> in various <a href="http://en.wikipedia.org/wiki/Algebraic_set">algebraic sets</a>.</p>
<p><!--more--></p>
<p>In previous lectures, we considered the problem of detecting tuples of primes in various linear or convex sets; in particular, we considered the size of sets of the form <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+%7B%5Cmathcal+P%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap {&#92;mathcal P}^k' title='V &#92;cap {&#92;mathcal P}^k' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+P%7D+%3D+%5C%7B2%2C3%2C5%2C%5Cldots%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal P} = &#92;{2,3,5,&#92;ldots&#92;}' title='{&#92;mathcal P} = &#92;{2,3,5,&#92;ldots&#92;}' class='latex' /> is the set of primes, and V is some <a href="http://en.wikipedia.org/wiki/Affine_space">affine subspace</a> of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^k' title='{&#92;Bbb R}^k' class='latex' />.  (For instance, the <a href="http://en.wikipedia.org/wiki/Twin_prime_conjecture">twin prime conjecture</a> would correspond to the case when k=2 and <img src='http://s0.wp.com/latex.php?latex=V+%3D+%5C%7B+%28x%2Cx%2B2%29%3A+x+%5Cin+%7B%5CBbb+R%7D%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V = &#92;{ (x,x+2): x &#92;in {&#92;Bbb R}&#92;}' title='V = &#92;{ (x,x+2): x &#92;in {&#92;Bbb R}&#92;}' class='latex' />, while the <a href="http://en.wikipedia.org/wiki/Green-Tao_theorem">Green-Tao theorem</a> would correspond to the case <img src='http://s0.wp.com/latex.php?latex=V+%3D+%5C%7B+%28x%2Cx%2Br%2C%5Cldots%2Cx%2B%28k-1%29r%29%3A+x%2Cr+%5Cin+%7B%5CBbb+R%7D%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V = &#92;{ (x,x+r,&#92;ldots,x+(k-1)r): x,r &#92;in {&#92;Bbb R}&#92;}' title='V = &#92;{ (x,x+r,&#92;ldots,x+(k-1)r): x,r &#92;in {&#92;Bbb R}&#92;}' class='latex' />.  We refer to elements of <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+P%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal P}^k' title='{&#92;mathcal P}^k' class='latex' /> as <em>prime points</em>.  The <a class="snap_noshots" href="http://mathworld.wolfram.com/k-TupleConjecture.html">prime tuples conjecture</a> implies the following qualitative criterion for when such a set of prime points should be &#8220;large&#8221;:</p>
<blockquote><p><strong>Qualitative prime tuples conjecture.</strong> Let V be an affine subspace of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^k' title='{&#92;Bbb R}^k' class='latex' />.  Suppose that</p>
<ol>
<li>(No obstructions at infinity)  For any N, <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+%7B%5CBbb+Z%7D_%7B%26%2362%3BN%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap {&#92;Bbb Z}_{&gt;N}^k' title='V &#92;cap {&#92;Bbb Z}_{&gt;N}^k' class='latex' /> affinely spans all of V, where <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D_%7B%26%2362%3BN%7D+%3A%3D+%5C%7B+n+%5Cin+%7B%5CBbb+Z%7D%3A+n+%26%2362%3B+N+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}_{&gt;N} := &#92;{ n &#92;in {&#92;Bbb Z}: n &gt; N &#92;}' title='{&#92;Bbb Z}_{&gt;N} := &#92;{ n &#92;in {&#92;Bbb Z}: n &gt; N &#92;}' class='latex' />.  (In particular, <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+%7B%5CBbb+Z%7D_%7B%26%2362%3BN%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap {&#92;Bbb Z}_{&gt;N}^k' title='V &#92;cap {&#92;Bbb Z}_{&gt;N}^k' class='latex' /> is non-empty.)</li>
<li>(No obstructions at q)  For any <img src='http://s0.wp.com/latex.php?latex=q+%26%2362%3B+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='q &gt; 1' title='q &gt; 1' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+%28%7B%5CBbb+Z%7D_q%5E%2A%29%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap ({&#92;Bbb Z}_q^*)^k' title='V &#92;cap ({&#92;Bbb Z}_q^*)^k' class='latex' /> affinely spans all of V, where <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D_q%5E%2A+%3A%3D+%5C%7B+n+%5Cin+%7B%5CBbb+Z%7D%3A+%28n%2Cq%29+%3D+1+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}_q^* := &#92;{ n &#92;in {&#92;Bbb Z}: (n,q) = 1 &#92;}' title='{&#92;Bbb Z}_q^* := &#92;{ n &#92;in {&#92;Bbb Z}: (n,q) = 1 &#92;}' class='latex' />.  (In particular,<img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+%28%7B%5CBbb+Z%7D_q%5E%2A%29%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap ({&#92;Bbb Z}_q^*)^k' title='V &#92;cap ({&#92;Bbb Z}_q^*)^k' class='latex' /> is non-empty.)</li>
</ol>
<p>Then <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+%7B%5Cmathcal+P%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap {&#92;mathcal P}^k' title='V &#92;cap {&#92;mathcal P}^k' class='latex' /> affinely spans all of V.  (In particular, V contains at least one prime point.)</p></blockquote>
<p>Both of the hypotheses in this conjecture are easily verified for any given V, the first by (integer) linear programming and the second by modular arithmetic.  This conjecture would imply several other results and conjectures in number theory, including the twin prime conjecture and the Green-Tao theorem.  Needless to say, it remains open in general (though the results mentioned in the previous lecture give partial results in the case when V is at least two-dimensional and non-degenerate).</p>
<p>Now we attempt to generalise the above conjecture to the setting in which V is an <a href="http://en.wikipedia.org/wiki/Algebraic_variety">algebraic variety</a> rather than an affine subspace.  (This would cover some famous open problems in number theory, for instance the <a href="http://en.wikipedia.org/wiki/Landau%27s_problems">Landau problem</a> that asks whether there are infinitely many primes of the form <img src='http://s0.wp.com/latex.php?latex=n%5E2%2B1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n^2+1' title='n^2+1' class='latex' />.) The notion of a set affinely spanning V is then naturally replaced by the notion of a set being <a href="http://en.wikipedia.org/wiki/Zariski_topology">Zariski dense</a> in V, which means that the set is not contained in any strictly smaller subvariety of V.  One could then formulate a naive generalisation of the above conjecture by replacing &#8220;affine space&#8221; and &#8220;affinely spans all of&#8221; with &#8220;algebraic variety&#8221; and &#8220;is Zariski dense in&#8221; respectively.  However, the hypotheses are now no longer easy to verify; indeed, just the problem of determining whether V contains an integer point <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}^k' title='{&#92;Bbb Z}^k' class='latex' /> is essentially <a href="http://en.wikipedia.org/wiki/Hilbert%27s_tenth_problem">Hilbert&#8217;s tenth problem</a>, which by <a href="http://en.wikipedia.org/wiki/Matiyasevich%27s_theorem">Matiyasevich&#8217;s theorem</a> is known to be undecidable for general V.  Indeed, since one can encode any computable set in terms of the integer points of a variety V, it is not too difficult to see that this conjecture fails in general.  (An amusing historical connection here: one of the first demonstrations of the undecidability of Hilbert&#8217;s tenth problem by Robinson, Davis, and Putnam was conditional on the existence of arbitrarily long progressions of primes (i.e. the Green-Tao theorem), although subsequent proofs did not need this fact.)</p>
<p>Since arbitrary algebraic varieties are far too general to have any hope of a reasonable theory, one should look for prime points in much more special sets.  An important class here is that of an orbit <img src='http://s0.wp.com/latex.php?latex=%5CLambda+b&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda b' title='&#92;Lambda b' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}^k' title='{&#92;Bbb Z}^k' class='latex' />, where b is some vector in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}^k' title='{&#92;Bbb Z}^k' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5CLambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda' title='&#92;Lambda' class='latex' /> is some finitely generated subgroup of <img src='http://s0.wp.com/latex.php?latex=SL_k%28%7B%5CBbb+Z%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_k({&#92;Bbb Z})' title='SL_k({&#92;Bbb Z})' class='latex' />.  (One can also consider the slightly more general set of images <img src='http://s0.wp.com/latex.php?latex=F%28%5CLambda+b%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F(&#92;Lambda b)' title='F(&#92;Lambda b)' class='latex' /> under a polynomial map, but for simplicity let us stick to just orbits.) Of course one should take b to be primitive (not a multiple of any smaller vector), since one clearly will have a difficult time finding prime points in <img src='http://s0.wp.com/latex.php?latex=%5CLambda+b&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda b' title='&#92;Lambda b' class='latex' /> otherwise.</p>
<p>The orbit <img src='http://s0.wp.com/latex.php?latex=%5CLambda+b&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda b' title='&#92;Lambda b' class='latex' /> will be Zariski dense in some algebraic variety V, and is clearly a collection of integer points (though it may not cover all of <img src='http://s0.wp.com/latex.php?latex=V+%5Ccap+%7B%5CBbb+Z%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V &#92;cap {&#92;Bbb Z}^k' title='V &#92;cap {&#92;Bbb Z}^k' class='latex' />).  Assuming no local obstructions at infinity or at q (which means that <img src='http://s0.wp.com/latex.php?latex=%5CLambda+b+%5Ccap+%7B%5CBbb+Z%7D_%7B%26%2362%3BN%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda b &#92;cap {&#92;Bbb Z}_{&gt;N}^k' title='&#92;Lambda b &#92;cap {&#92;Bbb Z}_{&gt;N}^k' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5CLambda+b+%5Ccap+%28%7B%5CBbb+Z%7D_q%5E%2A%29%5Ek%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda b &#92;cap ({&#92;Bbb Z}_q^*)^k)' title='&#92;Lambda b &#92;cap ({&#92;Bbb Z}_q^*)^k)' class='latex' /> are Zariski dense in V), one could then conjecture that <img src='http://s0.wp.com/latex.php?latex=%5CLambda+b+%5Ccap+%7B%5CBbb+P%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda b &#92;cap {&#92;Bbb P}^k' title='&#92;Lambda b &#92;cap {&#92;Bbb P}^k' class='latex' /> is also Zariski dense in V (which, if V is infinite, would in particular imply that the orbit <img src='http://s0.wp.com/latex.php?latex=%5CLambda+b&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda b' title='&#92;Lambda b' class='latex' /> contains infinitely many prime points).</p>
<p>For simplicity let us restrict attention to the two-dimensional case k=2, which is already highly non-trivial; Bourgain, Gamburd and Sarnak have recently begun to get some preliminary results in k=3 but I will not discuss them here.  Thus <img src='http://s0.wp.com/latex.php?latex=%5CLambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda' title='&#92;Lambda' class='latex' /> is now a finitely generated subgroup of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z})' title='SL_2({&#92;Bbb Z})' class='latex' />.  If this subgroup is <em>elementary</em> &#8211; e.g. if it is cyclic &#8211; then the orbit <img src='http://s0.wp.com/latex.php?latex=%5CLambda+b&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda b' title='&#92;Lambda b' class='latex' /> can be exponentially sparse (a ball of radius R may only contain <img src='http://s0.wp.com/latex.php?latex=O%28%5Clog+R%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O(&#92;log R)' title='O(&#92;log R)' class='latex' /> points), and it becomes extremely difficult to do any sieving or primality detection.  (In this case, the problem becomes comparable to such notoriously difficult questions as whether there are infinitely many Mersenne primes.)  It thus makes sense to restrict attention to non-elementary subgroups of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z})' title='SL_2({&#92;Bbb Z})' class='latex' /> &#8211; groups which contain a copy of the <a href="http://en.wikipedia.org/wiki/Free_group">free non-abelian group</a> on two generators, or equivalently any group whose Zariski closure is all of <img src='http://s0.wp.com/latex.php?latex=SL_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2' title='SL_2' class='latex' /> (or equivalently yet again, a group whose limit set consists of more than one point).  In this situation, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2246331">Bourgain, Gamburd, and Sarnak conjectured</a>:</p>
<blockquote><p><strong>Conjecture 2.</strong> Let <img src='http://s0.wp.com/latex.php?latex=%5CLambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda' title='&#92;Lambda' class='latex' /> be a non-elementary subgroup of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z})' title='SL_2({&#92;Bbb Z})' class='latex' />, and let b be a primitive element of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}^2' title='{&#92;Bbb Z}^2' class='latex' />.  Suppose that there are no local obstructions at infinity or at finite places q.  Then <img src='http://s0.wp.com/latex.php?latex=%5CLambda+b+%5Ccap+%7B%5Cmathcal+P%7D%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda b &#92;cap {&#92;mathcal P}^2' title='&#92;Lambda b &#92;cap {&#92;mathcal P}^2' class='latex' /> is Zariski dense in the plane (in particular, <img src='http://s0.wp.com/latex.php?latex=%5CLambda+b+%5Ccap+%7B%5Cmathcal+P%7D%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda b &#92;cap {&#92;mathcal P}^2' title='&#92;Lambda b &#92;cap {&#92;mathcal P}^2' class='latex' /> is infinite).</p></blockquote>
<p>This conjecture remains open.  However, as in the linear situation, one can make progress if one replaces primes with <a href="http://en.wikipedia.org/wiki/Almost_prime">almost primes</a> &#8211; products of at most r primes for some bounded r.  (There are certainly prior results for nonlinear patterns in the almost primes; for instance, it is a famous <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=485740">result of Iwaniec</a> that there are infinitely many numbers of the form <img src='http://s0.wp.com/latex.php?latex=n%5E2%2B1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n^2+1' title='n^2+1' class='latex' /> that are the product of at most two primes.)  In particular, Bourgain, Gamburd, and Sarnak were able to show</p>
<blockquote><p><strong>Theorem.</strong> Let <img src='http://s0.wp.com/latex.php?latex=%5CLambda%2C+b&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda, b' title='&#92;Lambda, b' class='latex' /> be as in Conjecture 2.  Then there exists an r such that <img src='http://s0.wp.com/latex.php?latex=%5CLambda+b+%5Ccap+%7B%5Cmathcal+P%7D_r%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda b &#92;cap {&#92;mathcal P}_r^2' title='&#92;Lambda b &#92;cap {&#92;mathcal P}_r^2' class='latex' /> is Zariski dense in the plane, where <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+P%7D_r&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal P}_r' title='{&#92;mathcal P}_r' class='latex' /> is the set of numbers that are the product of at most r primes.</p></blockquote>
<p>Several further generalisations and extensions of this result, with a similar flavour, are known, but will not be discussed here.  There are a number of amusing special cases of these results, for instance one can show that there exist infinitely many <a href="http://en.wikipedia.org/wiki/Apollonian_gasket">Appollonian circle packings</a> of the unit circle by four other mutually tangent circles, all of whose radii is the reciprocal of an almost prime, or infinitely many <a href="http://en.wikipedia.org/wiki/Pythagorean_triple">Pythagorean triples</a> whose area is an almost prime (for a sufficiently large r in the definition of &#8220;almost prime&#8221;).</p>
<p>Let me now discuss some of the key ideas in the proof of this theorem.  One begins by rephrasing the question in a more quantitative (or finitary) manner.  In the linear case, this would be done by counting the number of points in <img src='http://s0.wp.com/latex.php?latex=%5CLambda+b+%5Ccap+%7B%5Cmathcal+P%7D_r%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda b &#92;cap {&#92;mathcal P}_r^2' title='&#92;Lambda b &#92;cap {&#92;mathcal P}_r^2' class='latex' /> that lie inside some large Euclidean ball, thus using the Euclidean (or Archimedean) notion of distance to localise the problem.  This can also be done here, but it turns out to be more convenient to instead use the <a href="http://en.wikipedia.org/wiki/Word_metric">word metric</a> induced by the finite generating set S of <img src='http://s0.wp.com/latex.php?latex=%5CLambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda' title='&#92;Lambda' class='latex' /> (which we can take to be symmetric for convenience, thus <img src='http://s0.wp.com/latex.php?latex=S+%3D+S%5E%7B-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S = S^{-1}' title='S = S^{-1}' class='latex' />).  One thus looks at sets of the form <img src='http://s0.wp.com/latex.php?latex=B_R+b+%5Ccap+%7B%5Cmathcal+P%7D_r%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_R b &#92;cap {&#92;mathcal P}_r^2' title='B_R b &#92;cap {&#92;mathcal P}_r^2' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=B_R+%5Csubset+%5CLambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_R &#92;subset &#92;Lambda' title='B_R &#92;subset &#92;Lambda' class='latex' /> consists of all words formed by products of at most R elements of S.  A major new difficulty here compared to the linear theory is the exponential growth of <img src='http://s0.wp.com/latex.php?latex=B_R&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_R' title='B_R' class='latex' /> (a consequence of the non-elementary nature of <img src='http://s0.wp.com/latex.php?latex=%5CLambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda' title='&#92;Lambda' class='latex' />).  (Though it is not immediately apparent, the same problem also arises if one uses Euclidean balls instead of word metric balls, due to the multiplicative rather than additive nature of the group <img src='http://s0.wp.com/latex.php?latex=%5CLambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda' title='&#92;Lambda' class='latex' />.)</p>
<p>The next step is to use sieve theory.  Recall the <a href="http://en.wikipedia.org/wiki/Sieve_of_Eratosthenes">sieve of Eratosthenes</a>, which expresses the set of all (large) primes as the integers, minus the multiples of two, minus the multiples of three, and so forth.  Using the <a href="http://en.wikipedia.org/wiki/Inclusion-exclusion_principle">inclusion-exclusion principle</a>, we can thus view the indicator function <img src='http://s0.wp.com/latex.php?latex=1_%7B%7B%5Cmathcal+P%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1_{{&#92;mathcal P}}' title='1_{{&#92;mathcal P}}' class='latex' /> of the primes, when restricted to an interval such as <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5BN%2C2N%5D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[N,2N]' title='{}[N,2N]' class='latex' />, as equal to 1, minus the indicator function <img src='http://s0.wp.com/latex.php?latex=1_%7B2%7B%5CBbb+Z%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1_{2{&#92;Bbb Z}}' title='1_{2{&#92;Bbb Z}}' class='latex' /> of the even numbers, minus the indicator function <img src='http://s0.wp.com/latex.php?latex=1_%7B3%7B%5CBbb+Z%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1_{3{&#92;Bbb Z}}' title='1_{3{&#92;Bbb Z}}' class='latex' /> of the multiples of three, plus the indicator function <img src='http://s0.wp.com/latex.php?latex=1_%7B6%7B%5CBbb+Z%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1_{6{&#92;Bbb Z}}' title='1_{6{&#92;Bbb Z}}' class='latex' /> of the multiples of six, and so forth.  This leads to the <a href="http://en.wikipedia.org/wiki/Legendre_sieve">Legendre sieve</a></p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+1_%7B%7B%5Cmathcal+P%7D%7D+%3D+%5Csum_d+%5Cmu%28d%29+1_%7Bd+%7B%5CBbb+Z%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle 1_{{&#92;mathcal P}} = &#92;sum_d &#92;mu(d) 1_{d {&#92;Bbb Z}}' title='&#92;displaystyle 1_{{&#92;mathcal P}} = &#92;sum_d &#92;mu(d) 1_{d {&#92;Bbb Z}}' class='latex' />, (1)</p>
<p>valid in an interval <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5BN%2C2N%5D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[N,2N]' title='{}[N,2N]' class='latex' /> as long as one restricts d to those integers which are products of primes less than N.  Here <img src='http://s0.wp.com/latex.php?latex=%5Cmu%28d%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mu(d)' title='&#92;mu(d)' class='latex' /> is the <a href="http://en.wikipedia.org/wiki/M%C3%B6bius_function">Möbius function</a>.</p>
<p>The basic idea of sieve theory is to replace the indicator function of the primes (or almost primes) by a more general divisor sum</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Csum_d+c_d+1_%7Bd%7B%5CBbb+Z%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;sum_d c_d 1_{d{&#92;Bbb Z}}' title='&#92;displaystyle &#92;sum_d c_d 1_{d{&#92;Bbb Z}}' class='latex' />,</p>
<p>where the sieve weights <img src='http://s0.wp.com/latex.php?latex=c_d&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_d' title='c_d' class='latex' /> are chosen in order to optimise the final bounds in the sieve (they typically resemble &#8220;smoothed out&#8221; versions of the Möbius functionin order that these sieves be large on the almost primes and small elsewhere).  In order for the sieve to be practical, one wants to restrict d in this sum to be relatively small, for instance <img src='http://s0.wp.com/latex.php?latex=d+%5Cleq+N%5E%5Ctheta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d &#92;leq N^&#92;theta' title='d &#92;leq N^&#92;theta' class='latex' /> for some absolute constant <img src='http://s0.wp.com/latex.php?latex=0+%26%2360%3B+%5Ctheta+%26%2360%3B+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='0 &lt; &#92;theta &lt; 1' title='0 &lt; &#92;theta &lt; 1' class='latex' /> (values such as <img src='http://s0.wp.com/latex.php?latex=%5Ctheta%3D1%2F4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;theta=1/4' title='&#92;theta=1/4' class='latex' /> are fairly typical).  The selection of the sieve weights <img src='http://s0.wp.com/latex.php?latex=c_d&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_d' title='c_d' class='latex' /> is now a well-developed science (see also <a class="snap_noshots" href="http://terrytao.wordpress.com/2007/06/05/open-question-the-parity-problem-in-sieve-theory/">my earlier post</a>, and <a class="snap_noshots" href="http://terrytao.wordpress.com/2007/08/08/emmanuel-kowalski-the-large-sieve-inequalities/">Kowalski&#8217;s guest post</a>, on this topic), and Bourgain, Gamburd and Sarnak basically use off-the-shelf sieves (in particular, combinatorial sieves and the Selberg sieve) in their work.  Inserting these standard sieves into the problem at hand, the task of counting almost primes in the finite set <img src='http://s0.wp.com/latex.php?latex=B_R+b&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_R b' title='B_R b' class='latex' /> then quickly reduces to the question of getting good estimates on sets such as <img src='http://s0.wp.com/latex.php?latex=B_R+b+%5Ccap+%28q+%7B%5CBbb+Z%7D%29%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_R b &#92;cap (q {&#92;Bbb Z})^2' title='B_R b &#92;cap (q {&#92;Bbb Z})^2' class='latex' /> for various q.  This amounts to much the same thing as asking for good equidistribution bounds for <img src='http://s0.wp.com/latex.php?latex=B_R&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_R' title='B_R' class='latex' /> modulo q, thus we project the generating set S, and the ball <img src='http://s0.wp.com/latex.php?latex=B_R&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_R' title='B_R' class='latex' /> it produces, from <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z})' title='SL_2({&#92;Bbb Z})' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' />.  For sieving purposes it turns out to be necessary to consider all squarefree moduli q, but for simplicity we shall only discuss the (massively easier) case when q is prime.</p>
<p>The reduction to an equidistribution problem converts the original sieving problem to a more combinatorial one, involving the <a href="http://en.wikipedia.org/wiki/Cayley_graph">Cayley graph</a> G on <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' /> induced by the set S, thus two vertices <img src='http://s0.wp.com/latex.php?latex=x%2C+y+%5Cin+SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x, y &#92;in SL_2({&#92;Bbb Z}_q)' title='x, y &#92;in SL_2({&#92;Bbb Z}_q)' class='latex' /> are connected by an edge in G if <img src='http://s0.wp.com/latex.php?latex=y+x%5E%7B-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='y x^{-1}' title='y x^{-1}' class='latex' /> lie in S (modulo q).  The image of the ball <img src='http://s0.wp.com/latex.php?latex=B_R&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_R' title='B_R' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' /> is then the set of points one can reach in the graph G from the origin by walking on a path of length at most R.  The desired equidistribution result one needs can then be viewed as a mixing result for the random walk along the graph G.</p>
<p>Standard graph theory then tells us that the task reduces to showing that the graphs G form a family of <a href="http://en.wikipedia.org/wiki/Expander_graph">expander graphs</a> as <img src='http://s0.wp.com/latex.php?latex=q+%5Cto+%5Cinfty&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='q &#92;to &#92;infty' title='q &#92;to &#92;infty' class='latex' /> (recall we are restricting q to be prime for simplicity).  There are many equivalent definitions of what an expander graph is, but let us give a spectral definition that is specialised to Cayley graphs.  The symmetric generating set S induces a natural measure</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cmu+%3A%3D+%5Cfrac%7B1%7D%7B%26%23124%3BS%26%23124%3B%7D+%5Csum_%7Bs+%5Cin+S%7D+%5Cdelta_s&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;mu := &#92;frac{1}{&#124;S&#124;} &#92;sum_{s &#92;in S} &#92;delta_s' title='&#92;displaystyle &#92;mu := &#92;frac{1}{&#124;S&#124;} &#92;sum_{s &#92;in S} &#92;delta_s' class='latex' /></p>
<p>that is the uniform distribution on S, which controls the random walk along G; note that if <img src='http://s0.wp.com/latex.php?latex=f%3A+SL_2%28%7B%5CBbb+Z%7D_q%29+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: SL_2({&#92;Bbb Z}_q) &#92;to {&#92;Bbb C}' title='f: SL_2({&#92;Bbb Z}_q) &#92;to {&#92;Bbb C}' class='latex' /> is a function, then the convolution <img src='http://s0.wp.com/latex.php?latex=f%2A%5Cmu%3A+SL_2%28%7B%5CBbb+Z%7D_q%29+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f*&#92;mu: SL_2({&#92;Bbb Z}_q) &#92;to {&#92;Bbb C}' title='f*&#92;mu: SL_2({&#92;Bbb Z}_q) &#92;to {&#92;Bbb C}' class='latex' /> is another function, whose value at any vertex x is the average value of f at all the neighbours of x.  The operation <img src='http://s0.wp.com/latex.php?latex=f+%5Cmapsto+f%2A%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f &#92;mapsto f*&#92;mu' title='f &#92;mapsto f*&#92;mu' class='latex' /> is then a self-adjoint contraction on <img src='http://s0.wp.com/latex.php?latex=l%5E2%28SL_2%28%7B%5CBbb+Z%7D_q%29%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='l^2(SL_2({&#92;Bbb Z}_q))' title='l^2(SL_2({&#92;Bbb Z}_q))' class='latex' /> which leaves the function 1 invariant, so its largest eigenvalue <img src='http://s0.wp.com/latex.php?latex=%5Clambda_1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda_1' title='&#92;lambda_1' class='latex' /> is equal to 1.  The expander graph condition is then equivalent to the existence of a <a href="http://en.wikipedia.org/wiki/Spectral_gap">spectral gap</a> <img src='http://s0.wp.com/latex.php?latex=%5Clambda_2+%5Cleq+1-c&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda_2 &#92;leq 1-c' title='&#92;lambda_2 &#92;leq 1-c' class='latex' /> for the second largest eigenvalue, where <img src='http://s0.wp.com/latex.php?latex=c+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c &gt; 0' title='c &gt; 0' class='latex' /> is a constant independent of q.</p>
<p>Of course, to have a spectral gap, one necessary condition is that <img src='http://s0.wp.com/latex.php?latex=%5Clambda_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda_2' title='&#92;lambda_2' class='latex' /> be strictly less than 1.  This can be easily seen to be equivalent to the statement that G is connected, which in turn is equivalent to the statement that the projection of S to <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' /> generates all of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' />.  This statement can be verified to be true, either by direct consideration of all possible subgroups of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' />, or by the <a href="http://en.wikipedia.org/wiki/Approximation_in_algebraic_groups">strong approximation property</a>.  However, mere connectedness is not enough to ensure that a Cayley graph is an expander family (which can be viewed as a sort of &#8220;robust&#8221; version of connectedness, which can survive the deletion of large numbers of edges).  For instance, the Cayley graph of the generating set <img src='http://s0.wp.com/latex.php?latex=%5C%7B-1%2C%2B1%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{-1,+1&#92;}' title='&#92;{-1,+1&#92;}' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}/N{&#92;Bbb Z}' title='{&#92;Bbb Z}/N{&#92;Bbb Z}' class='latex' /> is connected, but does not form an expander family as <img src='http://s0.wp.com/latex.php?latex=N+%5Cto+%5Cinfty&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N &#92;to &#92;infty' title='N &#92;to &#92;infty' class='latex' />; the second largest eigenvalue is about <img src='http://s0.wp.com/latex.php?latex=1+-+O%281%2FN%5E2%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1 - O(1/N^2)' title='1 - O(1/N^2)' class='latex' /> only.  (One can also see that the random walk on this Cayley graph takes a long time (about O(N^2) steps) before it mixes to be close to the uniform distribution; with expander graphs on a set of N vertices, mixing instead occurs in time <img src='http://s0.wp.com/latex.php?latex=O%28%5Clog+N%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O(&#92;log N)' title='O(&#92;log N)' class='latex' />, thanks to the spectral gap.)</p>
<p>Obtaining the spectral gap property requires more work.  When the original subgroup <img src='http://s0.wp.com/latex.php?latex=%5CLambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda' title='&#92;Lambda' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z})' title='SL_2({&#92;Bbb Z})' class='latex' /> is as large as a finite index subgroup (in particular, if it is a <a href="http://en.wikipedia.org/wiki/Congruence_subgroup">congruence subgroup</a>), this gap follows from a celebrated <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=182610">theorem of Selberg</a> providing a similar spectral gap for arithmetic quotients of the upper half-plane.  Smaller examples (in which the index is now infinite) were first constructed <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1645694">by Shalom</a> and <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1900698">by Gamburd</a>, with the latter following the method of <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1131400">Sarnak and Xue</a>.  Then (in the case of prime q), <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2415383">Bourgain and Gamburd</a> extended the method using additional tools from additive combinatorics to handle all non-elementary subgroups.</p>
<p>Let us now describe the method of proof.  As mentioned briefly earlier, the existence of a spectral gap implies a strong mixing property: the iterated convolutions <img src='http://s0.wp.com/latex.php?latex=%5Cmu%5E%7B%28n%29%7D+%3A%3D+%5Cmu+%2A+%5Cldots+%2A+%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mu^{(n)} := &#92;mu * &#92;ldots * &#92;mu' title='&#92;mu^{(n)} := &#92;mu * &#92;ldots * &#92;mu' class='latex' /> of the probability measure <img src='http://s0.wp.com/latex.php?latex=%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mu' title='&#92;mu' class='latex' /> (which can be interpreted as the probability distribution of a random walk on n steps) converges exponentially fast to the constant distribution on <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' />.  Since the latter distribution has an l^2 norm of <img src='http://s0.wp.com/latex.php?latex=O%28+q%5E%7B-3%2F2%7D+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O( q^{-3/2} )' title='O( q^{-3/2} )' class='latex' />, we see in particular that for any fixed <img src='http://s0.wp.com/latex.php?latex=%5Cvarepsilon+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;varepsilon &gt; 0' title='&#92;varepsilon &gt; 0' class='latex' />, we will have</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%26%23124%3B+%5Cmu%5E%7B%28n%29%7D%5C%26%23124%3B_%7Bl%5E2%28SL_2%28%7B%5CBbb+Z%7D_q%29%29%7D+%3D+O%28+q%5E%7B-3%2F2%2B%5Cvarepsilon%7D+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;&#124; &#92;mu^{(n)}&#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))} = O( q^{-3/2+&#92;varepsilon} )' title='&#92;displaystyle &#92;&#124; &#92;mu^{(n)}&#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))} = O( q^{-3/2+&#92;varepsilon} )' class='latex' /> (2)</p>
<p>once n is a sufficiently large multiple of <img src='http://s0.wp.com/latex.php?latex=%5Clog+q&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;log q' title='&#92;log q' class='latex' />.  This can also be seen explicitly from the trace formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%26%23124%3B+%5Cmu%5E%7B%28n%29%7D%5C%26%23124%3B_%7Bl%5E2%28SL_2%28%7B%5CBbb+Z%7D_q%29%29%7D%5E2+%3D+%5Cfrac%7B1%7D%7B%26%23124%3BSL_2%28%7B%5CBbb+Z%7D_q%29%26%23124%3B%7D+%5Csum_j+%5Clambda_j%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;&#124; &#92;mu^{(n)}&#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))}^2 = &#92;frac{1}{&#124;SL_2({&#92;Bbb Z}_q)&#124;} &#92;sum_j &#92;lambda_j^n' title='&#92;displaystyle &#92;&#124; &#92;mu^{(n)}&#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))}^2 = &#92;frac{1}{&#124;SL_2({&#92;Bbb Z}_q)&#124;} &#92;sum_j &#92;lambda_j^n' class='latex' />. (3)</p>
<p>In general, this implication between spectral gap and rapid mixing (2) cannot be reversed; the problem is that <img src='http://s0.wp.com/latex.php?latex=%5Clambda_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda_2' title='&#92;lambda_2' class='latex' /> only directly influences one term in the summation on the right-hand side of (3), and so upper bounds on the left-hand side do not translate efficiently to upper bounds on <img src='http://s0.wp.com/latex.php?latex=%5Clambda_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda_2' title='&#92;lambda_2' class='latex' />. However, there is an algebraic miracle that happens in the case of groups such as <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' /> that allows one to reverse the implication:</p>
<blockquote><p><strong>Lemma</strong> (Frobenius)  Let q be prime.  Then every non-trivial finite-dimensional unitary representation of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' /> has dimension at least (q-1)/2.</p></blockquote>
<p><strong>Proof.</strong> Observe that <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' /> can be generated by parabolic elements, so given a non-trivial representation <img src='http://s0.wp.com/latex.php?latex=%5Crho%3A+SL_2%28%7B%5CBbb+Z%7D_q%29+%5Cto+U%28V%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho: SL_2({&#92;Bbb Z}_q) &#92;to U(V)' title='&#92;rho: SL_2({&#92;Bbb Z}_q) &#92;to U(V)' class='latex' />, there exists a parabolic element a whose representation <img src='http://s0.wp.com/latex.php?latex=%5Crho%28a%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho(a)' title='&#92;rho(a)' class='latex' /> is non-trivial.  By a change of basis we may take</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+a+%3D+%5Cbegin%7Bpmatrix%7D+1+%26%2338%3B+1+%5C%5C+0+%26%2338%3B+1+%5Cend%7Bpmatrix%7D.&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle a = &#92;begin{pmatrix} 1 &amp; 1 &#92;&#92; 0 &amp; 1 &#92;end{pmatrix}.' title='&#92;displaystyle a = &#92;begin{pmatrix} 1 &amp; 1 &#92;&#92; 0 &amp; 1 &#92;end{pmatrix}.' class='latex' /></p>
<p>On the one hand, we have <img src='http://s0.wp.com/latex.php?latex=a%5Eq%3D1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a^q=1' title='a^q=1' class='latex' /> and hence <img src='http://s0.wp.com/latex.php?latex=%5Crho%28a%29%5Eq%3D1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho(a)^q=1' title='&#92;rho(a)^q=1' class='latex' />; thus all eigenvalues of <img src='http://s0.wp.com/latex.php?latex=%5Crho%28a%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho(a)' title='&#92;rho(a)' class='latex' /> are <img src='http://s0.wp.com/latex.php?latex=q%5E%7Bth%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='q^{th}' title='q^{th}' class='latex' /> roots of unity.  On another hand, <img src='http://s0.wp.com/latex.php?latex=%5Crho%28a%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho(a)' title='&#92;rho(a)' class='latex' /> is non-trivial, so at least one of the eigenvalues of <img src='http://s0.wp.com/latex.php?latex=%5Crho%28a%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho(a)' title='&#92;rho(a)' class='latex' /> differs from 1.  Thirdly, conjugating a by diagonal matrices in <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' />, we see that a is conjugate to <img src='http://s0.wp.com/latex.php?latex=a%5Em&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a^m' title='a^m' class='latex' /> whenever m is a quadratic residue mod q, and so the eigenvalues of <img src='http://s0.wp.com/latex.php?latex=%5Crho%28a%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho(a)' title='&#92;rho(a)' class='latex' /> must be stable under the operation of taking <img src='http://s0.wp.com/latex.php?latex=m%5E%7Bth%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='m^{th}' title='m^{th}' class='latex' /> powers.  On the other hand, there are (q-1)/2 quadratic residues.  Putting all this together we see that <img src='http://s0.wp.com/latex.php?latex=%5Crho%28a%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho(a)' title='&#92;rho(a)' class='latex' /> must take at least (q-1)/2 distinct eigenvalues, and the claim follows. <img src='http://s0.wp.com/latex.php?latex=%5CBox&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Box' title='&#92;Box' class='latex' /></p>
<p><strong>Remark. </strong>For our purposes, the exact value of <img src='http://s0.wp.com/latex.php?latex=%28q-1%29%2F2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(q-1)/2' title='(q-1)/2' class='latex' /> is irrelevant; any multiplicity which grows like a power of q would suffice. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p>Applying this lemma to the eigenspace of <img src='http://s0.wp.com/latex.php?latex=%5Clambda_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda_2' title='&#92;lambda_2' class='latex' />, we obtain</p>
<blockquote><p><strong>Corollary.</strong> The second eigenvalue <img src='http://s0.wp.com/latex.php?latex=%5Clambda_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda_2' title='&#92;lambda_2' class='latex' /> of the operation <img src='http://s0.wp.com/latex.php?latex=f+%5Cmapsto+f%2A%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f &#92;mapsto f*&#92;mu' title='f &#92;mapsto f*&#92;mu' class='latex' /> appears with multiplicity at least <img src='http://s0.wp.com/latex.php?latex=%28q-1%29%2F2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(q-1)/2' title='(q-1)/2' class='latex' />.</p></blockquote>
<p>Combining this corollary with (3), one can now reverse the previous implication and obtain a spectral gap <img src='http://s0.wp.com/latex.php?latex=%5Clambda_2+%5Cleq+1-c&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda_2 &#92;leq 1-c' title='&#92;lambda_2 &#92;leq 1-c' class='latex' /> as soon as one gets a mixing estimate (2) for some <img src='http://s0.wp.com/latex.php?latex=n+%3D+O%28%5Clog+q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n = O(&#92;log q)' title='n = O(&#92;log q)' class='latex' /> and some sufficiently small <img src='http://s0.wp.com/latex.php?latex=%5Cvarepsilon&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;varepsilon' title='&#92;varepsilon' class='latex' />.</p>
<p>The task is now to obtain the mixing estimate (2).  The quantity <img src='http://s0.wp.com/latex.php?latex=%5C%26%23124%3B%5Cmu%5E%7B%28n%29%7D%5C%26%23124%3B_%7Bl%5E2%28SL_2%28%7B%5CBbb+Z%7D_q%29%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;&#124;&#92;mu^{(n)}&#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))}' title='&#92;&#124;&#92;mu^{(n)}&#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))}' class='latex' /> starts at 1 when <img src='http://s0.wp.com/latex.php?latex=n%3D0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n=0' title='n=0' class='latex' /> and decreases with n.  If we assume (as we may) that S generates a free group, then it is not hard to see that <img src='http://s0.wp.com/latex.php?latex=%5Cmu%5E%7B%28n%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mu^{(n)}' title='&#92;mu^{(n)}' class='latex' /> expands rapidly for <img src='http://s0.wp.com/latex.php?latex=n+%5Cll+%5Clog+q&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n &#92;ll &#92;log q' title='n &#92;ll &#92;log q' class='latex' /> (because all the words generated by S will be distinct until one encounters the &#8220;wrap-around&#8221; effect of taking residues modulo q).  Using this one can get a preliminary mixing bound</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%26%23124%3B+%5Cmu%5E%7B%28n%29%7D+%5C%26%23124%3B_%7Bl%5E2%28SL_2%28%7B%5CBbb+Z%7D_q%29%29%7D+%5Cleq+q%5E%7B-%5Cdelta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;&#124; &#92;mu^{(n)} &#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))} &#92;leq q^{-&#92;delta}' title='&#92;displaystyle &#92;&#124; &#92;mu^{(n)} &#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))} &#92;leq q^{-&#92;delta}' class='latex' /></p>
<p>for some absolute constant <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta &gt; 0' title='&#92;delta &gt; 0' class='latex' /> and some <img src='http://s0.wp.com/latex.php?latex=n+%3D+O%28%5Clog+q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n = O(&#92;log q)' title='n = O(&#92;log q)' class='latex' />.  Also, since S modulo q generates all of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' />, we know that the probability measure <img src='http://s0.wp.com/latex.php?latex=%5Cmu%5E%7B%28n%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mu^{(n)}' title='&#92;mu^{(n)}' class='latex' /> is not trapped inside any proper subgroup H of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' />; indeed, using the classification of subgroups of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' /> (or some general &#8220;escape from subvarieties&#8221; machinery of <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2129706">Eskin, Moses, and Oh</a>) one can show that</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cmu%5E%7B%28n%29%7D%28H%29+%5Cleq+q%5E%7B-%5Cdelta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;mu^{(n)}(H) &#92;leq q^{-&#92;delta}' title='&#92;displaystyle &#92;mu^{(n)}(H) &#92;leq q^{-&#92;delta}' class='latex' /></p>
<p>for any such subgroup, and some <img src='http://s0.wp.com/latex.php?latex=n+%3D+O%28%5Clog+q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n = O(&#92;log q)' title='n = O(&#92;log q)' class='latex' />.  The result now follows from iterating the following lemma, which is the heart of the argument:</p>
<blockquote><p><strong><img src='http://s0.wp.com/latex.php?latex=l%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='l^2' title='l^2' class='latex' /> flattening lemma.</strong> Let <img src='http://s0.wp.com/latex.php?latex=%5Cnu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nu' title='&#92;nu' class='latex' /> be a symmetric probability measure on <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' /> which is a little bit dispersed in the sense that</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%26%23124%3B+%5Cnu+%5C%26%23124%3B_%7Bl%5E2%28SL_2%28%7B%5CBbb+Z%7D_q%29%29%7D+%5Cleq+q%5E%7B-%5Cdelta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;&#124; &#92;nu &#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))} &#92;leq q^{-&#92;delta}' title='&#92;displaystyle &#92;&#124; &#92;nu &#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))} &#92;leq q^{-&#92;delta}' class='latex' /></p>
<p>for some <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta &gt; 0' title='&#92;delta &gt; 0' class='latex' />, and is not concentrated in a subvariety in the sense that <img src='http://s0.wp.com/latex.php?latex=%5Cnu%2A%5Cnu%28H%29+%5Cleq+q%5E%7B-%5Cdelta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nu*&#92;nu(H) &#92;leq q^{-&#92;delta}' title='&#92;nu*&#92;nu(H) &#92;leq q^{-&#92;delta}' class='latex' /> for any proper subgroup H of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' />.  Suppose also that <img src='http://s0.wp.com/latex.php?latex=%5Cnu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nu' title='&#92;nu' class='latex' /> is not entirely flat in the sense that</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%26%23124%3B+%5Cnu+%5C%26%23124%3B_%7Bl%5E2%28SL_2%28%7B%5CBbb+Z%7D_q%29%29%7D+%5Cgeq+q%5E%7B-3%2F2%2B%5Cdelta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;&#124; &#92;nu &#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))} &#92;geq q^{-3/2+&#92;delta}' title='&#92;displaystyle &#92;&#124; &#92;nu &#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))} &#92;geq q^{-3/2+&#92;delta}' class='latex' /></p>
<p>(note that the minimal <img src='http://s0.wp.com/latex.php?latex=l%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='l^2' title='l^2' class='latex' /> norm for a probability measure is comparable to <img src='http://s0.wp.com/latex.php?latex=q%5E%7B-3%2F2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='q^{-3/2}' title='q^{-3/2}' class='latex' />, attained for the uniform distribution).  Then <img src='http://s0.wp.com/latex.php?latex=%5Cnu+%2A+%5Cnu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nu * &#92;nu' title='&#92;nu * &#92;nu' class='latex' /> is &#8220;flatter&#8221; than <img src='http://s0.wp.com/latex.php?latex=%5Cnu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nu' title='&#92;nu' class='latex' /> in the sense that</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%26%23124%3B+%5Cnu+%2A+%5Cnu+%5C%26%23124%3B_%7Bl%5E2%28SL_2%28%7B%5CBbb+Z%7D_q%29%29%7D+%5Cleq+q%5E%7B-%5Cvarepsilon%7D+%5C%26%23124%3B%5Cnu%5C%26%23124%3B_%7Bl%5E2%28SL_2%28%7B%5CBbb+Z%7D_q%29%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;&#124; &#92;nu * &#92;nu &#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))} &#92;leq q^{-&#92;varepsilon} &#92;&#124;&#92;nu&#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))}' title='&#92;displaystyle &#92;&#124; &#92;nu * &#92;nu &#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))} &#92;leq q^{-&#92;varepsilon} &#92;&#124;&#92;nu&#92;&#124;_{l^2(SL_2({&#92;Bbb Z}_q))}' class='latex' /></p>
<p>for some <img src='http://s0.wp.com/latex.php?latex=%5Cvarepsilon+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;varepsilon &gt; 0' title='&#92;varepsilon &gt; 0' class='latex' /> depending on <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />.</p></blockquote>
<p>In the special case when <img src='http://s0.wp.com/latex.php?latex=%5Cnu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nu' title='&#92;nu' class='latex' /> is the uniform distribution on some set A, the flattening lemma is very close to the following theorem <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2415382">of Helfgott</a>:</p>
<blockquote><p><strong>Product theorem.</strong> Let q be a prime.  Let A be a subset of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' /> which is not too big in the sense that <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%26%23124%3B+%5Cleq+q%5E%7B3-%5Cdelta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A&#124; &#92;leq q^{3-&#92;delta}' title='&#124;A&#124; &#92;leq q^{3-&#92;delta}' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta &gt; 0' title='&#92;delta &gt; 0' class='latex' />, and which is not contained in any proper subgroup H of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_q%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_q)' title='SL_2({&#92;Bbb Z}_q)' class='latex' />.  Then <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA+%5Ccdot+A+%5Ccdot+A%26%23124%3B+%5Cgeq+%26%23124%3BA%26%23124%3B%5E%7B1%2B%5Cvarepsilon%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A &#92;cdot A &#92;cdot A&#124; &#92;geq &#124;A&#124;^{1+&#92;varepsilon}' title='&#124;A &#92;cdot A &#92;cdot A&#124; &#92;geq &#124;A&#124;^{1+&#92;varepsilon}' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%5Cvarepsilon+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;varepsilon &gt; 0' title='&#92;varepsilon &gt; 0' class='latex' /> depending on <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />.</p></blockquote>
<p>Indeed, by using some standard additive combinatorics, in particular a (non-commutative version of) the Balog-Szemerédi-Gowers lemma (which can be found for instance in my <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2289012">book with Van Vu</a>), which connects &#8220;statistical&#8221; multiplication, such as that provided by convolution <img src='http://s0.wp.com/latex.php?latex=%5Cmu%2C+%5Cnu+%5Cmapsto+%5Cmu+%2A+%5Cnu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mu, &#92;nu &#92;mapsto &#92;mu * &#92;nu' title='&#92;mu, &#92;nu &#92;mapsto &#92;mu * &#92;nu' class='latex' />, with &#8220;combinatorial&#8221; multiplication, coming from the product set operation <img src='http://s0.wp.com/latex.php?latex=A%2C+B+%5Cmapsto+A+%5Ccdot+B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A, B &#92;mapsto A &#92;cdot B' title='A, B &#92;mapsto A &#92;cdot B' class='latex' />, one can show that these two statements are in fact equivalent to each other.</p>
<p>The product theorem is a manifestation of certain &#8220;nonlinear&#8221; or &#8220;noncommutative&#8221; behaviour in the group <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_p%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_p)' title='SL_2({&#92;Bbb Z}_p)' class='latex' />; see my <a class="snap_noshots" href="http://terrytao.wordpress.com/2007/12/06/milliman-lecture-iii-sum-product-estimates-expanders-and-exponential-sums/">Milliman lecture</a> for a bit more discussion on this.  For now, let me just say that Helfgott&#8217;s proof on this uses a variety of algebraic and combinatorial computations exploiting the special structure of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_p%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_p)' title='SL_2({&#92;Bbb Z}_p)' class='latex' /> (especially how commutativity or non-commutativity of various elements in this group interact with the trace of various combinations of these elements), as well as the following sum-product estimate of <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2053599">Bourgain-Katz-Tao</a> and <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1998834">Bourgain-Glibichuk-Konyagin</a>:</p>
<blockquote><p><strong>Sum-product theorem.</strong> Let q be prime. Let A be a subset of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D_q&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}_q' title='{&#92;Bbb Z}_q' class='latex' /> which is not too big in the sense that <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%26%23124%3B+%5Cleq+q%5E%7B1-%5Cdelta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A&#124; &#92;leq q^{1-&#92;delta}' title='&#124;A&#124; &#92;leq q^{1-&#92;delta}' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta &gt; 0' title='&#92;delta &gt; 0' class='latex' />.  Then <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%2BA%26%23124%3B+%2B+%26%23124%3BA+%5Ccdot+A%26%23124%3B+%5Cgeq+%26%23124%3BA%26%23124%3B%5E%7B1%2B%5Cvarepsilon%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A+A&#124; + &#124;A &#92;cdot A&#124; &#92;geq &#124;A&#124;^{1+&#92;varepsilon}' title='&#124;A+A&#124; + &#124;A &#92;cdot A&#124; &#92;geq &#124;A&#124;^{1+&#92;varepsilon}' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%5Cvarepsilon+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;varepsilon &gt; 0' title='&#92;varepsilon &gt; 0' class='latex' /> depending only on <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />.</p></blockquote>
<p>There are now some quite elementary proofs of this theorem, but I will not discuss them here (see e.g. my <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/06/17/the-sum-product-phenomenon-in-arbitrary-rings/">earlier blog post on this topic)</a>.  I should note, though, that the bulk of the Bourgain-Gamburd-Sarnak work is preoccupied with establishing a suitable extension of this sum-product theorem to the case when q is not prime, in a manner which is uniform in the number of prime factors; this turns out to be a surprisingly difficult task.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Marker lectures II, "Linear equations in primes" ]]></title>
<link>http://terrytao.wordpress.com/2008/11/18/marker-lectures-ii-linear-equations-in-primes/</link>
<pubDate>Tue, 18 Nov 2008 20:45:12 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2008/11/18/marker-lectures-ii-linear-equations-in-primes/</guid>
<description><![CDATA[This week I am at Penn State University, giving this year&#8217;s Marker lectures.  My chosen theme]]></description>
<content:encoded><![CDATA[<p>This week I am at <a class="snap_noshots" href="http://www.math.psu.edu/">Penn State University</a>, giving this <a class="snap_noshots" href="http://www.math.psu.edu/seminars/marker2008.php">year&#8217;s Marker lectures</a>.  My chosen theme for my four lectures here is &#8220;recent developments in additive prime number theory&#8221;.  My first lecture, &#8220;Long arithmetic progressions in primes&#8221;, is similar to my <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/01/07/ams-lecture-structure-and-randomness-in-the-prime-numbers/">AMS lecture on the same topic</a> and so I am not reposting it here.  The second lecture, the notes for which begin after the fold, is on &#8220;Linear equations in primes&#8221;.  These two lectures focus primarily on work of myself and Ben Green.  The third and fourth lectures, entitled &#8220;Small gaps between primes&#8221; and &#8220;Sieving for almost primes and expander graphs&#8221;, will instead be focused on the work of Goldston-Yildirim-Pintz and Bourgain-Gamburd-Sarnak respectively.<br />
<!--more-->In my previous lecture (or the <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/01/07/ams-lecture-structure-and-randomness-in-the-prime-numbers/">AMS lecture</a>), I focused on finding a specific type of pattern inside the prime numbers, namely that of an arithmetic progression <img src='http://s0.wp.com/latex.php?latex=n%2C+n%2Br%2C+%5Cldots%2C+n%2B%28k-1%29r&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n, n+r, &#92;ldots, n+(k-1)r' title='n, n+r, &#92;ldots, n+(k-1)r' class='latex' />.  The main reason why the analysis there is specific to progressions is because of its reliance on <a href="http://en.wikipedia.org/wiki/Szemer%C3%A9di%27s_theorem">Szemerédi&#8217;s theorem</a>, which shows that arithmetic progressions are necessarily abundant in sufficiently &#8220;large&#8221; sets of integers.  There are several other variants and generalisations of this theorem known to a few other types of patterns (e.g. polynomial progressions <img src='http://s0.wp.com/latex.php?latex=n%2BP_1%28r%29%2C+%5Cldots%2C+n%2BP_k%28r%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n+P_1(r), &#92;ldots, n+P_k(r)' title='n+P_1(r), &#92;ldots, n+P_k(r)' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=P_1%2C%5Cldots%2CP_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P_1,&#92;ldots,P_k' title='P_1,&#92;ldots,P_k' class='latex' /> are polynomials from the integers to the integers with <img src='http://s0.wp.com/latex.php?latex=P_1%280%29+%3D+%5Cldots+%3D+P_k%280%29%3D0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P_1(0) = &#92;ldots = P_k(0)=0' title='P_1(0) = &#92;ldots = P_k(0)=0' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=r+%5Cneq+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='r &#92;neq 0' title='r &#92;neq 0' class='latex' />), and in some cases the analogous results about primes are known (e.g. <a class="snap_noshots" href="http://arxiv.org/abs/math.NT/0610050">Tamar Ziegler and myself showed</a> that for any given <img src='http://s0.wp.com/latex.php?latex=P_1%2C%5Cldots%2CP_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P_1,&#92;ldots,P_k' title='P_1,&#92;ldots,P_k' class='latex' /> as above, there are infinitely many polynomial progressions of primes).</p>
<p>However, for most patterns, there is no analogue of Szemerédi&#8217;s theorem, and the strategy used in the previous lecture or <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/01/07/ams-lecture-structure-and-randomness-in-the-prime-numbers/">AMS lecture</a> cannot be directly applied.  For instance, it is certainly not true that any subset of integers with positive upper density contains any twins <img src='http://s0.wp.com/latex.php?latex=n%2Cn%2B2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n,n+2' title='n,n+2' class='latex' />; the multiples of three, for instance, form a counterexample, among many others.  (In fact, there are so many counterexamples here, that it looks unlikely that the <a href="http://en.wikipedia.org/wiki/Twin_prime_conjecture">twin prime conjecture</a> can be attacked by this method without a significant new idea.)</p>
<p>Furthermore, even in the cases when these methods do work, for instance in demonstrating for each k that there are infinitely many progressions of length k inside the primes, they do not settle the more quantitative problem of how many progressions of length k there are asymptotically in any given finite range of primes, e.g. the primes less than a number N in the asymptotic limit <img src='http://s0.wp.com/latex.php?latex=N+%5Cto+%5Cinfty&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N &#92;to &#92;infty' title='N &#92;to &#92;infty' class='latex' />.  This is because Szemerédi&#8217;s theorem provides a lower bound for the number of progressions in a large finite set, but not a matching upper bound.  (For instance, given a subset A of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> of density 1/2, the number of progressions of length 3 in A can be as large as <img src='http://s0.wp.com/latex.php?latex=%281%2F4+%2B+o%281%29%29+N%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(1/4 + o(1)) N^2' title='(1/4 + o(1)) N^2' class='latex' /> (if <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A' title='A' class='latex' /> consists of the even integers from 1 to N, for instance) and as small as <img src='http://s0.wp.com/latex.php?latex=%281%2F8+%2B+o%281%29%29+N%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(1/8 + o(1)) N^2' title='(1/8 + o(1)) N^2' class='latex' /> (if <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A' title='A' class='latex' /> is a randomly chosen set of density 1/2), and can even be a little bit smaller by perturbing this example slightly.)</p>
<p>On the other hand, as discussed in the previous lecture or <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/01/07/ams-lecture-structure-and-randomness-in-the-prime-numbers/">AMS lecture</a>, one can use standard random models for the primes to predict what the correct asymptotic for these questions should be.  For instance, the number of arithmetic progressions <img src='http://s0.wp.com/latex.php?latex=n%2C+n%2Br%2C+%5Cldots%2C+n%2B%28k-1%29r&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n, n+r, &#92;ldots, n+(k-1)r' title='n, n+r, &#92;ldots, n+(k-1)r' class='latex' /> of a fixed length k consisting of primes less than N should be asymptotically</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%28%5Cfrac%7B1%7D%7B2%28k-1%29%7D+%28%5Cprod_p+%5Cbeta_p%29+%2B+o%281%29%29+%5Cfrac%7BN%5E2%7D%7B%5Clog%5Ek+N%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle (&#92;frac{1}{2(k-1)} (&#92;prod_p &#92;beta_p) + o(1)) &#92;frac{N^2}{&#92;log^k N}' title='&#92;displaystyle (&#92;frac{1}{2(k-1)} (&#92;prod_p &#92;beta_p) + o(1)) &#92;frac{N^2}{&#92;log^k N}' class='latex' /> (1)</p>
<p>where the product is over all primes p, and the quantity <img src='http://s0.wp.com/latex.php?latex=%5Cbeta_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;beta_p' title='&#92;beta_p' class='latex' /> is defined as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cbeta_p+%3A%3D+%5Cfrac%7B1%7D%7Bp%7D+%28%5Cfrac%7Bp%7D%7Bp-1%7D%29%5E%7Bk-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;beta_p := &#92;frac{1}{p} (&#92;frac{p}{p-1})^{k-1}' title='&#92;displaystyle &#92;beta_p := &#92;frac{1}{p} (&#92;frac{p}{p-1})^{k-1}' class='latex' /></p>
<p>for <img src='http://s0.wp.com/latex.php?latex=p+%5Cleq+k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p &#92;leq k' title='p &#92;leq k' class='latex' />, and</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cbeta_p+%3A%3D+%281-%5Cfrac%7Bk-1%7D%7Bp%7D%29+%28%5Cfrac%7Bp%7D%7Bp-1%7D%29%5E%7Bk-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;beta_p := (1-&#92;frac{k-1}{p}) (&#92;frac{p}{p-1})^{k-1}' title='&#92;displaystyle &#92;beta_p := (1-&#92;frac{k-1}{p}) (&#92;frac{p}{p-1})^{k-1}' class='latex' /></p>
<p>for <img src='http://s0.wp.com/latex.php?latex=p+%5Cgeq+k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p &#92;geq k' title='p &#92;geq k' class='latex' />.</p>
<p>The various terms in this complicated-looking formula can be explained as follows.  The &#8220;Archimedean&#8221; factors <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7B1%7D%7B2%28k-1%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;frac{1}{2(k-1)}' title='&#92;frac{1}{2(k-1)}' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=N%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N^2' title='N^2' class='latex' /> come from the fact that the number of arithmetic progressions <img src='http://s0.wp.com/latex.php?latex=n%2C+n%2Br%2C+%5Cldots%2Cn%2B%28k-1%29r&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n, n+r, &#92;ldots,n+(k-1)r' title='n, n+r, &#92;ldots,n+(k-1)r' class='latex' /> of <em>natural numbers</em> less than N is <img src='http://s0.wp.com/latex.php?latex=%28%5Cfrac%7B1%7D%7B2%28k-1%29%7D+%2Bo%281%29%29+N%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(&#92;frac{1}{2(k-1)} +o(1)) N^2' title='(&#92;frac{1}{2(k-1)} +o(1)) N^2' class='latex' />.  The &#8220;density&#8221; factor <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7B1%7D%7B%5Clog%5Ek+N%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;frac{1}{&#92;log^k N}' title='&#92;frac{1}{&#92;log^k N}' class='latex' /> comes from the <a href="http://en.wikipedia.org/wiki/Prime_number_theorem">prime number theorem</a>, which roughly speaking asserts that each of the k elements <img src='http://s0.wp.com/latex.php?latex=n%2C+n%2Br%2C+%5Cldots%2Cn%2B%28k-1%29r&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n, n+r, &#92;ldots,n+(k-1)r' title='n, n+r, &#92;ldots,n+(k-1)r' class='latex' /> in a typical arithmetic progression has a <img src='http://s0.wp.com/latex.php?latex=%281%2Bo%281%29%29+%5Cfrac%7B1%7D%7B%5Clog+N%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(1+o(1)) &#92;frac{1}{&#92;log N}' title='(1+o(1)) &#92;frac{1}{&#92;log N}' class='latex' /> &#8220;probability&#8221; of being prime.  The &#8220;local&#8221; factors <img src='http://s0.wp.com/latex.php?latex=%5Cbeta_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;beta_p' title='&#92;beta_p' class='latex' /> measures how much bias arithmetic progressions with respect to being coprime to a fixed prime p, which is relevant for progressions of primes, since primes of course tend to be coprime to p.  More precisely, <img src='http://s0.wp.com/latex.php?latex=%5Cbeta_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;beta_p' title='&#92;beta_p' class='latex' /> can be defined as the probability that a random arithmetic progression of length k has all entries coprime to p, divided by the probability that a random collection of k independent numbers are all coprime to p.  It is not difficult to show that the product <img src='http://s0.wp.com/latex.php?latex=%5Cprod_p+%5Cbeta_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;prod_p &#92;beta_p' title='&#92;prod_p &#92;beta_p' class='latex' /> converges to some finite non-zero number for each k.</p>
<p>Similar heuristic asymptotic formulae exist for the number of many other patterns of primes; for instance, the number of representations <img src='http://s0.wp.com/latex.php?latex=N%3Dp_1%2Bp_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N=p_1+p_2' title='N=p_1+p_2' class='latex' /> of a large integer N as the sum of two primes should be equal to <img src='http://s0.wp.com/latex.php?latex=%28%5Cprod_p+%5Cbeta_%7Bp%2CN%7D+%2B+o%281%29%29+N&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(&#92;prod_p &#92;beta_{p,N} + o(1)) N' title='(&#92;prod_p &#92;beta_{p,N} + o(1)) N' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%5Cbeta_%7Bp%2CN%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;beta_{p,N}' title='&#92;beta_{p,N}' class='latex' /> is the probability that two randomly chosen numbers <a href="http://en.wikipedia.org/wiki/Conditional_probability">conditioned</a> to be coprime to p sum to N modulo p, divided by the probability that two randomly chosen numbers sum to N modulo p.  (In particular, <img src='http://s0.wp.com/latex.php?latex=%5Cbeta_%7B2%2CN%7D%3D0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;beta_{2,N}=0' title='&#92;beta_{2,N}=0' class='latex' /> for odd N, reflecting the fact that it is very difficult for an odd number to be representable as the sum of two primes.  More generally, one can compute that <img src='http://s0.wp.com/latex.php?latex=%5Cbeta_%7Bp%2CN%7D+%3D+1+%2B+%5Cfrac%7B1%7D%7Bp-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;beta_{p,N} = 1 + &#92;frac{1}{p-1}' title='&#92;beta_{p,N} = 1 + &#92;frac{1}{p-1}' class='latex' /> when p divides N, and <img src='http://s0.wp.com/latex.php?latex=%5Cbeta_%7Bp%2CN%7D+%3D+1+-+%5Cfrac%7B1%7D%7B%28p-1%29%5E2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;beta_{p,N} = 1 - &#92;frac{1}{(p-1)^2}' title='&#92;beta_{p,N} = 1 - &#92;frac{1}{(p-1)^2}' class='latex' /> otherwise.)  A more general prediction for counting linear patterns inside primes exists, and is essentially the <a class="snap_noshots" href="http://mathworld.wolfram.com/k-TupleConjecture.html">Hardy-Littlewood prime tuples conjecture</a>.  This conjecture, which is widely believed to be true, would imply many other conjectures in the subject, such as the twin primes conjecture and the <a href="http://en.wikipedia.org/wiki/Goldbach_conjecture">Goldbach conjecture</a> (for sufficiently large even numbers).  Unfortunately, the cases of the prime tuples conjecture which would have these consequences remain out of reach of current technology.</p>
<p>Using some elementary linear algebra, one can recast the prime tuples conjecture not as a question of finding linear patterns inside primes, but rather that of solving linear equations in which all the unknowns are required to be prime, subject to some additional linear inequalities.  For instance, finding progressions of length k consisting entirely of primes less than N is essentially the same as asking for solutions to the system of equations</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=p_2+-+p_1+%3D+p_3+-+p_2+%3D+%5Cldots+%3D+p_k+-+p_%7Bk-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p_2 - p_1 = p_3 - p_2 = &#92;ldots = p_k - p_{k-1}' title='p_2 - p_1 = p_3 - p_2 = &#92;ldots = p_k - p_{k-1}' class='latex' /></p>
<p>and inequalities</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=0+%5Cleq+p_1%2C%5Cldots%2Cp_k+%5Cleq+N&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='0 &#92;leq p_1,&#92;ldots,p_k &#92;leq N' title='0 &#92;leq p_1,&#92;ldots,p_k &#92;leq N' class='latex' /></p>
<p>where the unknowns <img src='http://s0.wp.com/latex.php?latex=p_1%2C%5Cldots%2Cp_k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p_1,&#92;ldots,p_k' title='p_1,&#92;ldots,p_k' class='latex' /> are required to be primes.  More generally, one could imagine the question of asking the number of k-tuples <img src='http://s0.wp.com/latex.php?latex=%28p_1%2C%5Cldots%2Cp_k%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(p_1,&#92;ldots,p_k)' title='(p_1,&#92;ldots,p_k)' class='latex' /> consisting entirely of primes which is contained in some convex set B in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^k' title='{&#92;Bbb R}^k' class='latex' /> of some intermediate dimension <img src='http://s0.wp.com/latex.php?latex=1+%5Cleq+d+%5Cleq+k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1 &#92;leq d &#92;leq k' title='1 &#92;leq d &#92;leq k' class='latex' />, which is contained in a ball of radius O(N) around the origin.  For instance, in the above example B is the 2-dimensional set</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5C%7B+%28x_1%2C%5Cldots%2Cx_k%29+%5Cin+%7B%5CBbb+R%7D%5Ek%3A+x_2-x_1+%3D+%5Cldots+%3D+x_k-x_%7Bk-1%7D%3B+0+%5Cleq+x_1%2C%5Cldots%2Cx_k+%5Cleq+N+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{ (x_1,&#92;ldots,x_k) &#92;in {&#92;Bbb R}^k: x_2-x_1 = &#92;ldots = x_k-x_{k-1}; 0 &#92;leq x_1,&#92;ldots,x_k &#92;leq N &#92;}' title='&#92;{ (x_1,&#92;ldots,x_k) &#92;in {&#92;Bbb R}^k: x_2-x_1 = &#92;ldots = x_k-x_{k-1}; 0 &#92;leq x_1,&#92;ldots,x_k &#92;leq N &#92;}' class='latex' />.</p>
<p>(We make the technical assumption that the linear coefficients of the equations defining the d-dimensional subspace that B lives in are independent of N; k and d are of course also assumed to be independent of N.  The constant coefficients, however, are allowed to vary with N; this is the situation that comes up for instance in the Goldbach conjectures.)  One can think of the problem of finding points in B as that of solving k-d equations in k unknowns.  One can also generalise this problem slightly by enforcing some residue constraints <img src='http://s0.wp.com/latex.php?latex=x_j+%3D+a_j+%5Cmod+q_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x_j = a_j &#92;mod q_j' title='x_j = a_j &#92;mod q_j' class='latex' /> on the unknowns, but we will ignore this minor extension to simplify the discussion.</p>
<p>The prime tuples conjecture for this problem can roughly speaking be phrased as follows.  Suppose that the number if k-tuples <img src='http://s0.wp.com/latex.php?latex=%28n_1%2C%5Cldots%2Cn_k%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(n_1,&#92;ldots,n_k)' title='(n_1,&#92;ldots,n_k)' class='latex' /> in B consisting of <em>natural numbers</em> is known to be</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%28%5Cbeta_%5Cinfty+%2B+o%281%29%29+N%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(&#92;beta_&#92;infty + o(1)) N^d' title='(&#92;beta_&#92;infty + o(1)) N^d' class='latex' /></p>
<p>for some constant <img src='http://s0.wp.com/latex.php?latex=%5Cbeta_%5Cinfty&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;beta_&#92;infty' title='&#92;beta_&#92;infty' class='latex' /> independent of N (one can think of this constant as the normalised volume of B).  Suppose also that for any fixed prime p, the number of k-tuples in B consisting of natural numbers coprime to p is known to be</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%28%5Cbeta_%5Cinfty+%5Cbeta_p+%2B+o%281%29%29+%281+-+%5Cfrac%7B1%7D%7Bp%7D%29%5Ek+N%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle (&#92;beta_&#92;infty &#92;beta_p + o(1)) (1 - &#92;frac{1}{p})^k N^d' title='&#92;displaystyle (&#92;beta_&#92;infty &#92;beta_p + o(1)) (1 - &#92;frac{1}{p})^k N^d' class='latex' /></p>
<p>for some constant $latex\ beta_p$ independent of n.  (The factor <img src='http://s0.wp.com/latex.php?latex=%281+-+%5Cfrac%7B1%7D%7Bp%7D%29%5Ek&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(1 - &#92;frac{1}{p})^k' title='(1 - &#92;frac{1}{p})^k' class='latex' /> is natural, as it represents the proportion of tuples of k natural numbers in which all the entries are coprime to p.)  Then the prime tuples conjecture asserts that the number of k-tuples in B consisting of primes should be</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%28%5Cbeta_%5Cinfty+%5Cprod_p+%5Cbeta_p+%2B+o%281%29%29+%5Cfrac%7BN%5Ed%7D%7B%5Clog%5Ek+N%7D.&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle (&#92;beta_&#92;infty &#92;prod_p &#92;beta_p + o(1)) &#92;frac{N^d}{&#92;log^k N}.' title='&#92;displaystyle (&#92;beta_&#92;infty &#92;prod_p &#92;beta_p + o(1)) &#92;frac{N^d}{&#92;log^k N}.' class='latex' /> (2)</p>
<p>In particular, this can be shown to imply that if <img src='http://s0.wp.com/latex.php?latex=%5Cbeta_%5Cinfty+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;beta_&#92;infty &gt; 0' title='&#92;beta_&#92;infty &gt; 0' class='latex' /> (thus there are no obstructions to solving the system of equations at infinity) and if <img src='http://s0.wp.com/latex.php?latex=%5Cbeta_p+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;beta_p &gt; 0' title='&#92;beta_p &gt; 0' class='latex' /> for all p (thus there are no obstructions to solvability mod p for any p) then there will exist many solutions to the system of equations in primes when N is large enough.</p>
<p>As mentioned earlier, this conjecture remains open in several important cases, most particularly in the one-dimensional case d=1.  For instance, the twin prime conjecture would follow from the case <img src='http://s0.wp.com/latex.php?latex=B+%3A%3D+%5C%7B+%28x_1%2Cx_2%29%3A+x_2+-x_1+%3D+2%2C+0+%5Cleq+x_1%2Cx_2+%5Cleq+N+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B := &#92;{ (x_1,x_2): x_2 -x_1 = 2, 0 &#92;leq x_1,x_2 &#92;leq N &#92;}' title='B := &#92;{ (x_1,x_2): x_2 -x_1 = 2, 0 &#92;leq x_1,x_2 &#92;leq N &#92;}' class='latex' />, but this case remains open.  However, there has now been significant progress in the higher dimensional cases <img src='http://s0.wp.com/latex.php?latex=d+%5Cgeq+2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d &#92;geq 2' title='d &#92;geq 2' class='latex' />, especially when the <em>codimension</em> <img src='http://s0.wp.com/latex.php?latex=k-d&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k-d' title='k-d' class='latex' /> (representing the number of equations in the system) is low.  Firstly, the prime number theorem settles the zero codimension case <img src='http://s0.wp.com/latex.php?latex=d%3Dk&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d=k' title='d=k' class='latex' /> (and is pretty much the only situation in which we can handle a $d=1$ case).  The Hardy-Littlewood circle method, based on Fourier analysis, settles all &#8220;non-degenerate&#8221; cases when <img src='http://s0.wp.com/latex.php?latex=d+%5Cgeq+%5Cmax%28k-1%2C+2%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d &#92;geq &#92;max(k-1, 2)' title='d &#92;geq &#92;max(k-1, 2)' class='latex' />, where &#8220;non-degenerate&#8221; roughly speaking means that the problem does not secretly contain a <img src='http://s0.wp.com/latex.php?latex=d%3D1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d=1' title='d=1' class='latex' /> problem inside it as a lower-dimensional projection (e.g. <img src='http://s0.wp.com/latex.php?latex=B+%3A%3D+%5C%7B+%28x_1%2Cx_2%2Cx_3%29%3A+x_2-x_1%3D2%2C+0+%5Cleq+x_1%2Cx_2%2Cx_3+%5Cleq+N+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B := &#92;{ (x_1,x_2,x_3): x_2-x_1=2, 0 &#92;leq x_1,x_2,x_3 &#92;leq N &#92;}' title='B := &#92;{ (x_1,x_2,x_3): x_2-x_1=2, 0 &#92;leq x_1,x_2,x_3 &#92;leq N &#92;}' class='latex' /> would be degenerate; more generally, B is non-degenerate if it is not contained in any hyperplane that can be defined using at most two of the unknowns).  It can also handle some cases in which the codimension k-d exceeds 1 (e.g. one could take the Cartesian product of some codimension 1 examples); the precise description of what problems are within reach of this method is a little technical to state and will not be given here.</p>
<p>The main result of <a class="snap_noshots" href="http://arxiv.org/abs/math.NT/0606088">this paper</a> of Ben Green and myself (combined with results from <a class="snap_noshots" href="http://front.math.ucdavis.edu/math.NT/0503014">these</a> <a class="snap_noshots" href="http://arxiv.org/abs/math.NT/0606087">other</a> <a class="snap_noshots" href="http://arxiv.org/abs/0807.1736">papers</a> of ours) is the following:</p>
<blockquote><p><strong>Theorem.</strong> The prime tuples conjecture is true in all non-degenerate situations in which <img src='http://s0.wp.com/latex.php?latex=d+%5Cgeq+%5Cmax%28k-2%2C+2%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d &#92;geq &#92;max(k-2, 2)' title='d &#92;geq &#92;max(k-2, 2)' class='latex' />.  If the <em>inverse conjecture for the Gowers norms over the integers</em> is true, then the prime tuples conjecture is true in all non-degenerate situations in which <img src='http://s0.wp.com/latex.php?latex=d+%5Cgeq+2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d &#92;geq 2' title='d &#92;geq 2' class='latex' />.</p></blockquote>
<p>I will say a little bit more about what the inverse conjecture for the Gowers norms is later.  This theorem unfortunately does not touch the most interesting case <img src='http://s0.wp.com/latex.php?latex=d%3D1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d=1' title='d=1' class='latex' /> (when the patterns one is seeking only have one degree of freedom), but it does largely settle all the other cases.  For instance, this theorem implies the asymptotic (1) for prime progressions of length k for <img src='http://s0.wp.com/latex.php?latex=k+%5Cleq+4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k &#92;leq 4' title='k &#92;leq 4' class='latex' /> (the cases <img src='http://s0.wp.com/latex.php?latex=k+%5Cleq+3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k &#92;leq 3' title='k &#92;leq 3' class='latex' /> were established earlier by van der Corput using the circle method), and the case of higher k would also follow from the theorem once the inverse conjecture is proven.  As with the circle method, we can also unconditionally handle some cases in which d is less than <img src='http://s0.wp.com/latex.php?latex=%5Cmax%28k-2%2C2%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;max(k-2,2)' title='&#92;max(k-2,2)' class='latex' />, but the precise statement here is technical and will be omitted.  (Details and further examples can of course be found in <a class="snap_noshots" href="http://arxiv.org/abs/math.NT/0606088">our paper</a>.)</p>
<p>Now I would like to turn to the proof of this theorem.  At first glance, the result looks like it is going to be quite complicated, due to the presence of all the different factors in the asymptotic (2) that one is trying to prove.  However, most of the factors can be dealt with by various standard tricks.  The Archimedean factor <img src='http://s0.wp.com/latex.php?latex=%5Cbeta_%5Cinfty&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;beta_&#92;infty' title='&#92;beta_&#92;infty' class='latex' /> can be eliminated from the problem by working locally (with respect to the infinite <a href="http://en.wikipedia.org/wiki/Place_(mathematics)">place</a>), covering B by cubes of sidelength o(N).  For similar reasons, the local factors <img src='http://s0.wp.com/latex.php?latex=%5Cbeta_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;beta_p' title='&#92;beta_p' class='latex' /> can be eliminated by working locally mod p (i.e. restricting to a single residue class mod p for various small p).  The factors <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7B1%7D%7B%5Clog%5Ek+N%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;frac{1}{&#92;log^k N}' title='&#92;frac{1}{&#92;log^k N}' class='latex' />, which come from the density <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7B1%7D%7B%5Clog+N%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;frac{1}{&#92;log N}' title='&#92;frac{1}{&#92;log N}' class='latex' /> of primes in the region of interest (i.e. from the <a href="http://en.wikipedia.org/wiki/Prime_number_theorem">prime number theorem</a>), can largely be compensated for (with some effort) from the transference principle technology developed in our earlier paper on long progressions in the primes, which was discussed in the previous lecture (or at the <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/01/07/ams-lecture-structure-and-randomness-in-the-prime-numbers/">AMS lecture</a>).  After all this, the problem basically boils down to the following.  We have a certain subset A of the integers <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> with some density <img src='http://s0.wp.com/latex.php?latex=0+%26%2360%3B+%5Cdelta+%26%2360%3B+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='0 &lt; &#92;delta &lt; 1' title='0 &lt; &#92;delta &lt; 1' class='latex' /> (one should think of A as being a &#8220;model&#8221; for the primes, after all the distorting structure coming from local obstructions has been stripped out; in actuality, one has to replace the set A by a weight function <img src='http://s0.wp.com/latex.php?latex=f%3A+%5C%7B1%2C%5Cldots%2CN%5C%7D+%5Cto+%5B0%2C1%5D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: &#92;{1,&#92;ldots,N&#92;} &#92;to [0,1]' title='f: &#92;{1,&#92;ldots,N&#92;} &#92;to [0,1]' class='latex' /> of mean value <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />, but let us ignore this technicality to simplify the discussion).  We pick a random instance of some linear pattern inside <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> (for sake of concreteness, let us pick a random arithmetic progression <img src='http://s0.wp.com/latex.php?latex=n%2C+n%2Br%2C+%5Cldots%2Cn%2B%28k-1%29+r&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n, n+r, &#92;ldots,n+(k-1) r' title='n, n+r, &#92;ldots,n+(k-1) r' class='latex' />) and ask what is the probability that all the elements of this pattern lie in A.  Since A has density <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />, we expect each element <img src='http://s0.wp.com/latex.php?latex=n%2Bjr&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n+jr' title='n+jr' class='latex' /> of our random progression to have a probability <img src='http://s0.wp.com/latex.php?latex=%5Cdelta%2Bo%281%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta+o(1)' title='&#92;delta+o(1)' class='latex' /> to lie in A.  (This is not quite the case if A is biased to lie on one side of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> than on the other, but it turns out that one can ignore this possibility.)  Since our pattern consists of k elements, we thus expect</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+P%7D%28+n%2C+n%2Br%2C+%5Cldots%2C+n%2B%28k-1%29r+%5Cin+A+%29+%3D+%5Cdelta%5Ek+%2B+o%281%29.&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb P}( n, n+r, &#92;ldots, n+(k-1)r &#92;in A ) = &#92;delta^k + o(1).' title='{&#92;Bbb P}( n, n+r, &#92;ldots, n+(k-1)r &#92;in A ) = &#92;delta^k + o(1).' class='latex' />  (3)</p>
<p>Roughly speaking, the key issue in proving the theorem is to work out some &#8220;easily checkable&#8221; conditions on A that would guarantee that the heuristic (3) is in fact valid.  One then verifies that these &#8220;easily checkable&#8221; conditions do indeed hold for the set A of interest (which is a proxy for the set of primes).</p>
<p>As stated before, we expect <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+P%7D%28n%2Bjr+%5Cin+A%29+%3D+%5Cdelta%2Bo%281%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb P}(n+jr &#92;in A) = &#92;delta+o(1)' title='{&#92;Bbb P}(n+jr &#92;in A) = &#92;delta+o(1)' class='latex' /> for each <img src='http://s0.wp.com/latex.php?latex=0+%5Cleq+j+%26%2360%3B+k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='0 &#92;leq j &lt; k' title='0 &#92;leq j &lt; k' class='latex' />.  Thus (3) is asserting in some sense that the events <img src='http://s0.wp.com/latex.php?latex=n%2Bjr+%5Cin+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n+jr &#92;in A' title='n+jr &#92;in A' class='latex' /> are &#8220;approximately independent&#8221;.  This would be a reasonable assertion if A was <em>pseudorandom</em> (i.e. it behaved like a random subset of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> of the given density <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />), and is consistent with the general heuristic from number theory that we expect the prime numbers to behave randomly once all the &#8220;obvious&#8221; irregularities in distribution (in particular, irregularity modulo p for small p) has been dealt with.  But if A exhibits certain types of <em>structure</em> (or at least some bias towards structure), then (3) can fail.  For instance, suppose that A consists entirely of odd numbers.  Then, if the first two elements n, n+r of an arithmetic progression lie in A, they are necessarily odd, which then forces the rest of the elements of this progression to be odd.  As A is concentrated entirely in these odd numbers, these elements of the progression are thus expected to have an elevated probability of lying in A, and so the left-hand side of (3) would be expected to significantly exceed the former once <img src='http://s0.wp.com/latex.php?latex=k+%5Cgeq+3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k &#92;geq 3' title='k &#92;geq 3' class='latex' />.  (The asymptotic (3) becomes trivially true for <img src='http://s0.wp.com/latex.php?latex=k%26%2360%3B3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k&lt;3' title='k&lt;3' class='latex' />.) A similar distorting effect occurs if A is not entirely contained in the odd numbers, but is merely <em>biased</em> towards them, in that odd numbers are more likely to lie in A than even numbers.  In this example, the bias in A caused the number of progressions to go up from the expected number predicted by (3); it is also possible (but more tricky) to concoct examples in which bias in A forces the number of progressions to go down somewhat, though Szemerédi&#8217;s theorem does prevent one from extinguishing these progressions completely when N is large.</p>
<p>Bias towards odd or even numbers is equivalent to a correlation between A and the <a href="http://en.wikipedia.org/wiki/Character_(mathematics)">linear character</a> <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3A%3D+%28-1%29%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) := (-1)^n' title='&#92;chi(n) := (-1)^n' class='latex' />; the algebraic constraints between the <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%2Bjr%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n+jr)' title='&#92;chi(n+jr)' class='latex' />, and in particular the relationship</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%2B2r%29+%5Cchi%28n%2Br%29%5E%7B-2%7D+%5Cchi%28n%29%3D1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n+2r) &#92;chi(n+r)^{-2} &#92;chi(n)=1' title='&#92;chi(n+2r) &#92;chi(n+r)^{-2} &#92;chi(n)=1' class='latex' /> (4)</p>
<p>can be viewed as the underlying source of the distorting effect that can prevent (3) from holding for <img src='http://s0.wp.com/latex.php?latex=k+%5Cgeq+3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k &#92;geq 3' title='k &#92;geq 3' class='latex' />.  The same algebraic constraint holds for any other linear character, e.g. the Fourier character <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3A%3D+e%28%5Cxi+n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) := e(&#92;xi n)' title='&#92;chi(n) := e(&#92;xi n)' class='latex' /> (where <img src='http://s0.wp.com/latex.php?latex=e%28x%29+%3A%3D+e%5E%7B2%5Cpi+i+x%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e(x) := e^{2&#92;pi i x}' title='e(x) := e^{2&#92;pi i x}' class='latex' />) for some fixed frequency <img src='http://s0.wp.com/latex.php?latex=%5Cxi+%5Cin+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi &#92;in {&#92;Bbb R}' title='&#92;xi &#92;in {&#92;Bbb R}' class='latex' />, for much the same reason that two points on a line determine the rest of the line (it is also closely related to the fact that the second derivative of a linear function vanishes).  Because of this, we expect (3) to be distorted when A <em>correlates</em> with such a character (which means that the Fourier coefficient <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bn+%5Cin+A%7D+%5Cchi%28n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_{n &#92;in A} &#92;chi(n)' title='&#92;sum_{n &#92;in A} &#92;chi(n)' class='latex' /> is unexpectedly large in magnitude).</p>
<p>It turns out that in the case k=3 of progressions of length three, correlation with a linear character is the <em>only</em> source of distortion in the count (3).  A sign of this can be seen from the identity</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%23+%5C%7B+%28n%2Cr%29%3A+n%2Cn%2Br%2Cn%2B2r+%5Cin+A+%5C%7D+%3D+%5Cint_0%5E1+%28%5Csum_%7Bn_1+%5Cin+A%7D+e%28%5Cxi+n_1%29+%28%5Csum_%7Bn_2+%5Cin+A%7D+e%28-2%5Cxi+n_2%29%29+%28%5Csum_%7Bn_3+%5Cin+A%7D+e%28%5Cxi+n_3%29%29%5C+d%5Cxi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;# &#92;{ (n,r): n,n+r,n+2r &#92;in A &#92;} = &#92;int_0^1 (&#92;sum_{n_1 &#92;in A} e(&#92;xi n_1) (&#92;sum_{n_2 &#92;in A} e(-2&#92;xi n_2)) (&#92;sum_{n_3 &#92;in A} e(&#92;xi n_3))&#92; d&#92;xi' title='&#92;displaystyle &#92;# &#92;{ (n,r): n,n+r,n+2r &#92;in A &#92;} = &#92;int_0^1 (&#92;sum_{n_1 &#92;in A} e(&#92;xi n_1) (&#92;sum_{n_2 &#92;in A} e(-2&#92;xi n_2)) (&#92;sum_{n_3 &#92;in A} e(&#92;xi n_3))&#92; d&#92;xi' class='latex' /></p>
<p>which can be viewed as a &#8220;Fourier transform&#8221; of the algebraic identity (4).  One can formalise this using (a slight variant of) the above identity and some other Fourier-analytic tools (in particular, the <a href="http://en.wikipedia.org/wiki/Plancherel_theorem">Plancherel identity</a>) to conclude</p>
<blockquote><p><strong>Inverse theorem for length three progressions. </strong>(Informal statement)  Let k=3. Suppose that A is a subset of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> of density <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> for which (3) fails.  Then A correlates with a non-trivial linear character <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3D+e%28%5Cxi+n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) = e(&#92;xi n)' title='&#92;chi(n) = e(&#92;xi n)' class='latex' />.  (&#8220;Non-trivial&#8221; basically means that <img src='http://s0.wp.com/latex.php?latex=%5Cchi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi' title='&#92;chi' class='latex' /> oscillates at least once on the interval <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' />.)</p></blockquote>
<p>Applying this theorem in the contrapositive, we conclude that we can justify the asymptotic (3) in the k=3 case as long as we can show that A does not correlate with a linear character.  In the case when A is a proxy for the primes, this task essentially boils down to that of establishing non-trivial estimates for exponential sums over primes, such as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bp+%26%2360%3B+N%7D+e%28%5Cxi+p%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_{p &lt; N} e(&#92;xi p)' title='&#92;sum_{p &lt; N} e(&#92;xi p)' class='latex' />;</p>
<p>for technical reasons it is more convenient to deal with slight variants of this sum such as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bn%3D1%7D%5EN+%5CLambda%28n%29+e%28%5Cxi+n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_{n=1}^N &#92;Lambda(n) e(&#92;xi n)' title='&#92;sum_{n=1}^N &#92;Lambda(n) e(&#92;xi n)' class='latex' /> (5)</p>
<p>where <img src='http://s0.wp.com/latex.php?latex=%5CLambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda' title='&#92;Lambda' class='latex' /> is the <a href="http://en.wikipedia.org/wiki/Von_Mangoldt_function">von Mangoldt function</a>, or</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bn%3D1%7D%5EN+%5Cmu%28n%29+e%28%5Cxi+n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_{n=1}^N &#92;mu(n) e(&#92;xi n)' title='&#92;sum_{n=1}^N &#92;mu(n) e(&#92;xi n)' class='latex' /> (6)</p>
<p>where <img src='http://s0.wp.com/latex.php?latex=%5Cmu&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mu' title='&#92;mu' class='latex' /> is the <a href="http://en.wikipedia.org/wiki/M%C3%B6bius_function">Möbius function</a>.  (There are various elementary identities, such as summation by parts, that allow one to express one of these sums in terms of the others.  One has a lot of flexibility in here as long as one retains a factor in the sum which is somehow sensitive to the prime factorisation of n.)  The reason for using these functions instead is that they enjoy a number of very useful identities, such as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Csum_%7Bn%3D1%7D%5E%5Cinfty+%5Cfrac%7B%5CLambda%28n%29+%5Cchi%28n%29%7D%7Bn%5Es%7D+%3D+-%5Cfrac%7BL%27%28s%2C%5Cchi%29%7D%7BL%28s%2C%5Cchi%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;sum_{n=1}^&#92;infty &#92;frac{&#92;Lambda(n) &#92;chi(n)}{n^s} = -&#92;frac{L&#039;(s,&#92;chi)}{L(s,&#92;chi)}' title='&#92;displaystyle &#92;sum_{n=1}^&#92;infty &#92;frac{&#92;Lambda(n) &#92;chi(n)}{n^s} = -&#92;frac{L&#039;(s,&#92;chi)}{L(s,&#92;chi)}' class='latex' /> (7)</p>
<p>and</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Csum_%7Bn%3D1%7D%5E%5Cinfty+%5Cfrac%7B%5Cmu%28n%29+%5Cchi%28n%29%7D%7Bn%5Es%7D+%3D+%5Cfrac%7B1%7D%7BL%28s%2C%5Cchi%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;sum_{n=1}^&#92;infty &#92;frac{&#92;mu(n) &#92;chi(n)}{n^s} = &#92;frac{1}{L(s,&#92;chi)}' title='&#92;displaystyle &#92;sum_{n=1}^&#92;infty &#92;frac{&#92;mu(n) &#92;chi(n)}{n^s} = &#92;frac{1}{L(s,&#92;chi)}' class='latex' /> (8)</p>
<p>for any <a href="http://en.wikipedia.org/wiki/Dirichlet_character">Dirichlet character</a> <img src='http://s0.wp.com/latex.php?latex=%5Cchi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi' title='&#92;chi' class='latex' /> (where <img src='http://s0.wp.com/latex.php?latex=L%28s%2C%5Cchi%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L(s,&#92;chi)' title='L(s,&#92;chi)' class='latex' /> is the <a href="http://en.wikipedia.org/wiki/Dirichlet_L-function">Dirichlet L-function</a>), and also multiplicative identities such as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5CLambda%28n%29+%3D+%5Csum_%7Bd%26%23124%3Bn%7D+%5Cmu%28d%29+%5Clog+%5Cfrac%7Bn%7D%7Bd%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;Lambda(n) = &#92;sum_{d&#124;n} &#92;mu(d) &#92;log &#92;frac{n}{d}' title='&#92;displaystyle &#92;Lambda(n) = &#92;sum_{d&#124;n} &#92;mu(d) &#92;log &#92;frac{n}{d}' class='latex' /> (9)</p>
<p>and</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cmu%28n%29+%3D+%5Csum_%7Bn%3Dabc%7D+%5Cmu%28a%29+%5Cmu%28b%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;mu(n) = &#92;sum_{n=abc} &#92;mu(a) &#92;mu(b)' title='&#92;displaystyle &#92;mu(n) = &#92;sum_{n=abc} &#92;mu(a) &#92;mu(b)' class='latex' /> (10)</p>
<p>To cut a long story very short, identities such as (7), (8) are useful for estimating (5), (6) respectively in the <em>major arc</em> case when <img src='http://s0.wp.com/latex.php?latex=%5Cxi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi' title='&#92;xi' class='latex' /> is rational or close to rational (with small denominator), while (variants of) identities such as (9) or (10) (in particular, certain truncated versions of (9) and (10) such as <a href="http://en.wikipedia.org/wiki/Vaughan%27s_identity">Vaughan&#8217;s identity</a>) are useful for estimating (5), (6) respectively in the <em>minor arc</em> case when <img src='http://s0.wp.com/latex.php?latex=%5Cxi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi' title='&#92;xi' class='latex' /> is far from a rational with small denominator (or close to a rational with large denominator).  This theory was pioneered by Vinogradov (and also Hardy and Littlewood), and refined and simplified over the years with many contributions by Vaughan, Davenport, Heath-Brown, and others, with the upshot being that we now have a fairly good understanding of sums such as (5) and (6), and in particular that the sum (6) exhibits a strong cancellation (by a factor of <img src='http://s0.wp.com/latex.php?latex=O_A%28%5Clog%5E%7B-A%7D+N%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O_A(&#92;log^{-A} N)' title='O_A(&#92;log^{-A} N)' class='latex' /> for any fixed A) uniformly in <img src='http://s0.wp.com/latex.php?latex=%5Cxi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi' title='&#92;xi' class='latex' /> (i.e. we can handle both major and minor arcs with a uniform estimate).  Combining this with the inverse theorem and the previous reductions, one can eventually establish the asymptotic (1) in the k=3 case.  (This is not exactly how the original proof of (1) by van der Corput in this case proceeded, but both proofs use the same general ingredients and method, i.e. the Hardy-Littlewood circle method and the Vinogradov method for estimating exponential sums.)</p>
<p>Now we turn to progressions of longer length, such as the case k=4.  Here, linear characters <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3D+e%28%5Cxi+n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) = e(&#92;xi n)' title='&#92;chi(n) = e(&#92;xi n)' class='latex' /> continue to cause bias that distorts the expected asymptotic (3), and so it is still necessary to control sums such as (5) or (6) to prevent such bias from occuring.  However, a major new difficulty arises that new sources of bias also arise.  For instance, if one takes a quadratic character <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3A%3D+e%28%5Cxi+n%5E2%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) := e(&#92;xi n^2)' title='&#92;chi(n) := e(&#92;xi n^2)' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%5Cxi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;xi' title='&#92;xi' class='latex' />, then one easily verifies the identity</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%5Cchi%28n%2Br%29%5E%7B-3%7D+%5Cchi%28n%2B2r%29%5E3+%5Cchi%28n%2B3r%29%5E%7B-1%7D+%3D+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) &#92;chi(n+r)^{-3} &#92;chi(n+2r)^3 &#92;chi(n+3r)^{-1} = 1' title='&#92;chi(n) &#92;chi(n+r)^{-3} &#92;chi(n+2r)^3 &#92;chi(n+3r)^{-1} = 1' class='latex' /> (11)</p>
<p>which reflects the fact that the third derivative of a quadratic function (such as <img src='http://s0.wp.com/latex.php?latex=n+%5Cmapsto+%5Cxi+n%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n &#92;mapsto &#92;xi n^2' title='n &#92;mapsto &#92;xi n^2' class='latex' />) is zero (it also reflects the fact that three points on the graph of a quadratic (i.e. a parabola) determine the entire parabola).  One consequence of this is that if <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29%2C+%5Cchi%28n%2Br%29%2C+%5Cchi%28n%2B2r%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n), &#92;chi(n+r), &#92;chi(n+2r)' title='&#92;chi(n), &#92;chi(n+r), &#92;chi(n+2r)' class='latex' /> are all close to 1 (say), then <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%2B3r%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n+3r)' title='&#92;chi(n+3r)' class='latex' /> will be also.  This constraint between the four values of <img src='http://s0.wp.com/latex.php?latex=%5Cchi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi' title='&#92;chi' class='latex' /> along an arithmetic progression suggests that if A exhibits significant correlation with <img src='http://s0.wp.com/latex.php?latex=%5Cchi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi' title='&#92;chi' class='latex' />, then the event that n+3r lies in A will be influenced in some non-trivial manner by whether n, n+r, and n+2r already lie in A, which will lead to some distortion in (3).  Thus one will need to update the inverse theorem by taking quadratic characters into account.  (Easy examples show that it is possible for a set to correlate with a quadratic character without exhibiting any correlation with linear characters, by choosing a quadratic character with irrational coefficients.)  The most optimistic conjecture in this regard would be</p>
<blockquote><p><strong>Proposed inverse theorem for length four progressions. </strong>(Informal statement)  Let k=4. Suppose that A is a subset of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> of density <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> for which (3) fails.  Then A correlates with a non-trivial quadratic character <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3D+e%28+%5Cxi_2+n%5E2+%2B+%5Cxi_1+n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) = e( &#92;xi_2 n^2 + &#92;xi_1 n)' title='&#92;chi(n) = e( &#92;xi_2 n^2 + &#92;xi_1 n)' class='latex' />.</p></blockquote>
<p>Unfortunately, this conjecture fails.  The easiest way to see this is to consider a <em>bracket quadratic character</em>, such as <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3D+e%28+%5Clfloor+%5Csqrt%7B2%7D+n+%5Crfloor+%5Csqrt%7B3%7D+n+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) = e( &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n )' title='&#92;chi(n) = e( &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n )' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%5Clfloor+%5Crfloor&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lfloor &#92;rfloor' title='&#92;lfloor &#92;rfloor' class='latex' /> is the <a href="http://en.wikipedia.org/wiki/Floor_and_ceiling_functions">greatest integer function</a>.  This is not quite a quadratic character, because <img src='http://s0.wp.com/latex.php?latex=%5Clfloor+%5Crfloor&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lfloor &#92;rfloor' title='&#92;lfloor &#92;rfloor' class='latex' /> is not quite a linear function.  However, this function is linear &#8220;a positive fraction of the time&#8221;; if one picks x and y to be some generic real numbers, one expects <img src='http://s0.wp.com/latex.php?latex=%5Clfloor+x%2By+%5Crfloor&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lfloor x+y &#92;rfloor' title='&#92;lfloor x+y &#92;rfloor' class='latex' /> to equal <img src='http://s0.wp.com/latex.php?latex=%5Clfloor+x+%5Crfloor+%2B+%5Clfloor+y+%5Crfloor&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lfloor x &#92;rfloor + &#92;lfloor y &#92;rfloor' title='&#92;lfloor x &#92;rfloor + &#92;lfloor y &#92;rfloor' class='latex' /> about half of the time.  Because of this, we see that while the identity (11) certainly doesn&#8217;t hold all the time for <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n)' title='&#92;chi(n)' class='latex' />, it does hold a positive fraction of the time, and this is enough to still cause significant bias to disrupt (3) if A correlates with this object.  It is furthermore possible to concoct examples of sets A that correlate with bracket quadratic characters such as <img src='http://s0.wp.com/latex.php?latex=e%28+%5Clfloor+%5Csqrt%7B2%7D+n+%5Crfloor+%5Csqrt%7B3%7D+n+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e( &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n )' title='e( &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n )' class='latex' /> but not any linear or quadratic characters.  (The same phenomenon is not visible at the linear level; a bracket linear phase such as <img src='http://s0.wp.com/latex.php?latex=e%28+%5Clfloor+%5Csqrt%7B2%7D+n+%5Crfloor+%5Csqrt%7B3%7D+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e( &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} )' title='e( &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} )' class='latex' /> can be rewritten as <img src='http://s0.wp.com/latex.php?latex=e%28+%5Csqrt%7B6%7D+n+%29+e%28+-%5Csqrt%7B3%7D+%5C%7B+%5Csqrt%7B2%7D+n+%5C%7D+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e( &#92;sqrt{6} n ) e( -&#92;sqrt{3} &#92;{ &#92;sqrt{2} n &#92;} )' title='e( &#92;sqrt{6} n ) e( -&#92;sqrt{3} &#92;{ &#92;sqrt{2} n &#92;} )' class='latex' />, which by Fourier series can be expressed as a linear combination of linear characters <img src='http://s0.wp.com/latex.php?latex=e%28+%28%5Csqrt%7B6%7D+%2B+k+%5Csqrt%7B2%7D%29+n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e( (&#92;sqrt{6} + k &#92;sqrt{2}) n)' title='e( (&#92;sqrt{6} + k &#92;sqrt{2}) n)' class='latex' /> for integer k.  Note that the same trick does not work for <img src='http://s0.wp.com/latex.php?latex=e%28+%5Clfloor+%5Csqrt%7B2%7D+n+%5Crfloor+%5Csqrt%7B3%7D+n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e( &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n)' title='e( &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n)' class='latex' />.)  Once one throws in these bracket quadratics, it turns out that these do in fact constitute all the possible obstructions to (3) holding in the k=4 case, as shown <a class="snap_noshots" href="http://front.math.ucdavis.edu/math.NT/0503014">by Ben Green and myself</a>:</p>
<blockquote><p><strong>Inverse theorem for length four progressions. </strong>(Informal statement)  Let k=4. Suppose that A is a subset of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> of density <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> for which (3) fails.  Then A correlates with a non-trivial bracket quadratic character <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3D+e%28+%5Csum_%7Bj%3D1%7D%5EJ+%5Clfloor+%5Calpha_j+n+%5Crfloor+%5Cbeta_n+j+%2B+%5Cxi_2+n%5E2+%2B+%5Cxi_1+n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) = e( &#92;sum_{j=1}^J &#92;lfloor &#92;alpha_j n &#92;rfloor &#92;beta_n j + &#92;xi_2 n^2 + &#92;xi_1 n)' title='&#92;chi(n) = e( &#92;sum_{j=1}^J &#92;lfloor &#92;alpha_j n &#92;rfloor &#92;beta_n j + &#92;xi_2 n^2 + &#92;xi_1 n)' class='latex' /> for some real numbers <img src='http://s0.wp.com/latex.php?latex=%5Calpha_j%2C+%5Cbeta_j%2C+%5Cxi_1%2C+%5Cxi_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha_j, &#92;beta_j, &#92;xi_1, &#92;xi_2' title='&#92;alpha_j, &#92;beta_j, &#92;xi_1, &#92;xi_2' class='latex' /> and bounded J.</p></blockquote>
<p>The proof of this result involves both Fourier analysis and additive combinatorics, relying heavily on ideas from a <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1631259">paper of Gowers</a> on Szemerédi&#8217;s theorem for progressions of length 4.  It will not be discussed here.</p>
<p>In view of the inverse theorem, the problem of establishing the asymptotic (1) for length 4 progressions then reduces (by suitable generalisations of the various methods discussed previously) to that of estimating exponential sums of which</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bn%3D1%7D%5EN+%5Cmu%28n%29+e%28+%5Clfloor+%5Csqrt%7B2%7D+n+%5Crfloor+%5Csqrt%7B3%7D+n+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_{n=1}^N &#92;mu(n) e( &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n )' title='&#92;sum_{n=1}^N &#92;mu(n) e( &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n )' class='latex' /></p>
<p>is a typical example.  One can begin to apply the methods of Vinogradov and Vaughan to control this type of expression.  But one is soon faced with the problem of understanding the distribution of quadratic phases such as <img src='http://s0.wp.com/latex.php?latex=%5Clfloor+%5Csqrt%7B2%7D+n+%5Crfloor+%5Csqrt%7B3%7D+n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n' title='&#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n' class='latex' />, and in particular to estimate exponential sums such as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bn%3D1%7D%5EN+e%28+%5Clfloor+%5Csqrt%7B2%7D+n+%5Crfloor+%5Csqrt%7B3%7D+n+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_{n=1}^N e( &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n )' title='&#92;sum_{n=1}^N e( &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n )' class='latex' />.</p>
<p>This turns out to be somewhat unpleasant; the standard technology of Weyl differencing and the <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/06/14/the-van-der-corputs-trick-and-equidistribution-on-nilmanifolds/">van der Corput lemma</a> eventually works (see <a class="snap_noshots" href="http://arxiv.org/abs/math.NT/0606087">this paper of Ben Green and myself</a>), but does not scale well to bracket polynomials of higher degree such as <img src='http://s0.wp.com/latex.php?latex=e%28+%5Clfloor+%5Clfloor+%5Csqrt%7B2%7D+n+%5Crfloor+%5Csqrt%7B3%7D+n+%5Crfloor+%5Csqrt%7B5%7D+n+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e( &#92;lfloor &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n &#92;rfloor &#92;sqrt{5} n )' title='e( &#92;lfloor &#92;lfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n &#92;rfloor &#92;sqrt{5} n )' class='latex' />, which would be necessary if one were to extend (1) to k beyond 4.</p>
<p>To resolve this, and inspired by the work in ergodic theory by <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1412607">Furstenberg-Weiss</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2150389">Host-Kra</a>, <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2318563">Bergelson-Leibman</a>, and others, we re-interpreted bracket polynomials from a more dynamical systems perspective.  To motivate this, observe that the linear character <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3D+e%28%5Calpha+n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) = e(&#92;alpha n)' title='&#92;chi(n) = e(&#92;alpha n)' class='latex' /> is closely tied to the circle rotation <img src='http://s0.wp.com/latex.php?latex=T%3A+x+%5Cmapsto+x%2B%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T: x &#92;mapsto x+&#92;alpha' title='T: x &#92;mapsto x+&#92;alpha' class='latex' /> on the unit circle <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%2F%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}/{&#92;Bbb Z}' title='{&#92;Bbb R}/{&#92;Bbb Z}' class='latex' />, in that the character <img src='http://s0.wp.com/latex.php?latex=%5Cchi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi' title='&#92;chi' class='latex' /> can be described as a function <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3D+F%28T%5En+x_0%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) = F(T^n x_0)' title='&#92;chi(n) = F(T^n x_0)' class='latex' /> of an orbit <img src='http://s0.wp.com/latex.php?latex=%28T%5En+x_0%29_%7Bn+%5Cin+%7B%5CBbb+Z%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(T^n x_0)_{n &#92;in {&#92;Bbb Z}}' title='(T^n x_0)_{n &#92;in {&#92;Bbb Z}}' class='latex' /> on this system, where <img src='http://s0.wp.com/latex.php?latex=x_0%3D0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x_0=0' title='x_0=0' class='latex' /> is the origin and <img src='http://s0.wp.com/latex.php?latex=F%28x%29+%3A%3D+e%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F(x) := e(x)' title='F(x) := e(x)' class='latex' /> is the exponential function.  In a similar spirit, a quadratic character such as <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3D+e%28%5Calpha+%5Cfrac%7Bn%28n-1%29%7D%7B2%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) = e(&#92;alpha &#92;frac{n(n-1)}{2})' title='&#92;chi(n) = e(&#92;alpha &#92;frac{n(n-1)}{2})' class='latex' /> can be expressed in terms of the <em>skew shift system</em> <img src='http://s0.wp.com/latex.php?latex=%28x%2Cy%29+%5Cmapsto+%28x%2B%5Calpha%2Cy%2Bx%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(x,y) &#92;mapsto (x+&#92;alpha,y+x)' title='(x,y) &#92;mapsto (x+&#92;alpha,y+x)' class='latex' /> on the torus <img src='http://s0.wp.com/latex.php?latex=%28%7B%5CBbb+R%7D%2F%7B%5CBbb+Z%7D%29%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='({&#92;Bbb R}/{&#92;Bbb Z})^2' title='({&#92;Bbb R}/{&#92;Bbb Z})^2' class='latex' />, being of the form <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3D+F%28T%5En+x_0%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) = F(T^n x_0)' title='&#92;chi(n) = F(T^n x_0)' class='latex' /> where <img src='http://s0.wp.com/latex.php?latex=x_0+%3A%3D+%280%2C0%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x_0 := (0,0)' title='x_0 := (0,0)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=F%28x%2Cy%29+%3A%3D+e%28y%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F(x,y) := e(y)' title='F(x,y) := e(y)' class='latex' />.  More generally, one can also express bracket quadratic polynomials such as <img src='http://s0.wp.com/latex.php?latex=e%28+%5Crfloor+%5Csqrt%7B2%7D+n+%5Crfloor+%5Csqrt%7B3%7D+n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e( &#92;rfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n)' title='e( &#92;rfloor &#92;sqrt{2} n &#92;rfloor &#92;sqrt{3} n)' class='latex' /> in the form <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3D+F%28T%5En+x_0%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) = F(T^n x_0)' title='&#92;chi(n) = F(T^n x_0)' class='latex' />, where T is now the action of a group element <img src='http://s0.wp.com/latex.php?latex=x+%5Cmapsto+gx&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x &#92;mapsto gx' title='x &#92;mapsto gx' class='latex' /> on a 2-step nilmanifold <img src='http://s0.wp.com/latex.php?latex=G%2F%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G/&#92;Gamma' title='G/&#92;Gamma' class='latex' />, and F is some reasonable (e.g. piecewise smooth) function on this nilmanifold.  (See my <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/03/09/254a-lecture-16-a-ratner-type-theorem-for-nilmanifolds/">lecture notes</a>, this <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2318563">paper of Bergelson and Leibman</a>, or <a class="snap_noshots" href="http://front.math.ucdavis.edu/math.NT/0503014">this paper of Ben and myself</a> for details.)  The relevance of 2-step nilpotent groups and nilmanifolds to length 4 progressions can be glimpsed in the identity</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%28g%5En+x%29+%28g%5E%7Bn%2Br%7D+x%29%5E%7B-3%7D+%28g%5E%7Bn%2B3r%7D+x%29%5E%7B-1%7D%C2%A0%28g%5E%7Bn%2B2r%7D+x%29%5E3+%3D+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(g^n x) (g^{n+r} x)^{-3} (g^{n+3r} x)^{-1} (g^{n+2r} x)^3 = 1' title='(g^n x) (g^{n+r} x)^{-3} (g^{n+3r} x)^{-1} (g^{n+2r} x)^3 = 1' class='latex' /></p>
<p>which is valid for all g, x in a 2-step nilpotent group G (compare this with (11)); it is an instructive exercise to prove this identity and to see how the 2-step nilpotency is used.  (To make the connection more precise, one needs a variant of this constraint in which x lies in <img src='http://s0.wp.com/latex.php?latex=G%2F%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G/&#92;Gamma' title='G/&#92;Gamma' class='latex' /> rather than G, which is harder to state; see <a class="snap_noshots" href="http://www.technion.ac.il/~tamarzr/Thesis1.dvi">Ziegler&#8217;s thesis</a>, or this paper of <a class="snap_noshots" href="http://front.math.ucdavis.edu/math.NT/0503014">Ben and myself</a>, for details.)  Indeed, one can reformulate the inverse theorem for length 4 progressions in an equivalent form:</p>
<blockquote><p><strong>Inverse theorem for length four progressions, again. </strong>(Informal statement)  Let k=4. Suppose that A is a subset of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> of density <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> for which (3) fails.  Then A correlates with a non-trivial <em>2-step nilsequence</em> <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3D+F%28T%5En+x_0%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) = F(T^n x_0)' title='&#92;chi(n) = F(T^n x_0)' class='latex' /> for some 2-step nilmanifold <img src='http://s0.wp.com/latex.php?latex=G%2F%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G/&#92;Gamma' title='G/&#92;Gamma' class='latex' /> (of bounded &#8220;complexity&#8221;), some group rotation <img src='http://s0.wp.com/latex.php?latex=T%3A+x+%5Cmapsto+gx&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T: x &#92;mapsto gx' title='T: x &#92;mapsto gx' class='latex' />, some starting point <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+G%2F%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x &#92;in G/&#92;Gamma' title='x &#92;in G/&#92;Gamma' class='latex' />, and some function <img src='http://s0.wp.com/latex.php?latex=F%3A+G%2F%5CGamma+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F: G/&#92;Gamma &#92;to {&#92;Bbb C}' title='F: G/&#92;Gamma &#92;to {&#92;Bbb C}' class='latex' /> (also of &#8220;bounded complexity&#8221;; e.g. bounded Lipschitz norm will do).</p></blockquote>
<p>The precise formulation of the theorem is a little technical; see this <a class="snap_noshots" href="http://front.math.ucdavis.edu/math.NT/0503014">paper of Ben and myself</a> for details.  Using this theorem and all the standard machinery, the task of establishing asymptotics such as (1) in the k=4 case now reduces to that of understanding sums such as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bn%3D1%7D%5EN+F%28T%5En+x_0%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_{n=1}^N F(T^n x_0)' title='&#92;sum_{n=1}^N F(T^n x_0)' class='latex' />.</p>
<p>At this point, one can start using the existing theory of equidistribution of orbits on homogeneous spaces <img src='http://s0.wp.com/latex.php?latex=G%2F%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G/&#92;Gamma' title='G/&#92;Gamma' class='latex' /> (of which nilmanifolds are an important example).  It turns out that the existing theory is not quite quantitative enough for our purposes, and we had to develop a quantitative analogue of this theory; see <a class="snap_noshots" href="http://terrytao.wordpress.com/2007/09/25/the-quantitative-behaviour-of-polynomial-orbits-on-nilmanifolds/">these</a> <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/07/13/the-mobius-and-nilsequences-conjecture/">blog</a> posts for more discussion.  Anyway, it all works, and gives asymptotics for progressions of length 4 in the primes, as well as other linear patterns of similar &#8220;complexity&#8221; (e.g. any non-degenerate system of two equations in four prime unknowns is OK).  To handle higher patterns, what we need is</p>
<blockquote><p><strong>Inverse conjecture for arithmetic progressions. </strong>(Informal statement)  Let <img src='http://s0.wp.com/latex.php?latex=k+%5Cgeq+3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k &#92;geq 3' title='k &#92;geq 3' class='latex' />.  Suppose that A is a subset of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> of density <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' /> for which (3) fails.  Then A correlates with a non-trivial <em><img src='http://s0.wp.com/latex.php?latex=k-2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k-2' title='k-2' class='latex' />-step nilsequence</em> <img src='http://s0.wp.com/latex.php?latex=%5Cchi%28n%29+%3D+F%28T%5En+x_0%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;chi(n) = F(T^n x_0)' title='&#92;chi(n) = F(T^n x_0)' class='latex' /> for some (k-2)-step nilmanifold <img src='http://s0.wp.com/latex.php?latex=G%2F%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G/&#92;Gamma' title='G/&#92;Gamma' class='latex' /> (of bounded &#8220;complexity&#8221;), some group rotation <img src='http://s0.wp.com/latex.php?latex=T%3A+x+%5Cmapsto+gx&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T: x &#92;mapsto gx' title='T: x &#92;mapsto gx' class='latex' />, some starting point <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+G%2F%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x &#92;in G/&#92;Gamma' title='x &#92;in G/&#92;Gamma' class='latex' />, and some function <img src='http://s0.wp.com/latex.php?latex=F%3A+G%2F%5CGamma+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F: G/&#92;Gamma &#92;to {&#92;Bbb C}' title='F: G/&#92;Gamma &#92;to {&#92;Bbb C}' class='latex' /> (also of &#8220;bounded complexity&#8221;).</p></blockquote>
<p>This is a consequence of (and very closely related to) the inverse conjecture for the Gowers norm.  It is already known for k=3 and k=4, and hopefully the higher k cases will be resolved in the near future, presumably using a mix of techniques from Fourier analysis, additive combinatorics, and ergodic theory.  At this time of writing, this is the only remaining obstacle before we can understand the asymptotics of linear patterns in primes which genuinely involve two or more free parameters (as mentioned earlier, one-parameter problems such as the twin prime conjecture seem well out of reach of these methods for a number of reasons, one of which is that there is definitely no analogue of the inverse conjecture for such one-parameter patterns).</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[A counterexample to a strong polynomial Freiman-Ruzsa conjecture]]></title>
<link>http://terrytao.wordpress.com/2008/11/09/a-counterexample-to-a-strong-polynomial-freiman-ruzsa-conjecture/</link>
<pubDate>Sun, 09 Nov 2008 22:56:25 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2008/11/09/a-counterexample-to-a-strong-polynomial-freiman-ruzsa-conjecture/</guid>
<description><![CDATA[One of my favourite open problems in additive combinatorics is the polynomial Freiman-Ruzsa conjectu]]></description>
<content:encoded><![CDATA[<p>One of my favourite open problems in additive combinatorics is the polynomial Freiman-Ruzsa conjecture, which Ben Green <a class="snap_noshots" href="http://terrytao.wordpress.com/2007/03/11/ben-green-the-polynomial-freiman-ruzsa-conjecture/">guest blogged about</a> here some time ago.  It has many equivalent formulations (which is always a healthy sign when considering a conjecture), but here is one involving &#8220;approximate homomorphisms&#8221;:</p>
<blockquote><p><strong>Polynomial Freiman-Ruzsa conjecture. </strong> Let <img src='http://s0.wp.com/latex.php?latex=f%3A+F_2%5En+%5Cto+F_2%5Em&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: F_2^n &#92;to F_2^m' title='f: F_2^n &#92;to F_2^m' class='latex' /> be a function which is an approximate homomorphism in the sense that <img src='http://s0.wp.com/latex.php?latex=f%28x%2By%29-f%28x%29-f%28y%29+%5Cin+S&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(x+y)-f(x)-f(y) &#92;in S' title='f(x+y)-f(x)-f(y) &#92;in S' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=x%2Cy+%5Cin+F_2%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x,y &#92;in F_2^n' title='x,y &#92;in F_2^n' class='latex' /> and some set <img src='http://s0.wp.com/latex.php?latex=S+%5Csubset+F_2%5Em&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S &#92;subset F_2^m' title='S &#92;subset F_2^m' class='latex' />.  Then there exists a genuine homomorphism <img src='http://s0.wp.com/latex.php?latex=g%3A+F_2%5En+%5Cto+F_2%5Em&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g: F_2^n &#92;to F_2^m' title='g: F_2^n &#92;to F_2^m' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=f-g&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f-g' title='f-g' class='latex' /> takes at most <img src='http://s0.wp.com/latex.php?latex=O%28+%26%23124%3BS%26%23124%3B%5E%7BO%281%29%7D+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O( &#124;S&#124;^{O(1)} )' title='O( &#124;S&#124;^{O(1)} )' class='latex' /> values.</p></blockquote>
<p><strong>Remark 1.</strong> The key point here is that the bound on the range of <img src='http://s0.wp.com/latex.php?latex=f-g&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f-g' title='f-g' class='latex' /> is at most polynomial in &#124;S&#124;.  An exponential bound of <img src='http://s0.wp.com/latex.php?latex=2%5E%7B%26%23124%3BS%26%23124%3B%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='2^{&#124;S&#124;}' title='2^{&#124;S&#124;}' class='latex' /> can be trivially established by splitting <img src='http://s0.wp.com/latex.php?latex=F_2%5Em&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F_2^m' title='F_2^m' class='latex' /> into the subspace spanned by S (which has size at most <img src='http://s0.wp.com/latex.php?latex=2%5E%7B%26%23124%3BS%26%23124%3B%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='2^{&#124;S&#124;}' title='2^{&#124;S&#124;}' class='latex' />) and some complementary subspace, and then letting g be the projection of f to that complementary subspace. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p>Recently, Ben Green and I have shown that this conjecture is equivalent to a certain polynomially quantitative strengthening of the inverse conjecture for the Gowers norm <img src='http://s0.wp.com/latex.php?latex=U%5E3%28F_2%5En%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U^3(F_2^n)' title='U^3(F_2^n)' class='latex' />; I hope to talk about this in a future post.  For this (somewhat technical) post, I want to comment on a possible further strengthening of this conjecture, namely</p>
<blockquote><p><strong>Strong Polynomial Freiman-Ruzsa conjecture. </strong> Let <img src='http://s0.wp.com/latex.php?latex=f%3A+F_2%5En+%5Cto+F_2%5Em&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: F_2^n &#92;to F_2^m' title='f: F_2^n &#92;to F_2^m' class='latex' /> be a function which is an approximate homomorphism in the sense that <img src='http://s0.wp.com/latex.php?latex=f%28x%2By%29-f%28x%29-f%28y%29+%5Cin+S&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(x+y)-f(x)-f(y) &#92;in S' title='f(x+y)-f(x)-f(y) &#92;in S' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=x%2Cy+%5Cin+F_2%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x,y &#92;in F_2^n' title='x,y &#92;in F_2^n' class='latex' /> and some set <img src='http://s0.wp.com/latex.php?latex=S+%5Csubset+F_2%5Em&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S &#92;subset F_2^m' title='S &#92;subset F_2^m' class='latex' />.  Then there exists a genuine homomorphism <img src='http://s0.wp.com/latex.php?latex=g%3A+F_2%5En+%5Cto+F_2%5Em&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g: F_2^n &#92;to F_2^m' title='g: F_2^n &#92;to F_2^m' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=f-g&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f-g' title='f-g' class='latex' /> takes values in the sumset <img src='http://s0.wp.com/latex.php?latex=CS+%3A%3D+S+%2B+%5Cldots+%2B+S&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='CS := S + &#92;ldots + S' title='CS := S + &#92;ldots + S' class='latex' /> for some fixed <img src='http://s0.wp.com/latex.php?latex=C%3DO%281%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='C=O(1)' title='C=O(1)' class='latex' />.</p></blockquote>
<p>This conjecture is known to be true for certain types of set S (e.g. for Hamming balls, this is a result <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=1770537">of Farah</a>).  Unfortunately, it is false in general; the purpose of this post is to describe one counterexample (related to the failure of the inverse conjecture for the Gowers norm for <img src='http://s0.wp.com/latex.php?latex=U%5E4%28F_2%5En%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U^4(F_2^n)' title='U^4(F_2^n)' class='latex' /> for classical polynomials; in particular, the arguments here have several features in common with those in the papers of <a class="snap_noshots" href="http://www.cs.huji.ac.il/~salex/papers/Gowers-false.pdf">Lovett-Meshulam-Samorodnitsky</a> and <a class="snap_noshots" href="http://arxiv.org/abs/0711.3191">Green-Tao</a>).  [A somewhat different counterexample also appears in the paper of Farah.]  The verification of the counterexample is surprisingly involved, ultimately relying on the multidimensional Szemerédi theorem of <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=531279">Furstenberg and Katznelson</a>.</p>
<p>(The results here are derived from forthcoming joint work with Ben Green.)</p>
<p style="text-align:center;"><!--more-->&#8211; Description of counterexample &#8211;</p>
<p>We let n be a large number, and replace <img src='http://s0.wp.com/latex.php?latex=F_2%5Em&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F_2^m' title='F_2^m' class='latex' /> by the <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7Bn%28n%2B1%29%7D%7B2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;frac{n(n+1)}{2}' title='&#92;frac{n(n+1)}{2}' class='latex' />-dimensional vector space V of quadratic forms <img src='http://s0.wp.com/latex.php?latex=Q%3A+F_2%5En+%5Cto+F_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q: F_2^n &#92;to F_2' title='Q: F_2^n &#92;to F_2' class='latex' /> (with a basis given by the monomials <img src='http://s0.wp.com/latex.php?latex=x_i+x_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x_i x_j' title='x_i x_j' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=1+%5Cleq+i+%5Cleq+j+%5Cleq+n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1 &#92;leq i &#92;leq j &#92;leq n' title='1 &#92;leq i &#92;leq j &#92;leq n' class='latex' />).  We let <img src='http://s0.wp.com/latex.php?latex=f%3A+F_2%5En+%5Cto+V&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: F_2^n &#92;to V' title='f: F_2^n &#92;to V' class='latex' /> be defined by the formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=f%28h_1%2C%5Cldots%2Ch_n%29%28x_1%2C%5Cldots%2Cx_n%29+%3A%3D+%5Csum_%7B1+%5Cleq+i+%26%2360%3B+j+%5Cleq+n%7D+h_i+x_i+h_j+x_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(h_1,&#92;ldots,h_n)(x_1,&#92;ldots,x_n) := &#92;sum_{1 &#92;leq i &lt; j &#92;leq n} h_i x_i h_j x_j' title='f(h_1,&#92;ldots,h_n)(x_1,&#92;ldots,x_n) := &#92;sum_{1 &#92;leq i &lt; j &#92;leq n} h_i x_i h_j x_j' class='latex' />.</p>
<p>A brief computation shows that for any <img src='http://s0.wp.com/latex.php?latex=h%2C+k+%5Cin+F_2%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='h, k &#92;in F_2^n' title='h, k &#92;in F_2^n' class='latex' />, the quadratic form <img src='http://s0.wp.com/latex.php?latex=f%28h%2Bk%29-f%28h%29-f%28k%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(h+k)-f(h)-f(k)' title='f(h+k)-f(h)-f(k)' class='latex' /> is of rank at most three, by which we mean that it is a function of at most three linear forms.  More specifically, we have</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=f%28h%2Bk%29-f%28h%29-f%28k%29+%3D+a_%7Bh%2Ck%7D+b_%7Bh%2Ck%7D+%2B+b_%7Bh%2Ck%7D+c_%7Bh%2Ck%7D+%2B+c_%7Bh%2Ck%7D+a_%7Bh%2Ck%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(h+k)-f(h)-f(k) = a_{h,k} b_{h,k} + b_{h,k} c_{h,k} + c_{h,k} a_{h,k}' title='f(h+k)-f(h)-f(k) = a_{h,k} b_{h,k} + b_{h,k} c_{h,k} + c_{h,k} a_{h,k}' class='latex' /> (1)</p>
<p>where</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=a_%7Bh%2Ck%7D%28x%29+%3D+%5Csum_%7Bi%3D1%7D%5En+h_i%281-k_i%29+x_i&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a_{h,k}(x) = &#92;sum_{i=1}^n h_i(1-k_i) x_i' title='a_{h,k}(x) = &#92;sum_{i=1}^n h_i(1-k_i) x_i' class='latex' /></p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=b_%7Bh%2Ck%7D%28x%29+%3D+%5Csum_%7Bi%3D1%7D%5En+%281-h_i%29+k_i+x_i&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='b_{h,k}(x) = &#92;sum_{i=1}^n (1-h_i) k_i x_i' title='b_{h,k}(x) = &#92;sum_{i=1}^n (1-h_i) k_i x_i' class='latex' /></p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c_%7Bh%2Ck%7D%28x%29+%3D+%5Csum_%7Bi%3D1%7D%5En+h_i+k_i+x_i&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_{h,k}(x) = &#92;sum_{i=1}^n h_i k_i x_i' title='c_{h,k}(x) = &#92;sum_{i=1}^n h_i k_i x_i' class='latex' /></p>
<p style="text-align:left;">Thus, if we let S be the space of quadratic forms of rank at most 3, the hypotheses of the polynomial Freiman-Ruzsa conjecture hold.</p>
<p style="text-align:center;">&#8211; Verification of counterexample &#8211;</p>
<p>To establish the counterexample, we assume for contradiction that there exists a linear function <img src='http://s0.wp.com/latex.php?latex=g%3A+F_2%5En+%5Cto+V&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g: F_2^n &#92;to V' title='g: F_2^n &#92;to V' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=f%28h%29-g%28h%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(h)-g(h)' title='f(h)-g(h)' class='latex' /> has bounded rank for all h, and deduce a contradiction (for n sufficiently large).</p>
<p>By hypothesis, we have linear forms <img src='http://s0.wp.com/latex.php?latex=L_%7Bh%2C1%7D%2C%5Cldots%2CL_%7Bh%2Cd%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L_{h,1},&#92;ldots,L_{h,d}' title='L_{h,1},&#92;ldots,L_{h,d}' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=h+%5Cin+F_2%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='h &#92;in F_2^n' title='h &#92;in F_2^n' class='latex' /> and some <img src='http://s0.wp.com/latex.php?latex=d+%3D+O%281%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d = O(1)' title='d = O(1)' class='latex' /> and coefficients <img src='http://s0.wp.com/latex.php?latex=c_%7Bh%2Ci%2Cj%7D+%5Cin+F_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_{h,i,j} &#92;in F_2' title='c_{h,i,j} &#92;in F_2' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=1+%5Cleq+i+%5Cleq+j+%5Cleq+d&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1 &#92;leq i &#92;leq j &#92;leq d' title='1 &#92;leq i &#92;leq j &#92;leq d' class='latex' /> such that</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+f%28h%29-g%28h%29+%3D+%5Csum_%7B1+%5Cleq+i+%5Cleq+j+%5Cleq+n%7D+c_%7Bh%2Ci%2Cj%7D+L_%7Bh%2Ci%7D+L_%7Bh%2Cj%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle f(h)-g(h) = &#92;sum_{1 &#92;leq i &#92;leq j &#92;leq n} c_{h,i,j} L_{h,i} L_{h,j}' title='&#92;displaystyle f(h)-g(h) = &#92;sum_{1 &#92;leq i &#92;leq j &#92;leq n} c_{h,i,j} L_{h,i} L_{h,j}' class='latex' /></p>
<p>and in particular (by (1) and linearity of g)</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+a_%7Bh%2Ck%7D+b_%7Bh%2Ck%7D+%2B+b_%7Bh%2Ck%7D+c_%7Bh%2Ck%7D+%2B+c_%7Bh%2Ck%7D+a_%7Bh%2Ck%7D+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle a_{h,k} b_{h,k} + b_{h,k} c_{h,k} + c_{h,k} a_{h,k} ' title='&#92;displaystyle a_{h,k} b_{h,k} + b_{h,k} c_{h,k} + c_{h,k} a_{h,k} ' class='latex' /></p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%3D+%5Csum_%7B1+%5Cleq+i+%5Cleq+j+%5Cleq+n%7D+c_%7Bh%2Bk%2Ci%2Cj%7D+L_%7Bh%2Bk%2Ci%7D+L_%7Bh%2Bk%2Cj%7D+-+c_%7Bh%2Ci%2Cj%7D+L_%7Bh%2Ci%7D+L_%7Bh%2Cj%7D+-+c_%7Bk%2Ci%2Cj%7D+L_%7Bk%2Ci%7D+L_%7Bk%2Cj%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle = &#92;sum_{1 &#92;leq i &#92;leq j &#92;leq n} c_{h+k,i,j} L_{h+k,i} L_{h+k,j} - c_{h,i,j} L_{h,i} L_{h,j} - c_{k,i,j} L_{k,i} L_{k,j}' title='&#92;displaystyle = &#92;sum_{1 &#92;leq i &#92;leq j &#92;leq n} c_{h+k,i,j} L_{h+k,i} L_{h+k,j} - c_{h,i,j} L_{h,i} L_{h,j} - c_{k,i,j} L_{k,i} L_{k,j}' class='latex' />. (2)</p>
<p>The key point is that the linear forms <img src='http://s0.wp.com/latex.php?latex=a_%7Bh%2Ck%7D%2C+b_%7Bh%2Ck%7D%2C+c_%7Bh%2Ck%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a_{h,k}, b_{h,k}, c_{h,k}' title='a_{h,k}, b_{h,k}, c_{h,k}' class='latex' /> are usually &#8220;independent&#8221; of the linear forms <img src='http://s0.wp.com/latex.php?latex=L_%7Bh%2Ci%7D%2C+L_%7Bk%2Ci%7D%2C+L_%7Bh%2Bk%2Ci%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L_{h,i}, L_{k,i}, L_{h+k,i}' title='L_{h,i}, L_{k,i}, L_{h+k,i}' class='latex' />.  The key lemma in this regard is</p>
<blockquote><p><strong>Lemma 1. </strong>If h, k are selected uniformly and independently at random, then with probability 1-o(1), <img src='http://s0.wp.com/latex.php?latex=a_%7Bh%2Ck%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a_{h,k}' title='a_{h,k}' class='latex' /> is not a linear combination of the <img src='http://s0.wp.com/latex.php?latex=L_%7Bh%2Ci%7D%2C+L_%7Bk%2Ci%7D%2C+L_%7Bh%2Bk%2Ci%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L_{h,i}, L_{k,i}, L_{h+k,i}' title='L_{h,i}, L_{k,i}, L_{h+k,i}' class='latex' />.  Similarly for <img src='http://s0.wp.com/latex.php?latex=b_%7Bh%2Ck%7D%2C+c_%7Bh%2Ck%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='b_{h,k}, c_{h,k}' title='b_{h,k}, c_{h,k}' class='latex' />.</p></blockquote>
<p><strong>Proof.</strong> By cyclically permuting h,k,h+k it suffices to show this for <img src='http://s0.wp.com/latex.php?latex=c_%7Bh%2Ck%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_{h,k}' title='c_{h,k}' class='latex' />.  Since there are at most O(1) possible linear combinations amongst the <img src='http://s0.wp.com/latex.php?latex=L_%7Bh%2Ci%7D%2C+L_%7Bk%2Ci%7D%2C+L_%7Bh%2Bk%2Ci%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L_{h,i}, L_{k,i}, L_{h+k,i}' title='L_{h,i}, L_{k,i}, L_{h+k,i}' class='latex' />, it suffices to show that for any given assignments <img src='http://s0.wp.com/latex.php?latex=h+%5Cmapsto+L%27_h%2C+k+%5Cmapsto+L%27%27_k%2C+h%2Bk+%5Cmapsto+L%27%27%27_%7Bh%2Bk%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='h &#92;mapsto L&#039;_h, k &#92;mapsto L&#039;&#039;_k, h+k &#92;mapsto L&#039;&#039;&#039;_{h+k}' title='h &#92;mapsto L&#039;_h, k &#92;mapsto L&#039;&#039;_k, h+k &#92;mapsto L&#039;&#039;&#039;_{h+k}' class='latex' /> of linear forms, that the probability of the event</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c_%7Bh%2Ck%7D+%3D+L%27_h+%2B+L%27%27_k+%2B+L%27%27%27_%7Bh%2Bk%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_{h,k} = L&#039;_h + L&#039;&#039;_k + L&#039;&#039;&#039;_{h+k}' title='c_{h,k} = L&#039;_h + L&#039;&#039;_k + L&#039;&#039;&#039;_{h+k}' class='latex' /> (3)</p>
<p>is o(1).  Suppose for contradiction that the event (3) holds for a set E of pairs (h,k) in <img src='http://s0.wp.com/latex.php?latex=F_2%5En+%5Ctimes+F_2%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F_2^n &#92;times F_2^n' title='F_2^n &#92;times F_2^n' class='latex' /> of positive density.  Applying the <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=531279">Furstenberg-Katznelson</a> multidimensional Szemerédi theorem) we can find (for n large enough) a square <img src='http://s0.wp.com/latex.php?latex=%28h%2Ck%29%2C+%28h%2Br%2Ck%29%2C+%28h%2Ck%2Br%29%2C+%28h%2Br%2Ck%2Br%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(h,k), (h+r,k), (h,k+r), (h+r,k+r)' title='(h,k), (h+r,k), (h,k+r), (h+r,k+r)' class='latex' /> in E with r non-zero.  Applying (3) for all four pairs and summing, we obtain</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=c_%7Bh%2Ck%7D+%2Bc_%7Bh%2Br%2Ck%7D+%2B+c_%7Bh%2Ck%2Br%7D+%2B+c_%7Bh%2Br%2Ck%2Br%7D+%3D+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_{h,k} +c_{h+r,k} + c_{h,k+r} + c_{h+r,k+r} = 0' title='c_{h,k} +c_{h+r,k} + c_{h,k+r} + c_{h+r,k+r} = 0' class='latex' /></p>
<p>(recall we are in characteristic 2).  But the left-hand side is equal to the linear form <img src='http://s0.wp.com/latex.php?latex=%5Csum_i+r_i+x_i&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_i r_i x_i' title='&#92;sum_i r_i x_i' class='latex' />, which is non-zero, a contradiction. <img src='http://s0.wp.com/latex.php?latex=%5CBox&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Box' title='&#92;Box' class='latex' /></p>
<p>Now we can obtain the desired contradiction.  For a generic choice of h,k, we now know that none of the <img src='http://s0.wp.com/latex.php?latex=a_%7Bh%2Ck%7D%2C+b_%7Bh%2Ck%7D%2C+c_%7Bh%2Ck%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a_{h,k}, b_{h,k}, c_{h,k}' title='a_{h,k}, b_{h,k}, c_{h,k}' class='latex' /> are linear combinations of the <img src='http://s0.wp.com/latex.php?latex=L_%7Bh%2Ci%7D%2C+L_%7Bk%2Ci%7D%2C+L_%7Bh%2Bk%2Ci%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L_{h,i}, L_{k,i}, L_{h+k,i}' title='L_{h,i}, L_{k,i}, L_{h+k,i}' class='latex' />.  Thus, on a given level set of the  <img src='http://s0.wp.com/latex.php?latex=L_%7Bh%2Ci%7D%2C+L_%7Bk%2Ci%7D%2C+L_%7Bh%2Bk%2Ci%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='L_{h,i}, L_{k,i}, L_{h+k,i}' title='L_{h,i}, L_{k,i}, L_{h+k,i}' class='latex' /> (which form a subspace of <img src='http://s0.wp.com/latex.php?latex=F_2%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F_2^n' title='F_2^n' class='latex' />), the linear functions <img src='http://s0.wp.com/latex.php?latex=a_%7Bh%2Ck%7D%2C+b_%7Bh%2Ck%7D%2C+c_%7Bh%2Ck%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a_{h,k}, b_{h,k}, c_{h,k}' title='a_{h,k}, b_{h,k}, c_{h,k}' class='latex' /> are non-constant, and so the range of the triplet <img src='http://s0.wp.com/latex.php?latex=%28a_%7Bh%2Ck%7D%2C+b_%7Bh%2Ck%7D%2C+c_%7Bh%2Ck%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(a_{h,k}, b_{h,k}, c_{h,k})' title='(a_{h,k}, b_{h,k}, c_{h,k})' class='latex' /> must be an affine subspace of <img src='http://s0.wp.com/latex.php?latex=F_2%5E3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F_2^3' title='F_2^3' class='latex' /> which is not contained in any plane in <img src='http://s0.wp.com/latex.php?latex=F_2%5E3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F_2^3' title='F_2^3' class='latex' /> that is parallel to the two of the coordinate axes (i.e. of the form <img src='http://s0.wp.com/latex.php?latex=a%3Dconst&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a=const' title='a=const' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=b%3Dconst&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='b=const' title='b=const' class='latex' />, or <img src='http://s0.wp.com/latex.php?latex=c%3Dconst&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c=const' title='c=const' class='latex' />.  This forces the subspace to be parallel to (1,1,1).  But this implies that <img src='http://s0.wp.com/latex.php?latex=%28a%2Cb%2Cc%29+%5Cmapsto+ab%2Bbc%2Bca&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(a,b,c) &#92;mapsto ab+bc+ca' title='(a,b,c) &#92;mapsto ab+bc+ca' class='latex' /> is non-constant on this space, contradicting (2).</p>
<p style="text-align:left;"><strong>Remark 2.</strong> The function f appearing in the above example is closely related to the symmetric polynomial</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+S_4%28x%29+%3A%3D+%5Csum_%7B1+%5Cleq+i+%26%2360%3B+j+%26%2360%3B+k+%26%2360%3B+l+%5Cleq+n%7D+x_i+x_j+x_k+x_l&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle S_4(x) := &#92;sum_{1 &#92;leq i &lt; j &lt; k &lt; l &#92;leq n} x_i x_j x_k x_l' title='&#92;displaystyle S_4(x) := &#92;sum_{1 &#92;leq i &lt; j &lt; k &lt; l &#92;leq n} x_i x_j x_k x_l' class='latex' />.</p>
<p style="text-align:left;">Indeed, one can show that the derivative <img src='http://s0.wp.com/latex.php?latex=S_4%28x%2Bh%29-S_4%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S_4(x+h)-S_4(x)' title='S_4(x+h)-S_4(x)' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=S_4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S_4' title='S_4' class='latex' /> is equal to <img src='http://s0.wp.com/latex.php?latex=f%28h%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(h)' title='f(h)' class='latex' />, plus some additional terms which involve only a finite number of linear forms, and the quadratic polynomial <img src='http://s0.wp.com/latex.php?latex=S_2%28x%29+%3A%3D+%5Csum_%7B1+%5Cleq+i+%26%2360%3B+j+%5Cleq+n%7D+x_i+x_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S_2(x) := &#92;sum_{1 &#92;leq i &lt; j &#92;leq n} x_i x_j' title='S_2(x) := &#92;sum_{1 &#92;leq i &lt; j &#92;leq n} x_i x_j' class='latex' />.    If it was the case that <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f' title='f' class='latex' /> could be approximated by a linear map <img src='http://s0.wp.com/latex.php?latex=g&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g' title='g' class='latex' /> modulo low rank errors, then it one could use this to eventually show that <img src='http://s0.wp.com/latex.php?latex=S_4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S_4' title='S_4' class='latex' /> correlated with a cubic polynomial; but it is known (from the papers of <a class="snap_noshots" href="http://www.cs.huji.ac.il/~salex/papers/Gowers-false.pdf">Lovett-Meshulam-Samorodnitsky</a> and <a class="snap_noshots" href="http://arxiv.org/abs/0711.3191">Green-Tao</a>) that this is not the case.  Thus there is an alternate way to verify that the above example is indeed a counterexample to the strong polynomial Freiman-Ruzsa conjecture. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[The inverse conjecture for the Gowers norm over finite fields via the correspondence principle]]></title>
<link>http://terrytao.wordpress.com/2008/11/01/the-inverse-conjecture-for-the-gowers-norm-over-finite-fields-via-the-correspondence-principle/</link>
<pubDate>Sat, 01 Nov 2008 19:04:44 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2008/11/01/the-inverse-conjecture-for-the-gowers-norm-over-finite-fields-via-the-correspondence-principle/</guid>
<description><![CDATA[Tamar Ziegler and I have just uploaded to the arXiv our paper, &#8220;The inverse conjecture for the]]></description>
<content:encoded><![CDATA[<p><a class="snap_noshots" href="http://www.technion.ac.il/~tamarzr/">Tamar Ziegler</a> and I have just uploaded to the <a class="snap_noshots" href="http://www.arxiv.org">arXiv</a> our paper, &#8220;<a class="snap_noshots" href="http://arxiv.org/abs/0810.5527">The inverse conjecture for the Gowers norm over finite fields via the correspondence principle</a>&#8220;, submitted to <a class="snap_noshots" href="http://pjm.math.berkeley.edu/apde/about/cover/cover.html">Analysis &#38; PDE</a>.  As announced a few months ago in <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/08/30/the-correspondence-principle-and-finitary-ergodic-theory/">this blog post</a>, this paper establishes (most of) the inverse conjecture for the Gowers norm from an ergodic theory analogue of this conjecture (in a forthcoming paper by <a class="snap_noshots" href="http://www.math.ohio-state.edu/~vitaly/">Vitaly Bergelson</a>, Tamar Ziegler, and myself, which should be ready shortly), using a variant of the Furstenberg correspondence principle.  Our papers were held up for a while due to some unexpected technical difficulties arising in the low characteristic case; as a consequence, our paper only establishes the full inverse conjecture in the high characteristic case <img src='http://s0.wp.com/latex.php?latex=p+%5Cgeq+k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p &#92;geq k' title='p &#92;geq k' class='latex' />, and gives a partial result in the low characteristic case <img src='http://s0.wp.com/latex.php?latex=p+%26%2360%3B+k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p &lt; k' title='p &lt; k' class='latex' />.</p>
<p>In the rest of this post, I would like to describe the inverse conjecture (in both combinatorial and ergodic forms), and sketch how one deduces one from the other via the correspondence principle (together with two additional ingredients, namely a statistical sampling lemma and a local testability result for polynomials).</p>
<p style="text-align:center;"><!--more-->&#8211; Polynomials and the Gowers norm &#8211;</p>
<p>Let F be a finite field, and let <img src='http://s0.wp.com/latex.php?latex=F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^n' title='F^n' class='latex' /> be a vector space over that field.  Given any function <img src='http://s0.wp.com/latex.php?latex=P%3A+F%5En+%5Cto+%7B%5CBbb+R%7D%2F%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P: F^n &#92;to {&#92;Bbb R}/{&#92;Bbb Z}' title='P: F^n &#92;to {&#92;Bbb R}/{&#92;Bbb Z}' class='latex' /> taking values in the unit circle <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%2F%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}/{&#92;Bbb Z}' title='{&#92;Bbb R}/{&#92;Bbb Z}' class='latex' />, we define the<em> additive derivative</em> <img src='http://s0.wp.com/latex.php?latex=%5CDelta%5E%2B+P%3A+F%5En+%5Cto+%7B%5CBbb+R%7D%2F%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Delta^+ P: F^n &#92;to {&#92;Bbb R}/{&#92;Bbb Z}' title='&#92;Delta^+ P: F^n &#92;to {&#92;Bbb R}/{&#92;Bbb Z}' class='latex' /> of P in the direction h by the formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5CDelta%5E%2B_h+f%28x%29+%3A%3D+f%28x%2Bh%29+-+f%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Delta^+_h f(x) := f(x+h) - f(x)' title='&#92;Delta^+_h f(x) := f(x+h) - f(x)' class='latex' />.</p>
<p>Now let <img src='http://s0.wp.com/latex.php?latex=k+%5Cgeq+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k &#92;geq 0' title='k &#92;geq 0' class='latex' />.  A function <img src='http://s0.wp.com/latex.php?latex=P%3A+F%5En+%5Cto+%7B%5CBbb+R%7D%2F%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P: F^n &#92;to {&#92;Bbb R}/{&#92;Bbb Z}' title='P: F^n &#92;to {&#92;Bbb R}/{&#92;Bbb Z}' class='latex' /> is said to be a <em>polynomial of degree <img src='http://s0.wp.com/latex.php?latex=%26%2360%3Bk&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&lt;k' title='&lt;k' class='latex' /></em> if one has <img src='http://s0.wp.com/latex.php?latex=%5CDelta%5E%2B_%7Bh_1%7D+%5Cldots+%5CDelta%5E%2B_%7Bh_k%7D+P+%3D+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Delta^+_{h_1} &#92;ldots &#92;Delta^+_{h_k} P = 0' title='&#92;Delta^+_{h_1} &#92;ldots &#92;Delta^+_{h_k} P = 0' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=h_1%2C%5Cldots%2Ch_k+%5Cin+F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='h_1,&#92;ldots,h_k &#92;in F^n' title='h_1,&#92;ldots,h_k &#92;in F^n' class='latex' />.  Thus for instance, the only polynomial of degree &#60;0 is the zero function, the only polynomials of degree &#60;1 are the constants, the only polynomial of degree &#60;2 are the <a href="http://en.wikipedia.org/wiki/Affine_transformation">affine</a> <a href="http://en.wikipedia.org/wiki/Character_(mathematics)">characters</a>, and so forth.  If <img src='http://s0.wp.com/latex.php?latex=F%3DF_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F=F_p' title='F=F_p' class='latex' /> is a field of prime order, and we make the additional assumption that P takes values in the <img src='http://s0.wp.com/latex.php?latex=p%5E%7Bth%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p^{th}' title='p^{th}' class='latex' /> roots of unity (which we can identify with F in the usual manner), then we can express the polynomial P in the customary manner as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=P%28x_1%2C%5Cldots%2Cx_n%29+%3D+%5Csum_%7Bi_1%2B%5Cldots%2Bi_n+%26%2360%3B+k%7D+c_%7Bi_1%2C%5Cldots%2Ci_n%7D+x_1%5E%7Bi_1%7D+%5Cldots+x_n%5E%7Bi_n%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P(x_1,&#92;ldots,x_n) = &#92;sum_{i_1+&#92;ldots+i_n &lt; k} c_{i_1,&#92;ldots,i_n} x_1^{i_1} &#92;ldots x_n^{i_n}' title='P(x_1,&#92;ldots,x_n) = &#92;sum_{i_1+&#92;ldots+i_n &lt; k} c_{i_1,&#92;ldots,i_n} x_1^{i_1} &#92;ldots x_n^{i_n}' class='latex' /> (1)</p>
<p>for some coefficients <img src='http://s0.wp.com/latex.php?latex=c_%7Bi_1%2C%5Cldots%2Ci_n%7D+%5Cin+F&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_{i_1,&#92;ldots,i_n} &#92;in F' title='c_{i_1,&#92;ldots,i_n} &#92;in F' class='latex' />; let us refer to these as <em>classical polynomials of degree &#60;k</em>.  However, if one does not require P to take values in <img src='http://s0.wp.com/latex.php?latex=p%5E%7Bth%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p^{th}' title='p^{th}' class='latex' /> roots of unity, then the above definition also encompasses some non-classical polynomials.  For instance, if <img src='http://s0.wp.com/latex.php?latex=F+%3D+F_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F = F_2' title='F = F_2' class='latex' />, the function <img src='http://s0.wp.com/latex.php?latex=P%3A+F+%5Cto+%7B%5CBbb+R%7D%2F%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P: F &#92;to {&#92;Bbb R}/{&#92;Bbb Z}' title='P: F &#92;to {&#92;Bbb R}/{&#92;Bbb Z}' class='latex' /> defined by P(0)=0 and P(1)=1/4 is of degree &#60;3 (i.e. is a &#8220;quadratic&#8221; polynomial), but is not classical.  [However, it is a nice exercise to show that in the high characteristic case <img src='http://s0.wp.com/latex.php?latex=p+%5Cgeq+k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p &#92;geq k' title='p &#92;geq k' class='latex' />, every polynomial of degree <img src='http://s0.wp.com/latex.php?latex=%26%2360%3Bk&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&lt;k' title='&lt;k' class='latex' /> can be expressed as the sum of a classical polynomial and a constant; thus the genuinely non-classical polynomials are a purely low characteristic phenomenon.  I may write more about the relationship between classical and non-classical polynomials in a future post.]</p>
<p>We can define multiplicative analogues of the above concept.  Given a function <img src='http://s0.wp.com/latex.php?latex=f%3A+F%5En+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: F^n &#92;to {&#92;Bbb C}' title='f: F^n &#92;to {&#92;Bbb C}' class='latex' />, we define the <em>multiplicative derivative</em> <img src='http://s0.wp.com/latex.php?latex=%5CDelta_h%5E%5Ctimes+f%3A+F%5En+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Delta_h^&#92;times f: F^n &#92;to {&#92;Bbb C}' title='&#92;Delta_h^&#92;times f: F^n &#92;to {&#92;Bbb C}' class='latex' /> to be the function</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5CDelta%5E%5Ctimes_h+f%28x%29+%3A%3D+f%28x%2Bh%29+%5Coverline%7Bf%28x%29%7D%2C&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Delta^&#92;times_h f(x) := f(x+h) &#92;overline{f(x)},' title='&#92;Delta^&#92;times_h f(x) := f(x+h) &#92;overline{f(x)},' class='latex' /></p>
<p>and say that f is a <em>phase polynomial of degree &#60;k</em> if <img src='http://s0.wp.com/latex.php?latex=%5CDelta%5E%5Ctimes_%7Bh_1%7D+%5Cldots+%5CDelta%5E%5Ctimes_%7Bh_k%7D+f+%3D+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Delta^&#92;times_{h_1} &#92;ldots &#92;Delta^&#92;times_{h_k} f = 1' title='&#92;Delta^&#92;times_{h_1} &#92;ldots &#92;Delta^&#92;times_{h_k} f = 1' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=h_1%2C%5Cldots%2Ch_k+%5Cin+F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='h_1,&#92;ldots,h_k &#92;in F^n' title='h_1,&#92;ldots,h_k &#92;in F^n' class='latex' />.  It is not hard to show that f is a phase polynomial of degree &#60;k if and only if <img src='http://s0.wp.com/latex.php?latex=f%3De%28P%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f=e(P)' title='f=e(P)' class='latex' /> for some polynomial <img src='http://s0.wp.com/latex.php?latex=P%3A+F%5En+%5Cto+%7B%5CBbb+R%7D%2F%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P: F^n &#92;to {&#92;Bbb R}/{&#92;Bbb Z}' title='P: F^n &#92;to {&#92;Bbb R}/{&#92;Bbb Z}' class='latex' /> of degree &#60;k, where <img src='http://s0.wp.com/latex.php?latex=e%3A%7B%5CBbb+R%7D%2F%7B%5CBbb+Z%7D+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e:{&#92;Bbb R}/{&#92;Bbb Z} &#92;to {&#92;Bbb C}' title='e:{&#92;Bbb R}/{&#92;Bbb Z} &#92;to {&#92;Bbb C}' class='latex' /> is the standard character <img src='http://s0.wp.com/latex.php?latex=e%28x%29+%3D+e%5E%7B2%5Cpi+i+x%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e(x) = e^{2&#92;pi i x}' title='e(x) = e^{2&#92;pi i x}' class='latex' />.</p>
<p>The <em>Gowers uniformity norms</em> are a means to measure the extent to which an arbitrary function <img src='http://s0.wp.com/latex.php?latex=f%3A+F%5En+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: F^n &#92;to {&#92;Bbb C}' title='f: F^n &#92;to {&#92;Bbb C}' class='latex' /> behaves like a phase polynomial; if <img src='http://s0.wp.com/latex.php?latex=k+%5Cgeq+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k &#92;geq 1' title='k &#92;geq 1' class='latex' />, they are defined by the formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5C%26%23124%3Bf%5C%26%23124%3B_%7BU%5Ek%28F%5En%29%7D+%3A%3D+%5B%7B%5CBbb+E%7D_%7Bx%2Ch_1%2C%5Cldots%2Ch_k+%5Cin+F%5En%7D+%5CDelta%5E%5Ctimes_%7Bh_1%7D+%5Cldots+%5CDelta%5E%5Ctimes_%7Bh_k%7D+f%28x%29%5D%5E%7B1%2F2%5Ek%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;&#124;f&#92;&#124;_{U^k(F^n)} := [{&#92;Bbb E}_{x,h_1,&#92;ldots,h_k &#92;in F^n} &#92;Delta^&#92;times_{h_1} &#92;ldots &#92;Delta^&#92;times_{h_k} f(x)]^{1/2^k}' title='&#92;&#124;f&#92;&#124;_{U^k(F^n)} := [{&#92;Bbb E}_{x,h_1,&#92;ldots,h_k &#92;in F^n} &#92;Delta^&#92;times_{h_1} &#92;ldots &#92;Delta^&#92;times_{h_k} f(x)]^{1/2^k}' class='latex' /></p>
<p>where <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+E%7D_%7Ba+%5Cin+A%7D+f%28a%29+%3A%3D+%5Cfrac%7B1%7D%7B%26%23124%3BA%26%23124%3B%7D+%5Csum_%7Ba+%5Cin+A%7D+f%28a%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb E}_{a &#92;in A} f(a) := &#92;frac{1}{&#124;A&#124;} &#92;sum_{a &#92;in A} f(a)' title='{&#92;Bbb E}_{a &#92;in A} f(a) := &#92;frac{1}{&#124;A&#124;} &#92;sum_{a &#92;in A} f(a)' class='latex' /> denotes the average of f on A.  Thus, for instance, if f is bounded in magnitude by 1, then <img src='http://s0.wp.com/latex.php?latex=%5C%26%23124%3Bf%5C%26%23124%3B_%7BU%5Ek%28F%5En%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;&#124;f&#92;&#124;_{U^k(F^n)}' title='&#92;&#124;f&#92;&#124;_{U^k(F^n)}' class='latex' /> is a number between 0 and 1, and equals 1 if and only if f is a phase polynomial of degree &#60;k.</p>
<p>We have just seen that exact extremisers of the Gowers norms are given by phase polynomials.  This statement is in fact robust, in that near-exact extremisers of the Gowers norms are close to phase polynomials:</p>
<blockquote><p><strong>Theorem 1</strong> (local testabilty of phase polynomials)  Suppose that <img src='http://s0.wp.com/latex.php?latex=f%3A+F%5En+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: F^n &#92;to {&#92;Bbb C}' title='f: F^n &#92;to {&#92;Bbb C}' class='latex' /> is bounded in magnitude by 1 and obeys the bound <img src='http://s0.wp.com/latex.php?latex=%5C%26%23124%3Bf%5C%26%23124%3B_%7BU%5Ek%28F%5En%29%7D+%5Cgeq+1-%5Cvarepsilon&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;&#124;f&#92;&#124;_{U^k(F^n)} &#92;geq 1-&#92;varepsilon' title='&#92;&#124;f&#92;&#124;_{U^k(F^n)} &#92;geq 1-&#92;varepsilon' class='latex' />.  Then there exists a phase polynomial g of degree &#60;k such that <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+E%7D_x+%26%23124%3Bf%28x%29-g%28x%29%26%23124%3B+%3D+o_%7B%5Cvarepsilon+%5Cto+0%3Bk%7D%281%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb E}_x &#124;f(x)-g(x)&#124; = o_{&#92;varepsilon &#92;to 0;k}(1)' title='{&#92;Bbb E}_x &#124;f(x)-g(x)&#124; = o_{&#92;varepsilon &#92;to 0;k}(1)' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=o_%7B%5Cvarepsilon+%5Cto+0%3Bk%2CF%7D%281%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='o_{&#92;varepsilon &#92;to 0;k,F}(1)' title='o_{&#92;varepsilon &#92;to 0;k,F}(1)' class='latex' /> denotes a quantity that goes to zero as <img src='http://s0.wp.com/latex.php?latex=%5Cvarepsilon+%5Cto+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;varepsilon &#92;to 0' title='&#92;varepsilon &#92;to 0' class='latex' /> for fixed k, F, uniformly in the choice of f or n.</p></blockquote>
<p>This fact is essentially due <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2080792">to Alon, Kaufman, Krivelevich, Litsyn, and Ron</a>; their paper is focused on the case when <img src='http://s0.wp.com/latex.php?latex=F%3DF_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F=F_2' title='F=F_2' class='latex' /> and P is a classical polynomial, but it is not too difficult to extend the result to the generality of Theorem 1 (this is done in an appendix to our paper, as we need this local testability result in our arguments).  One consequence of this theorem is that one can test (with high confidence) whether a given function is close to a phase polynomial of degree &#60;k, by randomly computing some derivatives <img src='http://s0.wp.com/latex.php?latex=%5CDelta%5E%5Ctimes_%7Bh_1%7D+%5Cldots+%5CDelta%5E%5Ctimes_%7Bh_k%7D+f%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Delta^&#92;times_{h_1} &#92;ldots &#92;Delta^&#92;times_{h_k} f(x)' title='&#92;Delta^&#92;times_{h_1} &#92;ldots &#92;Delta^&#92;times_{h_k} f(x)' class='latex' /> and seeing whether they are consistently close to 1.  Note that this test is <em>local</em>, in the sense that one only has to evaluate f at a bounded number of positions in <img src='http://s0.wp.com/latex.php?latex=F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^n' title='F^n' class='latex' /> (even as <img src='http://s0.wp.com/latex.php?latex=n+%5Cto+%5Cinfty&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n &#92;to &#92;infty' title='n &#92;to &#92;infty' class='latex' />) in order to perform the test.</p>
<p>Theorem 1 describes what happens when a function &#8220;behaves like a phase polynomial 99% of the time&#8221;, in the sense that the Gowers norm is very close to 1.  The inverse conjecture for the Gowers norm addresses the more general case in which a function &#8220;behaves like a phase polynomial 51% of the time&#8221;, in the sense that the Gowers norm is bounded below by some small constant <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta &gt; 0' title='&#92;delta &gt; 0' class='latex' />.  (Note that if <img src='http://s0.wp.com/latex.php?latex=F%3DF_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F=F_2' title='F=F_2' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=f%3A+F%5En+%5Cto+%5C%7B-1%2C%2B1%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: F^n &#92;to &#92;{-1,+1&#92;}' title='f: F^n &#92;to &#92;{-1,+1&#92;}' class='latex' /> was a random boolean function, then <img src='http://s0.wp.com/latex.php?latex=%5CDelta%5E%5Ctimes_%7Bh_1%7D+%5Cldots+%5CDelta%5E%5Ctimes_%7Bh_k%7D+f%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Delta^&#92;times_{h_1} &#92;ldots &#92;Delta^&#92;times_{h_k} f(x)' title='&#92;Delta^&#92;times_{h_1} &#92;ldots &#92;Delta^&#92;times_{h_k} f(x)' class='latex' /> would equal +1 about 50% of the time and -1 for the other 50%, leading to a very small Gowers norm.)  More precisely, we have</p>
<blockquote><p><strong>Inverse Conjecture for the Gowers norm over finite fields.</strong> Let <img src='http://s0.wp.com/latex.php?latex=f%3A+F%5En+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: F^n &#92;to {&#92;Bbb C}' title='f: F^n &#92;to {&#92;Bbb C}' class='latex' /> be a function bounded in magnitude by 1 such that <img src='http://s0.wp.com/latex.php?latex=%5C%26%23124%3Bf%5C%26%23124%3B_%7BU%5Ek%28F%5En%29%7D+%5Cgeq+%5Cdelta+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;&#124;f&#92;&#124;_{U^k(F^n)} &#92;geq &#92;delta &gt; 0' title='&#92;&#124;f&#92;&#124;_{U^k(F^n)} &#92;geq &#92;delta &gt; 0' class='latex' />.  Then there exists a phase polynomial g of degree &#60;k such that <img src='http://s0.wp.com/latex.php?latex=%26%23124%3B%7B%5CBbb+E%7D+f%28x%29+%5Coverline%7Bg%28x%29%7D%26%23124%3B+%5Cgeq+c%28k%2CF%2C%5Cdelta%29+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;{&#92;Bbb E} f(x) &#92;overline{g(x)}&#124; &#92;geq c(k,F,&#92;delta) &gt; 0' title='&#124;{&#92;Bbb E} f(x) &#92;overline{g(x)}&#124; &#92;geq c(k,F,&#92;delta) &gt; 0' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=c%28k%2CF%2C%5Cdelta%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c(k,F,&#92;delta)' title='c(k,F,&#92;delta)' class='latex' /> independent of n.</p></blockquote>
<p>This conjecture has a number of applications to the structural theory of additive patterns (e.g. arithmetic progressions) in subsets of <img src='http://s0.wp.com/latex.php?latex=F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^n' title='F^n' class='latex' />; see for instance this <a class="snap_noshots" href="http://arxiv.org/abs/0711.0185">paper of Gowers-Wolf</a>.  It is also connected to various problems in computer science, such as generation of pseudorandom bits (see e.g. this <a class="snap_noshots" href="http://projectamericano.com/adib/pubs/lowdeggen.pdf">paper of Bogdanov-Viola</a>), and to polynomiality testing (see e.g. this <a class="snap_noshots" href="http://www.cs.huji.ac.il/~salex/papers/gowers.pdf">paper of Samorodnitsky-Trevisan</a>).</p>
<p>Previously, this conjecture had been verified for <img src='http://s0.wp.com/latex.php?latex=k+%5Cleq+3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k &#92;leq 3' title='k &#92;leq 3' class='latex' /> by <a class="snap_noshots" href="http://www.cs.huji.ac.il/~salex/papers/low_degree.ps">Samorodnitsky</a> (in even <a href="http://en.wikipedia.org/wiki/Characteristic_(algebra)">characteristic</a>) and by <a class="snap_noshots" href="http://front.math.ucdavis.edu/math.NT/0503014">Ben Green and myself</a> (in odd characteristic).  In high characteristic <img src='http://s0.wp.com/latex.php?latex=p+%5Cgeq+k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p &#92;geq k' title='p &#92;geq k' class='latex' />, a partial result (assuming that f was already a classical polynomial of bounded degree) was obtained by <a class="snap_noshots" href="http://arxiv.org/abs/0711.3191">Ben and myself</a>; but in low characteristic, it was discovered in that paper and simultaneously by <a class="snap_noshots" href="http://www.cs.huji.ac.il/~salex/papers/Gowers-false.pdf">Lovett-Meshulam-Samorodnitsky</a> that the conjecture can fail if one restricts g to be a classical phase polynomial.  Nevertheless we believe the conjecture should still in the low characteristic case hold if we allow g to be non-classical.</p>
<p>Our main results are as follows:</p>
<blockquote><p><strong>Theorem 2. </strong> The inverse conjecture is true in the high characteristic case <img src='http://s0.wp.com/latex.php?latex=p+%5Cgeq+k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p &#92;geq k' title='p &#92;geq k' class='latex' />.</p></blockquote>
<blockquote><p><strong>Theorem 3.</strong> In general characteristic, a weaker form of the inverse conjecture is true, in which we allow g to be a phase polynomial of degree <img src='http://s0.wp.com/latex.php?latex=%26%2360%3BC%28k%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&lt;C(k)' title='&lt;C(k)' class='latex' /> rather than <img src='http://s0.wp.com/latex.php?latex=%26%2360%3Bk&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&lt;k' title='&lt;k' class='latex' />, for some C(k) depending only on k.</p></blockquote>
<p>As mentioned above, we expect that we should be able to take C(k)=k in any characteristic, but this seems to require a delicate algebraic analysis and we were not able to establish it here.</p>
<p style="text-align:center;">&#8211; Ergodic theory &#8211;</p>
<p style="text-align:left;">We were able to deduce the combinatorial results in Theorems 2, 3 from ergodic analogues of these results.  Roughly speaking, these ergodic results correspond to &#8220;local&#8221; versions of the &#8220;global&#8221; theorems in Theorem 2, 3, because they involve averaging over only a small portion of the underlying space, as opposed to the global averages used to define the Gowers norms.  [In this regard, the ergodic theory results are weaker than the combinatorial ones.  However, there is another direction in which the ergodic results are stronger, in that they encode some additional "measurability" information that is not present in the combinatorial setting; roughly speaking, this information asserts that the function g appearing in the above conjecture can be "locally reconstructed" from f in some sense, as opposed to merely existing in some non-constructive sense.] Fortunately, thanks to some &#8220;local-to-global&#8221; principles (including the local testability result in Theorem 1), we are able to pass back and forth between the two types of theorems in a fairly &#8220;soft&#8221; manner.</p>
<p style="text-align:left;">Whereas the combinatorial results take place in the finitary setting of a finite-dimensional vector space <img src='http://s0.wp.com/latex.php?latex=F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^n' title='F^n' class='latex' />, the ergodic theory results take place in the infinitary setting of a probability space <img src='http://s0.wp.com/latex.php?latex=%28X%2C+%7B%5Cmathcal+B%7D%2C+%5Cmu%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(X, {&#92;mathcal B}, &#92;mu)' title='(X, {&#92;mathcal B}, &#92;mu)' class='latex' /> with a measure-preserving action <img src='http://s0.wp.com/latex.php?latex=%28T_g%29_%7Bg+%5Cin+F%5E%7B%5Comega%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(T_g)_{g &#92;in F^{&#92;omega}}' title='(T_g)_{g &#92;in F^{&#92;omega}}' class='latex' /> of the countably infinite vector space <img src='http://s0.wp.com/latex.php?latex=F%5E%5Comega+%3A%3D+%5Clim_%7Bn+%5Cto+%5Cinfty%7D+F%5En+%3D+%5Cbigcup_n+F%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^&#92;omega := &#92;lim_{n &#92;to &#92;infty} F^n = &#92;bigcup_n F^n' title='F^&#92;omega := &#92;lim_{n &#92;to &#92;infty} F^n = &#92;bigcup_n F^n' class='latex' />.   Given a measurable function <img src='http://s0.wp.com/latex.php?latex=f%3A+X+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: X &#92;to {&#92;Bbb C}' title='f: X &#92;to {&#92;Bbb C}' class='latex' />, one can define multiplicative derivatives</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5CDelta%5E%5Ctimes_h+f%28x%29+%3A%3D+f%28+T_h+x+%29+%5Coverline%7Bf%28x%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Delta^&#92;times_h f(x) := f( T_h x ) &#92;overline{f(x)}' title='&#92;Delta^&#92;times_h f(x) := f( T_h x ) &#92;overline{f(x)}' class='latex' /></p>
<p style="text-align:left;">for <img src='http://s0.wp.com/latex.php?latex=h+%5Cin+F%5E%5Comega&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='h &#92;in F^&#92;omega' title='h &#92;in F^&#92;omega' class='latex' />, and we say that f is a <em>phase polynomial of degree &#60;k</em> if <img src='http://s0.wp.com/latex.php?latex=%5CDelta%5E%5Ctimes_%7Bh_1%7D+%5Cldots+%5CDelta%5E%5Ctimes_%7Bh_k%7D+f+%3D+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Delta^&#92;times_{h_1} &#92;ldots &#92;Delta^&#92;times_{h_k} f = 1' title='&#92;Delta^&#92;times_{h_1} &#92;ldots &#92;Delta^&#92;times_{h_k} f = 1' class='latex' /> a.e. for all <img src='http://s0.wp.com/latex.php?latex=h_1%2C%5Cldots%2Ch_k+%5Cin+F%5E%5Comega&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='h_1,&#92;ldots,h_k &#92;in F^&#92;omega' title='h_1,&#92;ldots,h_k &#92;in F^&#92;omega' class='latex' />.  We can also define the <em>Gowers-Host-Kra uniformity seminorm</em> <img src='http://s0.wp.com/latex.php?latex=%5C%26%23124%3Bf%5C%26%23124%3B_%7BU%5Ek%28X%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;&#124;f&#92;&#124;_{U^k(X)}' title='&#92;&#124;f&#92;&#124;_{U^k(X)}' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=k+%5Cgeq+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k &#92;geq 1' title='k &#92;geq 1' class='latex' /> by the formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5C%26%23124%3Bf%5C%26%23124%3B_%7BU%5Ek%28X%29%7D+%3A%3D+%28%5Clim_%7Bn_1+%5Cto+%5Cinfty%7D+%5Cldots+%5Clim_%7Bn_k+%5Cto+%5Cinfty%7D+%7B%5CBbb+E%7D_%7Bh_1+%5Cin+F%5E%7Bn_1%7D%2C+%5Cldots%2C+h_k+%5Cin+F%5E%7Bn_k%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;&#124;f&#92;&#124;_{U^k(X)} := (&#92;lim_{n_1 &#92;to &#92;infty} &#92;ldots &#92;lim_{n_k &#92;to &#92;infty} {&#92;Bbb E}_{h_1 &#92;in F^{n_1}, &#92;ldots, h_k &#92;in F^{n_k}}' title='&#92;displaystyle &#92;&#124;f&#92;&#124;_{U^k(X)} := (&#92;lim_{n_1 &#92;to &#92;infty} &#92;ldots &#92;lim_{n_k &#92;to &#92;infty} {&#92;Bbb E}_{h_1 &#92;in F^{n_1}, &#92;ldots, h_k &#92;in F^{n_k}}' class='latex' /></p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cint_X+%5CDelta%5E%5Ctimes_%7Bh_1%7D+%5Cldots+%5CDelta%5E%5Ctimes_%7Bh_k%7D+f%5C+d%5Cmu%29%5E%7B1%2F2%5Ek%7D%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;int_X &#92;Delta^&#92;times_{h_1} &#92;ldots &#92;Delta^&#92;times_{h_k} f&#92; d&#92;mu)^{1/2^k};' title='&#92;displaystyle &#92;int_X &#92;Delta^&#92;times_{h_1} &#92;ldots &#92;Delta^&#92;times_{h_k} f&#92; d&#92;mu)^{1/2^k};' class='latex' /></p>
<p style="text-align:left;">it can be shown that the limits exist and that this is indeed a seminorm.  Thus for instance, as in the combinatorial setting, phase polynomials of degree &#60;k have a Gowers-Host-Kra seminorm of 1.</p>
<p style="text-align:left;">There is an analogue of the inverse conjecture:</p>
<blockquote><p><strong>Ergodic Inverse Conjecture for the Gowers norm over finite fields.</strong> Suppose that X is <a href="http://en.wikipedia.org/wiki/Ergodic_(adjective)">ergodic</a>.  Let <img src='http://s0.wp.com/latex.php?latex=f+%5Cin+L%5E%5Cinfty%28X%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f &#92;in L^&#92;infty(X)' title='f &#92;in L^&#92;infty(X)' class='latex' /> be such that <img src='http://s0.wp.com/latex.php?latex=%5C%26%23124%3Bf%5C%26%23124%3B_%7BU%5Ek%28X%29%7D+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;&#124;f&#92;&#124;_{U^k(X)} &gt; 0' title='&#92;&#124;f&#92;&#124;_{U^k(X)} &gt; 0' class='latex' />.  Then there exists a phase polynomial g of degree &#60;k such that <img src='http://s0.wp.com/latex.php?latex=%26%23124%3B%5Cint_X+f+%5Coverline%7Bg%7D%5C+d%5Cmu%26%23124%3B+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;&#92;int_X f &#92;overline{g}&#92; d&#92;mu&#124; &gt; 0' title='&#124;&#92;int_X f &#92;overline{g}&#92; d&#92;mu&#124; &gt; 0' class='latex' />.</p></blockquote>
<p>In a forthcoming paper of Bergelson, Ziegler, and myself, we show</p>
<blockquote><p><strong>Theorem 2&#8242;. </strong> The ergodic inverse conjecture is true in the high characteristic case <img src='http://s0.wp.com/latex.php?latex=p+%5Cgeq+k&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p &#92;geq k' title='p &#92;geq k' class='latex' />.</p></blockquote>
<blockquote><p><strong>Theorem 3&#8242;.</strong> In general characteristic, a weaker form of the ergodic inverse conjecture is true, in which we allow g to be a phase polynomial of degree <img src='http://s0.wp.com/latex.php?latex=%26%2360%3BC%28k%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&lt;C(k)' title='&lt;C(k)' class='latex' /> rather than <img src='http://s0.wp.com/latex.php?latex=%26%2360%3Bk&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&lt;k' title='&lt;k' class='latex' />, for some C(k) depending only on k.</p></blockquote>
<p style="text-align:left;">The aim of the current paper is to deduce Theorems 2, 3 from Theorems 2&#8242;, 3&#8242; respectively.</p>
<p style="text-align:center;">&#8211; Outline of proof &#8211;</p>
<p style="text-align:left;">The main idea here is to use the compactness and contradiction argument as embodied in the Furstenberg correspondence principle (which was discussed in my <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/08/30/the-correspondence-principle-and-finitary-ergodic-theory/">previous blog post</a>).  Suppose for instance that Theorem 2 failed; then one would have a sequence of bounded functions <img src='http://s0.wp.com/latex.php?latex=f_j%3A+F%5E%7Bn_j%7D+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f_j: F^{n_j} &#92;to {&#92;Bbb C}' title='f_j: F^{n_j} &#92;to {&#92;Bbb C}' class='latex' /> (where <img src='http://s0.wp.com/latex.php?latex=n_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n_j' title='n_j' class='latex' /> is going off to infinity) with Gowers norms of order k uniformly bounded from below, but such that <img src='http://s0.wp.com/latex.php?latex=f_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f_j' title='f_j' class='latex' /> becomes increasingly orthogonal to all phase polynomials of degree &#60;k.  The idea is then to exploit weak compactness to take a &#8220;limit&#8221;, which would be a bounded measurable function <img src='http://s0.wp.com/latex.php?latex=f+%5Cin+L%5E%5Cinfty%28X%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f &#92;in L^&#92;infty(X)' title='f &#92;in L^&#92;infty(X)' class='latex' /> on some measure space X with an <img src='http://s0.wp.com/latex.php?latex=F%5E%5Comega&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^&#92;omega' title='F^&#92;omega' class='latex' /> action.</p>
<p style="text-align:left;">The problem in doing this is that the convergence given by weak compactness is only in the &#8220;weak&#8221; or &#8220;local&#8221; sense; for instance, one can establish convergence results for <em>local</em> statistics, e.g.</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%7B%5CBbb+E%7D_%7Bx+%5Cin+F%5E%7Bn_j%7D%7D+f_j%28x%29+f_j%28x%2Bh%29+f_j%28x%2Bk%29+%5Cto+%5Cint_X+f%28x%29+f%28T_h+x%29+f%28T_k+x%29%5C+d%5Cmu%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle {&#92;Bbb E}_{x &#92;in F^{n_j}} f_j(x) f_j(x+h) f_j(x+k) &#92;to &#92;int_X f(x) f(T_h x) f(T_k x)&#92; d&#92;mu(x)' title='&#92;displaystyle {&#92;Bbb E}_{x &#92;in F^{n_j}} f_j(x) f_j(x+h) f_j(x+k) &#92;to &#92;int_X f(x) f(T_h x) f(T_k x)&#92; d&#92;mu(x)' class='latex' /></p>
<p style="text-align:left;">for any <em>fixed</em> <img src='http://s0.wp.com/latex.php?latex=h%2C+k+%5Cin+F%5E%5Comega&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='h, k &#92;in F^&#92;omega' title='h, k &#92;in F^&#92;omega' class='latex' />, but one cannot immediately establish convergence results for <em>global</em> statistics, e.g.</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%7B%5CBbb+E%7D_%7Bx%2Ch%2Ck+%5Cin+F%5E%7Bn_j%7D%7D+f_j%28x%29+f_j%28x%2Bh%29+f_j%28x%2Bk%29+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle {&#92;Bbb E}_{x,h,k &#92;in F^{n_j}} f_j(x) f_j(x+h) f_j(x+k) ' title='&#92;displaystyle {&#92;Bbb E}_{x,h,k &#92;in F^{n_j}} f_j(x) f_j(x+h) f_j(x+k) ' class='latex' /></p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cnot+%5Cto+%5Clim_%7BN+%5Cto+%5Cinfty%7D+%5Clim_%7BM+%5Cto+%5Cinfty%7D+%7B%5CBbb+E%7D_%7Bh+%5Cin+F%5EN%2C+k+%5Cin+F%5EM%7D+%5Cint_X+f%28x%29+f%28T_h+x%29+f%28T_k+x%29%5C+d%5Cmu%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;not &#92;to &#92;lim_{N &#92;to &#92;infty} &#92;lim_{M &#92;to &#92;infty} {&#92;Bbb E}_{h &#92;in F^N, k &#92;in F^M} &#92;int_X f(x) f(T_h x) f(T_k x)&#92; d&#92;mu(x)' title='&#92;displaystyle &#92;not &#92;to &#92;lim_{N &#92;to &#92;infty} &#92;lim_{M &#92;to &#92;infty} {&#92;Bbb E}_{h &#92;in F^N, k &#92;in F^M} &#92;int_X f(x) f(T_h x) f(T_k x)&#92; d&#92;mu(x)' class='latex' /> (2)</p>
<p style="text-align:left;">for the limit objects X, f obtained by the traditional Furstenberg correspondence principle.  To get around this, we first &#8220;shuffle&#8221; each of the original spaces <img src='http://s0.wp.com/latex.php?latex=F%5E%7Bn_j%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^{n_j}' title='F^{n_j}' class='latex' /> by a randomly chosen general linear transformation before applying the correspondence principle argument (actually, to be more precise, we pull back <img src='http://s0.wp.com/latex.php?latex=F%5E%7Bn_j%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^{n_j}' title='F^{n_j}' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=F%5E%5Cinfty&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F^&#92;infty' title='F^&#92;infty' class='latex' /> using a random linear transformation from the latter to the former, but never mind this technical detail).   The point of doing this shuffling is that it makes local averages approximate global averages with high confidence, for exactly the same reason that random polls with moderately large sample sizes can be accurate even when the overall population is enormous (see <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/10/10/small-samples-and-the-margin-of-error/">my previous post</a> for more discussion on this).  To cut a long story short, this shuffling (or &#8220;statistical sampling&#8221;) enables one to obtain convergence results such as (2).  This serves several useful purposes in our paper.  For instance, it allows us to ensure that the limit system X is ergodic.  But more importantly, it allows us to carry the information that each <img src='http://s0.wp.com/latex.php?latex=f_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f_j' title='f_j' class='latex' /> has large Gowers norm into the limit, concluding that f has large Gowers-Host-Kra seminorm.  Thus we are now in a position to apply the hypothesis Theorem 2&#8242; (or Theorem 3&#8242;, if the objective is to prove Theorem 3) and conclude that f correlates with a phase polynomial g.</p>
<p style="text-align:left;">Now we want to pass back from the limit system to the original finitary systems and conclude the desired contradiction.  A new problem emerges: the nature of the convergence of the latter to the former allows one to relate (local) statistics of <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f' title='f' class='latex' /> to (local) statistics of <img src='http://s0.wp.com/latex.php?latex=f_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f_j' title='f_j' class='latex' /> (which in turn relate to global statistics of <img src='http://s0.wp.com/latex.php?latex=f_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f_j' title='f_j' class='latex' />), but says nothing about g.  Fortunately, the ergodic inverse theorems tell us that g is measurable, which turns out to imply that g is &#8220;nearly local&#8221; in the sense that it can be approximated to arbitrary accuracy as some function g&#8217; of finitely many shifts of f, which is almost a phase polynomial and will continue to correlate with f if the approximation is good enough.  This latter object can be passed back to the original finitary systems, creating functions <img src='http://s0.wp.com/latex.php?latex=g%27_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g&#039;_j' title='g&#039;_j' class='latex' /> correlating with <img src='http://s0.wp.com/latex.php?latex=f_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f_j' title='f_j' class='latex' /> which are also almost polynomials.  To finish off the proof, we use the local testability (Theorem 1) to approximate <img src='http://s0.wp.com/latex.php?latex=g%27_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g&#039;_j' title='g&#039;_j' class='latex' /> by a genuine phase polynomial; this means that the <img src='http://s0.wp.com/latex.php?latex=f_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f_j' title='f_j' class='latex' /> do, after all, have significant correlation with a phase polynomial, contradicting our hypothesis and establishing the result.</p>
<p style="text-align:left;">It is tempting to try to use this type of argument to handle the inverse conjecture for the Gowers norm on the integers <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}' title='{&#92;Bbb Z}' class='latex' />, especially since ergodic inverse conjectures in this setting are already known thanks to the work of <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2150389">Host-Kra</a> (and also <a class="snap_noshots" href="http://www.ams.org/mathscinet-getitem?mr=2257397">Ziegler</a>).  However, there are two non-trivial obstacles here.  Firstly, one cannot &#8220;shuffle&#8221; the integers (or related objects, such as cyclic groups <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%2FN%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}/N{&#92;Bbb Z}' title='{&#92;Bbb Z}/N{&#92;Bbb Z}' class='latex' />) with the same degree of freedom as one can for a vector space, causing a difficulty in relating local and global statistics.  Secondly, whereas polynomials are known to be local testable, the local testability of the analogous object for the integer case, namely that of nilsequences, is unknown.  These difficulties do not appear to be totally insurmountable, but they would require some additional effort to resolve.</p>
<p style="text-align:left;">
<p style="text-align:left;">
]]></content:encoded>
</item>
<item>
<title><![CDATA[From the Littlewood-Offord problem to the Circular Law: universality of the spectral distribution of random matrices]]></title>
<link>http://terrytao.wordpress.com/2008/10/18/from-the-littlewood-offord-problem-to-the-circular-law-universality-of-the-spectral-distribution-of-random-matrices/</link>
<pubDate>Sat, 18 Oct 2008 20:13:51 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2008/10/18/from-the-littlewood-offord-problem-to-the-circular-law-universality-of-the-spectral-distribution-of-random-matrices/</guid>
<description><![CDATA[Van Vu and I have just uploaded to the arXiv our survey paper &#8220;From the Littlewood-Offord prob]]></description>
<content:encoded><![CDATA[<p><a class="snap_noshots" href="http://www.math.rutgers.edu/~vanvu/">Van Vu</a> and I have just uploaded to the <a class="snap_noshots" href="http://www.arxiv.org">arXiv</a> our survey paper &#8220;<a class="snap_noshots" href="http://arxiv.org/abs/0810.2994">From the Littlewood-Offord problem to the Circular Law: universality of the spectral distribution of random matrices</a>&#8220;, submitted to <a class="snap_noshots" href="http://www.ams.org/bull/">Bull. Amer. Math. Soc.</a>.  This survey recaps (avoiding most of the technical details) the recent work of ourselves and others that exploits the inverse theory for the Littlewood-Offord problem (which, roughly speaking, amounts to figuring out what types of random walks exhibit concentration at any given point), and how this leads to bounds on condition numbers, least singular values, and resolvents of random matrices; and then how the latter then leads to universality of the empirical spectral distributions (ESDs) of random matrices, and in particular to the circular law for the ESDs for iid random matrices with zero mean and unit variance (see my <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/08/02/random-matrices-universality-of-esds-and-the-circular-law/">previous blog post</a> on this topic, or my <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/03/18/lewis-lectures/">Lewis lectures</a>).  We conclude by mentioning a few open problems in the subject.</p>
<p>While this subject does unfortunately contain a large amount of technical theory and detail, every so often we find a very elementary observation that simplifies the work required significantly.  One such observation is an identity which we call the <em>negative second moment identity</em>, which I would like to discuss here.    Let A be an <img src='http://s0.wp.com/latex.php?latex=n+%5Ctimes+n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n &#92;times n' title='n &#92;times n' class='latex' /> matrix; for simplicity we assume that the entries are real-valued.  Denote the n rows of A by <img src='http://s0.wp.com/latex.php?latex=X_1%2C%5Cldots%2CX_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X_1,&#92;ldots,X_n' title='X_1,&#92;ldots,X_n' class='latex' />, which we view as vectors in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^n' title='{&#92;Bbb R}^n' class='latex' />.  Let <img src='http://s0.wp.com/latex.php?latex=%5Csigma_1%28A%29+%5Cgeq+%5Cldots+%5Cgeq+%5Csigma_n%28A%29+%5Cgeq+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sigma_1(A) &#92;geq &#92;ldots &#92;geq &#92;sigma_n(A) &#92;geq 0' title='&#92;sigma_1(A) &#92;geq &#92;ldots &#92;geq &#92;sigma_n(A) &#92;geq 0' class='latex' /> be the <a href="http://en.wikipedia.org/wiki/Singular_value">singular values</a> of A. In our applications, the vectors <img src='http://s0.wp.com/latex.php?latex=X_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X_j' title='X_j' class='latex' /> are easily described (e.g. they might be randomly distributed on the discrete cube <img src='http://s0.wp.com/latex.php?latex=%5C%7B-1%2C1%5C%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{-1,1&#92;}^n' title='&#92;{-1,1&#92;}^n' class='latex' />), but the distribution of the singular values <img src='http://s0.wp.com/latex.php?latex=%5Csigma_j%28A%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sigma_j(A)' title='&#92;sigma_j(A)' class='latex' /> is much more mysterious, and understanding this distribution is a key objective in this entire theory.</p>
<p><!--more--></p>
<p>In general, the relationship between the singular values <img src='http://s0.wp.com/latex.php?latex=%5Csigma_j%28A%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sigma_j(A)' title='&#92;sigma_j(A)' class='latex' /> (which encode spectral information about A) and the rows <img src='http://s0.wp.com/latex.php?latex=X_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X_j' title='X_j' class='latex' /> (which encode geometric information about A) are rather complicated.  However, there are some simple identities (or &#8220;trace formulae&#8221;, if you will) that link the two.  For instance, by computing the second moment <img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Btr%7D%28A+A%5E%2A%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{tr}(A A^*)' title='&#92;hbox{tr}(A A^*)' class='latex' /> in two different ways, one obtains the <em>second moment identity</em></p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bj%3D1%7D%5En+%5Csigma_j%28A%29%5E2+%3D+%5Csum_%7Bj%3D1%7D%5En+%26%23124%3BX_j%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_{j=1}^n &#92;sigma_j(A)^2 = &#92;sum_{j=1}^n &#124;X_j&#124;^2' title='&#92;sum_{j=1}^n &#92;sigma_j(A)^2 = &#92;sum_{j=1}^n &#124;X_j&#124;^2' class='latex' /> (1)</p>
<p>where <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BX_j%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;X_j&#124;' title='&#124;X_j&#124;' class='latex' /> denotes the length of <img src='http://s0.wp.com/latex.php?latex=X_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X_j' title='X_j' class='latex' />.  This simple identity is already enough to get some crude upper bounds on the &#8220;average&#8221; value of <img src='http://s0.wp.com/latex.php?latex=%5Csigma_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sigma_j' title='&#92;sigma_j' class='latex' />, although it does not preclude the possibility that a lot of singular values are very close to zero, or that a few singular values are extremely large.</p>
<p>The latter scenario (a few very large singular values) can be controlled by higher moment identities, for instance based on the fourth moment <img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Btr%7D%28AA%5E%2A+AA%5E%2A%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{tr}(AA^* AA^*)' title='&#92;hbox{tr}(AA^* AA^*)' class='latex' />.  But these moments are not good at controlling the former scenario &#8211; when one or more singular values comes close to zero, so that A becomes close to <a href="http://en.wikipedia.org/wiki/Singular_matrix">singular</a> (or more precisely, <a href="http://en.wikipedia.org/wiki/Ill-conditioned">ill-conditioned</a>).  For this, we need a different set of identities.  One such identity comes from computing the unsigned determinant <img src='http://s0.wp.com/latex.php?latex=%26%23124%3B%5Cdet%28A%29%26%23124%3B+%3D+%5Cdet%28AA%5E%2A%29%5E%7B1%2F2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;&#92;det(A)&#124; = &#92;det(AA^*)^{1/2}' title='&#124;&#92;det(A)&#124; = &#92;det(AA^*)^{1/2}' class='latex' /> in two different ways, one using the singular values and one using the elementary base-times-height formula for volume of a parallelopiped.  This gives the useful identity</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cprod_%7Bj%3D1%7D%5En+%5Csigma_j%28A%29+%3D+%5Cprod_%7Bj%3D1%7D%5En+%5Chbox%7Bdist%7D%28X_j%2C+%5Chbox%7Bspan%7D%28X_1%2C%5Cldots%2CX_%7Bj-1%7D%29%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;prod_{j=1}^n &#92;sigma_j(A) = &#92;prod_{j=1}^n &#92;hbox{dist}(X_j, &#92;hbox{span}(X_1,&#92;ldots,X_{j-1}))' title='&#92;prod_{j=1}^n &#92;sigma_j(A) = &#92;prod_{j=1}^n &#92;hbox{dist}(X_j, &#92;hbox{span}(X_1,&#92;ldots,X_{j-1}))' class='latex' /></p>
<p>or equivalently (assuming A is non-singular)</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bj%3D1%7D%5En+%5Clog+%5Csigma_j%28A%29+%3D+%5Csum_%7Bj%3D1%7D%5En+%5Clog+%5Chbox%7Bdist%7D%28X_j%2C+%5Chbox%7Bspan%7D%28X_1%2C%5Cldots%2CX_%7Bj-1%7D%29%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_{j=1}^n &#92;log &#92;sigma_j(A) = &#92;sum_{j=1}^n &#92;log &#92;hbox{dist}(X_j, &#92;hbox{span}(X_1,&#92;ldots,X_{j-1}))' title='&#92;sum_{j=1}^n &#92;log &#92;sigma_j(A) = &#92;sum_{j=1}^n &#92;log &#92;hbox{dist}(X_j, &#92;hbox{span}(X_1,&#92;ldots,X_{j-1}))' class='latex' />. (2)</p>
<p>This identity has some ability to control concentration of singular values near the origin (as the logarithm is large in that region), once one understands the distance between a random vector and a subspace spanned by other random vectors.  This is the philosophy used for instance in <a class="snap_noshots" href="http://front.math.ucdavis.edu/math.CO/0411095">this paper of mine with Van Vu</a>; related ideas also appear in <a class="snap_noshots" href="http://arxiv.org/abs/math/0703503">these</a> <a class="snap_noshots" href="http://arxiv.org/abs/0802.3956">papers</a> <a class="snap_noshots" href="http://arxiv.org/abs/0805.3407">by</a> Rudelson and Vershynin.</p>
<p>The identity (2) is particularly useful for controlling the least singular value <img src='http://s0.wp.com/latex.php?latex=%5Csigma_n%28A%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sigma_n(A)' title='&#92;sigma_n(A)' class='latex' />, but is not so useful for controlling other low singular values (e.g. <img src='http://s0.wp.com/latex.php?latex=%5Csigma_%7Bn-k%7D%28A%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sigma_{n-k}(A)' title='&#92;sigma_{n-k}(A)' class='latex' /> for some moderately small k).  For this, we found an alternate identity, based on the <em>negative</em> second moment <img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Btr%7D%28+%28A%5E%7B-1%7D%29%5E%2A+A%5E%7B-1%7D+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{tr}( (A^{-1})^* A^{-1} )' title='&#92;hbox{tr}( (A^{-1})^* A^{-1} )' class='latex' />.   Observe that the <img src='http://s0.wp.com/latex.php?latex=j%5E%7Bth%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='j^{th}' title='j^{th}' class='latex' /> column <img src='http://s0.wp.com/latex.php?latex=Y_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Y_j' title='Y_j' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=A%5E%7B-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A^{-1}' title='A^{-1}' class='latex' /> has an inner product of 1 with <img src='http://s0.wp.com/latex.php?latex=X_j&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X_j' title='X_j' class='latex' /> and is orthogonal to all the other rows of A.  A little bit of high school geometry then tells us that the length <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BY_j%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;Y_j&#124;' title='&#124;Y_j&#124;' class='latex' /> of this column is equal to <img src='http://s0.wp.com/latex.php?latex=1+%2F+%5Chbox%7Bdist%7D%28+X_j%2C+%5Chbox%7Bspan%7D%28X_1%2C%5Cldots%2CX_%7Bj-1%7D%2CX_%7Bj%2B1%7D%2C%5Cldots%2CX_n%29+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1 / &#92;hbox{dist}( X_j, &#92;hbox{span}(X_1,&#92;ldots,X_{j-1},X_{j+1},&#92;ldots,X_n) )' title='1 / &#92;hbox{dist}( X_j, &#92;hbox{span}(X_1,&#92;ldots,X_{j-1},X_{j+1},&#92;ldots,X_n) )' class='latex' />.  Since <img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Btr%7D%28+%28A%5E%7B-1%7D%29%5E%2A+A%5E%7B-1%7D+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{tr}( (A^{-1})^* A^{-1} )' title='&#92;hbox{tr}( (A^{-1})^* A^{-1} )' class='latex' /> is equal to both <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bj%3D1%7D%5En+%5Csigma_j%28A%29%5E%7B-2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_{j=1}^n &#92;sigma_j(A)^{-2}' title='&#92;sum_{j=1}^n &#92;sigma_j(A)^{-2}' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bj%3D1%7D%5En+%26%23124%3BY_j%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_{j=1}^n &#124;Y_j&#124;^2' title='&#92;sum_{j=1}^n &#124;Y_j&#124;^2' class='latex' />, we conclude that</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bj%3D1%7D%5En+%5Csigma_j%28A%29%5E%7B-2%7D+%3D+%5Csum_%7Bj%3D1%7D%5En+%5Chbox%7Bdist%7D%28+X_j%2C+%5Chbox%7Bspan%7D%28X_1%2C%5Cldots%2CX_%7Bj-1%7D%2CX_%7Bj%2B1%7D%2C%5Cldots%2CX_n%29+%29%5E%7B-2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sum_{j=1}^n &#92;sigma_j(A)^{-2} = &#92;sum_{j=1}^n &#92;hbox{dist}( X_j, &#92;hbox{span}(X_1,&#92;ldots,X_{j-1},X_{j+1},&#92;ldots,X_n) )^{-2}' title='&#92;sum_{j=1}^n &#92;sigma_j(A)^{-2} = &#92;sum_{j=1}^n &#92;hbox{dist}( X_j, &#92;hbox{span}(X_1,&#92;ldots,X_{j-1},X_{j+1},&#92;ldots,X_n) )^{-2}' class='latex' /> (3)</p>
<p style="text-align:left;">(compare with (1) and (2)).  We found the identity (3) to be useful for preventing too many singular values of A from clustering near the origin, much as (1) prevents too many singular values of A from becoming extremely large, thus saving us from having to deploy more sophisticated and lengthier methods to control the singular values <img src='http://s0.wp.com/latex.php?latex=%5Csigma_j%28A%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;sigma_j(A)' title='&#92;sigma_j(A)' class='latex' />.  It may well be that further identities or inequalities of this form may simplify these sorts of arguments further.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[What is a gauge?]]></title>
<link>http://terrytao.wordpress.com/2008/09/27/what-is-a-gauge/</link>
<pubDate>Sat, 27 Sep 2008 19:49:39 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2008/09/27/what-is-a-gauge/</guid>
<description><![CDATA[&#8220;Gauge theory&#8221; is a term which has connotations of being a fearsomely complicated part o]]></description>
<content:encoded><![CDATA[<p>&#8220;<a href="http://en.wikipedia.org/wiki/Gauge_theory">Gauge theory</a>&#8221; is a term which has connotations of being a fearsomely complicated part of mathematics &#8211; for instance, playing an important role in quantum field theory, general relativity, geometric PDE, and so forth.  But the underlying concept is really quite simple: a <em>gauge</em> is nothing more than a &#8220;coordinate system&#8221; that varies depending on one&#8217;s &#8220;location&#8221; with respect to some &#8220;base space&#8221; or &#8220;parameter space&#8221;, a <em>gauge transform</em> is a change of coordinates applied to each such location, and a <em>gauge theory</em> is a model for some physical or mathematical system to which gauge transforms can be applied (and is typically <em>gauge invariant</em>, in that all physically meaningful quantities are left unchanged (or transform naturally) under gauge transformations).  By <a href="http://en.wikipedia.org/wiki/Gauge_fixing"><em>fixing</em></a> a gauge (thus <em>breaking</em> or <em>spending</em> the gauge symmetry), the model becomes something easier to analyse mathematically, such as a system of partial differential equations (in classical gauge theories) or a perturbative quantum field theory (in quantum gauge theories), though the tractability of the resulting problem can be heavily dependent on the choice of gauge that one fixed.  Deciding exactly how to fix a gauge (or whether one should spend the gauge symmetry at all) is a key question in the analysis of gauge theories, and one that often requires the input of geometric ideas and intuition into that analysis.</p>
<p>I was asked recently to explain what a gauge theory was, and so I will try to do so in this post.  For simplicity, I will focus exclusively on classical gauge theories; quantum gauge theories are the quantization of classical gauge theories and have their own set of conceptual difficulties (coming from quantum field theory) that I will not discuss here. While gauge theories originated from physics, I will not discuss the physical significance of these theories much here, instead focusing just on their mathematical aspects.  My discussion will be informal, as I want to try to convey the geometric intuition rather than the rigorous formalism (which can, of course, be found in any graduate text on differential geometry).</p>
<p><!--more--></p>
<p style="text-align:center;">&#8211; Coordinate systems &#8211;</p>
<p>Before I discuss gauges, I first review the more familiar concept of a <em>coordinate system</em>, which is basically the special case of a gauge when the base space (or parameter space) is trivial.</p>
<p>Classical mathematics, such as practised by the ancient Greeks, could be loosely divided into two disciplines, <em>geometry</em> and <em>number theory</em>, where I use the latter term very broadly, to encompass all sorts of mathematics dealing with any sort of number.  The two disciplines are unified by the concept of a <em>coordinate system</em>, which allows one to convert geometric objects to numeric ones or vice versa.  The most well known example of a coordinate system is the <a href="http://en.wikipedia.org/wiki/Cartesian_coordinate_system">Cartesian coordinate system</a> for the plane (or more generally for a Euclidean space), but this is just one example of many such systems.  For instance:</p>
<ol>
<li>One can convert a length (of, say, an interval) into an (unsigned) real number, or vice versa, once one fixes a unit of length (e.g. the metre or the foot).  In this case, the coordinate system is specified by the choice of length unit.</li>
<li>One can convert a <a href="http://en.wikipedia.org/wiki/Displacement_(vector)">displacement</a> along a line into a (signed) real number, or vice versa, once one fixes a unit of length <em>and</em> an orientation along that line.  In this case, the coordinate system is specified by the length unit together with the choice of orientation.  Alternatively, one can replace the unit of length and the orientation by a unit displacement vector <img src='http://s0.wp.com/latex.php?latex=e&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e' title='e' class='latex' /> along the line.</li>
<li>One can convert a position (i.e. a point) on a line into a real number, or vice versa, once one fixes a unit of length, an orientation along the line, <em>and</em> an origin on that line.  Equivalently, one can pick an origin <img src='http://s0.wp.com/latex.php?latex=O&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O' title='O' class='latex' /> and a unit displacement vector <img src='http://s0.wp.com/latex.php?latex=e&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e' title='e' class='latex' />.  This coordinate system essentially identifies the original line with the standard real line <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}' title='{&#92;Bbb R}' class='latex' />.</li>
<li>One can generalise these systems to higher dimensions.  For instance, one can convert a displacement along a plane into a vector in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^2' title='{&#92;Bbb R}^2' class='latex' />, or vice versa, once one fixes two linearly independent displacement vectors <img src='http://s0.wp.com/latex.php?latex=e_1%2C+e_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e_1, e_2' title='e_1, e_2' class='latex' /> (i.e. a basis) to span that plane; the Cartesian coordinate system is just one special case of this general scheme.  Similarly, one can convert a position on a plane to a vector in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^2' title='{&#92;Bbb R}^2' class='latex' /> once one picks a basis <img src='http://s0.wp.com/latex.php?latex=e_1%2C+e_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e_1, e_2' title='e_1, e_2' class='latex' /> for that plane as well as an origin <img src='http://s0.wp.com/latex.php?latex=O&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O' title='O' class='latex' />, thus identifying that plane with the standard Euclidean plane <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^2' title='{&#92;Bbb R}^2' class='latex' />.  (To put it another way, units of measurement are nothing more than one-dimensional (i.e. scalar) coordinate systems.)</li>
<li>To convert an angle in a plane to a signed number (modulo multiples of <img src='http://s0.wp.com/latex.php?latex=2%5Cpi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='2&#92;pi' title='2&#92;pi' class='latex' />), or vice versa, one needs to pick an orientation on the plane (e.g. to decide that anti-clockwise angles are positive).</li>
<li>To convert a <em>direction</em> in a plane to a signed number (again modulo multiples of <img src='http://s0.wp.com/latex.php?latex=2%5Cpi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='2&#92;pi' title='2&#92;pi' class='latex' />), or vice versa, one needs to pick an orientation on the plane, as well as a reference direction (e.g. <a href="http://en.wikipedia.org/wiki/True_north">true</a> or <a href="http://en.wikipedia.org/wiki/Magnetic_north_pole">magnetic north</a> is often used in the case of ocean navigation).</li>
<li>Similarly, to convert a position on a circle to a number (modulo multiples of <img src='http://s0.wp.com/latex.php?latex=2%5Cpi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='2&#92;pi' title='2&#92;pi' class='latex' />), or vice versa, one needs to pick an orientation on that circle, together with an origin on that circle.  Such a coordinate system then equates the original circle to the standard unit circle <img src='http://s0.wp.com/latex.php?latex=S%5E1+%3A%3D+%5C%7B+z+%5Cin+%7B%5CBbb+C%7D%3A+%26%23124%3Bz%26%23124%3B+%3D+1+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S^1 := &#92;{ z &#92;in {&#92;Bbb C}: &#124;z&#124; = 1 &#92;}' title='S^1 := &#92;{ z &#92;in {&#92;Bbb C}: &#124;z&#124; = 1 &#92;}' class='latex' /> (with the standard origin <img src='http://s0.wp.com/latex.php?latex=%2B1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='+1' title='+1' class='latex' /> and the standard anticlockwise orientation <img src='http://s0.wp.com/latex.php?latex=%5Ccirclearrowleft&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;circlearrowleft' title='&#92;circlearrowleft' class='latex' />).</li>
<li>To convert a position on a two-dimensional sphere (e.g. the surface of the Earth, as a first approximation) to a point on the standard unit sphere <img src='http://s0.wp.com/latex.php?latex=S%5E2+%3A%3D+%5C%7B+%28x%2Cy%2Cz%29+%5Cin+%7B%5CBbb+R%7D%5E3%3A+x%5E2%2By%5E2%2Bz%5E2+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S^2 := &#92;{ (x,y,z) &#92;in {&#92;Bbb R}^3: x^2+y^2+z^2 &#92;}' title='S^2 := &#92;{ (x,y,z) &#92;in {&#92;Bbb R}^3: x^2+y^2+z^2 &#92;}' class='latex' />, one can pick an orientation on that sphere, an &#8220;origin&#8221; (or &#8220;north pole&#8221;) for that sphere, and a &#8220;<a href="http://en.wikipedia.org/wiki/Prime_Meridian">prime meridian</a>&#8221; connecting the north pole to its antipode.  Alternatively, one can view this coordinate system as determining a pair of <a href="http://en.wikipedia.org/wiki/Euler_angles">Euler angles</a> <img src='http://s0.wp.com/latex.php?latex=%5Cphi%2C+%5Clambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;phi, &#92;lambda' title='&#92;phi, &#92;lambda' class='latex' /> (or a <a href="http://en.wikipedia.org/wiki/Latitude">latitude</a> and <a href="http://en.wikipedia.org/wiki/Longitude">longitude</a>) to be assigned to every point on one&#8217;s original sphere.</li>
<li>The above examples were all geometric in nature, but one can also consider &#8220;combinatorial&#8221; coordinate systems, which allow one to identify combinatorial objects with numerical ones.  An extremely familiar example of this is <a href="http://en.wikipedia.org/wiki/Enumeration"><em>enumeration</em></a>: one can identify a set A of (say) five elements with the numbers 1,2,3,4,5 simply by choosing an enumeration <img src='http://s0.wp.com/latex.php?latex=a_1%2C+a_2%2C+%5Cldots%2C+a_5&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a_1, a_2, &#92;ldots, a_5' title='a_1, a_2, &#92;ldots, a_5' class='latex' /> of the set A.  One can similarly enumerate other combinatorial objects (e.g. <a href="http://en.wikipedia.org/wiki/Graph_(mathematics)">graphs</a>, <a href="http://en.wikipedia.org/wiki/Relation_(mathematics)">relations</a>, <a href="http://en.wikipedia.org/wiki/Tree_(graph_theory)">trees</a>, <a href="http://en.wikipedia.org/wiki/Partially_ordered_set">partial orders</a>, etc.), and indeed this is done all the time in combinatorics.  Similarly for algebraic objects, such as <a href="http://en.wikipedia.org/wiki/Coset">cosets</a> of a subgroup H (or more generally, <a href="http://en.wikipedia.org/wiki/Torsor">torsors</a> of a group G); one can identify such a coset with H itself by designating an element of that coset to be the &#8220;identity&#8221; or &#8220;origin&#8221;.</li>
</ol>
<p>More generally, a coordinate system <img src='http://s0.wp.com/latex.php?latex=%5CPhi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Phi' title='&#92;Phi' class='latex' /> can be viewed as an isomorphism <img src='http://s0.wp.com/latex.php?latex=%5CPhi%3A+A+%5Cto+G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Phi: A &#92;to G' title='&#92;Phi: A &#92;to G' class='latex' /> between a given geometric (or combinatorial) object A in some class (e.g. a circle), and a standard object G in that class (e.g. the standard unit circle).  (To be pedantic, this is what a <em>global</em> coordinate system is; a <em>local</em> coordinate system, such as the coordinate charts on a manifold, is an isomorphism between a local piece of a geometric or combinatorial object in a class, and a local piece of a standard object in that class.  I will restrict attention to global coordinate systems for this discussion.)</p>
<p>Coordinate systems identify geometric or combinatorial objects with numerical (or standard) ones, but in many cases, there is no natural (or <a href="http://en.wikipedia.org/wiki/Canonical">canonical</a>) choice of this identification; instead, one may be faced with a variety of coordinate systems, all equally valid.  One can of course just fix one such system once and for all, in which case there is no real harm in thinking of the geometric and numeric objects as being equivalent.  If however one plans to change from one system to the next (or to avoid using such systems altogether), then it becomes important to carefully distinguish these two types of objects, to avoid confusion.  For instance, if an interval AB is measured to have a length of 3 yards, then it is OK to write <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAB%26%23124%3B%3D3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AB&#124;=3' title='&#124;AB&#124;=3' class='latex' /> (identifying the geometric concept of length with the numeric concept of a positive real number) so long as you plan to stick to having the yard as the unit of length for the rest of one&#8217;s analysis.  But if one was also planning to use, say, feet, as a unit of length also, then to avoid confusing statements such as &#8220;<img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAB%26%23124%3B%3D3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AB&#124;=3' title='&#124;AB&#124;=3' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAB%26%23124%3B%3D9&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AB&#124;=9' title='&#124;AB&#124;=9' class='latex' />&#8220;,  one should specify the coordinate systems explicitly, e.g. &#8220;<img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAB%26%23124%3B+%3D+3+%5Chbox%7B+yards%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AB&#124; = 3 &#92;hbox{ yards}' title='&#124;AB&#124; = 3 &#92;hbox{ yards}' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAB%26%23124%3B+%3D+9+%5Chbox%7B+feet%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AB&#124; = 9 &#92;hbox{ feet}' title='&#124;AB&#124; = 9 &#92;hbox{ feet}' class='latex' />&#8220;.  Similarly, identifying a point P in a plane with its coordinates (e.g. <img src='http://s0.wp.com/latex.php?latex=P+%3D+%284%2C3%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P = (4,3)' title='P = (4,3)' class='latex' />) is safe as long as one intends to only use a single coordinate system throughout; but if one intends to change coordinates at some point (or to switch to a coordinate-free perspective) then one should be more careful, e.g. writing <img src='http://s0.wp.com/latex.php?latex=P+%3D+4+e_1+%2B+3+e_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P = 4 e_1 + 3 e_2' title='P = 4 e_1 + 3 e_2' class='latex' />, or even <img src='http://s0.wp.com/latex.php?latex=P+%3D+O+%2B+4+e_1+%2B+3+e_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P = O + 4 e_1 + 3 e_2' title='P = O + 4 e_1 + 3 e_2' class='latex' />, if the origin O and basis vectors <img src='http://s0.wp.com/latex.php?latex=e_1%2C+e_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e_1, e_2' title='e_1, e_2' class='latex' /> of one&#8217;s coordinate systems might be subject to future change.</p>
<p>As mentioned above, it is possible to in many cases to dispense with coordinates altogether.  For instance, one can view the length <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAB%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AB&#124;' title='&#124;AB&#124;' class='latex' /> of a line segment AB not as a number (which requires one to select a unit of length), but more abstractly as the equivalence class of all line segments CD that are <a href="http://en.wikipedia.org/wiki/Congruence_(geometry)">congruent</a> to AB.  With this perspective, <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAB%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AB&#124;' title='&#124;AB&#124;' class='latex' /> no longer lies in the standard <a href="http://en.wikipedia.org/wiki/Semigroup">semigroup</a> <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5E%2B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^+' title='{&#92;Bbb R}^+' class='latex' />, but in a more abstract semigroup <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+L%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal L}' title='{&#92;mathcal L}' class='latex' /> (the space of line segments quotiented by congruence), with addition now defined geometrically (by concatenation of intervals) rather than numerically.  A unit of length can now be viewed as just one of many different isomorphisms <img src='http://s0.wp.com/latex.php?latex=%5CPhi%3A+%7B%5Cmathcal+L%7D+%5Cto+%7B%5CBbb+R%7D%5E%2B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Phi: {&#92;mathcal L} &#92;to {&#92;Bbb R}^+' title='&#92;Phi: {&#92;mathcal L} &#92;to {&#92;Bbb R}^+' class='latex' /> between <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+L%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal L}' title='{&#92;mathcal L}' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5E%2B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^+' title='{&#92;Bbb R}^+' class='latex' />, but one can abandon the use of such units and just work with <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+L%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal L}' title='{&#92;mathcal L}' class='latex' /> directly.  Many statements in Euclidean geometry involving length can be phrased in this manner.  For instance, if B lies in AC, then the statement <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAC%26%23124%3B%3D%26%23124%3BAB%26%23124%3B%2B%26%23124%3BBC%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AC&#124;=&#124;AB&#124;+&#124;BC&#124;' title='&#124;AC&#124;=&#124;AB&#124;+&#124;BC&#124;' class='latex' /> can be stated in <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+L%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal L}' title='{&#92;mathcal L}' class='latex' />, and does not require any units to convert <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+L%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal L}' title='{&#92;mathcal L}' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+R%7D%5E%2B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal R}^+' title='{&#92;mathcal R}^+' class='latex' />; with a bit more work, one can also make sense of such statements as <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAC%26%23124%3B%5E2+%3D+%26%23124%3BAB%26%23124%3B%5E2+%2B+%26%23124%3BBC%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AC&#124;^2 = &#124;AB&#124;^2 + &#124;BC&#124;^2' title='&#124;AC&#124;^2 = &#124;AB&#124;^2 + &#124;BC&#124;^2' class='latex' /> for a right-angled triangle ABC (i.e. <a href="http://en.wikipedia.org/wiki/Pythagorean_theorem">Pythagoras&#8217; theorem</a>) while avoiding units, by defining a symmetric bilinear product operation <img src='http://s0.wp.com/latex.php?latex=%5Ctimes%3A+%7B%5Cmathcal+L%7D+%5Ctimes+%7B%5Cmathcal+L%7D+%5Cto+%7B%5Cmathcal+A%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;times: {&#92;mathcal L} &#92;times {&#92;mathcal L} &#92;to {&#92;mathcal A}' title='&#92;times: {&#92;mathcal L} &#92;times {&#92;mathcal L} &#92;to {&#92;mathcal A}' class='latex' /> from the abstract semigroup <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+L%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal L}' title='{&#92;mathcal L}' class='latex' /> of lengths to the abstract semigroup <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+A%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal A}' title='{&#92;mathcal A}' class='latex' /> of areas.  (Indeed, this is basically how the ancient Greeks, who did not quite possess the modern <a href="http://en.wikipedia.org/wiki/Real_number">real number system</a> <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}' title='{&#92;Bbb R}' class='latex' />, viewed geometry, though of course without the assistance of such modern terminology as &#8220;semigroup&#8221; or &#8220;bilinear&#8221;.)</p>
<p>The above abstract <em>coordinate-free perspective</em> is equivalent to a more concrete <em>coordinate-invariant perspective</em>, in which we do allow the use of coordinates to convert all geometric quantities to numeric ones, but insist that every statement that we write down is invariant under changes of coordinates.  For instance, if we shrink our chosen unit of length by a factor <img src='http://s0.wp.com/latex.php?latex=%5Clambda+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda &gt; 0' title='&#92;lambda &gt; 0' class='latex' />, then the numerical length of every interval increases by a factor of <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' />, e.g. <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAB%26%23124%3B+%5Cmapsto+%5Clambda+%26%23124%3BAB%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AB&#124; &#92;mapsto &#92;lambda &#124;AB&#124;' title='&#124;AB&#124; &#92;mapsto &#92;lambda &#124;AB&#124;' class='latex' />.  The coordinate-invariant approach to length measurement then treats lengths such as <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAB%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AB&#124;' title='&#124;AB&#124;' class='latex' /> as numbers, but requires all statements involving such lengths to be invariant under the above scaling symmetry.  For instance, a statement such as <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAC%26%23124%3B%5E2+%3D+%26%23124%3BAB%26%23124%3B%5E2+%2B+%26%23124%3BBC%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AC&#124;^2 = &#124;AB&#124;^2 + &#124;BC&#124;^2' title='&#124;AC&#124;^2 = &#124;AB&#124;^2 + &#124;BC&#124;^2' class='latex' /> is legitimate under this perspective, but a statement such as <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAB%26%23124%3B+%3D+%26%23124%3BBC%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AB&#124; = &#124;BC&#124;^2' title='&#124;AB&#124; = &#124;BC&#124;^2' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAB%26%23124%3B+%3D+3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AB&#124; = 3' title='&#124;AB&#124; = 3' class='latex' /> is not.  [In other words, co-ordinate invariance here is the same thing as being dimensionally consistent.  Indeed, <a href="http://en.wikipedia.org/wiki/Dimensional_analysis">dimensional analysis</a> is nothing more than the analysis of the scaling symmetries in one's coordinate systems.]  One can retain this coordinate-invariance symmetry throughout one&#8217;s arguments; or one can, at some point, choose to <em>spend</em> (or <em>break</em>) this coordinate invariance by selecting (or <em>fixing</em>) the coordinate system (which, in this case, means selecting a unit length).  The advantage in spending such a symmetry is that one can often normalise one or more quantities to equal a particularly nice value; for instance, if a length <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAB%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AB&#124;' title='&#124;AB&#124;' class='latex' /> is appearing everywhere in one&#8217;s arguments, and one has carefully retained coordinate-invariance up until some key point, then it can be convenient to spend this invariance to normalise <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BAB%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;AB&#124;' title='&#124;AB&#124;' class='latex' /> to equal 1.  (In this case, one only has a one-dimensional family of symmetries, and so can only normalise one quantity at a time; but when one&#8217;s symmetry group is larger, one can often normalise many more quantities at once; as a rule of thumb, one can normalise one quantity for each degree of freedom in the symmetry group.)  Conversely, if one has already spent the coordinate invariance, one can often buy it back by converting all the facts, hypotheses, and desired conclusions one currently possesses in the situation back to a coordinate-invariant formulation.  Thus one could imagine performing one normalisation to do one set of calculations, then undoing that normalisation to return to a coordinate-free perspective, doing some coordinate-free manipulations, and then performing a different normalisation to work on another part of the problem, and so forth.  (For instance, in Euclidean geometry problems, it is often convenient to temporarily assign one key point to be the origin (thus spending translation invariance symmetry), then another, then switch back to a translation-invariant perspective, and so forth.  As long as one is correctly accounting for what symmetries are being spent and bought at any given time, this can be a very powerful way of simplifying one&#8217;s calculations.)</p>
<p>Given a coordinate system <img src='http://s0.wp.com/latex.php?latex=%5CPhi%3A+A+%5Cto+G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Phi: A &#92;to G' title='&#92;Phi: A &#92;to G' class='latex' /> that identifies some geometric object A with a standard object G, and some isomorphism <img src='http://s0.wp.com/latex.php?latex=%5CPsi%3A+G+%5Cto+G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Psi: G &#92;to G' title='&#92;Psi: G &#92;to G' class='latex' /> of that standard object, we can obtain a new coordinate system <img src='http://s0.wp.com/latex.php?latex=%5CPsi+%5Ccirc+%5CPhi%3A+A+%5Cto+G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Psi &#92;circ &#92;Phi: A &#92;to G' title='&#92;Psi &#92;circ &#92;Phi: A &#92;to G' class='latex' /> of A by composing the two isomorphisms.  [I will be vague on what "<a href="http://en.wikipedia.org/wiki/Isomorphism">isomorphism</a>" means; one can formalise the concept using the language of <a href="http://en.wikipedia.org/wiki/Category_theory">category theory</a>.] Conversely, every other coordinate system <img src='http://s0.wp.com/latex.php?latex=%5CPhi%27%3A+A+%5Cto+G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Phi&#039;: A &#92;to G' title='&#92;Phi&#039;: A &#92;to G' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A' title='A' class='latex' /> arises in this manner.  Thus, the space of coordinate systems on A is (non-canonically) identifiable with the isomorphism group <img src='http://s0.wp.com/latex.php?latex=%5Chbox%7BIsom%7D%28G%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{Isom}(G)' title='&#92;hbox{Isom}(G)' class='latex' /> of G.  This isomorphism group is called the <a href="http://en.wikipedia.org/wiki/Structure_group#Structure_groups_and_transition_functions"><em>structure group</em></a> (or <a href="http://en.wikipedia.org/wiki/Gauge_group"><em>gauge group</em></a>) of the class of geometric objects.  For example, the structure group for lengths is <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5E%2B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^+' title='{&#92;Bbb R}^+' class='latex' />; the structure group for angles is <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%2F2%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}/2{&#92;Bbb Z}' title='{&#92;Bbb Z}/2{&#92;Bbb Z}' class='latex' />; the structure group for lines is the <a href="http://en.wikipedia.org/wiki/Affine_group">affine group</a> <img src='http://s0.wp.com/latex.php?latex=%5Chbox%7BAff%7D%28%7B%5CBbb+R%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{Aff}({&#92;Bbb R})' title='&#92;hbox{Aff}({&#92;Bbb R})' class='latex' />; the structure group for <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n' title='n' class='latex' />-dimensional Euclidean geometry is the <a href="http://en.wikipedia.org/wiki/Euclidean_group">Euclidean group</a> <img src='http://s0.wp.com/latex.php?latex=E%28n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='E(n)' title='E(n)' class='latex' />; the structure group for (oriented) 2-spheres is the (special) <a href="http://en.wikipedia.org/wiki/Orthogonal_group">orthogonal group</a> <img src='http://s0.wp.com/latex.php?latex=SO%283%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SO(3)' title='SO(3)' class='latex' />; and so forth.  (Indeed, one can basically describe each of the classical geometries (<a href="http://en.wikipedia.org/wiki/Euclidean_geometry">Euclidean</a>, <a href="http://en.wikipedia.org/wiki/Affine_geometry">affine</a>, <a href="http://en.wikipedia.org/wiki/Projective_geometry">projective</a>, <a href="http://en.wikipedia.org/wiki/Spherical_geometry">spherical</a>, <a href="http://en.wikipedia.org/wiki/Hyperbolic_geometry">hyperbolic</a>, <a href="http://en.wikipedia.org/wiki/Minkowski_space">Minkowski</a>, etc.) as a <a href="http://en.wikipedia.org/wiki/Homogeneous_space">homogeneous space</a> for its structure group, as per the <a href="http://en.wikipedia.org/wiki/Erlangen_program">Erlangen program</a>.)</p>
<p style="text-align:center;">&#8211; Gauges &#8211;</p>
<p>In our discussion of coordinate systems, we focused on a single geometric (or combinatorial) object <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A' title='A' class='latex' />: a single line, a single circle, a single set, etc.  We then used a single coordinate system to identify that object with a standard representative of such an object.</p>
<p>Now let us consider the more general situation in which one has a <em>family</em> (or <a href="http://en.wikipedia.org/wiki/Fiber_bundle"><em>fibre bundle</em></a>) <img src='http://s0.wp.com/latex.php?latex=%28A_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(A_x)_{x &#92;in X}' title='(A_x)_{x &#92;in X}' class='latex' /> of geometric (or combinatorial) objects (or <em>fibres</em>) <img src='http://s0.wp.com/latex.php?latex=A_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_x' title='A_x' class='latex' />: a family of lines (i.e. a line bundle), a family of circles (i.e. a circle bundle), a family of sets, etc.  This family is parameterised by some <em>parameter set</em> or <em>base point</em> x, which ranges in some <em>parameter space</em> or <em>base space</em> X.  In many cases one also requires some topological or differentiable compatibility between the various fibres; for instance, continuous (or smooth) variations of the base point should lead to continuous (or smooth) variations in the fibre.  For sake of discussion, however, let us gloss over these compatibility conditions.</p>
<p>In many cases, each individual fibre <img src='http://s0.wp.com/latex.php?latex=A_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_x' title='A_x' class='latex' /> in a bundle <img src='http://s0.wp.com/latex.php?latex=%28A_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(A_x)_{x &#92;in X}' title='(A_x)_{x &#92;in X}' class='latex' />, being a geometric object of a certain class, can be identified with a standard object <img src='http://s0.wp.com/latex.php?latex=G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G' title='G' class='latex' /> in that class, by means of a separate coordinate system <img src='http://s0.wp.com/latex.php?latex=%5CPhi_x%3A+A_x+%5Cto+G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Phi_x: A_x &#92;to G' title='&#92;Phi_x: A_x &#92;to G' class='latex' /> for each base point x.  The entire collection <img src='http://s0.wp.com/latex.php?latex=%5CPhi+%3D+%28%5CPhi_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Phi = (&#92;Phi_x)_{x &#92;in X}' title='&#92;Phi = (&#92;Phi_x)_{x &#92;in X}' class='latex' /> is then referred to as a (global) <em>gauge</em> or <em><a href="http://en.wikipedia.org/wiki/Trivialization">trivialisation</a></em> for this bundle (provided that it is compatible with whatever topological or differentiable structures one has placed on the bundle, but never mind that for now).  Equivalently, a gauge is a <a href="http://en.wikipedia.org/wiki/Bundle_isomorphism">bundle isomorphism</a> <img src='http://s0.wp.com/latex.php?latex=%5CPhi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Phi' title='&#92;Phi' class='latex' /> from the original bundle <img src='http://s0.wp.com/latex.php?latex=%28A_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(A_x)_{x &#92;in X}' title='(A_x)_{x &#92;in X}' class='latex' /> to the <em>trivial bundle</em> <img src='http://s0.wp.com/latex.php?latex=%28G%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(G)_{x &#92;in X}' title='(G)_{x &#92;in X}' class='latex' />, in which every fibre is the standard geometric object G.  (There are also <em>local</em> gauges, which only trivialise a portion of the bundle, but let&#8217;s ignore this distinction for now.)</p>
<p>Let&#8217;s give three concrete examples of bundles and gauges; one from differential geometry, one from dynamical systems, and one from combinatorics.</p>
<p><strong>Example 1: the circle bundle of the sphere.</strong> Recall from the previous section that the space of directions in a plane (which can be viewed as the circle of unit vectors) can be identified with the standard circle <img src='http://s0.wp.com/latex.php?latex=S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S^1' title='S^1' class='latex' /> after picking an orientation and a reference direction.  Now let us work not on the plane, but on a sphere, and specifically, on the surface X of the earth.  At each point x on this surface, there is a circle <img src='http://s0.wp.com/latex.php?latex=S_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S_x' title='S_x' class='latex' /> of directions that one can travel along the sphere from x; the collection <img src='http://s0.wp.com/latex.php?latex=SX+%3A%3D+%28S_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SX := (S_x)_{x &#92;in X}' title='SX := (S_x)_{x &#92;in X}' class='latex' /> of all such circles is then a circle bundle with base space X (known as <em>the</em> circle bundle; it could also be viewed as the sphere bundle, cosphere bundle, or orthonormal frame bundle of X). The structure group of this bundle is the circle group <img src='http://s0.wp.com/latex.php?latex=U%281%29+%5Cequiv+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U(1) &#92;equiv S^1' title='U(1) &#92;equiv S^1' class='latex' /> if one preserves orientation, or the <a href="http://en.wikipedia.org/wiki/Semidirect_product">semi-direct product</a> <img src='http://s0.wp.com/latex.php?latex=S%5E1+%5Crtimes+%7B%5CBbb+Z%7D%2F2%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S^1 &#92;rtimes {&#92;Bbb Z}/2{&#92;Bbb Z}' title='S^1 &#92;rtimes {&#92;Bbb Z}/2{&#92;Bbb Z}' class='latex' /> otherwise.</p>
<p>Now suppose, at every point x on the earth X, the wind is blowing in some direction <img src='http://s0.wp.com/latex.php?latex=w_x+%5Cin+S_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='w_x &#92;in S_x' title='w_x &#92;in S_x' class='latex' />.  (This is not actually possible globally, thanks to the <a href="http://en.wikipedia.org/wiki/Hairy_ball_theorem">hairy ball theorem</a>, but let&#8217;s ignore this technicality for now.)  Thus wind direction can be thought of as a collection <img src='http://s0.wp.com/latex.php?latex=w+%3D+%28w_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='w = (w_x)_{x &#92;in X}' title='w = (w_x)_{x &#92;in X}' class='latex' /> of representatives from the fibres of the fibre bundle <img src='http://s0.wp.com/latex.php?latex=%28S_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(S_x)_{x &#92;in X}' title='(S_x)_{x &#92;in X}' class='latex' />; such a collection is known as a <a href="http://en.wikipedia.org/wiki/Section_(fiber_bundle)"><em>section</em></a> of the fibre bundle (it is to bundles as the concept of a <a href="http://en.wikipedia.org/wiki/Graph_of_a_function">graph</a> <img src='http://s0.wp.com/latex.php?latex=%5C%7B+%28x%2C+f%28x%29%29%3A+x+%5Cin+X+%5C%7D+%5Csubset+X+%5Ctimes+G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{ (x, f(x)): x &#92;in X &#92;} &#92;subset X &#92;times G' title='&#92;{ (x, f(x)): x &#92;in X &#92;} &#92;subset X &#92;times G' class='latex' /> of a function <img src='http://s0.wp.com/latex.php?latex=f%3A+X+%5Cto+G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: X &#92;to G' title='f: X &#92;to G' class='latex' /> is to the trivial bundle <img src='http://s0.wp.com/latex.php?latex=%28G%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(G)_{x &#92;in X}' title='(G)_{x &#92;in X}' class='latex' />).</p>
<p>At present, this section has not been represented in terms of numbers; instead, the wind direction <img src='http://s0.wp.com/latex.php?latex=w+%28w_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='w (w_x)_{x &#92;in X}' title='w (w_x)_{x &#92;in X}' class='latex' /> is a collection of points on various different circles in the circle bundle SX.  But one can convert this section w into a collection of numbers (and more specifically, a function <img src='http://s0.wp.com/latex.php?latex=u%3A+X+%5Cto+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u: X &#92;to S^1' title='u: X &#92;to S^1' class='latex' /> from X to <img src='http://s0.wp.com/latex.php?latex=S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S^1' title='S^1' class='latex' />) by choosing a gauge for this circle bundle &#8211; in other words, by selecting an orientation <img src='http://s0.wp.com/latex.php?latex=%5Cepsilon_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;epsilon_x' title='&#92;epsilon_x' class='latex' /> and a reference direction <img src='http://s0.wp.com/latex.php?latex=N_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N_x' title='N_x' class='latex' /> for each point x on the surface of the Earth X.  For instance, one can pick the anticlockwise orientation <img src='http://s0.wp.com/latex.php?latex=%5Ccirclearrowleft&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;circlearrowleft' title='&#92;circlearrowleft' class='latex' /> and true north for every point x (ignore for now the problem that this is not defined at the north and south poles, and so is merely a local gauge rather than a global one), and then each wind direction <img src='http://s0.wp.com/latex.php?latex=w_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='w_x' title='w_x' class='latex' /> can now be identified with a unit complex number <img src='http://s0.wp.com/latex.php?latex=u%28x%29+%5Cin+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u(x) &#92;in S^1' title='u(x) &#92;in S^1' class='latex' /> (e.g. <img src='http://s0.wp.com/latex.php?latex=e%5E%7Bi%5Cpi%2F4%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e^{i&#92;pi/4}' title='e^{i&#92;pi/4}' class='latex' /> if the wind is blowing in the northwest direction at x).  Now that one has a numerical function u to play with, rather than a geometric object w, one can now use analytical tools (e.g. differentiation, integration, Fourier transforms, etc.) to analyse the wind direction if one desires.  But one should be aware that this function reflects the choice of gauge as well as the original object of study.  If one changes the gauge (e.g. by using <a href="http://en.wikipedia.org/wiki/Magnetic_north_pole">magnetic north</a> instead of true north), then the function u changes, even though the wind direction w is still the same.  If one does not want to spend the U(1) gauge symmetry, one would have to take care that all operations one performs on these functions are gauge-invariant; unfortunately, this restrictive requirement eliminates wide swathes of analytic tools (in particular, integration and the Fourier transform) and so one is often forced to break the gauge symmetry in order to use analysis.  The challenge is then to select the gauge that maximises the effectiveness of analytic methods.  <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Example 2: circle extensions of a dynamical system.</strong> Recall (see e.g. my <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/01/08/254a-lecture-1-overview/">lecture notes</a>) that a dynamical system is a pair X = (X,T), where X is a space and <img src='http://s0.wp.com/latex.php?latex=T%3A+X+%5Cto+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T: X &#92;to X' title='T: X &#92;to X' class='latex' /> is an invertible map.  (One can also place additional topological or measure-theoretic structures on this system, as is done in those notes, but we will ignore these structures for this discussion.)  Given such a system, and given a <em>cocycle</em> <img src='http://s0.wp.com/latex.php?latex=%5Crho%3A+X+%5Cto+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho: X &#92;to S^1' title='&#92;rho: X &#92;to S^1' class='latex' /> (which, in this context, is simply a function from X to the unit circle), we can define the <em>skew product</em> <img src='http://s0.wp.com/latex.php?latex=X+%5Ctimes_%5Crho+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X &#92;times_&#92;rho S^1' title='X &#92;times_&#92;rho S^1' class='latex' /> of X and the unit circle <img src='http://s0.wp.com/latex.php?latex=S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S^1' title='S^1' class='latex' />, twisted by the cocycle <img src='http://s0.wp.com/latex.php?latex=%5Crho&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho' title='&#92;rho' class='latex' />, to be the Cartesian product <img src='http://s0.wp.com/latex.php?latex=X+%5Ctimes+S%5E1+%3A%3D+%5C%7B+%28x%2Cu%29%3A+x+%5Cin+X%2C+u+%5Cin+S%5E1+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X &#92;times S^1 := &#92;{ (x,u): x &#92;in X, u &#92;in S^1 &#92;}' title='X &#92;times S^1 := &#92;{ (x,u): x &#92;in X, u &#92;in S^1 &#92;}' class='latex' /> with the shift <img src='http://s0.wp.com/latex.php?latex=%5Ctilde+T%3A+%28x%2Cu%29+%5Cmapsto+%28Tx%2C+%5Crho%28x%29+u%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde T: (x,u) &#92;mapsto (Tx, &#92;rho(x) u)' title='&#92;tilde T: (x,u) &#92;mapsto (Tx, &#92;rho(x) u)' class='latex' />; this is easily seen to be another dynamical system.  (If one wishes to have a topological or measure-theoretic dynamical system, then <img src='http://s0.wp.com/latex.php?latex=%5Crho&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho' title='&#92;rho' class='latex' /> will have to be continuous or measurable here, but let us ignore such issues for this discussion.)  Observe that there is a <a href="http://en.wikipedia.org/wiki/Free_action">free action</a> <img src='http://s0.wp.com/latex.php?latex=%28S_v%3A+%28x%2Cu%29+%5Cmapsto+%28x%2Cvu%29%29_%7Bv+%5Cin+S%5E1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(S_v: (x,u) &#92;mapsto (x,vu))_{v &#92;in S^1}' title='(S_v: (x,u) &#92;mapsto (x,vu))_{v &#92;in S^1}' class='latex' /> of the circle group <img src='http://s0.wp.com/latex.php?latex=S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S^1' title='S^1' class='latex' /> on the skew product <img src='http://s0.wp.com/latex.php?latex=X+%5Ctimes_%5Crho+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X &#92;times_&#92;rho S^1' title='X &#92;times_&#92;rho S^1' class='latex' /> that commutes with the shift <img src='http://s0.wp.com/latex.php?latex=%5Ctilde+T&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde T' title='&#92;tilde T' class='latex' />; the <a href="http://en.wikipedia.org/wiki/Quotient_space">quotient space</a> <img src='http://s0.wp.com/latex.php?latex=%28X+%5Ctimes_%5Crho+S%5E1%29%2FS%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(X &#92;times_&#92;rho S^1)/S^1' title='(X &#92;times_&#92;rho S^1)/S^1' class='latex' /> of this action is isomorphic to X, thus leading to a <em>factor map</em> <img src='http://s0.wp.com/latex.php?latex=%5Cpi%3A+X+%5Ctimes_%5Crho+S%5E1+%5Cto+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;pi: X &#92;times_&#92;rho S^1 &#92;to X' title='&#92;pi: X &#92;times_&#92;rho S^1 &#92;to X' class='latex' />, which is of course just the projection map <img src='http://s0.wp.com/latex.php?latex=%5Cpi%3A+%28x%2Cu%29+%5Cmapsto+x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;pi: (x,u) &#92;mapsto x' title='&#92;pi: (x,u) &#92;mapsto x' class='latex' />.  (An example is provided by the <em>skew shift system</em>, described in my <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/01/10/254a-lecture-2-three-categories-of-dynamical-systems/">lecture notes</a>.)</p>
<p>Conversely, suppose that one had a dynamical system <img src='http://s0.wp.com/latex.php?latex=%5Ctilde+X+%3D+%28%5Ctilde+X%2C+%5Ctilde+T%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde X = (&#92;tilde X, &#92;tilde T)' title='&#92;tilde X = (&#92;tilde X, &#92;tilde T)' class='latex' /> which had a free <img src='http://s0.wp.com/latex.php?latex=S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S^1' title='S^1' class='latex' /> action <img src='http://s0.wp.com/latex.php?latex=%28S_v%3A+%5Ctilde+X+%5Cto+%5Ctilde+X%29_%7Bv+%5Cin+S%5E1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(S_v: &#92;tilde X &#92;to &#92;tilde X)_{v &#92;in S^1}' title='(S_v: &#92;tilde X &#92;to &#92;tilde X)_{v &#92;in S^1}' class='latex' /> commuting with the shift <img src='http://s0.wp.com/latex.php?latex=%5Ctilde+T&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde T' title='&#92;tilde T' class='latex' />.  If we set <img src='http://s0.wp.com/latex.php?latex=X+%3A%3D+%5Ctilde+X%2FS%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X := &#92;tilde X/S^1' title='X := &#92;tilde X/S^1' class='latex' /> to be the quotient space, we thus have a factor map <img src='http://s0.wp.com/latex.php?latex=%5Cpi%3A+%5Ctilde+X+%5Cto+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;pi: &#92;tilde X &#92;to X' title='&#92;pi: &#92;tilde X &#92;to X' class='latex' />, whose level sets <img src='http://s0.wp.com/latex.php?latex=%5Cpi%5E%7B-1%7D%28%5C%7Bx%5C%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;pi^{-1}(&#92;{x&#92;})' title='&#92;pi^{-1}(&#92;{x&#92;})' class='latex' /> are all isomorphic to the circle <img src='http://s0.wp.com/latex.php?latex=S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S^1' title='S^1' class='latex' />; we call <img src='http://s0.wp.com/latex.php?latex=%5Ctilde+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde X' title='&#92;tilde X' class='latex' /> a <em>circle extension</em> of the dynamical system X.  We can thus view <img src='http://s0.wp.com/latex.php?latex=%5Ctilde+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde X' title='&#92;tilde X' class='latex' /> as a <em>circle bundle</em> <img src='http://s0.wp.com/latex.php?latex=%28%5Cpi%5E%7B-1%7D%28%5C%7Bx%5C%7D%29%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(&#92;pi^{-1}(&#92;{x&#92;}))_{x &#92;in X}' title='(&#92;pi^{-1}(&#92;{x&#92;}))_{x &#92;in X}' class='latex' /> with base space X, thus the level sets <img src='http://s0.wp.com/latex.php?latex=%5Cpi%5E%7B-1%7D%28%5C%7Bx%5C%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;pi^{-1}(&#92;{x&#92;})' title='&#92;pi^{-1}(&#92;{x&#92;})' class='latex' /> are now the fibres of the bundle, and the structure group is <img src='http://s0.wp.com/latex.php?latex=S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S^1' title='S^1' class='latex' />.  If one picks a <em>gauge </em>for this bundle, by choosing a reference point <img src='http://s0.wp.com/latex.php?latex=p_x+%5Cin+%5Cpi%5E%7B-1%7D%28%5C%7Bx%5C%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p_x &#92;in &#92;pi^{-1}(&#92;{x&#92;})' title='p_x &#92;in &#92;pi^{-1}(&#92;{x&#92;})' class='latex' /> in the fibre for each base point x (thus in this context a gauge is the same thing as a <a href="http://en.wikipedia.org/wiki/Section_(fiber_bundle)"><em>section</em></a> <img src='http://s0.wp.com/latex.php?latex=p+%3D+%28p_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p = (p_x)_{x &#92;in X}' title='p = (p_x)_{x &#92;in X}' class='latex' />; this is basically because this bundle is a <a href="http://en.wikipedia.org/wiki/Principal_bundle">principal bundle</a>), then one can identify <img src='http://s0.wp.com/latex.php?latex=%5Ctilde+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde X' title='&#92;tilde X' class='latex' /> with a skew product <img src='http://s0.wp.com/latex.php?latex=X+%5Ctimes_%5Crho+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X &#92;times_&#92;rho S^1' title='X &#92;times_&#92;rho S^1' class='latex' /> by identifying the point <img src='http://s0.wp.com/latex.php?latex=S_v+p_x+%5Cin+%5Ctilde+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S_v p_x &#92;in &#92;tilde X' title='S_v p_x &#92;in &#92;tilde X' class='latex' /> with the point <img src='http://s0.wp.com/latex.php?latex=%28x%2Cv%29+%5Cin+X+%5Ctimes_%5Crho+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(x,v) &#92;in X &#92;times_&#92;rho S^1' title='(x,v) &#92;in X &#92;times_&#92;rho S^1' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+X%2C+v+%5Cin+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x &#92;in X, v &#92;in S^1' title='x &#92;in X, v &#92;in S^1' class='latex' />, and letting <img src='http://s0.wp.com/latex.php?latex=%5Crho&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho' title='&#92;rho' class='latex' /> be the cocycle defined by the formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=S_%7B%5Crho%28x%29%7D+p_%7BTx%7D+%3D+%5Ctilde+T+p_x.&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S_{&#92;rho(x)} p_{Tx} = &#92;tilde T p_x.' title='S_{&#92;rho(x)} p_{Tx} = &#92;tilde T p_x.' class='latex' /></p>
<p>One can check that this is indeed an isomorphism of dynamical systems; if all the various objects here are continuous (resp. measurable), then one also has an isomorphism of topological dynamical systems (resp. measure-preserving systems).  Thus we see that gauges allow us to write circle extensions as skew products.  However, more than one gauge is available for any given circle extension; two gauges <img src='http://s0.wp.com/latex.php?latex=%28p_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(p_x)_{x &#92;in X}' title='(p_x)_{x &#92;in X}' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%28p%27_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(p&#039;_x)_{x &#92;in X}' title='(p&#039;_x)_{x &#92;in X}' class='latex' /> will give rise to two skew products <img src='http://s0.wp.com/latex.php?latex=X+%5Ctimes_%5Crho+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X &#92;times_&#92;rho S^1' title='X &#92;times_&#92;rho S^1' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=X+%5Ctimes_%7B%5Crho%27%7D+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X &#92;times_{&#92;rho&#039;} S^1' title='X &#92;times_{&#92;rho&#039;} S^1' class='latex' /> which are isomorphic but not identical.  Indeed, if we let <img src='http://s0.wp.com/latex.php?latex=v%3A+X+%5Cto+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v: X &#92;to S^1' title='v: X &#92;to S^1' class='latex' /> be a rotation map that sends <img src='http://s0.wp.com/latex.php?latex=p_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p_x' title='p_x' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=p%27_%7Bx%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p&#039;_{x}' title='p&#039;_{x}' class='latex' />, thus <img src='http://s0.wp.com/latex.php?latex=p%27_%7Bx%7D+%3D+S_%7Bv%28x%29%7D+p_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p&#039;_{x} = S_{v(x)} p_x' title='p&#039;_{x} = S_{v(x)} p_x' class='latex' />, then we see that the two cocycles <img src='http://s0.wp.com/latex.php?latex=%5Crho%27&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho&#039;' title='&#92;rho&#039;' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Crho&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho' title='&#92;rho' class='latex' /> are related by the formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Crho%27%28x%29+%3D+v%28Tx%29%5E%7B-1%7D+%5Crho%28x%29+v%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho&#039;(x) = v(Tx)^{-1} &#92;rho(x) v(x)' title='&#92;rho&#039;(x) = v(Tx)^{-1} &#92;rho(x) v(x)' class='latex' />.  (1)</p>
<p>Two cocycles that obey the above relation are called <em>cohomologous</em>; their skew products are isomorphic to each other.  An important general question in dynamical systems is to understand when two given cocycles are in fact cohomologous, for instance by introducing non-trivial cohomological invariants for such cocycles.</p>
<p>As an example of a circle extension, consider the sphere <img src='http://s0.wp.com/latex.php?latex=X+%3D+S%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X = S^2' title='X = S^2' class='latex' /> from Example 1, with a rotation shift T given by, say, rotating anti-clockwise by some given angle <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' /> around the axis connecting the north and south poles.  This rotation also induces a rotation on the circle bundle <img src='http://s0.wp.com/latex.php?latex=%5Ctilde+X+%3A%3D+SX&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde X := SX' title='&#92;tilde X := SX' class='latex' />, thus giving a circle extension of the original system <img src='http://s0.wp.com/latex.php?latex=%28X%2CT%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(X,T)' title='(X,T)' class='latex' />.  One can then use a gauge to write this system as a skew product.  For instance, if one selects the gauge that chooses <img src='http://s0.wp.com/latex.php?latex=p_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p_x' title='p_x' class='latex' /> to be the true north direction at each point x (ignoring for now the fact that this is not defined at the two poles), then this system becomes the ordinary product <img src='http://s0.wp.com/latex.php?latex=X+%5Ctimes_0+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X &#92;times_0 S^1' title='X &#92;times_0 S^1' class='latex' /> of the original system X with the circle <img src='http://s0.wp.com/latex.php?latex=S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S^1' title='S^1' class='latex' />, with the cocycle being the trivial cocycle 0.  If we were however to use a different gauge, e.g. magnetic north instead of true north, one would obtain a different skew-product <img src='http://s0.wp.com/latex.php?latex=X+%5Ctimes_%7B%5Crho%27%7D+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X &#92;times_{&#92;rho&#039;} S^1' title='X &#92;times_{&#92;rho&#039;} S^1' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%5Crho%27&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho&#039;' title='&#92;rho&#039;' class='latex' /> is some cocycle which is cohomologous to the trivial cocycle (except at the poles).  (A cocycle which is globally cohomologous to the trivial cocycle is known as a <em>coboundary</em>.  Not every cocycle is a coboundary, especially once one imposes topological or measure-theoretic structure, thanks to the presence of various topological or measure-theoretic invariants, such as <a href="http://en.wikipedia.org/wiki/Degree_(continuous_map)">degree</a>.)</p>
<p>There was nothing terribly special about circles in this example; one can also define group extensions, or more generally homogeneous space extensions, of dynamical systems, and have a similar theory, although one has to take a little care with the order of operations when the structure group is non-abelian; see e.g. my <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/01/24/254a-lecture-6-isometric-systems-and-isometric-extensions/">lecture notes</a> on isometric extensions. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Example 3: Orienting an undirected graph.</strong> The language of gauge theory is not often used in combinatorics, but nevertheless combinatorics does provide some simple discrete examples of bundles and gauges which can be useful in getting an intuitive grasp of the concept.  Consider for instance an <a href="http://en.wikipedia.org/wiki/Undirected_graph">undirected graph</a> G = (V,E) of vertices and edges.  I will let X=E denote the space of edges (not the space of vertices)!.  Every edge <img src='http://s0.wp.com/latex.php?latex=e+%5Cin+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e &#92;in X' title='e &#92;in X' class='latex' /> can be oriented (or directed) in two different ways; let <img src='http://s0.wp.com/latex.php?latex=A_e&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_e' title='A_e' class='latex' /> be the pair of directed edges of e arising in this manner.  Then <img src='http://s0.wp.com/latex.php?latex=%28A_e%29_%7Be+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(A_e)_{e &#92;in X}' title='(A_e)_{e &#92;in X}' class='latex' /> is a fibre bundle with base space X and with each fibre isomorphic (in the category of sets) to the standard two-element set <img src='http://s0.wp.com/latex.php?latex=%5C%7B-1%2C%2B1%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{-1,+1&#92;}' title='&#92;{-1,+1&#92;}' class='latex' />, with structure group <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%2F2%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}/2{&#92;Bbb Z}' title='{&#92;Bbb Z}/2{&#92;Bbb Z}' class='latex' />.</p>
<p><em>A priori</em>, there is no reason to prefer one orientation of an edge e over another, and so there is no canonical way to identify each fibre <img src='http://s0.wp.com/latex.php?latex=A_e&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_e' title='A_e' class='latex' /> with the standard set <img src='http://s0.wp.com/latex.php?latex=%5C%7B-1%2C%2B1%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{-1,+1&#92;}' title='&#92;{-1,+1&#92;}' class='latex' />.  Nevertheless, we can go ahead and arbitrary select a gauge for X by <em>orienting</em> the graph G.  This orientation assigns an oriented edge <img src='http://s0.wp.com/latex.php?latex=%5Cvec+e+%5Cin+A_e&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;vec e &#92;in A_e' title='&#92;vec e &#92;in A_e' class='latex' /> to each edge <img src='http://s0.wp.com/latex.php?latex=e+%5Cin+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e &#92;in X' title='e &#92;in X' class='latex' />, thus creating a gauge (or section) <img src='http://s0.wp.com/latex.php?latex=%28%5Cvec+e%29_%7Be+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(&#92;vec e)_{e &#92;in X}' title='(&#92;vec e)_{e &#92;in X}' class='latex' /> of the bundle <img src='http://s0.wp.com/latex.php?latex=%28A_e%29_%7Be+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(A_e)_{e &#92;in X}' title='(A_e)_{e &#92;in X}' class='latex' />.  Once one selects such a gauge, we can now identify the fibre bundle <img src='http://s0.wp.com/latex.php?latex=%28A_e%29_%7Be+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(A_e)_{e &#92;in X}' title='(A_e)_{e &#92;in X}' class='latex' /> with the trivial bundle <img src='http://s0.wp.com/latex.php?latex=X+%5Ctimes+%5C%7B-1%2C%2B1%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X &#92;times &#92;{-1,+1&#92;}' title='X &#92;times &#92;{-1,+1&#92;}' class='latex' /> by identifying the preferred oriented edge <img src='http://s0.wp.com/latex.php?latex=%5Cvec+e&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;vec e' title='&#92;vec e' class='latex' /> of each unoriented edge <img src='http://s0.wp.com/latex.php?latex=e+%5Cin+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e &#92;in X' title='e &#92;in X' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=%28e%2C%2B1%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(e,+1)' title='(e,+1)' class='latex' />, and the other oriented edge with <img src='http://s0.wp.com/latex.php?latex=%28e%2C-1%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(e,-1)' title='(e,-1)' class='latex' />.  In particular, any other orientation of the graph G can be expressed relative to this reference orientation as a function <img src='http://s0.wp.com/latex.php?latex=f%3A+X+%5Cto+%5C%7B-1%2C%2B1%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: X &#92;to &#92;{-1,+1&#92;}' title='f: X &#92;to &#92;{-1,+1&#92;}' class='latex' />, which measures when the two orientations agree or disagree with each other. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p>Recall that every isomorphism <img src='http://s0.wp.com/latex.php?latex=%5CPsi+%5Cin+%5Chbox%7BIsom%7D%28G%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Psi &#92;in &#92;hbox{Isom}(G)' title='&#92;Psi &#92;in &#92;hbox{Isom}(G)' class='latex' /> of a standard geometric object G allowed one to transform a coordinate system <img src='http://s0.wp.com/latex.php?latex=%5CPhi%3A+A+%5Cto+G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Phi: A &#92;to G' title='&#92;Phi: A &#92;to G' class='latex' /> on a geometric object A to another coordinate system <img src='http://s0.wp.com/latex.php?latex=%5CPsi+%5Ccirc+%5CPhi%3A+A+%5Cto+G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Psi &#92;circ &#92;Phi: A &#92;to G' title='&#92;Psi &#92;circ &#92;Phi: A &#92;to G' class='latex' />.  We can generalise this observation to gauges: every family <img src='http://s0.wp.com/latex.php?latex=%5CPsi+%3D+%28%5CPsi_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Psi = (&#92;Psi_x)_{x &#92;in X}' title='&#92;Psi = (&#92;Psi_x)_{x &#92;in X}' class='latex' /> of isomorphisms on G allows one to transform a gauge <img src='http://s0.wp.com/latex.php?latex=%28%5CPhi_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(&#92;Phi_x)_{x &#92;in X}' title='(&#92;Phi_x)_{x &#92;in X}' class='latex' /> to another gauge <img src='http://s0.wp.com/latex.php?latex=%28%5CPsi_x+%5Ccirc+%5CPhi_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(&#92;Psi_x &#92;circ &#92;Phi_x)_{x &#92;in X}' title='(&#92;Psi_x &#92;circ &#92;Phi_x)_{x &#92;in X}' class='latex' /> (again assuming that <img src='http://s0.wp.com/latex.php?latex=%5CPsi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Psi' title='&#92;Psi' class='latex' /> respects whatever topological or differentiable structure is present).  Such a collection <img src='http://s0.wp.com/latex.php?latex=%5CPsi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Psi' title='&#92;Psi' class='latex' /> is known as a <em>gauge transformation</em>.  For instance, in Example 1, one could rotate the reference direction <img src='http://s0.wp.com/latex.php?latex=N_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N_x' title='N_x' class='latex' /> at each point <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x &#92;in X' title='x &#92;in X' class='latex' /> anti-clockwise by some angle <img src='http://s0.wp.com/latex.php?latex=%5Ctheta%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;theta(x)' title='&#92;theta(x)' class='latex' />; this would cause the function <img src='http://s0.wp.com/latex.php?latex=u%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u(x)' title='u(x)' class='latex' /> to rotate to <img src='http://s0.wp.com/latex.php?latex=u%28x%29+e%5E%7B-i%5Ctheta%28x%29%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u(x) e^{-i&#92;theta(x)}' title='u(x) e^{-i&#92;theta(x)}' class='latex' />.   In Example 2, a gauge transformation is just a map <img src='http://s0.wp.com/latex.php?latex=v%3A+X+%5Cto+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v: X &#92;to S^1' title='v: X &#92;to S^1' class='latex' /> (which may need to be continuous or measurable, depending on the structures one places on X); it rotates a point <img src='http://s0.wp.com/latex.php?latex=%28x%2Cu%29+%5Cin+X+%5Ctimes_%5Crho+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(x,u) &#92;in X &#92;times_&#92;rho S^1' title='(x,u) &#92;in X &#92;times_&#92;rho S^1' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%28x%2C+v%5E%7B-1%7D+u%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(x, v^{-1} u)' title='(x, v^{-1} u)' class='latex' />, and it also transforms the cocycle <img src='http://s0.wp.com/latex.php?latex=%5Crho&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho' title='&#92;rho' class='latex' /> by the formula (1).  In Example 3, a gauge transformation would be a map <img src='http://s0.wp.com/latex.php?latex=v%3A+X+%5Cto+%5C%7B-1%2C%2B1%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v: X &#92;to &#92;{-1,+1&#92;}' title='v: X &#92;to &#92;{-1,+1&#92;}' class='latex' />; it rotates a point <img src='http://s0.wp.com/latex.php?latex=%28x%2C+%5Cepsilon%29+%5Cin+X+%5Ctimes+%5C%7B-1%2C%2B1%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(x, &#92;epsilon) &#92;in X &#92;times &#92;{-1,+1&#92;}' title='(x, &#92;epsilon) &#92;in X &#92;times &#92;{-1,+1&#92;}' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%28x%2C+v%28x%29+%5Cepsilon%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(x, v(x) &#92;epsilon)' title='(x, v(x) &#92;epsilon)' class='latex' />.</p>
<p>Gauge transformations transform functions on the base X in many ways, but some things remain gauge-invariant.  For instance, in Example 1, the <a href="http://en.wikipedia.org/wiki/Winding_number">winding number</a> of a function <img src='http://s0.wp.com/latex.php?latex=u%3A+X+%5Cto+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u: X &#92;to S^1' title='u: X &#92;to S^1' class='latex' /> along a closed loop <img src='http://s0.wp.com/latex.php?latex=%5Cgamma+%5Csubset+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gamma &#92;subset X' title='&#92;gamma &#92;subset X' class='latex' /> would not change under a gauge transformation (as long as no singularities in the gauge are created, moved, or destroyed, and the orientation is not reversed).  But such topological gauge-invariants are not the only gauge invariants of interest; there are important <em>differential</em> gauge-invariants which make gauge theory a crucial component of modern differential geometry and geometric PDE.  But to describe these, one needs an additional gauge-theoretic concept, namely that of a <a href="http://en.wikipedia.org/wiki/Connection_(mathematics)"><em>connection</em></a> on a fibre bundle.</p>
<p style="text-align:center;">&#8211; Connections &#8211;</p>
<p style="text-align:left;">There are many essentially equivalent ways to introduce the concept of a connection; I will use the formulation based primarily on <a href="http://en.wikipedia.org/wiki/Parallel_transport">parallel transport</a>, and on differentiation of sections.  To avoid some technical details I will work (somewhat non-rigorously) with <a href="http://en.wikipedia.org/wiki/Infinitesimal">infinitesimals</a> such as dx.  (There are ways to make the use of infinitesimals rigorous, such as <a class="snap_noshots" href="http://terrytao.wordpress.com/2007/06/25/ultrafilters-nonstandard-analysis-and-epsilon-management/">non-standard analysis</a>, but this is not the focus of my post today.)</p>
<p style="text-align:left;">In single variable calculus, we learn that if we want to differentiate a function <img src='http://s0.wp.com/latex.php?latex=f%3A+%5Ba%2Cb%5D+%5Cto+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: [a,b] &#92;to {&#92;Bbb R}' title='f: [a,b] &#92;to {&#92;Bbb R}' class='latex' /> at some point x, then we need to compare the value f(x) of f at x with its value f(x+dx) at some infinitesimally close point x+dx, take the difference <img src='http://s0.wp.com/latex.php?latex=f%28x%2Bdx%29-f%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(x+dx)-f(x)' title='f(x+dx)-f(x)' class='latex' />, and then divide by dx, taking limits as <img src='http://s0.wp.com/latex.php?latex=dx+%5Cto+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='dx &#92;to 0' title='dx &#92;to 0' class='latex' />, if one does not like to use infinitesimals:</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cnabla+f%28x%29+%3A%3D+%5Clim_%7Bdx+%5Cto+0%7D+%5Cfrac%7Bf%28x%2Bdx%29+-+f%28x%29%7D%7Bdx%7D.&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;nabla f(x) := &#92;lim_{dx &#92;to 0} &#92;frac{f(x+dx) - f(x)}{dx}.' title='&#92;displaystyle &#92;nabla f(x) := &#92;lim_{dx &#92;to 0} &#92;frac{f(x+dx) - f(x)}{dx}.' class='latex' /></p>
<p style="text-align:left;">In several variable calculus, we learn several generalisations of this concept in which the domain and range of f to be multi-dimensional.  For instance, if <img src='http://s0.wp.com/latex.php?latex=f%3A+X+%5Cto+%7B%5CBbb+R%7D%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: X &#92;to {&#92;Bbb R}^d' title='f: X &#92;to {&#92;Bbb R}^d' class='latex' /> is now a vector-valued function on some multi-dimensional domain (e.g. a <a href="http://en.wikipedia.org/wiki/Manifold">manifold</a>) X, and v is a <a href="http://en.wikipedia.org/wiki/Tangent_space">tangent vector</a> to X at some point x, we can define the <a href="http://en.wikipedia.org/wiki/Directional_derivative">directional derivative</a> <img src='http://s0.wp.com/latex.php?latex=%5Cnabla_v+f%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla_v f(x)' title='&#92;nabla_v f(x)' class='latex' /> of f at x by comparing <img src='http://s0.wp.com/latex.php?latex=f%28x%2Bv+dt%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(x+v dt)' title='f(x+v dt)' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=f%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(x)' title='f(x)' class='latex' /> for some infinitesimal dt, take the difference <img src='http://s0.wp.com/latex.php?latex=f%28x%2Bvdt%29+-+f%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(x+vdt) - f(x)' title='f(x+vdt) - f(x)' class='latex' />, divide by dt, and then take limits as <img src='http://s0.wp.com/latex.php?latex=dt+%5Cto+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='dt &#92;to 0' title='dt &#92;to 0' class='latex' />:</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cnabla_v+f%28x%29+%3A%3D+%5Clim_%7Bdt+%5Cto+0%7D+%5Cfrac%7Bf%28x%2Bvdt%29+-+f%28x%29%7D%7Bdt%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;nabla_v f(x) := &#92;lim_{dt &#92;to 0} &#92;frac{f(x+vdt) - f(x)}{dt}' title='&#92;displaystyle &#92;nabla_v f(x) := &#92;lim_{dt &#92;to 0} &#92;frac{f(x+vdt) - f(x)}{dt}' class='latex' />.</p>
<p style="text-align:left;">[Strictly speaking, if X is not flat, then x+vdt is only defined up to an ambiguity of o(dt), but let us ignore this minor issue here, as it is not important in the limit.]  If f is sufficiently smooth (being continuously differentiable will do), the directional derivative is linear in v, thus for instance <img src='http://s0.wp.com/latex.php?latex=%5Cnabla_%7Bv%2Bv%27%7D+f%28x%29+%3D+%5Cnabla_v+f%28x%29+%2B+%5Cnabla_%7Bv%27%7D+f%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla_{v+v&#039;} f(x) = &#92;nabla_v f(x) + &#92;nabla_{v&#039;} f(x)' title='&#92;nabla_{v+v&#039;} f(x) = &#92;nabla_v f(x) + &#92;nabla_{v&#039;} f(x)' class='latex' />. One can also generalise the range of f to other multi-dimensional domains than <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^d' title='{&#92;Bbb R}^d' class='latex' />; the directional derivative then lives in a tangent space of that domain.</p>
<p style="text-align:left;">In all of the above examples, though, we were differentiating functions <img src='http://s0.wp.com/latex.php?latex=f%3AX+%5Cto+Y&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f:X &#92;to Y' title='f:X &#92;to Y' class='latex' />, thus each element <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x &#92;in X' title='x &#92;in X' class='latex' /> in the base (or domain) gets mapped to an element <img src='http://s0.wp.com/latex.php?latex=f%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f(x)' title='f(x)' class='latex' /> in the same range Y.  However, in many geometrical situations we would like to differentiate <em>sections</em> <img src='http://s0.wp.com/latex.php?latex=f+%3D+%28f_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f = (f_x)_{x &#92;in X}' title='f = (f_x)_{x &#92;in X}' class='latex' /> instead of functions, thus f now maps each point <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x &#92;in X' title='x &#92;in X' class='latex' /> in the base to an element <img src='http://s0.wp.com/latex.php?latex=f_x+%5Cin+A_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f_x &#92;in A_x' title='f_x &#92;in A_x' class='latex' /> of some fibre in a fibre bundle <img src='http://s0.wp.com/latex.php?latex=%28A_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(A_x)_{x &#92;in X}' title='(A_x)_{x &#92;in X}' class='latex' />.  For instance, one might want to know how the wind direction <img src='http://s0.wp.com/latex.php?latex=w+%3D+%28w_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='w = (w_x)_{x &#92;in X}' title='w = (w_x)_{x &#92;in X}' class='latex' /> changes as one moves x in some direction v; thus computing a directional derivative <img src='http://s0.wp.com/latex.php?latex=%5Cnabla_v+w%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla_v w(x)' title='&#92;nabla_v w(x)' class='latex' /> of w at x in direction v.  One can try to mimic the previous definitions in order to define this directional derivative.  For instance, one can move x along v by some infinitesimal amount dt, creating a nearby point <img src='http://s0.wp.com/latex.php?latex=x%2Bv+dt&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x+v dt' title='x+v dt' class='latex' />, and then evaluate w at this point to obtain <img src='http://s0.wp.com/latex.php?latex=w%28x%2Bvdt%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='w(x+vdt)' title='w(x+vdt)' class='latex' />.  But here we hit a snag: we cannot directly compare <img src='http://s0.wp.com/latex.php?latex=w%28x%2Bvdt%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='w(x+vdt)' title='w(x+vdt)' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=w%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='w(x)' title='w(x)' class='latex' />, because the former lives in the fibre <img src='http://s0.wp.com/latex.php?latex=A_%7Bx%2Bvdt%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_{x+vdt}' title='A_{x+vdt}' class='latex' /> while the latter lives in the fibre <img src='http://s0.wp.com/latex.php?latex=A_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_x' title='A_x' class='latex' />.</p>
<p style="text-align:left;">With a gauge, of course, we can identify all the fibres (and in particular, <img src='http://s0.wp.com/latex.php?latex=A_%7Bx%2Bvdt%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_{x+vdt}' title='A_{x+vdt}' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=A_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_x' title='A_x' class='latex' />) with a common object G, in which case there is no difficulty comparing <img src='http://s0.wp.com/latex.php?latex=w%28x%2Bvdt%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='w(x+vdt)' title='w(x+vdt)' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=w%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='w(x)' title='w(x)' class='latex' />.  But this would lead to a notion of derivative which is not gauge-invariant, known as the <em>non-covariant</em> or <em>ordinary</em> derivative in physics.</p>
<p style="text-align:left;">But there is another way to take a derivative, which does not require the full strength of a gauge (which identifies <em>all</em> fibres simultaneously together).  Indeed, in order to compute a derivative <img src='http://s0.wp.com/latex.php?latex=%5Cnabla_v+w%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla_v w(x)' title='&#92;nabla_v w(x)' class='latex' />, one only needs to identify (or <em>connect</em>) two infinitesimally close fibres together: <img src='http://s0.wp.com/latex.php?latex=A_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_x' title='A_x' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=A_%7Bx%2Bvdt%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_{x+vdt}' title='A_{x+vdt}' class='latex' />.  In practice, these two fibres are already &#8220;within O(dt) of each other&#8221; in some sense, but suppose in fact that we have some means <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28x+%5Cto+x%2Bvdt%29%3A+A_x+%5Cto+A_%7Bx%2Bvdt%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(x &#92;to x+vdt): A_x &#92;to A_{x+vdt}' title='&#92;Gamma(x &#92;to x+vdt): A_x &#92;to A_{x+vdt}' class='latex' /> of identifying these two fibres together.  Then, we can pull back <img src='http://s0.wp.com/latex.php?latex=w%28x%2Bvdt%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='w(x+vdt)' title='w(x+vdt)' class='latex' /> from <img src='http://s0.wp.com/latex.php?latex=A_%7Bx%2Bvdt%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_{x+vdt}' title='A_{x+vdt}' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=A_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_x' title='A_x' class='latex' /> through <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28x+%5Cto+x%2Bvdt%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(x &#92;to x+vdt)' title='&#92;Gamma(x &#92;to x+vdt)' class='latex' /> to define the <a href="http://en.wikipedia.org/wiki/Covariant_derivative">covariant derivative</a>:</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cnabla_v+w%28x%29+%3A%3D+%5Clim_%7Bdt+%5Cto+0%7D+%5Cfrac%7B%5CGamma%28x+%5Cto+x%2Bvdt%29%5E%7B-1%7D%28+w%28x%2Bvdt%29+%29+-+w%28x%29+%7D%7Bdt%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle &#92;nabla_v w(x) := &#92;lim_{dt &#92;to 0} &#92;frac{&#92;Gamma(x &#92;to x+vdt)^{-1}( w(x+vdt) ) - w(x) }{dt}' title='&#92;displaystyle &#92;nabla_v w(x) := &#92;lim_{dt &#92;to 0} &#92;frac{&#92;Gamma(x &#92;to x+vdt)^{-1}( w(x+vdt) ) - w(x) }{dt}' class='latex' />.</p>
<p style="text-align:left;">
<p style="text-align:left;">
<p>In order to retain the basic property that <img src='http://s0.wp.com/latex.php?latex=%5Cnabla_v+w&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla_v w' title='&#92;nabla_v w' class='latex' /> is linear in v, and to allow one to extend the infinitesimal identifications <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28x+%5Cto+x%2Bdx%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(x &#92;to x+dx)' title='&#92;Gamma(x &#92;to x+dx)' class='latex' /> to non-infinitesimal identifications, we impose the property that the <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28x+%5Cto+x%2Bdx%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(x &#92;to x+dx)' title='&#92;Gamma(x &#92;to x+dx)' class='latex' /> to be approximately transitive in that</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5CGamma%28x%2Bdx+%5Cto+x%2Bdx%2Bdx%27%29+%5Ccirc+%5CGamma%28x+%5Cto+x+%2B+dx+%29+%5Capprox+%5CGamma%28x+%5Cto+x%2Bdx%2Bdx%27%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(x+dx &#92;to x+dx+dx&#039;) &#92;circ &#92;Gamma(x &#92;to x + dx ) &#92;approx &#92;Gamma(x &#92;to x+dx+dx&#039;)' title='&#92;Gamma(x+dx &#92;to x+dx+dx&#039;) &#92;circ &#92;Gamma(x &#92;to x + dx ) &#92;approx &#92;Gamma(x &#92;to x+dx+dx&#039;)' class='latex' /> (1)</p>
<p>for all x, dx, dx&#8217;, where the <img src='http://s0.wp.com/latex.php?latex=%5Capprox&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;approx' title='&#92;approx' class='latex' /> symbol indicates that the error between the two sides is o(&#124;dx&#124; + &#124;dx&#8217;&#124;).  [The precise nature of this error is actually rather important, being essentially the <a href="http://en.wikipedia.org/wiki/Curvature_form"><em>curvature</em></a> of the connection <img src='http://s0.wp.com/latex.php?latex=%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma' title='&#92;Gamma' class='latex' /> at x in the directions <img src='http://s0.wp.com/latex.php?latex=dx%2C+dx%27&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='dx, dx&#039;' title='dx, dx&#039;' class='latex' />, but let us ignore this for now.]  To oversimplify a little bit, any collection <img src='http://s0.wp.com/latex.php?latex=%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma' title='&#92;Gamma' class='latex' /> of infinitesimal maps <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28x+%5Cto+x%2Bdx%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(x &#92;to x+dx)' title='&#92;Gamma(x &#92;to x+dx)' class='latex' /> obeying this property (and some technical regularity properties) is a <em>connection</em>.</p>
<p>[There are many other important ways to view connections, for instance the <a href="http://en.wikipedia.org/wiki/Christoffel_symbols">Christoffel symbol</a> perspective that we will discuss a bit later.  Another approach is to focus on the differentiation operation <img src='http://s0.wp.com/latex.php?latex=%5Cnabla_v&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla_v' title='&#92;nabla_v' class='latex' /> rather than the identifications <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28x+%5Cto+x%2Bdx%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(x &#92;to x+dx)' title='&#92;Gamma(x &#92;to x+dx)' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28%5Cgamma%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(&#92;gamma)' title='&#92;Gamma(&#92;gamma)' class='latex' />, and in particular on the algebraic properties of this operation, such as linearity in v or <a href="http://en.wikipedia.org/wiki/Derivation_(abstract_algebra)">derivation</a>-type properties (in particular, obeying various variants of the <a href="http://en.wikipedia.org/wiki/Leibnitz_rule">Leibnitz rule</a>).  This approach is particularly important in algebraic geometry, in which the notion of an infinitesimal or of a path may not always be obviously available, but we will not discuss it here.]</p>
<p>The way we have defined it, a connection is a means of identifying two infinitesimally close fibres <img src='http://s0.wp.com/latex.php?latex=A_x%2C+A_%7Bx%2Bdx%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_x, A_{x+dx}' title='A_x, A_{x+dx}' class='latex' /> of a fibre bundle <img src='http://s0.wp.com/latex.php?latex=%28A_x%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(A_x)_{x &#92;in X}' title='(A_x)_{x &#92;in X}' class='latex' />.  But, thanks to (1), we can also identify two distant fibres <img src='http://s0.wp.com/latex.php?latex=A_x%2C+A_y&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_x, A_y' title='A_x, A_y' class='latex' />, provided that we have a path <img src='http://s0.wp.com/latex.php?latex=%5Cgamma%3A+%5Ba%2Cb%5D+%5Cto+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gamma: [a,b] &#92;to X' title='&#92;gamma: [a,b] &#92;to X' class='latex' /> from <img src='http://s0.wp.com/latex.php?latex=x+%3D+%5Cgamma%28a%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x = &#92;gamma(a)' title='x = &#92;gamma(a)' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=y+%3D+%5Cgamma%28b%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='y = &#92;gamma(b)' title='y = &#92;gamma(b)' class='latex' />, by concatenating the infinitesimal identifications by a non-commutative variant of a <a href="http://en.wikipedia.org/wiki/Riemann_sum">Riemann sum</a>:</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5CGamma%28%5Cgamma%29+%3A%3D+%5Clim_%7B%5Csup+%26%23124%3Bt_%7Bi%2B1%7D-t_i%26%23124%3B+%5Cto+0%7D+%5CGamma%28%5Cgamma%28t_%7Bn-1%7D%29+%5Cto+%5Cgamma%28t_n%29%29+%5Ccirc+%5Cldots+%5Ccirc+%5CGamma%28%5Cgamma%28t_0%29+%5Cto+%5Cgamma%28t_1%29%29%2C&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(&#92;gamma) := &#92;lim_{&#92;sup &#124;t_{i+1}-t_i&#124; &#92;to 0} &#92;Gamma(&#92;gamma(t_{n-1}) &#92;to &#92;gamma(t_n)) &#92;circ &#92;ldots &#92;circ &#92;Gamma(&#92;gamma(t_0) &#92;to &#92;gamma(t_1)),' title='&#92;Gamma(&#92;gamma) := &#92;lim_{&#92;sup &#124;t_{i+1}-t_i&#124; &#92;to 0} &#92;Gamma(&#92;gamma(t_{n-1}) &#92;to &#92;gamma(t_n)) &#92;circ &#92;ldots &#92;circ &#92;Gamma(&#92;gamma(t_0) &#92;to &#92;gamma(t_1)),' class='latex' /> (2)</p>
<p>where <img src='http://s0.wp.com/latex.php?latex=a+%3D+t_0+%26%2360%3B+t_1+%26%2360%3B+%5Cldots+%26%2360%3B+t_n+%3D+b&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a = t_0 &lt; t_1 &lt; &#92;ldots &lt; t_n = b' title='a = t_0 &lt; t_1 &lt; &#92;ldots &lt; t_n = b' class='latex' /> ranges over partitions.  This gives us a <a href="http://en.wikipedia.org/wiki/Parallel_transport"><em>parallel transport</em></a> map <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28%5Cgamma%29%3A+A_x+%5Cto+A_y&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(&#92;gamma): A_x &#92;to A_y' title='&#92;Gamma(&#92;gamma): A_x &#92;to A_y' class='latex' /> identifying <img src='http://s0.wp.com/latex.php?latex=A_x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_x' title='A_x' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=A_y&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_y' title='A_y' class='latex' />, which in view of its Riemann sum definition, can be viewed as the &#8220;integral&#8221; of the connection <img src='http://s0.wp.com/latex.php?latex=%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma' title='&#92;Gamma' class='latex' /> along the curve <img src='http://s0.wp.com/latex.php?latex=%5Cgamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gamma' title='&#92;gamma' class='latex' />.  This map does not depend on how one parametrises the path <img src='http://s0.wp.com/latex.php?latex=%5Cgamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gamma' title='&#92;gamma' class='latex' />, but it can depend on the choice of path used to travel from x to y.</p>
<p>We illustrate these concepts using several examples, including the three examples introduced earlier.</p>
<p><strong>Example 1 continued.</strong> (Circle bundle of the sphere) The geometry of the sphere X in Example 1 provides a natural connection on the circle bundle SX, the <a href="http://en.wikipedia.org/wiki/Levi-Civita_connection">Levi-Civita connection</a> <img src='http://s0.wp.com/latex.php?latex=%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma' title='&#92;Gamma' class='latex' />, that lets one transport directions around the sphere in as &#8220;parallel&#8221; a manner as possible; the precise definition is a little technical (see e.g. my <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/03/26/285g-lecture-0-riemannian-manifolds-and-curvature/">lecture notes</a> for a brief description).  Suppose for instance one starts at some location x on the equator of the earth, and moves to the antipodal point y by a <a href="http://en.wikipedia.org/wiki/Great_circle">great semi-circle</a> <img src='http://s0.wp.com/latex.php?latex=%5Cgamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gamma' title='&#92;gamma' class='latex' /> going through the north pole.  The parallel transport <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28%5Cgamma%29%3A+S_x+%5Cto+S_y&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(&#92;gamma): S_x &#92;to S_y' title='&#92;Gamma(&#92;gamma): S_x &#92;to S_y' class='latex' /> along this path will map the north direction at x to the <em>south</em> direction at y.  On the other hand, if we went from x to y by a great semi-circle <img src='http://s0.wp.com/latex.php?latex=%5Cgamma%27&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gamma&#039;' title='&#92;gamma&#039;' class='latex' /> going along the equator, then the north direction at x would be transported to the <em>north</em> direction at y.  Given a section u of this circle bundle, the quantity <img src='http://s0.wp.com/latex.php?latex=%5Cnabla_v+u%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla_v u(x)' title='&#92;nabla_v u(x)' class='latex' /> can be interpreted as the rate at which u rotates as one travels from x with velocity v. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Example 2 continued. </strong>(Circle extensions) In Example 2, we change the notion of &#8220;infinitesimally close&#8221; by declaring x and Tx to be infinitesimally close for any x in the base space X (and more generally, x and <img src='http://s0.wp.com/latex.php?latex=T%5En+x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T^n x' title='T^n x' class='latex' /> are non-infinitesimally close for any positive integer n, being connected by the path <img src='http://s0.wp.com/latex.php?latex=x+%5Cto+Tx+%5Cto+%5Cldots+%5Cto+T%5En+x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x &#92;to Tx &#92;to &#92;ldots &#92;to T^n x' title='x &#92;to Tx &#92;to &#92;ldots &#92;to T^n x' class='latex' />, and similarly for negative n).  A cocycle <img src='http://s0.wp.com/latex.php?latex=%5Crho%3A+X+%5Cto+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho: X &#92;to S^1' title='&#92;rho: X &#92;to S^1' class='latex' /> can then be viewed as defining a connection on the skew product <img src='http://s0.wp.com/latex.php?latex=X+%5Ctimes_%5Crho+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X &#92;times_&#92;rho S^1' title='X &#92;times_&#92;rho S^1' class='latex' />, by setting <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28+x+%5Cmapsto+Tx+%29+%3D+%5Crho%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma( x &#92;mapsto Tx ) = &#92;rho(x)' title='&#92;Gamma( x &#92;mapsto Tx ) = &#92;rho(x)' class='latex' /> (and also <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28x+%5Cto+x%29+%3D+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(x &#92;to x) = 1' title='&#92;Gamma(x &#92;to x) = 1' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28Tx+%5Cto+x+%29+%3D+%5Crho%28x%29%5E%7B-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(Tx &#92;to x ) = &#92;rho(x)^{-1}' title='&#92;Gamma(Tx &#92;to x ) = &#92;rho(x)^{-1}' class='latex' /> to ensure compatibility with (1); to avoid notational ambiguities let us assume for sake of discussion that <img src='http://s0.wp.com/latex.php?latex=x%2C+Tx%2C+T%5E%7B-1%7D+x&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x, Tx, T^{-1} x' title='x, Tx, T^{-1} x' class='latex' /> are always distinct from each other).  The non-infinitesimal connections <img src='http://s0.wp.com/latex.php?latex=%5Crho_n%28x%29+%3A%3D+%5CGamma%28x+%5Cto+Tx+%5Cto+%5Cldots+%5Cto+T%5En+x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho_n(x) := &#92;Gamma(x &#92;to Tx &#92;to &#92;ldots &#92;to T^n x)' title='&#92;rho_n(x) := &#92;Gamma(x &#92;to Tx &#92;to &#92;ldots &#92;to T^n x)' class='latex' /> are then given by the formula <img src='http://s0.wp.com/latex.php?latex=%5Crho_n%28x%29+%3D+%5Crho%28x%29+%5Crho%28Tx%29+%5Cldots+%5Crho%28T%5E%7Bn-1%7D+x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho_n(x) = &#92;rho(x) &#92;rho(Tx) &#92;ldots &#92;rho(T^{n-1} x)' title='&#92;rho_n(x) = &#92;rho(x) &#92;rho(Tx) &#92;ldots &#92;rho(T^{n-1} x)' class='latex' /> for positive n (with a similar formula for negative n).  Note that these iterated cocycles <img src='http://s0.wp.com/latex.php?latex=%5Crho_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;rho_n' title='&#92;rho_n' class='latex' /> also describe the iterations of the shift <img src='http://s0.wp.com/latex.php?latex=%5Ctilde+T%3A+%28x%2Cu%29+%5Cmapsto+%28Tx%2C%5Crho%28x%29u%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde T: (x,u) &#92;mapsto (Tx,&#92;rho(x)u)' title='&#92;tilde T: (x,u) &#92;mapsto (Tx,&#92;rho(x)u)' class='latex' />, indeed <img src='http://s0.wp.com/latex.php?latex=%5Ctilde+T%5En+%28x%2Cu%29+%3D+%28T%5En+x%2C+%5Crho_n%28x%29+u%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde T^n (x,u) = (T^n x, &#92;rho_n(x) u)' title='&#92;tilde T^n (x,u) = (T^n x, &#92;rho_n(x) u)' class='latex' />. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Example 3 continued.</strong> (Oriented graphs) In Example 3, we declare two edges e, e&#8217; in X to be &#8220;infinitesimally close&#8221; if they are adjacent.  Then there is a natural notion of parallel transport on the bundle <img src='http://s0.wp.com/latex.php?latex=%28A_e%29_%7Be+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(A_e)_{e &#92;in X}' title='(A_e)_{e &#92;in X}' class='latex' />; given two adjacent edges <img src='http://s0.wp.com/latex.php?latex=e+%3D+%5C%7Bu%2Cv%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e = &#92;{u,v&#92;}' title='e = &#92;{u,v&#92;}' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=e%27%3D%5C%7Bv%2Cw%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='e&#039;=&#92;{v,w&#92;}' title='e&#039;=&#92;{v,w&#92;}' class='latex' />, we let <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28e+%5Cto+e%27%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(e &#92;to e&#039;)' title='&#92;Gamma(e &#92;to e&#039;)' class='latex' /> be the isomorphism from <img src='http://s0.wp.com/latex.php?latex=A_e+%3D+%5C%7B+%5Cvec%7Buv%7D%2C+%5Cvec%7Bvu%7D+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_e = &#92;{ &#92;vec{uv}, &#92;vec{vu} &#92;}' title='A_e = &#92;{ &#92;vec{uv}, &#92;vec{vu} &#92;}' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=A_%7Be%27%7D+%3D+%5C%7B+%5Cvec%7Bvw%7D%2C+%5Cvec%7Bwv%7D+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_{e&#039;} = &#92;{ &#92;vec{vw}, &#92;vec{wv} &#92;}' title='A_{e&#039;} = &#92;{ &#92;vec{vw}, &#92;vec{wv} &#92;}' class='latex' /> that maps <img src='http://s0.wp.com/latex.php?latex=%5Cvec%7Buv%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;vec{uv}' title='&#92;vec{uv}' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%5Cvec%7Bvw%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;vec{vw}' title='&#92;vec{vw}' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Cvec%7Bvu%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;vec{vu}' title='&#92;vec{vu}' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%5Cvec%7Bwv%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;vec{wv}' title='&#92;vec{wv}' class='latex' />.  Any path <img src='http://s0.wp.com/latex.php?latex=%5Cgamma+%3D+%28%5C%7Bv_1%2Cv_2%5C%7D%2C+%5C%7Bv_2%2Cv_3%5C%7D%2C+%5Cldots%2C+%5C%7Bv_%7Bn-1%7D%2Cv_n%5C%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gamma = (&#92;{v_1,v_2&#92;}, &#92;{v_2,v_3&#92;}, &#92;ldots, &#92;{v_{n-1},v_n&#92;})' title='&#92;gamma = (&#92;{v_1,v_2&#92;}, &#92;{v_2,v_3&#92;}, &#92;ldots, &#92;{v_{n-1},v_n&#92;})' class='latex' /> of edges then gives rise to a connection <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28%5Cgamma%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(&#92;gamma)' title='&#92;Gamma(&#92;gamma)' class='latex' /> identifying <img src='http://s0.wp.com/latex.php?latex=A_%7B%5C%7Bv_1%2Cv_2%5C%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_{&#92;{v_1,v_2&#92;}}' title='A_{&#92;{v_1,v_2&#92;}}' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=A_%7B%5C%7Bv_%7Bn-1%7D%2Cv_n%5C%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_{&#92;{v_{n-1},v_n&#92;}}' title='A_{&#92;{v_{n-1},v_n&#92;}}' class='latex' />.  For instance, the triangular path <img src='http://s0.wp.com/latex.php?latex=%28%5C%7Bu%2Cv%5C%7D%2C+%5C%7Bv%2Cw%5C%7D%2C+%5C%7Bw%2Cu%5C%7D%2C+%5C%7Bu%2Cv%5C%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(&#92;{u,v&#92;}, &#92;{v,w&#92;}, &#92;{w,u&#92;}, &#92;{u,v&#92;})' title='(&#92;{u,v&#92;}, &#92;{v,w&#92;}, &#92;{w,u&#92;}, &#92;{u,v&#92;})' class='latex' /> induces the identity map on <img src='http://s0.wp.com/latex.php?latex=A_%7B%5C%7Bu%2Cv%5C%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_{&#92;{u,v&#92;}}' title='A_{&#92;{u,v&#92;}}' class='latex' />, whereas the U-turn path <img src='http://s0.wp.com/latex.php?latex=%28%5C%7Bu%2Cv%5C%7D%2C+%5C%7Bv%2Cw%5C%7D%2C+%5C%7Bw%2Cx%5C%7D%2C+%5C%7Bx%2Cv%5C%7D%2C+%5C%7Bv%2Cu%5C%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(&#92;{u,v&#92;}, &#92;{v,w&#92;}, &#92;{w,x&#92;}, &#92;{x,v&#92;}, &#92;{v,u&#92;})' title='(&#92;{u,v&#92;}, &#92;{v,w&#92;}, &#92;{w,x&#92;}, &#92;{x,v&#92;}, &#92;{v,u&#92;})' class='latex' /> induces the anti-identity map on <img src='http://s0.wp.com/latex.php?latex=A_%7B%5C%7Bu%2Cv%5C%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_{&#92;{u,v&#92;}}' title='A_{&#92;{u,v&#92;}}' class='latex' />.</p>
<p>Given an orientation <img src='http://s0.wp.com/latex.php?latex=%5Cvec+G+%3D+%28%5Cvec+e%29_%7Be+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;vec G = (&#92;vec e)_{e &#92;in X}' title='&#92;vec G = (&#92;vec e)_{e &#92;in X}' class='latex' /> of the graph G, one can &#8220;differentiate&#8221; <img src='http://s0.wp.com/latex.php?latex=%5Cvec+G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;vec G' title='&#92;vec G' class='latex' /> at an edge <img src='http://s0.wp.com/latex.php?latex=%5C%7Bu%2Cv%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{u,v&#92;}' title='&#92;{u,v&#92;}' class='latex' /> in the direction <img src='http://s0.wp.com/latex.php?latex=%5C%7Bu%2Cv%5C%7D+%5Cto+%5C%7Bv%2Cw%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{u,v&#92;} &#92;to &#92;{v,w&#92;}' title='&#92;{u,v&#92;} &#92;to &#92;{v,w&#92;}' class='latex' /> to obtain a number <img src='http://s0.wp.com/latex.php?latex=%5Cnabla_%7B%5C%7Bu%2Cv%5C%7D+%5Cto+%5C%7Bv%2Cw%5C%7D%7D+%5Cvec+G%28%5C%7Bu%2Cv%5C%7D%29+%5Cin+%5C%7B-1%2C%2B1%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla_{&#92;{u,v&#92;} &#92;to &#92;{v,w&#92;}} &#92;vec G(&#92;{u,v&#92;}) &#92;in &#92;{-1,+1&#92;}' title='&#92;nabla_{&#92;{u,v&#92;} &#92;to &#92;{v,w&#92;}} &#92;vec G(&#92;{u,v&#92;}) &#92;in &#92;{-1,+1&#92;}' class='latex' />, defined as +1 if the parallel transport from <img src='http://s0.wp.com/latex.php?latex=%5C%7Bu%2Cv%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{u,v&#92;}' title='&#92;{u,v&#92;}' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5C%7Bv%2Cw%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{v,w&#92;}' title='&#92;{v,w&#92;}' class='latex' /> preserves the orientations given by <img src='http://s0.wp.com/latex.php?latex=%5Cvec+G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;vec G' title='&#92;vec G' class='latex' />, and -1 otherwise.  This number of course depends on the choice of orientation.  But certain combinations of these numbers are independent of such a choice; for instance, given any closed path <img src='http://s0.wp.com/latex.php?latex=%5Cgamma+%3D+%5C%7Be_1%2Ce_2%2C%5Cldots%2Ce_n%2Ce_%7Bn%2B1%7D%3De_1%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gamma = &#92;{e_1,e_2,&#92;ldots,e_n,e_{n+1}=e_1&#92;}' title='&#92;gamma = &#92;{e_1,e_2,&#92;ldots,e_n,e_{n+1}=e_1&#92;}' class='latex' /> of edges in X, the &#8220;integral&#8221; <img src='http://s0.wp.com/latex.php?latex=%5Cprod_%7Bi%3D1%7D%5En+%5Cnabla_%7Be_i+%5Cto+e_%7Bi%2B1%7D%7D+%5Cvec+G%28e_i%29+%5Cin+%5C%7B-1%2C%2B1%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;prod_{i=1}^n &#92;nabla_{e_i &#92;to e_{i+1}} &#92;vec G(e_i) &#92;in &#92;{-1,+1&#92;}' title='&#92;prod_{i=1}^n &#92;nabla_{e_i &#92;to e_{i+1}} &#92;vec G(e_i) &#92;in &#92;{-1,+1&#92;}' class='latex' /> is independent of the choice of orientation <img src='http://s0.wp.com/latex.php?latex=%5Cvec+G&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;vec G' title='&#92;vec G' class='latex' /> (indeed, it is equal to +1 if <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28%5Cgamma%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(&#92;gamma)' title='&#92;Gamma(&#92;gamma)' class='latex' /> is the identity, and -1 if <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28%5Cgamma%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(&#92;gamma)' title='&#92;Gamma(&#92;gamma)' class='latex' /> is the anti-identity.  <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Example 4.</strong> (Monodromy)  One can interpret the <a href="http://en.wikipedia.org/wiki/Monodromy">monodromy maps</a> of a <a href="http://en.wikipedia.org/wiki/Covering_space">covering space</a> in the language of connections.  Suppose for instance that we have a covering space <img src='http://s0.wp.com/latex.php?latex=%5Cpi%3A+%5Ctilde+X+%5Cto+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;pi: &#92;tilde X &#92;to X' title='&#92;pi: &#92;tilde X &#92;to X' class='latex' /> of a topological space X whose fibres <img src='http://s0.wp.com/latex.php?latex=%5Cpi%5E%7B-1%7D%28%5C%7Bx%5C%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;pi^{-1}(&#92;{x&#92;})' title='&#92;pi^{-1}(&#92;{x&#92;})' class='latex' /> are discrete; thus <img src='http://s0.wp.com/latex.php?latex=%5Ctilde+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde X' title='&#92;tilde X' class='latex' /> is a discrete fibre bundle over X.  The discreteness induces a natural connection <img src='http://s0.wp.com/latex.php?latex=%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma' title='&#92;Gamma' class='latex' /> on this space, which is given by the lifting map; in particular, if one integrates this connection on a closed loop based at some point x, one obtains the monodromy map of that loop at x. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Example 5.</strong> (Definite integrals) In view of the definition (2), it should not be surprising that the <a href="http://en.wikipedia.org/wiki/Definite_integral">definite integral</a> <img src='http://s0.wp.com/latex.php?latex=%5Cint_a%5Eb+f%28x%29%5C+dx&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;int_a^b f(x)&#92; dx' title='&#92;int_a^b f(x)&#92; dx' class='latex' /> of a scalar function <img src='http://s0.wp.com/latex.php?latex=f%3A+%5Ba%2Cb%5D+%5Cto+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f: [a,b] &#92;to {&#92;Bbb R}' title='f: [a,b] &#92;to {&#92;Bbb R}' class='latex' /> can be interpreted as an integral of a connection.  Indeed, set <img src='http://s0.wp.com/latex.php?latex=X+%3A%3D+%5Ba%2Cb%5D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X := [a,b]' title='X := [a,b]' class='latex' />, and let <img src='http://s0.wp.com/latex.php?latex=%28%7B%5CBbb+R%7D%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='({&#92;Bbb R})_{x &#92;in X}' title='({&#92;Bbb R})_{x &#92;in X}' class='latex' /> be the trivial line bundle over X.  The function f induces a connection <img src='http://s0.wp.com/latex.php?latex=%5CGamma_f&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_f' title='&#92;Gamma_f' class='latex' /> on this bundle by setting</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5CGamma_f%28x+%5Cmapsto+x%2Bdx%29%3A+y+%5Cmapsto+y+%2B+f%28x%29+dx.&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_f(x &#92;mapsto x+dx): y &#92;mapsto y + f(x) dx.' title='&#92;Gamma_f(x &#92;mapsto x+dx): y &#92;mapsto y + f(x) dx.' class='latex' /></p>
<p>The integral <img src='http://s0.wp.com/latex.php?latex=%5CGamma_f%28%5Ba%2Cb%5D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_f([a,b])' title='&#92;Gamma_f([a,b])' class='latex' /> of this connection along <img src='http://s0.wp.com/latex.php?latex=%7B%7D%5Ba%2Cb%5D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{}[a,b]' title='{}[a,b]' class='latex' /> is then just the operation of translation by <img src='http://s0.wp.com/latex.php?latex=%5Cint_a%5Eb+f%28x%29%5C+dx&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;int_a^b f(x)&#92; dx' title='&#92;int_a^b f(x)&#92; dx' class='latex' /> in the real line. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Example 6.</strong> (Line integrals) One can generalise Example 5 to encompass <a href="http://en.wikipedia.org/wiki/Line_integral">line integrals</a> in several variable calculus.  Indeed, if <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X' title='X' class='latex' /> is an n-dimensional domain, then a vector field <img src='http://s0.wp.com/latex.php?latex=f+%3D+%28f_1%2C%5Cldots%2Cf_n%29%3A+X+%5Cto+%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='f = (f_1,&#92;ldots,f_n): X &#92;to {&#92;Bbb R}^n' title='f = (f_1,&#92;ldots,f_n): X &#92;to {&#92;Bbb R}^n' class='latex' /> induces a connection <img src='http://s0.wp.com/latex.php?latex=%5CGamma_f&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_f' title='&#92;Gamma_f' class='latex' /> on the trivial line bundle <img src='http://s0.wp.com/latex.php?latex=%28%7B%5CBbb+R%7D%29_%7Bx+%5Cin+X%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='({&#92;Bbb R})_{x &#92;in X}' title='({&#92;Bbb R})_{x &#92;in X}' class='latex' /> by setting</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5CGamma_f%28+x+%5Cmapsto+x%2Bdx+%29%3A+y+%5Cmapsto+y+%2B+f_1%28x%29+dx_1+%2B+%5Cldots+%2B+f_n%28x%29+dx_n.&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_f( x &#92;mapsto x+dx ): y &#92;mapsto y + f_1(x) dx_1 + &#92;ldots + f_n(x) dx_n.' title='&#92;Gamma_f( x &#92;mapsto x+dx ): y &#92;mapsto y + f_1(x) dx_1 + &#92;ldots + f_n(x) dx_n.' class='latex' /></p>
<p>The integral <img src='http://s0.wp.com/latex.php?latex=%5CGamma_f%28%5Cgamma%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_f(&#92;gamma)' title='&#92;Gamma_f(&#92;gamma)' class='latex' /> of this connection along a curve <img src='http://s0.wp.com/latex.php?latex=%5Cgamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;gamma' title='&#92;gamma' class='latex' /> is then just the operation of translation by the line integral <img src='http://s0.wp.com/latex.php?latex=%5Cint_%5Cgamma+f+%5Ccdot+dx&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;int_&#92;gamma f &#92;cdot dx' title='&#92;int_&#92;gamma f &#92;cdot dx' class='latex' /> in the real line.</p>
<p>Note that a gauge transformation in this context is just a vertical translation <img src='http://s0.wp.com/latex.php?latex=%28x%2Cy%29+%5Cmapsto+%28x%2Cy%2BV%28x%29%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(x,y) &#92;mapsto (x,y+V(x))' title='(x,y) &#92;mapsto (x,y+V(x))' class='latex' /> of the bundle <img src='http://s0.wp.com/latex.php?latex=%28%7B%5CBbb+R%7D%29_%7Bx+%5Cin+X%7D+%5Cequiv+X+%5Ctimes+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='({&#92;Bbb R})_{x &#92;in X} &#92;equiv X &#92;times {&#92;Bbb R}' title='({&#92;Bbb R})_{x &#92;in X} &#92;equiv X &#92;times {&#92;Bbb R}' class='latex' /> by some potential function <img src='http://s0.wp.com/latex.php?latex=V%3A+X+%5Cto+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V: X &#92;to {&#92;Bbb R}' title='V: X &#92;to {&#92;Bbb R}' class='latex' />, which we will assume to be smooth for sake of discussion.  This transformation conjugates the connection <img src='http://s0.wp.com/latex.php?latex=%5CGamma_f&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_f' title='&#92;Gamma_f' class='latex' /> to the connection <img src='http://s0.wp.com/latex.php?latex=%5CGamma_%7Bf+-+%5Cnabla+V%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_{f - &#92;nabla V}' title='&#92;Gamma_{f - &#92;nabla V}' class='latex' />.  Note that this is a <a href="http://en.wikipedia.org/wiki/Conservative_force">conservative</a> transformation: the integral of a connection along a closed loop is unchanged by gauge transformation. <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p><strong>Example 7.</strong> (ODE) A different way to generalise Example 5 can be obtained by using the <a href="http://en.wikipedia.org/wiki/Fundamental_theorem_of_calculus">fundamental theorem of calculus </a>to interpret <img src='http://s0.wp.com/latex.php?latex=%5Cint_%7B%5Ba%2Cb%5D%7D+f%28x%29%5C+dx&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;int_{[a,b]} f(x)&#92; dx' title='&#92;int_{[a,b]} f(x)&#92; dx' class='latex' /> as the final value <img src='http://s0.wp.com/latex.php?latex=u%28b%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u(b)' title='u(b)' class='latex' /> of the solution to the initial value problem</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=u%27%28t%29+%3D+f%28t%29%3B+%5Cquad+u%28a%29+%3D+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u&#039;(t) = f(t); &#92;quad u(a) = 0' title='u&#039;(t) = f(t); &#92;quad u(a) = 0' class='latex' /></p>
<p>for the ordinary differential equation <img src='http://s0.wp.com/latex.php?latex=u%27%3Df&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u&#039;=f' title='u&#039;=f' class='latex' />.  More generally, the solution u(b) to the initial value problem</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=u%27%28t%29+%3D+F%28+t%2C+u%28t%29+%29%3B+%5Cquad+u%28a%29+%3D+u_0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u&#039;(t) = F( t, u(t) ); &#92;quad u(a) = u_0' title='u&#039;(t) = F( t, u(t) ); &#92;quad u(a) = u_0' class='latex' /></p>
<p>for some <img src='http://s0.wp.com/latex.php?latex=u%3A+%5Ba%2Cb%5D+%5Cto+%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u: [a,b] &#92;to {&#92;Bbb R}^n' title='u: [a,b] &#92;to {&#92;Bbb R}^n' class='latex' /> taking values in some manifold Y, where <img src='http://s0.wp.com/latex.php?latex=F%3A+%5Ba%2Cb%5D+%5Ctimes+%7B%5CBbb+R%7D%5En+%5Cto+%7B%5CBbb+R%7D%5En&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F: [a,b] &#92;times {&#92;Bbb R}^n &#92;to {&#92;Bbb R}^n' title='F: [a,b] &#92;times {&#92;Bbb R}^n &#92;to {&#92;Bbb R}^n' class='latex' /> is a function (let us take it to be Lipschitz, to avoid technical issues), can also be interpreted as the integral of a connection <img src='http://s0.wp.com/latex.php?latex=%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma' title='&#92;Gamma' class='latex' /> on the trivial vector space bundle <img src='http://s0.wp.com/latex.php?latex=%28%7B%5CBbb+R%7D%5En%29_%7Bt+%5Cin+%5Ba%2Cb%5D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='({&#92;Bbb R}^n)_{t &#92;in [a,b]}' title='({&#92;Bbb R}^n)_{t &#92;in [a,b]}' class='latex' />, defined by the formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5CGamma%28t+%5Cmapsto+t%2Bdt%29%3A+y+%5Cmapsto+y+%2B+F%28t%2Cy%29+dt.&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma(t &#92;mapsto t+dt): y &#92;mapsto y + F(t,y) dt.' title='&#92;Gamma(t &#92;mapsto t+dt): y &#92;mapsto y + F(t,y) dt.' class='latex' /></p>
<p>Then <img src='http://s0.wp.com/latex.php?latex=%5CGamma%5Ba%2Cb%5D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma[a,b]' title='&#92;Gamma[a,b]' class='latex' /> will map <img src='http://s0.wp.com/latex.php?latex=u_0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u_0' title='u_0' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=u%28b%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='u(b)' title='u(b)' class='latex' />, this is nothing more than the <a href="http://en.wikipedia.org/wiki/Euler_method">Euler method</a> for solving ODE.   Note that the method of <a href="http://en.wikipedia.org/wiki/Integrating_factor">integrating factors</a> in solving ODE can be interpreted as an attempt to simplify the connection <img src='http://s0.wp.com/latex.php?latex=%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma' title='&#92;Gamma' class='latex' /> via a gauge transformation.  Indeed, it can be profitable to view the entire theory of connections as a multidimensional &#8220;variable-coefficient&#8221; generalisation of the theory of ODE.  <img src='http://s0.wp.com/latex.php?latex=%5Cdiamond&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;diamond' title='&#92;diamond' class='latex' /></p>
<p>Once one selects a gauge, one can express a connection in terms of that gauge.  In the case of <a href="http://en.wikipedia.org/wiki/Vector_bundle">vector bundles</a> (in which every fibre is a d-dimensional vector space for some fixed d), the covariant derivative <img src='http://s0.wp.com/latex.php?latex=%5Cnabla_v+w%28x%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla_v w(x)' title='&#92;nabla_v w(x)' class='latex' /> of a section w of that bundle along some vector v emanating from x can be expressed in any given gauge by the formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cnabla_v+w%28x%29%5Ei+%3D+v%5E%5Calpha+%5Cpartial_%5Calpha+w%28x%29%5Ei+%2B+v%5E%5Calpha+%5CGamma_%7B%5Calpha+j%7D%5Ei+w%28x%29%5Ej&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla_v w(x)^i = v^&#92;alpha &#92;partial_&#92;alpha w(x)^i + v^&#92;alpha &#92;Gamma_{&#92;alpha j}^i w(x)^j' title='&#92;nabla_v w(x)^i = v^&#92;alpha &#92;partial_&#92;alpha w(x)^i + v^&#92;alpha &#92;Gamma_{&#92;alpha j}^i w(x)^j' class='latex' /></p>
<p>where we use the gauge to express w(x) as a vector <img src='http://s0.wp.com/latex.php?latex=%28w%28x%29%5E1%2C%5Cldots%2Cw%28x%29%5Ed%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(w(x)^1,&#92;ldots,w(x)^d)' title='(w(x)^1,&#92;ldots,w(x)^d)' class='latex' />, the indices <img src='http://s0.wp.com/latex.php?latex=i%2C+j+%3D+1%2C%5Cldots%2Cd&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='i, j = 1,&#92;ldots,d' title='i, j = 1,&#92;ldots,d' class='latex' /> are summed over the fibre dimensions (and <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' /> summed over the base dimensions) as per the <a href="http://en.wikipedia.org/wiki/Einstein_notation">usual conventions</a>, and the <img src='http://s0.wp.com/latex.php?latex=%5CGamma_%7B%5Calpha+j%7D%5Ei+%3A%3D+%28%5Cnabla_%7Be_%5Calpha%7D+e_j%29%5Ei&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_{&#92;alpha j}^i := (&#92;nabla_{e_&#92;alpha} e_j)^i' title='&#92;Gamma_{&#92;alpha j}^i := (&#92;nabla_{e_&#92;alpha} e_j)^i' class='latex' /> are the <a href="http://en.wikipedia.org/wiki/Christoffel_symbols">Christoffel symbols</a> of this connection relative to this gauge.</p>
<p>One example of this, which models <a href="http://en.wikipedia.org/wiki/Electromagnetism">electromagnetism</a>, is a connection on a <a href="http://en.wikipedia.org/wiki/Complex_line_bundle">complex line bundle</a> <img src='http://s0.wp.com/latex.php?latex=V+%3D+%28V_%7Bt%2Cx%7D%29_%7B%28t%2Cx%29+%5Cin+%7B%5CBbb+R%7D%5E%7B1%2B3%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V = (V_{t,x})_{(t,x) &#92;in {&#92;Bbb R}^{1+3}}' title='V = (V_{t,x})_{(t,x) &#92;in {&#92;Bbb R}^{1+3}}' class='latex' /> in <a href="http://en.wikipedia.org/wiki/Spacetime">spacetime</a> <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D%5E%7B1%2B3%7D+%3D+%5C%7B+%28t%2Cx%29%3A+t+%5Cin+%7B%5CBbb+R%7D%2C+x+%5Cin+%7B%5CBbb+R%7D%5E3+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}^{1+3} = &#92;{ (t,x): t &#92;in {&#92;Bbb R}, x &#92;in {&#92;Bbb R}^3 &#92;}' title='{&#92;Bbb R}^{1+3} = &#92;{ (t,x): t &#92;in {&#92;Bbb R}, x &#92;in {&#92;Bbb R}^3 &#92;}' class='latex' />.  Such a bundle assigns a complex line <img src='http://s0.wp.com/latex.php?latex=V_%7Bt%2Cx%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V_{t,x}' title='V_{t,x}' class='latex' /> (i.e. a one-dimensional complex vector space, and thus isomorphic to <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb C}' title='{&#92;Bbb C}' class='latex' />) to every point <img src='http://s0.wp.com/latex.php?latex=%28t%2Cx%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(t,x)' title='(t,x)' class='latex' /> in spacetime.  The structure group here is U(1) (strictly speaking, this means that we view the fibres as <em>normed</em> one-dimensional complex vector spaces, otherwise the structure group would be <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+C%7D%5E%5Ctimes&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb C}^&#92;times' title='{&#92;Bbb C}^&#92;times' class='latex' />). A gauge identifies V with the trivial complex line bundle <img src='http://s0.wp.com/latex.php?latex=%28%7B%5CBbb+C%7D%29_%7B%28t%2Cx%29+%5Cin+%7B%5CBbb+R%7D%5E%7B1%2B3%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='({&#92;Bbb C})_{(t,x) &#92;in {&#92;Bbb R}^{1+3}}' title='({&#92;Bbb C})_{(t,x) &#92;in {&#92;Bbb R}^{1+3}}' class='latex' />, thus converting sections <img src='http://s0.wp.com/latex.php?latex=%28w_%7Bt%2Cx%7D%29_%7B%28t%2Cx%29+%5Cin+%7B%5CBbb+R%7D%5E%7B1%2B3%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(w_{t,x})_{(t,x) &#92;in {&#92;Bbb R}^{1+3}}' title='(w_{t,x})_{(t,x) &#92;in {&#92;Bbb R}^{1+3}}' class='latex' /> of this bundle into complex-valued functions <img src='http://s0.wp.com/latex.php?latex=%5Cphi%3A+%7B%5CBbb+R%7D%5E%7B1%2B3%7D+%5Cto+%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;phi: {&#92;Bbb R}^{1+3} &#92;to {&#92;Bbb C}' title='&#92;phi: {&#92;Bbb R}^{1+3} &#92;to {&#92;Bbb C}' class='latex' />.  A connection on V, when described in this gauge, can be given in terms of fields <img src='http://s0.wp.com/latex.php?latex=A_%5Calpha%3A+%7B%5CBbb+R%7D%5E%7B1%2B3%7D+%5Cto+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_&#92;alpha: {&#92;Bbb R}^{1+3} &#92;to {&#92;Bbb R}' title='A_&#92;alpha: {&#92;Bbb R}^{1+3} &#92;to {&#92;Bbb R}' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=%5Calpha+%3D+0%2C1%2C2%2C3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha = 0,1,2,3' title='&#92;alpha = 0,1,2,3' class='latex' />; the covariant derivative of a section in this gauge is then given by the formula</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cnabla_%5Calpha+%5Cphi+%3A%3D+%5Cpartial_%5Calpha+%5Cphi+%2B+i+A_%5Calpha+%5Cphi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla_&#92;alpha &#92;phi := &#92;partial_&#92;alpha &#92;phi + i A_&#92;alpha &#92;phi' title='&#92;nabla_&#92;alpha &#92;phi := &#92;partial_&#92;alpha &#92;phi + i A_&#92;alpha &#92;phi' class='latex' />.</p>
<p>In the theory of electromagnetism, <img src='http://s0.wp.com/latex.php?latex=A_0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_0' title='A_0' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%28A_1%2CA_2%2CA_3%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(A_1,A_2,A_3)' title='(A_1,A_2,A_3)' class='latex' /> are known (up to some normalising constants) as the <a href="http://en.wikipedia.org/wiki/Magnetic_vector_potential">electric potential</a> and <a href="http://en.wikipedia.org/wiki/Magnetic_vector_potential">magnetic potential</a> respectively.  Sections of V do not show up directly in Maxwell&#8217;s equations of electromagnetism, but appear in more complicated variants of these equations, such as the <a class="snap_noshots" href="http://tosio.math.toronto.edu/wiki/index.php/Maxwell-Klein-Gordon_equations">Maxwell-Klein-Gordon equation</a>.</p>
<p>A gauge transformation of V is given by a map <img src='http://s0.wp.com/latex.php?latex=U%3A+%7B%5CBbb+R%7D%5E%7B1%2B3%7D+%5Cto+S%5E1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U: {&#92;Bbb R}^{1+3} &#92;to S^1' title='U: {&#92;Bbb R}^{1+3} &#92;to S^1' class='latex' />; it transforms sections by the formula <img src='http://s0.wp.com/latex.php?latex=%5Cphi+%5Cmapsto+U%5E%7B-1%7D+%5Cphi&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;phi &#92;mapsto U^{-1} &#92;phi' title='&#92;phi &#92;mapsto U^{-1} &#92;phi' class='latex' />, and connections by the formula <img src='http://s0.wp.com/latex.php?latex=%5Cnabla_%5Calpha+%5Cmapsto+U%5E%7B-1%7D+%5Cnabla_%5Calpha+U&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla_&#92;alpha &#92;mapsto U^{-1} &#92;nabla_&#92;alpha U' title='&#92;nabla_&#92;alpha &#92;mapsto U^{-1} &#92;nabla_&#92;alpha U' class='latex' />, or equivalently</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=A_%5Calpha+%5Cmapsto+A_%5Calpha+%2B+%5Cfrac%7B1%7D%7Bi%7D+U%5E%7B-1%7D+%5Cpartial_%5Calpha+U+%3D+A_%5Calpha+%2B+%5Cpartial_%5Calpha+%5Cfrac%7B1%7D%7Bi%7D+%5Clog+U&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_&#92;alpha &#92;mapsto A_&#92;alpha + &#92;frac{1}{i} U^{-1} &#92;partial_&#92;alpha U = A_&#92;alpha + &#92;partial_&#92;alpha &#92;frac{1}{i} &#92;log U' title='A_&#92;alpha &#92;mapsto A_&#92;alpha + &#92;frac{1}{i} U^{-1} &#92;partial_&#92;alpha U = A_&#92;alpha + &#92;partial_&#92;alpha &#92;frac{1}{i} &#92;log U' class='latex' />.   (2)</p>
<p>In particular, the electromagnetic potential <img src='http://s0.wp.com/latex.php?latex=A_%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_&#92;alpha' title='A_&#92;alpha' class='latex' /> is not gauge invariant (which broadly corresponds to the concept of being <em>nonphysical</em> or <em>nonmeasurable</em> in physics), as gauge symmetry allows one to add an arbitrary gradient function to this potential.  However, the <a href="http://en.wikipedia.org/wiki/Curvature_form">curvature tensor</a></p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=F_%7B%5Calpha+%5Cbeta%7D+%3A%3D+%5B%5Cnabla_%5Calpha%2C+%5Cnabla_%5Cbeta%5D+%3D+%5Cpartial_%5Calpha+A_%5Cbeta+-+%5Cpartial_%5Cbeta+A_%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F_{&#92;alpha &#92;beta} := [&#92;nabla_&#92;alpha, &#92;nabla_&#92;beta] = &#92;partial_&#92;alpha A_&#92;beta - &#92;partial_&#92;beta A_&#92;alpha' title='F_{&#92;alpha &#92;beta} := [&#92;nabla_&#92;alpha, &#92;nabla_&#92;beta] = &#92;partial_&#92;alpha A_&#92;beta - &#92;partial_&#92;beta A_&#92;alpha' class='latex' /></p>
<p>of the connection is gauge-invariant, and physically measurable in electromagnetism; the components <img src='http://s0.wp.com/latex.php?latex=F_%7B0i%7D+%3D+-F_%7Bi0%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F_{0i} = -F_{i0}' title='F_{0i} = -F_{i0}' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=i%3D1%2C2%2C3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='i=1,2,3' title='i=1,2,3' class='latex' /> of this field have a physical interpretation as the <a href="http://en.wikipedia.org/wiki/Electric_field">electric field</a>, and the components <img src='http://s0.wp.com/latex.php?latex=F_%7Bij%7D+%3D+-F_%7Bji%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F_{ij} = -F_{ji}' title='F_{ij} = -F_{ji}' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=1+%5Cleq+i+%26%2360%3B+j+%5Cleq+3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1 &#92;leq i &lt; j &#92;leq 3' title='1 &#92;leq i &lt; j &#92;leq 3' class='latex' /> have a physical interpretation as the <a href="http://en.wikipedia.org/wiki/Magnetic_field">magnetic field</a>.  (The curvature tensor <img src='http://s0.wp.com/latex.php?latex=F&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='F' title='F' class='latex' /> can be interpreted as describing the parallel transport of infinitesimal rectangles; it measures how far off the connection is from being <em>flat</em>, which means that it can be (locally) &#8220;straightened&#8221; via some choice of gauge to be the trivial connection.  In nonabelian gauge theories, in which the structure group is more complicated than just the abelian group U(1), the curvature tensor is non-scalar, but remains gauge-invariant in a tensor sense (gauge transformations will transform the curvature as they would transform a <a href="http://en.wikipedia.org/wiki/Tensor">tensor</a> of the same rank).</p>
<p>Gauge theories can often be expressed succinctly in terms of a connection and its curvatures.  For instance, <a href="http://en.wikipedia.org/wiki/Maxwell%27s_equations">Maxwell&#8217;s equations</a> in free space, which describes how electromagnetic radiation propagates in the presence of charges and currents (but no media other than vacuum), can be written (after normalising away some physical constants) as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cpartial%5E%5Calpha+F_%7B%5Calpha+%5Cbeta%7D+%3D+J_%5Cbeta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;partial^&#92;alpha F_{&#92;alpha &#92;beta} = J_&#92;beta' title='&#92;partial^&#92;alpha F_{&#92;alpha &#92;beta} = J_&#92;beta' class='latex' /></p>
<p>where <img src='http://s0.wp.com/latex.php?latex=J_%5Cbeta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='J_&#92;beta' title='J_&#92;beta' class='latex' /> is the <a href="http://en.wikipedia.org/wiki/4-current">4-current</a>.  (Actually, this is only half of Maxwell&#8217;s equations, but the other half are a consequence of the interpretation (*) of the electromagnetic field as a curvature of a U(1) connection.  Thus this purely geometric interpretation of electromagnetism has some non-trivial physical implications, for instance ruling out the possibility of (classical) <a href="http://en.wikipedia.org/wiki/Magnetic_monopole">magnetic monopoles</a>.)  If one generalises from complex line bundles to higher-dimensional vector bundles (with a larger structure group), one can then write down the (classical) <a href="http://en.wikipedia.org/wiki/Yang-Mills_equation">Yang-Mills equation</a></p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cnabla%5E%5Calpha+F_%7B%5Calpha+%5Cbeta%7D+%3D+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;nabla^&#92;alpha F_{&#92;alpha &#92;beta} = 0' title='&#92;nabla^&#92;alpha F_{&#92;alpha &#92;beta} = 0' class='latex' /></p>
<p>which is the classical model for three of the four fundamental forces in physics: the electromagnetic, weak, and strong nuclear forces (with structure groups U(1), SU(2), and SU(3) respectively).  (The classical model for the fourth force, gravitation, is given by a somewhat different geometric equation, namely the <a href="http://en.wikipedia.org/wiki/Einstein_equations">Einstein equations</a> <img src='http://s0.wp.com/latex.php?latex=G_%7B%5Calpha+%5Cbeta%7D+%3D+8+%5Cpi+T_%7B%5Calpha+%5Cbeta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G_{&#92;alpha &#92;beta} = 8 &#92;pi T_{&#92;alpha &#92;beta}' title='G_{&#92;alpha &#92;beta} = 8 &#92;pi T_{&#92;alpha &#92;beta}' class='latex' />, though this equation is also &#8220;gauge-invariant&#8221; in some sense.)</p>
<p>The gauge invariance (or gauge freedom) inherent in these equations complicates their analysis.  For instance, due to the gauge freedom (2), Maxwell&#8217;s equations, when viewed in terms of the electromagnetic potential <img src='http://s0.wp.com/latex.php?latex=A_%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_&#92;alpha' title='A_&#92;alpha' class='latex' />, are ill-posed: specifying the initial value of this potential at time zero does not uniquely specify the future value of this potential (even if one also specifies any number of additional time derivatives of this potential at time zero), since one can use (2) with a gauge function U that is trivial at time zero but non-trivial at some future time to demonstrate the non-uniqueness.  Thus, in order to use standard PDE methods to solve these equations, it is necessary to first fix the gauge to a sufficient extent that it eliminates this sort of ambiguity.  If one were in a one-dimensional situation (as opposed to the four-dimensional situation of spacetime), with a trivial topology (i.e. the domain is a line rather than a circle), then it is possible to gauge transform the connection to be completely trivial, for reasons generalising both the <a href="http://en.wikipedia.org/wiki/Fundamental_theorem_of_calculus">fundamental theorem of calculus</a> and the <a href="http://en.wikipedia.org/wiki/Picard%E2%80%93Lindel%C3%B6f_theorem">fundamental theorem of ODEs</a>.  (Indeed, to trivialise a connection <img src='http://s0.wp.com/latex.php?latex=%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma' title='&#92;Gamma' class='latex' /> on a line <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}' title='{&#92;Bbb R}' class='latex' />, one can pick an arbitrary origin <img src='http://s0.wp.com/latex.php?latex=t_0+%5Cin+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='t_0 &#92;in {&#92;Bbb R}' title='t_0 &#92;in {&#92;Bbb R}' class='latex' /> and gauge transform each point <img src='http://s0.wp.com/latex.php?latex=t+%5Cin+%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='t &#92;in {&#92;Bbb R}' title='t &#92;in {&#92;Bbb R}' class='latex' /> by <img src='http://s0.wp.com/latex.php?latex=%5CGamma%28%5Bt_0%2Ct%5D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma([t_0,t])' title='&#92;Gamma([t_0,t])' class='latex' />.)  However, in higher dimensions, one cannot hope to completely trivialise a connection by gauge transforms (mainly because of the possibility of a non-zero curvature form); in general, one cannot hope to do much better than setting a single component of the connection to equal zero.  For instance, for Maxwell&#8217;s equations (or the Yang-Mills equations), one can trivialise the connection <img src='http://s0.wp.com/latex.php?latex=A_%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_&#92;alpha' title='A_&#92;alpha' class='latex' /> in the time direction, leading to the <em>temporal gauge condition</em></p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=A_0+%3D+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_0 = 0' title='A_0 = 0' class='latex' />.</p>
<p>This gauge is indeed useful for providing an easy proof of local existence for these equations, at least for smooth initial data.  But there are many other useful gauges also that one can fix; for instance one has the <a href="http://en.wikipedia.org/wiki/Lorenz_gauge">Lorenz gauge</a></p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cpartial%5E%5Calpha+A_%5Calpha+%3D+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;partial^&#92;alpha A_&#92;alpha = 0' title='&#92;partial^&#92;alpha A_&#92;alpha = 0' class='latex' /></p>
<p>which has the nice property of being <a href="http://en.wikipedia.org/wiki/Lorentz_invariance">Lorentz-invariant</a>, and transforms the Maxwell or Yang-Mills equations into linear or nonlinear wave equations respectively.  Another important gauge is the <a href="http://en.wikipedia.org/wiki/Coulomb_gauge#Coulomb_gauge">Coulomb gauge</a></p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cpartial_i+A_i+%3D+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;partial_i A_i = 0' title='&#92;partial_i A_i = 0' class='latex' /></p>
<p>where i only ranges over spatial indices 1,2,3 rather than over spacetime indices 0,1,2,3.  This gauge has an elliptic variational formulation (Coulomb gauges are critical points of the functional <img src='http://s0.wp.com/latex.php?latex=%5Cint_%7B%7B%5CBbb+R%7D%5E3%7D+%5Csum_%7Bi%3D1%7D%5E3+%26%23124%3BA_i%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;int_{{&#92;Bbb R}^3} &#92;sum_{i=1}^3 &#124;A_i&#124;^2' title='&#92;int_{{&#92;Bbb R}^3} &#92;sum_{i=1}^3 &#124;A_i&#124;^2' class='latex' />) and thus are expected to be &#8220;smaller&#8221; and &#8220;smoother&#8221; than many other gauges; this intuition can be borne out by standard elliptic theory (or <a href="http://en.wikipedia.org/wiki/Hodge_theory">Hodge theory</a>, in the case of Maxwell&#8217;s equations).  In some cases, the correct selection of a gauge is crucial in order to establish basic properties of the underlying equation, such as local existence.  For instance, the simplest proof of local existence of the Einstein equations uses a <a href="http://en.wikipedia.org/wiki/Harmonic_coordinates">harmonic gauge</a>, which is analogous to the Lorenz gauge mentioned earlier; the simplest proof of local existence of Ricci flow uses a gauge of de Turck that is also related to harmonic maps (see e.g. <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/03/28/285g-lecture-1-ricci-flow/">my lecture notes</a>); and in my own work on wave maps, a certain &#8220;caloric gauge&#8221; based on harmonic map heat flow is crucial (see e.g. <a class="snap_noshots" href="http://terrytao.wordpress.com/2008/08/06/global-regularity-of-wave-maps-v-large-data-local-wellposedness-in-the-energy-class/">this post</a> of mine).  But in many situations, it is not yet fully understood whether the use of the correct choice of gauge is a mere technical convenience, or is more innate to the equation.  It is definitely conceivable, for instance, that a given gauge field equation is well-posed with one choice of gauge but ill-posed with another.  It would also be desirable to have a more gauge-invariant theory of PDEs that did not rely so heavily on gauge theory at all, but this seems to be rather difficult; many of our most powerful tools in PDE (for instance, the Fourier transform) are highly non-gauge-invariant, which makes it very inconvenient to try to analyse these equations in a purely gauge-invariant setting.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[The correspondence principle and finitary ergodic theory]]></title>
<link>http://terrytao.wordpress.com/2008/08/30/the-correspondence-principle-and-finitary-ergodic-theory/</link>
<pubDate>Sat, 30 Aug 2008 16:43:32 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2008/08/30/the-correspondence-principle-and-finitary-ergodic-theory/</guid>
<description><![CDATA[This month I am at MSRI, for the programs of Ergodic Theory and Additive Combinatorics, and Analysis]]></description>
<content:encoded><![CDATA[<p>This month I am at <a class="snap_noshots" href="http://www.msri.org">MSRI</a>, for the programs of <a class="snap_noshots" href="http://www.msri.org/calendar/programs/ProgramInfo/252/show_program">Ergodic Theory and Additive Combinatorics</a>, and <a class="snap_noshots" href="http://www.msri.org/calendar/programs/ProgramInfo/250/show_program">Analysis on Singular Spaces</a>, that are currently ongoing here.  This week I am giving <a class="snap_noshots" href="http://www.msri.org/calendar/workshops/ScheduleInfo/895/show_abstract">three lectures</a> on the correspondence principle, and on finitary versions of ergodic theory, for the <a class="snap_noshots" href="http://www.msri.org/calendar/workshops/WorkshopInfo/439/show_workshop">introductory workshop in the former program</a>.  The article here is broadly describing the content of these talks (which are slightly different in theme from that announced in the <a class="snap_noshots" href="http://www.msri.org/calendar/workshops/ScheduleInfo/895/show_abstract">abstract</a>, due to some recent developments).  [These lectures were also recorded on video and should be available from the MSRI web site within a few months.]</p>
<p><!--more--></p>
<p>My lectures are devoted to the <em>correspondence principle</em> between finite dynamical systems and infinite dynamical systems, that allows one to convert certain statements about the former to logically equivalent statements about the latter.  (I will be vague here about what &#8220;dynamical system&#8221; means; very broadly, just about anything with a group action could qualify here.)  A little more specifically, the correspondence principle equates four types of results, which we informally describe as follows:</p>
<ol>
<li>Local quantitative results for concrete finite systems.</li>
<li>Local qualitative results for concrete infinite systems.</li>
<li>Continuous quantitative results for abstract finite systems.</li>
<li>Continuous qualitative results for abstract infinite systems.</li>
</ol>
<p>(The meaning of these terms should become clearer once we give some specific examples.)</p>
<p>There are many contexts in which this principle shows up (e.g. in <a href="http://en.wikipedia.org/wiki/Ramsey_theory">Ramsey theory</a>, recurrence theory, graph theory, group theory, etc.) but the basic ingredients are always the same.  Namely, the correspondence between Type 1 and Type 2 (or Type 3 and Type 4) arises from a <em>weak sequential compactness property</em>, which, roughly speaking asserts that given any sequence of (increasingly large) finite systems, there exists a subsequence of such systems which converges (in a suitably &#8220;weak&#8221; sense) to an infinite system.  [We will define these terms more precisely in concrete situations later.]  More informally, any &#8220;sufficiently large&#8221; finite system can be &#8220;approximated&#8221; in some weak sense by an infinite system.   (One can make this informal statement more rigorous using <a class="snap_noshots" href="http://terrytao.wordpress.com/2007/06/25/ultrafilters-nonstandard-analysis-and-epsilon-management/">nonstandard analysis and/or ultrafilters</a>, but we will not take such an approach here.)  Because of this, we obtain a <em>correspondence principle</em>: any <em>qualitative</em> statement about infinite systems (e.g. that a certain quantity is always strictly positive) is equivalent to a <em>quantitative </em>statement about sufficiently large finite systems (e.g. a certain quantity is always uniformly bounded from below).  This principle forms a crucial bridge between finitary (or quantitative) mathematics and infinitary (or qualitative) mathematics; in particular, by taking advantage of this principle, tools from one type of mathematics can be used to prove results in the other.  (See my previous <a class="snap_noshots" href="http://terrytao.wordpress.com/2007/05/23/soft-analysis-hard-analysis-and-the-finite-convergence-principle/">post on soft analysis and hard analysis</a> for further discussion.)</p>
<p>In addition to the use of compactness, the other key pillar of the correspondence principle is that of <em>abstraction</em> &#8211; the ability to generalise from a concrete system (on a very explicit space, e.g. the infinite cube <img src='http://s0.wp.com/latex.php?latex=%5C%7B0%2C1%5C%7D%5E%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{0,1&#92;}^{&#92;Bbb Z}' title='&#92;{0,1&#92;}^{&#92;Bbb Z}' class='latex' />) to an more general abstract setting (e.g. an abstract dynamical system, measure-preserving system, group, etc.)  One of the reasons for doing this is that there are various maneuvres one can do in the abstract setting (e.g. passing from a system to a subsystem, a factor, or an extension, or by reasoning by analogy from other special systems that are different from the original concrete system) which can be quite difficult to execute or motivate if one stays within the confines of a single concrete setting.</p>
<p>We now turn to several specific examples of this principle in various contexts.  We begin with the more &#8220;combinatorial&#8221; or &#8220;non-ergodic theoretical&#8221; instances of this principle, in which there is no underlying probability measure involved; these situations are simpler than the ergodic-theoretic ones, but already illustrate many of the key features of this principle in action.</p>
<p><strong>1.  The correspondence principle in Ramsey theory</strong></p>
<p>We begin with the classical correspondence principle that connects Ramsey results about finite colourings, to Ramsey results about infinite colourings, or (equivalently) about the topological dynamics of covers of open sets.  We illustrate this by demonstrating the equivalence of three statements. The first two are as follows:</p>
<blockquote><p><strong>Theorem 1.</strong> (<a href="http://en.wikipedia.org/wiki/Van_der_Waerden%27s_theorem">van der Waerden theorem</a>, Type 2 formulation)  Suppose the integers are coloured by finitely many colours.  Then there exist arbitrarily long monochromatic arithmetic progressions.</p></blockquote>
<blockquote><p><strong>Theorem 2.</strong> (van der Waerden theorem, Type 1 formulation) For every c and k there exists N such that whenever <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> is coloured by c colours, there exists a monochromatic arithmetic progression of length k.</p></blockquote>
<p>It is easy to see that Theorem 2 implies Theorem 1.  Conversely, to deduce Theorem 2 from Theorem 1 we use the correspondence principle as follows.  Assume for contradiction that Theorem 1 was true, but Theorem 2 was false. Untangling the quantifiers, this means that there exist positive integers k, c, a sequence <img src='http://s0.wp.com/latex.php?latex=N_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N_n' title='N_n' class='latex' /> going to infinity, and colourings <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN_n%5C%7D+%3D+A_%7Bn%2C1%7D+%5Ccup+%5Cldots+%5Ccup+A_%7Bn%2Cc%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N_n&#92;} = A_{n,1} &#92;cup &#92;ldots &#92;cup A_{n,c}' title='&#92;{1,&#92;ldots,N_n&#92;} = A_{n,1} &#92;cup &#92;ldots &#92;cup A_{n,c}' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN_n%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N_n&#92;}' title='&#92;{1,&#92;ldots,N_n&#92;}' class='latex' /> into c colours, none of which contain any monochromatic arithmetic progressions of length k.</p>
<p>By shifting the sets <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN_n%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N_n&#92;}' title='&#92;{1,&#92;ldots,N_n&#92;}' class='latex' /> and redefining <img src='http://s0.wp.com/latex.php?latex=N_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N_n' title='N_n' class='latex' /> a little, we can replace <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN_n%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N_n&#92;}' title='&#92;{1,&#92;ldots,N_n&#92;}' class='latex' /> by <img src='http://s0.wp.com/latex.php?latex=%5C%7B-N_n%2C%5Cldots%2CN_n%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{-N_n,&#92;ldots,N_n&#92;}' title='&#92;{-N_n,&#92;ldots,N_n&#92;}' class='latex' />.  This sequence of colourings on the increasingly large sets <img src='http://s0.wp.com/latex.php?latex=%5C%7B-N_n%2C%5Cldots%2CN_n%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{-N_n,&#92;ldots,N_n&#92;}' title='&#92;{-N_n,&#92;ldots,N_n&#92;}' class='latex' />.  One can now extract a subsequence of such colourings on finite sets of integers that converge <em>pointwise</em> or <em>weakly</em> to a colouring on the whole set of integers by the usual &#8220;<a href="http://en.wikipedia.org/wiki/Arzela-Ascoli_theorem">Arzelà-Ascoli</a> diagonalisation trick&#8221;.   Indeed, by passing to an initial subsequence (and using the infinite pigeonhole principle), one can ensure that all of these colourings eventually become a constant colour at 0; refining to another subsequence, we can ensure it is a constant colour at 1; then at -1, 2, -2, and so forth.  Taking a diagonal subsequence of these sequences, we obtain a final subsequence of finite colourings that converges pointwise to an infinite limit colouring.  By Theorem 1, this limit colouring contains an monochromatic arithmetic progression of length k.  Now note that the property of being monochromatic at this progression is a <em>local</em> one: one only needs to inspect the colour of finitely many of the integers in order to verify this property.  Because of this, this property of the infinite limit colouring will also be shared by the finite colourings that are sufficiently far along the converging sequence.  But we assumed at the very beginning that none of these finite colourings have a monochromatic arithmetic progression of length k, a contradiction, and the claim follows.</p>
<p>The above argument has all the basic ingredients of the correspondence principle in action: a proof by contradiction, use of weak compactness to extract an infinite limiting object, application of the infinitary result to that object, and checking that the conclusion of that result is sufficiently &#8220;finitary&#8221;, &#8220;local&#8221;, or &#8220;continuous&#8221; that it extends back to some of the finitary sequence, leading to the desired contradiction.  [It is essential that one manages to reduce to purely local properties before passing from the converging sequence to the limit, or vice versa, since non-local properties are usually not preserved by the limit. For instance, consider the colouring of <img src='http://s0.wp.com/latex.php?latex=%5C%7B-N%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{-N,&#92;ldots,N&#92;}' title='&#92;{-N,&#92;ldots,N&#92;}' class='latex' /> which colours every integer between -N/2 and N/2 blue, and all the rest red.  Then this converges weakly to the all-blue colouring, and clearly the (non-local) property of containing at least one red element is not preserved by the limit.]</p>
<p>A key disadvantage of the use of the correspondence principle, though, is that it is quite difficult to extract specific quantitative bounds from any argument that uses this principle; for instance, while one can eventually &#8220;proof mine&#8221; the above argument (combined with some standard proof of Theorem 1) to eventually get a bound on N in terms of k and d, such a bound is extremely poor (of <a href="http://en.wikipedia.org/wiki/Ackermann_function">Ackermann function</a> type).</p>
<p>Theorem 1 can be reformulated in a more abstract form:</p>
<blockquote><p><strong>Theorem 3.</strong> (van der Waerden theorem, Type 4 version)  Let X be a compact space, let <img src='http://s0.wp.com/latex.php?latex=T%3A+X+%5Cto+X&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T: X &#92;to X' title='T: X &#92;to X' class='latex' /> be a <a href="http://en.wikipedia.org/wiki/Homeomorphism">homeomorphism</a>, and let <img src='http://s0.wp.com/latex.php?latex=%28V_%5Calpha%29_%7B%5Calpha+%5Cin+A%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(V_&#92;alpha)_{&#92;alpha &#92;in A}' title='(V_&#92;alpha)_{&#92;alpha &#92;in A}' class='latex' /> be an open cover of X.  Then for any k there exists a positive integer n and an open set <img src='http://s0.wp.com/latex.php?latex=V_%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V_&#92;alpha' title='V_&#92;alpha' class='latex' /> in the cover such that <img src='http://s0.wp.com/latex.php?latex=V_%5Calpha+%5Ccap+T%5E%7B-n%7D+V_%5Calpha+%5Ccap+%5Cldots+T%5E%7B-%28k-1%29n%7D+V_%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V_&#92;alpha &#92;cap T^{-n} V_&#92;alpha &#92;cap &#92;ldots T^{-(k-1)n} V_&#92;alpha' title='V_&#92;alpha &#92;cap T^{-n} V_&#92;alpha &#92;cap &#92;ldots T^{-(k-1)n} V_&#92;alpha' class='latex' /> is non-empty.</p></blockquote>
<p>The deduction of Theorem 3 from Theorem 1 is easy, after using the compactness to refine the open cover to a finite subcover, picking a point <img src='http://s0.wp.com/latex.php?latex=x_0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x_0' title='x_0' class='latex' /> in X, and then colouring each integer n by the index <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' /> of the first open set <img src='http://s0.wp.com/latex.php?latex=V_%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V_&#92;alpha' title='V_&#92;alpha' class='latex' /> that contains <img src='http://s0.wp.com/latex.php?latex=T%5En+x_0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T^n x_0' title='T^n x_0' class='latex' />. The converse deduction of Theorem 1 from Theorem 3 is the one which shows the &#8220;dynamical&#8221; aspect of this theorem: we can encode a colouring <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D+%3D+A_1+%5Ccup+%5Cldots+%5Ccup+A_c&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z} = A_1 &#92;cup &#92;ldots &#92;cup A_c' title='{&#92;Bbb Z} = A_1 &#92;cup &#92;ldots &#92;cup A_c' class='latex' /> of the integers as a point <img src='http://s0.wp.com/latex.php?latex=x_0+%3A%3D+%28c_n%29_%7Bn+%5Cin+%7B%5CBbb+Z%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x_0 := (c_n)_{n &#92;in {&#92;Bbb Z}}' title='x_0 := (c_n)_{n &#92;in {&#92;Bbb Z}}' class='latex' /> in the infinite product space <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2Cc%5C%7D%5E%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,c&#92;}^{&#92;Bbb Z}' title='&#92;{1,&#92;ldots,c&#92;}^{&#92;Bbb Z}' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=c_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c_n' title='c_n' class='latex' /> is the unique class such that <img src='http://s0.wp.com/latex.php?latex=n+%5Cin+A_%7Bc_n%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='n &#92;in A_{c_n}' title='n &#92;in A_{c_n}' class='latex' /> (indeed, one can think of this product space as the space of all c-colourings of the integers).  The infinite product space is compact with the <a href="http://en.wikipedia.org/wiki/Product_topology">product</a> (or weak) topology used earlier, thus a sequence of colourings converge to a limit iff they converge locally (or pointwise).  This space also comes with the standard shift <img src='http://s0.wp.com/latex.php?latex=T%3A+%28x_n%29_%7Bn+%5Cin+%7B%5CBbb+Z%7D%7D+%5Cto+%28x_%7Bn-1%7D%29_%7Bn+%5Cin+%7B%5CBbb+Z%7D%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='T: (x_n)_{n &#92;in {&#92;Bbb Z}} &#92;to (x_{n-1})_{n &#92;in {&#92;Bbb Z}}' title='T: (x_n)_{n &#92;in {&#92;Bbb Z}} &#92;to (x_{n-1})_{n &#92;in {&#92;Bbb Z}}' class='latex' /> (corresponding to right shift on the space of colourings).  If we let X be the closure of the orbit <img src='http://s0.wp.com/latex.php?latex=%5C%7B+T%5En+x_0%3A+n+%5Cin+%7B%5CBbb+Z%7D+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{ T^n x_0: n &#92;in {&#92;Bbb Z} &#92;}' title='&#92;{ T^n x_0: n &#92;in {&#92;Bbb Z} &#92;}' class='latex' />, and let <img src='http://s0.wp.com/latex.php?latex=V_1%2C%5Cldots%2CV_c&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V_1,&#92;ldots,V_c' title='V_1,&#92;ldots,V_c' class='latex' /> be the open cover <img src='http://s0.wp.com/latex.php?latex=V_i+%3A%3D+%5C%7B+%28x_n%29_%7Bn+%5Cin+%7B%5CBbb+Z%7D%7D%3A+x_0+%3D+i+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='V_i := &#92;{ (x_n)_{n &#92;in {&#92;Bbb Z}}: x_0 = i &#92;}' title='V_i := &#92;{ (x_n)_{n &#92;in {&#92;Bbb Z}}: x_0 = i &#92;}' class='latex' />, it is straightforward to show that Theorem 3 implies Theorem 1.</p>
<p><strong>2.  The correspondence principle for finitely generated groups</strong></p>
<p>The combinatorial correspondence used above for colourings can also be applied to other situations, such as that of finitely generated groups.  Recall that if G is a group generated by a finite set S, we say that G has polynomial growth if there exists constants K, d such that the ball <img src='http://s0.wp.com/latex.php?latex=B_r&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_r' title='B_r' class='latex' /> of radius r (i.e. the set of words in S of length at most r) has cardinality at most <img src='http://s0.wp.com/latex.php?latex=Kr%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Kr^d' title='Kr^d' class='latex' />.  Such groups were classified by a well-known <a href="http://en.wikipedia.org/wiki/Gromov%27s_theorem_on_groups_of_polynomial_growth">theorem of Gromov</a>:</p>
<blockquote><p><strong>Theorem 4.</strong> (Gromov&#8217;s theorem on polynomial growth, Type 4 version) Let G be a finitely generated group of polynomial growth.  Then G is <a href="http://en.wikipedia.org/wiki/Virtually">virtually</a> <a href="http://en.wikipedia.org/wiki/Nilpotent_group">nilpotent</a> (i.e. it has a finite index subgroup that is nilpotent).</p></blockquote>
<p>As observed in Gromov&#8217;s original paper, this result is equivalent to a finitary version:</p>
<blockquote><p><strong>Theorem 5. </strong> (Gromov&#8217;s theorem on polynomial growth, Type 3)  For every integers s, K, d, there exists R such that any finitely generated group with s generators, such that <img src='http://s0.wp.com/latex.php?latex=B_r&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_r' title='B_r' class='latex' /> has cardinality at most <img src='http://s0.wp.com/latex.php?latex=Kr%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Kr^d' title='Kr^d' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=1+%5Cleq+r+%5Cleq+R&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1 &#92;leq r &#92;leq R' title='1 &#92;leq r &#92;leq R' class='latex' />, is virtually nilpotent.</p></blockquote>
<p>It is clear that Theorem 5 implies Theorem 4; for the converse implication, we use the correspondence principle.  We sketch the details as follows.  First, we make things more concrete (i.e. move from Type 3 and Type 4 to Type 1 and Type 2 respectively) by observing that every group G on s generators can be viewed as a quotient <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_s%2F%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_s/&#92;Gamma' title='{&#92;Bbb F}_s/&#92;Gamma' class='latex' /> of the (nonabelian) free group on s generators by some normal subgroup <img src='http://s0.wp.com/latex.php?latex=%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma' title='&#92;Gamma' class='latex' />.</p>
<p>Suppose for contradiction that Theorem 5 failed in this concrete setting; then there exists s, K, d, a sequence <img src='http://s0.wp.com/latex.php?latex=R_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='R_n' title='R_n' class='latex' /> going to infinity, and a sequence <img src='http://s0.wp.com/latex.php?latex=G_n+%3D+%7B%5CBbb+F%7D_s%2F%5CGamma_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G_n = {&#92;Bbb F}_s/&#92;Gamma_n' title='G_n = {&#92;Bbb F}_s/&#92;Gamma_n' class='latex' /> of groups such that each <img src='http://s0.wp.com/latex.php?latex=G_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G_n' title='G_n' class='latex' /> obeys the volume condition <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BB_r%26%23124%3B+%5Cleq+K+r%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;B_r&#124; &#92;leq K r^d' title='&#124;B_r&#124; &#92;leq K r^d' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=1+%5Cleq+r+%5Cleq+R_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1 &#92;leq r &#92;leq R_n' title='1 &#92;leq r &#92;leq R_n' class='latex' />.</p>
<p>The next step, as before, is to exploit weak sequential compactness and extract a subsequence of groups <img src='http://s0.wp.com/latex.php?latex=G_n+%3D+%7B%5CBbb+F%7D_s%2F%5CGamma_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G_n = {&#92;Bbb F}_s/&#92;Gamma_n' title='G_n = {&#92;Bbb F}_s/&#92;Gamma_n' class='latex' /> that &#8220;converge&#8221; to some limit <img src='http://s0.wp.com/latex.php?latex=G+%3D+%7B%5CBbb+F%7D_s%2F%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G = {&#92;Bbb F}_s/&#92;Gamma' title='G = {&#92;Bbb F}_s/&#92;Gamma' class='latex' />, in the &#8220;weak&#8221; or &#8220;pointwise&#8221; sense that <img src='http://s0.wp.com/latex.php?latex=%5CGamma_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma_n' title='&#92;Gamma_n' class='latex' /> converges pointwise (or locally) to <img src='http://s0.wp.com/latex.php?latex=%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma' title='&#92;Gamma' class='latex' /> (much as with the convergence of colourings in the previous setting).  The Arzelà-Ascoli argument as before shows that we can find a subsequence of <img src='http://s0.wp.com/latex.php?latex=G_n+%3D+%7B%5CBbb+F%7D_s%2F%5CGamma_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G_n = {&#92;Bbb F}_s/&#92;Gamma_n' title='G_n = {&#92;Bbb F}_s/&#92;Gamma_n' class='latex' /> which do converge pointwise to some limit object <img src='http://s0.wp.com/latex.php?latex=G+%3D+%7B%5CBbb+F%7D_s%2F%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G = {&#92;Bbb F}_s/&#92;Gamma' title='G = {&#92;Bbb F}_s/&#92;Gamma' class='latex' />; one can check that the property of being a normal subgroup is sufficiently &#8220;local&#8221; that it is preserved under limits, thus <img src='http://s0.wp.com/latex.php?latex=%5CGamma&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Gamma' title='&#92;Gamma' class='latex' /> is a normal subgroup of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_s&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_s' title='{&#92;Bbb F}_s' class='latex' /> and so G is well-defined as a group.  (One way to view this convergence is that algebraic identity obeyed by the generators of G, is eventually obeyed by the groups sufficiently far along the convergent subsequence, and conversely.)</p>
<p>As volume growth is a local condition (involving only words of bounded length for any fixed r), we then easily conclude that G is of polynomial growth, and thus by Theorem 4 is virtually nilpotent.  Some nilpotent algebra reveals that every virtually nilpotent group is finitely presented, so in particular there are a finite list of relations among the generators which guarantee this virtual nilpotency property.  Such properties are local enough that they must then persist to groups <img src='http://s0.wp.com/latex.php?latex=G_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='G_n' title='G_n' class='latex' /> sufficiently far along the subsequence, contradicting Theorem 5.</p>
<p>A slight modification of the above argument also reveals that the step and index of the nilpotent subgroup of G can be bounded by some constant depending only on K, d, s; this gives Theorem 5 meaningful content even when G is finite (in contrast to Theorem 4, which is trivial for finite groups).  On the other hand, no explicit bound for this constant (or for R) in terms of s, K, d is currently known, though presumably some such bound can eventually be extracted from the above argument and the existing proofs of Gromov&#8217;s theorem by proof mining techniques.</p>
<p><strong>3.  The correspondence principle for dense sets of integers</strong></p>
<p>Now we turn to the more &#8220;ergodic&#8221; variants of the correspondence principle, starting with the fundamental <em>Furstenberg correspondence principle</em> connecting combinatorial number theory with ergodic theory.  We will illustrate this with the classic example of <a href="http://en.wikipedia.org/wiki/Szemer%C3%A9di%27s_theorem">Szemerédi&#8217;s theorem</a>.</p>
<p>There are many finitary versions of Szemerédi&#8217;s theorem.  Here is one:</p>
<blockquote><p><strong>Theorem 6.</strong> (Szemerédi&#8217;s theorem, Type 1 version) Let <img src='http://s0.wp.com/latex.php?latex=k+%5Cgeq+2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k &#92;geq 2' title='k &#92;geq 2' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=0+%26%2360%3B+%5Cdelta+%5Cleq+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='0 &lt; &#92;delta &#92;leq 1' title='0 &lt; &#92;delta &#92;leq 1' class='latex' />.  Then there exists a positive integer <img src='http://s0.wp.com/latex.php?latex=N+%3D+N%28%5Cdelta%2Ck%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N = N(&#92;delta,k)' title='N = N(&#92;delta,k)' class='latex' /> such that every subset A of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;}' title='&#92;{1,&#92;ldots,N&#92;}' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%26%23124%3B+%5Cgeq+%5Cdelta+N&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A&#124; &#92;geq &#92;delta N' title='&#124;A&#124; &#92;geq &#92;delta N' class='latex' /> contains at least one k-term arithmetic progression.</p></blockquote>
<p>The standard &#8220;Type 2&#8243; formulation of this theorem is the assertion that any subset of the integers of positive upper density has arbitrarily long arithmetic progressions.  While this statement is indeed easily shown to be equivalent to Theorem 6, the Furstenberg correspondence principle instead connects this formulation to a rather different one, in which the deterministic infinite set is replaced by a random one. Recall that a random subset of integers <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}' title='{&#92;Bbb Z}' class='latex' /> is a random variable A taking values in the power set <img src='http://s0.wp.com/latex.php?latex=2%5E%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='2^{&#92;Bbb Z}' title='2^{&#92;Bbb Z}' class='latex' /> of the integers (or more formally, with a distribution that is a <a href="http://en.wikipedia.org/wiki/Borel_measure">Borel probability measure</a> on <img src='http://s0.wp.com/latex.php?latex=2%5E%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='2^{&#92;Bbb Z}' title='2^{&#92;Bbb Z}' class='latex' /> with the product topology), and so in particular the probabilities of any <em><a href="http://en.wikipedia.org/wiki/Cylinder_set">cylinder events</a> </em>such as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+P%7D%28+3%2C+5+%5Cin+A%3B+7%2C+11+%5Cnot+%5Cin+A+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb P}( 3, 5 &#92;in A; 7, 11 &#92;not &#92;in A )' title='{&#92;Bbb P}( 3, 5 &#92;in A; 7, 11 &#92;not &#92;in A )' class='latex' /></p>
<p>that involve only finitely many of the elements of A, are well-defined as numbers between 0 and 1.  The <a href="http://en.wikipedia.org/wiki/Caratheodory_extension_theorem">Carathéodory extension theorem</a> (combined with some topological properties of <img src='http://s0.wp.com/latex.php?latex=2%5E%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='2^{&#92;Bbb Z}' title='2^{&#92;Bbb Z}' class='latex' />) shows, conversely, that any assignment of numbers between 0 and 1 to each cylinder set, which obeys various compatibility conditions such as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+P%7D%28+3+%5Cin+A+%29+%3D+%7B%5CBbb+P%7D%28+3%2C+5+%5Cin+A+%29+%2B+%7B%5CBbb+P%7D%28+3+%5Cin+A%3B+5+%5Cnot+%5Cin+A+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb P}( 3 &#92;in A ) = {&#92;Bbb P}( 3, 5 &#92;in A ) + {&#92;Bbb P}( 3 &#92;in A; 5 &#92;not &#92;in A )' title='{&#92;Bbb P}( 3 &#92;in A ) = {&#92;Bbb P}( 3, 5 &#92;in A ) + {&#92;Bbb P}( 3 &#92;in A; 5 &#92;not &#92;in A )' class='latex' /></p>
<p style="text-align:left;">can be shown to give rise to a well-defined random set A.</p>
<p style="text-align:left;">We say that a random set A of integers is <a href="http://en.wikipedia.org/wiki/Stationary_process"><em>stationary</em></a> if for every integer h, the shifted set A+h has the same probability distribution as A.  In terms of cylinder events, this is equivalent to a collection of assertions such as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+P%7D%28+3%2C+5+%5Cin+A%3B+7%2C+11+%5Cnot+%5Cin+A+%29+%3D+%7B%5CBbb+P%7D%28+3-h%2C+5-h+%5Cin+A%3B+7-h%2C+11-h+%5Cnot+%5Cin+A+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb P}( 3, 5 &#92;in A; 7, 11 &#92;not &#92;in A ) = {&#92;Bbb P}( 3-h, 5-h &#92;in A; 7-h, 11-h &#92;not &#92;in A )' title='{&#92;Bbb P}( 3, 5 &#92;in A; 7, 11 &#92;not &#92;in A ) = {&#92;Bbb P}( 3-h, 5-h &#92;in A; 7-h, 11-h &#92;not &#92;in A )' class='latex' /></p>
<p>and so forth.  One can then equate Theorem 6 with</p>
<blockquote><p><strong>Theorem 7.</strong> (Szemerédi&#8217;s theorem, Type 2 version) Let A be a stationary random infinite set of integers such that <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+P%7D%280+%5Cin+A%29+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb P}(0 &#92;in A) &gt; 0' title='{&#92;Bbb P}(0 &#92;in A) &gt; 0' class='latex' /> (which, by stationarity, implies that <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+P%7D%28n+%5Cin+A%29+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb P}(n &#92;in A) &gt; 0' title='{&#92;Bbb P}(n &#92;in A) &gt; 0' class='latex' /> for all n), and let <img src='http://s0.wp.com/latex.php?latex=k+%5Cgeq+2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='k &#92;geq 2' title='k &#92;geq 2' class='latex' />.  Then, with positive probability, A contains a k-term arithmetic progression for each k.</p></blockquote>
<p>It is not difficult to show that Theorem 6 implies Theorem 7.  We briefly sketch the converse implication, which goes through the usual compactness-and-contradiction method.  Suppose for contradiction that Theorem 7 is true, but Theorem 6 fails.  Then we can find k and <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />, a sequence of <img src='http://s0.wp.com/latex.php?latex=N_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N_n' title='N_n' class='latex' /> going to infinity, and sets <img src='http://s0.wp.com/latex.php?latex=A_n+%5Csubset+%5C%7B1%2C%5Cldots%2CN_n%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_n &#92;subset &#92;{1,&#92;ldots,N_n&#92;}' title='A_n &#92;subset &#92;{1,&#92;ldots,N_n&#92;}' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA_n%26%23124%3B+%5Cgeq+%5Cdelta+N_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A_n&#124; &#92;geq &#92;delta N_n' title='&#124;A_n&#124; &#92;geq &#92;delta N_n' class='latex' /> with no k-term arithmetic progressions.</p>
<p>We now need to extract a stationary random infinite set A of integers as a limit of the deterministic finite sets <img src='http://s0.wp.com/latex.php?latex=A_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_n' title='A_n' class='latex' />.  The way one does that is by randomly translating each of the <img src='http://s0.wp.com/latex.php?latex=A_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_n' title='A_n' class='latex' />.  More precisely, let <img src='http://s0.wp.com/latex.php?latex=B_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_n' title='B_n' class='latex' /> denote the random finite set <img src='http://s0.wp.com/latex.php?latex=A_n+-+h&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A_n - h' title='A_n - h' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=h&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='h' title='h' class='latex' /> is chosen from <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN_n%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N_n&#92;}' title='&#92;{1,&#92;ldots,N_n&#92;}' class='latex' /> uniformly at random.  The probability distribution <img src='http://s0.wp.com/latex.php?latex=%5Cmu_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;mu_n' title='&#92;mu_n' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=B_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_n' title='B_n' class='latex' /> is a discrete probability measure on <img src='http://s0.wp.com/latex.php?latex=2%5E%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='2^{&#92;Bbb Z}' title='2^{&#92;Bbb Z}' class='latex' /> which is &#8220;almost stationary&#8221; in the sense that <img src='http://s0.wp.com/latex.php?latex=B_n%2B1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_n+1' title='B_n+1' class='latex' /> (say) has a distribution very close to <img src='http://s0.wp.com/latex.php?latex=B_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='B_n' title='B_n' class='latex' />; for instance probabilities such as <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+P%7D%28+3%2C+5+%5Cin+B_n+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb P}( 3, 5 &#92;in B_n )' title='{&#92;Bbb P}( 3, 5 &#92;in B_n )' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+P%7D%283%2C5+%5Cin+B_n%2B1%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb P}(3,5 &#92;in B_n+1)' title='{&#92;Bbb P}(3,5 &#92;in B_n+1)' class='latex' /> can easily be seen to differ only by <img src='http://s0.wp.com/latex.php?latex=O%281%2FN_n%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O(1/N_n)' title='O(1/N_n)' class='latex' />.  Also, the fact that <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA_n%26%23124%3B+%5Cgeq+%5Cdelta+N_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A_n&#124; &#92;geq &#92;delta N_n' title='&#124;A_n&#124; &#92;geq &#92;delta N_n' class='latex' /> equates to the assertion that <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+P%7D%280+%5Cin+B_n%29+%5Cgeq+%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb P}(0 &#92;