<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress.com" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>sum-product-estimates &amp;laquo; WordPress.com Tag Feed</title>
	<link>http://en.wordpress.com/tag/sum-product-estimates/</link>
	<description>Feed of posts on WordPress.com tagged "sum-product-estimates"</description>
	<pubDate>Wed, 22 May 2013 13:30:24 +0000</pubDate>

	<generator>http://en.wordpress.com/tags/</generator>
	<language>en</language>

<item>
<title><![CDATA[Milliman Lecture III: Sum-product estimates, expanders, and exponential sums]]></title>
<link>http://terrytao.wordpress.com/2007/12/06/milliman-lecture-iii-sum-product-estimates-expanders-and-exponential-sums/</link>
<pubDate>Fri, 07 Dec 2007 06:11:11 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2007/12/06/milliman-lecture-iii-sum-product-estimates-expanders-and-exponential-sums/</guid>
<description><![CDATA[This is my final Milliman lecture, in which I talk about the sum-product phenomenon in arithmetic co]]></description>
<content:encoded><![CDATA[<p>This is my <a href="http://www.math.washington.edu/Seminars/milliman_0708.php" class="snap_noshots">final Milliman lecture</a>, in which I talk about the sum-product phenomenon in arithmetic combinatorics, and some selected recent applications of this phenomenon to uniform distribution of exponentials, <a href="http://en.wikipedia.org/wiki/Expander_graph">expander graphs</a>, <a href="http://en.wikipedia.org/wiki/Extractor">randomness extractors</a>, and detecting (<a href="http://en.wikipedia.org/wiki/Sieve_theory">sieving</a>) <a href="http://en.wikipedia.org/wiki/Almost_prime">almost primes</a> in group orbits, particularly as developed by Bourgain and his co-authors.<br />
<!--more-->In the previous two lectures we had concentrated on <i>additive combinatorics</i> &#8211; the study of additive operations and patterns on sets.  Now we will look at <i>arithmetic combinatorics</i> &#8211; the simultaneous study of additive <i>and</i> multiplicative operations on sets; this is sort of a combinatorial analogue of <a href="http://en.wikipedia.org/wiki/Commutative_algebra">commutative algebra</a>.</p>
<p>There are many questions to study here, but the most basic is the sum-product problem, which we can state as follows.  Let A be a finite non-empty set of elements of a ring R (e.g. finite sets of integers, or elements of a cyclic group <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%2Fq%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}/q{&#92;Bbb Z}' title='{&#92;Bbb Z}/q{&#92;Bbb Z}' class='latex' />, or sets of matrices over some ring).  Then we can form the <i>sum set</i></p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=A%2BA+%3A%3D+%5C%7B+a+%2B+b%3A+a%2C+b+%5Cin+A+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+A := &#92;{ a + b: a, b &#92;in A &#92;}' title='A+A := &#92;{ a + b: a, b &#92;in A &#92;}' class='latex' /></p>
<p>and the <i>product set</i></p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A+%3A%3D+%5C%7B+a+%5Ccdot+b%3A+a%2C+b+%5Cin+A+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A := &#92;{ a &#92;cdot b: a, b &#92;in A &#92;}' title='A &#92;cdot A := &#92;{ a &#92;cdot b: a, b &#92;in A &#92;}' class='latex' /></p>
<p>To avoid degeneracies, let us assume that none (or very few) of the elements in A are <a href="http://en.wikipedia.org/wiki/Zero_divisor">zero divisors</a> (as this may cause <img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A' title='A &#92;cdot A' class='latex' /> to become very small).  Then it is easy to see that <img src='http://s0.wp.com/latex.php?latex=A%2BA&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+A' title='A+A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A' title='A &#92;cdot A' class='latex' /> will be at least as large as A itself.</p>
<p>Typically, both of these sets will be much larger than A itself, indeed, if we select A at random, we generically expect <img src='http://s0.wp.com/latex.php?latex=A%2BA&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+A' title='A+A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A' title='A &#92;cdot A' class='latex' /> to have cardinality comparable to <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A&#124;^2' title='&#124;A&#124;^2' class='latex' />.  But when A enjoys additive or multiplicative structure, the sets <img src='http://s0.wp.com/latex.php?latex=A%2BA&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+A' title='A+A' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A' title='A &#92;cdot A' class='latex' /> can be of size comparable to A.  For instance, if A is an arithmetic progression <img src='http://s0.wp.com/latex.php?latex=%5C%7Ba%2C+a%2Br%2C+a%2B2r%2C+%5Cldots%2C+a%2B%28k-1%29r%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{a, a+r, a+2r, &#92;ldots, a+(k-1)r&#92;}' title='&#92;{a, a+r, a+2r, &#92;ldots, a+(k-1)r&#92;}' class='latex' /> or an additive subgroup in the ring R (modulo zero divisors, such as 0), then <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%2BA%26%23124%3B+%5Csim+%26%23124%3BA%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A+A&#124; &#92;sim &#124;A&#124;' title='&#124;A+A&#124; &#92;sim &#124;A&#124;' class='latex' />.  Similarly, if A is a geometric progression <img src='http://s0.wp.com/latex.php?latex=%5C%7Ba%2C+ar%2C+ar%5E2%2C+%5Cldots%2C+ar%5E%7Bk-1%7D+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{a, ar, ar^2, &#92;ldots, ar^{k-1} &#92;}' title='&#92;{a, ar, ar^2, &#92;ldots, ar^{k-1} &#92;}' class='latex' /> or a multiplicative subgroup in the ring R, then <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA+%5Ccdot+A%26%23124%3B+%5Csim+%26%23124%3BA%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A &#92;cdot A&#124; &#92;sim &#124;A&#124;' title='&#124;A &#92;cdot A&#124; &#92;sim &#124;A&#124;' class='latex' />.  And of course, if A is both an additive and a multiplicative subgroup of R (modulo zero divisors), i.e. if A is a subring of R, then <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%2BA%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A+A&#124;' title='&#124;A+A&#124;' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA+%5Ccdot+A%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A &#92;cdot A&#124;' title='&#124;A &#92;cdot A&#124;' class='latex' /> are both comparable in size to &#124;A&#124;.  These examples are robust with respect to small perturbations; for instance, if A is a dense subset of an arithmetic progression or additive subgroup, then it is still the case that <img src='http://s0.wp.com/latex.php?latex=A%2BA&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+A' title='A+A' class='latex' /> is comparable in size to A.  There are also slightly more complicated examples of interest, such as <a href="http://en.wikipedia.org/wiki/Generalized_arithmetic_progression">generalised arithmetic progressions</a>, but we will not discuss these here.</p>
<p>Now let us work in the ring of integers <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}' title='{&#92;Bbb Z}' class='latex' />.  This ring has no non-trivial finite additive subgroups or multiplicative subgroups (and it certainly has no non-trivial finite subrings), but it of course has plenty of arithmetic progressions and geometric progressions.  But observe that it is rather difficult for a finite set A of integers to resemble both an arithmetic progression and a geometric progression simultaneously (unless A is very small).  So one expects at least one of <img src='http://s0.wp.com/latex.php?latex=A%2BA&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+A' title='A+A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A' title='A &#92;cdot A' class='latex' /> to be significantly larger than A itself. This claim was made precise <a href="http://www.ams.org/mathscinet-getitem?mr=820223" class="snap_noshots">by Erdős and Szemerédi</a>, who showed that</p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cmax%28+%26%23124%3BA%2BA%26%23124%3B%2C+%26%23124%3BA+%5Ccdot+A%26%23124%3B+%29+%5Cgg+%26%23124%3BA%26%23124%3B%5E%7B1%2B%5Cvarepsilon%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;max( &#124;A+A&#124;, &#124;A &#92;cdot A&#124; ) &#92;gg &#124;A&#124;^{1+&#92;varepsilon}' title='&#92;max( &#124;A+A&#124;, &#124;A &#92;cdot A&#124; ) &#92;gg &#124;A&#124;^{1+&#92;varepsilon}' class='latex' /> (1)</p>
<p>for some absolute constant <img src='http://s0.wp.com/latex.php?latex=%5Cvarepsilon+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;varepsilon &gt; 0' title='&#92;varepsilon &gt; 0' class='latex' />.  The value of this constant as improved steadily over the years; the best result currently is due <a href="http://www.ams.org/mathscinet-getitem?mr=2212132" class="snap_noshots">to Solymosi</a>, who showed that one can take <img src='http://s0.wp.com/latex.php?latex=%5Cvarepsilon&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;varepsilon' title='&#92;varepsilon' class='latex' /> arbitrarily close to 3/11.  Erdős and Szemerédi in fact conjectured that one can take <img src='http://s0.wp.com/latex.php?latex=%5Cvarepsilon&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;varepsilon' title='&#92;varepsilon' class='latex' /> arbitrarily close to 1 (i.e. for any finite set of integers A, either the sum set or product set has to be very close to its maximal size of <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A&#124;^2' title='&#124;A&#124;^2' class='latex' />), but this conjecture seems out of reach at present.  Nevertheless, even just the epsilon improvement over the trivial bound of &#124;A&#124; is actually quite useful.  It is the first example of what is now called the sum-product phenomenon: if a finite set A is not close to an actual subring, then either the sum set <img src='http://s0.wp.com/latex.php?latex=A%2BA&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+A' title='A+A' class='latex' /> or the product set <img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A' title='A &#92;cdot A' class='latex' /> must be significantly larger than A itself.  One can view (1) as a &#8220;robust&#8221; version of the assertion that the integers contain no non-trivial finite subrings; (1) is asserting that in fact the integers contain no non-trivial finite sets which even come close to behaving like a subring.</p>
<p>In 1999, Tom Wolff posed the question of whether the sum-product phenomenon  held true in finite fields <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_p' title='{&#92;Bbb F}_p' class='latex' /> of prime order (note that such fields have no non-trivial subrings), and in particular whether (1) was true when <img src='http://s0.wp.com/latex.php?latex=A+%5Csubset+%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;subset {&#92;Bbb F}_p' title='A &#92;subset {&#92;Bbb F}_p' class='latex' />, and A was not close to being all of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_p' title='{&#92;Bbb F}_p' class='latex' />, in the sense that <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%26%23124%3B+%5Cleq+p%5E%7B1-%5Cdelta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A&#124; &#92;leq p^{1-&#92;delta}' title='&#124;A&#124; &#92;leq p^{1-&#92;delta}' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta &gt; 0' title='&#92;delta &gt; 0' class='latex' />; of course one would need <img src='http://s0.wp.com/latex.php?latex=%5Cvarepsilon&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;varepsilon' title='&#92;varepsilon' class='latex' /> to depend on <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />.  (Actually, Tom only posed the question for <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%26%23124%3B+%5Csim+p%5E%7B1%2F2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A&#124; &#92;sim p^{1/2}' title='&#124;A&#124; &#92;sim p^{1/2}' class='latex' />, being motivated by finite field analogues of the Kakeya problem, but the question was clearly of interest for other ranges of A as well.)  This question was solved in the affirmative <a href="http://front.math.ucdavis.edu/math.CO/0301343" class="snap_noshots">by Bourgain, Katz, and myself</a> (in the range <img src='http://s0.wp.com/latex.php?latex=p%5E%5Cdelta+%5Cleq+%26%23124%3BA%26%23124%3B+%5Cleq+p%5E%7B1-%5Cdelta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p^&#92;delta &#92;leq &#124;A&#124; &#92;leq p^{1-&#92;delta}' title='p^&#92;delta &#92;leq &#124;A&#124; &#92;leq p^{1-&#92;delta}' class='latex' />) and then <a href="http://arxiv.org/abs/math/0304217" class="snap_noshots">by Bourgain, Glibichuk, and Konyagin</a> (in the full range <img src='http://s0.wp.com/latex.php?latex=1+%5Cleq+%26%23124%3BA%26%23124%3B+%5Cleq+p%5E%7B1-%5Cdelta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='1 &#92;leq &#124;A&#124; &#92;leq p^{1-&#92;delta}' title='1 &#92;leq &#124;A&#124; &#92;leq p^{1-&#92;delta}' class='latex' />); the result is now known as the <i>sum-product theorem</i> for <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_p' title='{&#92;Bbb F}_p' class='latex' /> (and there have since been several further proofs and refinements of this theorem).  The fact that the field has prime order is key; if for instance we were working in a field of order <img src='http://s0.wp.com/latex.php?latex=p%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p^2' title='p^2' class='latex' />, then by taking A to be the subfield of order p we see that both <img src='http://s0.wp.com/latex.php?latex=A%2BA&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+A' title='A+A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A' title='A &#92;cdot A' class='latex' /> have exactly the same size as A.  So any proof of the sum-product theorem must use at some point the fact that the field has prime order.</p>
<p>As in the integers, one can view the sum-product theorem as a robust assertion of the obvious statement that the field <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_p' title='{&#92;Bbb F}_p' class='latex' /> contains no non-trivial subrings.  So the main difficulty in the proof is to find a proof of this latter fact which is robust enough to generalise to this combinatorial setting.  The standard way to classify subrings is to use <a href="http://en.wikipedia.org/wiki/Lagrange%27s_theorem_%28group_theory%29">Lagrange&#8217;s theorem</a> that the order of a subgroup divides the order of the whole group, which is proven by partitioning the whole group into cosets of the subgroup, but this argument is very unstable and does not extend to the combinatorial setting.  But there are other ways to proceed.  The argument of Bourgain, Katz, and myself (which is based on an earlier argument <a href="http://www.ams.org/mathscinet-getitem?mr=1948103">of Edgar and Miller</a>), roughly speaking, proceeds by investigating the &#8220;dimension&#8221; of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_p' title='{&#92;Bbb F}_p' class='latex' /> relative to A, or in other words the least number of elements <img src='http://s0.wp.com/latex.php?latex=v_1%2C+%5Cldots%2C+v_d&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='v_1, &#92;ldots, v_d' title='v_1, &#92;ldots, v_d' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_p' title='{&#92;Bbb F}_p' class='latex' /> such that every element of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}' title='{&#92;Bbb F}' class='latex' /> can be expressed in the form <img src='http://s0.wp.com/latex.php?latex=a_1+v_1+%2B+%5Cldots+%2B+a_d+v_d&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a_1 v_1 + &#92;ldots + a_d v_d' title='a_1 v_1 + &#92;ldots + a_d v_d' class='latex' />.  Note that the number of such representations is equal to <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%26%23124%3B%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A&#124;^d' title='&#124;A&#124;^d' class='latex' />.  The key observation is that as <img src='http://s0.wp.com/latex.php?latex=%26%23124%3B%7B%5CBbb+F%7D_p%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;{&#92;Bbb F}_p&#124;' title='&#124;{&#92;Bbb F}_p&#124;' class='latex' /> is prime, it cannot equal <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%26%23124%3B%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A&#124;^d' title='&#124;A&#124;^d' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=d+%26%2362%3B+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='d &gt; 1' title='d &gt; 1' class='latex' />, and so by the pigeonhole principle some element must have more than one representation.  One can use this &#8220;linear dependence&#8221; to reduce the dimension by 1 (assuming that A behaves a lot like a subring), and so can eventually reduce to the d=1 case, which is prohibited by our assumption <img src='http://s0.wp.com/latex.php?latex=A+%26%2360%3B+p%5E%7B1-%5Cdelta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &lt; p^{1-&#92;delta}' title='A &lt; p^{1-&#92;delta}' class='latex' />.  (The hypothesis <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%26%23124%3B+%26%2362%3B+p%5E%7B%5Cdelta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A&#124; &gt; p^{&#92;delta}' title='&#124;A&#124; &gt; p^{&#92;delta}' class='latex' /> is needed to ensure that the initial dimension d is bounded, so that the iteration only requires a bounded number of steps.)  The argument of Bourgain, Glibichuk, and Konyagin uses a more algebraic method (a variant of the polynomial method of Stepanov), using the basic observation that the number of zeroes of a polynomial (counting multiplicity) is bounded by the degree of that polynomial to obtain upper bounds for various sets (such as the number of parallelograms in A).  More recently, a <a href="http://arxiv.org/abs/math/0702780" class="snap_noshots">short argument of Garaev</a> proceeds using the simple observation that if A is any non-trivial subset of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_p' title='{&#92;Bbb F}_p' class='latex' />, then there must exist <img src='http://s0.wp.com/latex.php?latex=a+%5Cin+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a &#92;in A' title='a &#92;in A' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=a%2B1+%5Cnot+%5Cin+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a+1 &#92;not &#92;in A' title='a+1 &#92;not &#92;in A' class='latex' />; applying this to the &#8220;fraction field&#8221; <img src='http://s0.wp.com/latex.php?latex=Q%5BA%5D+%3A%3D+%5C%7B+%28a-b%29%2F%28c-d%29%3A+a%2Cb%2Cc%2Cd+%5Cin+A%2C+c+%5Cneq+d+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Q[A] := &#92;{ (a-b)/(c-d): a,b,c,d &#92;in A, c &#92;neq d &#92;}' title='Q[A] := &#92;{ (a-b)/(c-d): a,b,c,d &#92;in A, c &#92;neq d &#92;}' class='latex' /> of A one can conclude that Q[A] does not in fact behave like a field, and hence A does not behave like a ring.</p>
<p>The sum-product phenomenon implies that if a set <img src='http://s0.wp.com/latex.php?latex=A+%5Csubset+%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;subset {&#92;Bbb F}_p' title='A &#92;subset {&#92;Bbb F}_p' class='latex' /> of medium size <img src='http://s0.wp.com/latex.php?latex=p%5E%5Cdelta+%5Cleq+%26%23124%3BA%26%23124%3B+%5Cleq+p%5E%7B1-%5Cdelta%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p^&#92;delta &#92;leq &#124;A&#124; &#92;leq p^{1-&#92;delta}' title='p^&#92;delta &#92;leq &#124;A&#124; &#92;leq p^{1-&#92;delta}' class='latex' /> is multiplicatively structured (e.g. it is a geometric progression or a multiplicative subgroup) then it cannot be additively structured: <img src='http://s0.wp.com/latex.php?latex=A%2BA&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+A' title='A+A' class='latex' /> is significantly larger than A.  It turns out that with a little bit of extra work, this observation can be iterated: A+A+A+A is even larger than A, and so on and so forth, and in fact one can show that <img src='http://s0.wp.com/latex.php?latex=kA+%3D+%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='kA = {&#92;Bbb F}_p' title='kA = {&#92;Bbb F}_p' class='latex' /> for some bounded k depending only on <img src='http://s0.wp.com/latex.php?latex=%5Cdelta&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta' title='&#92;delta' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=kA+%3A%3D+A+%2B+%5Cldots+%2B+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='kA := A + &#92;ldots + A' title='kA := A + &#92;ldots + A' class='latex' /> is the k-fold sumset of A.  (The key to this iteration essentially lies in the inclusion <img src='http://s0.wp.com/latex.php?latex=%28kA%29+%5Ccdot+%28kA%29+%5Csubset+k%5E2+%28A%5E2%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(kA) &#92;cdot (kA) &#92;subset k^2 (A^2)' title='(kA) &#92;cdot (kA) &#92;subset k^2 (A^2)' class='latex' />, which is a consequence of the <a href="http://en.wikipedia.org/wiki/Distributive_law">distributive law</a>.  The use of this law unfortunately breaks the symmetry between multiplication and addition that one sees in the sum-product estimates.)  Thus any multiplicatively structured subset A of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_p' title='{&#92;Bbb F}_p' class='latex' /> of medium size must eventually additively generate the whole field.  As a consequence of this, one can show that A is an additive expander, which roughly speaking means that <img src='http://s0.wp.com/latex.php?latex=A%2BB&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+B' title='A+B' class='latex' /> is spread out on a significantly larger set than B for any medium-sized set B.  (In more probabilistic language, if one considered the random walk whose steps were drawn randomly from A, then this walk would converge extremely rapidly to the uniform distribution.)  From that observation (and some more combinatorial effort), one can in fact conclude that  multiplicatively structured sets must be distributed uniformly in an additive sense; if they concentrated too much in, say, a subinterval of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_p' title='{&#92;Bbb F}_p' class='latex' />, then this could be used to contradict the additive expansion property.</p>
<p>Let me note one cute application of this technology, due to Bourgain, to the <a href="http://en.wikipedia.org/wiki/Diffie-Hellman_key_exchange">Diffie-Hellman key exchange protocol</a> and its relatives in cryptography.  Suppose we have two people, Alice and Bob, who want to communicate privately and securely, but have never met each other before and can only contact each other via an unsecured network (e.g. the internet, or physical mail), in which anyone can eavesdrop.  How can Alice and Bob achieve this?</p>
<p>If one was sending a physical object (e.g. a physical letter) by physical mail (which could be opened by third parties), one could proceed as follows.</p>
<ol>
<li>Alice places the object in a box, and locks the box with her own padlock, keeping the key.  She then mails the locked box to Bob.  Anyone who intercepts the box cannot open it, since they don&#8217;t have Alice&#8217;s key.</li>
<li>Of course, Bob can&#8217;t open the box either.  But what he can do instead is put his own padlock on the box (keeping the key), and sends the doubly locked box back to Alice.</li>
<li>Alice can&#8217;t unlock Bob&#8217;s padlock&#8230; but she can unlock her own.  So she removes her lock, and sends the singly locked box back to Bob.</li>
<li>Bob can unlock his own padlock, and so retreives the object safely.  At no point was the object available to any interceptor.</li>
</ol>
<p>A similar procedure (a slight variant of the Diffie-Hellman protocol, essentially the <a href="http://en.wikipedia.org/wiki/Massey-Omura_cryptosystem">Massey-Omura cryptosystem</a>) can be used to transmit a digital message g (which one should think of as just being a number) from Alice to Bob over an unsecured network, as follows:</p>
<ol>
<li>Alice and Bob agree (over the unsecured network) on some large prime p (larger than the maximum size of the message g).</li>
<li>Alice &#8220;locks&#8221; the message g by raising it to a power a mod p, where Alice generates the &#8220;key&#8221; a randomly and keeps it secret.  She then sends the locked message <img src='http://s0.wp.com/latex.php?latex=g%5Ea+%5Chbox%7B+mod+%7D+p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g^a &#92;hbox{ mod } p' title='g^a &#92;hbox{ mod } p' class='latex' /> to Bob.</li>
<li>Bob can&#8217;t decode this message (he doesn&#8217;t know a), but he doubly locks the message by raising the message to his own power b, and returns the doubly locked message <img src='http://s0.wp.com/latex.php?latex=g%5E%7Bab%7D+%5Chbox%7B+mod+%7D+p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g^{ab} &#92;hbox{ mod } p' title='g^{ab} &#92;hbox{ mod } p' class='latex' /> back to Alice.</li>
<li>Alice then &#8220;unlocks&#8221; her part of the message by taking the <img src='http://s0.wp.com/latex.php?latex=a%5E%7Bth%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='a^{th}' title='a^{th}' class='latex' /> root (which can be done by <a href="http://en.wikipedia.org/wiki/Cauchy%27s_theorem_%28group_theory%29">Cauchy&#8217;s theorem</a>) and sends <img src='http://s0.wp.com/latex.php?latex=g%5Eb+%5Chbox%7B+mod+%7D+p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g^b &#92;hbox{ mod } p' title='g^b &#92;hbox{ mod } p' class='latex' /> back to Bob.</li>
<li>Bob then takes the <img src='http://s0.wp.com/latex.php?latex=b%5E%7Bth%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='b^{th}' title='b^{th}' class='latex' /> root of the message and recovers g.</li>
</ol>
<p>An eavesdropper (let&#8217;s call her Eve) could intercept p, as well as the three &#8220;locked&#8221; values <img src='http://s0.wp.com/latex.php?latex=g%5Ea%2C+g%5Eb%2C+g%5E%7Bab%7D+%5Chbox%7B+mod+%7D+p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g^a, g^b, g^{ab} &#92;hbox{ mod } p' title='g^a, g^b, g^{ab} &#92;hbox{ mod } p' class='latex' />, but does not directly recover g.  Now, it is possible that one could use this information to reconstruct g (indeed, if one could quickly take discrete logarithms, then this would be a fairly easy task) but no feasible algorithm for this is known (if p is large, e.g. 500+ digits); the problem is generally believed to be roughly comparable in difficulty to that of factoring large numbers.  But no-one knows how to rigorously prove that the Diffie-Hellman reconstruction problem is hard (e.g. non-polynomial time); indeed, this would imply <a href="http://en.wikipedia.org/wiki/P_%3D_NP_problem"><img src='http://s0.wp.com/latex.php?latex=P+%5Cneq+NP&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P &#92;neq NP' title='P &#92;neq NP' class='latex' /></a>, since this reconstruction problem is easily seen to be in NP (though it is not believed to be <a href="http://en.wikipedia.org/wiki/NP-complete">NP-complete</a>).</p>
<p>Using the sum-product technology, Bourgain was at least able to show that the Diffie-Hellman protocol was secure (for sufficiently large p) if Eve was only able to see the <a href="http://en.wikipedia.org/wiki/Significant_figures">high bits</a> of <img src='http://s0.wp.com/latex.php?latex=g%5Ea%2C+g%5Eb%2C+g%5E%7Bab%7D+%5Chbox%7B+mod+%7D+p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g^a, g^b, g^{ab} &#92;hbox{ mod } p' title='g^a, g^b, g^{ab} &#92;hbox{ mod } p' class='latex' />, thus pinning down <img src='http://s0.wp.com/latex.php?latex=g%5Ea%2C+g%5Eb%2C+g%5E%7Bab%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='g^a, g^b, g^{ab}' title='g^a, g^b, g^{ab}' class='latex' /> to intervals.  The reason for this is that the set <img src='http://s0.wp.com/latex.php?latex=%5C%7B+%28g%5Ea%2C+g%5Eb%2C+g%5E%7Bab%7D%29+%5Cin+%7B%5CBbb+F%7D_p%5E3%3A+a%2Cb+%5Cin+%7B%5CBbb+Z%7D+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{ (g^a, g^b, g^{ab}) &#92;in {&#92;Bbb F}_p^3: a,b &#92;in {&#92;Bbb Z} &#92;}' title='&#92;{ (g^a, g^b, g^{ab}) &#92;in {&#92;Bbb F}_p^3: a,b &#92;in {&#92;Bbb Z} &#92;}' class='latex' /> has a lot of multiplicative structure (indeed, it is a multiplicative subgroup of the ring <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_p%5E3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_p^3' title='{&#92;Bbb F}_p^3' class='latex' />) and so should be uniformly distributed in an additive sense (by adapting the above sum-product technology to <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_p%5E3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_p^3' title='{&#92;Bbb F}_p^3' class='latex' />).</p>
<p>Another application of sum-product technology was to build efficient <a href="http://en.wikipedia.org/wiki/Extractor">randomness extractors</a> &#8211; deterministic algorithms that can create high-quality (very uniform) random bits from several independent low-quality (non-uniform) random sources; such extractors are of importance in computer science and cryptography.  Basically, the sum-product estimate implies that if <img src='http://s0.wp.com/latex.php?latex=A%2C+B%2C+C+%5Csubset+%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A, B, C &#92;subset {&#92;Bbb F}_p' title='A, B, C &#92;subset {&#92;Bbb F}_p' class='latex' /> are sets of medium size, then the set <img src='http://s0.wp.com/latex.php?latex=A%2BB+%5Ccdot+C&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+B &#92;cdot C' title='A+B &#92;cdot C' class='latex' /> is significantly larger than A, B, or C.  As a consequence, if X, Y, Z are independent random variables in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+F%7D_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb F}_p' title='{&#92;Bbb F}_p' class='latex' /> which are not too narrowly distributed(in particular, they are not deterministic, and thus distributed only on a single value), one can show (with the assistance of some additive combinatorics) that the random variable <img src='http://s0.wp.com/latex.php?latex=X%2BYZ&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='X+YZ' title='X+YZ' class='latex' /> is significantly more uniformly distributed than X, Y, or Z.  Iterating this leads to some surprisingly good randomness extractors, as was first observed <a href="http://www.cs.princeton.edu/~boaz/Papers/msamples.ps" class="snap_noshots">by Barak, Impagliazzo, and Wigderson</a>.</p>
<p>Another application of the above sum-product technology was to get a product estimate in matrix groups, such as <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+F%7D_p%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb F}_p)' title='SL_2({&#92;Bbb F}_p)' class='latex' />.  Indeed, <a href="http://arxiv.org/abs/math/0509024" class="snap_noshots">Helfgott was able to show</a> that if A was a subset of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+F%7D_p%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb F}_p)' title='SL_2({&#92;Bbb F}_p)' class='latex' /> of medium or small size, and it was not trapped inside a proper subgroup of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+F%7D_p%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb F}_p)' title='SL_2({&#92;Bbb F}_p)' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A+%5Ccdot+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A &#92;cdot A' title='A &#92;cdot A &#92;cdot A' class='latex' /> was significantly larger than A itself.  (One needs to work with triple products here instead of double products for a rather trivial reason: if <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A' title='A' class='latex' /> was the union of a subgroup and some external element, then <img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A' title='A &#92;cdot A' class='latex' /> is still comparable in size to A, but <img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A+%5Ccdot+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A &#92;cdot A' title='A &#92;cdot A &#92;cdot A' class='latex' /> will be much larger.  This result may not immediately look like a sum-product estimate, because there is no obvious addition, but it is concealed within the matrix multiplication law for <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+F%7D_p%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb F}_p)' title='SL_2({&#92;Bbb F}_p)' class='latex' />.  The key observation in Helfgott&#8217;s argument, which relies crucially on the sum-product estimate, is that if V is a collection of diagonal matrices in <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+F%7D_p%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb F}_p)' title='SL_2({&#92;Bbb F}_p)' class='latex' /> of medium size, and g is a non-diagonal matrix element, then the set <img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Btr%7D%28V+g+V+g%5E%7B-1%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{tr}(V g V g^{-1})' title='&#92;hbox{tr}(V g V g^{-1})' class='latex' /> is significantly larger than V itself.  If one works out explicitly what this trace is, one sees a sum-product type of result emerging.  Conversely, if the trace <img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Btr%7D%28A%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{tr}(A)' title='&#92;hbox{tr}(A)' class='latex' /> of a group-like set A is large, then the conjugacy classes in A are fairly small (since trace is conjugation-invariant), which forces many pairs in A to commute, which creates large sets V of simultaneously commuting (and hence simultaneously diagonalisable) elements, due to the fact (specific to <img src='http://s0.wp.com/latex.php?latex=SL_2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2' title='SL_2' class='latex' />) that if two elements commute with a third, then they are quite likely to commute with each other.  The tension between these two implications is what underlies Helfgott&#8217;s results.</p>
<p>The estimate of Helfgott shows that multiplication by medium-size sets in <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+F%7D_p%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb F}_p)' title='SL_2({&#92;Bbb F}_p)' class='latex' /> expands rapidly across the group (unless it is trapped in a subgroup).  As a consequence of Helfgott&#8217;s estimate, <a href="http://www.ams.org/mathscinet-getitem?mr=2227746" class="snap_noshots">Bourgain and Gamburd</a> were able to show that if S was any finite symmetric set of matrices in <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z})' title='SL_2({&#92;Bbb Z})' class='latex' /> which generated a sufficiently large (or more precisely, Zariski dense) subgroup of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z})' title='SL_2({&#92;Bbb Z})' class='latex' />, and <img src='http://s0.wp.com/latex.php?latex=S_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S_p' title='S_p' class='latex' /> was the projection of S to <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_p%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_p)' title='SL_2({&#92;Bbb Z}_p)' class='latex' />, then the random walk using <img src='http://s0.wp.com/latex.php?latex=S_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S_p' title='S_p' class='latex' /> on <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D_p%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z}_p)' title='SL_2({&#92;Bbb Z}_p)' class='latex' /> was very rapidly mixing, so that after about <img src='http://s0.wp.com/latex.php?latex=O%28%5Clog+p%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='O(&#92;log p)' title='O(&#92;log p)' class='latex' /> steps, the walk was very close to uniform.  (The precise statement was that the <a href="http://en.wikipedia.org/wiki/Cayley_graph">Cayley graph</a> associated to <img src='http://s0.wp.com/latex.php?latex=S_p&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='S_p' title='S_p' class='latex' /> for each p formed an <a href="http://en.wikipedia.org/wiki/Expander_graph">expander family</a>.)  Quite recently, <a href="http://www.ams.org/mathscinet-getitem?mr=2246331" class="snap_noshots">Bourgain, Gamburd, and Sarnak</a> have applied these results (and generalisations thereof) to the problem of detecting  (or <a href="http://en.wikipedia.org/wiki/Sieve_theory">sieving</a>) almost primes in thin algebraically generated sets.  To motivate the problem, we observe that many classical questions in prime number theory can be rephrased as one of detecting prime points <img src='http://s0.wp.com/latex.php?latex=%28p_1%2C%5Cldots%2Cp_d%29+%5Cin+%7B%5Cmathcal+P%7D%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(p_1,&#92;ldots,p_d) &#92;in {&#92;mathcal P}^d' title='(p_1,&#92;ldots,p_d) &#92;in {&#92;mathcal P}^d' class='latex' /> in algebraic subsets <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+O%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal O}' title='{&#92;mathcal O}' class='latex' /> of a lattice <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}^d' title='{&#92;Bbb Z}^d' class='latex' />.  For instance, the twin prime problem asks whether the line <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+O%7D+%3D+%5C%7B+%28n%2Cn%2B2%29+%5Cin+%7B%5CBbb+Z%7D%5E2+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal O} = &#92;{ (n,n+2) &#92;in {&#92;Bbb Z}^2 &#92;}' title='{&#92;mathcal O} = &#92;{ (n,n+2) &#92;in {&#92;Bbb Z}^2 &#92;}' class='latex' /> contains infinitely many prime points.  In general, these problems are very difficult, especially once one considers sets described by polynomials rather than linear functions; even the one-dimensional problem of determining whether the set <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+O%7D+%3D+%5C%7B+n%5E2%2B1%3A+n+%5Cin+%7B%5CBbb+Z%7D+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal O} = &#92;{ n^2+1: n &#92;in {&#92;Bbb Z} &#92;}' title='{&#92;mathcal O} = &#92;{ n^2+1: n &#92;in {&#92;Bbb Z} &#92;}' class='latex' /> contains infinitely many primes has been open for quite a long time (though it is worth mentioning the celebrated <a href="http://www.ams.org/mathscinet-getitem?mr=1670065" class="snap_noshots">result of Friedlander and Iwaniec</a> that the somewhat larger set <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+O%7D+%3D+%5C%7B+n%5E2+%2B+m%5E4%3A+n%2Cm+%5Cin+%7B%5CBbb+Z%7D+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal O} = &#92;{ n^2 + m^4: n,m &#92;in {&#92;Bbb Z} &#92;}' title='{&#92;mathcal O} = &#92;{ n^2 + m^4: n,m &#92;in {&#92;Bbb Z} &#92;}' class='latex' /> is known to have infinitely many primes).</p>
<p>So prime points are hard to detect.  However, by using methods from sieve theory, one can often detect almost prime points in various sets <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+O%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal O}' title='{&#92;mathcal O}' class='latex' /> &#8211; points whose coordinates are the products of only a few primes.  For instance, a famous <a href="http://en.wikipedia.org/wiki/Chen%27s_theorem">theorem of Chen</a> shows that the line <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+O%7D+%3D+%5C%7B+%28n%2Cn%2B2%29+%5Cin+%7B%5CBbb+Z%7D%5E2+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal O} = &#92;{ (n,n+2) &#92;in {&#92;Bbb Z}^2 &#92;}' title='{&#92;mathcal O} = &#92;{ (n,n+2) &#92;in {&#92;Bbb Z}^2 &#92;}' class='latex' /> contains infinitely many points which are <a href="http://en.wikipedia.org/wiki/Almost_prime">almost prime</a> in the sense that the first coordinate is prime, and the second coordinate is the product of at most two primes.  The basic idea of sieve theory is to sift out primes and almost primes by removing all points whose coordinates are divisible by small factors (and then, due to various generalisations of the <a href="http://en.wikipedia.org/wiki/Inclusion-exclusion_principle">inclusion-exclusion principle</a>, one has to add back in points which are divisible by multiple small factors, and so forth).  See <a href="http://terrytao.wordpress.com/2007/06/05/open-question-the-parity-problem-in-sieve-theory/" class="snap_noshots">my post on sieve theory and the parity problem</a> for further discussion.  In order for sieve theory to work well, one needs to be able to accurately count the size of the original set <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+O%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal O}' title='{&#92;mathcal O}' class='latex' /> (or more precisely, the size of this set restricted to a ball or a similar object), and also need to count how many points in that set have a certain residue class modulo q, for various values of q.  (For instance, to sieve out twin primes or twin almost primes in the interval {1,..,N}, one needs to count how many elements n in that interval are such that n and n+2 are both invertible modulo q (i.e. coprime to q) for various values of q.)</p>
<p>For arbitrary algebraic sets <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+O%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal O}' title='{&#92;mathcal O}' class='latex' />, these tasks are very difficult.  For instance, even the basic task of determining whether a set <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+O%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal O}' title='{&#92;mathcal O}' class='latex' /> described by several polynomials is non-empty is essentially <a href="http://en.wikipedia.org/wiki/Hilbert%27s_tenth_problem">Hilbert&#8217;s tenth problem</a>, which is undecidable in general.  But if the set <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+O%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal O}' title='{&#92;mathcal O}' class='latex' /> is generated by a group <img src='http://s0.wp.com/latex.php?latex=%5CLambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda' title='&#92;Lambda' class='latex' /> acting on <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+Z%7D%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb Z}^d' title='{&#92;Bbb Z}^d' class='latex' /> (in some polynomial fashion), thus <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+O%7D+%3D+%5CLambda+b&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal O} = &#92;Lambda b' title='{&#92;mathcal O} = &#92;Lambda b' class='latex' /> for some point <img src='http://s0.wp.com/latex.php?latex=b+%5Cin+%7B%5CBbb+Z%7D%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='b &#92;in {&#92;Bbb Z}^d' title='b &#92;in {&#92;Bbb Z}^d' class='latex' />, then the problems become much more tractable.  If the group <img src='http://s0.wp.com/latex.php?latex=%5CLambda&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;Lambda' title='&#92;Lambda' class='latex' /> is generated by some finite set S, and we restrict attention to group elements with some given word length, the problem of understanding how <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal+O%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathcal O}' title='{&#92;mathcal O}' class='latex' /> is distributed modulo q is equivalent to asking how random walks on S of a given length distribute themselves on <img src='http://s0.wp.com/latex.php?latex=%28%7B%5CBbb+Z%7D%2Fq%7B%5CBbb+Z%7D%29%5Ed&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='({&#92;Bbb Z}/q{&#92;Bbb Z})^d' title='({&#92;Bbb Z}/q{&#92;Bbb Z})^d' class='latex' />.  This latter problem is very close to the problem solved by the mixing results of Bourgain and Gamburd mentioned earlier, which is where the link to sum-product estimates arises from.  Indeed, Bourgain, Gamburd, and Sarnak have now shown that rather general classes of algebraic sets generated by subgroups of <img src='http://s0.wp.com/latex.php?latex=SL_2%28%7B%5CBbb+Z%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_2({&#92;Bbb Z})' title='SL_2({&#92;Bbb Z})' class='latex' /> will contain infinitely many almost primes, as long as there are no obvious algebraic obstructions; the methods should hopefully extend to more general groups, such as subgroups of <img src='http://s0.wp.com/latex.php?latex=SL_n%28%7B%5CBbb+Z%7D%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SL_n({&#92;Bbb Z})' title='SL_n({&#92;Bbb Z})' class='latex' />.</p>
<p>[<i>Update</i>, Dec 7: terminology and typos corrected.]</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[The crossing number inequality]]></title>
<link>http://terrytao.wordpress.com/2007/09/18/the-crossing-number-inequality/</link>
<pubDate>Wed, 19 Sep 2007 05:17:19 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2007/09/18/the-crossing-number-inequality/</guid>
<description><![CDATA[Today I&#8217;d like to discuss a beautiful inequality in graph theory, namely the crossing number i]]></description>
<content:encoded><![CDATA[<p>Today I&#8217;d like to discuss a beautiful inequality in graph theory, namely the <a href="http://en.wikipedia.org/wiki/Crossing_number">crossing number inequality</a>. This inequality gives a useful bound on how far a given graph is from being planar, and has a number of applications, for instance to sum-product estimates. Its proof is also an excellent example of the <a href="http://terrytao.wordpress.com/2007/09/05/amplification-arbitrage-and-the-tensor-power-trick/">amplification trick</a> in action; here the main source of amplification is the freedom to pass to subobjects, which is a freedom which I didn&#8217;t touch upon in the previous post on amplification. The crossing number inequality (and its proof) is well known among graph theorists but perhaps not among the wider mathematical community, so I thought I would publicise it here.</p>
<p>In this post, when I talk about a <a href="http://en.wikipedia.org/wiki/Graph_%28mathematics%29">graph</a>, I mean an abstract collection of vertices V, together with some abstract edges E joining pairs of vertices together. We will assume that the graph is <em>undirected</em> (the edges do not have a preferred orientation), <em>loop-free</em> (an edge cannot begin and start at the same vertex), and <em>multiplicity-free</em> (any pair of vertices is joined by at most one edge). More formally, we can model all this by viewing E as a subset of <img src='http://s0.wp.com/latex.php?latex=%5Cbinom%7BV%7D%7B2%7D+%3A%3D+%5C%7B+e+%5Csubset+V%3A+%26%23124%3Be%26%23124%3B%3D2+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;binom{V}{2} := &#92;{ e &#92;subset V: &#124;e&#124;=2 &#92;}' title='&#92;binom{V}{2} := &#92;{ e &#92;subset V: &#124;e&#124;=2 &#92;}' class='latex' />, the set of 2-element subsets of V, and we view the graph G as an ordered pair G = (V,E). (The notation is set up so that <img src='http://s0.wp.com/latex.php?latex=%26%23124%3B%5Cbinom%7BV%7D%7B2%7D%26%23124%3B+%3D+%5Cbinom%7B%26%23124%3BV%26%23124%3B%7D%7B2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;&#92;binom{V}{2}&#124; = &#92;binom{&#124;V&#124;}{2}' title='&#124;&#92;binom{V}{2}&#124; = &#92;binom{&#124;V&#124;}{2}' class='latex' />.)</p>
<p>Now one of the great features of graphs, as opposed to some other abstract maths concepts, is that they are easy to draw: the abstract vertices become dots on a plane, while the edges become line segments or curves connecting these dots. [To avoid some technicalities we do not allow these curves to pass through the dots, except if the curve is terminating at that dot.] Let us informally refer to such a concrete representation D of a graph G as a <a href="http://en.wikipedia.org/wiki/Graph_drawing">drawing</a> of that graph. Clearly, any non-trivial graph is going to have an infinite number of possible drawings. In some of these drawings, a pair of edges might cross each other; in other drawings, all edges might be disjoint (except of course at the vertices, where edges with a common endpoint are obliged to meet). If G has a drawing D of the latter type, we say that the graph G is <a href="http://en.wikipedia.org/wiki/Planar_graph">planar</a>.</p>
<p>Given an abstract graph G, or a drawing thereof, it is not always obvious as to whether that graph is planar; just because the drawing that you currently possess of G contains crossings, does not necessarily mean that <em>all</em> drawings of G do. The wonderful little web game &#8220;<a href="http://www.planarity.net/">Planarity</a>&#8221; illustrates this point excellently. Nevertheless, there are definitely graphs which are not planar; in particular the <a href="http://en.wikipedia.org/wiki/Complete_graph">complete graph </a><img src='http://s0.wp.com/latex.php?latex=K_5&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='K_5' title='K_5' class='latex' /> on five vertices, and the <a href="http://en.wikipedia.org/wiki/Complete_bipartite_graph">complete bipartite graph</a> <img src='http://s0.wp.com/latex.php?latex=K_%7B3%2C3%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='K_{3,3}' title='K_{3,3}' class='latex' /> on two sets of three vertices, are non-planar.</p>
<p><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/cf/Complete_graph_K5.svg/100px-Complete_graph_K5.svg.png" /><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/1/11/Complete_bipartite_graph_K3%2C3.svg/100px-Complete_bipartite_graph_K3%2C3.svg.png" height="76" width="100" /></p>
<p>There is in fact a famous <a href="http://en.wikipedia.org/wiki/Planar_graph#Kuratowski.27s_and_Wagner.27s_theorems">theorem of Kuratowski</a> that says that these two graphs are the only &#8220;source&#8221; of non-planarity, in the sense that any non-planar graph contains (a <a href="http://en.wikipedia.org/wiki/Subdivision_%28graph_theory%29">subdivision</a> of) one of these graphs as a subgraph. (There is of course the even more famous <a href="http://en.wikipedia.org/wiki/Four_color_theorem">four-colour theorem</a> that asserts that every planar graphs is four-colourable, but this is not the topic of my post today.)</p>
<p>Intuitively, if we fix the number of vertices &#124;V&#124;, and increase the number of edges &#124;E&#124;, then the graph should become &#8220;increasingly non-planar&#8221;; conversely, if we keep the same number of edges &#124;E&#124; but spread them amongst a greater number of vertices &#124;V&#124;, then the graph should become &#8220;increasingly planar&#8221;. Is there a quantitative way to measure the &#8220;non-planarity&#8221; of a graph, and to formalise the above intuition as some sort of inequality?<br />
<!--more--></p>
<p>It turns out that there is an elegant inequality that does precisely this, known as the <a href="http://en.wikipedia.org/wiki/Crossing_number#The_crossing_number_inequality">crossing number inequality</a>. It was first discovered by <a href="http://www.ams.org/mathscinet-getitem?mr=806962">Ajtai-Chvátal-Newborn-Szemerédi</a> and <a href="http://www.ams.org/mathscinet-getitem?mr=738751">by Leighton</a> (the latter being motivated by optimising <a href="http://en.wikipedia.org/wiki/VLSI">VLSI</a> designs). Nowadays it can be proven by two elegant amplifications of <a href="http://en.wikipedia.org/wiki/Euler_characteristic">Euler&#8217;s formula</a>, as we shall see.</p>
<p>If D is a drawing of a graph G, we define cr(D) to be the total number of crossings &#8211; where pairs of edges intersect at a point, for a reason other than sharing a common vertex. (If multiple edges intersect at the same point, each pair of edges counts once.) We then define the <em>crossing number</em> cr(G) of G to be the minimal value of cr(D) as D ranges over the drawings of G. Thus for instance cr(G)=0 if and only if G is planar. One can also verify that the two graphs <img src='http://s0.wp.com/latex.php?latex=K_5&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='K_5' title='K_5' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=K_%7B3%2C3%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='K_{3,3}' title='K_{3,3}' class='latex' /> have a crossing number of 1. This quantity cr(G) will be the measure of how non-planar our graph G is. The problem is to relate this quantity in terms of the number of vertices &#124;V&#124; and the number of edges &#124;E&#124;. We of course do not expect an exact identity relating these three quantities (two graphs with the same number of vertices and edges may have a different number of crossing numbers), so will settle for good upper and lower bounds on cr(G) in terms of &#124;V&#124; and &#124;E&#124;.</p>
<p>How big can the crossing number of a graph G = (V,E) be? A trivial upper bound is cr(G) = O( &#124;E&#124;^2 ), because if we place the vertices in general position (or on a circle) and draw the edges as line segments, then every pair of edges crosses at most once. But this bound does not seem very tight; we expect to be able to find drawings in which most pairs of edges in fact do not intersect.</p>
<p>Let&#8217;s turn our attention instead to lower bounds. We of course have the trivial lower bound <img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Bcr%7D%28G%29+%5Cgeq+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{cr}(G) &#92;geq 0' title='&#92;hbox{cr}(G) &#92;geq 0' class='latex' />; can we do better? Let&#8217;s first be extremely unambitious and see when one can get the minimal possible improvement on this bound, namely <img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Bcr%7D%28G%29+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{cr}(G) &gt; 0' title='&#92;hbox{cr}(G) &gt; 0' class='latex' />. In other words, we want to find some conditions on &#124;V&#124; and &#124;E&#124; which will force G to be non-planar. We can turn this around by taking contrapositives: if G is planar, what does this tell us about &#124;V&#124; and &#124;E&#124;?</p>
<p>Here, the natural tool is <a href="http://en.wikipedia.org/wiki/Euler_characteristic">Euler&#8217;s formula</a> &#124;V&#124;-&#124;E&#124;+&#124;F&#124;=2, valid for any planar drawing, where &#124;F&#124; is the number of faces (including the unbounded face). [This is the one place where we shall really use the topological structure of the plane; the rest of the argument is combinatorial.  There are some minor issues if the graph is disconnected, or if there are vertices of degree one or zero, but these are easily dealt with.] What do we know about &#124;F&#124;? Well, every face is adjacent to at least three edges, whereas every edge is adjacent to exactly two faces. By <a href="http://en.wikipedia.org/wiki/Double_counting_%28proof_technique%29">double counting</a> the edge-face incidences, we conclude that <img src='http://s0.wp.com/latex.php?latex=3%26%23124%3BF%26%23124%3B%5Cleq+2%26%23124%3BE%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='3&#124;F&#124;&#92;leq 2&#124;E&#124;' title='3&#124;F&#124;&#92;leq 2&#124;E&#124;' class='latex' />. Eliminating &#124;F&#124;, we conclude that <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%5Cleq+3%26%23124%3BV%26%23124%3B-6&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &#92;leq 3&#124;V&#124;-6' title='&#124;E&#124; &#92;leq 3&#124;V&#124;-6' class='latex' /> for all connected planar graphs with at least one cycle (and this bound is tight when the graph is triangular), which then implies <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%5Cleq+3%26%23124%3BV%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &#92;leq 3&#124;V&#124;' title='&#124;E&#124; &#92;leq 3&#124;V&#124;' class='latex' /> in general. Taking contrapositives, we conclude</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Bcr%7D%28G%29+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{cr}(G) &gt; 0' title='&#92;hbox{cr}(G) &gt; 0' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%26%2362%3B+3%26%23124%3BV%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &gt; 3&#124;V&#124;' title='&#124;E&#124; &gt; 3&#124;V&#124;' class='latex' />. (*)</p>
<p>Now, let us <a href="http://terrytao.wordpress.com/2007/09/05/amplification-arbitrage-and-the-tensor-power-trick/">amplify</a> this inequality by exploiting the freedom to delete edges. Indeed, observe that if a graph G can be drawn with only cr(G) crossings, then we can delete one of the crossings by removing an edge associated to that crossing, and so we can remove all the crossings by deleting at most cr(G) edges, leaving at least &#124;E&#124;-cr(G) edges (and &#124;V&#124; vertices). Combining this with (*) we see that regardless of the number of crossings, we have</p>
<p><img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B-%5Chbox%7Bcr%7D%28G%29+%5Cleq+3%26%23124%3BV%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124;-&#92;hbox{cr}(G) &#92;leq 3&#124;V&#124;' title='&#124;E&#124;-&#92;hbox{cr}(G) &#92;leq 3&#124;V&#124;' class='latex' /></p>
<p>leading to the following amplification of (*):</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Bcr%7D%28G%29+%5Cgeq+%26%23124%3BE%26%23124%3B+-+3%26%23124%3BV%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{cr}(G) &#92;geq &#124;E&#124; - 3&#124;V&#124;' title='&#92;hbox{cr}(G) &#92;geq &#124;E&#124; - 3&#124;V&#124;' class='latex' />. (**)</p>
<p>This is not the best bound, though, as one can already suspect by comparing (**) with the crude upper bound <img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Bcr%7D%28G%29+%3D+O%28%26%23124%3BE%26%23124%3B%5E2%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{cr}(G) = O(&#124;E&#124;^2)' title='&#92;hbox{cr}(G) = O(&#124;E&#124;^2)' class='latex' />. We can amplify (**) further by exploiting a second freedom, namely the ability to delete vertices. One could try the same sort of trick as before, deleting vertices which are associated to a crossing, but this turns out to be very inefficient (because deleting vertices also deletes an unknown number of edges, many of which had nothing to do with the crossing). Indeed, it would seem that one would have to be fiendishly clever to find an efficient way to delete a lot of crossings by deleting only very few vertices.</p>
<p>However, there is an amazing (and unintuitive) principle in combinatorics which states that when there is no obvious &#8220;best&#8221; choice for some combinatorial object (such as a set of vertices to delete), then often trying a <em>random</em> choice will give a reasonable answer, if the notion of &#8220;random&#8221; is chosen carefully. (See <a href="http://www.dpmms.cam.ac.uk/~wtg10/2cultures.pdf">this paper of Gowers </a>for some further discussion of this principle.) The application of this principle is known as the <em>probabilistic method</em>, first introduced by Erdős.</p>
<p>Here is how it works in this current setting. Let <img src='http://s0.wp.com/latex.php?latex=0+%26%2360%3B+p+%5Cleq+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='0 &lt; p &#92;leq 1' title='0 &lt; p &#92;leq 1' class='latex' /> be a parameter to be chosen later. We will randomly delete all but a fraction p of the vertices, by letting each vertex be deleted with an independent probability of 1-p (and thus surviving with a probability of p). Let V&#8217; be the set of vertices that remain. Once one deletes vertices, one also has to delete the edges attached to these vertices; let E&#8217; denote the surviving edges (i.e. the edges connected to vertices in V&#8217;). Let G&#8217;=(V&#8217;,E&#8217;) be the surviving graph (known as the subgraph of G <a href="http://en.wikipedia.org/wiki/Glossary_of_graph_theory#Subgraphs">induced</a> by V&#8217;). Then from (**) we have</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Bcr%7D%28G%27%29+%5Cgeq+%26%23124%3BE%27%26%23124%3B+-+3%26%23124%3BV%27%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{cr}(G&#039;) &#92;geq &#124;E&#039;&#124; - 3&#124;V&#039;&#124;' title='&#92;hbox{cr}(G&#039;) &#92;geq &#124;E&#039;&#124; - 3&#124;V&#039;&#124;' class='latex' />.</p>
<p>Now, how do we get from this back to the original graph G = (V,E)? The quantities &#124;V&#8217;&#124;, &#124;E&#8217;&#124;, and cr(G&#8217;) all fluctuate randomly, and are difficult to compute. However, their <em>expectations</em> are much easier to compute. Accordingly, we take expectations of both sides (this is an example of the <em>first moment method</em>). Using linearity of expectation, we have</p>
<p><img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+E%7D%28%5Chbox%7Bcr%7D%28G%27%29%29+%5Cgeq+%7B%5CBbb+E%7D%28%26%23124%3BE%27%26%23124%3B%29+-+3+%7B%5CBbb+E%7D%28%26%23124%3BV%27%26%23124%3B%29.&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb E}(&#92;hbox{cr}(G&#039;)) &#92;geq {&#92;Bbb E}(&#124;E&#039;&#124;) - 3 {&#92;Bbb E}(&#124;V&#039;&#124;).' title='{&#92;Bbb E}(&#92;hbox{cr}(G&#039;)) &#92;geq {&#92;Bbb E}(&#124;E&#039;&#124;) - 3 {&#92;Bbb E}(&#124;V&#039;&#124;).' class='latex' /></p>
<p>These quantities are all relatively easy to compute. The easiest is <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+E%7D%28%26%23124%3BV%27%26%23124%3B%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb E}(&#124;V&#039;&#124;)' title='{&#92;Bbb E}(&#124;V&#039;&#124;)' class='latex' />. Each vertex in V has a probability p of ending up in V&#8217;, and thus contributing 1 to &#124;V&#8217;&#124;. Summing up (using linearity of expectation again), we obtain <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+E%7D%28%26%23124%3BV%27%26%23124%3B%29+%3D+p%26%23124%3BV%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb E}(&#124;V&#039;&#124;) = p&#124;V&#124;' title='{&#92;Bbb E}(&#124;V&#039;&#124;) = p&#124;V&#124;' class='latex' />.</p>
<p>The quantity <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+E%7D%28%26%23124%3BE%27%26%23124%3B%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb E}(&#124;E&#039;&#124;)' title='{&#92;Bbb E}(&#124;E&#039;&#124;)' class='latex' /> is almost as easy to compute. Each edge e in E will have a probability <img src='http://s0.wp.com/latex.php?latex=p%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p^2' title='p^2' class='latex' /> of ending up in E&#8217;, since both vertices have an independent probability of p of surviving. Summing up, we obtain <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+E%7D%28%26%23124%3BE%27%26%23124%3B%29+%3D+p%5E2+%26%23124%3BE%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb E}(&#124;E&#039;&#124;) = p^2 &#124;E&#124;' title='{&#92;Bbb E}(&#124;E&#039;&#124;) = p^2 &#124;E&#124;' class='latex' />. (The events that each edge ends up in E&#8217; are not quite independent, but the great thing about linearity of expectation is that it works even without assuming any independence.)</p>
<p>Finally, we turn to <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+E%7D%28%5Chbox%7Bcr%7D%28G%27%29%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb E}(&#92;hbox{cr}(G&#039;))' title='{&#92;Bbb E}(&#92;hbox{cr}(G&#039;))' class='latex' />. Let us draw G in the optimal way, with exactly cr(G) crossings. Observe that each crossing involves two edges and four vertices. (If the two edges involved in a crossing share a common vertex as well, thus forming an <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' /> shape, one can reduce the number of crossings by 1 by swapping the two halves of the loop in the <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' /> shape. So with the optimal drawing, the edges in a crossing do not share any vertices in common.) Passing to G&#8217;, we see that the probability that the crossing survives in this drawing is only <img src='http://s0.wp.com/latex.php?latex=p%5E4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p^4' title='p^4' class='latex' />. By one last application of linearity of expectation, the expected number of crossings of this diagram that survive for G&#8217; is <img src='http://s0.wp.com/latex.php?latex=p%5E4+%5Chbox%7Bcr%7D%28G%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p^4 &#92;hbox{cr}(G)' title='p^4 &#92;hbox{cr}(G)' class='latex' />. This particular diagram may not be the optimal one for G&#8217;, so we end up with an inequality <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+E%7D+%5Chbox%7Bcr%7D%28G%27%29+%5Cleq+p%5E4+%5Chbox%7Bcr%7D%28G%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb E} &#92;hbox{cr}(G&#039;) &#92;leq p^4 &#92;hbox{cr}(G)' title='{&#92;Bbb E} &#92;hbox{cr}(G&#039;) &#92;leq p^4 &#92;hbox{cr}(G)' class='latex' />. Fortunately for us, this inequality goes in the right direction, and we get a useful inequality:</p>
<p><img src='http://s0.wp.com/latex.php?latex=p%5E4+%5Chbox%7Bcr%7D%28G%29+%5Cgeq+p%5E2+%26%23124%3BE%26%23124%3B+-+3+p+%26%23124%3BV%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p^4 &#92;hbox{cr}(G) &#92;geq p^2 &#124;E&#124; - 3 p &#124;V&#124;' title='p^4 &#92;hbox{cr}(G) &#92;geq p^2 &#124;E&#124; - 3 p &#124;V&#124;' class='latex' />.</p>
<p>In terms of cr(G), we have</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Bcr%7D%28G%29+%5Cgeq+p%5E%7B-2%7D+%26%23124%3BE%26%23124%3B+-+3+p%5E%7B-3%7D+%26%23124%3BV%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{cr}(G) &#92;geq p^{-2} &#124;E&#124; - 3 p^{-3} &#124;V&#124;' title='&#92;hbox{cr}(G) &#92;geq p^{-2} &#124;E&#124; - 3 p^{-3} &#124;V&#124;' class='latex' />.</p>
<p>To finish the amplification, we need to optimise in p, subject of course to the restriction <img src='http://s0.wp.com/latex.php?latex=0+%26%2360%3B+p+%5Cleq+1&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='0 &lt; p &#92;leq 1' title='0 &lt; p &#92;leq 1' class='latex' />, since p is a probability. One can solve the optimisation problem exactly, but we will perform a cheaper computation by settling for a bound which is close to the optimal bound rather than exactly equal to it. A general principle is that optima are often obtained when two of the terms are roughly in balance. A bit of thought reveals that it might be particularly good to have <img src='http://s0.wp.com/latex.php?latex=3+p%5E%7B-3%7D+%26%23124%3BV%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='3 p^{-3} &#124;V&#124;' title='3 p^{-3} &#124;V&#124;' class='latex' /> just barely smaller than <img src='http://s0.wp.com/latex.php?latex=p%5E%7B-2%7D+%26%23124%3BE%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p^{-2} &#124;E&#124;' title='p^{-2} &#124;E&#124;' class='latex' />. (If it is a lot smaller, then p will be large, and we don&#8217;t get a good bound on the right. If instead <img src='http://s0.wp.com/latex.php?latex=3+p%5E%7B-3%7D+%26%23124%3BV%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='3 p^{-3} &#124;V&#124;' title='3 p^{-3} &#124;V&#124;' class='latex' /> is a lot bigger, then we are likely to have a negative right-hand side.) For instance, we could choose p so that <img src='http://s0.wp.com/latex.php?latex=4+p%5E%7B-3%7D+%26%23124%3BV%26%23124%3B+%3D+p%5E%7B-2%7D+%26%23124%3BE%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='4 p^{-3} &#124;V&#124; = p^{-2} &#124;E&#124;' title='4 p^{-3} &#124;V&#124; = p^{-2} &#124;E&#124;' class='latex' />; this is legal as long as <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%5Cgeq+4%26%23124%3BV%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &#92;geq 4&#124;V&#124;' title='&#124;E&#124; &#92;geq 4&#124;V&#124;' class='latex' />. Substituting this we obtain the <em>crossing number inequality</em></p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Bcr%7D%28G%29+%5Cgeq+%5Cfrac%7B%26%23124%3BE%26%23124%3B%5E3%7D%7B64+%26%23124%3BV%26%23124%3B%5E2%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{cr}(G) &#92;geq &#92;frac{&#124;E&#124;^3}{64 &#124;V&#124;^2}' title='&#92;hbox{cr}(G) &#92;geq &#92;frac{&#124;E&#124;^3}{64 &#124;V&#124;^2}' class='latex' /> whenever <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%5Cgeq+4%26%23124%3BV%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &#92;geq 4&#124;V&#124;' title='&#124;E&#124; &#92;geq 4&#124;V&#124;' class='latex' />. (***)</p>
<p>This is quite a strong amplification of (*) or (**) (except in the transition region in which &#124;E&#124; is comparable to &#124;V&#124;).</p>
<p>Is it sharp? We can compare it against the trivial bound <img src='http://s0.wp.com/latex.php?latex=%5Chbox%7Bcr%7D%28G%29+%3D+O%28%26%23124%3BE%26%23124%3B%5E2%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hbox{cr}(G) = O(&#124;E&#124;^2)' title='&#92;hbox{cr}(G) = O(&#124;E&#124;^2)' class='latex' />, and we observe that the two bounds match up to constants when &#124;E&#124; is comparable to <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BV%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;V&#124;^2' title='&#124;V&#124;^2' class='latex' />. (Clearly, &#124;E&#124; cannot be larger than <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BV%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;V&#124;^2' title='&#124;V&#124;^2' class='latex' />.) So the crossing number inequality is sharp (up to constants) for dense graphs, such as the complete graph <img src='http://s0.wp.com/latex.php?latex=K_n&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='K_n' title='K_n' class='latex' /> on n vertices.</p>
<p>Are there any other cases where it is sharp? We can answer this by appealing to the symmetries of (***). By the nature of its proof, the inequality is basically symmetric under passage to random induced subgraphs, but this symmetry does not give any further examples, because random induced subgraphs of dense graphs again tend to be dense graphs (cf. the computation of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+E%7D+%26%23124%3BV%27%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb E} &#124;V&#039;&#124;' title='{&#92;Bbb E} &#124;V&#039;&#124;' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+E%7D+%26%23124%3BE%27%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb E} &#124;E&#039;&#124;' title='{&#92;Bbb E} &#124;E&#039;&#124;' class='latex' /> above). But there is a second symmetry of (***) available, namely that of <em>replication</em>. If one takes k disjoint copies of a graph G = (V,E), one gets a new graph with k&#124;V&#124; vertices and k&#124;E&#124; edges, and a moment&#8217;s thought will reveal that the new graph has a crossing number of k cr(G). Thus replication is a symmetry of (***). Thus, (***) is also sharp up to constants for replicated dense graphs. It is not hard to see that these examples basically cover all possibilities of &#124;V&#124; and &#124;E&#124; for which <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BE%26%23124%3B+%5Cgeq+4%26%23124%3BV%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;E&#124; &#92;geq 4&#124;V&#124;' title='&#124;E&#124; &#92;geq 4&#124;V&#124;' class='latex' />. Thus the crossing number inequality cannot be improved except for the constants. (The best constants known currently can be found in a recent paper of <a href="http://www.ams.org/mathscinet-getitem?mr=2267545">Pach, Radoicic, Tardos, and Tóth</a>.)</p>
<p>[A general principle, by the way, is that one can roughly gauge the "strength" of an inequality by the number of independent symmetries (or approximate symmetries) it has. If for instance there is a three-parameter family of symmetries, then any example that demonstrates that sharpness of that inequality is immediately amplified to a three-parameter family of such examples (unless of course the example is fixed by a significant portion of these symmetries). The more examples that show an inequality is sharp, the more efficient it is - and the harder it is to prove, since one cannot afford to lose anything (other than perhaps some constants) in every one of the sharp example cases. This principle is of course consistent with my <a href="http://terrytao.wordpress.com/2007/09/05/amplification-arbitrage-and-the-tensor-power-trick/">previous post </a>on arbitraging a weak asymmetric inequality into a strong symmetric one.]</p>
<p>&#8211; Application: the Szemerédi-Trotter theorem &#8211;</p>
<p>It was noticed <a href="http://www.ams.org/mathscinet-getitem?mr=1464571">by Székely</a> that the crossing number is powerful enough to give easy proofs of several difficult inequalities in combinatorial <a href="http://en.wikipedia.org/wiki/Incidence_geometry">incidence geometry</a>. For instance, the <a href="http://en.wikipedia.org/wiki/Szemer%C3%A9di-Trotter_theorem">Szemerédi-Trotter theorem</a> concerns the number of <a href="http://en.wikipedia.org/wiki/Incidence_%28geometry%29">incidences</a> <img src='http://s0.wp.com/latex.php?latex=I%28P%2CL%29+%3A%3D+%26%23124%3B%5C%7B+%28p%2Cl%29+%5Cin+P+%5Ctimes+L%3A+p+%5Cin+l+%5C%7D%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='I(P,L) := &#124;&#92;{ (p,l) &#92;in P &#92;times L: p &#92;in l &#92;}&#124;' title='I(P,L) := &#124;&#92;{ (p,l) &#92;in P &#92;times L: p &#92;in l &#92;}&#124;' class='latex' /> between a finite collection of points P and lines L in the plane. For instance, the three lines and three points of a triangle form six incidences; the five lines and ten points of a pentagram form 20 incidences; and so forth.</p>
<p>One can ask the question: given &#124;P&#124; points and &#124;L&#124; lines, what is the maximum number of incidences I(P,L) one can form between these points and lines? (The minimum number is obviously 0, which is a boring answer.) The trivial bound is <img src='http://s0.wp.com/latex.php?latex=I%28P%2CL%29+%5Cleq+%26%23124%3BP%26%23124%3B+%26%23124%3BL%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='I(P,L) &#92;leq &#124;P&#124; &#124;L&#124;' title='I(P,L) &#92;leq &#124;P&#124; &#124;L&#124;' class='latex' />, but one can do better than this, because it is not possible for every point to lie on every line. Indeed, if we use nothing more than the axiom that every two points determine at most one line, combined with the Cauchy-Schwarz inequality, it is not hard to show (by double-counting the space of triples <img src='http://s0.wp.com/latex.php?latex=%28p%2Cp%27%2Cl%29+%5Cin+P+%5Ctimes+P+%5Ctimes+L&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='(p,p&#039;,l) &#92;in P &#92;times P &#92;times L' title='(p,p&#039;,l) &#92;in P &#92;times P &#92;times L' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=p%2C+p%27+%5Cin+l&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='p, p&#039; &#92;in l' title='p, p&#039; &#92;in l' class='latex' />) that</p>
<p><img src='http://s0.wp.com/latex.php?latex=%26%23124%3BI%28P%2CL%29%26%23124%3B+%5Cleq+%26%23124%3BP%26%23124%3B+%26%23124%3BL%26%23124%3B%5E%7B1%2F2%7D+%2B+%26%23124%3BL%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;I(P,L)&#124; &#92;leq &#124;P&#124; &#124;L&#124;^{1/2} + &#124;L&#124;' title='&#124;I(P,L)&#124; &#92;leq &#124;P&#124; &#124;L&#124;^{1/2} + &#124;L&#124;' class='latex' />. (****)</p>
<p>Dually, using the axiom that two lines intersect in at most one point, we obtain</p>
<p><img src='http://s0.wp.com/latex.php?latex=%26%23124%3BI%28P%2CL%29%26%23124%3B+%5Cleq+%26%23124%3BL%26%23124%3B+%26%23124%3BP%26%23124%3B%5E%7B1%2F2%7D+%2B+%26%23124%3BP%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;I(P,L)&#124; &#92;leq &#124;L&#124; &#124;P&#124;^{1/2} + &#124;P&#124;' title='&#124;I(P,L)&#124; &#92;leq &#124;L&#124; &#124;P&#124;^{1/2} + &#124;P&#124;' class='latex' />.</p>
<p>(One can also deduce one inequality from the other by <a href="http://en.wikipedia.org/wiki/Duality_(projective_geometry)">projective duality</a>.)</p>
<p>Can one do better? The answer is yes, if we observe that a configuration of points and lines naturally determines a drawing of a graph, to which the crossing number can be applied. To see this, assume temporarily that every line in L is incident to at least two points in P. A line l in L which is incident to k points in P will thus contain k-1 line segments in P; k-1 is comparable to k. Since the sum of all the k is I(P,L) by definition, we see that there are roughly I(P,L) line segments of L connecting adjacent points in P; this is a diagram with &#124;P&#124; vertices and roughly &#124;I(P,L)&#124; edges. On the other hand, a crossing in this diagram can only occur when two lines in L intersect. Since two lines intersect in at most one point, the total number of crossings is O(&#124;L&#124;^2). Applying the crossing number inequality (***), we obtain</p>
<p><img src='http://s0.wp.com/latex.php?latex=%26%23124%3BL%26%23124%3B%5E2+%5Cgg+I%28P%2CL%29%5E3%2F%26%23124%3BP%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;L&#124;^2 &#92;gg I(P,L)^3/&#124;P&#124;^2' title='&#124;L&#124;^2 &#92;gg I(P,L)^3/&#124;P&#124;^2' class='latex' /> if I(P,L) is much larger than &#124;P&#124;</p>
<p>which leads to</p>
<p><img src='http://s0.wp.com/latex.php?latex=I%28P%2CL%29+%3D+O%28+%26%23124%3BL%26%23124%3B%5E%7B2%2F3%7D+%26%23124%3BP%26%23124%3B%5E%7B2%2F3%7D+%2B+%26%23124%3BP%26%23124%3B+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='I(P,L) = O( &#124;L&#124;^{2/3} &#124;P&#124;^{2/3} + &#124;P&#124; )' title='I(P,L) = O( &#124;L&#124;^{2/3} &#124;P&#124;^{2/3} + &#124;P&#124; )' class='latex' />.</p>
<p>We can then remove our temporary assumption that lines in L are incident to at least two points, by observing that lines that are incident to at most one point will only contribute O(&#124;L&#124;) incidences, leading to the <em>Szemerédi-Trotter theorem</em></p>
<p><img src='http://s0.wp.com/latex.php?latex=I%28P%2CL%29+%3D+O%28+%26%23124%3BL%26%23124%3B%5E%7B2%2F3%7D+%26%23124%3BP%26%23124%3B%5E%7B2%2F3%7D+%2B+%26%23124%3BP%26%23124%3B+%2B+%26%23124%3BL%26%23124%3B%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='I(P,L) = O( &#124;L&#124;^{2/3} &#124;P&#124;^{2/3} + &#124;P&#124; + &#124;L&#124;)' title='I(P,L) = O( &#124;L&#124;^{2/3} &#124;P&#124;^{2/3} + &#124;P&#124; + &#124;L&#124;)' class='latex' />.</p>
<p>This bound is somewhat stronger than the previous bounds, and is in fact surprisingly sharp; a typical example that demonstrates this is when P is the lattice <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C%5Cldots%2CN%5C%7D+%5Ctimes+%5C%7B1%2C%5Cldots%2CN%5E2%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{1,&#92;ldots,N&#92;} &#92;times &#92;{1,&#92;ldots,N^2&#92;}' title='&#92;{1,&#92;ldots,N&#92;} &#92;times &#92;{1,&#92;ldots,N^2&#92;}' class='latex' /> and L is the set of lines <img src='http://s0.wp.com/latex.php?latex=%5C%7B+%28x%2Cy%29%3A+y%3Dmx%2Bb%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{ (x,y): y=mx+b&#92;}' title='&#92;{ (x,y): y=mx+b&#92;}' class='latex' /> with slope <img src='http://s0.wp.com/latex.php?latex=m+%5Cin+%5C%7B1%2C%5Cldots%2CN%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='m &#92;in &#92;{1,&#92;ldots,N&#92;}' title='m &#92;in &#92;{1,&#92;ldots,N&#92;}' class='latex' /> and intercept <img src='http://s0.wp.com/latex.php?latex=b+%5Cin+%5C%7B1%2C%5Cldots%2CN%5E2%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='b &#92;in &#92;{1,&#92;ldots,N^2&#92;}' title='b &#92;in &#92;{1,&#92;ldots,N^2&#92;}' class='latex' />; here <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BP%26%23124%3B%3D%26%23124%3BL%26%23124%3B%3DN%5E3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;P&#124;=&#124;L&#124;=N^3' title='&#124;P&#124;=&#124;L&#124;=N^3' class='latex' /> and the number of incidences is roughly <img src='http://s0.wp.com/latex.php?latex=N%5E4&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='N^4' title='N^4' class='latex' />.</p>
<p>The original proof of this theorem, by the way, proceeded by amplifying (****) using the method of <em>cell decomposition</em>; it is thus somewhat similar in spirit to Szekély&#8217;s proof, but was a bit more complicated technically. <a href="http://www.ams.org/mathscinet-getitem?mr=1660476">Wolff conjectured</a> a continuous version of this theorem for fractal sets, sometimes called the <em>Furstenberg set conjecture</em>, and related to the <a href="http://en.wikipedia.org/wiki/Kakeya_conjecture">Kakeya conjecture</a>; a small amount of progress beyond the analogue of (****) is known, thanks to work of <a href="http://www.ams.org/mathscinet-getitem?mr=1856956">Katz and myself</a> and <a href="http://www.ams.org/mathscinet-getitem?mr=1982147">of Bourgain</a>, but we are still far from the best possible result here.</p>
<p>&#8211; Application: sum-product estimates &#8211;</p>
<p>One striking application of the Szemerédi-Trotter theorem (and by extension, the crossing number inequality) is to the arena of <em>sum-product </em>estimates in additive combinatorics, which is currently a very active area of research, especially in finite fields, due to its connections with some long-standing problems in analytic number theory, as well as to some computer science problems concerning randomness extraction and expander graphs. However, our focus here will be on the more classical setting of sum-product estimates in the real line <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}' title='{&#92;Bbb R}' class='latex' />.</p>
<p>Let A be a finite non-empty set of non-zero real numbers. We can form the <em>sum set</em></p>
<p><img src='http://s0.wp.com/latex.php?latex=A%2BA+%3A%3D+%5C%7B+a+%2B+b%3A+a%2C+b+%5Cin+A+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+A := &#92;{ a + b: a, b &#92;in A &#92;}' title='A+A := &#92;{ a + b: a, b &#92;in A &#92;}' class='latex' /></p>
<p>and the <em>product set</em></p>
<p><img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A+%3D+%5C%7B+ab%3A+a%2C+b+%5Cin+A+%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A = &#92;{ ab: a, b &#92;in A &#92;}' title='A &#92;cdot A = &#92;{ ab: a, b &#92;in A &#92;}' class='latex' />.</p>
<p>If A is in &#8220;general position&#8221;, it is not hard to see that <img src='http://s0.wp.com/latex.php?latex=A%2BA&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+A' title='A+A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A' title='A &#92;cdot A' class='latex' /> both have cardinality comparable to <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A&#124;^2' title='&#124;A&#124;^2' class='latex' />. However, in certain cases one can make one or the other sets significantly smaller. For instance, if A is an arithmetic progression <img src='http://s0.wp.com/latex.php?latex=%5C%7B+a%2C+a%2Br%2C+%5Cldots%2C+a%2B%28k-1%29r%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{ a, a+r, &#92;ldots, a+(k-1)r&#92;}' title='&#92;{ a, a+r, &#92;ldots, a+(k-1)r&#92;}' class='latex' />, then the sum set <img src='http://s0.wp.com/latex.php?latex=A%2BA&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+A' title='A+A' class='latex' /> has cardinality comparable to just &#124;A&#124;. Similarly, if A is a geometric progression <img src='http://s0.wp.com/latex.php?latex=%5C%7B+a%2C+ar%2C+%5Cldots%2C+ar%5E%7Bk-1%7D%5C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;{ a, ar, &#92;ldots, ar^{k-1}&#92;}' title='&#92;{ a, ar, &#92;ldots, ar^{k-1}&#92;}' class='latex' />, then the product set <img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A' title='A &#92;cdot A' class='latex' /> has cardinality comparable to &#124;A&#124;. But clearly A cannot be an arithmetic progression and a geometric progression at the same time (unless it is very short). So one might conjecture that at least one of the sum set and product set should be significantly larger than A. Informally, this is saying that no finite set of reals can behave much like a subring of <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}' title='{&#92;Bbb R}' class='latex' />.</p>
<p>This intuition was made precise <a href="http://www.ams.org/mathscinet-getitem?mr=820223">by Erdős and Szemerédi</a>, who established the lower bound</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cmax%28+%26%23124%3BA%2BA%26%23124%3B%2C+%26%23124%3BA+%5Ccdot+A%26%23124%3B+%29+%5Cgg+%26%23124%3BA%26%23124%3B%5E%7B1%2Bc%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;max( &#124;A+A&#124;, &#124;A &#92;cdot A&#124; ) &#92;gg &#124;A&#124;^{1+c}' title='&#92;max( &#124;A+A&#124;, &#124;A &#92;cdot A&#124; ) &#92;gg &#124;A&#124;^{1+c}' class='latex' /></p>
<p>for some small <img src='http://s0.wp.com/latex.php?latex=c%26%2362%3B0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='c&gt;0' title='c&gt;0' class='latex' /> which they did not make explicit. They then conjectured that in fact c should be made arbitrary close to the optimal value of 1, and more precisely that</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cmax%28+%26%23124%3BA%2BA%26%23124%3B%2C+%26%23124%3BA+%5Ccdot+A%26%23124%3B+%29+%5Cgg+%26%23124%3BA%26%23124%3B%5E2+%5Cexp%28+-+%5Cdelta+%5Clog+%26%23124%3BA%26%23124%3B%2F%5Clog%5Clog%26%23124%3BA%26%23124%3B+%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;max( &#124;A+A&#124;, &#124;A &#92;cdot A&#124; ) &#92;gg &#124;A&#124;^2 &#92;exp( - &#92;delta &#92;log &#124;A&#124;/&#92;log&#92;log&#124;A&#124; )' title='&#92;max( &#124;A+A&#124;, &#124;A &#92;cdot A&#124; ) &#92;gg &#124;A&#124;^2 &#92;exp( - &#92;delta &#92;log &#124;A&#124;/&#92;log&#92;log&#124;A&#124; )' class='latex' /></p>
<p>for large &#124;A&#124; and some absolute constant <img src='http://s0.wp.com/latex.php?latex=%5Cdelta+%26%2362%3B+0&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;delta &gt; 0' title='&#92;delta &gt; 0' class='latex' />.</p>
<p>The Erdős-Szemerédi conjecture remains open, however the value of c has been improved; currently, the best bound is due <a href="http://www.ams.org/mathscinet-getitem?mr=2143727">to Solymosi</a>, who showed that c can be arbitrarily close to 3/11. Solymosi&#8217;s argument is based on an earlier argument <a href="http://www.ams.org/mathscinet-getitem?mr=1472816">of Elekes</a>, who obtained c=1/4 by a short and elegant argument based on the Szemerédi-Trotter theorem which we will now present. The basic connection between the two problems stems from the familiar formula y=mx+b for a line, which clearly encodes a multiplicative and additive structure. We already used this connection implicitly in the example that demonstrated that the Szemerédi-Trotter theorem was sharp. For Elekes&#8217; argument, the challenge is to show that if <img src='http://s0.wp.com/latex.php?latex=A%2BA&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A+A' title='A+A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=A+%5Ccdot+A&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A &#92;cdot A' title='A &#92;cdot A' class='latex' /> are both small, then a suitable family of lines <img src='http://s0.wp.com/latex.php?latex=y%3Dmx%2Bb&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='y=mx+b' title='y=mx+b' class='latex' /> associated to A will have a high number of incidences with some set of points associated to A, so that the Szemerédi-Trotter may then be profitably applied. It is not immediately obvious exactly how to do this, but Elekes settled upon the choice of letting <img src='http://s0.wp.com/latex.php?latex=P+%3A%3D+%28A+%5Ccdot+A%29+%5Ctimes+%28A+%2B+A%29&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='P := (A &#92;cdot A) &#92;times (A + A)' title='P := (A &#92;cdot A) &#92;times (A + A)' class='latex' />, and letting L be the space of lines y=mx+b with slope in <img src='http://s0.wp.com/latex.php?latex=A%5E%7B-1%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='A^{-1}' title='A^{-1}' class='latex' /> and intercept in A, thus <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BP%26%23124%3B+%3D+%26%23124%3BA%2BA%26%23124%3B+%26%23124%3BA+%5Ccdot+A%26%23124%3B&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;P&#124; = &#124;A+A&#124; &#124;A &#92;cdot A&#124;' title='&#124;P&#124; = &#124;A+A&#124; &#124;A &#92;cdot A&#124;' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BL%26%23124%3B+%3D+%26%23124%3BA%26%23124%3B%5E2&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;L&#124; = &#124;A&#124;^2' title='&#124;L&#124; = &#124;A&#124;^2' class='latex' />. One observes that each line in L is incident to &#124;A&#124; points in P, leading to <img src='http://s0.wp.com/latex.php?latex=%26%23124%3BA%26%23124%3B%5E3&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#124;A&#124;^3' title='&#124;A&#124;^3' class='latex' /> incidences. Applying the Szemerédi-Trotter theorem and doing the algebra one eventually concludes that <img src='http://s0.wp.com/latex.php?latex=%5Cmax%28+%26%23124%3BA%2BA%26%23124%3B%2C+%26%23124%3BA+%5Ccdot+A%26%23124%3B+%29+%5Cgg+%26%23124%3BA%26%23124%3B%5E%7B5%2F4%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;max( &#124;A+A&#124;, &#124;A &#92;cdot A&#124; ) &#92;gg &#124;A&#124;^{5/4}' title='&#92;max( &#124;A+A&#124;, &#124;A &#92;cdot A&#124; ) &#92;gg &#124;A&#124;^{5/4}' class='latex' />. (A more elementary proof of this inequality, not relying on the Szemerédi-Trotter theorem or crossing number bounds, and thus having the advantage on working on other archimedean fields such as <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+C%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb C}' title='{&#92;Bbb C}' class='latex' />, was subsequently found <a href="http://www.ams.org/mathscinet-getitem?mr=2212132">by Solymosi</a>, but the best bounds on the sum-product problem in <img src='http://s0.wp.com/latex.php?latex=%7B%5CBbb+R%7D&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;Bbb R}' title='{&#92;Bbb R}' class='latex' /> still rely very much on the Szemerédi-Trotter inequality.)</p>
]]></content:encoded>
</item>

</channel>
</rss>
