<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress.com" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>large-deviation-inequality &amp;laquo; WordPress.com Tag Feed</title>
	<link>http://en.wordpress.com/tag/large-deviation-inequality/</link>
	<description>Feed of posts on WordPress.com tagged "large-deviation-inequality"</description>
	<pubDate>Fri, 24 May 2013 06:10:57 +0000</pubDate>

	<generator>http://en.wordpress.com/tags/</generator>
	<language>en</language>

<item>
<title><![CDATA[Talagrand's concentration inequality]]></title>
<link>http://terrytao.wordpress.com/2009/06/09/talagrands-concentration-inequality/</link>
<pubDate>Wed, 10 Jun 2009 00:01:55 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2009/06/09/talagrands-concentration-inequality/</guid>
<description><![CDATA[In the theory of discrete random matrices (e.g. matrices whose entries are random signs ), one often]]></description>
<content:encoded><![CDATA[<p>
 In the theory of discrete random matrices (e.g. matrices whose entries are random signs <img src='http://s0.wp.com/latex.php?latex=%7B%5Cpm+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;pm 1}&amp;fg=000000' title='{&#92;pm 1}&amp;fg=000000' class='latex' />), one often encounters the problem of understanding the distribution of the random variable <img src='http://s0.wp.com/latex.php?latex=%7B%5Chbox%7Bdist%7D%28X%2CV%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hbox{dist}(X,V)}&amp;fg=000000' title='{&#92;hbox{dist}(X,V)}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7BX+%3D+%28x_1%2C%5Cldots%2Cx_n%29+%5Cin+%5C%7B-1%2C%2B1%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X = (x_1,&#92;ldots,x_n) &#92;in &#92;{-1,+1&#92;}^n}&amp;fg=000000' title='{X = (x_1,&#92;ldots,x_n) &#92;in &#92;{-1,+1&#92;}^n}&amp;fg=000000' class='latex' /> is an <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' />-dimensional random sign vector (so <img src='http://s0.wp.com/latex.php?latex=%7BX%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X}&amp;fg=000000' title='{X}&amp;fg=000000' class='latex' /> is uniformly distributed in the discrete cube <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B-1%2C%2B1%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' title='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' class='latex' />), and <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> is some <img src='http://s0.wp.com/latex.php?latex=%7Bd%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{d}&amp;fg=000000' title='{d}&amp;fg=000000' class='latex' />-dimensional subspace of <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cbf+R%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;bf R}^n}&amp;fg=000000' title='{{&#92;bf R}^n}&amp;fg=000000' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%7B0+%5Cleq+d+%5Cleq+n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0 &#92;leq d &#92;leq n}&amp;fg=000000' title='{0 &#92;leq d &#92;leq n}&amp;fg=000000' class='latex' />.
</p>
<p>
It is not hard to compute the second moment of this random variable. Indeed, if <img src='http://s0.wp.com/latex.php?latex=%7BP+%3D+%28p_%7Bij%7D%29_%7B1+%5Cleq+i%2Cj+%5Cleq+n%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P = (p_{ij})_{1 &#92;leq i,j &#92;leq n}}&amp;fg=000000' title='{P = (p_{ij})_{1 &#92;leq i,j &#92;leq n}}&amp;fg=000000' class='latex' /> denotes the orthogonal projection matrix from <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cbf+R%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;bf R}^n}&amp;fg=000000' title='{{&#92;bf R}^n}&amp;fg=000000' class='latex' /> to the orthogonal complement <img src='http://s0.wp.com/latex.php?latex=%7BV%5E%5Cperp%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V^&#92;perp}&amp;fg=000000' title='{V^&#92;perp}&amp;fg=000000' class='latex' /> of <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' />, then one observes that </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Chbox%7Bdist%7D%28X%2CV%29%5E2+%3D+X+%5Ccdot+P+X+%3D+%5Csum_%7Bi%3D1%7D%5En+%5Csum_%7Bj%3D1%7D%5En+x_i+x_j+p_%7Bij%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;hbox{dist}(X,V)^2 = X &#92;cdot P X = &#92;sum_{i=1}^n &#92;sum_{j=1}^n x_i x_j p_{ij}&amp;fg=000000' title='&#92;displaystyle  &#92;hbox{dist}(X,V)^2 = X &#92;cdot P X = &#92;sum_{i=1}^n &#92;sum_{j=1}^n x_i x_j p_{ij}&amp;fg=000000' class='latex' /></p>
<p> and so upon taking expectations we see that <a name="secmoment">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%5Chbox%7Bdist%7D%28X%2CV%29%5E2+%3D+%5Csum_%7Bi%3D1%7D%5En+p_%7Bii%7D+%3D+%5Chbox%7Btr%7D+P+%3D+n-d+%5C+%5C+%5C+%5C+%5C+%281%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#92;hbox{dist}(X,V)^2 = &#92;sum_{i=1}^n p_{ii} = &#92;hbox{tr} P = n-d &#92; &#92; &#92; &#92; &#92; (1)&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#92;hbox{dist}(X,V)^2 = &#92;sum_{i=1}^n p_{ii} = &#92;hbox{tr} P = n-d &#92; &#92; &#92; &#92; &#92; (1)&amp;fg=000000' class='latex' /></p>
<p></a> since <img src='http://s0.wp.com/latex.php?latex=%7BP%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{P}&amp;fg=000000' title='{P}&amp;fg=000000' class='latex' /> is a rank <img src='http://s0.wp.com/latex.php?latex=%7Bn-d%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n-d}&amp;fg=000000' title='{n-d}&amp;fg=000000' class='latex' /> orthogonal projection. So we expect <img src='http://s0.wp.com/latex.php?latex=%7B%5Chbox%7Bdist%7D%28X%2CV%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hbox{dist}(X,V)}&amp;fg=000000' title='{&#92;hbox{dist}(X,V)}&amp;fg=000000' class='latex' /> to be about <img src='http://s0.wp.com/latex.php?latex=%7B%5Csqrt%7Bn-d%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;sqrt{n-d}}&amp;fg=000000' title='{&#92;sqrt{n-d}}&amp;fg=000000' class='latex' /> on the average.</p>
<p>
In fact, one has sharp concentration around this value, in the sense that <img src='http://s0.wp.com/latex.php?latex=%7B%5Chbox%7Bdist%7D%28X%2CV%29+%3D+%5Csqrt%7Bn-d%7D%2BO%281%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hbox{dist}(X,V) = &#92;sqrt{n-d}+O(1)}&amp;fg=000000' title='{&#92;hbox{dist}(X,V) = &#92;sqrt{n-d}+O(1)}&amp;fg=000000' class='latex' /> with high probability. More precisely, we have
</p>
<blockquote><p><b>Proposition 1 (Large deviation inequality)</b> <a name="ldi"></a> For any <img src='http://s0.wp.com/latex.php?latex=%7Bt%26%2362%3B0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t&gt;0}&amp;fg=000000' title='{t&gt;0}&amp;fg=000000' class='latex' />, one has
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+P%7D%28+%26%23124%3B%5Chbox%7Bdist%7D%28X%2CV%29+-+%5Csqrt%7Bn-d%7D%26%23124%3B+%5Cgeq+t+%29+%5Cleq+C+%5Cexp%28-+c+t%5E2+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb P}( &#124;&#92;hbox{dist}(X,V) - &#92;sqrt{n-d}&#124; &#92;geq t ) &#92;leq C &#92;exp(- c t^2 )&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb P}( &#124;&#92;hbox{dist}(X,V) - &#92;sqrt{n-d}&#124; &#92;geq t ) &#92;leq C &#92;exp(- c t^2 )&amp;fg=000000' class='latex' /></p>
<p> for some absolute constants <img src='http://s0.wp.com/latex.php?latex=%7BC%2C+c+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{C, c &gt; 0}&amp;fg=000000' title='{C, c &gt; 0}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<p>
In fact the constants <img src='http://s0.wp.com/latex.php?latex=%7BC%2C+c%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{C, c}&amp;fg=000000' title='{C, c}&amp;fg=000000' class='latex' /> are very civilised; for large <img src='http://s0.wp.com/latex.php?latex=%7Bt%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t}&amp;fg=000000' title='{t}&amp;fg=000000' class='latex' /> one can basically take <img src='http://s0.wp.com/latex.php?latex=%7BC%3D4%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{C=4}&amp;fg=000000' title='{C=4}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7Bc%3D1%2F16%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{c=1/16}&amp;fg=000000' title='{c=1/16}&amp;fg=000000' class='latex' />, for instance. This type of concentration, particularly for subspaces <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> of moderately large codimension <img src='http://s0.wp.com/latex.php?latex=%7Bn-d%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n-d}&amp;fg=000000' title='{n-d}&amp;fg=000000' class='latex' />, is fundamental to much of my work on random matrices with Van Vu, starting with <a href="http://front.math.ucdavis.edu/math.CO/0411095">our first paper</a> (in which this proposition first appears). (For subspaces of small codimension (such as hyperplanes) one has to use other tools to get good results, such as inverse Littlewood-Offord theory or the Berry-Ess&#233;en central limit theorem, but that is another story.)
</p>
<p>
Proposition <a href="#ldi">1</a> is an easy consequence of the second moment computation and <a href="http://www.ams.org/mathscinet-getitem?mr=1361756">Talagrand&#8217;s inequality</a>, which among other things provides a sharp concentration result for convex Lipschitz functions on the cube <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B-1%2C%2B1%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' title='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' class='latex' />; since <img src='http://s0.wp.com/latex.php?latex=%7B%5Chbox%7Bdist%7D%28x%2CV%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hbox{dist}(x,V)}&amp;fg=000000' title='{&#92;hbox{dist}(x,V)}&amp;fg=000000' class='latex' /> is indeed a convex Lipschitz function, this inequality can be applied immediately. The proof of Talagrand&#8217;s inequality is short and can be found in several textbooks (e.g. <a href="http://www.ams.org/mathscinet-getitem?mr=2437651">Alon and Spencer</a>), but I thought I would reproduce the argument here (specialised to the convex case), mostly to force myself to learn the proof properly. Note the concentration of <img src='http://s0.wp.com/latex.php?latex=%7BO%281%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O(1)}&amp;fg=000000' title='{O(1)}&amp;fg=000000' class='latex' /> obtained by Talagrand&#8217;s inequality is much stronger than what one would get from more elementary tools such as <a href="http://en.wikipedia.org/wiki/Azuma&#037;27s_inequality">Azuma&#8217;s inequality</a> or <a href="http://en.wikipedia.org/wiki/McDiarmid&#037;27s_inequality">McDiarmid&#8217;s inequality</a>, which would only give concentration of about <img src='http://s0.wp.com/latex.php?latex=%7BO%28%5Csqrt%7Bn%7D%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O(&#92;sqrt{n})}&amp;fg=000000' title='{O(&#92;sqrt{n})}&amp;fg=000000' class='latex' /> or so (which is in fact trivial, since the cube <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B-1%2C%2B1%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' title='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' class='latex' /> has diameter <img src='http://s0.wp.com/latex.php?latex=%7B2%5Csqrt%7Bn%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{2&#92;sqrt{n}}&amp;fg=000000' title='{2&#92;sqrt{n}}&amp;fg=000000' class='latex' />); the point is that Talagrand&#8217;s inequality is very effective at exploiting the convexity of the problem, as well as the Lipschitz nature of the function in all directions, whereas Azuma&#8217;s inequality can only easily take advantage of the Lipschitz nature of the function in coordinate directions. On the other hand, Azuma&#8217;s inequality works just as well if the <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell^2}&amp;fg=000000' title='{&#92;ell^2}&amp;fg=000000' class='latex' /> metric is replaced with the larger <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%5E1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell^1}&amp;fg=000000' title='{&#92;ell^1}&amp;fg=000000' class='latex' /> metric, and one can conclude that the <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%5E1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell^1}&amp;fg=000000' title='{&#92;ell^1}&amp;fg=000000' class='latex' /> distance between <img src='http://s0.wp.com/latex.php?latex=%7BX%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X}&amp;fg=000000' title='{X}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> concentrates around its median to a width <img src='http://s0.wp.com/latex.php?latex=%7BO%28%5Csqrt%7Bn%7D%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O(&#92;sqrt{n})}&amp;fg=000000' title='{O(&#92;sqrt{n})}&amp;fg=000000' class='latex' />, which is a more non-trivial fact than the <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell^2}&amp;fg=000000' title='{&#92;ell^2}&amp;fg=000000' class='latex' /> concentration bound given by that inequality. (The computation of the median of the <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%5E1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell^1}&amp;fg=000000' title='{&#92;ell^1}&amp;fg=000000' class='latex' /> distance is more complicated than for the <img src='http://s0.wp.com/latex.php?latex=%7B%5Cell%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;ell^2}&amp;fg=000000' title='{&#92;ell^2}&amp;fg=000000' class='latex' /> distance, though, and depends on the orientation of <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' />.)
</p>
<blockquote><p><b>Remark 1</b>  If one makes the coordinates of <img src='http://s0.wp.com/latex.php?latex=%7BX%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X}&amp;fg=000000' title='{X}&amp;fg=000000' class='latex' /> iid Gaussian variables <img src='http://s0.wp.com/latex.php?latex=%7Bx_i+%5Cequiv+N%280%2C1%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x_i &#92;equiv N(0,1)}&amp;fg=000000' title='{x_i &#92;equiv N(0,1)}&amp;fg=000000' class='latex' /> rather than random signs, then Proposition <a href="#ldi">1</a> is much easier to prove; the probability distribution of a Gaussian vector is rotation-invariant, so one can rotate <img src='http://s0.wp.com/latex.php?latex=%7BV%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{V}&amp;fg=000000' title='{V}&amp;fg=000000' class='latex' /> to be, say, <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cbf+R%7D%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;bf R}^d}&amp;fg=000000' title='{{&#92;bf R}^d}&amp;fg=000000' class='latex' />, at which point <img src='http://s0.wp.com/latex.php?latex=%7B%5Chbox%7Bdist%7D%28X%2CV%29%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hbox{dist}(X,V)^2}&amp;fg=000000' title='{&#92;hbox{dist}(X,V)^2}&amp;fg=000000' class='latex' /> is clearly the sum of <img src='http://s0.wp.com/latex.php?latex=%7Bn-d%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n-d}&amp;fg=000000' title='{n-d}&amp;fg=000000' class='latex' /> independent squares of Gaussians (i.e. a <a href="http://en.wikipedia.org/wiki/Chi-square_distribution">chi-square distribution</a>), and the claim follows from direct computation (or one can use the <a href="http://en.wikipedia.org/wiki/Chernoff_inequality">Chernoff inequality</a>). The gaussian counterpart of Talagrand&#8217;s inequality is more classical, being essentially due to L&#233;vy, and will also be discussed later in this post. </p></blockquote>
</p>
<p>
<!--more-->
</p>
</p>
<p align="center"><b> &#8212;  1. Concentration on the cube  &#8212; </b></p>
<p>
Proposition <a href="#ldi">1</a> follows easily from the following statement, that asserts that if a convex set <img src='http://s0.wp.com/latex.php?latex=%7BA+%5Csubset+%7B%5Cbf+R%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A &#92;subset {&#92;bf R}^n}&amp;fg=000000' title='{A &#92;subset {&#92;bf R}^n}&amp;fg=000000' class='latex' /> occupies a non-trivial fraction of the cube <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B-1%2C%2B1%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' title='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' class='latex' />, then the neighbourhood <img src='http://s0.wp.com/latex.php?latex=%7BA_t+%3A%3D+%5C%7B+x+%5Cin+%7B%5Cbf+R%7D%5En%3A+%5Chbox%7Bdist%7D%28x%2CA%29+%5Cleq+t+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_t := &#92;{ x &#92;in {&#92;bf R}^n: &#92;hbox{dist}(x,A) &#92;leq t &#92;}}&amp;fg=000000' title='{A_t := &#92;{ x &#92;in {&#92;bf R}^n: &#92;hbox{dist}(x,A) &#92;leq t &#92;}}&amp;fg=000000' class='latex' /> will occupy almost all of the cube for <img src='http://s0.wp.com/latex.php?latex=%7Bt+%5Cgg+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t &#92;gg 1}&amp;fg=000000' title='{t &#92;gg 1}&amp;fg=000000' class='latex' />:
</p>
<blockquote><p><b>Proposition 2 (Talagrand&#8217;s concentration inequality)</b> <a name="conci"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> be a convex set in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cbf+R%7D%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;bf R}^d}&amp;fg=000000' title='{{&#92;bf R}^d}&amp;fg=000000' class='latex' />. Then
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+P%7D%28X+%5Cin+A%29+%5Cmathop%7B%5Cbf+P%7D%28+X+%5Cnot+%5Cin+A_t+%29+%5Cleq+%5Cexp%28+-+c+t%5E2+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;mathop{&#92;bf P}(X &#92;in A) &#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t ) &#92;leq &#92;exp( - c t^2 )&amp;fg=000000' title='&#92;displaystyle  &#92;mathop{&#92;bf P}(X &#92;in A) &#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t ) &#92;leq &#92;exp( - c t^2 )&amp;fg=000000' class='latex' /></p>
<p> for all <img src='http://s0.wp.com/latex.php?latex=%7Bt%26%2362%3B0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t&gt;0}&amp;fg=000000' title='{t&gt;0}&amp;fg=000000' class='latex' /> and some absolute constant <img src='http://s0.wp.com/latex.php?latex=%7Bc+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{c &gt; 0}&amp;fg=000000' title='{c &gt; 0}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7BX+%5Cin+%5C%7B-1%2C%2B1%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X &#92;in &#92;{-1,+1&#92;}^n}&amp;fg=000000' title='{X &#92;in &#92;{-1,+1&#92;}^n}&amp;fg=000000' class='latex' /> is chosen uniformly from <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B-1%2C%2B1%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' title='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' class='latex' />. </p></blockquote>
</p>
<blockquote><p><b>Remark 2</b>  It is crucial that <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> is convex here. If instead <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> is, say, the set of all points in <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B-1%2C%2B1%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' title='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' class='latex' /> with fewer than <img src='http://s0.wp.com/latex.php?latex=%7Bn%2F2-%5Csqrt%7Bn%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n/2-&#92;sqrt{n}}&amp;fg=000000' title='{n/2-&#92;sqrt{n}}&amp;fg=000000' class='latex' /> <img src='http://s0.wp.com/latex.php?latex=%7B%2B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{+1}&amp;fg=000000' title='{+1}&amp;fg=000000' class='latex' />&#8216;s, then <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathop%7B%5Cbf+P%7D%28X+%5Cin+A%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathop{&#92;bf P}(X &#92;in A)}&amp;fg=000000' title='{&#92;mathop{&#92;bf P}(X &#92;in A)}&amp;fg=000000' class='latex' /> is comparable to <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' />, but <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathop%7B%5Cbf+P%7D%28+X+%5Cnot+%5Cin+A_t+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t )}&amp;fg=000000' title='{&#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t )}&amp;fg=000000' class='latex' /> only starts decaying once <img src='http://s0.wp.com/latex.php?latex=%7Bt+%5Cgg+%5Csqrt%7Bn%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t &#92;gg &#92;sqrt{n}}&amp;fg=000000' title='{t &#92;gg &#92;sqrt{n}}&amp;fg=000000' class='latex' />, rather than <img src='http://s0.wp.com/latex.php?latex=%7Bt+%5Cgg+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t &#92;gg 1}&amp;fg=000000' title='{t &#92;gg 1}&amp;fg=000000' class='latex' />. Indeed, it is not hard to show that Proposition <a href="#conci">2</a> implies the variant
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+P%7D%28X+%5Cin+A%29+%5Cmathop%7B%5Cbf+P%7D%28+X+%5Cnot+%5Cin+A_t+%29+%5Cleq+%5Cexp%28+-+c+t%5E2+%2F+n%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;mathop{&#92;bf P}(X &#92;in A) &#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t ) &#92;leq &#92;exp( - c t^2 / n)&amp;fg=000000' title='&#92;displaystyle  &#92;mathop{&#92;bf P}(X &#92;in A) &#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t ) &#92;leq &#92;exp( - c t^2 / n)&amp;fg=000000' class='latex' /></p>
<p> for non-convex <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> (by restricting <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B-1%2C%2B1%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' title='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' class='latex' /> and then passing from <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> to the convex hull, noting that distances to <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> on <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B-1%2C%2B1%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' title='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' class='latex' /> may be contracted by a factor of <img src='http://s0.wp.com/latex.php?latex=%7BO%28%5Csqrt%7Bn%7D%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O(&#92;sqrt{n})}&amp;fg=000000' title='{O(&#92;sqrt{n})}&amp;fg=000000' class='latex' /> by this latter process); this inequality can also be easily deduced from <a href="http://en.wikipedia.org/wiki/Azuma&#037;27s_inequality">Azuma&#8217;s inequality</a>. </p></blockquote>
</p>
<p>
To apply this proposition to the situation at hand, observe that if <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> is the cylindrical region <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B+x+%5Cin+%7B%5Cbf+R%7D%5En%3A+%5Chbox%7Bdist%7D%28x%2CV%29+%5Cleq+r+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{ x &#92;in {&#92;bf R}^n: &#92;hbox{dist}(x,V) &#92;leq r &#92;}}&amp;fg=000000' title='{&#92;{ x &#92;in {&#92;bf R}^n: &#92;hbox{dist}(x,V) &#92;leq r &#92;}}&amp;fg=000000' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=%7Br%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{r}&amp;fg=000000' title='{r}&amp;fg=000000' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> is convex and <img src='http://s0.wp.com/latex.php?latex=%7BA_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_t}&amp;fg=000000' title='{A_t}&amp;fg=000000' class='latex' /> is contained in <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B+x+%5Cin+%7B%5Cbf+R%7D%5En%3A+%5Chbox%7Bdist%7D%28x%2CV%29+%5Cleq+r%2Bt+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{ x &#92;in {&#92;bf R}^n: &#92;hbox{dist}(x,V) &#92;leq r+t &#92;}}&amp;fg=000000' title='{&#92;{ x &#92;in {&#92;bf R}^n: &#92;hbox{dist}(x,V) &#92;leq r+t &#92;}}&amp;fg=000000' class='latex' />. Thus </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+P%7D%28%5Chbox%7Bdist%7D%28X%2CV%29+%5Cleq+r%29+%5Cmathop%7B%5Cbf+P%7D%28+%5Chbox%7Bdist%7D%28X%2CV%29+%26%2362%3B+r%2Bt+%29+%5Cleq+%5Cexp%28-ct%5E2%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;mathop{&#92;bf P}(&#92;hbox{dist}(X,V) &#92;leq r) &#92;mathop{&#92;bf P}( &#92;hbox{dist}(X,V) &gt; r+t ) &#92;leq &#92;exp(-ct^2).&amp;fg=000000' title='&#92;displaystyle  &#92;mathop{&#92;bf P}(&#92;hbox{dist}(X,V) &#92;leq r) &#92;mathop{&#92;bf P}( &#92;hbox{dist}(X,V) &gt; r+t ) &#92;leq &#92;exp(-ct^2).&amp;fg=000000' class='latex' /></p>
<p> Applying this with <img src='http://s0.wp.com/latex.php?latex=%7Br+%3A%3D+M%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{r := M}&amp;fg=000000' title='{r := M}&amp;fg=000000' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%7Br+%3A%3D+M-t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{r := M-t}&amp;fg=000000' title='{r := M-t}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7BM%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M}&amp;fg=000000' title='{M}&amp;fg=000000' class='latex' /> is the median value of <img src='http://s0.wp.com/latex.php?latex=%7B%5Chbox%7Bdist%7D%28X%2CV%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hbox{dist}(X,V)}&amp;fg=000000' title='{&#92;hbox{dist}(X,V)}&amp;fg=000000' class='latex' />, one soon obtains concentration around the median:
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+P%7D%28+%26%23124%3B%5Chbox%7Bdist%7D%28X%2CV%29+-+M%26%23124%3B+%26%2362%3B+t+%29+%5Cleq+4+%5Cexp%28-ct%5E2%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;mathop{&#92;bf P}( &#124;&#92;hbox{dist}(X,V) - M&#124; &gt; t ) &#92;leq 4 &#92;exp(-ct^2).&amp;fg=000000' title='&#92;displaystyle  &#92;mathop{&#92;bf P}( &#124;&#92;hbox{dist}(X,V) - M&#124; &gt; t ) &#92;leq 4 &#92;exp(-ct^2).&amp;fg=000000' class='latex' /></p>
<p> This is only compatible with <a href="#secmoment">(1)</a> if <img src='http://s0.wp.com/latex.php?latex=%7BM+%3D+%5Csqrt%7Bn-d%7D+%2B+O%281%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{M = &#92;sqrt{n-d} + O(1)}&amp;fg=000000' title='{M = &#92;sqrt{n-d} + O(1)}&amp;fg=000000' class='latex' />, and the claim follows.</p>
<p>
To prove Proposition <a href="#conci">2</a>, we use the exponential moment method. Indeed, it suffices by Markov&#8217;s inequality to show that <a name="nvac">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+P%7D%28X+%5Cin+A%29+%7B%5CBbb+E%7D+%5Cexp%28+c+%5Chbox%7Bdist%7D%28X%2CA%29%5E2+%29+%5Cleq+1+%5C+%5C+%5C+%5C+%5C+%282%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;mathop{&#92;bf P}(X &#92;in A) {&#92;Bbb E} &#92;exp( c &#92;hbox{dist}(X,A)^2 ) &#92;leq 1 &#92; &#92; &#92; &#92; &#92; (2)&amp;fg=000000' title='&#92;displaystyle  &#92;mathop{&#92;bf P}(X &#92;in A) {&#92;Bbb E} &#92;exp( c &#92;hbox{dist}(X,A)^2 ) &#92;leq 1 &#92; &#92; &#92; &#92; &#92; (2)&amp;fg=000000' class='latex' /></p>
<p></a> for a sufficiently small absolute constant <img src='http://s0.wp.com/latex.php?latex=%7Bc+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{c &gt; 0}&amp;fg=000000' title='{c &gt; 0}&amp;fg=000000' class='latex' /> (in fact one can take <img src='http://s0.wp.com/latex.php?latex=%7Bc%3D1%2F16%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{c=1/16}&amp;fg=000000' title='{c=1/16}&amp;fg=000000' class='latex' />).
</p>
<p>
We prove <a href="#nvac">(2)</a> by an induction on the dimension <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' />. The claim is trivial for <img src='http://s0.wp.com/latex.php?latex=%7Bn%3D0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n=0}&amp;fg=000000' title='{n=0}&amp;fg=000000' class='latex' />, so suppose <img src='http://s0.wp.com/latex.php?latex=%7Bn+%5Cgeq+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n &#92;geq 1}&amp;fg=000000' title='{n &#92;geq 1}&amp;fg=000000' class='latex' /> and the claim has already been proven for <img src='http://s0.wp.com/latex.php?latex=%7Bn-1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n-1}&amp;fg=000000' title='{n-1}&amp;fg=000000' class='latex' />.
</p>
<p>
Let us write <img src='http://s0.wp.com/latex.php?latex=%7BX+%3D+%28X%27%2Cx_n%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X = (X&#039;,x_n)}&amp;fg=000000' title='{X = (X&#039;,x_n)}&amp;fg=000000' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=%7Bx_n+%3D+%5Cpm+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x_n = &#92;pm 1}&amp;fg=000000' title='{x_n = &#92;pm 1}&amp;fg=000000' class='latex' />. For each <img src='http://s0.wp.com/latex.php?latex=%7Bt+%5Cin+%7B%5Cbf+R%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t &#92;in {&#92;bf R}}&amp;fg=000000' title='{t &#92;in {&#92;bf R}}&amp;fg=000000' class='latex' />, we introduce the slice <img src='http://s0.wp.com/latex.php?latex=%7BA_t+%3A%3D+%5C%7B+x%27+%5Cin+%7B%5Cbf+R%7D%5E%7Bn-1%7D%3A+%28x%27%2Ct%29+%5Cin+A+%5C%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_t := &#92;{ x&#039; &#92;in {&#92;bf R}^{n-1}: (x&#039;,t) &#92;in A &#92;}}&amp;fg=000000' title='{A_t := &#92;{ x&#039; &#92;in {&#92;bf R}^{n-1}: (x&#039;,t) &#92;in A &#92;}}&amp;fg=000000' class='latex' />, then <img src='http://s0.wp.com/latex.php?latex=%7BA_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_t}&amp;fg=000000' title='{A_t}&amp;fg=000000' class='latex' /> is convex. We now try to bound the left-hand side of <a href="#nvac">(2)</a> in terms of <img src='http://s0.wp.com/latex.php?latex=%7BX%27%2C+A_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X&#039;, A_t}&amp;fg=000000' title='{X&#039;, A_t}&amp;fg=000000' class='latex' /> rather than <img src='http://s0.wp.com/latex.php?latex=%7BX%2C+A%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X, A}&amp;fg=000000' title='{X, A}&amp;fg=000000' class='latex' />. Clearly </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+P%7D%28X+%5Cin+A%29+%3D+%5Cfrac%7B1%7D%7B2%7D+%5B+%5Cmathop%7B%5Cbf+P%7D%28+X%27+%5Cin+A_%7B-1%7D%29+%2B+%5Cmathop%7B%5Cbf+P%7D%28+X%27+%5Cin+A_%7B%2B1%7D+%29+%5D.+%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;mathop{&#92;bf P}(X &#92;in A) = &#92;frac{1}{2} [ &#92;mathop{&#92;bf P}( X&#039; &#92;in A_{-1}) + &#92;mathop{&#92;bf P}( X&#039; &#92;in A_{+1} ) ]. &amp;fg=000000' title='&#92;displaystyle  &#92;mathop{&#92;bf P}(X &#92;in A) = &#92;frac{1}{2} [ &#92;mathop{&#92;bf P}( X&#039; &#92;in A_{-1}) + &#92;mathop{&#92;bf P}( X&#039; &#92;in A_{+1} ) ]. &amp;fg=000000' class='latex' /></p>
<p> By symmetry we may assume that <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathop%7B%5Cbf+P%7D%28+X%27+%5Cin+A_%7B%2B1%7D+%29+%5Cgeq+%5Cmathop%7B%5Cbf+P%7D%28+X%27+%5Cin+A_%7B-1%7D+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathop{&#92;bf P}( X&#039; &#92;in A_{+1} ) &#92;geq &#92;mathop{&#92;bf P}( X&#039; &#92;in A_{-1} )}&amp;fg=000000' title='{&#92;mathop{&#92;bf P}( X&#039; &#92;in A_{+1} ) &#92;geq &#92;mathop{&#92;bf P}( X&#039; &#92;in A_{-1} )}&amp;fg=000000' class='latex' />, thus we may write <a name="av">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+P%7D%28+X%27+%5Cin+A_%7B%5Cpm+1%7D+%29+%3D+p+%281+%5Cpm+q%29+%5C+%5C+%5C+%5C+%5C+%283%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;mathop{&#92;bf P}( X&#039; &#92;in A_{&#92;pm 1} ) = p (1 &#92;pm q) &#92; &#92; &#92; &#92; &#92; (3)&amp;fg=000000' title='&#92;displaystyle  &#92;mathop{&#92;bf P}( X&#039; &#92;in A_{&#92;pm 1} ) = p (1 &#92;pm q) &#92; &#92; &#92; &#92; &#92; (3)&amp;fg=000000' class='latex' /></p>
<p></a> where <img src='http://s0.wp.com/latex.php?latex=%7Bp+%3A%3D+%5Cmathop%7B%5Cbf+P%7D%28X+%5Cin+A%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{p := &#92;mathop{&#92;bf P}(X &#92;in A)}&amp;fg=000000' title='{p := &#92;mathop{&#92;bf P}(X &#92;in A)}&amp;fg=000000' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%7B0+%5Cleq+q+%5Cleq+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0 &#92;leq q &#92;leq 1}&amp;fg=000000' title='{0 &#92;leq q &#92;leq 1}&amp;fg=000000' class='latex' />.</p>
<p>
Now we look at <img src='http://s0.wp.com/latex.php?latex=%7B%5Chbox%7Bdist%7D%28X%2CA%29%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;hbox{dist}(X,A)^2}&amp;fg=000000' title='{&#92;hbox{dist}(X,A)^2}&amp;fg=000000' class='latex' />. For <img src='http://s0.wp.com/latex.php?latex=%7Bt+%3D+%5Cpm+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t = &#92;pm 1}&amp;fg=000000' title='{t = &#92;pm 1}&amp;fg=000000' class='latex' />, let <img src='http://s0.wp.com/latex.php?latex=%7BY_t+%5Cin+%7B%5Cbf+R%7D%5E%7Bn-1%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{Y_t &#92;in {&#92;bf R}^{n-1}}&amp;fg=000000' title='{Y_t &#92;in {&#92;bf R}^{n-1}}&amp;fg=000000' class='latex' /> be the closest point of (the closure of) <img src='http://s0.wp.com/latex.php?latex=%7BA_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_t}&amp;fg=000000' title='{A_t}&amp;fg=000000' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7BX%27%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X&#039;}&amp;fg=000000' title='{X&#039;}&amp;fg=000000' class='latex' />, thus <a name="xpy">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%26%23124%3BX%27-Y_t%26%23124%3B+%3D+%5Chbox%7Bdist%7D%28X%27%2C+A_t%29.+%5C+%5C+%5C+%5C+%5C+%284%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#124;X&#039;-Y_t&#124; = &#92;hbox{dist}(X&#039;, A_t). &#92; &#92; &#92; &#92; &#92; (4)&amp;fg=000000' title='&#92;displaystyle  &#124;X&#039;-Y_t&#124; = &#92;hbox{dist}(X&#039;, A_t). &#92; &#92; &#92; &#92; &#92; (4)&amp;fg=000000' class='latex' /></p>
<p></a> Let <img src='http://s0.wp.com/latex.php?latex=%7B0+%5Cleq+%5Clambda+%5Cleq+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0 &#92;leq &#92;lambda &#92;leq 1}&amp;fg=000000' title='{0 &#92;leq &#92;lambda &#92;leq 1}&amp;fg=000000' class='latex' /> be chosen later; then the point <img src='http://s0.wp.com/latex.php?latex=%7B%281-%5Clambda%29+%28Y_%7Bx_n%7D%2C+x_n%29+%2B+%5Clambda+%28Y_%7B-x_n%7D%2C-x_n%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{(1-&#92;lambda) (Y_{x_n}, x_n) + &#92;lambda (Y_{-x_n},-x_n)}&amp;fg=000000' title='{(1-&#92;lambda) (Y_{x_n}, x_n) + &#92;lambda (Y_{-x_n},-x_n)}&amp;fg=000000' class='latex' /> lies in <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> by convexity, and so </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Chbox%7Bdist%7D%28X%2CA%29+%5Cleq+%26%23124%3B%281-%5Clambda%29+%28Y_%7Bx_n%7D%2C+x_n%29+%2B+%5Clambda+%28Y_%7B-x_n%7D%2C-x_n%29+-+%28X%27%2Cx_n%29%26%23124%3B.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;hbox{dist}(X,A) &#92;leq &#124;(1-&#92;lambda) (Y_{x_n}, x_n) + &#92;lambda (Y_{-x_n},-x_n) - (X&#039;,x_n)&#124;.&amp;fg=000000' title='&#92;displaystyle  &#92;hbox{dist}(X,A) &#92;leq &#124;(1-&#92;lambda) (Y_{x_n}, x_n) + &#92;lambda (Y_{-x_n},-x_n) - (X&#039;,x_n)&#124;.&amp;fg=000000' class='latex' /></p>
<p> Squaring this and using Pythagoras, one obtains
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Chbox%7Bdist%7D%28X%2CA%29%5E2+%5Cleq+4+%5Clambda%5E2+%2B+%26%23124%3B%281-%5Clambda%29+%28X%27-Y_%7Bx_n%7D%29+%2B+%5Clambda+%28X%27-Y_%7B-x_n%7D%29%26%23124%3B%5E2.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;hbox{dist}(X,A)^2 &#92;leq 4 &#92;lambda^2 + &#124;(1-&#92;lambda) (X&#039;-Y_{x_n}) + &#92;lambda (X&#039;-Y_{-x_n})&#124;^2.&amp;fg=000000' title='&#92;displaystyle  &#92;hbox{dist}(X,A)^2 &#92;leq 4 &#92;lambda^2 + &#124;(1-&#92;lambda) (X&#039;-Y_{x_n}) + &#92;lambda (X&#039;-Y_{-x_n})&#124;^2.&amp;fg=000000' class='latex' /></p>
<p> As we will shortly be exponentiating the left-hand side, we need to linearise the right-hand side. Accordingly, we will exploit the convexity of the function <img src='http://s0.wp.com/latex.php?latex=%7Bx+%5Cmapsto+%26%23124%3Bx%26%23124%3B%5E2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x &#92;mapsto &#124;x&#124;^2}&amp;fg=000000' title='{x &#92;mapsto &#124;x&#124;^2}&amp;fg=000000' class='latex' /> to bound
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%26%23124%3B%281-%5Clambda%29+%28X-Y_%7Bx_n%7D%29+%2B+%5Clambda+%28X-Y_%7B-x_n%7D%29%26%23124%3B%5E2+%5Cleq+%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#124;(1-&#92;lambda) (X-Y_{x_n}) + &#92;lambda (X-Y_{-x_n})&#124;^2 &#92;leq &amp;fg=000000' title='&#92;displaystyle  &#124;(1-&#92;lambda) (X-Y_{x_n}) + &#92;lambda (X-Y_{-x_n})&#124;^2 &#92;leq &amp;fg=000000' class='latex' /></p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%281-%5Clambda%29+%26%23124%3BX%27-Y_%7Bx_n%7D%26%23124%3B%5E2+%2B+%5Clambda+%26%23124%3BX%27-Y_%7B-x_n%7D%26%23124%3B%5E2%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  (1-&#92;lambda) &#124;X&#039;-Y_{x_n}&#124;^2 + &#92;lambda &#124;X&#039;-Y_{-x_n}&#124;^2&amp;fg=000000' title='&#92;displaystyle  (1-&#92;lambda) &#124;X&#039;-Y_{x_n}&#124;^2 + &#92;lambda &#124;X&#039;-Y_{-x_n}&#124;^2&amp;fg=000000' class='latex' /></p>
<p> and thus by <a href="#xpy">(4)</a>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Chbox%7Bdist%7D%28X%2CA%29%5E2+%5Cleq+4+%5Clambda%5E2+%2B+%281-%5Clambda%29+%5Chbox%7Bdist%7D%28X%27%2CA_%7Bx_n%7D%29%5E2+%2B+%5Clambda+%5Chbox%7Bdist%7D%28X%27%2CA_%7B-x_n%7D%29%5E2.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;hbox{dist}(X,A)^2 &#92;leq 4 &#92;lambda^2 + (1-&#92;lambda) &#92;hbox{dist}(X&#039;,A_{x_n})^2 + &#92;lambda &#92;hbox{dist}(X&#039;,A_{-x_n})^2.&amp;fg=000000' title='&#92;displaystyle  &#92;hbox{dist}(X,A)^2 &#92;leq 4 &#92;lambda^2 + (1-&#92;lambda) &#92;hbox{dist}(X&#039;,A_{x_n})^2 + &#92;lambda &#92;hbox{dist}(X&#039;,A_{-x_n})^2.&amp;fg=000000' class='latex' /></p>
<p> We exponentiate this and take expectations in <img src='http://s0.wp.com/latex.php?latex=%7BX%27%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X&#039;}&amp;fg=000000' title='{X&#039;}&amp;fg=000000' class='latex' /> (holding <img src='http://s0.wp.com/latex.php?latex=%7Bx_n%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x_n}&amp;fg=000000' title='{x_n}&amp;fg=000000' class='latex' /> fixed for now) to get
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D_%7BX%27%7D+e%5E%7Bc+%5Chbox%7Bdist%7D%28X%2CA%29%5E2%7D+%5Cleq+e%5E%7B4+c+%5Clambda%5E2%7D+%7B%5CBbb+E%7D_%7BX%27%7D+%28e%5E%7Bc+%5Chbox%7Bdist%7D%28X%27%2CA_%7Bx_n%7D%29%5E2%7D%29%5E%7B1-%5Clambda%7D+%28e%5E%7Bc+%5Chbox%7Bdist%7D%28X%27%2CA_%7B-x_n%7D%29%5E2%7D%29%5E%7B%5Clambda%7D.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}_{X&#039;} e^{c &#92;hbox{dist}(X,A)^2} &#92;leq e^{4 c &#92;lambda^2} {&#92;Bbb E}_{X&#039;} (e^{c &#92;hbox{dist}(X&#039;,A_{x_n})^2})^{1-&#92;lambda} (e^{c &#92;hbox{dist}(X&#039;,A_{-x_n})^2})^{&#92;lambda}.&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}_{X&#039;} e^{c &#92;hbox{dist}(X,A)^2} &#92;leq e^{4 c &#92;lambda^2} {&#92;Bbb E}_{X&#039;} (e^{c &#92;hbox{dist}(X&#039;,A_{x_n})^2})^{1-&#92;lambda} (e^{c &#92;hbox{dist}(X&#039;,A_{-x_n})^2})^{&#92;lambda}.&amp;fg=000000' class='latex' /></p>
<p> Meanwhile, from the induction hypothesis and <a href="#av">(3)</a> we have
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D_%7BX%27%7D+e%5E%7Bc+%5Chbox%7Bdist%7D%28X%27%2CA_%7Bx_n%7D%29%5E2%7D+%5Cleq+%5Cfrac%7B1%7D%7Bp%281+%2B+x_n+q%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}_{X&#039;} e^{c &#92;hbox{dist}(X&#039;,A_{x_n})^2} &#92;leq &#92;frac{1}{p(1 + x_n q)}&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}_{X&#039;} e^{c &#92;hbox{dist}(X&#039;,A_{x_n})^2} &#92;leq &#92;frac{1}{p(1 + x_n q)}&amp;fg=000000' class='latex' /></p>
<p> and similarly for <img src='http://s0.wp.com/latex.php?latex=%7BA_%7B-x_n%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_{-x_n}}&amp;fg=000000' title='{A_{-x_n}}&amp;fg=000000' class='latex' />. By H&#246;lder&#8217;s inequality, we conclude
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D_%7BX%27%7D+e%5E%7Bc+%5Chbox%7Bdist%7D%28X%2CA%29%5E2%7D+%5Cleq+e%5E%7B4+c+%5Clambda%5E2%7D+%5Cfrac%7B1%7D%7Bp+%281+%2B+x_n+q%29%5E%7B1-%5Clambda%7D+%281-x_n+q%29%5E%5Clambda%7D.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}_{X&#039;} e^{c &#92;hbox{dist}(X,A)^2} &#92;leq e^{4 c &#92;lambda^2} &#92;frac{1}{p (1 + x_n q)^{1-&#92;lambda} (1-x_n q)^&#92;lambda}.&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}_{X&#039;} e^{c &#92;hbox{dist}(X,A)^2} &#92;leq e^{4 c &#92;lambda^2} &#92;frac{1}{p (1 + x_n q)^{1-&#92;lambda} (1-x_n q)^&#92;lambda}.&amp;fg=000000' class='latex' /></p>
<p> For <img src='http://s0.wp.com/latex.php?latex=%7Bx_n%3D%2B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x_n=+1}&amp;fg=000000' title='{x_n=+1}&amp;fg=000000' class='latex' />, the optimal choice of <img src='http://s0.wp.com/latex.php?latex=%7B%5Clambda%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;lambda}&amp;fg=000000' title='{&#92;lambda}&amp;fg=000000' class='latex' /> here is <img src='http://s0.wp.com/latex.php?latex=%7B0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0}&amp;fg=000000' title='{0}&amp;fg=000000' class='latex' />, obtaining
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D_%7BX%27%7D+e%5E%7Bc+%5Chbox%7Bdist%7D%28X%2CA%29%5E2%7D+%3D+%5Cfrac%7B1%7D%7Bp+%281%2Bq%29%7D%3B%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E}_{X&#039;} e^{c &#92;hbox{dist}(X,A)^2} = &#92;frac{1}{p (1+q)};&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E}_{X&#039;} e^{c &#92;hbox{dist}(X,A)^2} = &#92;frac{1}{p (1+q)};&amp;fg=000000' class='latex' /></p>
<p> for <img src='http://s0.wp.com/latex.php?latex=%7Bx_n%3D-1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x_n=-1}&amp;fg=000000' title='{x_n=-1}&amp;fg=000000' class='latex' />, the optimal choice of <img src='http://s0.wp.com/latex.php?latex=%7B%5Clambda%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;lambda}&amp;fg=000000' title='{&#92;lambda}&amp;fg=000000' class='latex' /> is to be determined. Averaging, we obtain
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%7B%5CBbb+E%7D_%7BX%7D+e%5E%7Bc+%5Chbox%7Bdist%7D%28X%2CA%29%5E2%7D+%3D+%5Cfrac%7B1%7D%7B2%7D+%5B+%5Cfrac%7B1%7D%7Bp+%281%2Bq%29%7D+%2B+e%5E%7B4+c+%5Clambda%5E2%7D+%5Cfrac%7B1%7D%7Bp+%281+-+q%29%5E%7B1-%5Clambda%7D+%281+%2B+q%29%5E%5Clambda%7D+%5D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle {&#92;Bbb E}_{X} e^{c &#92;hbox{dist}(X,A)^2} = &#92;frac{1}{2} [ &#92;frac{1}{p (1+q)} + e^{4 c &#92;lambda^2} &#92;frac{1}{p (1 - q)^{1-&#92;lambda} (1 + q)^&#92;lambda} ]&amp;fg=000000' title='&#92;displaystyle {&#92;Bbb E}_{X} e^{c &#92;hbox{dist}(X,A)^2} = &#92;frac{1}{2} [ &#92;frac{1}{p (1+q)} + e^{4 c &#92;lambda^2} &#92;frac{1}{p (1 - q)^{1-&#92;lambda} (1 + q)^&#92;lambda} ]&amp;fg=000000' class='latex' /></p>
<p> so to establish <a href="#nvac">(2)</a>, it suffices to pick <img src='http://s0.wp.com/latex.php?latex=%7B0+%5Cleq+%5Clambda+%5Cleq+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{0 &#92;leq &#92;lambda &#92;leq 1}&amp;fg=000000' title='{0 &#92;leq &#92;lambda &#92;leq 1}&amp;fg=000000' class='latex' /> such that
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cfrac%7B1%7D%7B1%2Bq%7D+%2B+e%5E%7B4+c+%5Clambda%5E2%7D+%5Cfrac%7B1%7D%7B%281+-+q%29%5E%7B1-%5Clambda%7D+%281+%2B+q%29%5E%5Clambda%7D+%5Cleq+2.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;frac{1}{1+q} + e^{4 c &#92;lambda^2} &#92;frac{1}{(1 - q)^{1-&#92;lambda} (1 + q)^&#92;lambda} &#92;leq 2.&amp;fg=000000' title='&#92;displaystyle  &#92;frac{1}{1+q} + e^{4 c &#92;lambda^2} &#92;frac{1}{(1 - q)^{1-&#92;lambda} (1 + q)^&#92;lambda} &#92;leq 2.&amp;fg=000000' class='latex' /></p>
<p> If <img src='http://s0.wp.com/latex.php?latex=%7Bq%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q}&amp;fg=000000' title='{q}&amp;fg=000000' class='latex' /> is bounded away from zero, then by choosing <img src='http://s0.wp.com/latex.php?latex=%7B%5Clambda%3D1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;lambda=1}&amp;fg=000000' title='{&#92;lambda=1}&amp;fg=000000' class='latex' /> we would obtain the claim if <img src='http://s0.wp.com/latex.php?latex=%7Bc%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{c}&amp;fg=000000' title='{c}&amp;fg=000000' class='latex' /> is small enough, so we may take <img src='http://s0.wp.com/latex.php?latex=%7Bq%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q}&amp;fg=000000' title='{q}&amp;fg=000000' class='latex' /> to be small. But then a Taylor expansion allows us to conclude if we take <img src='http://s0.wp.com/latex.php?latex=%7B%5Clambda%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;lambda}&amp;fg=000000' title='{&#92;lambda}&amp;fg=000000' class='latex' /> to be a constant multiple of <img src='http://s0.wp.com/latex.php?latex=%7Bq%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{q}&amp;fg=000000' title='{q}&amp;fg=000000' class='latex' />, and again pick <img src='http://s0.wp.com/latex.php?latex=%7Bc%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{c}&amp;fg=000000' title='{c}&amp;fg=000000' class='latex' /> to be small enough. The point is that <img src='http://s0.wp.com/latex.php?latex=%7B%5Clambda%3D0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;lambda=0}&amp;fg=000000' title='{&#92;lambda=0}&amp;fg=000000' class='latex' /> already almost works up to errors of <img src='http://s0.wp.com/latex.php?latex=%7BO%28q%5E2%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O(q^2)}&amp;fg=000000' title='{O(q^2)}&amp;fg=000000' class='latex' />, and increasing <img src='http://s0.wp.com/latex.php?latex=%7B%5Clambda%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;lambda}&amp;fg=000000' title='{&#92;lambda}&amp;fg=000000' class='latex' /> from zero to a small non-zero quantity will decrease the LHS by about <img src='http://s0.wp.com/latex.php?latex=%7BO%28%5Clambda+q%29+-+O%28c+%5Clambda%5E2%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{O(&#92;lambda q) - O(c &#92;lambda^2)}&amp;fg=000000' title='{O(&#92;lambda q) - O(c &#92;lambda^2)}&amp;fg=000000' class='latex' />. [By optimising everything using first-year calculus, one eventually gets the constant <img src='http://s0.wp.com/latex.php?latex=%7Bc%3D1%2F16%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{c=1/16}&amp;fg=000000' title='{c=1/16}&amp;fg=000000' class='latex' /> claimed earlier.]</p>
<blockquote><p><b>Remark 3</b>  Talagrand&#8217;s inequality is in fact far more general than this; it applies to arbitrary products of probability spaces, rather than just to <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B-1%2C%2B1%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' title='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' class='latex' />, and to non-convex <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' />, but the notion of distance needed to define <img src='http://s0.wp.com/latex.php?latex=%7BA_t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A_t}&amp;fg=000000' title='{A_t}&amp;fg=000000' class='latex' /> becomes more complicated; the proof of the inequality, though, is essentially the same. Besides its applicability to convex Lipschitz functions, Talagrand&#8217;s inequality is also very useful for controlling combinatorial Lipschitz functions <img src='http://s0.wp.com/latex.php?latex=%7BF%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{F}&amp;fg=000000' title='{F}&amp;fg=000000' class='latex' /> which are &#8220;locally certifiable&#8221; in the sense that whenever <img src='http://s0.wp.com/latex.php?latex=%7BF%28x%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{F(x)}&amp;fg=000000' title='{F(x)}&amp;fg=000000' class='latex' /> is larger than some threshold <img src='http://s0.wp.com/latex.php?latex=%7Bt%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t}&amp;fg=000000' title='{t}&amp;fg=000000' class='latex' />, then there exist some bounded number <img src='http://s0.wp.com/latex.php?latex=%7Bf%28t%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(t)}&amp;fg=000000' title='{f(t)}&amp;fg=000000' class='latex' /> of coefficients of <img src='http://s0.wp.com/latex.php?latex=%7Bx%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x}&amp;fg=000000' title='{x}&amp;fg=000000' class='latex' /> which &#8220;certify&#8221; this fact (in the sense that <img src='http://s0.wp.com/latex.php?latex=%7BF%28y%29+%5Cgeq+t%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{F(y) &#92;geq t}&amp;fg=000000' title='{F(y) &#92;geq t}&amp;fg=000000' class='latex' /> for any other <img src='http://s0.wp.com/latex.php?latex=%7By%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{y}&amp;fg=000000' title='{y}&amp;fg=000000' class='latex' /> which agrees with <img src='http://s0.wp.com/latex.php?latex=%7Bx%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x}&amp;fg=000000' title='{x}&amp;fg=000000' class='latex' /> on these coefficients). See e.g. the <a href="http://www.ams.org/mathscinet-getitem?mr=2437651">text of Alon and Spencer</a> for a more precise statement and some applications. </p></blockquote>
</p>
<p align="center"><b> &#8212;  2. Gaussian concentration  &#8212; </b></p>
<p>
As mentioned earlier, there are analogous results when the uniform distribution on the cube <img src='http://s0.wp.com/latex.php?latex=%7B%5C%7B-1%2C%2B1%5C%7D%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' title='{&#92;{-1,+1&#92;}^n}&amp;fg=000000' class='latex' /> are replaced by other distributions, such as the <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{n}&amp;fg=000000' title='{n}&amp;fg=000000' class='latex' />-dimensional Gaussian distribution. In fact, in this case convexity is not needed:
</p>
<blockquote><p><b>Proposition 3 (Gaussian concentration inequality)</b> <a name="gauss"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7BA%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{A}&amp;fg=000000' title='{A}&amp;fg=000000' class='latex' /> be a measurable set in <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5Cbf+R%7D%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;bf R}^d}&amp;fg=000000' title='{{&#92;bf R}^d}&amp;fg=000000' class='latex' />. Then
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+P%7D%28X+%5Cin+A%29+%5Cmathop%7B%5Cbf+P%7D%28+X+%5Cnot+%5Cin+A_t+%29+%5Cleq+%5Cexp%28+-+c+t%5E2+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;mathop{&#92;bf P}(X &#92;in A) &#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t ) &#92;leq &#92;exp( - c t^2 )&amp;fg=000000' title='&#92;displaystyle  &#92;mathop{&#92;bf P}(X &#92;in A) &#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t ) &#92;leq &#92;exp( - c t^2 )&amp;fg=000000' class='latex' /></p>
<p> for all <img src='http://s0.wp.com/latex.php?latex=%7Bt%26%2362%3B0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t&gt;0}&amp;fg=000000' title='{t&gt;0}&amp;fg=000000' class='latex' /> and some absolute constant <img src='http://s0.wp.com/latex.php?latex=%7Bc+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{c &gt; 0}&amp;fg=000000' title='{c &gt; 0}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7BX+%5Cequiv+N%280%2C1%29%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X &#92;equiv N(0,1)^n}&amp;fg=000000' title='{X &#92;equiv N(0,1)^n}&amp;fg=000000' class='latex' /> is a random Gaussian vector. </p></blockquote>
</p>
<p>
This inequality can be deduced from <a href="http://en.wikipedia.org/wiki/Concentration_of_measure">L&#233;vy&#8217;s classical concentration of measure inequality for the sphere</a> (with the optimal constant), but we will give an alternate proof due to Maurey and Pisier. It suffices to prove the following variant of Proposition <a href="#gauss">3</a>:
</p>
<blockquote><p><b>Proposition 4 (Gaussian concentration inequality for Lipschitz functions)</b> <a name="gauss2"></a> Let <img src='http://s0.wp.com/latex.php?latex=%7Bf%3A+%7B%5Cbf+R%7D%5Ed+%5Crightarrow+%7B%5Cbf+R%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f: {&#92;bf R}^d &#92;rightarrow {&#92;bf R}}&amp;fg=000000' title='{f: {&#92;bf R}^d &#92;rightarrow {&#92;bf R}}&amp;fg=000000' class='latex' /> be a function which is Lipschitz with constant <img src='http://s0.wp.com/latex.php?latex=%7B1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{1}&amp;fg=000000' title='{1}&amp;fg=000000' class='latex' /> (i.e. <img src='http://s0.wp.com/latex.php?latex=%7B%26%23124%3Bf%28x%29-f%28y%29%26%23124%3B+%5Cleq+%26%23124%3Bx-y%26%23124%3B%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#124;f(x)-f(y)&#124; &#92;leq &#124;x-y&#124;}&amp;fg=000000' title='{&#124;f(x)-f(y)&#124; &#92;leq &#124;x-y&#124;}&amp;fg=000000' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=%7Bx%2Cy+%5Cin+%7B%5Cbf+R%7D%5Ed%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x,y &#92;in {&#92;bf R}^d}&amp;fg=000000' title='{x,y &#92;in {&#92;bf R}^d}&amp;fg=000000' class='latex' />. Then for any <img src='http://s0.wp.com/latex.php?latex=%7Bt%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t}&amp;fg=000000' title='{t}&amp;fg=000000' class='latex' /> we have
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+P%7D%28+%26%23124%3Bf%28X%29+-+%7B%5CBbb+E%7D+f%28X%29%26%23124%3B+%5Cgeq+t+%29+%5Cleq+2%5Cexp%28+-+ct%5E2+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb P}( &#124;f(X) - {&#92;Bbb E} f(X)&#124; &#92;geq t ) &#92;leq 2&#92;exp( - ct^2 )&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb P}( &#124;f(X) - {&#92;Bbb E} f(X)&#124; &#92;geq t ) &#92;leq 2&#92;exp( - ct^2 )&amp;fg=000000' class='latex' /></p>
<p> for all <img src='http://s0.wp.com/latex.php?latex=%7Bt%26%2362%3B0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t&gt;0}&amp;fg=000000' title='{t&gt;0}&amp;fg=000000' class='latex' /> and some absolute constant <img src='http://s0.wp.com/latex.php?latex=%7Bc%26%2362%3B0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{c&gt;0}&amp;fg=000000' title='{c&gt;0}&amp;fg=000000' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%7BX+%5Cequiv+N%280%2C1%29%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X &#92;equiv N(0,1)^n}&amp;fg=000000' title='{X &#92;equiv N(0,1)^n}&amp;fg=000000' class='latex' /> is a random variable. [Informally, Lipschitz functions of Gaussian variables concentrate as if they were Gaussian themselves; for comparison, Talagrand's inequality implies that <em>convex</em> Lipschitz functions of <em>Bernoulli</em> variables concentrate as if they were Gaussian.] </p></blockquote>
</p>
<p>
Indeed, if one sets <img src='http://s0.wp.com/latex.php?latex=%7Bf%28x%29+%3A%3D+%5Chbox%7Bdist%7D%28x%2CA%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f(x) := &#92;hbox{dist}(x,A)}&amp;fg=000000' title='{f(x) := &#92;hbox{dist}(x,A)}&amp;fg=000000' class='latex' />, and splits into the cases whether <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D+f%28X%29+%5Cgeq+t%2F2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E} f(X) &#92;geq t/2}&amp;fg=000000' title='{{&#92;Bbb E} f(X) &#92;geq t/2}&amp;fg=000000' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D+f%28X%29+%26%2360%3B+t%2F2%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E} f(X) &lt; t/2}&amp;fg=000000' title='{{&#92;Bbb E} f(X) &lt; t/2}&amp;fg=000000' class='latex' />, one obtains either <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+P%7D%28+X+%5Cin+A+%29+%5Cleq+2+%5Cexp%28-ct%5E2%2F4%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb P}( X &#92;in A ) &#92;leq 2 &#92;exp(-ct^2/4)}&amp;fg=000000' title='{{&#92;Bbb P}( X &#92;in A ) &#92;leq 2 &#92;exp(-ct^2/4)}&amp;fg=000000' class='latex' /> (in the former case) or <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+P%7D%28+X+%5Cnot+%5Cin+A_t+%29+%5Cleq+2+%5Cexp%28-ct%5E2%2F4%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb P}( X &#92;not &#92;in A_t ) &#92;leq 2 &#92;exp(-ct^2/4)}&amp;fg=000000' title='{{&#92;Bbb P}( X &#92;not &#92;in A_t ) &#92;leq 2 &#92;exp(-ct^2/4)}&amp;fg=000000' class='latex' />, and so </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cmathop%7B%5Cbf+P%7D%28X+%5Cin+A%29+%5Cmathop%7B%5Cbf+P%7D%28+X+%5Cnot+%5Cin+A_t+%29+%5Cleq+2%5Cexp%28+-+c+t%5E2%2F4+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;mathop{&#92;bf P}(X &#92;in A) &#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t ) &#92;leq 2&#92;exp( - c t^2/4 )&amp;fg=000000' title='&#92;displaystyle  &#92;mathop{&#92;bf P}(X &#92;in A) &#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t ) &#92;leq 2&#92;exp( - c t^2/4 )&amp;fg=000000' class='latex' /></p>
<p> in either case. Also, since <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathop%7B%5Cbf+P%7D%28X+%5Cin+A%29+%2B+%5Cmathop%7B%5Cbf+P%7D%28+X+%5Cnot+%5Cin+A_t+%29+%5Cleq+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathop{&#92;bf P}(X &#92;in A) + &#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t ) &#92;leq 1}&amp;fg=000000' title='{&#92;mathop{&#92;bf P}(X &#92;in A) + &#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t ) &#92;leq 1}&amp;fg=000000' class='latex' />, we have <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathop%7B%5Cbf+P%7D%28X+%5Cin+A%29+%5Cmathop%7B%5Cbf+P%7D%28+X+%5Cnot+%5Cin+A_t+%29+%5Cleq+1%2F4%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;mathop{&#92;bf P}(X &#92;in A) &#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t ) &#92;leq 1/4}&amp;fg=000000' title='{&#92;mathop{&#92;bf P}(X &#92;in A) &#92;mathop{&#92;bf P}( X &#92;not &#92;in A_t ) &#92;leq 1/4}&amp;fg=000000' class='latex' />. Putting the two bounds together gives the claim.</p>
<p>
Now we prove Proposition <a href="#gauss2">4</a>. By the <a href="http://www.tricki.org/article/Create_an_epsilon_of_room">epsilon regularisation argument</a> we may take <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> to be smooth, and so by the Lipschitz property we have <a name="flip">
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%26%23124%3B%5Cnabla+f%28x%29%26%23124%3B+%5Cleq+1+%5C+%5C+%5C+%5C+%5C+%285%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#124;&#92;nabla f(x)&#124; &#92;leq 1 &#92; &#92; &#92; &#92; &#92; (5)&amp;fg=000000' title='&#92;displaystyle  &#124;&#92;nabla f(x)&#124; &#92;leq 1 &#92; &#92; &#92; &#92; &#92; (5)&amp;fg=000000' class='latex' /></p>
<p></a> for all <img src='http://s0.wp.com/latex.php?latex=%7Bx%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{x}&amp;fg=000000' title='{x}&amp;fg=000000' class='latex' />. By subtracting off the mean we may assume <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D+f+%3D+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E} f = 0}&amp;fg=000000' title='{{&#92;Bbb E} f = 0}&amp;fg=000000' class='latex' />. By replacing <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{f}&amp;fg=000000' title='{f}&amp;fg=000000' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=%7B-f%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{-f}&amp;fg=000000' title='{-f}&amp;fg=000000' class='latex' /> if necessary it suffices to control the upper tail probability <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+P%7D%28+f%28X%29+%5Cgeq+t+%29%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb P}( f(X) &#92;geq t )}&amp;fg=000000' title='{{&#92;Bbb P}( f(X) &#92;geq t )}&amp;fg=000000' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=%7Bt+%26%2362%3B+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{t &gt; 0}&amp;fg=000000' title='{t &gt; 0}&amp;fg=000000' class='latex' />.
</p>
<p>
We again use the exponential moment method. It suffices to show that </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%5Cexp%28+t+f%28X%29+%29+%5Cleq+%5Cexp%28+C+t%5E2+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#92;exp( t f(X) ) &#92;leq &#92;exp( C t^2 )&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#92;exp( t f(X) ) &#92;leq &#92;exp( C t^2 )&amp;fg=000000' class='latex' /></p>
<p> for some absolute constant <img src='http://s0.wp.com/latex.php?latex=%7BC%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{C}&amp;fg=000000' title='{C}&amp;fg=000000' class='latex' />. </p>
<p>
Now we use a variant of the <a href="http://www.tricki.org/article/Square_and_rearrange">square and rearrange</a> trick. Let <img src='http://s0.wp.com/latex.php?latex=%7BY%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{Y}&amp;fg=000000' title='{Y}&amp;fg=000000' class='latex' /> be an independent copy of <img src='http://s0.wp.com/latex.php?latex=%7BX%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X}&amp;fg=000000' title='{X}&amp;fg=000000' class='latex' />. Since <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D+f%28Y%29+%3D+0%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E} f(Y) = 0}&amp;fg=000000' title='{{&#92;Bbb E} f(Y) = 0}&amp;fg=000000' class='latex' />, we see from <a href="http://en.wikipedia.org/wiki/Jensen&#037;27s_inequality">Jensen&#8217;s inequality</a> that <img src='http://s0.wp.com/latex.php?latex=%7B%7B%5CBbb+E%7D+%5Cexp%28+-+t+f%28Y%29+%29+%5Cgeq+1%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{{&#92;Bbb E} &#92;exp( - t f(Y) ) &#92;geq 1}&amp;fg=000000' title='{{&#92;Bbb E} &#92;exp( - t f(Y) ) &#92;geq 1}&amp;fg=000000' class='latex' />, and so </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%5Cexp%28+t+f%28X%29+%29+%5Cleq+%7B%5CBbb+E%7D+%5Cexp%28+t+%28f%28X%29-f%28Y%29%29+%29.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#92;exp( t f(X) ) &#92;leq {&#92;Bbb E} &#92;exp( t (f(X)-f(Y)) ).&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#92;exp( t f(X) ) &#92;leq {&#92;Bbb E} &#92;exp( t (f(X)-f(Y)) ).&amp;fg=000000' class='latex' /></p>
<p> With an eye to exploiting <a href="#flip">(5)</a>, one might seek to use the fundamental theorem of calculus to write
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++f%28X%29+-+f%28Y%29+%3D+%5Cint_0%5E1+%5Cfrac%7Bd%7D%7Bd%5Clambda%7D+f%28+%281-%5Clambda%29+Y+%2B+%5Clambda+X+%29%5C+d%5Clambda.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  f(X) - f(Y) = &#92;int_0^1 &#92;frac{d}{d&#92;lambda} f( (1-&#92;lambda) Y + &#92;lambda X )&#92; d&#92;lambda.&amp;fg=000000' title='&#92;displaystyle  f(X) - f(Y) = &#92;int_0^1 &#92;frac{d}{d&#92;lambda} f( (1-&#92;lambda) Y + &#92;lambda X )&#92; d&#92;lambda.&amp;fg=000000' class='latex' /></p>
<p> But actually it turns out to be smarter to use a circular arc of integration, rather than a line segment:
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++f%28X%29+-+f%28Y%29+%3D+%5Cint_0%5E%7B%5Cpi%2F2%7D+%5Cfrac%7Bd%7D%7Bd%5Ctheta%7D+f%28+Y+%5Ccos+%5Ctheta+%2B+X+%5Csin+%5Ctheta+%29%5C+d%5Ctheta.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  f(X) - f(Y) = &#92;int_0^{&#92;pi/2} &#92;frac{d}{d&#92;theta} f( Y &#92;cos &#92;theta + X &#92;sin &#92;theta )&#92; d&#92;theta.&amp;fg=000000' title='&#92;displaystyle  f(X) - f(Y) = &#92;int_0^{&#92;pi/2} &#92;frac{d}{d&#92;theta} f( Y &#92;cos &#92;theta + X &#92;sin &#92;theta )&#92; d&#92;theta.&amp;fg=000000' class='latex' /></p>
<p> The reason for this is that <img src='http://s0.wp.com/latex.php?latex=%7BX_%5Ctheta+%3A%3D+Y+%5Ccos+%5Ctheta+%2B+X+%5Csin+%5Ctheta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X_&#92;theta := Y &#92;cos &#92;theta + X &#92;sin &#92;theta}&amp;fg=000000' title='{X_&#92;theta := Y &#92;cos &#92;theta + X &#92;sin &#92;theta}&amp;fg=000000' class='latex' /> is another gaussian random variable equivalent to <img src='http://s0.wp.com/latex.php?latex=%7BN%280%2C1%29%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{N(0,1)^n}&amp;fg=000000' title='{N(0,1)^n}&amp;fg=000000' class='latex' />, as is its derivative <img src='http://s0.wp.com/latex.php?latex=%7BX%27_%5Ctheta+%3A%3D+-Y+%5Csin+%5Ctheta+%2B+X+%5Ccos+%5Ctheta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X&#039;_&#92;theta := -Y &#92;sin &#92;theta + X &#92;cos &#92;theta}&amp;fg=000000' title='{X&#039;_&#92;theta := -Y &#92;sin &#92;theta + X &#92;cos &#92;theta}&amp;fg=000000' class='latex' />; furthermore, and crucially, these two random variables are <em>independent</em>.</p>
<p>
To exploit this, we first use Jensen&#8217;s inequality to bound </p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%5Cexp%28+t+%28f%28X%29+-+f%28Y%29%29%29+%5Cleq+%5Cfrac%7B2%7D%7B%5Cpi%7D+%5Cint_0%5E%7B%5Cpi%2F2%7D+%5Cexp%28+%5Cfrac%7B%5Cpi+t%7D%7B2%7D+%5Cfrac%7Bd%7D%7Bd%5Ctheta%7D+f%28+X_%5Ctheta+%29+%29%5C+d%5Ctheta.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  &#92;exp( t (f(X) - f(Y))) &#92;leq &#92;frac{2}{&#92;pi} &#92;int_0^{&#92;pi/2} &#92;exp( &#92;frac{&#92;pi t}{2} &#92;frac{d}{d&#92;theta} f( X_&#92;theta ) )&#92; d&#92;theta.&amp;fg=000000' title='&#92;displaystyle  &#92;exp( t (f(X) - f(Y))) &#92;leq &#92;frac{2}{&#92;pi} &#92;int_0^{&#92;pi/2} &#92;exp( &#92;frac{&#92;pi t}{2} &#92;frac{d}{d&#92;theta} f( X_&#92;theta ) )&#92; d&#92;theta.&amp;fg=000000' class='latex' /></p>
<p> Applying the chain rule and taking expectations, we have
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%5Cexp%28+t+%28f%28X%29+-+f%28Y%29%29%29+%5Cleq+%5Cfrac%7B2%7D%7B%5Cpi%7D+%5Cint_0%5E%7B%5Cpi%2F2%7D+%7B%5CBbb+E%7D+%5Cexp%28+%5Cfrac%7B%5Cpi+t%7D%7B2%7D+%5Cnabla+f%28+X_%5Ctheta+%29+%5Ccdot+X%27_%5Ctheta+%29%5C+d%5Ctheta.%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#92;exp( t (f(X) - f(Y))) &#92;leq &#92;frac{2}{&#92;pi} &#92;int_0^{&#92;pi/2} {&#92;Bbb E} &#92;exp( &#92;frac{&#92;pi t}{2} &#92;nabla f( X_&#92;theta ) &#92;cdot X&#039;_&#92;theta )&#92; d&#92;theta.&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#92;exp( t (f(X) - f(Y))) &#92;leq &#92;frac{2}{&#92;pi} &#92;int_0^{&#92;pi/2} {&#92;Bbb E} &#92;exp( &#92;frac{&#92;pi t}{2} &#92;nabla f( X_&#92;theta ) &#92;cdot X&#039;_&#92;theta )&#92; d&#92;theta.&amp;fg=000000' class='latex' /></p>
<p> Let us condition <img src='http://s0.wp.com/latex.php?latex=%7BX_%5Ctheta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X_&#92;theta}&amp;fg=000000' title='{X_&#92;theta}&amp;fg=000000' class='latex' /> to be fixed, then <img src='http://s0.wp.com/latex.php?latex=%7BX%27_%5Ctheta+%5Cequiv+N%280%2C1%29%5En%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X&#039;_&#92;theta &#92;equiv N(0,1)^n}&amp;fg=000000' title='{X&#039;_&#92;theta &#92;equiv N(0,1)^n}&amp;fg=000000' class='latex' />; applying <a href="#flip">(5)</a>, we conclude that <img src='http://s0.wp.com/latex.php?latex=%7B%5Cfrac%7B%5Cpi+t%7D%7B2%7D+%5Cnabla+f%28+X_%5Ctheta+%29+%5Ccdot+X%27_%5Ctheta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;frac{&#92;pi t}{2} &#92;nabla f( X_&#92;theta ) &#92;cdot X&#039;_&#92;theta}&amp;fg=000000' title='{&#92;frac{&#92;pi t}{2} &#92;nabla f( X_&#92;theta ) &#92;cdot X&#039;_&#92;theta}&amp;fg=000000' class='latex' /> is normally distributed with standard deviation at most <img src='http://s0.wp.com/latex.php?latex=%7B%5Cfrac%7B%5Cpi+t%7D%7B2%7D%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{&#92;frac{&#92;pi t}{2}}&amp;fg=000000' title='{&#92;frac{&#92;pi t}{2}}&amp;fg=000000' class='latex' />. As such we have
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle++%7B%5CBbb+E%7D+%5Cexp%28+%5Cfrac%7B2t%7D%7B%5Cpi%7D+%5Cnabla+f%28+X_%5Ctheta+%29+%5Ccdot+X%27_%5Ctheta+%29+%5Cleq+%5Cexp%28+C+t%5E2+%29%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;displaystyle  {&#92;Bbb E} &#92;exp( &#92;frac{2t}{&#92;pi} &#92;nabla f( X_&#92;theta ) &#92;cdot X&#039;_&#92;theta ) &#92;leq &#92;exp( C t^2 )&amp;fg=000000' title='&#92;displaystyle  {&#92;Bbb E} &#92;exp( &#92;frac{2t}{&#92;pi} &#92;nabla f( X_&#92;theta ) &#92;cdot X&#039;_&#92;theta ) &#92;leq &#92;exp( C t^2 )&amp;fg=000000' class='latex' /></p>
<p> for some absolute constant <img src='http://s0.wp.com/latex.php?latex=%7BC%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{C}&amp;fg=000000' title='{C}&amp;fg=000000' class='latex' />; integrating out the conditioning on <img src='http://s0.wp.com/latex.php?latex=%7BX_%5Ctheta%7D%26%2338%3Bfg%3D000000&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='{X_&#92;theta}&amp;fg=000000' title='{X_&#92;theta}&amp;fg=000000' class='latex' /> we obtain the claim. </p>
]]></content:encoded>
</item>

</channel>
</rss>
