<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress.com" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>theorem &amp;laquo; WordPress.com Tag Feed</title>
	<link>http://en.wordpress.com/tag/theorem/</link>
	<description>Feed of posts on WordPress.com tagged "theorem"</description>
	<pubDate>Thu, 31 Dec 2009 03:12:15 +0000</pubDate>

	<generator>http://en.wordpress.com/tags/</generator>
	<language>en</language>

<item>
<title><![CDATA[Idiosyncrasy in “conjectures”]]></title>
<link>http://adonis49.wordpress.com/2009/12/23/idiosyncrasy-in-%e2%80%9cconjectures%e2%80%9d/</link>
<pubDate>Wed, 23 Dec 2009 08:41:28 +0000</pubDate>
<dc:creator>adonis49</dc:creator>
<guid>http://adonis49.wordpress.com/2009/12/23/idiosyncrasy-in-%e2%80%9cconjectures%e2%80%9d/</guid>
<description><![CDATA[Idiosyncrasy in “conjectures”; (Dec. 21, 2009)             Idiosyncrasy or cultural bias related to ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><strong>Idiosyncrasy in “conjectures”; (Dec. 21, 2009)</strong></p>
<p>            Idiosyncrasy or cultural bias related to “common sense” behavior (for example, preferential priorities in choices of values, belief systems, and daily habits) is not restricted among different societies: it can be found within one society, even within what can be defined as “homogeneous restricted communities” ethnically, religiously, common language, gender groups, or professional disciplines. Most disciplines have mushroomed into cults.</p>
<p>            A cult is any organization that creates its own nomenclature and definition of terms to be distinguished from the other cults in order to acquiring recognition as a “professional entity” or independent disciplines that should benefit from laws of special minorities (when mainly it is a matter of generating profit or doing business as usual). These cults want to owe the un-initiated into believing that they have serious well developed methods or excellent comprehension of a restricted area in sciences. The initiated on multidisciplinary knowledge recognize that the methods of any cult are old and even far less precise or developed; that the terms are not new and there are already analogous terms in other disciplines that are more accurate and far better defined.</p>
<p>            Countless experiments have demonstrated various kinds of idiosyncrasies.  This article is oriented toward “cult” kinds of orders, organization, and professional discipline.  My first post is targeting the order of mathematicians; the next article will focus on experiments.</p>
<p>            Mathematics which is used as meaning “sure study” (wisekunde) has no reliable historical documentation. Most of mathematical concepts were written many decades or centuries after they were “floating around” among mathematicians.  Mathematics is confusing with its array of nomenclature. What are the differences among axiom, proposition, lemma, postulate, or conjecture?  What are the differences among the terms, theorem, questions, problems, hypothesis, corollary, and again conjecture?  For example, personally, I feel that axiom is mostly recurrent in geometry, lemma in probability, hypothesis in analytical procedures, and conjecture in algebraic deductive reasoning.</p>
<p>            Hypothesis is in desuetude in mathematics. For example, Newton said “I am not making a hypothesis”: Socrates made fun of this term by explaining how it was understood “I designate hypothesis what people doing geometry use to treating a question.  For example, when asked for their “expert opinion” they reply: “I still cannot confirm but I think that if I have a viable hypothesis for this problem and if it is the following hypothesis… then I think that we may draw a conclusion. If we have another hypothesis then another conclusion is more valid.”  Plato said: “As long as mathematics start from hypothesis instead of facts then we do not think that they have true comprehension since they are not going back to fundamentals”</p>
<p>            Hypothesis is still the main term used in experimental research. Theoretically, an experiment is not meant to accept a hypothesis as true or valid but simply “Not to reject it” if the relationships among the manipulated variables are “statistically significant” to a pre-determined level, usually 5% in random errors. Many pragmatic scientific researchers don’t care about the fine details in theoretical mathematical concepts and tend to adopt a hypothesis that was not rejected as law.  This is one case of idiosyncrasy when the researcher wants badly the “non-rejected” hypothesis to represent is view. Generally, an honest experimenter has to repeat the experiment or encourage someone else to generalize the results by studying more variables.</p>
<p>            Conjecture means (throwing in together) and can be translated as conclusion or deduction; basically, it is an opinion or supposition based on insufficient proofs. In the last century, conjectures were exposed in writing as promptly as possible instead of keeping them floating ideas, concepts, or probable theorems. This new behavior of writing conjectures was given the rationale that “plausible reasoning” is a set of suppositions thrown around as questions mathematicians guess they have answers to them but are unable to demonstrate temporarily.</p>
<p>            The term conjecture has been used so freely in the last decades that Andre Weil warned that “current mathematicians use the term conjecture when they fail after a few attempts to verify a concept, even if the problem is of no importance.”  David Kazhdan ironically warned that this practice of enunciating conjectures might turn out like a 5-year Soviet plan.</p>
<p>            At first, a set of conjectures was meant to be the basic structure for a theorem or precise assertions that were temporarily used in the trading of logical discussions. Thus, conjectures permit the construction of rigorous deductions that are accessible to direct testing of their validity. A conjecture was a “research program” that move ahead in order to foresee the explored domain.</p>
<p>            Consequently, conjecture is kind of extending a name and an address to a set of suppositions and analogies for a concept long before tools and methods are created to approach directly the problem. A “Problem” designates a mental task submitted to the audience or targeted for research or project; usually, the set of problems lead to demonstrating a general theorem. Many problems are in fact conjectures such as the problem of twin primary numbers that consists of proving the existence of an infinity of coupled numbers such that p-q = 2.</p>
<p>            One of the explanations for using freely the term conjecture is the modern facility of mathematicians of discriminating aspects of uncertainty at the theoretical level. It is an acquired habit, an idiosyncrasy. Thus, for a mathematician to state a conjecture he must have solved many particular cases and recognize that a research program is needed to developing special tools for demonstrating the conjecture.  This is a tough restriction in this age where time is of essence among millions of mathematicians competing for prizes.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[The Earnest Inquisition Of Obfuscation Bound By The Theorhetical Declaration Of The Theorem Derived From The Uniformed Chaos Axiom]]></title>
<link>http://fidelgonzales.wordpress.com/2009/12/05/the-earnest-inquisition-of-obfuscation-bound-within-the-theorhetical-declaration-of-the-theorem-derived-from-the-uniformed-chaos-axiom/</link>
<pubDate>Sat, 05 Dec 2009 07:58:47 +0000</pubDate>
<dc:creator>fidelgonzales</dc:creator>
<guid>http://fidelgonzales.wordpress.com/2009/12/05/the-earnest-inquisition-of-obfuscation-bound-within-the-theorhetical-declaration-of-the-theorem-derived-from-the-uniformed-chaos-axiom/</guid>
<description><![CDATA[Merely a hack who has rarely had the patience to sit through elongated oratory or literary dissertat]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Merely a hack who has rarely had the patience to sit through elongated oratory or literary dissertations in my quest to conceptualize and even capture the crux of creation through the language of mathematics and physics, I have found it gravely difficult, during my years of formal education, to await the necessarily standardized absorption by which mundane knowledge is methodically received. Boredom becomes quite captivating and even the most minute segments of data sets are set aside for in depth examination by which to formulate extraordinary principles both simplistic yet paradoxically profound. It seems truth tends to tangle in the arena of paradox (James 1:25).</p>
<p>There ultimately arrived a time by which I had to toss my systematic quest to conquer the principalities of physics to a far more poetically achievable educational genre. I therefore relented from the intensity of such studies into simpler tasks such as English, art and photography. History has likewise served such a capacity but to a lesser extent, since its process is more an abstract yet more tangible life application of the aforementioned principalities, which lends to its own enthusiastic arrangement of frustrations and fascinations. All the while, that which is exhibited within the confines of the Holy Bible has wallowed divinely in my midst throughout such eras, serving as an evermore acknowledged theme by which one in any such endeavor ought to engage with greater consideration, as its power of principalities provides a foundation by which wisdom is exponentially jettisoned to all who earnestly partake. Examine it as hereby stated and the wonders of the universe, whether academic or even spiritual, becoming exceedingly awakened unto your eyes and afforded the confluence of axioms.</p>
<p>Anyhow, I loved science and found great excitement in at least a few of them throughout my junior high and high school years. And later in college, math class, just as science, while it was a love, was likewise a match for my focus. Some facet would always seem to snag my interest in such a way that captivated me, leading me to overthrow the lesson for a gaping reception into the far too fundamentally apparent finiteness of one simplistic element.</p>
<p>Perhaps my greatest issue with the confines of formal education is its formalities of indoctrination, since it teaches merely to know than to learn, and therefore, my zealously fascinating manner of inquisition into easily accepted principalities disrupts well trained technicians of academic dissemination who themselves have very little understanding.</p>
<p>There have been several, perhaps many, instructors who have found my inquiry to be irritating to the point where vehemence erupts from their frustration. Likely, it is that their understanding of such principles bound within their chosen field of study is so light that immersion becomes frightening, to the point by which they attempt to maintain surface tension, yet cannot, for they have yet to achieve perfection.</p>
<p>Anyhow, there have been others who have found my earnest inquiry to be considerably intriguing and have engaged greatly. From them, I have learned much and have been equally challenged to such extents, exponentially jettisoned, just as iron sharpens iron. For when word becomes word, so goes the saber of triumph.</p>
<p>And so there I was. I had failed to turn in several weeks of assignments, stuck and even failing asleep on a series of formulas. While I had figured them is such a learned way, I envisioned yet another way, comparative to the difference of a casual observer who meanders through a garden on his way to his next class and that of a ballerina become the very beauty abound amid her arena. Then, I entered class to turn in homework from several weeks back, and as the class proceeded on several problems the instructor issued to buy him time to grade the evenings homework, he later landed upon mine near the end of class.</p>
<p>Looking up my way in a cross and unsettling manner: &#8220;This is late. And what is this (no question mark for it was not a question). This is not done right. This is unacceptable. Did you even read the book.&#8221;</p>
<p>I walked up. Observed what I knew to be in question. He continued. I watched. And might I mentioned I was also quite rebellious and had considerable disdain for those are fierce for the letter of the law but who have very little consideration for the spirit of the law. He was quite disrespectful in his way and even authoritarian in his astute posture. For this, I found fancy in foiling his stature.</p>
<p>&#8220;Yes. I read the chapter multiple times over.&#8221;</p>
<p>He became exacerbated, particularly since I offered no other explanation at anytime during the long pause for silence.</p>
<p>&#8220;I can&#8217;t have this. I can&#8217;t accept this. This is just wrong.&#8221;</p>
<p>&#8220;No.&#8221; I replied. My retort was accurate. It was right.</p>
<p>&#8220;No what?&#8221;</p>
<p>&#8220;The answer is right. Look upon your cheat sheet. It correctly matches what you have there. Therefore, according to your very own means of judgment, your readily accepted and utterly proclaimed sense of right and wrong, it is right.&#8221;</p>
<p>&#8220;No. It is wrong. You did it all wrong.&#8221;</p>
<p>At this point, the diatribe drew upon the attention of the class, which was acutely focused upon the dance of an arrogant subset of individuals. I stood. The instructor sat half in his seat, while the other half worked to rise in frustration, except for the tactical advantage of an underling who literally stood above him. To escape the sunken seat of the throne at that very moment would have only worked to his disadvantage, divulging the inferiority of his physical position, as a rabbit who uneasily waits behind the bushes as a hunter passes, hoping he who seeks blood has no clue of the thinly veiled vulnerability that resides easily within reach.</p>
<p>&#8220;If you are now saying the answer is wrong, then your theorem is likewise wrong, for it renders the same result. But if you are saying that my methodology is wrong, then you are wrong, for it renders the very same result as does your readily accepted methodology, yet you are unwilling to accept it, since you fear what you fail to consider.&#8221;</p>
<p>Just then, during the prelude to the climax, the clock struck the hour and officially called to a close the class, but even still, the encore was yet to unfold, and therefor kept at bay the bulk of the class.</p>
<p>Now, being that I in certain terms had just turned over the desk from which he dictated, the repercussion was on the verge of volleying grave result. Therefore, an intersession accompanied by a well-announced olive branch was boastfully interjected in such a way that the Little Hero Of Holland held at bay a great mass of water with merely a finger. And likewise amid the moment, the class teetered upon the crux of implosion.</p>
<p>&#8220;Okay. If you find that the methodology does not work according to the stated result, which equals that stated in the book, then, you may rightfully fail me for the remainder of the class. But should the methodology work out correctly, according to the result so stated, then it will be graded as such.&#8221;</p>
<p>Going merely by the memory of some 15 years ago, I recall him officially calling the class to a close by dismissing us.</p>
<p>The next class, as he was handing back papers, perhaps during a test, he quietly called my name to receive my graded paper.</p>
<p>&#8220;I don&#8217;t know how you did it. I have never seen this before. I even brought it to the attention of the department faculty. It seems you have found a new theorem. Good work.&#8221;</p>
<p>I walked away and left it at that. My grade for the class was somewhere around a C- for that class. I don&#8217;t blame the guy. Between my frequent surfing trips to Baja and inconsistent homework, even a C- may have been undeserved.</p>
<p>Perhaps one day I will share just how daunting the study of literature and moreover writing courses were and how one college professor struck a sword through such difficulties, severing my ties with such trying tasks by delivering me unto understanding, as is referenced in 1 Peter 4:8 (NIV), where is said: &#8220;Above all, love each other deeply, because love covers over a multitude of sins.&#8221;</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Three-sided story ...]]></title>
<link>http://grammarcops.wordpress.com/2009/11/23/three-sided-story/</link>
<pubDate>Tue, 24 Nov 2009 04:33:55 +0000</pubDate>
<dc:creator>grammarcops</dc:creator>
<guid>http://grammarcops.wordpress.com/2009/11/23/three-sided-story/</guid>
<description><![CDATA[This post is based on a true story &#8230; really! There was an elderly, widowed 9th grade algebra t]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><img class="alignleft size-full wp-image-2037" title="Triangle lady 1" src="http://grammarcops.wordpress.com/files/2009/11/triangle-lady-1.jpg" alt="" width="235" height="301" /><strong><em>This post is based on a true story &#8230; really!</em></strong></p>
<p><span style="color:#333333;">There was an elderly, widowed 9th grade <strong>algebra</strong> teacher in Alabama, who, as a result of the years and the students she had to put up with, became convinced that she was an <strong>equilateral triangle</strong>.  When she was <strong>cornered</strong>, she always looked for a new <strong>angle</strong>, in her own way of <strong>circular</strong> thinking she could never <strong>measure</strong> up. She treated everyone <strong>equilaterally</strong>, until someone called her a <strong>square</strong>. Although this may have been an <strong>acute</strong> observation, it was not <strong>right</strong>. The <strong>Pythagorean</strong> who made this accusation did not realize that it was actually an <strong>obtuse</strong> <strong>theorem</strong>. The teacher was put on a <strong>plane</strong>, sent away and committed for her <strong>polygonous </strong>belief. Many thought that being institutionalized would <strong>scalene</strong> down her <strong>parallelanoia</strong>, however, it must have been <strong>geometrically</strong> impossible to <strong>solve</strong> that <strong>equation</strong>. To <strong>sum</strong> up &#8230; there is good news and bad news. The good news is that this teacher was cured of believing she was a <strong>triangle</strong>. The bad news: in a <strong>special</strong>, <strong>isoscelestic</strong>, <strong>180°</strong> turn, she now thinks she is trapped inside a <strong>triangle</strong>. Does that mean that she is <strong>trapezoid</strong>?</span></p>
<p><em>Note: Puns have been added to protect the insanely and algebraically innocent.</em></p>
<p><em>Artwork by Lauren.</em></p>
<p><a href="http://twitter.com/home?status=Three-sided story ... http://wp.me/puvTW-wP"><img src="http://grammarcops.wordpress.com/files/2009/06/tt-twitter-big4.png" border="0" alt="Tweet Me from http://grammarcops.wordpress.com" /></a></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Theorem 13: the fundamental theorem of arithmetic]]></title>
<link>http://theoremoftheweek.wordpress.com/2009/11/21/theorem-13-the-fundamental-theorem-of-arithmetic/</link>
<pubDate>Sat, 21 Nov 2009 22:35:16 +0000</pubDate>
<dc:creator>theoremoftheweek</dc:creator>
<guid>http://theoremoftheweek.wordpress.com/2009/11/21/theorem-13-the-fundamental-theorem-of-arithmetic/</guid>
<description><![CDATA[I&#8217;ve mentioned the fundamental theorem of arithmetic a few times on this blog, so it&#8217;s p]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I&#8217;ve mentioned the fundamental theorem of arithmetic a few times on this blog, so it&#8217;s probably time for it to be a theorem of the week.  I&#8217;ll state it straightaway.</p>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem (the fundamental theorem of arithmetic</span></strong> Every integer greater than 1 can be expressed as a product of primes, uniquely up to reordering the factors.</p></blockquote>
<p>For example, <img src='http://l.wordpress.com/latex.php?latex=12+%3D+2%5Ctimes+2%5Ctimes+3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='12 = 2\times 2\times 3' title='12 = 2\times 2\times 3' class='latex' />, and while I can also write it as <img src='http://l.wordpress.com/latex.php?latex=12+%3D+2%5Ctimes+3%5Ctimes+2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='12 = 2\times 3\times 2' title='12 = 2\times 3\times 2' class='latex' />, this is just reordering the factors &#8212; it&#8217;s not a genuinely different factorisation.  Prime numbers also have prime factorisations: the prime factorisation of 17 is simply 17.</p>
<p>Note that the fundamental theorem of arithmetic is one good reason why it&#8217;s convenient to define 1 not to be a prime.  If it were prime, we could add as many factors of 1 as we liked to the prime factorisation of a number to get lots of different (but not interestingly different) factorisations.</p>
<p>This is a really important theorem &#8212; that&#8217;s why it&#8217;s called &#8220;fundamental&#8221;!  It tells us two things: existence (there <em>is</em> a prime factorisation), and uniqueness (the prime factorisation is unique).  Both parts are useful in all sorts of places.  The existence part is useful because it tells us that the primes are somehow the &#8220;building blocks&#8221; of the integers, and this helps with lots of things.  The uniqueness part is useful because it allows us to do certain things that would otherwise not be possible.  For example, if we know the prime factorisation of <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />, then we know the prime factorisation of <img src='http://l.wordpress.com/latex.php?latex=n%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^2' title='n^2' class='latex' />, safe in the knowledge that <img src='http://l.wordpress.com/latex.php?latex=n%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^2' title='n^2' class='latex' /> can&#8217;t also have some other prime factorisation.</p>
<p><!--more--></p>
<h3>A useful lemma</h3>
<p>Before I tell you about a proof of this theorem, there&#8217;s a preliminary result that we&#8217;ll need.  It&#8217;s jolly useful in all sorts of places.</p>
<blockquote><p><strong><span style="text-decoration:underline;"><a title="Wikipedia definition of a lemma" href="http://en.wikipedia.org/wiki/Lemma_(mathematics)">Lemma</a></span></strong> Let <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> be a prime, and suppose that <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> divides the product <img src='http://l.wordpress.com/latex.php?latex=ab&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='ab' title='ab' class='latex' />.  Then <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> divides <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> divides <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' />.</p></blockquote>
<p>(As usual in mathematics, the &#8220;or&#8221; here is inclusive, in the sense that we could have <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> dividing both <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' />.)</p>
<p>This really is a property of primes.  For example, 6 divides <img src='http://l.wordpress.com/latex.php?latex=3%5Ctimes+4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3\times 4' title='3\times 4' class='latex' />, but 6 divides neither 3 nor 4.  (Intuitively, I like to think that this is because we can break up 6 into <img src='http://l.wordpress.com/latex.php?latex=2%5Ctimes+3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2\times 3' title='2\times 3' class='latex' />, and the factors here can divides the factors 3 and 4 separately.)  In fact, this property is sometimes taken as the <em>definition</em> of a prime number.  In that case, one would then show from this definition that a prime number can be divided only by itself and 1.  But since most people are more familiar with the definition of a prime as a number divisible only by itself and 1, I&#8217;m going to stick with that definition and show you how to prove the lemma.  I&#8217;m going to use <a title="Theorem 1: Bezout's theorem" href="http://theoremoftheweek.wordpress.com/2009/07/27/theorem-1-bezouts-theorem/">Bézout&#8217;s theorem</a>, so if you don&#8217;t know that result you might like to read about it now (and then come back here afterwards!).</p>
<p><strong><span style="text-decoration:underline;">Proof of lemma</span></strong> We have a prime <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />, and we know that it divides the product <img src='http://l.wordpress.com/latex.php?latex=ab&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='ab' title='ab' class='latex' />.  Let&#8217;s suppose that <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> doesn&#8217;t divide <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' />.  Then our goal will be to show that <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> divides <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' />.</p>
<p>Now if <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> doesn&#8217;t divide <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> are coprime.  As usual, we try using <a title="Theorem 1: Bezout's theorem" href="http://theoremoftheweek.wordpress.com/2009/07/27/theorem-1-bezouts-theorem/">Bézout&#8217;s theorem</a> to turn this into a positive statement: there are integers <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=ph+%2B+ak+%3D+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='ph + ak = 1' title='ph + ak = 1' class='latex' />.</p>
<p>We know that <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> divides <img src='http://l.wordpress.com/latex.php?latex=ab&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='ab' title='ab' class='latex' />, so we should try to get something containing <img src='http://l.wordpress.com/latex.php?latex=ab&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='ab' title='ab' class='latex' />.  Let&#8217;s multiply our equation by <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' />: we get <img src='http://l.wordpress.com/latex.php?latex=bph+%2B+abk+%3D+b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='bph + abk = b' title='bph + abk = b' class='latex' />.</p>
<p>Now <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> divides each term on the left-hand side, so it must divide the right-hand side.  That is, <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> divides <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' />, as we wanted.</p>
<h3>Proof of the theorem</h3>
<p>Now we are well placed to tackle the proof of our main theorem, the fundamental theorem of arithmetic.  We&#8217;ll do the two parts (existence and uniqueness) separately.</p>
<p><strong><span style="text-decoration:underline;">Existence</span></strong> We need to show that every integer greater than 1 has a prime factorisation.  To do this, we shall use the idea of a <a title="Wikipedia page about minimal counterexamples" href="http://en.wikipedia.org/wiki/Minimal_counterexample">minimal counterexample</a>.  If the result isn&#8217;t true, then there must be a smallest number that doesn&#8217;t have a prime factorisation.  Let&#8217;s call it <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />.</p>
<p>Now <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> can&#8217;t be prime, because if <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> were prime then it would have a prime factorisation: <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> itself.  So <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> is composite.  That is, it is a product of two smaller numbers, say <img src='http://l.wordpress.com/latex.php?latex=n%3Dab&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n=ab' title='n=ab' class='latex' />.</p>
<p>Then <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' /> are less than <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> is the smallest number not having a prime factorisation, so <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' /> must have prime factorisations.  But then <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> has a prime factorisation: simply multiply those of <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' />.</p>
<p>So there can&#8217;t be a counterexample, so the result must be true.</p>
<p><strong><span style="text-decoration:underline;">Uniqueness</span></strong> Now our job is to show that every integer greater than 1 has a <em>unique</em> prime factorisation.</p>
<p>So let&#8217;s suppose that we have a number <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> with two factorisations: <img src='http://l.wordpress.com/latex.php?latex=n+%3D+p_1+%5Cdotsm+p_k+%3D+q_1+%5Cdotsm+q_l&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n = p_1 \dotsm p_k = q_1 \dotsm q_l' title='n = p_1 \dotsm p_k = q_1 \dotsm q_l' class='latex' />.   Here <img src='http://l.wordpress.com/latex.php?latex=p_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_1' title='p_1' class='latex' />, &#8230;, <img src='http://l.wordpress.com/latex.php?latex=p_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_k' title='p_k' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=q_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q_1' title='q_1' class='latex' />, &#8230;, <img src='http://l.wordpress.com/latex.php?latex=q_l&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q_l' title='q_l' class='latex' /> are primes, not necessarily distinct (we can have <img src='http://l.wordpress.com/latex.php?latex=p_1+%3D+p_3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_1 = p_3' title='p_1 = p_3' class='latex' /> or whatever).  Our aim is to show that in fact these two factorisations are the same. </p>
<p>If any <img src='http://l.wordpress.com/latex.php?latex=p_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_i' title='p_i' class='latex' /> is equal to some <img src='http://l.wordpress.com/latex.php?latex=q_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q_j' title='q_j' class='latex' />, then we can cancel those factors.  Let&#8217;s do that wherever possible.  We hope that in fact we  now have no primes left on either side, because then we&#8217;ll know that the two factorisations were the same (they contained exactly the same primes).  So let&#8217;s suppose that isn&#8217;t the case and try to obtain a contradiction.</p>
<p>We have something like <img src='http://l.wordpress.com/latex.php?latex=p_1+%5Cdotsm+p_r+%3D+q_1+%5Cdotsm+q_s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_1 \dotsm p_r = q_1 \dotsm q_s' title='p_1 \dotsm p_r = q_1 \dotsm q_s' class='latex' />, where each <img src='http://l.wordpress.com/latex.php?latex=p_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_i' title='p_i' class='latex' /> is not equal to any <img src='http://l.wordpress.com/latex.php?latex=q_j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q_j' title='q_j' class='latex' />.</p>
<p>Now <img src='http://l.wordpress.com/latex.php?latex=q_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q_1' title='q_1' class='latex' /> divides the right-hand side, so it must divide the left-hand side too.  Using our lemma repeatedly, we see that means that <img src='http://l.wordpress.com/latex.php?latex=q_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q_1' title='q_1' class='latex' /> divides <img src='http://l.wordpress.com/latex.php?latex=p_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_i' title='p_i' class='latex' /> for some <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' />.  But <img src='http://l.wordpress.com/latex.php?latex=q_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q_1' title='q_1' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=p_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_i' title='p_i' class='latex' /> are both prime, so this says that <img src='http://l.wordpress.com/latex.php?latex=q_1%3Dp_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q_1=p_i' title='q_1=p_i' class='latex' /> &#8212; and we&#8217;ve assumed that not to be the case.</p>
<p>This gives the contradiction we wanted, so the factorisation is indeed unique.</p>
<h3>Further reading</h3>
<p>Of course, Wikipedia has a page about the <a title="Wikipedia page about the fundamental theorem of arithmetic" href="http://en.wikipedia.org/wiki/Fundamental_theorem_of_arithmetic">fundamental theorem of arithmetic</a>.  The theorem should be described in any decent introduction to number theory.  As usual, I shall mention Davenport&#8217;s <em>The Higher Arithmetic</em> at this stage.</p>
<p>It turns out that there are settings (number fields) in which one can do number theory, but where <a title="Wikipedia page about unique factorisation in number fields" href="http://en.wikipedia.org/wiki/Number_field#Unique_factorization_and_class_number">one does not necessarily have unique factorisation</a>.  This leads to a lot of extremely interesting mathematics &#8212; perhaps I shall return to it in a future theorem of the week.  There&#8217;s so much interesting maths out there!</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Theorem 12: there are infinitely many primes]]></title>
<link>http://theoremoftheweek.wordpress.com/2009/11/01/theorem-12-there-are-infinitely-many-primes/</link>
<pubDate>Sun, 01 Nov 2009 20:09:47 +0000</pubDate>
<dc:creator>theoremoftheweek</dc:creator>
<guid>http://theoremoftheweek.wordpress.com/2009/11/01/theorem-12-there-are-infinitely-many-primes/</guid>
<description><![CDATA[As you might have already gathered from this blog, the prime numbers are pretty key in mathematics, ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>As you might have already gathered from this blog, the <a title="Glossary entry for prime numbers" href="http://theoremoftheweek.wordpress.com/glossary/#Prime_number">prime numbers</a> are pretty key in mathematics, and in particular in number theory (roughly speaking, the study of the whole numbers).  (And since I&#8217;m a number theorist, I think they&#8217;re pretty cool!)  The fundamental theorem of arithmetic (which says that every positive integer has a <a title="Glossary entry for prime factorisation" href="http://theoremoftheweek.wordpress.com/glossary/#Prime_factorisation">unique prime factorisation</a>) tells us that they are the building blocks of the integers (if one builds by multiplying).  And Goldbach&#8217;s conjecture (if true) and <a title="Vinogradov's three primes theorem" href="http://theoremoftheweek.wordpress.com/2009/08/10/theorem-3-vinogradovs-three-primes-theorem/">Vinogradov&#8217;s three primes theorem</a> tell us interesting things about how the primes behave when we add them.  But this all leaves some pretty fundamental questions unanswered.  For example, how many primes are there?</p>
<p>It turns out that one can be fairly precise about how many primes there are, thanks to the prime number theorem (which I hope to make a future theorem of the week).  For now, though, I&#8217;d like to discuss a simpler question: are there infinitely many primes?</p>
<p><!--more--></p>
<p>It turns out that there <em>are </em>infinitely many primes.  The Greek mathematician <a title="MacTutor biography of Euclid" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Euclid.html">Euclid</a> gave a famous proof of this fact, and I&#8217;d like to describe that proof here.  But it&#8217;s not the only proof; I&#8217;ll also mention a proof by <a title="MacTutor biography of Euler" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Euler.html">Euler</a> that involves some rather interesting mathematics.  Before that, I&#8217;ll state the theorem in a way that hopefully stands out!</p>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem</span></strong> There are infinitely many primes.</p></blockquote>
<p>So, how could one go about proving a result like this?  One important thing to notice is that we&#8217;re not trying to give a formula for the primes.  It&#8217;s enough to prove the existence of infinitely many primes; we don&#8217;t have to give any way of listing them, or anything like that.</p>
<h3>Euclid&#8217;s proof</h3>
<p>Euclid&#8217;s first idea was (in the modern way of presentation &#8211; he didn&#8217;t phrase it exactly like this) to use <a title="Glossary entry for proof by contradiction" href="http://theoremoftheweek.wordpress.com/glossary/#Proof_by_contradiction">proof by contradiction</a>.  We are trying to prove that there is not a largest prime, and this is exactly the sort of (negative) statement for which proof by contradiction is so helpful.</p>
<p>So let&#8217;s suppose that there <em>is</em> a largest prime (and we then hope to make some deductions that lead to a contradiction).  So we can write down all the primes, in a finite list.  Say they&#8217;re <img src='http://l.wordpress.com/latex.php?latex=p_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_1' title='p_1' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=p_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_2' title='p_2' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=p_3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_3' title='p_3' class='latex' />, &#8230;, <img src='http://l.wordpress.com/latex.php?latex=p_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_k' title='p_k' class='latex' />.  So those are all the primes in the world.</p>
<p>Now comes Euclid&#8217;s second good idea, to generate a prime that isn&#8217;t on that list.  He created a new number, <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N' title='N' class='latex' />, say, by multiplying together all those primes and adding <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />.  That is, <img src='http://l.wordpress.com/latex.php?latex=N+%3D+p_1+p_2+p_3+%5Cdotsm+p_k+%2B1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N = p_1 p_2 p_3 \dotsm p_k +1' title='N = p_1 p_2 p_3 \dotsm p_k +1' class='latex' />.</p>
<p>What&#8217;s so special about this number?  Well, it is a positive integer and so must have a prime factor.  (This is a fact about positive integers that one has to prove.  One can, for example, use <a title="Glossary entry for proof by induction" href="http://theoremoftheweek.wordpress.com/glossary/#Proof_by_induction">induction</a>.)  But it cannot be divisible by any of the primes <img src='http://l.wordpress.com/latex.php?latex=p_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_i' title='p_i' class='latex' />.  Indeed, <img src='http://l.wordpress.com/latex.php?latex=p_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_i' title='p_i' class='latex' /> divides <img src='http://l.wordpress.com/latex.php?latex=p_1+p_2+p_3+%5Cdotsm+p_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_1 p_2 p_3 \dotsm p_k' title='p_1 p_2 p_3 \dotsm p_k' class='latex' />.  So if <img src='http://l.wordpress.com/latex.php?latex=p_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_i' title='p_i' class='latex' /> divides <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N' title='N' class='latex' />, then it also divides <img src='http://l.wordpress.com/latex.php?latex=1+%3D+N+-+p_1+p_2+p_3+%5Cdotsm+p_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1 = N - p_1 p_2 p_3 \dotsm p_k' title='1 = N - p_1 p_2 p_3 \dotsm p_k' class='latex' /> &#8212; and that is clearly not possible.</p>
<p>So we have found a prime number (a prime factor of <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N' title='N' class='latex' />) that is not any of our primes <img src='http://l.wordpress.com/latex.php?latex=p_1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_1' title='p_1' class='latex' />, &#8230;, <img src='http://l.wordpress.com/latex.php?latex=p_k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_k' title='p_k' class='latex' />.  (Note that I&#8217;m not saying that <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N' title='N' class='latex' /> itself is prime, but only that it has a prime factor.)  But that contradicts our assumption that our list contained all prime numbers.  That is the contradiction we were seeking, so our assumption (that there are only finitely many primes) must have been wrong.  That is, there are infinitely many primes.</p>
<p>One can use similar ideas to show that there are infinitely many primes of the form <img src='http://l.wordpress.com/latex.php?latex=4n-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='4n-1' title='4n-1' class='latex' />.  One has to work slightly harder, and one has to consider a slightly different number (<img src='http://l.wordpress.com/latex.php?latex=N+%3D+4p_1+p_2+%5Cdotsm+p_k+-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N = 4p_1 p_2 \dotsm p_k -1' title='N = 4p_1 p_2 \dotsm p_k -1' class='latex' /> works well), but the ideas are broadly similar &#8212; you might like to think about the details.</p>
<h3>Euler&#8217;s proof</h3>
<p>I&#8217;m not going to give you all the details of Euler&#8217;s proof, for reasons that may become clear in a moment, but I&#8217;d like to give you the key idea.  Euler considered the sum <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bp%5Ctextrm%7B+prime%7D%7D+%5Cfrac%7B1%7D%7Bp%7D+%3D+%5Cfrac%7B1%7D%7B2%7D+%2B+%5Cfrac%7B1%7D%7B3%7D+%2B+%5Cfrac%7B1%7D%7B5%7D+%2B+%5Cfrac%7B1%7D%7B7%7D+%2B+%5Cfrac%7B1%7D%7B11%7D+%2B+%5Cfrac%7B1%7D%7B13%7D+%2B+%5Cdotsb&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_{p\textrm{ prime}} \frac{1}{p} = \frac{1}{2} + \frac{1}{3} + \frac{1}{5} + \frac{1}{7} + \frac{1}{11} + \frac{1}{13} + \dotsb' title='\sum_{p\textrm{ prime}} \frac{1}{p} = \frac{1}{2} + \frac{1}{3} + \frac{1}{5} + \frac{1}{7} + \frac{1}{11} + \frac{1}{13} + \dotsb' class='latex' />.  (That is, it is the sum of the reciprocals of all the primes.)</p>
<p>Euler was able to prove that this sum diverges.  That is, it is infinity.  For comparison, you might like to know that the sum of the reciprocals of the natural numbers, <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bn%5Cgeq+1%7D+%5Cfrac%7B1%7D%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_{n\geq 1} \frac{1}{n}' title='\sum_{n\geq 1} \frac{1}{n}' class='latex' />, called the <em><a title="Wikipedia page about the harmonic series" href="http://en.wikipedia.org/wiki/Harmonic_series_(mathematics)">harmonic series</a></em>, diverges, but the <a title="Wikipedia page about the sum of the reciprocals of the squares" href="http://en.wikipedia.org/wiki/Basel_problem">sum of the reciprocals of the squares</a>, <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bn%5Cgeq+1%7D+%5Cfrac%7B1%7D%7Bn%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_{n\geq 1} \frac{1}{n^2}' title='\sum_{n\geq 1} \frac{1}{n^2}' class='latex' />, converges &#8212; it is <img src='http://l.wordpress.com/latex.php?latex=%5Cfrac%7B%5Cpi%5E2%7D%7B6%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\frac{\pi^2}{6}' title='\frac{\pi^2}{6}' class='latex' />.  (The alert reader will notice that once one knows that the sum of the reciprocals of the primes diverges, one immediately knows that the harmonic series diverges, because it&#8217;s bigger.  But it&#8217;s also a lot easier to prove directly that the harmonic series diverges.)  I&#8217;m not going to tell you about how to prove that the sum of the reciprocals of the primes diverges; that&#8217;s the bit of the proof that I&#8217;m skipping.  If you&#8217;re interested, you could look at <a title="Wikipedia page about the divergence of the sum of the reciprocals of the primes" href="http://en.wikipedia.org/wiki/Proof_that_the_sum_of_the_reciprocals_of_the_primes_diverges">Wikipedia</a>.</p>
<p>Why does the divergence of this sum prove that there are infinitely many primes?  Well, suppose that there are only finitely many primes.  Then the sum <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bp%5Ctextrm%7B+prime%7D%7D+%5Cfrac%7B1%7D%7Bp%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_{p\textrm{ prime}} \frac{1}{p}' title='\sum_{p\textrm{ prime}} \frac{1}{p}' class='latex' /> is a sum of finitely many things, so it is finite.  But we know that the sum isn&#8217;t finite, so there must be infinitely many primes.</p>
<p>Once one has done the work to show the divergence of the sum of the reciprocals of the primes, the proof that there are infinitely many primes is very easy.  In fact, the divergence of the sum gives more information: it gives some information about how many primes there are.  For example, informally &#8220;there are more primes than squares&#8221;, because the sum of the reciprocals of the squares converges.</p>
<p>These are far from being the only proofs that there are infinitely many primes.  You can read about some others on <a title="Wikipedia page about the number of primes" href="http://en.wikipedia.org/wiki/Prime_number#The_number_of_prime_numbers">Wikipedia</a>, or on <a title="The Prime Pages proofs that there are infinitely many primes" href="http://primes.utm.edu/notes/proofs/infinite/">the Prime Pages</a>, for example.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Theorem 11: the pigeonhole principle]]></title>
<link>http://theoremoftheweek.wordpress.com/2009/10/25/theorem-11-the-pigeonhole-principle/</link>
<pubDate>Sun, 25 Oct 2009 21:59:22 +0000</pubDate>
<dc:creator>theoremoftheweek</dc:creator>
<guid>http://theoremoftheweek.wordpress.com/2009/10/25/theorem-11-the-pigeonhole-principle/</guid>
<description><![CDATA[The pigeonhole principle is a fairly simple idea to understand, and is extremely useful &#8212; math]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>The pigeonhole principle is a fairly simple idea to understand, and is extremely useful &#8212; mathematicians use it all the time.  Despite that, it seems not to be mentioned in most schools (in the UK).  This week, I&#8217;d like to tell you about it, and to give a nice application.  My understanding is that the pigeonhole principle was first stated by <a title="MacTutor biography of Dirichlet" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Dirichlet.html">Dirichlet</a> (although he didn&#8217;t call it that!), who used it to prove a result about <a title="Wikipedia page about Diophantine approximation" href="http://en.wikipedia.org/wiki/Diophantine_approximation">Diophantine approximation</a> &#8212; this is the application that I&#8217;d like to describe here.  The pigeonhole principle is sometimes called Dirichlet&#8217;s box principle.</p>
<p><!--more--></p>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem (the pigeonhole principle)</span></strong> If one puts <img src='http://l.wordpress.com/latex.php?latex=n%2B1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n+1' title='n+1' class='latex' /> pigeons into <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> pigeonholes, then at least one pigeonhole will contain at least two pigeons.</p>
<p>More generally, if one puts <img src='http://l.wordpress.com/latex.php?latex=kn%2B1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='kn+1' title='kn+1' class='latex' /> pigeons into <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> pigeonholes, then at least one pigeonhole will contain at least <img src='http://l.wordpress.com/latex.php?latex=k%2B1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k+1' title='k+1' class='latex' /> pigeons.</p></blockquote>
<p>It&#8217;s important to note what the theorem does <em>not</em> say.  It might be that just one pigeonhole contains all the pigeons &#8212; they don&#8217;t have to be evenly spread out.  It might be that several pigeonholes contain multiple pigeons.  All the theorem says is that <em>at least one</em> pigeonhole contains <em>at least two</em> (or, in the general case, <img src='http://l.wordpress.com/latex.php?latex=k%2B1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k+1' title='k+1' class='latex' />) pigeons.</p>
<p>I hope that this statement seems fairly obvious to you.  Its power is not that it is a deep and profound result in itself, but rather that it has many applications.  Nonetheless, one still needs to prove it.  Fortunately, that&#8217;s not too difficult &#8212; the hardest thing is possibly accepting that it does indeed need proof!  If you ask yourself <em>why</em> the pigeonhole principle is true, I think that you are very likely to hit upon a proof for yourself.</p>
<p>Indeed, how could it possibly not be true?  Suppose that each pigeonhole contains at most one pigeon (so either it contains one pigeon, or it is empty).  Then there are at most <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> pigeons in total.  So if we have <img src='http://l.wordpress.com/latex.php?latex=n%2B1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n+1' title='n+1' class='latex' /> pigeons, then at least one pigeonhole contains at least two pigeons.</p>
<p>A very similar argument shows the more general result about <img src='http://l.wordpress.com/latex.php?latex=kn%2B1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='kn+1' title='kn+1' class='latex' /> pigeons.</p>
<p>Sometimes it&#8217;s actually easier to use the idea of the proof than to quote the result.  It&#8217;s all the same idea, though.</p>
<p>As I mentioned above, Dirichlet used this principle to prove a result about approximating <a title="Glossary entry for irrational numbers" href="http://theoremoftheweek.wordpress.com/glossary/#Irrational_numbers">irrational numbers</a> by <a title="Glossary entry for rational numbers" href="http://theoremoftheweek.wordpress.com/glossary/#Rational_numbers">rationals</a>.  <a title="The square root of 2 is irrational" href="http://theoremoftheweek.wordpress.com/2009/08/24/theorem-5-root-2-is-irrational/">The square root of 2 is irrational</a>, but is it close to a rational?  That&#8217;s a rather vague question.  We can approximate <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{2}' title='\sqrt{2}' class='latex' /> arbitrarily well by rationals: just take the decimal expansion of <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{2}' title='\sqrt{2}' class='latex' /> and chop it off at some point (ignore all the digits to the right of that point).  That gives a rational (it has a finite decimal expansion), and we can make the error as small as we like by choosing the point at which we chop the decimal.  But in order to get a good approximation using this method, we need a rational with large denominator.</p>
<p>So here&#8217;s a more refined version of the question.  Can we approximate <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{2}' title='\sqrt{2}' class='latex' /> pretty well by a rational with a pretty small denominator?  What if we replace <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{2}' title='\sqrt{2}' class='latex' /> by another irrational?  There&#8217;s a whole area of research on this sort of question, and lots of very interesting results.  Dirichlet&#8217;s theorem is a relatively simple one, and that&#8217;s all that I&#8217;ll mention today.  (I hope to come back to the subject of approximating irrationals by rationals in a future theorem of the week.)</p>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem (Dirichlet)</span></strong> Let <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> be an irrational number, and let <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N' title='N' class='latex' /> be a positive integer.  Then there is a rational <img src='http://l.wordpress.com/latex.php?latex=p%2Fq&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p/q' title='p/q' class='latex' /> such that the denominator <img src='http://l.wordpress.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q' title='q' class='latex' /> is between 1 and <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N' title='N' class='latex' /> and such that <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3B%5Calpha+-+%5Cfrac%7Bp%7D%7Bq%7D%26%23124%3B+%5Cleq+%5Cfrac%7B1%7D%7BqN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;\alpha - \frac{p}{q}&#124; \leq \frac{1}{qN}' title='&#124;\alpha - \frac{p}{q}&#124; \leq \frac{1}{qN}' class='latex' />.</p></blockquote>
<p>Here&#8217;s the proof.  As you will by now have gathered, the plan is to use the pigeonhole principle.  Another way of expressing the conclusion <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3B%5Calpha+-+%5Cfrac%7Bp%7D%7Bq%7D%26%23124%3B+%5Cleq+%5Cfrac%7B1%7D%7BqN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;\alpha - \frac{p}{q}&#124; \leq \frac{1}{qN}' title='&#124;\alpha - \frac{p}{q}&#124; \leq \frac{1}{qN}' class='latex' /> is to say that <img src='http://l.wordpress.com/latex.php?latex=q%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q\alpha' title='q\alpha' class='latex' /> is within <img src='http://l.wordpress.com/latex.php?latex=1%2FN&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1/N' title='1/N' class='latex' /> of an integer (and we then choose <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> to be that integer).  So we&#8217;d like to show that there&#8217;s some <img src='http://l.wordpress.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q' title='q' class='latex' />, with <img src='http://l.wordpress.com/latex.php?latex=1%5Cleq+q%5Cleq+N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1\leq q\leq N' title='1\leq q\leq N' class='latex' />, such that <img src='http://l.wordpress.com/latex.php?latex=q%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q\alpha' title='q\alpha' class='latex' /> is within <img src='http://l.wordpress.com/latex.php?latex=1%2FN&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1/N' title='1/N' class='latex' /> of an integer.  We need some pigeonholes and some pigeons.</p>
<p>We&#8217;re going to use the interval from 0 to 1, including 0 but not 1.  Mathematicians have notation for this interval: we write <img src='http://l.wordpress.com/latex.php?latex=%5B0%2C1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='[0,1)' title='[0,1)' class='latex' /> for the set of all real numbers less than 1 and at least 0.  Split this interval into <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N' title='N' class='latex' /> subintervals, each of length <img src='http://l.wordpress.com/latex.php?latex=1%2FN&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1/N' title='1/N' class='latex' />: <img src='http://l.wordpress.com/latex.php?latex=%5B0%2C+%5Cfrac%7B1%7D%7BN%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='[0, \frac{1}{N})' title='[0, \frac{1}{N})' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5B%5Cfrac%7B1%7D%7BN%7D%2C+%5Cfrac%7B2%7D%7BN%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='[\frac{1}{N}, \frac{2}{N})' title='[\frac{1}{N}, \frac{2}{N})' class='latex' />, &#8230;, <img src='http://l.wordpress.com/latex.php?latex=%5B%5Cfrac%7BN-1%7D%7BN%7D%2C1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='[\frac{N-1}{N},1)' title='[\frac{N-1}{N},1)' class='latex' />.  These subintervals will be our pigeonholes.</p>
<p>We&#8217;re going to use the <em>fractional parts </em>of some numbers.  The fractional part of a number <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> is &#8220;the bit after the decimal point&#8221;, so it always lies in the interval <img src='http://l.wordpress.com/latex.php?latex=%5B0%2C1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='[0,1)' title='[0,1)' class='latex' />.  We write <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bx+%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{x \}' title='\{x \}' class='latex' /> for the fractional part of <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' />.  Our pigeons will be the <img src='http://l.wordpress.com/latex.php?latex=N%2B1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N+1' title='N+1' class='latex' /> numbers <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5C%7B%5Calpha%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{\alpha\}' title='\{\alpha\}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5C%7B2%5Calpha%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{2\alpha\}' title='\{2\alpha\}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5C%7B3%5Calpha%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{3\alpha\}' title='\{3\alpha\}' class='latex' />, &#8230;, <img src='http://l.wordpress.com/latex.php?latex=%5C%7BN%5Calpha%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{N\alpha\}' title='\{N\alpha\}' class='latex' />.</p>
<p>Now the pigeonhole principle tells us that two of these numbers, say <img src='http://l.wordpress.com/latex.php?latex=%5C%7Br%5Calpha%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{r\alpha\}' title='\{r\alpha\}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bs%5Calpha%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{s\alpha\}' title='\{s\alpha\}' class='latex' /> (with <img src='http://l.wordpress.com/latex.php?latex=r+%26%2360%3B+s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r &lt; s' title='r &lt; s' class='latex' />), lie within the same subinterval.  That immediately means that <img src='http://l.wordpress.com/latex.php?latex=%28s-r%29%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(s-r)\alpha' title='(s-r)\alpha' class='latex' /> is within <img src='http://l.wordpress.com/latex.php?latex=1%2FN&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1/N' title='1/N' class='latex' /> of an integer.  Now define <img src='http://l.wordpress.com/latex.php?latex=q%3Ds-r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q=s-r' title='q=s-r' class='latex' />.  Then certainly <img src='http://l.wordpress.com/latex.php?latex=1%5Cleq+q%5Cleq+N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1\leq q\leq N' title='1\leq q\leq N' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=q%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q\alpha' title='q\alpha' class='latex' /> is within <img src='http://l.wordpress.com/latex.php?latex=1%2FN&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1/N' title='1/N' class='latex' /> of an integer, as we wanted.</p>
<p>As I say, there are more results along these lines, and they are more precise about how well various irrational numbers can be approximated by rationals.  (For example, <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{2}' title='\sqrt{2}' class='latex' /> is actually quite hard to approximate by rationals with small denominators.)  But they&#8217;ll have to wait for a future post!</p>
<p>There are some more applications of the pigeonhole principle on the <a title="Wikipedia page on the pigeonhole principle" href="http://en.wikipedia.org/wiki/Pigeonhole_principle">Wikipedia page</a>.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Theorem 10: Lagrange's theorem in group theory]]></title>
<link>http://theoremoftheweek.wordpress.com/2009/10/18/theorem-10-lagranges-theorem-in-group-theory/</link>
<pubDate>Sun, 18 Oct 2009 13:49:31 +0000</pubDate>
<dc:creator>theoremoftheweek</dc:creator>
<guid>http://theoremoftheweek.wordpress.com/2009/10/18/theorem-10-lagranges-theorem-in-group-theory/</guid>
<description><![CDATA[The term has just started, and so I have been contemplating the exciting mathematics in store for th]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>The term has just started, and so I have been contemplating the exciting mathematics in store for the new first years.  This week I thought I&#8217;d tell you about one such result.</p>
<p>New students often find that some of the mathematics they meet at university is more abstract than the mathematics they studied at school.  Abstraction can be an extremely useful tool.  Broadly speaking, here&#8217;s what happens.  Mathematicians notice several examples of the same behaviour.  They want to explore why: what is it about those situations that leads to the same behaviour?  They try to write down a list of those key properties.  They then define a new object: it&#8217;s anything that has those properties.  (So all of the initial situations should be examples of this new kind of object.)  That should hopefully lead to a better understanding of what&#8217;s going on.  Moreover, if mathematicians can prove something about this object using only the knowledge that the object has those key properties, then they have proved a result about <em>every</em> example of that object &#8212; all in one go!</p>
<p>One good example of this is the notion of a group.  I&#8217;m not going to go through the definition of a group in great detail here, so if you haven&#8217;t come across the concept before then you might like to read <a title="NRICH article on Grouping Transformations" href="http://nrich.maths.org/public/viewer.php?obj_id=5336">this article</a> on NRICH.  Here, I&#8217;ll remind you of the definition of a group and give a few examples.</p>
<p><!--more--></p>
<p>A <em>group</em> is a set <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> and a binary operation <img src='http://l.wordpress.com/latex.php?latex=%5Cbullet&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\bullet' title='\bullet' class='latex' /> that satisfy the group axioms.  So <img src='http://l.wordpress.com/latex.php?latex=%5Cbullet&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\bullet' title='\bullet' class='latex' /> is a map that takes two elements of <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> as input, and gives one element of <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> as output.  There are three group axioms.</p>
<ol>
<li>The operation <img src='http://l.wordpress.com/latex.php?latex=%5Cbullet&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\bullet' title='\bullet' class='latex' /> is associative.  That is, <img src='http://l.wordpress.com/latex.php?latex=a+%5Cbullet+%28b%5Cbullet+c%29+%3D+%28a+%5Cbullet+b%29+%5Cbullet+c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a \bullet (b\bullet c) = (a \bullet b) \bullet c' title='a \bullet (b\bullet c) = (a \bullet b) \bullet c' class='latex' /> for any <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='c' title='c' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' />.  (So one can write <img src='http://l.wordpress.com/latex.php?latex=a+%5Cbullet+b+%5Cbullet+c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a \bullet b \bullet c' title='a \bullet b \bullet c' class='latex' /> without any ambiguity.)</li>
<li>There is an identity element in <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' />.  That is, there is some <img src='http://l.wordpress.com/latex.php?latex=e&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='e' title='e' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> so that <img src='http://l.wordpress.com/latex.php?latex=a+%5Cbullet+e+%3D+a+%3D+e+%5Cbullet+a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a \bullet e = a = e \bullet a' title='a \bullet e = a = e \bullet a' class='latex' /> for any <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' />.  (&#8220;Combining with <img src='http://l.wordpress.com/latex.php?latex=e&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='e' title='e' class='latex' /> has no effect.&#8221;)</li>
<li>Each element of <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> has an inverse in <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' />.  That is, for each <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> there is some element <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> so that <img src='http://l.wordpress.com/latex.php?latex=a+%5Cbullet+b+%3D+e+%3D+b+%5Cbullet+a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a \bullet b = e = b \bullet a' title='a \bullet b = e = b \bullet a' class='latex' />.  (The inverse of <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> is often written as <img src='http://l.wordpress.com/latex.php?latex=a%5E%7B-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a^{-1}' title='a^{-1}' class='latex' /> &#8212; this should not be confused with the usual reciprocal <img src='http://l.wordpress.com/latex.php?latex=1%2Fa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1/a' title='1/a' class='latex' />!)</li>
</ol>
<p>Here are some examples of groups.</p>
<ol>
<li>The <a title="Glossary entry for integers" href="http://theoremoftheweek.wordpress.com/glossary/#Integers">integers</a> under addition.  (The identity is <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' />, and the inverse of <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=-n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-n' title='-n' class='latex' />.)</li>
<li>The positive <a title="Glossary entry for rational numbers" href="http://theoremoftheweek.wordpress.com/glossary/#Rational_numbers">rational numbers</a> under multiplication.  (The identity is <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />, and the inverse of <img src='http://l.wordpress.com/latex.php?latex=a%2Fb&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a/b' title='a/b' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=1%2F%28a%2Fb%29+%3D+b%2Fa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1/(a/b) = b/a' title='1/(a/b) = b/a' class='latex' />.)</li>
<li>The symmetries of a square, illustrated on <a title="Wikipedia illustration of the symmetries of a square" href="http://en.wikipedia.org/wiki/Group_(mathematics)#Second_example:_a_symmetry_group">Wikipedia</a>.  The operation is composition of transformations (do one and then the other).  This is not an <a title="Wikipedia page about Abelian groups" href="http://en.wikipedia.org/wiki/Abelian_group">Abelian group</a>: the order of the transformations matters.</li>
<li>The non-zero remainders modulo a prime <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />, under multiplication (mod <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />).  (The identity is <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />.  The existence of inverses is not totally obvious, but is not too hard to prove.  Here&#8217;s a quick description of a proof.  Take a non-zero remainder (mod <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />), say <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' />.  Then <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> are coprime (why?), so <a title="Bezout's theorem" href="http://theoremoftheweek.wordpress.com/2009/07/27/theorem-1-bezouts-theorem/">Bézout&#8217;s theorem</a> says that there are integers <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> so that <img src='http://l.wordpress.com/latex.php?latex=ah%2Bpk%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='ah+pk=1' title='ah+pk=1' class='latex' />.  Now <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' /> is an inverse of <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> (mod <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />) (why?).  There a full proof in this <a title="NRICH article on Modular Arithmetic" href="http://nrich.maths.org/public/viewer.php?obj_id=4350">NRICH article</a>.)</li>
</ol>
<p>This week, I&#8217;d like to give you an example of a theorem about groups, and then show how it immediately gives a theorem about a specific example of a group (example 4 in the list above) that is an interesting theorem in its own right.  I need to define one more concept before I can state this week&#8217;s theorem.</p>
<p>A <em>subgroup</em> <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> of the group <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> is a subset of <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> that is also a group.  What does that mean?  Combining two elements of <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> gives an element of <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' />.  The operation is automatically associative, but one has to check that <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> contains the identity, and that if <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> is in <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> then so is the inverse of <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' />.  It turns out that to check whether <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> is a subgroup, it is enough to check that for any <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> the element <img src='http://l.wordpress.com/latex.php?latex=a+%5Cbullet+b%5E%7B-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a \bullet b^{-1}' title='a \bullet b^{-1}' class='latex' /> is also in <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' />.  You might like to think about why that check suffices.</p>
<p>Here are some examples of subgroups of those groups I mentioned earlier.</p>
<ol>
<li><img src='http://l.wordpress.com/latex.php?latex=G+%3D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G =' title='G =' class='latex' /> integers, <img src='http://l.wordpress.com/latex.php?latex=%5Cbullet+%3D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\bullet =' title='\bullet =' class='latex' /> addition.  The even numbers form a subgroup, but the odd numbers do not (because the set of odd numbers does not contain the identity, <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' />).</li>
<li><img src='http://l.wordpress.com/latex.php?latex=G+%3D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G =' title='G =' class='latex' /> positive rationals, <img src='http://l.wordpress.com/latex.php?latex=%5Cbullet+%3D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\bullet =' title='\bullet =' class='latex' /> multiplication.  The powers of <img src='http://l.wordpress.com/latex.php?latex=3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3' title='3' class='latex' /> (<img src='http://l.wordpress.com/latex.php?latex=3%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3^n' title='3^n' class='latex' /> for any integer <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />) form a subgroup.  The identity is <img src='http://l.wordpress.com/latex.php?latex=1+%3D+3%5E0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1 = 3^0' title='1 = 3^0' class='latex' />, and the inverse of <img src='http://l.wordpress.com/latex.php?latex=3%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3^n' title='3^n' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=3%5E%7B-n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3^{-n}' title='3^{-n}' class='latex' />, which is also in the set.</li>
<li><img src='http://l.wordpress.com/latex.php?latex=G+%3D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G =' title='G =' class='latex' /> symmetries of a square, <img src='http://l.wordpress.com/latex.php?latex=%5Cbullet+%3D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\bullet =' title='\bullet =' class='latex' /> composition of transformations.  The rotations (including the identity) form a subgroup.</li>
<li><img src='http://l.wordpress.com/latex.php?latex=G+%3D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G =' title='G =' class='latex' /> non-zero remainders modulo a prime <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5Cbullet+%3D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\bullet =' title='\bullet =' class='latex' /> multiplication (mod <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />).  For example, when <img src='http://l.wordpress.com/latex.php?latex=p%3D7&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p=7' title='p=7' class='latex' />, the non-zero remainders are <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2' title='2' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3' title='3' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='4' title='4' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=5&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='5' title='5' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=6&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='6' title='6' class='latex' />.  The set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C+2%2C+4%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{1, 2, 4\}' title='\{1, 2, 4\}' class='latex' /> is a subgroup (check this for yourself!).</li>
</ol>
<p>This week&#8217;s theorem tells us about the size (the <em>order</em>, to use the technical term) of a subgroup of a finite group.  (Examples 1 and 2 above are infinite groups, examples 3 and 4 are finite.)  It&#8217;s called <a title="MacTutor biography of Lagrange" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Lagrange.html">Lagrange</a>&#8217;s theorem, but that doesn&#8217;t identify it uniquely (hence the title of this post)!</p>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem (Lagrange)</span></strong> The order of a subgroup of a finite group divides the order of the group.</p></blockquote>
<p>Let&#8217;s check this on the examples above.</p>
<p>3. <img src='http://l.wordpress.com/latex.php?latex=G+%3D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G =' title='G =' class='latex' /> symmetries of a square, <img src='http://l.wordpress.com/latex.php?latex=%5Cbullet+%3D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\bullet =' title='\bullet =' class='latex' /> composition of transformations, <img src='http://l.wordpress.com/latex.php?latex=H+%3D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H =' title='H =' class='latex' /> {rotations}.  There are 8 symmetries, so <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3BG%26%23124%3B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;G&#124;' title='&#124;G&#124;' class='latex' /> (the order of <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' />) is 8.  There are 4 rotations, so <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3BH%26%23124%3B+%3D+4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;H&#124; = 4' title='&#124;H&#124; = 4' class='latex' />.  And 4 does indeed divide 8.</p>
<p>4. <img src='http://l.wordpress.com/latex.php?latex=G+%3D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G =' title='G =' class='latex' /> non-zero remainders (mod 7), <img src='http://l.wordpress.com/latex.php?latex=%5Cbullet+%3D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\bullet =' title='\bullet =' class='latex' /> multiplication, <img src='http://l.wordpress.com/latex.php?latex=H%3D%5C%7B1%2C2%2C4%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H=\{1,2,4\}' title='H=\{1,2,4\}' class='latex' />.  We have <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3BG%26%23124%3B%3D6&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;G&#124;=6' title='&#124;G&#124;=6' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3BH%26%23124%3B%3D3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;H&#124;=3' title='&#124;H&#124;=3' class='latex' />, and 3 does divide 6.</p>
<p>Phew &#8212; it works in these two cases!  (But that&#8217;s hardly a proof&#8230;)</p>
<p>Note that Lagrange&#8217;s theorem does <em>not</em> say that if <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> divides the order of a group <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> then <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> has a subgroup of order <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />.  In general, that isn&#8217;t true.</p>
<p>So, how does one go about proving Lagrange&#8217;s theorem?  Here&#8217;s one approach.</p>
<p>We have a finite group <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' />, and a subgroup <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> of that group, and we want to show that <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3BH%26%23124%3B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;H&#124;' title='&#124;H&#124;' class='latex' /> divides <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3BG%26%23124%3B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;G&#124;' title='&#124;G&#124;' class='latex' />.</p>
<p>The plan is to show that we can <em>partition</em> the elements of <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> into several subsets, each of size <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' />.  (The word &#8220;partition&#8221; indicates that the subsets don&#8217;t overlap, but between them they include every element of <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' />.  So each element of <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> lies in exactly one of the subsets forming the partition.)  That will immediately show that <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3BH%26%23124%3B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;H&#124;' title='&#124;H&#124;' class='latex' /> divides <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3BG%26%23124%3B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;G&#124;' title='&#124;G&#124;' class='latex' />.</p>
<p>So how can we do that?  I think that it seems natural to try to find some way of breaking <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> into subsets that are somehow &#8220;copies&#8221; of <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' />, since a &#8220;copy&#8221; of <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> will certainly have the same size as <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' />.  It turns out that one can do exactly that.  The &#8220;copies&#8221; are called <em>cosets</em>.  The coset <img src='http://l.wordpress.com/latex.php?latex=aH&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='aH' title='aH' class='latex' /> consists of elements of the form <img src='http://l.wordpress.com/latex.php?latex=ah&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='ah' title='ah' class='latex' />, with <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' /> an element of <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' />.  (Strictly speaking, this is a <em>left coset</em>; <img src='http://l.wordpress.com/latex.php?latex=aH&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='aH' title='aH' class='latex' /> is not, in general, the same as <img src='http://l.wordpress.com/latex.php?latex=Ha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Ha' title='Ha' class='latex' />.  But since I&#8217;m only using left cosets in this post, I&#8217;ll continue to call them cosets.)</p>
<p>We have lots of cosets <img src='http://l.wordpress.com/latex.php?latex=aH&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='aH' title='aH' class='latex' />, one for each element <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' />.  But they might overlap, or even be the same.  Let&#8217;s see what happens with one of our examples from earlier: <img src='http://l.wordpress.com/latex.php?latex=G%3D%5C%7B1%2C2%2C3%2C4%2C5%2C6%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G=\{1,2,3,4,5,6\}' title='G=\{1,2,3,4,5,6\}' class='latex' /> under multiplication (mod <img src='http://l.wordpress.com/latex.php?latex=7&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='7' title='7' class='latex' />), and <img src='http://l.wordpress.com/latex.php?latex=H%3D%5C%7B1%2C2%2C4%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H=\{1,2,4\}' title='H=\{1,2,4\}' class='latex' />.  We have six cosets.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=1H%3D%5C%7B1%2C2%2C4%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1H=\{1,2,4\}' title='1H=\{1,2,4\}' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=2H%3D%5C%7B1%2C2%2C4%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2H=\{1,2,4\}' title='2H=\{1,2,4\}' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=3H%3D%5C%7B3%2C5%2C6%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3H=\{3,5,6\}' title='3H=\{3,5,6\}' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=4H%3D%5C%7B1%2C2%2C4%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='4H=\{1,2,4\}' title='4H=\{1,2,4\}' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=5H%3D%5C%7B3%2C5%2C6%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='5H=\{3,5,6\}' title='5H=\{3,5,6\}' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=6H%3D%5C%7B3%2C5%2C6%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='6H=\{3,5,6\}' title='6H=\{3,5,6\}' class='latex' />.</p>
<p>It seems that either two cosets are the same, or they are disjoint (they don&#8217;t overlap at all).  That&#8217;s the key.  We choose one copy of each coset (so we might choose <img src='http://l.wordpress.com/latex.php?latex=2H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2H' title='2H' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=5H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='5H' title='5H' class='latex' />, say), and those give the partition we want.  Now there is still some checking to do in order to have a proof of the general result.  One has to show that it is indeed the case that either two cosets are the same or they are disjoint, and one then has to show that picking one copy of each coset does indeed give a partition of <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' />.  This is not hard, but this post is getting rather long so I think that I&#8217;ll skip it.</p>
<p>I said that I&#8217;d show that Lagrange&#8217;s theorem gives another interesting theorem as a <a title="Wikipedia page about corollary" href="http://en.wikipedia.org/wiki/Corollary">corollary</a>, so let me tell you about that now.  The theorem is called <a title="MacTutor biography of Fermat" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Fermat.html">Fermat</a>&#8217;s little theorem.  There&#8217;s a lot to say about it, so I plan for it to be a future Theorem of the week in its own right.  For now, I&#8217;ll just introduce it fairly briefly.</p>
<p>In that NRICH article introducing the idea of a group (link at the start of this post), there&#8217;s a proof that every element of a finite group has some power that&#8217;s equal to the identity.  Let&#8217;s test that for our favourite group of non-zero remainders (mod 7), under multiplication.  I&#8217;ll record the first few powers of each element.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=1%5E1%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1^1=1' title='1^1=1' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=1%5E2%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1^2=1' title='1^2=1' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=1%5E3%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1^3=1' title='1^3=1' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=1%5E4%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1^4=1' title='1^4=1' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=1%5E5%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1^5=1' title='1^5=1' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=1%5E6%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1^6=1' title='1^6=1' class='latex' />.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=2%5E1%3D2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^1=2' title='2^1=2' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=2%5E2%3D4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^2=4' title='2^2=4' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=2%5E3%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^3=1' title='2^3=1' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=2%5E4%3D2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^4=2' title='2^4=2' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=2%5E5%3D4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^5=4' title='2^5=4' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=2%5E6%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^6=1' title='2^6=1' class='latex' />.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=3%5E1%3D3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3^1=3' title='3^1=3' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=3%5E2%3D2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3^2=2' title='3^2=2' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=3%5E3%3D6&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3^3=6' title='3^3=6' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=3%5E4%3D4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3^4=4' title='3^4=4' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=3%5E5%3D5&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3^5=5' title='3^5=5' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=3%5E6%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3^6=1' title='3^6=1' class='latex' />.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=4%5E1%3D4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='4^1=4' title='4^1=4' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=4%5E2%3D2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='4^2=2' title='4^2=2' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=4%5E3%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='4^3=1' title='4^3=1' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=4%5E4%3D4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='4^4=4' title='4^4=4' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=4%5E5%3D2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='4^5=2' title='4^5=2' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=4%5E6%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='4^6=1' title='4^6=1' class='latex' />.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=5%5E1%3D5&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='5^1=5' title='5^1=5' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=5%5E2%3D4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='5^2=4' title='5^2=4' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=5%5E3%3D6&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='5^3=6' title='5^3=6' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=5%5E4%3D2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='5^4=2' title='5^4=2' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=5%5E5%3D3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='5^5=3' title='5^5=3' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=5%5E6%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='5^6=1' title='5^6=1' class='latex' />.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=6%5E1%3D6&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='6^1=6' title='6^1=6' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=6%5E2%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='6^2=1' title='6^2=1' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=6%5E3%3D6&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='6^3=6' title='6^3=6' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=6%5E4%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='6^4=1' title='6^4=1' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=6%5E5%3D6&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='6^5=6' title='6^5=6' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=6%5E6%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='6^6=1' title='6^6=1' class='latex' />.</p>
<p>There are all sorts of interesting things to notice there, and I encourage you to think about them.  For now, the thing I really want you to notice is that the sixth power of each element is <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />.  Some elements have a smaller power that is also <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> (e.g. <img src='http://l.wordpress.com/latex.php?latex=2%5E3%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2^3=1' title='2^3=1' class='latex' />), some don&#8217;t (e.g. <img src='http://l.wordpress.com/latex.php?latex=3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3' title='3' class='latex' />), but they all agree on the sixth powers.  Fermat&#8217;s little theorem tells us that this is an example of a much more general phenomenon.</p>
<p><strong><span style="text-decoration:underline;">Theorem (Fermat)</span></strong> Let <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' /> be a prime, and let <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> be an integer not divisible by <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />.  Then <img src='http://l.wordpress.com/latex.php?latex=a%5E%7Bp-1%7D%5Cequiv+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a^{p-1}\equiv 1' title='a^{p-1}\equiv 1' class='latex' /> (mod <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />).</p>
<p>This theorem is sometimes stated in a slightly different way: <img src='http://l.wordpress.com/latex.php?latex=a%5Ep%5Cequiv+a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a^p\equiv a' title='a^p\equiv a' class='latex' /> (mod <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />) for all <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> (not just <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> divisible by <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />).  You might like to think about why the two statements are equivalent (it&#8217;s not hard).</p>
<p>How can we prove this using Lagrange&#8217;s theorem?  Well, we have a group <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G' title='G' class='latex' /> consisting of the non-zero remainders (mod <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />) under multiplication, and <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3BG%26%23124%3B%3Dp-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;G&#124;=p-1' title='&#124;G&#124;=p-1' class='latex' />. </p>
<p>We need a subgroup.  Our element <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> <em>generates</em> a subgroup: we take <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' /> to be the powers of <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' />.  (For example, if <img src='http://l.wordpress.com/latex.php?latex=p%3D7&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p=7' title='p=7' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=a%3D2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a=2' title='a=2' class='latex' />, we&#8217;d take <img src='http://l.wordpress.com/latex.php?latex=H%3D%5C%7B1%2C2%2C4%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H=\{1,2,4\}' title='H=\{1,2,4\}' class='latex' />; if <img src='http://l.wordpress.com/latex.php?latex=p%3D7&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p=7' title='p=7' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=a%3D3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a=3' title='a=3' class='latex' />, we&#8217;d take <img src='http://l.wordpress.com/latex.php?latex=H%3D%5C%7B1%2C2%2C3%2C4%2C5%2C6%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H=\{1,2,3,4,5,6\}' title='H=\{1,2,3,4,5,6\}' class='latex' />.)</p>
<p>How big is <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H' title='H' class='latex' />?  Well, we have <img src='http://l.wordpress.com/latex.php?latex=H%3D%5C%7Ba%2Ca%5E2%2Ca%5E3%2C%5Cdotsc%2Ca%5Ek%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='H=\{a,a^2,a^3,\dotsc,a^k\}' title='H=\{a,a^2,a^3,\dotsc,a^k\}' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> is the smallest positive number such that <img src='http://l.wordpress.com/latex.php?latex=a%5Ek%5Cequiv+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a^k\equiv 1' title='a^k\equiv 1' class='latex' /> (mod <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />), so <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3BH%26%23124%3B%3Dk&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;H&#124;=k' title='&#124;H&#124;=k' class='latex' />.</p>
<p>Now Lagrange&#8217;s theorem tells us that <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3BH%26%23124%3B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;H&#124;' title='&#124;H&#124;' class='latex' /> divides <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3BG%26%23124%3B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;G&#124;' title='&#124;G&#124;' class='latex' />.  That is, it says that <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> divides <img src='http://l.wordpress.com/latex.php?latex=p-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p-1' title='p-1' class='latex' />.  Say <img src='http://l.wordpress.com/latex.php?latex=p-1%3Dkm&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p-1=km' title='p-1=km' class='latex' />.  Now we can calculate <img src='http://l.wordpress.com/latex.php?latex=a%5E%7Bp-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a^{p-1}' title='a^{p-1}' class='latex' />.  We have <img src='http://l.wordpress.com/latex.php?latex=a%5E%7Bp-1%7D%3Da%5E%7Bkm%7D%3D%28a%5Ek%29%5Em%5Cequiv+1%5Em%5Cequiv+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a^{p-1}=a^{km}=(a^k)^m\equiv 1^m\equiv 1' title='a^{p-1}=a^{km}=(a^k)^m\equiv 1^m\equiv 1' class='latex' /> (mod <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />) &#8212; just as we wanted!</p>
<p>Fermat&#8217;s little theorem and its generalisation to moduli that aren&#8217;t prime, the Fermat-Euler theorem, are, I think, very elegant in their own right.  They also have important applications, for example to <a title="Wikipedia page on cryptography" href="http://en.wikipedia.org/wiki/Cryptography">cryptography</a> (the <a title="Wikipedia page on the RSA algorithm" href="http://en.wikipedia.org/wiki/RSA">RSA algorithm</a> depends on them).  I&#8217;ll say more about this when they are the main topic of a post.  I&#8217;ll also give some other proofs.</p>
<p>Wikipedia has a page about <a title="Wikipedia page about Lagrange's theorem" href="http://en.wikipedia.org/wiki/Lagrange%27s_theorem_(group_theory)">Lagrange&#8217;s theorem</a>.  The theorem will be covered in any decent first introduction to group theory.  Similarly, Fermat&#8217;s little theorem will be in most introductions to number theory.  There&#8217;s a section about it in <a title="NRICH article on Introductory Number Theory" href="http://nrich.maths.org/public/viewer.php?obj_id=4352">this NRICH article</a>.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Lank hair]]></title>
<link>http://deadlyjelly.wordpress.com/2009/10/12/lank-hair/</link>
<pubDate>Mon, 12 Oct 2009 10:03:17 +0000</pubDate>
<dc:creator>deadlyjelly</dc:creator>
<guid>http://deadlyjelly.wordpress.com/2009/10/12/lank-hair/</guid>
<description><![CDATA[And then there was Proof, during which I was mentally violated by Gwyneth Paltrow. To avoid charges ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>And then there was Proof, during which I was mentally violated by Gwyneth Paltrow.</p>
<p><img class="alignnone size-full wp-image-2144" title="Proof" src="http://deadlyjelly.wordpress.com/files/2009/10/proof.jpg" alt="Proof" width="292" height="389" /></p>
<p>To avoid charges of bias, I will fess up right here. I mean, you can still charge me with bias, but since I will have admitted it, there seems little point, hmm?</p>
<p>Here is the basis of my bias: Gwyneth annoys the crap out of me. There is nobody else whose hair I want to pull more. And Baby CHEESES will someone ever give the woman a carb? She is critically in need of a feed of rice pudding. You can actually see her wasting away on screen.</p>
<p>Gwyneth evidently attended the Tom Cruise School of Acting, whereby as long as you have a minimum freaky charisma allied with plasticity of skin, you can employ one, single gesture to cover all human emotion and pass it off as acting. (In Tom Cruise’s case, this is of course the double-hand point. In Gwyneth’s, a bewildered frown with eyes crossed.)</p>
<p>Director: Cut, cut, CUT! Goddamit! Babydoll, get your sweet ass over here!</p>
<p>Gwyneth: I presume one is referring to me.</p>
<p>Director: Who? Yeah, whatever. Gwynnie honey. Since your features are so pale and indistinct, we need you to – what’s it called again? – oh yeah. We need you to ACT more.</p>
<p>Gwyneth: I can act you know. I won an Oscar for Shakespeare in Love.</p>
<p>Director: What’s that? You might want to speak up while you’re at it. I want you to EMOTE. I want to see the spit, I wanna FEEL the sweat. Set to it, there’s a good girl.</p>
<p>Director (aside to sub director): Stupid bitch. Needs a good feed of rice pudding.</p>
<p>I have no idea how she won an Oscar, although I can only conclude she must have donated many varied and nasty sexual favours to the entire Academy.</p>
<p>Now that’s out of the way: in Proof, Gwyneth plays the petulant, whiny, lank-haired daughter of a brilliant mathematician. She gives up her own studies when dad goes woopdewoohoo and then misplaces his mortal coil. Then her father’s student finds a notebook containing a mind-blowing proof to some theorem, which Gwyneth claims she wrote.</p>
<p>Honestly, it’s a stretch believing Gwyneth is capable of forming coherent sentences, never mind that she is a mathematical genius.</p>
<p>Gwyneth’s character is consumed by the likelihood of following her father into insanity. She appears to be crazy because she fears going crazy. Which is undoubtedly tragic but no more than, say, genocide &#8211; or any number of other things.</p>
<p>Her sister is supposed to be a harridan, but all I felt was sympathy for her. She turns up the day before their father’s funeral try to persuade Gwyneth to wash her hair – I completely understood where she was coming from – and Gwyneth gets all snotty about hair being dead tissue so Jojoba Oil won’t make any difference. Well, I’ve used Jojoba on my own dead tissue and the stuff is a miracle, so that shows you how much she knows.</p>
<p>Jake Gyllenhaal is her father’s former student, the drummer in a rock band of maths geeks. This time I suspended disbelief – hung it by the neck until dead – and then tried swinging it around and juggling it a while, but it was no good: maths geeks are just not that good looking. Trust me. I studied maths, and we were a dull looking bunch of monobrows. The best bit in the movie is the band’s song ‘i’, comprising three minutes of silence.</p>
<p>Disregarding all that, the movie is worth a watch.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Theorem 9: Bachet's duplication formula]]></title>
<link>http://theoremoftheweek.wordpress.com/2009/10/06/theorem-9-bachets-duplication-formula/</link>
<pubDate>Tue, 06 Oct 2009 17:33:41 +0000</pubDate>
<dc:creator>theoremoftheweek</dc:creator>
<guid>http://theoremoftheweek.wordpress.com/2009/10/06/theorem-9-bachets-duplication-formula/</guid>
<description><![CDATA[This week&#8217;s guest author is James Cooper.  Thanks, James! Bachet&#8217;s Equation and Geometry]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>This week&#8217;s guest author is James Cooper.  Thanks, James!</p>
<h3>Bachet&#8217;s Equation and Geometry</h3>
<p>Today&#8217;s blog entry concentrates on Diophantine Equations &#8212; problems posed in terms of whole numbers, and connected problems over the <a title="Definition of rational numbers" href="http://theoremoftheweek.wordpress.com/2009/08/24/theorem-5-root-2-is-irrational/#Rationals">rationals</a>.  Typically these problems are very easy to understand but difficult to solve.  Their solution often involves leaving the safe world of the <a title="Defintion of integers" href="http://theoremoftheweek.wordpress.com/2009/08/24/theorem-5-root-2-is-irrational/#Integers">integers</a> and using tools and techniques from other areas of mathematics before &#8220;projecting&#8221; the answer back into whole numbers.  The example I&#8217;m going to describe today will use ideas from algebra and geometry.</p>
<p>Fix some integer <img src='http://l.wordpress.com/latex.php?latex=c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='c' title='c' class='latex' />.  What are the rational solutions of the equation</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=y%5E2+-+x%5E3+%3D+c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y^2 - x^3 = c' title='y^2 - x^3 = c' class='latex' />?             <img src='http://l.wordpress.com/latex.php?latex=%28%5Cstar%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(\star)' title='(\star)' class='latex' /></p>
<p>By rational solution, I allow <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> to be fractions, but not arbitrary <a title="Informal definition of the real numbers" href="http://theoremoftheweek.wordpress.com/2009/08/24/theorem-5-root-2-is-irrational/#Reals">reals</a>.  So we are asking for the difference between a square and a cube to be a certain fixed integer.  This is known as <a title="Biography of Bachet" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Bachet.html">Bachet</a>&#8217;s equation, and we will see that its Geometric interpretation is the key to generating solutions.</p>
<p><!--more--></p>
<p>An amazing property of this equation is the existence of a so-called <em>Duplication Formula</em>.  Discovered by Bachet in 1621, it states that:</p>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem</span></strong> If <img src='http://l.wordpress.com/latex.php?latex=%28x%2Cy%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x,y)' title='(x,y)' class='latex' /> is a solution to <img src='http://l.wordpress.com/latex.php?latex=%28%5Cstar%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(\star)' title='(\star)' class='latex' /> with <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> rational, then</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=%28%5Cfrac%7Bx%5E4-8cx%7D%7B4y%5E2%7D%2C%5Cfrac%7B-x%5E6-20cx%5E3%2B8c%5E2%7D%7B8y%5E3%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(\frac{x^4-8cx}{4y^2},\frac{-x^6-20cx^3+8c^2}{8y^3})' title='(\frac{x^4-8cx}{4y^2},\frac{-x^6-20cx^3+8c^2}{8y^3})' class='latex' /></p>
<p> is another solution.</p></blockquote>
<p>That is, if we know one solution then we can build another one, by simply applying the duplication formula.</p>
<p>Notice that verification of this formula is easy, we just plug this expression into Bachet&#8217;s equation <img src='http://l.wordpress.com/latex.php?latex=%28%5Cstar%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(\star)' title='(\star)' class='latex' /> and check it reduces down to the original statement that <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> satisfy <img src='http://l.wordpress.com/latex.php?latex=%28%5Cstar%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(\star)' title='(\star)' class='latex' />.</p>
<p>But how did anyone come up with this formula?  Critically, Bachet&#8217;s equation is not linear (it contains higher powers of variables &#8212; it is not expressible as a combination of <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' />s and <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' />s &#8212; geometrically it does not correspond to a straight line).  In <a title="Bezout's theorem" href="http://theoremoftheweek.wordpress.com/2009/07/27/theorem-1-bezouts-theorem/">Bezout&#8217;s equation</a> (which is linear), if we have one solution <img src='http://l.wordpress.com/latex.php?latex=%28X%2CY%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(X,Y)' title='(X,Y)' class='latex' /> then we can symmetrically tweak the two variables to generate another solution: if <img src='http://l.wordpress.com/latex.php?latex=aX%2BbY+%3D+c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='aX+bY = c' title='aX+bY = c' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=a%28X%2Bb%29%2B+b%28Y-a%29+%3D+c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a(X+b)+ b(Y-a) = c' title='a(X+b)+ b(Y-a) = c' class='latex' /> and so <img src='http://l.wordpress.com/latex.php?latex=%28X%2Bb%2C+Y-a%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(X+b, Y-a)' title='(X+b, Y-a)' class='latex' /> is also a solution.</p>
<p>It seems that Bachet made a guess, an approximation to another solution, and then improved this guess using an iterative method until he reached the duplication formula, but this does not give any intuition on why this formula should work, and purely as a piece of algebra it remains mysterious.</p>
<h3>Example: <img src='http://l.wordpress.com/latex.php?latex=y%5E2-x%5E3+%3D+-2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y^2-x^3 = -2' title='y^2-x^3 = -2' class='latex' /></h3>
<p>Now let&#8217;s work through an example to see how powerful this formula can be.   It&#8217;s not too hard to spot the first solution to this equation: <img src='http://l.wordpress.com/latex.php?latex=%283%2C5%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(3,5)' title='(3,5)' class='latex' /> is a solution (because <img src='http://l.wordpress.com/latex.php?latex=5%5E2+-+3%5E3+%3D+-2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='5^2 - 3^3 = -2' title='5^2 - 3^3 = -2' class='latex' />).  Then, using the duplication formula,</p>
<ul>
<li><img src='http://l.wordpress.com/latex.php?latex=%283%2C5%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(3,5)' title='(3,5)' class='latex' /> is a solution;</li>
<li><img src='http://l.wordpress.com/latex.php?latex=%28%5Cfrac%7B129%7D%7B10%5E2%7D%2C%5Cfrac%7B-38%5E3%7D%7B10%5E3%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(\frac{129}{10^2},\frac{-38^3}{10^3})' title='(\frac{129}{10^2},\frac{-38^3}{10^3})' class='latex' /> is a solution;</li>
<li><img src='http://l.wordpress.com/latex.php?latex=%28%5Cfrac%7B2340922881%7D%7B7660%5E2%7D%2C%5Cfrac%7B113259286337292%7D%7B7660%5E3%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(\frac{2340922881}{7660^2},\frac{113259286337292}{7660^3})' title='(\frac{2340922881}{7660^2},\frac{113259286337292}{7660^3})' class='latex' /> is a solution;</li>
<li>&#8230;</li>
</ul>
<p>Already, these solutions are getting very complicated, and it&#8217;s unlikely a super-computer would stumble across such a rational solution, even more unlikely would be Bachet spotting these during his work in the 17th Century.</p>
<p>I hope this convinces you that it is really a very powerful formula, but why does it work?  Events later in the 17th century give us a different way to approach the problem which is far more elegant &#8212; the invention by <a title="Biography of Descartes" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Descartes.html">Descartes</a> (or at least the user-friendly notation) of coordinate geometry.</p>
<p>We are all happy with how Cartesian Geometry allows geometric problems to be solved algebraically &#8212; we have been doing this since we used Pythagoras to calculate distances between points in the plane, but of interest here is how algebraic problems can benefit from Geometric intuition.</p>
<p>For real <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' />, Bachet&#8217;s Equation, <img src='http://l.wordpress.com/latex.php?latex=%28%5Cstar%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(\star)' title='(\star)' class='latex' />, corresponds to a curve in the plane, and rational solutions correspond to the points where both coordinates are rational.</p>
<p>We now make a simple construction which explains the duplication formula:</p>
<p style="text-align:center;"><img class="aligncenter size-full wp-image-300" title="Bachet's equation" src="http://theoremoftheweek.wordpress.com/files/2009/10/bachet_curve1.jpg" alt="Bachet's equation" width="177" height="124" /></p>
<p>For a given solution <img src='http://l.wordpress.com/latex.php?latex=P+%3D+%28x%2Cy%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P = (x,y)' title='P = (x,y)' class='latex' />, draw the tangent to the curve at <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' />.  For <img src='http://l.wordpress.com/latex.php?latex=y%5Cneq+0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y\neq 0' title='y\neq 0' class='latex' />, substituting the equation of the tangent, <img src='http://l.wordpress.com/latex.php?latex=Y+%3D+mX+%3D+c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Y = mX = c' title='Y = mX = c' class='latex' /> into the cubic (3rd power) curve gives a cubic equation (#) in <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X' title='X' class='latex' />.  This leads to 3 (possibly repeated) solutions for the <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X' title='X' class='latex' />-coordinate (and corresponding <img src='http://l.wordpress.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Y' title='Y' class='latex' />-coordinate) of the point of intersection.</p>
<p>Let <img src='http://l.wordpress.com/latex.php?latex=Q%3D%28%5Calpha%2C%5Cbeta%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q=(\alpha,\beta)' title='Q=(\alpha,\beta)' class='latex' />. Since the tangent intersects twice at <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' />, we see that the solutions of (#) are <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha' title='\alpha' class='latex' />.</p>
<p><strong><span style="text-decoration:underline;">Question</span></strong> What goes wrong for <img src='http://l.wordpress.com/latex.php?latex=y%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y=0' title='y=0' class='latex' />?</p>
<p><strong><span style="text-decoration:underline;">Claim</span></strong>: <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' /> is also a rational point, and its coordinates are given by the duplication formula.</p>
<p>Define a curve to be rational if all its coefficients are.  We begin by noting that the gradient of the tangent will be rational, and that it passes through the rational point <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' />.  So the tangent is a rational line, and the point <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' /> is the intersection of a rational curve and a rational line.</p>
<p><strong>Warning:</strong> this is not enough to guarantee that <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' /> itself is rational.  For example, consider the intersection of the Rational Line <img src='http://l.wordpress.com/latex.php?latex=y%3D2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y=2' title='y=2' class='latex' /> and the Rational Curve <img src='http://l.wordpress.com/latex.php?latex=y%3Dx%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y=x^2' title='y=x^2' class='latex' />.  The intersection is not a point with rational coordinates.</p>
<p>After substituting for <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> in the cubic curve (using the equation of the tangent line) we are left with a <em>rational</em> cubic (#) in the <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' />-coordinates corresponding to the three points of intersection.</p>
<p>Thus, using the relation between roots and coefficients of a polynomial (sometimes called <a title="MacTutor biography of Viete" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Viete.html">Viète</a>&#8217;s <a title="Wikipedia page about Viete's formula" href="http://en.wikipedia.org/wiki/Viete%27s_formulas">formula</a>), the sum of the three <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' />-coordinates is (up to a sign) a quotient of rational coefficients, so the sum of the three <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' />-coordinates of intersection is rational.</p>
<p>But two of these solutions (intersecting <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' /> twice) are rational, so the third (at <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' />) must be rational also.</p>
<p>Substituting this <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' />-coordinate into the rational tangent line gives rational <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' />-coordinate for <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q' title='Q' class='latex' />.</p>
<p>This proves the first part of the claim.</p>
<p>You might like to check that this new point recovers the Duplication formula.  Remember to use the trick summing the three solutions of the cubic.  The alternative, using the two solutions we know to get a quadratic factor which we then factorise out of the cubic, can get messy.  Hints below.</p>
<p>In the language of <a title="Wikipedia page about elliptic curves" href="http://en.wikipedia.org/wiki/Elliptic_curve">elliptic curves</a>, given a rational point <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' /> we are considering the new rational point <img src='http://l.wordpress.com/latex.php?latex=-2P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-2P' title='-2P' class='latex' />.   This process never repeats itself (and so infinitely many rational points may be generated in this way).  If <img src='http://l.wordpress.com/latex.php?latex=%28-2%29%5Es+P+%3D+%28-2%29%5Et+P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(-2)^s P = (-2)^t P' title='(-2)^s P = (-2)^t P' class='latex' /> for some <img src='http://l.wordpress.com/latex.php?latex=s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='s' title='s' class='latex' />, $latex t$, then, subtracting, <img src='http://l.wordpress.com/latex.php?latex=nP%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='nP=0' title='nP=0' class='latex' /> for some <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> (where <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='0' title='0' class='latex' /> denotes the <em>point at infinity</em>).  Such <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' /> are known as <em>torsion elements</em>.  The <a title="Wikipedia page about the Nagell-Lutz theorem" href="http://en.wikipedia.org/wiki/Nagell-lutz_theorem">Nagell-Lutz theorem</a> tells us that torsion points with rational coordinates must then have integral coordinates.  We can now use <a title="Glossary entry for unique factorisation" href="http://theoremoftheweek.wordpress.com/glossary/#Prime_factorisation">unique factorisation</a> of natural numbers to perform &#8220;local calculations&#8221; (considering remainders modulo different primes <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p' title='p' class='latex' />) to show there are no torsion points with integral coordinates on the curve. </p>
<p>We do have torsion amongst points with real coordinates, though.  For <img src='http://l.wordpress.com/latex.php?latex=y%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y=0' title='y=0' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=P%3D%28-%5Csqrt%5B3%5D%7Bc%7D%2C0%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P=(-\sqrt[3]{c},0)' title='P=(-\sqrt[3]{c},0)' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' /> is of order 2: <img src='http://l.wordpress.com/latex.php?latex=2P%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2P=0' title='2P=0' class='latex' />.  Also, for <img src='http://l.wordpress.com/latex.php?latex=x%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x=0' title='x=0' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=P%3D%280%2C+%5Cpm+%5Csqrt%7Bc%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P=(0, \pm \sqrt{c})' title='P=(0, \pm \sqrt{c})' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' /> is of order 3: <img src='http://l.wordpress.com/latex.php?latex=3P%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3P=0' title='3P=0' class='latex' />.  Reinterpreting as <img src='http://l.wordpress.com/latex.php?latex=-2P%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-2P=0' title='-2P=0' class='latex' /> (<img src='http://l.wordpress.com/latex.php?latex=-2P+%3D+P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-2P = P' title='-2P = P' class='latex' />) respectively, can you explain these results geometrically?</p>
<p>We have shown that if there is a single rational solution, then in fact there are infinitely many.  But there are other related questions to think about.</p>
<ol>
<li>Are there any rational solutions at all?  Or could the curve meander around all the rational points of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{R}^2' title='\mathbb{R}^2' class='latex' />, as the <a title="MacTutor biography of Fermat" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Fermat.html">Fermat</a> curve <img src='http://l.wordpress.com/latex.php?latex=x%5E3%2By%5E3+%3D+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^3+y^3 = 1' title='x^3+y^3 = 1' class='latex' /> does?  (<a title="Wikipedia page about Fermat's last theorem" href="http://en.wikipedia.org/wiki/Fermat%27s_last_theorem">Fermat&#8217;s last theorem</a> says that the only rational points on the curve <img src='http://l.wordpress.com/latex.php?latex=x%5En+%2B+y%5En+%3D+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^n + y^n = 1' title='x^n + y^n = 1' class='latex' /> are <img src='http://l.wordpress.com/latex.php?latex=%28%5Cpm+1%2C+0%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(\pm 1, 0)' title='(\pm 1, 0)' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%280%2C%5Cpm+1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(0,\pm 1)' title='(0,\pm 1)' class='latex' /> for even <img src='http://l.wordpress.com/latex.php?latex=n+%5Cgeq+4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n \geq 4' title='n \geq 4' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=%280%2C1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(0,1)' title='(0,1)' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%281%2C0%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(1,0)' title='(1,0)' class='latex' /> for odd <img src='http://l.wordpress.com/latex.php?latex=n+%5Cgeq+3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n \geq 3' title='n \geq 3' class='latex' />.</li>
<li>What about working over the integers, are there any integer solutions for a given <img src='http://l.wordpress.com/latex.php?latex=c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='c' title='c' class='latex' />?</li>
<li>If there <em>is</em> a solution, are there infinitely many?  (Notice that the duplication formula only returns a rational solution even if you start with integers.)</li>
</ol>
<p>These questions are explored in the excellent book <a title="Rational Points on Elliptic Curves on Google books" href="http://books.google.co.uk/books?id=mAJei2-JcE4C&#38;printsec=frontcover&#38;dq=silverman+tate+elliptic+curves#v=onepage&#38;q=&#38;f=false"><em>Rational Points on Elliptic Curves</em></a> by Silverman and Tate, published by Springer UTM in 1992.  (The relevant part is available on Google books.)</p>
<h3>Hints</h3>
<ol>
<li>For rational point <img src='http://l.wordpress.com/latex.php?latex=P%3D+%28x%2Cy%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P= (x,y)' title='P= (x,y)' class='latex' /> on curve <img src='http://l.wordpress.com/latex.php?latex=y%5E2+%3D+c%2Bx%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y^2 = c+x^3' title='y^2 = c+x^3' class='latex' />, gradient at <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%5Cfrac%7Bdy%7D%7Bdx%7D+%3D+%5Cfrac%7B3x%5E2%7D%7B2y%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\frac{dy}{dx} = \frac{3x^2}{2y}' title='\frac{dy}{dx} = \frac{3x^2}{2y}' class='latex' />.</li>
<li>Equation of tangent <img src='http://l.wordpress.com/latex.php?latex=Y+%3D+X.%5Cfrac%7B3x%5E2%7D%7B2y%7D+%2B+%28y-%5Cfrac%7B3x%5E3%7D%7B2y%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Y = X.\frac{3x^2}{2y} + (y-\frac{3x^3}{2y})' title='Y = X.\frac{3x^2}{2y} + (y-\frac{3x^3}{2y})' class='latex' />.</li>
<li>Substituting, <img src='http://l.wordpress.com/latex.php?latex=Y%5E2%3DX%5E2.%5Cfrac%7B9x%5E4%7D%7B4y%5E2%7D%2BX.%5Cfrac%7B3x%5E2%7D%7By%7D.+%28y-%5Cfrac%7B3x%5E3%7D%7B2y%7D%29+%2B+%28y%5E2-3x%5E3%2B%5Cfrac%7B9x%5E6%7D%7B4y%5E2%7D%29%3Dc%2BX%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Y^2=X^2.\frac{9x^4}{4y^2}+X.\frac{3x^2}{y}. (y-\frac{3x^3}{2y}) + (y^2-3x^3+\frac{9x^6}{4y^2})=c+X^3' title='Y^2=X^2.\frac{9x^4}{4y^2}+X.\frac{3x^2}{y}. (y-\frac{3x^3}{2y}) + (y^2-3x^3+\frac{9x^6}{4y^2})=c+X^3' class='latex' />.</li>
<li>Intersects at <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P' title='P' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=Q%3D+%28%5Calpha%2C%5Cbeta%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Q= (\alpha,\beta)' title='Q= (\alpha,\beta)' class='latex' />, thus sum of roots:</li>
</ol>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=x%2Bx%2B%5Calpha%3D%5Cfrac%7B9x%5E4%7D%7B4y%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x+x+\alpha=\frac{9x^4}{4y^2}' title='x+x+\alpha=\frac{9x^4}{4y^2}' class='latex' /></p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=%5Calpha+%3D+%5Cfrac%7Bx%5E4-8xc%7D%7B4y%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha = \frac{x^4-8xc}{4y^2}' title='\alpha = \frac{x^4-8xc}{4y^2}' class='latex' />.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Theorem 8: Eulerian circuits]]></title>
<link>http://theoremoftheweek.wordpress.com/2009/09/29/theorem-8-eulerian-circuits/</link>
<pubDate>Tue, 29 Sep 2009 21:02:19 +0000</pubDate>
<dc:creator>theoremoftheweek</dc:creator>
<guid>http://theoremoftheweek.wordpress.com/2009/09/29/theorem-8-eulerian-circuits/</guid>
<description><![CDATA[On several occasions over the last few weeks, I have read that Immanuel Kant was from the Prussian c]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>On several occasions over the last few weeks, I have read that <a title="Wikipedia page about Immanuel Kant" href="http://en.wikipedia.org/wiki/Immanuel_Kant">Immanuel Kant</a> was from the Prussian city of <a title="Wikipedia page about Konigsberg" href="http://en.wikipedia.org/wiki/K%C3%B6nigsberg">Königsberg</a> (now Kaliningrad in Russia).  As a mathematician, this is not the first thing that comes to mind when I think of Königsberg. Instead, I think of the famous seven bridges (no longer all there), and the mathematical problem that they inspired.  Let me tell you about that problem, and about some of the interesting mathematics that came out of it.</p>
<p>Like many cities, Königsberg was situated on a river.  The city included an island in that river, and so there were several bridges, between the island and the banks of the river.  There&#8217;s a picture of the city and its bridges on <a title="Wikipedia page about the seven bridges of Konigsberg" href="http://en.wikipedia.org/wiki/Seven_Bridges_of_K%C3%B6nigsberg">Wikipedia</a>.</p>
<p>So what was the problem?  Well, could a resident of Königsberg have gone for a Sunday afternoon stroll in such a way that he would walk over each bridge exactly once? That is, he had to cross every bridge (no doubling back!), but he wasn&#8217;t allowed to walk across any bridge more than once. (No ferries allowed either!)  Oh, and our friendly Königsberger wanted to end up at home, where he started.</p>
<p>The problem was first solved by the great Swiss mathematician <a title="MacTutor biography of Euler" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Euler.html">Leonhard Euler</a>.  What did he do?  His first step was to strip away the extraneous information so that he could focus on the important things.  Today, we are familiar with his idea, thanks to the <a title="Wikipedia page about the tube map" href="http://en.wikipedia.org/wiki/Tube_map">map of the London underground</a>, but of course that came much later than Euler.  The idea is the same, though: what matters is not geographical locations, but which places are joined to which other places.  We can represent lumps of land (the island and the banks of the river) by blobs (<em>vertices</em>, singular <em>vertex</em>), and bridges by lines connecting those blobs (<em>edges</em>).  Here&#8217;s what Königsberg looks like when represented in this way (by a <em>graph</em>).</p>
<p style="text-align:center;"><img class="aligncenter size-full wp-image-243" title="Konigsberg" src="http://theoremoftheweek.wordpress.com/files/2009/09/konigsberg.jpg" alt="Konigsberg" width="355" height="96" /></p>
<p>Now the problem is to decide whether we can draw this network of edges without taking the pen off the paper (and without drawing any edge more than once).  You might like to try this before reading on.</p>
<p><!--more--></p>
<p>So, did you manage it?  I bet you didn&#8217;t!  But can you <em>prove</em> that it&#8217;s not possible?  That&#8217;s what Euler did.</p>
<p>What goes wrong when you try to draw it?  I think that the problem is probably that you find yourself stuck at a vertex with no way out (but with an edge still to draw).  Is that inevitable?  Let&#8217;s think about what happens at some vertex A (not the starting vertex).  We go in to it.  Then we go out again.  Then, perhaps a bit later, we go in.  Then &#8230;  In fact, every time we go in to it, we want to leave it again.  So we need an even number of edges at that vertex.  A similar argument shows that we need an even number of edges at the start/finish vertex too.  (The number of edges at a vertex is called the <em>degree</em> of that vertex.  We&#8217;ve just said that we need all the vertices to have even degree.)</p>
<p>What happens in Königsberg?  Well, there are three vertices with degree 3, and one with degree 5.  In particular, they all have odd degree.  So there can&#8217;t possibly be a path of the sort we want (an <em>Eulerian cycle</em>)!</p>
<p>That&#8217;s the Königsberg problem pretty much wrapped up.  As mathematicians, though, we&#8217;d like to know more.  We&#8217;d like to generalise.  Here, we had quite an easy way to show that the walk wasn&#8217;t possible (we just checked each vertex to see whether it had even degree, and as soon as we found one that didn&#8217;t we knew that there couldn&#8217;t be an Eulerian cycle).  What would happen for a different graph (configuration of lumps of land and bridges)?  If all the vertices have even degree, does that mean that an Eulerian cycle exists, or just that we can&#8217;t use this method to rule out the existence of one?  Well, let me tell you what this week&#8217;s theorem says.</p>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem</span></strong> A connected graph has an Eulerian cycle if and only if every vertex has even degree.</p></blockquote>
<p>Remember that the degree of a vertex is the number of edges coming out of it.  A <em>connected graph</em> is one in which it&#8217;s possible to get from any vertex to any other vertex: there are no isolated components.  A graph that isn&#8217;t connected can&#8217;t possibly have an Eulerian cycle, because there are two vertices with no path between them (two bits of land that aren&#8217;t connected by any route).  So we have to make sure that we deal only with connected graphs.  But that&#8217;s not a big problem, because any graph that isn&#8217;t connected is still made up of connected components, so we can just think about those instead.</p>
<p>We&#8217;ve already see the justification of one direction of this result.  Let&#8217;s remind ourselves.  If there&#8217;s an Eulerian cycle, then we have to go out of each vertex as many times as we go into it.  So each vertex must have even degree.</p>
<p>What about the other direction?  That&#8217;s a bit more interesting, because it&#8217;s not quite as easy to prove (although it&#8217;s not terribly difficult).   One proof is by <a title="Glossary entry for proof by induction" href="http://theoremoftheweek.wordpress.com/glossary/#Proof_by_induction">induction</a> on the number of edges in the graph.  That is, one shows that it&#8217;s true when there are no edges (not very difficult!), and then one shows that if it&#8217;s true for all graphs with <img src='http://l.wordpress.com/latex.php?latex=E&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='E' title='E' class='latex' /> edges, then it must also be true for all graphs with <img src='http://l.wordpress.com/latex.php?latex=E%2B1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='E+1' title='E+1' class='latex' /> edges.  In the interests of keeping this post to a reasonable length, I&#8217;m not going to go into the details, but you&#8217;re welcome to ask below if you want me to say some more.</p>
<p>Euler&#8217;s work is often seen as the start of the branch of mathematics called <a title="Wikipedia page about graph theory" href="http://en.wikipedia.org/wiki/Graph_theory">graph theory</a>, which is a good source of interesting theorems and questions (although it took a while for it to get started after Euler&#8217;s work).   These days, it also has some applications, for example to computer science and the theory of networks (such as the internet).  That said, as a pure mathematician my main interest is in the maths for its own sake!  The famous <a title="Wikipedia page about the four colour theorem" href="http://en.wikipedia.org/wiki/Four-color_theorem">four colour theorem</a> (perhaps a future theorem of the week?) belongs to graph theory, to give just one more example.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Fermat's Last Theorem]]></title>
<link>http://randomabsurdity.wordpress.com/2009/09/09/fermats-last-theorem/</link>
<pubDate>Wed, 09 Sep 2009 22:47:55 +0000</pubDate>
<dc:creator>Aaron</dc:creator>
<guid>http://randomabsurdity.wordpress.com/2009/09/09/fermats-last-theorem/</guid>
<description><![CDATA[Fermat&#8217;s Last Theorem is a very interesting theory. It states that no three positive integers ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Fermat&#8217;s Last Theorem is a very interesting theory. It states that no three positive integers a, b, and c can satisfy the equation a<sup>n</sup> + b<sup>n</sup> = c<sup>n</sup> for any integer value of n greater than two. The interesting thing about this is Pierre de Fermat discovered it in 1637. But it wasn&#8217;t proven until 1995. It went over <strong>300 years</strong> without being proven. And yet it&#8217;s true!</p>
<p>It was Finally solved by Andrew Wiles, but it took him 7 years, and the proof is over 100 pages long.</p>
<p>How Fermat could have figured it out but has eluded several thousand over the past 3 years is still a mystery.</p>
<p>For more info, go to <a href="http://en.wikipedia.org/wiki/Fermat%27s_last_theorem">http://en.wikipedia.org/wiki/Fermat%27s_last_theorem  </a> it has everything you could want to know about it.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Theorem 7: the Banach-Tarski paradox]]></title>
<link>http://theoremoftheweek.wordpress.com/2009/09/07/theorem-7-the-banach-tarski-paradox/</link>
<pubDate>Mon, 07 Sep 2009 20:03:29 +0000</pubDate>
<dc:creator>theoremoftheweek</dc:creator>
<guid>http://theoremoftheweek.wordpress.com/2009/09/07/theorem-7-the-banach-tarski-paradox/</guid>
<description><![CDATA[Let&#8217;s try a thought experiment.  Take a (theoretical) snooker ball.  Is it possible to break i]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Let&#8217;s try a thought experiment.  Take a (theoretical) <a title="Wikipedia page about snooker" href="http://en.wikipedia.org/wiki/Snooker">snooker</a> ball.  Is it possible to break it up into one hundred parts in such a way that the parts can be put together to form two snooker balls?  The balls have to be solid, with no holes, the pieces may not be stretched or squashed, and the new balls must be the same size as the original one (or it would make for a bad game of snooker!).</p>
<p style="text-align:center;"><img class="aligncenter size-full wp-image-235" title="Snooker balls" src="http://theoremoftheweek.wordpress.com/files/2009/09/snooker_ball_4.jpg" alt="Snooker balls" width="355" height="96" /></p>
<p>Obviously, it&#8217;s not possible, right?</p>
<p>Well&#8230;</p>
<p><!--more--></p>
<p>Why is it obviously not possible?  Let&#8217;s examine our intuition.  Here&#8217;s an argument that seems compelling to me.</p>
<p>When we split the ball into several pieces, we don&#8217;t change the total mass of the pieces: it&#8217;s still the mass of the original ball.  (This is one advantage of a theoretical snooker ball, where we can break it without losing any splinters on the way!)  When we rearrange those pieces, in any way at all, we don&#8217;t change the total mass.  But the total mass of two snooker balls is (obviously) twice the mass of a single snooker ball.  Since cutting the balls and rearranging pieces can&#8217;t increase the mass, we can&#8217;t make two balls from one.</p>
<p>Are you convinced?</p>
<h3>Casting doubt</h3>
<p>Why did I cast doubts on this earlier?  I&#8217;m allowing us to break up the snooker ball in any way we like (as long as it&#8217;s into one hundred pieces).  In particular, we&#8217;re allowed to do things that aren&#8217;t possible in practice.  For example, one piece could consist of a single point, if we wanted.  What implications does that have for the argument I gave above?  Well, what&#8217;s the mass of a single point?</p>
<p>If a single point has a positive mass, then a snooker ball (which consists of infinitely many points) has infinite mass.  Since a snooker ball plainly does not have infinite mass, we&#8217;d better decide that a single point has zero mass.  Did you expect that?  It suggests to me that perhaps some interesting things can happen when we break up a snooker ball mathematically, rather than with a chisel and hammer.  But it&#8217;s going to get stranger yet&#8230;</p>
<p>As mathematicians, it&#8217;s starting to look a bit concerning.  We have an intuitive idea of what it means to talk about the mass of an object, but do we have a precise, mathematical definition?  Can we assign a mass to any subset of a snooker ball?</p>
<h3>A proper definition of mass</h3>
<p>It turns out that it is possible to come up with a precise definition of mass.  (I hasten to add that this is mass as viewed by a pure mathematician.  I make no claims that this is relevant to real physical objects!)  This leads to an area of maths called measure theory.  We need an object called a <a title="Wikipedia page about measures" href="http://en.wikipedia.org/wiki/Measure_(mathematics)">measure</a>.  As with many mathematical definitions, the definition of a measure is that it is a particular type of object (in this case, a function) that satisfies certain conditions (the properties that we expect any reasonable notion of mass to satisfy).</p>
<p>We need a function: the function takes a subset of three-dimensional space (a chunk of our snooker ball), and gives as output the mass of that set, which should be a non-negative real number.  (In general, it could also be infinity, but our snooker ball definitely has finite mass!)  What properties do we want this function to have?  I&#8217;m not going to get into the precise details here, but let me mention two ideas.</p>
<p>One is that the mass of the empty set (the set with no members) is 0.  Well, any reasonable definition of mass ought to insist that the mass of nothing is nothing!</p>
<p>The other is that if we take two disjoint objects (they have no points in common), then the mass of both objects together is the sum of the masses of the individual objects.  For example, earlier we said that the mass of two snooker balls is twice the mass of an individual ball.  (In fact, the relevant property of a measure is more general, but I don&#8217;t want to be side-tracked.)</p>
<p>Right, so we <em>can</em> define a proper notion of mass.  So what&#8217;s the problem?  I rather glossed over the <a title="Glossary entry for function" href="http://theoremoftheweek.wordpress.com/glossary/#Function">domain of the function</a>.  That is, I didn&#8217;t tell you <em>which</em> sets have masses.  The measure that mathematicians use as a model of ordinary mass is the <a title="MacTutor biography of Lebesgue" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Lebesgue.html">Lebesgue</a> <a title="Wikipedia page about Lebesgue measure" href="http://en.wikipedia.org/wiki/Lebesgue_measure">measure</a>.  It turns out that no matter how hard one tries<a title="Footnote" href="#Footnote">*</a>, there are subsets that don&#8217;t have a Lebesgue measure: it&#8217;s not possible to give them a mass!  There&#8217;s a direct proof of this (rather extraordinary) fact, but one can also deduce it from this week&#8217;s theorem &#8212; so I&#8217;d better tell you what that result is.</p>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem (the <a title="MacTutor biography of Banach" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Banach.html">Banach</a>-<a title="MacTutor biography of Tarski" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Tarski.html">Tarski</a> paradox)</span></strong>  It is possible to break a (mathematical) snooker ball into one hundred pieces in such a way that those pieces can be reassembled to form two snooker balls.</p></blockquote>
<p>(I&#8217;ve been rather generous in writing &#8220;one hundred&#8221; here: a somewhat smaller number would do.  The point is really that it&#8217;s a small finite number.)</p>
<h3>The existence of non-measurable sets</h3>
<p> Our earlier argument (the one at the start) shows that if the pieces of the snooker ball are always measurable (have masses), then this wouldn&#8217;t work, so there must be a non-measurable set!</p>
<p>(It&#8217;s easier to prove the existence of a non-measurable set than it is to prove the Banach-Tarski paradox, but if one knows about the &#8220;paradox&#8221; then the existence of a non-measurable set follows directly.)</p>
<p><a id="Footnote">*</a> I&#8217;m assuming the <a title="Wikipedia page about the axiom of choice" href="http://en.wikipedia.org/wiki/Axiom_of_choice">Axiom of Choice</a> in this post.  It&#8217;s a particular assumption that mathematicians may or may not choose to make in their work.  It&#8217;s not always relevant, but there are some results that really depend on it, so it can be a good idea to mention when one is using it.  The moral of this post is that it can lead to some apparently bizarre consequences (despite sounding very plausible)!</p>
<h3>Further reading</h3>
<p>Here&#8217;s the Wikipedia page about the <a title="Wikipedia page about the Banach-Tarski paradox" href="http://en.wikipedia.org/wiki/Banach-Tarski_paradox">Banach-Tarski paradox</a>.  The classic book on the subject is <em>The Banach-Tarski Paradox</em> by Stan Wagon, published by Cambridge University Press in the series <em>Encyclopedia of mathematics and its applications</em>.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Theorem 6: the Binomial Theorem]]></title>
<link>http://theoremoftheweek.wordpress.com/2009/09/03/theorem-6-the-binomial-theorem/</link>
<pubDate>Thu, 03 Sep 2009 20:24:58 +0000</pubDate>
<dc:creator>theoremoftheweek</dc:creator>
<guid>http://theoremoftheweek.wordpress.com/2009/09/03/theorem-6-the-binomial-theorem/</guid>
<description><![CDATA[Have you ever noticed a similarity between the first few powers of 11 and the first few rows of Pasc]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Have you ever noticed a similarity between the first few powers of 11 and the first few rows of <a title="MacTutor biography of Pascal" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Pascal.html">Pascal</a>&#8217;s <a title="Glossary entry for Pascal's triangle" href="http://theoremoftheweek.wordpress.com/glossary/#Pascal's_triangle">triangle</a>?  I&#8217;ll write them out, so that you can compare them.  Here are the first few powers of 11.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=11%5E0+%3D+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='11^0 = 1' title='11^0 = 1' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=11%5E1+%3D+11&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='11^1 = 11' title='11^1 = 11' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=11%5E2+%3D+121&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='11^2 = 121' title='11^2 = 121' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=11%5E3+%3D+1331&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='11^3 = 1331' title='11^3 = 1331' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=11%5E4+%3D+14641&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='11^4 = 14641' title='11^4 = 14641' class='latex' />.</p>
<p>And here are the first few rows of Pascal&#8217;s triangle.</p>
<p>    1</p>
<p>   1 1</p>
<p>  1 2 1</p>
<p> 1 3 3 1</p>
<p>1 4 6 4 1</p>
<p>Notice anything interesting?  I&#8217;d like to explore the link in this post.</p>
<p><!--more--></p>
<p>Perhaps the first thing is to see whether the pattern continues.  The next row of Pascal&#8217;s triangle is</p>
<p>1   5   10   10   5   1,</p>
<p>and <img src='http://l.wordpress.com/latex.php?latex=11%5E5&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='11^5' title='11^5' class='latex' /> is 161051.  So something seems to go wrong.  Let&#8217;s try to understand this, because there does seem to be something curious going on here.  It would be great if we could understand both why the first few rows are the same, and why the sixth rows are different.</p>
<h3>Understanding the links</h3>
<p>How do you calculate the powers of 11?  Here&#8217;s how I found <img src='http://l.wordpress.com/latex.php?latex=11%5E4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='11^4' title='11^4' class='latex' />, having found that <img src='http://l.wordpress.com/latex.php?latex=11%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='11^3' title='11^3' class='latex' /> is 1331.  I want to multiply 1331 by 11.  That&#8217;s the same as multiplying 1331 by 10 (which is easy), and then adding 1331.  (If you do it using long multiplication, that&#8217;s exactly what you&#8217;re doing.)  So I want to add 13310 and 1331.</p>
<p>   13310</p>
<p><span style="text-decoration:underline;">+   1331</span></p>
<p>   14641</p>
<p>I&#8217;m adding 1331 to &#8220;1331 shifted one place to the left with a 0 after it&#8221;.  So a digit in the answer is the sum of two adjacent digits in 1331 (where we can imagine a 0 before and after 1331: 013310).  But that&#8217;s exactly how Pascal&#8217;s triangle is generated: each digit is the sum of the two adjacent digits above it!</p>
<p>That appears to explain the link, but why does the connection break down when we get to the sixth row?  Let&#8217;s see what the calculation of <img src='http://l.wordpress.com/latex.php?latex=11%5E5&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='11^5' title='11^5' class='latex' /> looks like.  We know that <img src='http://l.wordpress.com/latex.php?latex=11%5E4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='11^4' title='11^4' class='latex' /> is 14641, so we want to add 10 times that (146410) to 14641.</p>
<p>   146410</p>
<p><span style="text-decoration:underline;">+   14641</span></p>
<p>   161051</p>
<p>Ah.  So the problem occurs because of &#8220;carrying&#8221;: <img src='http://l.wordpress.com/latex.php?latex=4+%2B+6&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='4 + 6' title='4 + 6' class='latex' /> becomes &#8220;0 carry 1&#8243;, rather than 10.  So there&#8217;s a sense in which it feels as though the pattern <em>would</em> continue, if it weren&#8217;t for this pesky business of place value!</p>
<p>What next?  I&#8217;d like to show you how this relates to something a bit more general.</p>
<h3>A more general result</h3>
<p>When we looked at powers of 11, we decomposed 11 as <img src='http://l.wordpress.com/latex.php?latex=10+%2B+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='10 + 1' title='10 + 1' class='latex' />, and that helped us to understand what was going on.  I&#8217;d like to think now about powers of the more general expression <img src='http://l.wordpress.com/latex.php?latex=x+%2B+y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x + y' title='x + y' class='latex' />.  Let&#8217;s write out the first few powers.  (I&#8217;m going to skip the intermediate calculations that I might have used to get these &#8212; you might like to do the calculations to convince yourself.)</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5E0+%3D+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^0 = 1' title='(x+y)^0 = 1' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5E1+%3D+x%2By&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^1 = x+y' title='(x+y)^1 = x+y' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5E2+%3D+x%5E2+%2B+2xy+%2B+y%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^2 = x^2 + 2xy + y^2' title='(x+y)^2 = x^2 + 2xy + y^2' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5E3+%3D+x%5E3+%2B+3x%5E2+y+%2B+3x+y%5E2+%2B+y%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^3 = x^3 + 3x^2 y + 3x y^2 + y^3' title='(x+y)^3 = x^3 + 3x^2 y + 3x y^2 + y^3' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5E4+%3D+x%5E4+%2B+4x%5E3+y+%2B+6x%5E2+y%5E2+%2B+4x+y%5E3+%2B+y%5E4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^4 = x^4 + 4x^3 y + 6x^2 y^2 + 4x y^3 + y^4' title='(x+y)^4 = x^4 + 4x^3 y + 6x^2 y^2 + 4x y^3 + y^4' class='latex' />.</p>
<p>Notice anything about the numbers appearing here?  (They&#8217;re called the <em>coefficients</em>.  For example, the coefficient of <img src='http://l.wordpress.com/latex.php?latex=xy&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='xy' title='xy' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=x%5E2+%2B+2xy+%2B+y%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^2 + 2xy + y^2' title='x^2 + 2xy + y^2' class='latex' /> is 2, and the coefficient of <img src='http://l.wordpress.com/latex.php?latex=x%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^2' title='x^2' class='latex' /> is 1.)  They&#8217;re exactly our old friends from Pascal&#8217;s triangle again!</p>
<p>Can we explain this?  The good news is that it&#8217;s just the same sort of idea as last time.  Let&#8217;s think about finding <img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5E4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^4' title='(x+y)^4' class='latex' />, once we know that <img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5E3+%3D+x%5E3+%2B+3x%5E2+y+%2B+3x+y%5E2+%2B+y%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^3 = x^3 + 3x^2 y + 3x y^2 + y^3' title='(x+y)^3 = x^3 + 3x^2 y + 3x y^2 + y^3' class='latex' />.  We need to multiply <img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^3' title='(x+y)^3' class='latex' /> by <img src='http://l.wordpress.com/latex.php?latex=x%2By&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x+y' title='x+y' class='latex' />.  That is, we need to multiply <img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^3' title='(x+y)^3' class='latex' /> by <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' />, and add it to <img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^3' title='(x+y)^3' class='latex' /> multiplied by <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' />.  It&#8217;s perhaps not quite as obvious how to write it out this time, but I&#8217;ll have a go.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5E3+%5Ctimes+x+%3D+%28x%5E3+%2B+3x%5E2+y+%2B+3x+y%5E2+%2B+y%5E3%29x+%3D+x%5E4+%2B+3x%5E3+y+%2B+3x%5E2+y%5E2+%2B+x+y%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^3 \times x = (x^3 + 3x^2 y + 3x y^2 + y^3)x = x^4 + 3x^3 y + 3x^2 y^2 + x y^3' title='(x+y)^3 \times x = (x^3 + 3x^2 y + 3x y^2 + y^3)x = x^4 + 3x^3 y + 3x^2 y^2 + x y^3' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5E3+%5Ctimes+y+%3D+%28x%5E3+%2B+3x%5E2+y+%2B+3x+y%5E2+%2B+y%5E3%29y+%3D+x%5E3+y+%2B+3x%5E2+y%5E2+%2B+3x+y%5E3+%2B+y%5E4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^3 \times y = (x^3 + 3x^2 y + 3x y^2 + y^3)y = x^3 y + 3x^2 y^2 + 3x y^3 + y^4' title='(x+y)^3 \times y = (x^3 + 3x^2 y + 3x y^2 + y^3)y = x^3 y + 3x^2 y^2 + 3x y^3 + y^4' class='latex' />.</p>
<p>Now we need to add these.  I&#8217;ll try to align them helpfully to make that easier (although I&#8217;ve struggled to get LaTeX to cooperate, so it&#8217;s not as pretty as I&#8217;d like!).</p>
<p>   <img src='http://l.wordpress.com/latex.php?latex=x%5E4+%2B+3x%5E3+y+%2B+3x%5E2+y%5E2+%2B+xy%5E3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^4 + 3x^3 y + 3x^2 y^2 + xy^3' title='x^4 + 3x^3 y + 3x^2 y^2 + xy^3' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cunderline%7B%2B%5C+%5C+%5C+%5C+%5C+%5C+%5C+%5C+%5C+x%5E3+y+%2B+3x%5E2+y%5E2+%2B+3x+y%5E3+%2B+y%5E4%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\underline{+\ \ \ \ \ \ \ \ \ x^3 y + 3x^2 y^2 + 3x y^3 + y^4}' title='\underline{+\ \ \ \ \ \ \ \ \ x^3 y + 3x^2 y^2 + 3x y^3 + y^4}' class='latex' /></p>
<p>   <img src='http://l.wordpress.com/latex.php?latex=x%5E4+%2B+4x%5E3+y+%2B+6x%5E2+y%5E2+%2B+4xy%5E3+%2B+y%5E4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^4 + 4x^3 y + 6x^2 y^2 + 4xy^3 + y^4' title='x^4 + 4x^3 y + 6x^2 y^2 + 4xy^3 + y^4' class='latex' /></p>
<p>It&#8217;s just the same idea as before: adding pairs of adjacent coefficients!</p>
<p>Let me tell you what the binomial theorem says (since <a title="Wikipedia page about the Modern Major-General's Song" href="http://en.wikipedia.org/wiki/Modern_Major_General%27s_Song">Major-General Stanley</a>, however modern, will not).</p>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem (the binomial theorem)</span></strong> For any real numbers <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' />, and any non-negative whole number <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />, the coefficients of the various terms in <img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^n' title='(x+y)^n' class='latex' /> are the numbers of the <img src='http://l.wordpress.com/latex.php?latex=%28n%2B1%29%5E%7B%5Ctextrm%7Bth%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(n+1)^{\textrm{th}}' title='(n+1)^{\textrm{th}}' class='latex' /> row of Pascal&#8217;s triangle.  More precisely, the terms are of the form <img src='http://l.wordpress.com/latex.php?latex=%5Cbinom%7Bn%7D%7Br%7D+x%5Er+y%5E%7Bn-r%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\binom{n}{r} x^r y^{n-r}' title='\binom{n}{r} x^r y^{n-r}' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> is a whole number between 0 and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> (inclusive), and <img src='http://l.wordpress.com/latex.php?latex=%5Cbinom%7Bn%7D%7Br%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\binom{n}{r}' title='\binom{n}{r}' class='latex' /> is the <img src='http://l.wordpress.com/latex.php?latex=%28r%2B1%29%5E%7B%5Ctextrm%7Bth%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(r+1)^{\textrm{th}}' title='(r+1)^{\textrm{th}}' class='latex' /> entry in the <img src='http://l.wordpress.com/latex.php?latex=%28n%2B1%29%5E%7B%5Ctextrm%7Bth%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(n+1)^{\textrm{th}}' title='(n+1)^{\textrm{th}}' class='latex' /> row of Pascal&#8217;s triangle.</p></blockquote>
<p>The numbers <img src='http://l.wordpress.com/latex.php?latex=%5Cbinom%7Bn%7D%7Br%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\binom{n}{r}' title='\binom{n}{r}' class='latex' /> have a special name: they are called the <em>binomial coefficients</em>!  They have many interesting properties.  For example, <img src='http://l.wordpress.com/latex.php?latex=%5Cbinom%7Bn%7D%7Br%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\binom{n}{r}' title='\binom{n}{r}' class='latex' /> is the number of ways to choose <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> objects from a collection of <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> objects (where the order of the <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> objects doesn&#8217;t matter).  In how many ways can you choose three of the <a title="Wikipedia page about Snow White" href="http://en.wikipedia.org/wiki/Snow_White">seven dwarfs</a>?  Picking Sleepy, Happy and Sneezy is the same as picking Happy, Sneezy and Sleepy: that&#8217;s what I mean when I say that the order doesn&#8217;t matter.  The answer is that there are <img src='http://l.wordpress.com/latex.php?latex=%5Cbinom+%7B7%7D%7B3%7D+%3D+35&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\binom {7}{3} = 35' title='\binom {7}{3} = 35' class='latex' /> ways.  This is why <img src='http://l.wordpress.com/latex.php?latex=%5Cbinom%7Bn%7D%7Br%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\binom{n}{r}' title='\binom{n}{r}' class='latex' /> is read as &#8220;<img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> choose <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' />&#8220;.</p>
<p>I could say lots more about the binomial coefficients and their interesting properties (including a formula that means that one doesn&#8217;t have to write out Pascal&#8217;s triangle each time!), but that would make this post too long, so I&#8217;ll resist temptation and give some suggestions for further reading instead.</p>
<p>It turns out that there&#8217;s a very nice justification of the binomial theorem using the property that <img src='http://l.wordpress.com/latex.php?latex=%5Cbinom%7Bn%7D%7Br%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\binom{n}{r}' title='\binom{n}{r}' class='latex' /> is the number of ways to choose <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> objects from <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> (where order is unimportant).  I&#8217;ll let you think about what that justification is, but here&#8217;s a (quite large) hint.  Think about <img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^n' title='(x+y)^n' class='latex' /> as <img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%28x%2By%29%5Cdotsm+%28x%2By%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)(x+y)\dotsm (x+y)' title='(x+y)(x+y)\dotsm (x+y)' class='latex' />, with <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> bracketed factors.  Each term in the answer involves an <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> or a <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y' title='y' class='latex' /> from each bracket.  How many times will you get <img src='http://l.wordpress.com/latex.php?latex=x%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^n' title='x^n' class='latex' />?  What about <img src='http://l.wordpress.com/latex.php?latex=x%5Er+y%5E%7Bn-r%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^r y^{n-r}' title='x^r y^{n-r}' class='latex' />?</p>
<p>It turns out that it&#8217;s possible to give a more general version of the binomial theorem (sometimes attributed to <a title="MacTutor biography of Newton" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Newton.html">Newton</a>), in which the exponent <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> doesn&#8217;t have to be a non-negative integer.  Although this version is both useful and interesting, I shan&#8217;t go into it here.</p>
<p>Why have I chosen to write about the binomial theorem?  The theorem itself is jolly useful: I can tell you what the coefficient of <img src='http://l.wordpress.com/latex.php?latex=x%5E%7B51%7D+y%5E%7B49%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x^{51} y^{49}' title='x^{51} y^{49}' class='latex' /> is in the expansion of <img src='http://l.wordpress.com/latex.php?latex=%28x%2By%29%5E%7B100%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(x+y)^{100}' title='(x+y)^{100}' class='latex' />, without having to multiply out the whole thing.  But for me perhaps even more interesting are the links with Pascal&#8217;s triangle and numbers of combinations, and the various properties of the binomial coefficients.</p>
<h3>Further reading</h3>
<p>Wikipedia has a fairly extensive discussion of the <a title="Wikipedia page about the binomial theorem" href="http://en.wikipedia.org/wiki/Binomial_theorem">binomial theorem</a>, including other proofs and the more general version I mentioned above.  It also has quite a lot about <a title="Wikipedia page about Pascal's triangle" href="http://en.wikipedia.org/wiki/Pascal%27s_triangle">Pascal&#8217;s triangle</a>.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[I'm Finally Back! Also: On the Nature of the Average of Infinite Sums]]></title>
<link>http://ethanmendel.wordpress.com/2009/08/27/im-finally-back-also-on-the-nature-of-the-average-of-infinite-sums/</link>
<pubDate>Thu, 27 Aug 2009 16:19:15 +0000</pubDate>
<dc:creator>Ethan Mendel</dc:creator>
<guid>http://ethanmendel.wordpress.com/2009/08/27/im-finally-back-also-on-the-nature-of-the-average-of-infinite-sums/</guid>
<description><![CDATA[It&#8217;s been quite awhile, but this is the problem of having a life. There&#8217;s really no excu]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>It&#8217;s been quite awhile, but this is the problem of having a life. There&#8217;s really no excuse for not being here for this long, and I apologize for it.</p>
<p>Also I&#8217;m loath not to put up another blog post after reading this http://www.xkcd.com/621/</p>
<p>Makes me feel particularly exciting&#8230;</p>
<p>Ok not really.</p>
<p>But onwards! I figured today we should go through a little thought experiment:</p>
<p>Let&#8217;s ask the question, &#8220;What is the average of infinitely many things?&#8221;</p>
<p>Now the classic definition of an average looks like this:</p>
<p>Given a sequence:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=a_n+%5Cequiv+%5C%7Ba_o%2C+a_1%2C+a_2%2C+...+a_%7Bn-1%7D%2C+a_n%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a_n \equiv \{a_o, a_1, a_2, ... a_{n-1}, a_n\}' title='a_n \equiv \{a_o, a_1, a_2, ... a_{n-1}, a_n\}' class='latex' /></p>
<p>The average of that sequence is the series of the sequence divided by the number of terms:</p>
<p>Let Av represent the average of a sequence.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=Av+%5Cequiv+%5Cfrac%7B%5Csum_%7Bi%3D0%7D%5En+a_n%7D%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Av \equiv \frac{\sum_{i=0}^n a_n}{n}' title='Av \equiv \frac{\sum_{i=0}^n a_n}{n}' class='latex' /></p>
<p>However, if we have an infinite sequence, then the series:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bi%3D0%7D%5E%7Bn%7D+a_n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_{i=0}^{n} a_n' title='\sum_{i=0}^{n} a_n' class='latex' /></p>
<p>Can either converge (that is, the series has a finite value) or it diverges (the series has a value of either positive or negative infinity). Note that in this case, <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> must either be equal to <img src='http://l.wordpress.com/latex.php?latex=%5Cinfty&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\infty' title='\infty' class='latex' /> or the step <img src='http://l.wordpress.com/latex.php?latex=%5CDelta+i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Delta i' title='\Delta i' class='latex' /> must be infinitesimally small.</p>
<p>Noting that these are two distinct and special cases, it is clear that we will have to analyze these separately.</p>
<p><span style="text-decoration:underline;">Convergent Series</span></p>
<p>For any convergent series:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bi%3D0%7D%5E%7Bn%7D+a_n+%3D+c+%3A+c+%26%2360%3B+%5Cinfty&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_{i=0}^{n} a_n = c : c &lt; \infty' title='\sum_{i=0}^{n} a_n = c : c &lt; \infty' class='latex' /></p>
<p>Recall:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=Av+%5Cequiv+%5Cfrac%7B%5Csum_%7Bi%3D0%7D%5En+a_n%7D%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Av \equiv \frac{\sum_{i=0}^n a_n}{n}' title='Av \equiv \frac{\sum_{i=0}^n a_n}{n}' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Ctherefore+Av+%3D+%5Cfrac%7B%5Csum_%7Bi%3D0%7D%5En+a_n%7D%7Bn%7D+%3D+%5Cfrac%7Bc%7D%7Bn%7D+%3D+%5Cfrac%7Bc%7D%7B%5Cinfty%7D+%3D+0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\therefore Av = \frac{\sum_{i=0}^n a_n}{n} = \frac{c}{n} = \frac{c}{\infty} = 0' title='\therefore Av = \frac{\sum_{i=0}^n a_n}{n} = \frac{c}{n} = \frac{c}{\infty} = 0' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Csquare&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\square' title='\square' class='latex' /></p>
<p>Therefore by the classical definition of an average, the sum of a convergent infinite series is zero. Note that <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> here is determined to be zero because the number of terms in the sequence is infinite.</p>
<p><span style="text-decoration:underline;">Divergent Series</span></p>
<p>For any divergent series:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bi%3D0%7D%5En+a_n+%3D+%5Cpm+%5Cinfty&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_{i=0}^n a_n = \pm \infty' title='\sum_{i=0}^n a_n = \pm \infty' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Ctherefore+Av+%3D+%5Cfrac%7B%5Csum_%7Bi%3D0%7D%5En+a_n%7D%7Bn%7D+%3D+%5Cfrac%7Bc%7D%7Bn%7D+%3D+%5Cfrac%7B%5Cpm+%5Cinfty%7D%7B%5Cinfty%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\therefore Av = \frac{\sum_{i=0}^n a_n}{n} = \frac{c}{n} = \frac{\pm \infty}{\infty}' title='\therefore Av = \frac{\sum_{i=0}^n a_n}{n} = \frac{c}{n} = \frac{\pm \infty}{\infty}' class='latex' /></p>
<p>Which is undefined since positive or negative infinity could converge to anywhere.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Csquare&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\square' title='\square' class='latex' /></p>
<p>Therefore under the classical definition, the average of an infinite series is either zero or undefined.</p>
<p><span style="text-decoration:underline;">Negation of the Convergent Sum Proof</span></p>
<p>In order to negate the proofs above, we need to find a situation that is unnecessarily true in those proofs. The easiest way to do that is to consider what a series actually is. In this case a series is the <em>sum of infinitely many things</em>! Those of you who know calculus should be perking up your ears by now. This is the conceptual definition of an integral.</p>
<p>Now of course integrals apply to functions, so in order to apply the integral in place of summation notation the following assumption has to be made:</p>
<p>The values of the sequence:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=a_n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a_n' title='a_n' class='latex' /></p>
<p>Are given by the function:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=f%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(x)' title='f(x)' class='latex' /></p>
<p>For any value <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' /> within a specific range denoted by <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' />, if the step <img src='http://l.wordpress.com/latex.php?latex=%5CDelta+i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Delta i' title='\Delta i' class='latex' /> in the summation notation is infinitely small.</p>
<p>Recall the formula for the average value of a function:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=Av+%3D+%5Cfrac%7B1%7D%7Bj-i%7D+%2A+%5Cint_%7Bi%7D%5Ej+f%28x%29+dx&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Av = \frac{1}{j-i} * \int_{i}^j f(x) dx' title='Av = \frac{1}{j-i} * \int_{i}^j f(x) dx' class='latex' /></p>
<p>Since we have assumed that the values of the sequence $a_n$ are given by the function <img src='http://l.wordpress.com/latex.php?latex=f%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(x)' title='f(x)' class='latex' /> we can replace the summation notation with the integral:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bi%3D0%7D%5E%7Bn%7D+a_n+%3D+%5Cint_%7Bi%7D%5E%7Bj%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_{i=0}^{n} a_n = \int_{i}^{j}' title='\sum_{i=0}^{n} a_n = \int_{i}^{j}' class='latex' /></p>
<p>Now if <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' /> are both finite, then it is possible to attain a meaningful average value if the integral of the function also converges:</p>
<p>Given:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cint_%7Bi%7D%5E%7Bj%7D%3Dc+%3A+c+%26%2360%3B+%5Cinfty&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\int_{i}^{j}=c : c &lt; \infty' title='\int_{i}^{j}=c : c &lt; \infty' class='latex' /></p>
<p>Recall:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=Av+%3D+%5Cfrac%7B1%7D%7Bj-i%7D+%2A+%5Cint_%7Bi%7D%5Ej+f%28x%29+dx&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Av = \frac{1}{j-i} * \int_{i}^j f(x) dx' title='Av = \frac{1}{j-i} * \int_{i}^j f(x) dx' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Ctherefore&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\therefore' title='\therefore' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=Av+%3D+%5Cfrac%7B1%7D%7Bj-i%7D+%2A+c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Av = \frac{1}{j-i} * c' title='Av = \frac{1}{j-i} * c' class='latex' /></p>
<p>Observe that if <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' /> are both finite, then <img src='http://l.wordpress.com/latex.php?latex=%5Cfrac%7B1%7D%7Bj-i%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\frac{1}{j-i}' title='\frac{1}{j-i}' class='latex' /> must also be finite:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cfrac%7B1%7D%7Bj-i%7D%3Dk+%3A+k%26%2360%3B%5Cinfty&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\frac{1}{j-i}=k : k&lt;\infty' title='\frac{1}{j-i}=k : k&lt;\infty' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Ctherefore&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\therefore' title='\therefore' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=Av+%3D+k+%2A+c+%3A+c%26%2360%3B%5Cinfty+and+k%26%2360%3B%5Cinfty&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Av = k * c : c&lt;\infty and k&lt;\infty' title='Av = k * c : c&lt;\infty and k&lt;\infty' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Ctherefore&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\therefore' title='\therefore' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=k+%2A+c+%26%2360%3B+%5Cinfty&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k * c &lt; \infty' title='k * c &lt; \infty' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=Av+%26%2360%3B+%5Cinfty&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Av &lt; \infty' title='Av &lt; \infty' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Csquare&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\square' title='\square' class='latex' /></p>
<p>Since the values used in this case are all finite (excluding the number of terms utilized), the final average is also finite. I also did not disprove that the average value of whatever function is being utilized is not zero, but rather since <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='i' title='i' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='j' title='j' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=%5Cint_%7Bi%7D%5E%7Bj%7D+f%28x%29+dx&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\int_{i}^{j} f(x) dx' title='\int_{i}^{j} f(x) dx' class='latex' /> are all finite values and there is no division of a finite value by infinity the average value <em>does not have to be zero</em>. Therefore the conclusion reached here contradicts the conclusion above, and proves that infinite sums can have meaningful average values, but only on the condition that the range within which the infinite sum exists is finite. That is to say: averages as defined by the definitions utilized can only operate on either a finite number of values or a finite range within which an infinite number of terms exists.</p>
<p><span style="text-decoration:underline;">The Set as Represented by a Function with Relation to Averages</span></p>
<p>Interestingly enough, we should note that for ANY set consisting of real numbers, there is an equivalent function <img src='http://l.wordpress.com/latex.php?latex=f%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(x)' title='f(x)' class='latex' /> such that the average of that function is the true average of that set.</p>
<p>That is to say:</p>
<p>Given a set <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> such that:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=S+%3D+%5C%7BS_0%2C+S_1%2C+S_2%2C...S_%7Bn-1%7D%2C+S_n%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S = \{S_0, S_1, S_2,...S_{n-1}, S_n\}' title='S = \{S_0, S_1, S_2,...S_{n-1}, S_n\}' class='latex' /></p>
<p>Where <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> is the number of terms in that set.</p>
<p>There exists a function such that:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cfrac%7B1%7D%7Bj-i%7D%2A%5Cint_%7Bi%7D%5E%7Bj%7D+f%28x%29+dx+%3D+%5Cfrac%7B%5Csum%7Bk%3D0%7D%5E%7Bn%7D+S_n%7D%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\frac{1}{j-i}*\int_{i}^{j} f(x) dx = \frac{\sum{k=0}^{n} S_n}{n}' title='\frac{1}{j-i}*\int_{i}^{j} f(x) dx = \frac{\sum{k=0}^{n} S_n}{n}' class='latex' /></p>
<p>In order for this to be the case note that:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=f%28c%29+%3D+K&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(c) = K' title='f(c) = K' class='latex' /></p>
<p>Such that</p>
<p><img src='http://l.wordpress.com/latex.php?latex=K+%5Cin+S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='K \in S' title='K \in S' class='latex' /></p>
<p>In order for this to be true then the set <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> must follow the function <img src='http://l.wordpress.com/latex.php?latex=f%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(x)' title='f(x)' class='latex' />, implying that they both either converge or diverge.</p>
<p>To show that this is true, let us create a new set <img src='http://l.wordpress.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Y' title='Y' class='latex' /> such that:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=Y+%3D+%5C%7BY_0%2C+Y_1%2C+Y_2%2C...Y_%7Bq-1%7D%2CY_%7Bq%7D%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Y = \{Y_0, Y_1, Y_2,...Y_{q-1},Y_{q}\}' title='Y = \{Y_0, Y_1, Y_2,...Y_{q-1},Y_{q}\}' class='latex' /></p>
<p>Where the set <img src='http://l.wordpress.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Y' title='Y' class='latex' /> consists of the all the solutions to the function <img src='http://l.wordpress.com/latex.php?latex=f%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(x)' title='f(x)' class='latex' />.</p>
<p>In order for the set <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> to follow the function $f(x)$, then both it and the set <img src='http://l.wordpress.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Y' title='Y' class='latex' /> need to pass the Limit Comparison Test.</p>
<p>However note that we cannot simply apply the limit comparison test normally because the cardinality of the set <img src='http://l.wordpress.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Y' title='Y' class='latex' /> is much higher than the cardinality of the set <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' />. That is to say that the set <img src='http://l.wordpress.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Y' title='Y' class='latex' /> is much larger than set <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> in the number of terms.</p>
<p>To resolve this, note that the set <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> has certain elements that will appear in the set <img src='http://l.wordpress.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Y' title='Y' class='latex' /> with a certain regularity. That is to say that:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=S_n+%3D+Y_%7Bq+%2B+%5Cbeta+%2A+%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S_n = Y_{q + \beta * \alpha}' title='S_n = Y_{q + \beta * \alpha}' class='latex' /></p>
<p>Where <img src='http://l.wordpress.com/latex.php?latex=alpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='alpha' title='alpha' class='latex' /> is a constant value and <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\beta' title='\beta' class='latex' /> represents the current iteration. Since <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> also denotes the current iteration, I could have used it instead of <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\beta' title='\beta' class='latex' /> but separating the variable and then defining it makes things more clear I think.</p>
<p>If we rewrite the set <img src='http://l.wordpress.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Y' title='Y' class='latex' /> so that it reads:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=Y+%3D+%5C%7BY_0%2C+Y_1%2C+Y_2%2C...Y_%7Bx-1%7D%2C+Y_%7Bx%7D%5C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='Y = \{Y_0, Y_1, Y_2,...Y_{x-1}, Y_{x}\}' title='Y = \{Y_0, Y_1, Y_2,...Y_{x-1}, Y_{x}\}' class='latex' /></p>
<p>Where x is the actual x value utilized, then we can say that:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=q+%3D+x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q = x' title='q = x' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Ctherefore+n+%3D+x+%2B+%5Cbeta+%2A+%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\therefore n = x + \beta * \alpha' title='\therefore n = x + \beta * \alpha' class='latex' /></p>
<p>Where</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Calpha+%3D+%5CDelta+x&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha = \Delta x' title='\alpha = \Delta x' class='latex' /></p>
<p>Then, by the limit comparison test:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cfrac%7BS_n%7D%7BY_%7Bx+%2B+n+%2A+%5CDelta+x%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\frac{S_n}{Y_{x + n * \Delta x}}' title='\frac{S_n}{Y_{x + n * \Delta x}}' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Clim_%7Bn+%5Crightarrow+%5Cinfty%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\lim_{n \rightarrow \infty}' title='\lim_{n \rightarrow \infty}' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cbecause+S_n+%3D+Y_%7Bq+%2B+%5Cbeta+%2A+%5Calpha%7D+s.t.+%5Cbeta+%3D+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\because S_n = Y_{q + \beta * \alpha} s.t. \beta = n' title='\because S_n = Y_{q + \beta * \alpha} s.t. \beta = n' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Csquare&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\square' title='\square' class='latex' /></p>
<p>Therefore the set follows the given function <img src='http://l.wordpress.com/latex.php?latex=f%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(x)' title='f(x)' class='latex' /> and therefore the average applies, but only in the case where <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> is a constant for any given value of <img src='http://l.wordpress.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q' title='q' class='latex' />. That is to say that the values of the set $S_n$ follow the function with a regular interval rather than being simply random values from that function. Therefore it is actually possible to take the average value of a set by using the method of the average value of a function on an interval.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Theorem 5: the square root of 2 is irrational]]></title>
<link>http://theoremoftheweek.wordpress.com/2009/08/24/theorem-5-root-2-is-irrational/</link>
<pubDate>Mon, 24 Aug 2009 19:29:01 +0000</pubDate>
<dc:creator>theoremoftheweek</dc:creator>
<guid>http://theoremoftheweek.wordpress.com/2009/08/24/theorem-5-root-2-is-irrational/</guid>
<description><![CDATA[One of the aspects of mathematics that I find extraordinary and exciting is that it is possible to p]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>One of the aspects of mathematics that I find extraordinary and exciting is that it is possible to <em>prove</em> that some things are impossible.  Not just difficult, but genuinely impossible.  I hope at some point to write about one famous example of this, the insolubility of the quintic by radicals (including what that phrase means!), but this week I thought that I&#8217;d write about another famous example, the irrationality of the square root of 2.  These are both examples of theorems that say that something is impossible &#8212; but the latter is a bit easier than the former!</p>
<p><!--more--></p>
<h3>Types of numbers</h3>
<p>Let&#8217;s start by talking about different types of number.  (If you know about this already, please feel free to skip to the next section.)<br />
<a id="Naturals"></a><br />
Small children start by learning the counting numbers: 1, 2, 3, &#8230; (and perhaps 0).  Mathematicians call these the <em>natural numbers</em>, and represent the set of natural numbers by the symbol <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' />.  I am being deliberately vague about whether 0 is a natural number: opinions differ, and it depends on the context.<br />
<a id="Integers"></a><br />
When children get a bit older (and mathematicians more sophisticated &#8212; I think that the order in which children meet these ideas mirrors the historical development), they discover that various things become easier if they are allowed negative numbers too.  The whole numbers (positive, negative and 0) are called <em>integers</em>, and the set of integers is represented by <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BZ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{Z}' title='\mathbb{Z}' class='latex' />.  (The letter comes from the German word &#8220;Zahlen&#8221;, meaning &#8220;numbers&#8221;.)<br />
<a id="Rationals"></a><br />
Another type of number that children meet quite early on is the fraction.  (It&#8217;s important to be able to distinguish between half a cake and a whole cake!)  Mathematicians call these the <em>rational numbers</em>, or the <em>rationals</em>.  They are the numbers that can be written as the ratio of two integers.  That is, the rationals are the numbers <img src='http://l.wordpress.com/latex.php?latex=m%2Fn&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m/n' title='m/n' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> are integers, and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> is not 0.  The set of rationals is denoted by <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BQ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{Q}' title='\mathbb{Q}' class='latex' /> (for quotient).<br />
<a id="Reals"></a><br />
The other set of numbers that I want to mention here is the set of real numbers, <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' />.  One way to think of the real numbers is as all decimals.  If you imagine the number line, with the integers marked, the rationals occupy some of the points between (and including) the integers, but the real numbers are <em>all</em> of the points on this line.</p>
<p>I&#8217;ve described the sets in this order because they are nested sets.  For example, every natural number is also an integer: the natural numbers form a subset of the integers.  The mathematical way to write this is <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D+%5Csubseteq+%5Cmathbb%7BZ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{N} \subseteq \mathbb{Z}' title='\mathbb{N} \subseteq \mathbb{Z}' class='latex' />.  Then the integers form a subset of the rationals, and the rationals form a subset of the real numbers. </p>
<p>One quick warning.  I&#8217;ve often seen students write things along the lines of &#8220;3 isn&#8217;t a fraction&#8221;.   While I know what they mean, 3 <em>is</em> a fraction (a rational number): it&#8217;s 3/1.  It is a lot more convenient to include the integers as rationals than to exclude them in some artificial way.</p>
<h3>The square root of 2</h3>
<p>So, back to the square root of 2.  Tim Gowers has a very nice <a title="Tim Gowers's discussion about the existence of the square root of 2" href="http://www.dpmms.cam.ac.uk/~wtg10/roottwo.html">discussion</a> about the existence of the square root of 2, but I&#8217;ll assume that we&#8217;re willing to believe in its existence for now!  The ancient Greeks were: the diagonal of a square with side length 1 has length <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{2}' title='\sqrt{2}' class='latex' />.  (This follows from <a title="MacTutor biography of Pythagoras" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Pythagoras.html">Pythagoras</a>&#8217;s <a title="Wikipedia page about Pythagoras's theorem" href="http://en.wikipedia.org/wiki/Pythagoras%27s_theorem">theorem</a> &#8211; perhaps that will be the theorem of a week in the future!)  Indeed, the <a title="Wikipedia page about the history of irrational numbers" href="http://en.wikipedia.org/wiki/Irrational_number#History">story</a> is that the irrationality of <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{2}' title='\sqrt{2}' class='latex' /> was first discovered by one of the followers of Pythagoras.  Let&#8217;s state the theorem properly (so that I can use the nice WordPress blockquote environment!).</p>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem</span></strong> The square root of 2 is irrational.</p></blockquote>
<p>This is an old chestnut that can be proved in many ways.  I&#8217;ll describe one version of the most common proof here, since it&#8217;s a very good example of <em><a title="Glossary entry for proof by contradiction" href="http://theoremoftheweek.wordpress.com/glossary/#Proof_by_contradiction">proof by contradiction</a></em>, a useful tool when proving the impossibility of something.</p>
<p>Why do I keep mentioning impossibility in the context of the irrationality of the square root of 2?  Well, how can we tell whether <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{2}' title='\sqrt{2}' class='latex' /> is irrational?  We are interested in whether we can solve the equation</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B2%7D%3D%5Cfrac%7Bm%7D%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{2}=\frac{m}{n}' title='\sqrt{2}=\frac{m}{n}' class='latex' />,</p>
<p>where <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> are integers, and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> is not 0.  The square root of 2 is irrational precisely if it impossible to solve this equation.</p>
<p>How can one go about proving that something is impossible?  It&#8217;s no good just testing values of <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> and discovering that they don&#8217;t work: no matter how many values we test, we can never test all of them, because there are infinitely many possibilities!  So we need a more cunning strategy.</p>
<p>This is where proof by contradiction comes in.  It&#8217;s very difficult to go anywhere when dealing with a negative statement (&#8220;there are no solutions&#8221;).  So instead we suppose that we <em>can</em> solve the equation (that is, that <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{2}' title='\sqrt{2}' class='latex' /> is rational).  That gives us a positive statement that we can manipulate.  Then we make various deductions from that supposition, and come to some absurdity (a contradiction).  But if we arrive at an absurdity, then we must have had a false statement somewhere.  Everything followed legitimately from the supposition, so it must be that the supposition is false.  That is, it cannot be the case that <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{2}' title='\sqrt{2}' class='latex' /> is rational, so it must be irrational.  Let&#8217;s see how the details work.</p>
<p>So, we start by supposing that we have integers <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> (with <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> not 0) such that</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B2%7D+%3D+%5Cfrac%7Bm%7D%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{2} = \frac{m}{n}' title='\sqrt{2} = \frac{m}{n}' class='latex' />.</p>
<p>I think that this is crying out for us to square both sides, to get rid of the scary square root.  This gives</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=2+%3D+%5Cfrac%7Bm%5E2%7D%7Bn%5E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2 = \frac{m^2}{n^2}' title='2 = \frac{m^2}{n^2}' class='latex' />,</p>
<p>so <img src='http://l.wordpress.com/latex.php?latex=2n%5E2+%3D+m%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2n^2 = m^2' title='2n^2 = m^2' class='latex' />.</p>
<p>Let&#8217;s think about the power of 2 in the <a title="Glossary entry for prime factorisation" href="http://theoremoftheweek.wordpress.com/glossary/#Prime_factorisation">prime factorisation</a> of each side.  (This is assuming the <a title="Wikipedia entry for the fundamental theorem of arithmetic" href="http://en.wikipedia.org/wiki/Fundamental_theorem_of_arithmetic">Fundamental Theorem of Arithmetic</a>, which tells us that talking about &#8220;the&#8221; prime factorisation is a legitimate thing to do.)</p>
<p>What can we say about the power of 2 in the prime factorisation of <img src='http://l.wordpress.com/latex.php?latex=m%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m^2' title='m^2' class='latex' />?  Without knowing <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' />, it might seem that we can&#8217;t say anything.  But in fact there is one thing that we <em>can</em> say.  If the exponent of 2 in the prime factorisation of <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> (so <img src='http://l.wordpress.com/latex.php?latex=m%3D+2%5Ea+b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m= 2^a b' title='m= 2^a b' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' /> is odd), then <img src='http://l.wordpress.com/latex.php?latex=m%5E2+%3D+2%5E%7B2a%7D+b%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m^2 = 2^{2a} b^2' title='m^2 = 2^{2a} b^2' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=b%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b^2' title='b^2' class='latex' /> is odd.  So the exponent of 2 in the prime factorisation of <img src='http://l.wordpress.com/latex.php?latex=m%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m^2' title='m^2' class='latex' /> must be even.</p>
<p>What can we say about the power of 2 in the left-hand side, <img src='http://l.wordpress.com/latex.php?latex=2n%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2n^2' title='2n^2' class='latex' />?  Just as above, the exponent of 2 in the prime factorisation of <img src='http://l.wordpress.com/latex.php?latex=n%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^2' title='n^2' class='latex' /> is even.  So the exponent of 2 in the prime factorisation of <img src='http://l.wordpress.com/latex.php?latex=2n%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2n^2' title='2n^2' class='latex' /> is odd (it&#8217;s one more than that in the prime factorisation of <img src='http://l.wordpress.com/latex.php?latex=n%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n^2' title='n^2' class='latex' />).</p>
<p>Now <img src='http://l.wordpress.com/latex.php?latex=2n%5E2+%3D+m%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2n^2 = m^2' title='2n^2 = m^2' class='latex' />, so they must have the same prime factorisation (by the Fundamental Theorem of Arithmetic).  But we&#8217;ve just established that the powers of 2 are different!  This is the contradiction that we wanted.</p>
<p>Since our deductions have led to a contradiction, our initial assumption must have been false, and so <img src='http://l.wordpress.com/latex.php?latex=%5Csqrt%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sqrt{2}' title='\sqrt{2}' class='latex' /> is indeed irrational.</p>
<h3>Further reading</h3>
<p>Wikipedia presents a <a title="Wikipedia proof that the square root of 2 is irrational" href="http://en.wikipedia.org/wiki/Irrational_number#Square_roots">slightly different version</a> of the above proof, one that avoids using the Fundamental Theorem of Arithmetic.</p>
<p>If you didn&#8217;t read it when I referenced it above, I recommend that you look at Tim Gowers&#8217;s <a title="Tim Gowers's dialogue concerning the existence of the square root of two" href="http://www.dpmms.cam.ac.uk/~wtg10/roottwo.html">dialogue concerning the existence of the square root of two</a>.</p>
<p>There&#8217;s an <a title="NRICH article about proof by contradiction" href="http://nrich.maths.org/public/viewer.php?obj_id=4717">article about proof by contradiction</a> on NRICH.</p>
<p>There are many proofs that the square root of 2 is irrational collected at <a title="Cut the Knot collection of proofs that the square root of 2 is irrational" href="http://www.cut-the-knot.org/proofs/sq_root.shtml">Cut The Knot</a>.  The first time I saw Proof 8&#8221;&#8217; (which was a while back, elsewhere), I laughed out loud!</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Tim's Theorem (The Progressive Time Signature Theorem)]]></title>
<link>http://underaglassmoon.wordpress.com/2009/08/20/tims-theorem-the-progressive-time-signature-theorem/</link>
<pubDate>Thu, 20 Aug 2009 00:03:17 +0000</pubDate>
<dc:creator>underaglassmoon</dc:creator>
<guid>http://underaglassmoon.wordpress.com/2009/08/20/tims-theorem-the-progressive-time-signature-theorem/</guid>
<description><![CDATA[Back in the late 60&#8217;s mathematics hooked up with music(who later became a tramp and gave birth]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Back in the late 60&#8217;s mathematics hooked up with music(who later became a tramp and gave birth to rap, but that&#8217;s another story) and 9 months later progressive music came to be. It was a beautiful age when the concept of musicianship was actually relevant, during this time, the greatest minds of our age tried their best to find out the mathematical implications of music. Unfortunately when disco and punk came about in the 80&#8217;s all research on progressive rock was canceled.</p>
<p>This post(and my findings) are a tribute to the remaining few who are still passionate about the connection between math and music.</p>
<p><strong>The Progressive Time Signature Theorem</strong></p>
<p><img class="aligncenter size-full wp-image-146" title="prog time sig" src="http://underaglassmoon.wordpress.com/files/2009/08/prog-time-sig2.jpg" alt="prog time sig" width="497" height="126" /></p>
<p><strong>Notes:</strong></p>
<p><strong>1.)</strong> p is said to be prime to ensure that the time signature cannot be easily broken into compound time signatures and/or double time. &#8217;sufficiently&#8217; implies the prime should be in an appropriate neighbourhood of 2^n.<br />
<strong>2.) </strong>as n grows the time signature becomes more and more virtuosic, eventually becoming impossible to play as the velocity of notes approaches the speed of light.<br />
<strong>3.)</strong> <strong>&#8216;true progressive form&#8217;</strong> is an idealistic time signature for music of the progressive genre</p>
<p>I may post a proof of this theorem in the future, anyone who is up for giving it a try I would recommend studying the works of Rush, Dream Theater, King Crimson, Genesis, Pink Floyd  and Fates Warning.</p>
<p><strong>What have we learned from this post?</strong></p>
<p>1.) There are applications of mathematics even in music.<br />
2.) Tim really has too much time on his hands, and&#8230;<br />
3.) its pretty obvious why he doesn&#8217;t have a girlfriend.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Theorem 4: van der Waerden's theorem]]></title>
<link>http://theoremoftheweek.wordpress.com/2009/08/18/theorem-4-van-der-waerdens-theorem/</link>
<pubDate>Tue, 18 Aug 2009 19:50:31 +0000</pubDate>
<dc:creator>theoremoftheweek</dc:creator>
<guid>http://theoremoftheweek.wordpress.com/2009/08/18/theorem-4-van-der-waerdens-theorem/</guid>
<description><![CDATA[I used to get cross during English lessons at school.  This happened for various reasons, but one re]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I used to get cross during English lessons at school.  This happened for various reasons, but one regular cause was poetry analysis.  Specifically, I objected to apparently being asked to say &#8220;Ah, yes, <a title="Wikipedia page on alliteration" href="http://en.wikipedia.org/wiki/Alliteration">alliteration</a>&#8221; every time two or more consecutive words began with the same letter or sound, regardless of whether it was reasonable to suppose that the poet was deliberately aiming for an alliterative effect.  It seemed to me that since there is only a very limited range of letters available, there were bound to be some patterns that were just due to coincidence.  Leaving aside my English lessons, was I right?  If I just write words at random, will there necessarily be some patterns in the first letters of those words?  Obviously there won&#8217;t necessarily be consecutive words with the same first letters, but perhaps there will be other patterns.</p>
<p><!--more-->Let&#8217;s start by making the problem simpler (a favourite strategy of mathematicians!).  Let&#8217;s say that we have only two letters, A and B, and let&#8217;s suppose that we have some long string of these letters.  Here are some examples.</p>
<p>BBBBBBBBBBBBBBBBBBBBBBBBBBBBBB          (1)</p>
<p>ABABABABABABABABABABABABABABAB          (2)</p>
<p>ABAAABBAAABAAAABABBBBBBAABABAA          (3)</p>
<p>What patterns can we see here?  Strings 1 and 2 have obvious patterns.  What about string 3?  Here&#8217;s one pattern:</p>
<p><span style="color:#ff0000;">A</span>BAA<span style="color:#ff0000;">A</span>BBA<span style="color:#ff0000;">A</span>ABA<span style="color:#ff0000;">A</span>AAB<span style="color:#ff0000;">A</span>BBBBBBAABABAA</p>
<p>The red letters are evenly spaced.  That&#8217;s the sort of pattern we found in string 2: evenly spaced As (or Bs).  Of course, there were a lot more evenly spaced As in string 2; I had to stop colouring As in string 3 where I did because the next letter at the right spacing is a B.  But nonetheless there is some pattern there.</p>
<p style="text-align:justify;">Incidentally, I wanted to choose a third string that looked pretty &#8220;random&#8221;, but that you would be able to check was &#8220;random&#8221;, to prove that I hadn&#8217;t faked it to have a pattern!  So I looked at the first thirty digits after the decimal point in <img src='http://l.wordpress.com/latex.php?latex=%5Cpi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pi' title='\pi' class='latex' />, and wrote A if the digit was odd and B if it was even (at least if I didn&#8217;t make a slip along the way!).</p>
<p>Will there always be some sort of pattern of evenly spaced letters, whatever the sequence is?  How many evenly spaced letters can we expect to find?</p>
<h3>Finding two evenly spaced letters that are the same</h3>
<p>Let&#8217;s start with a relatively simple observation.  Let&#8217;s stick to our two letters, A and B, and let&#8217;s aim to find two evenly spaced letters that are the same.  Well, &#8220;two evenly spaced letters&#8221; really just means &#8220;two letters&#8221;.  So we want to have two letters that are the same.  If we only have two letters in the string, there might not be two letters that are the same (the string could be AB).  But if we have three (or more) letters, each A or B, then we must have two the same.  Hopefully this is fairly obvious to you, but one can be rather formal and say that it follows from the rather wonderfully named <a title="Glossary entry for pigeonhole principle" href="http://theoremoftheweek.wordpress.com/glossary/#Pigeonhole_principle">pigeonhole principle</a>.</p>
<h3>Finding three evenly spaced letters that are the same</h3>
<p>What if we still only have two letters, A and B, but now we want to know that there will be <em>three</em> evenly spaced letters that are the same?  We&#8217;re going to need enough letters in the string, or there might not be these evenly spaced letters (just as when we looked for two letters that are the same we needed at least three letters in the string).  So let&#8217;s assume that we have a long string (I think that at least 350 letters will turn out to be enough for this argument), and then hopefully we&#8217;ll be able to show that it must contain three evenly spaced letters that are the same.</p>
<p>So far, we know that in any block of three letters there must be two that are the same.  How can we use that to help us?</p>
<p>One scenario would be that we have two points that are the same, and then if we hop along to the next letter with the same spacing, it&#8217;s the same letter.  For example, <span style="color:#ff0000;">A</span>BB<span style="color:#ff0000;">A</span>AB<span style="color:#ff0000;">A</span> &#8212; the third red letter is conveniently the same as the first two, and so we have three evenly spaced letters that are the same.</p>
<p>But what if that doesn&#8217;t happen?  That is, we&#8217;ve got two letters that we know are the same (say A), but the third in that sequence (whatever the spacing is) is different (so B).  I&#8217;ll call these As and B &#8220;special&#8221;, to distinguish them from the other letters in the block.  Let&#8217;s represent the block by a diagram.</p>
<p style="text-align:center;"><img class="aligncenter size-thumbnail wp-image-137" title="Block containing A A B" src="http://theoremoftheweek.wordpress.com/files/2009/08/van_der_waerden_18.jpg?w=150" alt="Block containing A A B" width="150" height="31" /></p>
<p>These are three evenly spaced letters in a block of some length.</p>
<p>What would happen if we had two identical blocks like this?</p>
<p style="text-align:center;"><img class="aligncenter size-medium wp-image-140" title="Two identical blocks containing A A B" src="http://theoremoftheweek.wordpress.com/files/2009/08/van_der_waerden_23.jpg?w=300" alt="Two identical blocks containing A A B" width="300" height="26" /></p>
<p>(There might be some letters between the two blocks, but that&#8217;s not important.)</p>
<p>We need to measure some distances here.  Let&#8217;s say the distance between the first two As is <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> (so there&#8217;s an A, then <img src='http://l.wordpress.com/latex.php?latex=d-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d-1' title='d-1' class='latex' /> other letters, then an A).  And let&#8217;s say that the distance from the start of the first block to the start of the second is <img src='http://l.wordpress.com/latex.php?latex=t&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='t' title='t' class='latex' />.</p>
<p style="text-align:center;"><img class="aligncenter size-medium wp-image-144" title="Blocks with distances marked" src="http://theoremoftheweek.wordpress.com/files/2009/08/van_der_waerden_31.jpg?w=300" alt="Blocks with distances marked" width="300" height="118" /></p>
<p>Now we want to look for three evenly spaced letters that are the same.</p>
<p>We&#8217;re not going to get anything by taking both As in one block.  Let&#8217;s think about what happens if we take the first special A in the first block and the second special A in the second block.  What&#8217;s the distance between them?  It&#8217;s <img src='http://l.wordpress.com/latex.php?latex=t%2Bd&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='t+d' title='t+d' class='latex' />.</p>
<p style="text-align:center;"><img class="aligncenter size-medium wp-image-145" title="Distance from first A in first block to second A in second block is t+d" src="http://theoremoftheweek.wordpress.com/files/2009/08/van_der_waerden_41.jpg?w=300" alt="Distance from first A in first block to second A in second block is t+d" width="300" height="82" /></p>
<p>What can we say about the letter that&#8217;s <img src='http://l.wordpress.com/latex.php?latex=t%2Bd&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='t+d' title='t+d' class='latex' /> letters to the right of the second special A in the second block?  If it&#8217;s an A, then we&#8217;ve got three evenly spaced As, right?</p>
<p style="text-align:center;"><img class="aligncenter size-full wp-image-132" title="Third letter in sequence is an A" src="http://theoremoftheweek.wordpress.com/files/2009/08/van_der_waerden_5.jpg" alt="Third letter in sequence is an A" width="450" height="80" /></p>
<p>What if it&#8217;s a B?  Then we have three Bs; let&#8217;s see what the spacings are.</p>
<p style="text-align:center;"><img class="aligncenter size-full wp-image-133" title="Third letter in sequence is a B" src="http://theoremoftheweek.wordpress.com/files/2009/08/van_der_waerden_6.jpg" alt="Third letter in sequence is a B" width="449" height="124" /></p>
<p>Ah, the spacings are both <img src='http://l.wordpress.com/latex.php?latex=t&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='t' title='t' class='latex' /> &#8212; so we have three evenly spaced Bs.</p>
<p>Either way, we win!</p>
<p>Are we finished?  Not quite: we imagined that we had two identical blocks, but we didn&#8217;t prove that this will always be the case.  Let&#8217;s do that now.</p>
<p>We can assume that we&#8217;re looking at blocks of five letters.  Why?  Well, in the first three letters of a block of length five, two must be the same, and then if we hop along by the appropriate amount to find the third point, we&#8217;ll still be inside the five letters of the block.</p>
<p>How many possible strings of five letters are there?  There are two possibilities for the first letter, two for the second, two for the third, two for the fourth, and two for the fifth.  So there are <img src='http://l.wordpress.com/latex.php?latex=2%5Ctimes+2%5Ctimes+2%5Ctimes+2%5Ctimes+2+%3D+32&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2\times 2\times 2\times 2\times 2 = 32' title='2\times 2\times 2\times 2\times 2 = 32' class='latex' /> possible strings.</p>
<p>There are 32 possible blocks, and we want to guarantee that we have two that are the same.  So let&#8217;s take 33 blocks, and that will do the job.  (This is using the pigeonhole principle, just as we did earlier when we argued that in a block of three letters, two must be the same.)  That&#8217;s why I said earlier that 350 letters in the string would be enough.  We can split up the first <img src='http://l.wordpress.com/latex.php?latex=33+%5Ctimes+5+%3D+165&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='33 \times 5 = 165' title='33 \times 5 = 165' class='latex' /> letters into blocks of five letters, and find two blocks that are the same.  Whichever those blocks are, the third point that we generate will still be in the string; that&#8217;s why I allowed more letters.</p>
<h3>Summary of the proof</h3>
<p>Let&#8217;s recap the argument.  We split the first half of our long string (of at least 350 letters) of As and Bs into blocks of length five.  We have at least thirty-three blocks, so two must be the same.  If this repeated block contains three evenly spaced letters that are the same, then we&#8217;re done.  If not, then we have a pattern like</p>
<p style="text-align:center;"><img class="aligncenter size-thumbnail wp-image-146" title="Block containing A A B" src="http://theoremoftheweek.wordpress.com/files/2009/08/van_der_waerden_19.jpg?w=150" alt="Block containing A A B" width="150" height="31" /></p>
<p>in these two identical blocks.  Then we look at the sequence of three evenly spaced letters starting with the first special A in the first block and the second special A in the second block.  (Note that we have so many letters that we really can always find a third letter with the right spacing.)  If the third letter in this sequence is an A, then we&#8217;ve got three evenly spaced As.  If it&#8217;s a B, then we check the distances and find that the special Bs in the blocks and this extra B form a sequence of three evenly spaced Bs.  Either way, we&#8217;re done.</p>
<p>We&#8217;ve proved a special case of a theorem of <a title="MacTutor biography of van der Waerden" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Van_der_Waerden.html">Bartel van der Waerden</a>.  Let me tell you the general theorem.</p>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem (van der Waerden)</span></strong> Imagine that we have an alphabet of <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> letters, and fix a positive whole number <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' />.  If we have a sufficiently long string of letters from this alphabet, then there are <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> evenly spaced letters that are the same.</p></blockquote>
<p>(Here, &#8220;sufficiently long&#8221; depends on both <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' />.)</p>
<p>So the theorem tells us that <em>whatever</em> string we have, if it&#8217;s long enough then it must contain evenly spaced letters that are the same!</p>
<p>We had <img src='http://l.wordpress.com/latex.php?latex=k%3D2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k=2' title='k=2' class='latex' /> and proved the cases <img src='http://l.wordpress.com/latex.php?latex=r%3D2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r=2' title='r=2' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=r%3D3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r=3' title='r=3' class='latex' />.  (The case of <img src='http://l.wordpress.com/latex.php?latex=r%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r=1' title='r=1' class='latex' /> is not very interesting!)  The proof of the general theorem uses the same ideas that we used above, using the result for smaller values of <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='r' title='r' class='latex' /> to deduce the next step up.  (This is a rather vague description of <a title="Glossary entry for proof by induction" href="http://theoremoftheweek.wordpress.com/glossary/#Proof_by_induction">proof by induction</a>.)</p>
<p>By the way, finding what &#8220;sufficiently long&#8221; above means is an interesting problem.  There are various bounds known, but they&#8217;re pretty bad.  The 350 I gave above was a huge overestimate for this case: the answer is actually 9!</p>
<p>We could ask what happens if we take an infinite string of letters from a fixed (finite) alphabet.  You might like to think about whether it&#8217;s necessarily possible to find infinitely many evenly spaced letters that are all the same.</p>
<h3>Further reading</h3>
<p>I learnt about van der Waerden&#8217;s theorem in a lecture course in Cambridge by Imre Leader.  I&#8217;m afraid that I don&#8217;t really know what good books there are on the subject.  Feel free to leave recommendations below!  The <a title="Wikipedia page on van der Waerden's theorem" href="http://en.wikipedia.org/wiki/Van_der_Waerden%27s_theorem">Wikipedia page</a> has some information about what bounds are known &#8212; a good example of very large numbers cropping up in actual mathematics!</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Rational Roots Theorem]]></title>
<link>http://istudythings.wordpress.com/2009/08/11/rational-roots-theorem/</link>
<pubDate>Tue, 11 Aug 2009 16:58:31 +0000</pubDate>
<dc:creator>futbolusa</dc:creator>
<guid>http://istudythings.wordpress.com/2009/08/11/rational-roots-theorem/</guid>
<description><![CDATA[I&#8217;m studying for the math subject GRE exam at the moment and I came across the Rational Roots ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I&#8217;m studying for the math subject GRE exam at the moment and I came across the <em>Rational Roots Theorem</em> in the process of going through one of the study guides.  I find this theorem pretty neat, I had essentially forgot about it, and this is why I&#8217;m sharing it with you.  The theorem goes as follows:</p>
<p>Let <img src='http://l.wordpress.com/latex.php?latex=p%28x%29+%3D+a_%7Bn%7Dx%5E%7Bn%7D+%2B+...+%2B+a_%7B1%7Dx%5E%7B1%7D+%2B+a_%7B0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p(x) = a_{n}x^{n} + ... + a_{1}x^{1} + a_{0}' title='p(x) = a_{n}x^{n} + ... + a_{1}x^{1} + a_{0}' class='latex' /> with the coefficients <img src='http://l.wordpress.com/latex.php?latex=%5C%7Ba_%7Bi%7D%5C%7D_%7Bi%3D0%7D%5E%7Bn%7D%3Ccode%3E%5Cin%3C%2Fcode%3E%5Cmathbb%7BZ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\{a_{i}\}_{i=0}^{n}&lt;code&gt;\in&lt;/code&gt;\mathbb{Z}' title='\{a_{i}\}_{i=0}^{n}&lt;code&gt;\in&lt;/code&gt;\mathbb{Z}' class='latex' />.  If  <img src='http://l.wordpress.com/latex.php?latex=p%28x%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p(x)' title='p(x)' class='latex' /> has <em>any</em> rational roots, then they are of the form <img src='http://l.wordpress.com/latex.php?latex=x+%3D+%5Cfrac%7Bs%7D%7Bt%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x = \frac{s}{t}' title='x = \frac{s}{t}' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=s&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='s' title='s' class='latex' /> is a factor of <img src='http://l.wordpress.com/latex.php?latex=a_%7B0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a_{0}' title='a_{0}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=t&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='t' title='t' class='latex' /> is a factor of <img src='http://l.wordpress.com/latex.php?latex=a_%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a_{n}' title='a_{n}' class='latex' />.</p>
<p>Proof of the above theorem is not included here, but can be found at <a href="http://mathforum.org/library/drmath/view/53226.html" target="_blank">here at Ask Dr. Math</a>.</p>
<p>Here&#8217;s an example of this theorem in use, as found in a GRE book:</p>
<p><em>The equation</em> <img src='http://l.wordpress.com/latex.php?latex=3x%5E%7B4%7D+-+7x%5E%7B3%7D+%2B+5x%5E%7B2%7D+-+7x+%2B+2+%3D+0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3x^{4} - 7x^{3} + 5x^{2} - 7x + 2 = 0' title='3x^{4} - 7x^{3} + 5x^{2} - 7x + 2 = 0' class='latex' /> <em>has exactly two rational roots, both of which are positive.  Find the larger of these two roots.</em></p>
<p>Solution:</p>
<p>The coefficients of the above polynomial are integers, thus the theorem we are talking about is applicable.  Also, we know from the stated problem that the polynomial has two rational roots, so using the theorem actual is helpful.  First thing to do is to take a look at <img src='http://l.wordpress.com/latex.php?latex=a_%7B4%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a_{4}' title='a_{4}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=a_%7B0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a_{0}' title='a_{0}' class='latex' /> in order to enumerate their factors.  The coefficient <img src='http://l.wordpress.com/latex.php?latex=a_%7B4%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a_{4}' title='a_{4}' class='latex' /> has the factors <img src='http://l.wordpress.com/latex.php?latex=%5Cpm3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm3' title='\pm3' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cpm1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm1' title='\pm1' class='latex' /> and the <img src='http://l.wordpress.com/latex.php?latex=a_%7B0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a_{0}' title='a_{0}' class='latex' /> coefficient has the factors <img src='http://l.wordpress.com/latex.php?latex=%5Cpm2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm2' title='\pm2' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cpm1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm1' title='\pm1' class='latex' />.  Thus, the rational roots of the polynomial can be found in the following list:</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cpm1%2C+%5Cpm%5Cfrac%7B1%7D%7B3%7D%2C+%5Cpm2%2C+%5Cpm%5Cfrac%7B2%7D%7B3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\pm1, \pm\frac{1}{3}, \pm2, \pm\frac{2}{3}' title='\pm1, \pm\frac{1}{3}, \pm2, \pm\frac{2}{3}' class='latex' /></p>
<p>Clearly, the problem states that the roots we are interested in are positive, so we can remove have those less than zero and test only the remaining.  In doing so, we see that <img src='http://l.wordpress.com/latex.php?latex=x+%3D+%5Cfrac%7B1%7D%7B3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x = \frac{1}{3}' title='x = \frac{1}{3}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=x+%3D+2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x = 2' title='x = 2' class='latex' /> are the two roots from this list; 2 being the larger means that it is our answer.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Theorem 3: Vinogradov's Three Primes Theorem]]></title>
<link>http://theoremoftheweek.wordpress.com/2009/08/10/theorem-3-vinogradovs-three-primes-theorem/</link>
<pubDate>Mon, 10 Aug 2009 20:36:46 +0000</pubDate>
<dc:creator>theoremoftheweek</dc:creator>
<guid>http://theoremoftheweek.wordpress.com/2009/08/10/theorem-3-vinogradovs-three-primes-theorem/</guid>
<description><![CDATA[I very much doubt that when Christian Goldbach sat down in 1742 to write to Leonhard Euler he had an]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I very much doubt that when <a title="Mactutor biography of Goldbach" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Goldbach.html">Christian Goldbach</a> sat down in 1742 to write to <a title="Mactutor biography of Euler" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Euler.html">Leonhard Euler</a> he had any idea that the resulting correspondence would ensure him mathematical fame.  Goldbach&#8217;s conjecture is one of the famous unsolved problems in mathematics today.  I suspect that one of the main reasons for its fame is that it is easy to state, but apparently very hard.  Just in case you haven&#8217;t come across the conjecture, here&#8217;s a statement of it.  (This is not exactly how Goldbach phrased it; he thought of 1 as a prime number.  But this is a modern equivalent.)</p>
<blockquote><p><strong><span style="text-decoration:underline;">Conjecture (Goldbach)</span></strong> Every even number greater than 2 can be written as the sum of two <a title="Glossary entry for prime number" href="http://theoremoftheweek.wordpress.com/glossary/#Prime_number">primes</a> (a prime plus a prime).</p></blockquote>
<p>For example, <img src='http://l.wordpress.com/latex.php?latex=18+%3D+11%2B7&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='18 = 11+7' title='18 = 11+7' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=92+%3D+89%2B3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='92 = 89+3' title='92 = 89+3' class='latex' />.</p>
<p>Perhaps less well known is that Goldbach made another conjecture about writing numbers as sums of primes.</p>
<p><!--more--></p>
<blockquote><p><strong><span style="text-decoration:underline;">Conjecture (Goldbach)</span></strong> Every odd number greater than 5 can be written as the sum of three primes.</p></blockquote>
<p>For example, <img src='http://l.wordpress.com/latex.php?latex=7+%3D+2+%2B+2+%2B+3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='7 = 2 + 2 + 3' title='7 = 2 + 2 + 3' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=91+%3D+3+%2B+5+%2B+83&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='91 = 3 + 5 + 83' title='91 = 3 + 5 + 83' class='latex' />.</p>
<p>This brings me to this week&#8217;s theorem, which goes a long way towards proving this conjecture.  It is due to the Russian mathematician <a title="MacTutor biography of Vinogradov" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Vinogradov.html">Ivan Vinogradov</a>.</p>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem (Vinogradov)</span></strong> Every sufficiently large odd number can be written as the sum of three primes.</p></blockquote>
<p>When I say &#8220;every sufficiently large odd number&#8221;, I mean that there is some fixed point beyond which every odd number works (but I&#8217;m avoiding telling you what that fixed point is!).</p>
<p>Does that come close to proving Goldbach&#8217;s conjecture?  Yes, I think so.  Goldbach&#8217;s conjecture is that there are no odd numbers greater than 5 that cannot be expressed as the sum of three primes.  Vinogradov&#8217;s theorem tells us that there are only finitely many bad numbers.</p>
<p>In principle, one could set a computer to check the odd numbers not covered by Vinogradov&#8217;s theorem (the ones up to the fixed point).  In practice, to do this requires one first to show that one can take the fixed point to be a number that isn&#8217;t too large (or it would take a computer too long to do the checking).  I believe that there are mathematicians currently working on this, so it may be that before too long we&#8217;ll know for sure that every odd number greater than 5 can be written as the sum of three primes.</p>
<p>Vinogradov was not the first person to make progress on this theorem.  The groundwork was done by <a title="MacTutor biography of Hardy" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Hardy.html">G.H. Hardy</a> and <a title="MacTutor biography of Littlewood" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Littlewood.html">J.E. Littlewood</a> in 1923, when they used their so-called <em>circle method</em> (now often called the <em>Hardy-Littlewood circle method</em>) to show that if the generalised Riemann Hypothesis is true, then every sufficiently large odd integer can be written as the sum of three primes.  (The generalised Riemann Hypothesis gave them some information that they required about the distribution of the primes.)  Then in 1937 Vinogradov came up with an approach that also used the circle method, but it didn&#8217;t require the assumption of the generalised Riemann Hypothesis (which remains unproved to this day).</p>
<p>The circle method has been used for many other problems.  Hardy and <a title="MacTutor biography of Ramanujan" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Ramanujan.html">Srinivasa Ramanujan</a> used it to study the number of ways to write a number <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='N' title='N' class='latex' /> as a sum (the partition function), and Hardy and Littlewood  used it to give a new proof of Waring&#8217;s problem (first proved by <a title="MacTutor biography of Hilbert" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Hilbert.html">David Hilbert</a>), to give just two examples.</p>
<p>I&#8217;ll probably return to the circle method on this blog, since it&#8217;s very relevant to my research interests!  Unfortunately, it seems that the circle method can&#8217;t be used to prove the famous Goldbach conjecture (that every even number greater than 2 can be written as the sum of two primes) &#8212; although of course that doesn&#8217;t mean that the conjecture is false!</p>
<p>You might like to check your understanding by convincing yourself that the second conjecture I&#8217;ve stated above doesn&#8217;t imply the first.  That is, even if we know that odd numbers are sums of three primes, we don&#8217;t automatically know that even numbers are sums of two primes.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Theorem 2: the Intermediate Value Theorem]]></title>
<link>http://theoremoftheweek.wordpress.com/2009/08/03/theorem-2-intermediate-value-theorem/</link>
<pubDate>Mon, 03 Aug 2009 20:35:58 +0000</pubDate>
<dc:creator>theoremoftheweek</dc:creator>
<guid>http://theoremoftheweek.wordpress.com/2009/08/03/theorem-2-intermediate-value-theorem/</guid>
<description><![CDATA[I recently helped to restore an old mechanical clock to working order.  (There wasn&#8217;t much wro]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I recently helped to restore an old mechanical clock to working order.  (There wasn&#8217;t much wrong with it, so this was a pretty easy task!)  Having done this, we then needed to regulate the clock: to get it to run at the correct speed.  With a pendulum clock, one does this by adding or removing weights (often pennies) on the bob, which alters the effective length of the pendulum.  This is how the clock known as <a title="Wikipedia page about Big Ben" href="http://en.wikipedia.org/wiki/Big_ben#Mechanism">Big Ben</a> is regulated, for example.  With the sort of clock I worked on, there is a small knob that one can turn (within some limits).</p>
<p>The person with whom I worked on the clock recently sent me an e-mail to report on his experimentation.  At one end of the knob&#8217;s rotation, the clock gained something like 40 minutes over the course of a week.  At the other end, it lost about an hour a week.  At that point, the mathematician in me applied the Intermediate Value Theorem to deduce that there must be some position of the knob that would make the clock run at the correct speed.  So what is the Intermediate Value Theorem, and would it also apply to a pendulum clock that is adjusted by adding and removing pennies?</p>
<p><!--more--></p>
<p>I think that it&#8217;s intuitively quite clear that if the clock runs fast with the knob at one extreme and slow with the knob at the other extreme, then there must be some position of the knob that makes the clock run at the correct speed.  (I&#8217;m not claiming anything about what that position is, or how to find it &#8212; just that it exists.  In practice, finding it exactly will be rather difficult, and it may be that one has to accept the clock running a few seconds fast or slow.)  How might we go about finding the correct position for the knob?  It might be that some feature of the mechanism and adjusting knob means that it is easy to predict where the correct position should be, at least if you know the effect of having the knob at each extremity.  But I don&#8217;t know about such a feature, so I&#8217;d like some experimental way that doesn&#8217;t require analysing the mechanism.</p>
<p>We know that one extremity makes the clock run fast, and the other slow.  I think that I&#8217;d try halfway between them as a first guess.  Say that makes the clock run fast.  Then I&#8217;d know that the correct position is between that point and the slow extremity &#8212; I&#8217;ve halved the length of the interval that I need to consider.  So then I can do the same thing again: I pick the midpoint of the interval under consideration, and establish whether the clock runs fast or slow.  Then I know that I can restrict my attention to a shorter interval.  I can keep doing this.</p>
<p>If I could do it infinitely many times, I&#8217;d eventually home in on the correct position for the knob.  Unfortunately, I&#8217;m not going to be able to do it infinitely many times in practice &#8212; but if the clock had an idealised knob (one that moves continuously, with every intermediate position possible), and if I could adjust it perfectly, then I&#8217;d be in business.</p>
<p>What happens with a pendulum clock where we add and remove pennies?  There, we can&#8217;t make continuous adjustments: we can only add or remove whole numbers of pennies.  So we can get pretty close to the correct speed, but we probably shan&#8217;t be spot on.  So it seems that continuity is important.</p>
<p>Let&#8217;s be slightly more mathematical about all this.  We have a <a title="Glossary entry for function" href="http://theoremoftheweek.wordpress.com/glossary/#Function">function</a>, <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' />, say, that takes the positions of the knob and maps them to how fast or slow the clock runs.  If the slow end of the knob&#8217;s rotation is denoted by <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> and the fast end by <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=f%28a%29%3D-60&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(a)=-60' title='f(a)=-60' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=f%28b%29%3D%2B40&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(b)=+40' title='f(b)=+40' class='latex' /> (or whatever the numbers are).  And the argument is that because <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is a continuous function, there&#8217;s some point between <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' /> so that <img src='http://l.wordpress.com/latex.php?latex=f%28c%29%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(c)=0' title='f(c)=0' class='latex' />.  Before I state the theorem formally, here&#8217;s a picture that might help.</p>
<div id="attachment_61" class="wp-caption aligncenter" style="width: 460px"><img class="size-full wp-image-61" title="Intermediate_value_theorem" src="http://theoremoftheweek.wordpress.com/files/2009/08/intermediate_value_theorem1.gif" alt="A continuous function taking a negative value at a and a positive value at b" width="450" height="344" /><p class="wp-caption-text">A continuous function taking a negative value at a and a positive value at b</p></div>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem (Intermediate Value Theorem)</span></strong> Let <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> be a function that maps real numbers to real numbers.  Suppose that <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> is continuous on the interval <img src='http://l.wordpress.com/latex.php?latex=%5Ba%2Cb%5D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='[a,b]' title='[a,b]' class='latex' />, and that <img src='http://l.wordpress.com/latex.php?latex=f%28a%29+%26%2360%3B+0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(a) &lt; 0' title='f(a) &lt; 0' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=f%28b%29+%26%2362%3B+0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(b) &gt; 0' title='f(b) &gt; 0' class='latex' />.  Then there is a point <img src='http://l.wordpress.com/latex.php?latex=c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='c' title='c' class='latex' />, with <img src='http://l.wordpress.com/latex.php?latex=a+%26%2360%3B+c+%26%2360%3B+b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a &lt; c &lt; b' title='a &lt; c &lt; b' class='latex' />, such that <img src='http://l.wordpress.com/latex.php?latex=f%28c%29%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(c)=0' title='f(c)=0' class='latex' />.</p></blockquote>
<p>(The interval <img src='http://l.wordpress.com/latex.php?latex=%5Ba%2Cb%5D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='[a,b]' title='[a,b]' class='latex' /> is the set of all real numbers between <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' />, including the end points <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='b' title='b' class='latex' />.)</p>
<p>In the case of my clock, there&#8217;s only going to be one position of the knob that makes the clock run at the correct speed.  But with this general theorem it&#8217;s entirely possible that there&#8217;s more than one value of <img src='http://l.wordpress.com/latex.php?latex=c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='c' title='c' class='latex' /> that gives <img src='http://l.wordpress.com/latex.php?latex=f%28c%29%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(c)=0' title='f(c)=0' class='latex' />.  All the theorem tells us is that there is at least one such value.</p>
<p>How does one prove this theorem?  First, one needs a definition of continuity that makes precise our intuitive understanding &#8212; and I&#8217;m not going into that here (not because it&#8217;s enormously difficult, but it would take a little while).  But then one does just what we did for the clock above: repeated bisection of intervals, to home in on where the point <img src='http://l.wordpress.com/latex.php?latex=c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='c' title='c' class='latex' /> must be.</p>
<p>Here&#8217;s a question you might like to consider.  Suppose that (for some obscure reason) I want my clock to gain 10 minutes per week.  Can I do that?  More mathematically, suppose that I have a continuous function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f' title='f' class='latex' /> so that <img src='http://l.wordpress.com/latex.php?latex=f%28a%29%3D-60&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(a)=-60' title='f(a)=-60' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=f%28b%29%3D40&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(b)=40' title='f(b)=40' class='latex' />.  Is there some <img src='http://l.wordpress.com/latex.php?latex=c&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='c' title='c' class='latex' />, with <img src='http://l.wordpress.com/latex.php?latex=a+%26%2360%3B+c+%26%2360%3B+b&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a &lt; c &lt; b' title='a &lt; c &lt; b' class='latex' />, such that <img src='http://l.wordpress.com/latex.php?latex=f%28c%29%3D10&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f(c)=10' title='f(c)=10' class='latex' />?  Can you deduce this directly from the Intermediate Value Theorem?</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Reiniciando a numeração de um ambiente]]></title>
<link>http://meulatex.wordpress.com/2009/07/28/reiniciando-a-numeracao-de-um-ambiente/</link>
<pubDate>Tue, 28 Jul 2009 16:21:29 +0000</pubDate>
<dc:creator>Helena</dc:creator>
<guid>http://meulatex.wordpress.com/2009/07/28/reiniciando-a-numeracao-de-um-ambiente/</guid>
<description><![CDATA[Como reiniciar a cada capítulo / seção / etc. a numeração das subseções / seções / teoremas / defini]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Como reiniciar a cada capítulo / seção / etc. a numeração das subseções / seções / teoremas / definições / etc.</p>
<p><strong>Exemplos:</strong></p>
<p style="padding-left:30px;"><code><br />
\makeatletter \@addtoreset{subsection}{chapter} \makeatother % reseta a<br />
numeração de subseções a cada capítulo<br />
</code><br />
<code>\makeatletter \@addtoreset{deff}{chapter} \makeatother % reseta a numeração de definições a cada capítulo<br />
</code><br />
<code>\makeatletter \@addtoreset{teo}{chapter} \makeatother % reseta a numeração de teoremas a cada capítulo<br />
</code></p>
<p>Note que os ambientes <code>deff</code> e <code>teo</code> não são padrão (foram definidos por mim). Pode-se substituir também <code>chapter</code> por <code>section</code>, para reiniciar a numeração a cada seção, por exemplo.</p>
<p>Esses comandos são úteis para usar antes de um apêndice, quando se quer reiniciar alguma numeração</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Theorem 1: B&eacute;zout's Theorem]]></title>
<link>http://theoremoftheweek.wordpress.com/2009/07/27/theorem-1-bezouts-theorem/</link>
<pubDate>Mon, 27 Jul 2009 20:34:20 +0000</pubDate>
<dc:creator>theoremoftheweek</dc:creator>
<guid>http://theoremoftheweek.wordpress.com/2009/07/27/theorem-1-bezouts-theorem/</guid>
<description><![CDATA[Have you come across those problems about measuring out some amount of water using limited equipment]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Have you come across those problems about measuring out some amount of water using limited equipment?  Here&#8217;s an example.</p>
<p>I have two buckets.  One, when full, contains 3 litres, the other 5 litres.  I have a large quantity of water in the water butt in my garden, and I wish to end up with exactly 17 litres of this water in my pond.  Is this possible, just using these two buckets to transfer water?  (No part-filled buckets allowed!)</p>
<p><!--more--></p>
<p> It doesn&#8217;t take very long to see that it&#8217;s possible.  I fill the 3-litre bucket four times, and the 5-litre bucket once: <img src='http://l.wordpress.com/latex.php?latex=17+%3D+4%5Ctimes+3+%2B+5&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='17 = 4\times 3 + 5' title='17 = 4\times 3 + 5' class='latex' />.</p>
<p> What if I only wanted 1 litre in my pond?</p>
<p> That&#8217;s slightly trickier, but not too hard.  I use the 3-litre bucket twice, then take out 5 litres.  In an equation, <img src='http://l.wordpress.com/latex.php?latex=1+%3D+2+%5Ctimes+3+-+5&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1 = 2 \times 3 - 5' title='1 = 2 \times 3 - 5' class='latex' />.</p>
<p> What would happen if I started with a 3-litre bucket and a 6-litre bucket and still wanted to get 1 litre in my pond?</p>
<p> After a bit of experimentation, we might notice that the quantities we can make all seem to be multiples of 3.  In fact, we can justify that mathematically.  If we add <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> 3-litre buckets and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> 6-litre buckets (where <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> are allowed to be negative &#8212; we can take water out of the pond), then we get <img src='http://l.wordpress.com/latex.php?latex=3m%2B6n%3D3%28m%2B2n%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3m+6n=3(m+2n)' title='3m+6n=3(m+2n)' class='latex' /> litres in the pond, and that&#8217;s certainly a multiple of 3.  The problem was that the <a title="Glossary entry for highest common factor" href="http://theoremoftheweek.wordpress.com/glossary/#Highest_common_factor">highest common factor</a> of the capacities was 3, and that doesn&#8217;t divide our target amount (1).</p>
<p> What happens if I start with other buckets?  It might be that there&#8217;s a fairly obvious reason why we can&#8217;t make 1 litre: if the highest common factor of the capacities of the two measuring buckets is bigger than 1.  But if I give you two buckets where the highest common factor is 1, will you necessarily be able to get 1 litre from them?</p>
<p>Let&#8217;s translate what we&#8217;d like to do into mathematics.   We are given two positive whole numbers, <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />, say.  Those are the capacities of the two buckets.  We are told that the highest common factor of <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> is 1.  Then we&#8217;d like to know that there are integers <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> so that <img src='http://l.wordpress.com/latex.php?latex=hm%2Bkn%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='hm+kn=1' title='hm+kn=1' class='latex' />.   The equation tells us to transfer <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' />-buckets and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />-buckets of water into the pond.  Of course, a negative number of buckets translates into removing water from the pond, so we may have to be careful about the order in which we do things (we need water in the pond before we can start removing water!).  I&#8217;ll let you convince yourself that if we can solve the equation (find <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />) then it&#8217;s possible.</p>
<p> It turns out that the answer is that it <em>is</em> possible to solve the equation &#8212; that&#8217;s what <a title="MacTutor biography of Bezout" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Bezout.html">Bézout</a>&#8217;s Theorem tells us.  Better still, we can give a constructive proof.  That is, the proof actually gives us a way to find a water-moving strategy that will give 1 litre.</p>
<h3>Finding highest common factors &#8212; Euclid&#8217;s algorithm</h3>
<p> Before we get to the proof, let&#8217;s think about how we can find the highest common factor of two numbers.   One way would be to find and compare the <a title="Glossary entry for prime factorisation" href="http://theoremoftheweek.wordpress.com/glossary/#Prime_factorisation">prime factorisations</a> of the two numbers.  That&#8217;s fine, but finding prime factorisations can be extremely time-consuming if you&#8217;re dealing with large numbers.  Fortunately, there is another way: use <em><a title="MacTutor biography of Euclid" href="http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Euclid.html">Euclid</a>&#8217;s algorithm</em>.  Let&#8217;s see how it works by looking at an example.</p>
<p>Say that we want to find the highest common factor of 121 and 44.  (OK, I know, that&#8217;s easy, because they&#8217;re small numbers.  So please temporarily forget that it&#8217;s easy, and bear with me.  It wouldn&#8217;t be as easy if I were using larger numbers!)  It seems to me to be quite natural to try to divide 121 by 44.  We get</p>
<p><img src='http://l.wordpress.com/latex.php?latex=121+%3D+2%5Ctimes+44+%2B+33&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='121 = 2\times 44 + 33' title='121 = 2\times 44 + 33' class='latex' />.          (1)</p>
<p>So 121 isn&#8217;t divisible by 44.  But if I take a number <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> that divides 121 and 44, then it must also divide the remainder 33 (because <img src='http://l.wordpress.com/latex.php?latex=33+%3D+121+-+2%5Ctimes+44&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='33 = 121 - 2\times 44' title='33 = 121 - 2\times 44' class='latex' />, if you like).  So any common factor of 121 and 44 is also a common factor of 44 and 33, and those are smaller numbers.  So let&#8217;s play the same game again: divide 44 by 33.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=44+%3D+33+%2B+11&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='44 = 33 + 11' title='44 = 33 + 11' class='latex' />.          (2)</p>
<p>Now let&#8217;s try the same thing again: we divide 33 by 11.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=33+%3D+3%5Ctimes+11&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='33 = 3\times 11' title='33 = 3\times 11' class='latex' />.          (3)</p>
<p>We&#8217;d like to use these three equations to show that the highest common factor of 121 and 44 is 11.  Let&#8217;s do this in two stages.  First we&#8217;ll show that 11 really is a common factor of 121 and 44, and then we&#8217;ll show that any common factor of 121 and 44 must divide 11 &#8212; that is, 11 is the <em>highest</em> common factor.  (Of course, this is all very easy with such small numbers, where we could just check directly, but let&#8217;s see how these equations do the work for us.)</p>
<p>Equation (3) tells us that 11 is a factor of 33.  Then from equation (2) we see that, since 11 divides the right-hand side, 11 must also divide the left-hand side, 44.  Now 11 divides 33 and 44, so 11 divides the right-hand side of equation (1), so 11 also divides the left-hand side, namely 121.  We&#8217;ve argued that 11 divides both 121 and 44 &#8212; it&#8217;s a common factor.  That&#8217;s the first stage done.</p>
<p>Now let&#8217;s take any common factor <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> of 121 and 44; we want to show that <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> is a factor of 11.  Equation (1) tells us that <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> is a factor of 33.  Then equation (2) shows that <img src='http://l.wordpress.com/latex.php?latex=d&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='d' title='d' class='latex' /> is a factor of 11.  And that&#8217;s exactly what we wanted.</p>
<p>By working through the equations twice, once from last to first and once from first to second-last, we showed that the highest common factor of 121 and 44 is 11.</p>
<p>Let&#8217;s do another simple example.  We&#8217;ll find the highest common factor of 31 and 14, using this same algorithm.</p>
<p><img src='http://l.wordpress.com/latex.php?latex=31+%3D+2%5Ctimes+14+%2B+3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='31 = 2\times 14 + 3' title='31 = 2\times 14 + 3' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=14+%3D+4%5Ctimes+3+%2B+2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='14 = 4\times 3 + 2' title='14 = 4\times 3 + 2' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=3+%3D+2%2B1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3 = 2+1' title='3 = 2+1' class='latex' /></p>
<p><img src='http://l.wordpress.com/latex.php?latex=2%3D2%5Ctimes+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2=2\times 1' title='2=2\times 1' class='latex' /></p>
<p>Using just the same argument as before (working through the equations twice, once from the last to the first and once from the first to the second-last), we see that the highest common factor is 1.  I&#8217;ll let you think about the details.</p>
<h3>Solving our equation</h3>
<p>How does that help us find whole numbers <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> so that <img src='http://l.wordpress.com/latex.php?latex=31m+%2B+14n+%3D+1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='31m + 14n = 1' title='31m + 14n = 1' class='latex' />?</p>
<p>Well, the third equation tells us that <img src='http://l.wordpress.com/latex.php?latex=1+%3D+3-+2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1 = 3- 2' title='1 = 3- 2' class='latex' />.  We&#8217;ve successfully written 1 as a combination of some integers &#8212; but not the right integers!</p>
<p>The second equation gives <img src='http://l.wordpress.com/latex.php?latex=2+%3D+14+-+4%5Ctimes+3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2 = 14 - 4\times 3' title='2 = 14 - 4\times 3' class='latex' />.  Substituting this in gives <img src='http://l.wordpress.com/latex.php?latex=1+%3D+3-%2814-4%5Ctimes+3%29+%3D+5%5Ctimes+3+-+14&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1 = 3-(14-4\times 3) = 5\times 3 - 14' title='1 = 3-(14-4\times 3) = 5\times 3 - 14' class='latex' />.  That looks slightly more promising: we&#8217;ve written 1 as a combination of 3 and 14, and 14 is one of the numbers we&#8217;re aiming for.  Let&#8217;s get rid of the 3.</p>
<p>The first equation gives <img src='http://l.wordpress.com/latex.php?latex=3%3D31-2%5Ctimes+14&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3=31-2\times 14' title='3=31-2\times 14' class='latex' />.  Substituting this in gives <img src='http://l.wordpress.com/latex.php?latex=1%3D5%2831-2%5Ctimes+14%29-14%3D5%5Ctimes+31-11%5Ctimes+14&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1=5(31-2\times 14)-14=5\times 31-11\times 14' title='1=5(31-2\times 14)-14=5\times 31-11\times 14' class='latex' />.  Aha!  So we can take <img src='http://l.wordpress.com/latex.php?latex=m%3D5&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m=5' title='m=5' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n%3D-11&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n=-11' title='n=-11' class='latex' />, for example.  (There are lots of other solutions too, this is just one example.  But our aim was only to show that there is <em>a</em> solution, not to find them all.)</p>
<p>Let&#8217;s state Bézout&#8217;s Theorem formally.</p>
<blockquote><p><strong><span style="text-decoration:underline;">Theorem (Bézout)</span></strong> Let <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> be integers.  Then there are integers <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='m' title='m' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' /> satisfying the equation</p>
<p><img src='http://l.wordpress.com/latex.php?latex=hm%2Bkn%3D1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='hm+kn=1' title='hm+kn=1' class='latex' /></p>
<p>if and only if the highest common factor of <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> is 1 (they are <em>coprime</em>).</p></blockquote>
<p>How does one prove this?</p>
<p>The &#8220;only if&#8221; direction was the easy one: we noticed earlier that if there&#8217;s a solution to the equation then the highest common factor of <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> must be 1.</p>
<p>And for the &#8220;if&#8221; direction one uses Euclid&#8217;s algorithm, exactly as we did for the particular example just now.  You run Euclid&#8217;s algorithm to find the highest common factor of <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' /> (which we know is 1), and then use the resulting equations to express 1 in terms of <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='k' title='k' class='latex' />.</p>
<p>You might like to try this for some pairs of numbers of your own.  Try some fairly large numbers &#8212; Euclid&#8217;s algorithm is very efficient, and doesn&#8217;t take very many steps.</p>
<h3>Further reading</h3>
<p>I am a fan of Harold Davenport&#8217;s <em>The Higher Arithmetic</em>, which has a very readable discussion of Euclid&#8217;s algorithm (and various things that can be deduced from it) in Chapter 1.  Euclid&#8217;s algorithm and Bézout&#8217;s theorem are discussed in many introductory number theory books, that just happens to be a personal favourite.</p>
<p>There&#8217;s a short series of nice articles about Euclid&#8217;s algorithm and its uses on the NRICH website, starting with <a title="NRICH article about Euclid's algorithm" href="http://nrich.maths.org/public/viewer.php?obj_id=1357">Euclid&#8217;s Algorithm I</a>.</p>
</div>]]></content:encoded>
</item>

</channel>
</rss>
