<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress.com" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>bayes &amp;laquo; WordPress.com Tag Feed</title>
	<link>http://en.wordpress.com/tag/bayes/</link>
	<description>Feed of posts on WordPress.com tagged "bayes"</description>
	<pubDate>Sun, 29 Nov 2009 14:44:57 +0000</pubDate>

	<generator>http://en.wordpress.com/tags/</generator>
	<language>en</language>

<item>
<title><![CDATA[Screening for terrorists]]></title>
<link>http://sciencehouse.wordpress.com/2009/11/13/screening-for-terrorists/</link>
<pubDate>Fri, 13 Nov 2009 20:31:35 +0000</pubDate>
<dc:creator>Carson Chow</dc:creator>
<guid>http://sciencehouse.wordpress.com/2009/11/13/screening-for-terrorists/</guid>
<description><![CDATA[The recent tragedy at Fort Hood has people griping about missed signals that could have been used to]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>The recent tragedy at Fort Hood has people griping about missed signals that could have been used to prevent the attack.  However, I will argue that is likely to be impossible to ever have a system that can screen out all terrorists without also flagging a lot of innocent people.  The calculation is a simple exercise in probability theory that is often given in first year statistic classes.</p>
<p>Suppose we have a system in place that gives a yes Y or no response of whether or not a person is a terrorist T.  Let P(T) be the probablity that a given person is a terrorist,  P(T&#124;Y) be the probability that a person  is a terrorist given that the test said yes.  Thus P(~T&#124;Y)=1-P(T&#124;Y) is the probability that one is not a terrorist even though the test said so.  Using Bayes theorem we have that</p>
<p style="text-align:center;">P(~T&#124;Y)=P(Y&#124;~T) P(~T)/P(Y)  (*)</p>
<p>where P(Y)=P(Y&#124;T)P(T) + P(Y&#124;~T)P(~T) is the probability of getting a yes result.   Now, the probability of being a terrorist is very low.   Out of the 300 million or so people in the US a small number are probably potential terrorists.  The US military has over a million people on active service.   Hence, the probability of not being a terrorist is very high.</p>
<p>From (*),  we see that in order to have a low probability of flagging an innocent person we need to have  P(Y&#124;~T)P(~T)&#60;&#60; P(Y&#124;T)P(T), or P(Y&#124;~T)&#60;&#60; P(Y&#124;T) P(T)/P(~T).  Since  P(T) is very small, P(T)/P(~T)~ P(T),   so if the true positive probability P(Y&#124;T) was near one (i.e. a test that catches all terrorists), we need the false positive probability P(Y&#124;~T) to be much smaller than the probability of having a terrorist, which means we need a test that gives false positives at a rate of less than 1 in a million.  The problem is that the true positive and false positive probabilities will be correlated.  The more sensitive the test the more likely it is to get a false positive.  So if you set your threshold to be very low so P(Y&#124;T) is very high (i.e. make sure you never miss a terrorist), you&#8217;ll most certainly have P(Y&#124;~T) to also be high.  I doubt you&#8217;ll ever have a test where P(Y&#124;T) is near one while P(Y&#124;~T) is less than one in a million.   So basically, if we want to catch all the terrorists, we&#8217;ll also have to flag a lot of innocent people.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Why Head and Shoulders Has Less Variations For Sale]]></title>
<link>http://msujaws.wordpress.com/2009/11/08/why-head-and-shoulders-has-less-variations-for-sale/</link>
<pubDate>Sun, 08 Nov 2009 04:50:27 +0000</pubDate>
<dc:creator>msujaws</dc:creator>
<guid>http://msujaws.wordpress.com/2009/11/08/why-head-and-shoulders-has-less-variations-for-sale/</guid>
<description><![CDATA[Not too long ago, Proctor &amp; Gamble marketed Head and Shoulders shampoo with twenty-six variation]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><img class="alignright" title="Head and Shoulders" src="http://2.bp.blogspot.com/_XxPbedo13nY/SdPflThvWhI/AAAAAAAACVc/tsB2Chwtuuo/s400/head+and+shoulders.png" alt="" width="211" height="181" />Not too long ago, Proctor &#38; Gamble marketed Head and Shoulders shampoo with twenty-six variations. Sales were good, but not maximal. To increase sales of their shampoo, they decided to <em>reduce</em> the number of variations from twenty-six to fifteen. Result: Sales increased 10% <a href="http://www.columbia.edu/~ss957/articles/Choice_is_Demotivating.pdf">[1]</a>.</p>
<p>But does that make sense? Popular notion would seem that the more choices an individual can make, the happier they will be. Don&#8217;t you like to have the option to choose between a small, medium, large, or extra-large drink? It turns out that the popular notion isn&#8217;t wrong, sometimes it&#8217;s just applied at the wrong times.</p>
<p>When given the option to choose, increasing the number of options from 2 to 6 increases the satisfaction of the individual and reduces the amount of time to make said decision <a href="http://www.columbia.edu/~ss957/articles/Choice_is_Demotivating.pdf">[1]</a>. But at what point does the addition of options have diminishing returns? George Miller, a cognitive psychologist at Princeton in 1956, found that the optimal number of choices are 7, plus or minus 2 <a href="http://en.wikipedia.org/wiki/The_Magical_Number_Seven,_Plus_or_Minus_Two">[2]</a>.</p>
<p>This 7 ± 2 rule has been widely cited, and it may be the reason that many fast food restaurants have pared down the number of value meals that are offered to fit within this range. When the number of options is higher than 9, individuals will feel overwhelmed and burdened by the worry that comes with making a &#8220;wrong&#8221; choice.</p>
<p>Residents of the United States are faced with so many decisions on a daily basis, including decisions that are likely to affect loved ones, that the decision-making process becomes demotivating <a href="http://www.ucpress.edu/books/pages/5572001.php">[3]</a>. Up to this point (2000), there has been research that focused on the 7 ± 2 rule, options ranging from 2 to 6, but no research on options ranging in count from 6 to 24 or 6 to 30.</p>
<p>A study by <a href="http://www.columbia.edu/~ss957/index.shtml">Sheena Iyengar</a> of Columbia University and <a href="http://www.stanford.edu/~lepper/">Mark Lepper</a> of Stanford University specifically looks at the effects of choice and has interesting findings. Iyengar and Lepper performed three studies observing participants initial reaction, satisfaction, and decision-making when presented with extensive and limited selections.</p>
<p>One of their studies focused on the sale of exotic jams. They stationed a sample booth in a gourmet grocery store and offered varying numbers of jam to sample. If a customer approached the sample booth, they received a coupon towards a jam, and purchases of the jam were recorded to determine the effect of the sampling booth.</p>
<p style="text-align:center;">Table 1</p>
<table>
<thead>
<tr style="text-align:left;">
<th>Extensive Selection</th>
<th>Limited Selection</th>
<th>Total</th>
</tr>
</thead>
<tbody>
<tr>
<td>24 flavors</td>
<td>6 flavors</td>
<td></td>
</tr>
<tr>
<td>386 shoppers</td>
<td>368 shoppers</td>
<td>754 shoppers</td>
</tr>
<tr>
<td>242 encountered</td>
<td>260 encountered</td>
<td>502 encountered</td>
</tr>
<tr>
<td>145 stopped/sampled</td>
<td>104 stopped/sampled</td>
<td>249 stopped/sampled</td>
</tr>
<tr>
<td>4 purchased</td>
<td>31 purchased</td>
<td>35 purchased</td>
</tr>
</tbody>
</table>
<p>Those results show the following percentages:</p>
<p style="text-align:center;">Table 2</p>
<table style="width:350px;">
<thead>
<tr style="text-align:left;">
<th>Extensive Selection</th>
<th>Limited Selection</th>
</tr>
</thead>
<tbody>
<tr>
<td>66% encountered</td>
<td>63% encountered</td>
</tr>
<tr>
<td>60% stopped</td>
<td>40% stopped</td>
</tr>
<tr>
<td>3% purchased</td>
<td>30% purchased</td>
</tr>
</tbody>
</table>
<p>These results are very interesting. Although the number of customers that approached the extensive selection (60%) was much higher than the limited selection (40%), these customers did not purchase the jam. In fact, the customers that approached the limited selection were <strong>10 times more likely</strong> to purchase.</p>
<p>Given those percentages, <strong>can we turn the data on its head and determine the effect that the number of flavors presented had, given that the customer was going to purchase the jam?</strong></p>
<p>To answer the question, we can use the naïve Bayes theorem <a href="http://en.wikipedia.org/wiki/Naive_Bayes_classifier">[4]</a>. The naïve Bayes theorem provides a way to find the posterior probability given a pre- and post-condition. We can use the percentages given in Table 2 and apply them to the theorem:</p>
<p><a href="http://en.wikipedia.org/wiki/Naive_Bayes_classifier"><img class="aligncenter" title="Bayes Theorem" src="http://upload.wikimedia.org/math/3/1/7/3174021f44ba0d31f6ede772624c5523.png" alt="" width="350" height="50" /></a></p>
<p>The probabilities for purchasing without sampling were  not given, and thus we will use a weighted average of the purchasers to estimate the probability at 0.095 <em>((4/386) + (31/368))</em>. These numbers are chosen because we will have to assume that regardless of the number of flavors present at the sample booth, these customers were already going to make a purchase. <em>Ideally, we would have in our data set the number of customers who purchased regardless of stopping at the sample booth.</em></p>
<p style="text-align:left;"><em>P<sub>6</sub>(sample&#124;purchase) = (P<sub>6</sub>(purchase&#124;sample) * P<sub>6</sub>(sample))/(P<sub>6</sub>(purchase)) = (0.3 * 0.4)/0.095 = 1.26<br />
P<sub>24</sub>(sample&#124;purchase) = (P<sub>24</sub>(purchase&#124;sample) * P<sub>24</sub>(sample))/(P<sub>24</sub>(purchase)) = (0.03 * 0.6)/0.095 = 0.19</em></p>
<p>Using the naïve Bayes Theorem, we are able to find that when customers were buying jam, they were <strong>663% more likely</strong> to stop and sample jam if the sample booth only contained 6 flavors compared to 24 flavors. The increase in choice may have at first glance brought many more customers to the sample booth, yet the customers that approached the booth when 24 flavors were presented may never have intended on purchasing the jam.</p>
<p>So Miller&#8217;s rule says that 7 ± 2 is the right number of choices, and Iyengar and Lepper make a case that happiness does not increase with choices. Right now, we are encountering <em>too many</em> choices. Most of us would probably choose the same options, so we might as well be happier about our choices. Therefore, if our goal is to make the customer happier, the selection should be limited before they encounter their decision. In the end, we will all be happier.</p>
<p style="text-align:left;">[1] <a href="http://www.columbia.edu/~ss957/articles/Choice_is_Demotivating.pdf">http://www.columbia.edu/~ss957/articles/Choice_is_Demotivating.pdf</a><br />
[2] <a href="http://en.wikipedia.org/wiki/The_Magical_Number_Seven,_Plus_or_Minus_Two">http://en.wikipedia.org/wiki/The_Magical_Number_Seven,_Plus_or_Minus_Two</a><br />
[3] <a href="http://www.ucpress.edu/books/pages/5572001.php">http://www.ucpress.edu/books/pages/5572001.php</a><br />
[4] <a href="http://en.wikipedia.org/wiki/Naive_Bayes_classifier">http://en.wikipedia.org/wiki/Naive_Bayes_classifier</a></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Wolpert. SfN meeting 2009 – 2° giorno]]></title>
<link>http://neurophy.wordpress.com/2009/10/19/wolpert-sfn-meeting-2009-%e2%80%93-2%c2%b0-giorno/</link>
<pubDate>Mon, 19 Oct 2009 23:24:48 +0000</pubDate>
<dc:creator>Davide Anchisi</dc:creator>
<guid>http://neurophy.wordpress.com/2009/10/19/wolpert-sfn-meeting-2009-%e2%80%93-2%c2%b0-giorno/</guid>
<description><![CDATA[Il secondo giorno di meeting era il 18 ottobre. Lo so, le date sono sfasate ma: sto scrivendo in rit]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><em>Il secondo giorno di meeting era il 18 ottobre. Lo so, le date sono sfasate ma:</em></p>
<ol>
<li><em>sto scrivendo in ritardo;</em></li>
<li><em>l&#8217;ora dei post è rimasta quella italiana, mentre qui sono 7 ore indietro.</em></li>
</ol>
<p>Altra giornata piena. Bellissimo il seminario di Daniel <strong>Wolpert</strong>: &#8220;<em>Moving in an Uncertain World: Computational Principles of Human Motor Control</em>&#8220;.</p>
<p>Esposizione chiara e divertente dell&#8217;integrazione sensorimotoria. Approccio <em>bayesiano</em>. Mostrati anni di esperimenti.</p>
<p>Tra le tante cose <strong>alcuni punti:</strong></p>
<ul>
<li>Secondo Wolpert capire il movimento significa capire il cervello. Infatti sostiene che l&#8217;unico motivo che abbiamo di avere un sistema nervoso è quello di muoverci. A parte sudare.</li>
<li>La distribuzione di probabilità dei movimenti varia per diversi movimenti. [Una premessa: anche per l'esecuzione dei comandi motori, come per la percezione, c'è incertezza e rumore]. Posso effettuare movimenti diversi per arrivare ad uno stesso scopo. Avrò meno incertezza sul raggiungimento dello scopo se scelgo il movimento con la distribuzione di probabilità più ristretta.</li>
<li>Le informazioni sensoriali continuano ad essere processate anche quando non c&#8217;è più lo stimolo che le ha causate. Questo permette di migliorare una scelta, ma in ritardo, e quindi a cambiarla anche dopo che l&#8217;azione è iniziata.</li>
<li>Se il sistema usa una funzione di costo/beneficio differente prenderà anche decisioni diverse.</li>
</ul>
<p>Su quest&#8217;ultimo punto: mi chiedo se sia possibile utilizzarlo anche a livello comportamentale per migliorare gli allenamenti o la performance. Siccome il sistema cerca di ottimizzare il suo comportamento per minimizzare i costi dell&#8217;effetto della sua azione, se nell&#8217;allenamento do peso di più o penalizzo maggiormente certi certi aspetti piuttosto che altri cambio ciò come il sistema cerca cambiare per ottimizzare il risultato. Detto così è un po&#8217; fumoso, a livello intuitivo ho chiaro il senso, ma devo elaborarlo di più perché possa scriverne esplicitamente.</p>
<p>Un scelta della <strong>bibliografia</strong> citata nel seminario:</p>
<ul>
<li>Weiskrantz L, Elliott J, Darlington C.<em> Preliminary observations on tickling oneself.</em> Nature 1971, 230:598-599.</li>
<li>Shergill SS, Bays PM, Frith CD, Wolpert DM. <em>Two eyes for an eye: the neuroscience of force escalation.</em> Science 2003, 301:187</li>
<li>Shergill SS, Samson G, Bays PM, Frith CD, Wolpert DM. <em>Evidence for sensory prediction deficits in schizophrenia.</em> Am J Psychiatry 2005, 162:2384-6.</li>
</ul>
<p><em>La bibliografia continua, andrò avanti. Ora ho problemi tecnici a recuperarla.</em></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Talk at MBI]]></title>
<link>http://sciencehouse.wordpress.com/2009/10/06/talk-at-mbi/</link>
<pubDate>Wed, 07 Oct 2009 02:52:34 +0000</pubDate>
<dc:creator>Carson Chow</dc:creator>
<guid>http://sciencehouse.wordpress.com/2009/10/06/talk-at-mbi/</guid>
<description><![CDATA[I&#8217;m currently at the Mathematical Biosciences Institute for a workshop on Computational challe]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I&#8217;m currently at the <a href="http://mbi.osu.edu/">Mathematical Biosciences Institute</a> for a workshop on <a href="http://mbi.osu.edu/2009/ctwdescription.html">Computational challenges in integrative biological modeling</a>.  The slides for my talk on using Bayesian methods for parameter estimation and model comparison are<a href="http://sciencehouse.wordpress.com/files/2009/10/mbi_bayes_09_10_07.pdf"> here</a>.</p>
<div id="seminar">
<div>Title: Bayesian approaches for parameter estimation and model evaluation of dynamical systems</div>
</div>
<p>Abstract: Differential equation models are often used to model biological systems. An important and difficult problem is how to estimate parameters and decide which model among possible models is the best. I will argue that Bayesian inference provides a self-consistent framework to do both tasks. In particular, Bayesian parameter estimation provides a natural measure of parameter sensitivity and Bayesian model comparison automatically evaluates models by rewarding fit to the data while penalizing the number of parameters. I will give examples of employing these approaches on ODE and PDE models.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[KODE-panikk og bayesisk gjennomsnitt]]></title>
<link>http://kvalitetssynsing.wordpress.com/2009/08/31/kode-panikk-og-bayesisk-gjennomsnitt/</link>
<pubDate>Mon, 31 Aug 2009 14:33:55 +0000</pubDate>
<dc:creator>kvalitetssynsing</dc:creator>
<guid>http://kvalitetssynsing.wordpress.com/2009/08/31/kode-panikk-og-bayesisk-gjennomsnitt/</guid>
<description><![CDATA[KODE føler seg lurt til å få panikk av en svineforkjølelse-artikkel i New York Times. Jeg kjenner me]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><a href="http://kodeskriver.blogspot.com/2009/08/svineinfluensa.html">KODE føler seg lurt til å få panikk</a> av en svineforkjølelse-artikkel i New York Times. Jeg kjenner meg igjen.</p>
<p>Det er noe som heter at &#8220;ekstraordinære påstander krever ekstraordinære bevis&#8221;, at man ikke skal tro alt man hører, at man alltid må bruke sunt bondevett, at man ikke skal høre på ekstremister, osv. </p>
<p>Dette finnes også i statistikk, og kalles, på engelsk, <a href="http://en.wikipedia.org/wiki/Bayesian_average">Bayesian average</a> (igjen takk til Wikipedia):</p>
<blockquote><p>A Bayesian average is a method of estimating the mean of a population consistent with Bayesian interpretation, where instead of estimating the mean strictly from the available data set, other existing information related to that data set may also be incorporated into the calculation in order to minimize the impact of large deviations within the data set, or to assert a default value when the data set is small.</p></blockquote>
<p>Det høres sikkert komplisert ut, men det blir lettere å forstå med dette eksempelet:</p>
<blockquote><p>For example, in a calculation of an average review score of a book where only two reviews are available, both giving scores of 10, a normal average score would be 10. However, as only two reviews are available, 10 may not represent the true average had more reviews been available. The review site may instead calculate a Bayesian average of this score by adding the average review score of all books in the store to the calculation. For example, by adding five scores of 7 each, the Bayesian average becomes 7.86 instead of 10, which the review site would hope that it will better represent the quality of the book.</p></blockquote>
<p>Det blir subjektivt hvor mye man ønsker å trekke ned gjennomsnittet på denne måten. Det kommer her bl.a. an på hvor objektive/dyktige, og representative, man mener anmelderne er (altså, vekt og relevans).</p>
<p><b>I tilfellet svineinfluensa har KODE konsultert kun én kilde, New York Times, som gir swineflu 10/10 poeng på panikk-skalaen.</b> KODE skulle kanskje ha forstått hvor lite sannsynlig det var at dette var representativt og at han dermed <i>burde få</i> et panikkanfall tilsvarende 10/10 poeng, både ut fra kildens seriøsitet, hva man har kunnet lese andre steder, og ut fra prinsippet at det er særdeles få fenomener som <i>skal</i> gi 10/10 panikkpoeng.</p>
<p>Problemer med å vurdere og gjennomskue falske ekstraordinære påstander dukker gjerne opp når man stoler på veldig få kilder og dermed får et lite datagrunnlag (&#8220;få anmeldere&#8221;). I de aller fleste saker bør man faktisk stole på at verden henger på greip og at vanlige folk &#8211; eller på mer spesialiserte områder i det minste eksperter &#8211; har noenlunde snøring.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Bayes' Theorem and how to Dismiss Paranoia...]]></title>
<link>http://pythonism.wordpress.com/2009/08/30/bayes-theorem-and-how-to-dismiss-paranoia/</link>
<pubDate>Sun, 30 Aug 2009 22:24:32 +0000</pubDate>
<dc:creator>pythonisms</dc:creator>
<guid>http://pythonism.wordpress.com/2009/08/30/bayes-theorem-and-how-to-dismiss-paranoia/</guid>
<description><![CDATA[It is amazing how a very simple mathematical formula can hold such a world of detail and be such a p]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>It is amazing how a very simple mathematical formula can hold such a world of detail and be such a powerful tool in understanding reality.</p>
<p>One such formula is <strong>Bayes&#8217; Theorem</strong></p>
<p>This theorem tells us about the conditional probability of one event given that another event has been observed, what that is really about is the following:</p>
<p>How do we assess the likelihood of an event given a piece of evidence that may or may not have bearing on it? </p>
<p>The idea can be built up quite simply. let&#8217;s think of two events called A and B. The probabilities of each of these happening are represented as P(A) and P(B).</p>
<p>We&#8217;ll immediately make this concrete to explain it best. Let&#8217;s denote as follows</p>
<p><strong>A</strong> : John is guilty of the murder<br />
<strong>B</strong> : John was found at the scene after the dead body was found with blood on his hands</p>
<p>A has yet to be established, but we know B has already happened.</p>
<p>Well Bayes&#8217;s Theorem gives us one possible way to do some calculation about this. Time for notation&#8230;</p>
<p><strong>P(A&#124;B)</strong> is how we write &#8220;The probability of A happening given that B has happened.</p>
<p>it turns out that P(A&#124;B) can be expressed as</p>
<p><strong>P(A&#124;B) = P(A) * ( P(B&#124;A) / P(B) )</strong></p>
<p>So in this case P(A) will be the probability of a freely chosen individual in the sample being guilty of a murder. This is the basic rate when we have absolutely no evidence to narrow it down. In other words P(A) will be the murder rate in whatever country we are in.</p>
<p>P(B) will be the likelihood of a random individual being found with blood on their hands at a crime scene</p>
<p>and P(B&#124;A) is the probability of someone being found with blood on their hands when it is given and therefore certain they have just committed a murder. In other words this is a quantity that is the correlation or degree of connection between murder and having blood on your hands.</p>
<p>To look at our example again, maybe if someone worked at an abbatoir or butcher&#8217;s they might frequently be found with blood on their hands, so it is not self evident that the given evidence guarantees the probability P(A) must be 1. However it does seem likely that this kind of evidence is compelling, therefore we expect the correlation of bloody hands with murders to be quite high. P(B) is the denominator in the right hand term though, so if B is rare  and therefore P(B) small then that enhances our evidence&#8217;s power to convict. P(A) is the starting likelihood of john being a murderer before we found the evidence. So you can see that</p>
<p><strong>P(B&#124;A) / P(B)</strong> is the amount by a factor of which our evidence alters the likelihood of A. If P(B&#124;A) is small then our evidence may not help us that much, unless P(B) is even smaller. If P(B&#124;A) is big, i.e. if the two events of bloody hands and guilt are strongly correlated then the evidence may be helpful. If there&#8217;s slight correlation but B is a common event then once again the evidence may not be much help. You can play around with the figures yourself but its such a beautifully simple formula that it becomes quite easy to visualise.</p>
<p>One of the noted applications of Bayesian Reasoning of this kind is in using healthcare data about the error rates of clinical tests. A scientist online quoted an example of the Mammogram as a test for breast cancer, and to what extent we can expect a positive mammogram test to lead us to believe that the subject has breast cancer. Interestingly the figures work out to show that a small but appreciable false positive rate and a similarly small nonzero false negative rate seriously diminish the validity of the test. Here&#8217;s how it goes:</p>
<p>1% of women at age forty who participate in routine screening have breast cancer.  80% of women with breast cancer will get positive mammographies.  9.6% of women without breast cancer will also get positive mammographies.  A woman in this age group had a positive mammography in a routine screening.  What is the probability that she actually has breast cancer?</p>
<p>quoted from http://yudkowsky.net/rational/bayes</p>
<p>well if A is having cancer and B is testing positive then <strong>P(B&#124;A)</strong> should be the frequency of people who test positive while definitely and verifiably having cancer.</p>
<p>so <strong>P(A),</strong> the <em>incidence of cancer in the group</em>, is clearly 0.01</p>
<p><strong>P(B)</strong> is a little subtler, since it has to include the false positives rate, I make</p>
<p><strong>P(B) = (0.01 * 0.8) + (0.096 * (1- 0.01)) = 0.10304</strong></p>
<p>this is the <em>incidence of positive tests</em>, real illness notwithstanding</p>
<p>and finally</p>
<p><strong>P(B&#124;A) = 0.8</strong></p>
<p>which is 1 minus the <em>false negative rate</em> (number of people with cancer who don&#8217;t test positive)</p>
<p>so here:</p>
<p><strong>P(A&#124;B)</strong> = 0.01 * (0.8 / 0.10304) = <strong>0.07763 = 7.763%</strong></p>
<p>In other words the test is really not a conclusively strong piece of evidence. The catch is that there are so many more women who don&#8217;t have breast cancer that even a relatively small proportion of false positives (people who the test says have cancer but who don&#8217;t in reality) changes the figures quite drastically.</p>
<p>So there we have it ! The great thing about this stuff is that there is such a myriad profusion of examples we can think up. Any area where some basic stats are known can really elucidate and sharpen our understanding of how evidence based reasoning works out in practice. </p>
<p>I have also personally found that going to the trouble of some rule of thumb calculations can even help allay personal fears about  nasty and dangerous events, which duly do prove to be rather unlikely. <em>Maths beats paranoia every time !!</em></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Being wrong may be rational]]></title>
<link>http://sciencehouse.wordpress.com/2009/08/14/being-wrong-may-be-rational/</link>
<pubDate>Fri, 14 Aug 2009 18:33:43 +0000</pubDate>
<dc:creator>Carson Chow</dc:creator>
<guid>http://sciencehouse.wordpress.com/2009/08/14/being-wrong-may-be-rational/</guid>
<description><![CDATA[This past Sunday, economist Paul Krugman was lamenting in a book review of Justin Fox&#8217;s book ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>This past Sunday, economist Paul Krugman was lamenting in a <a href="http://www.nytimes.com/2009/08/09/books/review/Krugman-t.html?scp=1&#38;sq=krugman%20book%20review&#38;st=cse">book review</a> of Justin Fox&#8217;s book &#8220;Myth of the Rational Market&#8221; (which he liked very much) that despite this current financial crisis and previous crises, like the failure of the hedge fund <a href="http://en.wikipedia.org/wiki/Long-Term_Capital_Management">Long Term Capital Management</a>, people still believe in efficient markets as strongly as ever. The <a href="http://en.wikipedia.org/wiki/Efficient-market_hypothesis">efficient market hypothesis </a>is the basis of most of modern finance and assumes that the price of a security is always correct and that you can never beat the market.  So artificial bubbles should never occur.  Krugman wonders what it will take to ever change people&#8217;s minds.</p>
<p>I want to show here that there might be no amount of evidence that will ever change their minds and they can still be perfectly rational in the Bayesian sense.  The argument can also apply to all other controversial topics.  I think it is generally believed in intellectual circles that the reason there is so much disagreement on these issues is that the other side is either stupid, deluded or irrational.   I want to point out that believing in something completely wrong even in the face of overwhelming evidence may arise in perfectly rational beings.  That is not to say that faulty reasoning does not exist and can be dangerous.  It just explains why two perfectly reasonable and intelligent people can disagree so alarmingly.</p>
<p><!--more--></p>
<p>Consider the very simple case of a hypothesis H, like &#8220;the market is not efficient&#8221; and there is some data D, like a financial crisis.  Then from Bayes rule, the probability that the hypothesis is true given the data is <img src='http://l.wordpress.com/latex.php?latex=P%28H%26%23124%3BD%29%3DP%28D+%26%23124%3B+H%29P%28H%29%2FP%28D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='P(H&#124;D)=P(D &#124; H)P(H)/P(D)' title='P(H&#124;D)=P(D &#124; H)P(H)/P(D)' class='latex' />, where P(D&#124;H) is the likelihood function (probability of obtaining the data given the hypothesis is true), P(H) is the prior probability the hypothesis is true, and P(D) is the probability of obtaining the data.  Thus the odds that the hypothesis is true to it being false <img src='http://l.wordpress.com/latex.php?latex=%5Coverline%7BH%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\overline{H}' title='\overline{H}' class='latex' /> is</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle+%5Cfrac%7BP%28H%26%23124%3BD%29%7D%7BP%28%5Coverline%7BH%7D%26%23124%3BD%29%7D+%3D+%5Cfrac%7BP%28D%26%23124%3BH%29%7D%7BP%28D%26%23124%3B%5Coverline%7BH%7D%29%7D%5Cfrac%7BP%28H%29%7D%7BP%28%5Coverline%7BH%7D%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle \frac{P(H&#124;D)}{P(\overline{H}&#124;D)} = \frac{P(D&#124;H)}{P(D&#124;\overline{H})}\frac{P(H)}{P(\overline{H})}' title='\displaystyle \frac{P(H&#124;D)}{P(\overline{H}&#124;D)} = \frac{P(D&#124;H)}{P(D&#124;\overline{H})}\frac{P(H)}{P(\overline{H})}' class='latex' /></p>
<p>So, you see that even if two people have identical likelihood functions (i.e. reasoning ability) and have the same data, they can still come to completely different conclusions depending on their priors.  For example, let&#8217;s say two people agree that the odds of a crisis occurring given that efficient markets are false is 100 to 1.  So whatever the prior odds against an efficient market was before, it is now 100 times greater after the crisis.  So for someone who may have believed that efficient markets had even odds of being false now believes it is 100 to 1 that it wrong.  But, someone who originally believed that the odds of an efficient market were false  was one in a million now believes it is one in ten thousand.  Hence, given enough events they will eventually change their minds.  However, suppose there is a person who believed that the probability of an efficient market is false is zero.  Then they are completely unaffected by the data and no amount of data can ever convince them.  If a hypothesis has zero prior support then it can never be validated no matter what the data.</p>
<p>You could argue that a zero prior is faulty to start with but that is not a failure of reasoning.  In fact, it is easy to see how a prior could be zero.  Suppose you are a naive student and you take a class from a <a href="http://en.wikipedia.org/wiki/Eugene_Fama">Fama</a> or a <a href="http://en.wikipedia.org/wiki/Merton_Miller">Miller</a> who implicitly assumes that the market is efficient. You could easily end up believing that it is a law of nature like gravity.  There could even be an amplification effect in that the professor may have some doubt about the idea but for pedagogical or other purposes will not bring it up in her class.   Then the next generation is even more certain and eventually it becomes dogma.</p>
<p>So why are there efficient market doubters?  Well I think that there are probably some neural mechanisms that sets a minimum doubt level in every person.  Some people have complete certainty in their beliefs while others have doubts about everything.  I believe that this doubt level is innate and could be related to genes governing certain ion channels in the brain.  So some of the students will not completely believe in the efficient market.   However, given that doubt level seems to be broadly distributed in the population there must be advantages for maintaining diversity in a population.  A community of pure believers is dangerous (e.g. Jonestown) but one of doubters may end up starving to death because they can never decide on what to do.  A balance of the two may be necessary to get things done but also have a reality check.  This also means that some wrong ideas will only disappear when the zero doubt holders take them to the grave.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Chapter 1: Introduction]]></title>
<link>http://ainotes.wordpress.com/2009/07/22/chapter-1-introduction-part-1/</link>
<pubDate>Wed, 22 Jul 2009 20:27:58 +0000</pubDate>
<dc:creator>ainotes</dc:creator>
<guid>http://ainotes.wordpress.com/2009/07/22/chapter-1-introduction-part-1/</guid>
<description><![CDATA[Approaches to AI The introductory chapter starts with a four-fold categorization of the approaches t]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><h2 style="text-align:justify;"><strong>Approaches to AI</strong></h2>
<p style="text-align:justify;">The introductory chapter starts with a four-fold categorization of the approaches that have been taken in AI:</p>
<p style="text-align:justify;"><strong>1. Thinking like Humans</strong><br />
Cognitive science pretty much sums up this category. The idea is to understand how humans make intelligent decisions (using neuroscience, psychology, etc) and model it on an artificial system.</p>
<p style="text-align:justify;"><strong>2. Acting like Humans</strong><br />
Here the focus is on the results produced by humans. Any approach which has as it&#8217;s aim passing the Turing test can be considered in this category. AI approaches that can be seen as attempts to completely build an &#8216;artificial human&#8217; are considered to be in this group: natural language processing, knowledge representation, automated reasoning, machine learning, vision, robotics, etc.</p>
<p style="text-align:justify;"><strong>3. Thinking Rationally</strong><br />
The logicist tradition occupies this category. Focus: represent knowledge, make logical inferences on it to make decisions.</p>
<p style="text-align:justify;"><strong>4. Acting Rationally</strong><br />
The main approach considered in the book. Any approach that attempts to build an agent whose actions can be considered rational can be considered to be in this category.</p>
<p style="text-align:justify;">For the first two categories of approaches, success would be measured with respect to a human &#8211; which is quite a difficult task. With the latter two approaches, it would be measured to a more well defined rationality.</p>
<p style="text-align:justify;">In my opinion, it is important to notice that the categorization is not absolute and there is considerable overlap; particularly, the last two categories do not have a clear boundary. From one perspective, to &#8220;think rationally&#8221; is to decide how to infer the most rational action to take, and hence, category 3 is included in category 4. On the other hand, if it can be considered that any agent that makes a rational action has implicitly made a rational decision, then category 4 is included in category 3.</p>
<p style="text-align:justify;">The book cites the following example: <em>&#8220;For example, recoiling from a hot stove is a reflex action that is usually more successful than a slower action taken after careful deliberation&#8221;</em>. I disagree with this example, since the decision to recoil is a rational decision, taken after inferring that there is not enough time to engage in more complex decision making and that an action must be taken as quickly as possible. <em>i.e. The example is a case of making logical inferences with temporal logic.</em></p>
<p style="text-align:justify;"><span style="color:#800000;"><strong>Question:</strong> To which category do neural network approaches fall?</span></p>
<h2 style="text-align:justify;"><strong>Philosophy</strong></h2>
<p style="text-align:justify;">Since Aristotle&#8217;s syllogisms there have been attempts to create formal reasoning systems. Philosophy discusses the limits and possiblities of logic, and also of AI.</p>
<p style="text-align:justify;">Some important terms introduced are:<br />
<strong>1. Dualism:</strong> the idea that there is a immaterial part to the universe, that the mind is not a manifestation of the physical brain. (Was this Descartes&#8217; big mistake?)</p>
<p style="text-align:justify;"><strong>2. Materialism:</strong> the physical universe is all that exists.</p>
<p style="text-align:justify;"><strong>3. Empiricism: </strong>knowledge is distilled from one&#8217;s experiences.</p>
<p style="text-align:justify;"><strong>4. Hume&#8217;s principle of induction:</strong> general rules are aquuired by exposure to repeated association between their elements. <span style="color:#800000;">(Is this the same as pattern recognition?)</span></p>
<p style="text-align:justify;"><strong>5. Logical positivism:</strong> all knowledge can be characterised by logical theories, and all meaningful statements of that logic system can be verified or falsified. Developed by the infamous Vienna Circle.</p>
<p style="text-align:justify;"><strong>6. Confirmation Theory:</strong> attempts to understand how to do induction; i.e. how knowledge can be aquired from experience.</p>
<p style="text-align:justify;">One good example of how Philosphy has lended to concrete AI systems is Newell and Simons&#8217; General Problem Solver (GPS). The basic regression planning algorithm used in GPS is none other than the one proposed by Aristotle: consider what the outcome is, and plan backwards from it &#8211; see what actions lead to the outcome, and then what actions lead to those actions, and so on.</p>
<p style="text-align:justify;">Goal based analysis is discussed in Philosophy with respect to the question of understanding the relation between thinking and acting.</p>
<h2 style="text-align:justify;"><strong>Mathematics</strong></h2>
<p style="text-align:justify;">The contribution from mathematics can roughly be categorised into three areas:</p>
<p style="text-align:justify;"><strong>1. Logic</strong><br />
While philosophy gave birth to logic, the mathematical development of logic into a formal system was what enabled it to become a strong tool that can be used for AI.</p>
<p style="text-align:justify;"><strong>2. Computation</strong><br />
<span style="text-decoration:underline;">Decidability</span><br />
This is linked to logic and number theory. Of particular interest is Hilbert&#8217;s decision problem, which questions whether it was possible to find an algorithm to decide the truth value of any logical proposition involving the natural numbers &#8211; i.e. whether there were limits to the power of effective proof procedures.</p>
<p style="text-align:justify;"><span style="text-decoration:underline;">Computability</span><br />
G<em>ö</em>del&#8217;s findings tell that<br />
1) any statement expressed in first order logic that is true can be proved of it&#8217;s truth<br />
2) first order logic is not powerful enough to capture the principle of mathematical induction needed to characterise the natural numbers<br />
3) incompleteness theorem: any language expressive enough to capture the natural numbers fully (i.e. describe all the properties of the natural numbers) has, in that language, statements that are true, but their truth cannot be established by an algorithm. This means that some functions on the natural numbers cannot be computed by an algorithm.</p>
<p style="text-align:justify;">Turing brought the idea of computability &#8211; that is, to find which functions can be computed and which cannot. The Church-Turing thesis presents the idea of a Turing machine that is capable of computing any computable function.</p>
<p style="text-align:justify;"><span style="text-decoration:underline;">Tractability</span><br />
From a practical point of view, tractability is a much more useful concept to study, because even if a function is computable theoretically, it can be practically uncomputable because the resources required increase exponentially with respect to input size. Such problems are said to be intractable.</p>
<p style="text-align:justify;">Although it has not been proven (it is one of the remaining Millenium problems) it is generally assumed that NP-complete problems are intractable.</p>
<p style="text-align:justify;"><strong>3. Probability</strong><br />
The most important contribution from probability is Bayesian analysis, borne out of Baye&#8217;s theorem which shows how probabilities change when new conditions are added to an event. Bayesian analysis forms the basis of most approaches to dealing with uncertainity in AI.</p>
<h2 style="text-align:justify;"><strong>Economics</strong></h2>
<p style="text-align:justify;"><strong>1. Decision Theory</strong><br />
Utility theory investiages how decisions can be made leading to a peferred outcome; utility theory combines with probability results in decision theory, where decisions made under uncertain conditions can be investigated. Here probability captures the decision maker&#8217;s environment, and it is assumed that the decision maker&#8217;s world is not affected by others.</p>
<p style="text-align:justify;"><strong>2. Game Theory</strong><br />
If the decision maker also needs to consider what the other decision makers are doing, then such problems are studied in game theory.</p>
<p style="text-align:justify;"><strong>3. Operations Research</strong><br />
One important contribution from OR to AI is Markovian decision processes, where a number of sequencially made decisions are studied.</p>
<p style="text-align:justify;">Note that decision theory, game theory and operations research are fields that are difficult to classify only as either mathematics or economics, as they are a hybrid of both.</p>
<h2 style="text-align:justify;"><strong>Neuroscience</strong></h2>
<p style="text-align:justify;">Neuroscience can help in discovering how our brains manage to function as intelligent agents. Particularly the invention of fMRI has been very helpful in monitoring brain activity.</p>
<h2 style="text-align:justify;"><strong>Psychology</strong></h2>
<p style="text-align:justify;"><strong>1. Behaviourism</strong><br />
Behaviourism rejects mental introspection and only considers objective results of a psychological experiment. While behaviourism has helped to understand animal (non-human) behaviour, it has been less successful at understanding human behaviour.</p>
<p style="text-align:justify;">When you look at it, this should be expected from behaviourism. Behaviourism treats the intelligent-process of the agent as a black box and looks at only the inputs (percepts from the environment) and outputs (actions of the animal). It is reasonable to assume that the function which takes place inside the black box is simple and animals and much more complex in humans. Thus it would be easy the guess the function for animals, but not for humans.</p>
<p style="text-align:justify;"><strong>2. Cognitive Psychology</strong><br />
Cognitive psychology, in contrast, attempts to understand exactly the functionality that is taking place inside the brain, and does not exclude mental introspection (e.g. examining a persons beliefs and goals). Such mental introspection can be considered to be parts of a virtual representation of the world created in the brain.</p>
<p style="text-align:justify;">Cognitive science, where cognitive psychological models are developed as computed models, thus pay a lot of attention to how the external world can be represented in an intelligent agent.</p>
<h2 style="text-align:justify;"><strong>Control Theory and Cybernetics</strong></h2>
<p style="text-align:justify;">Control theory explores how an automated system can monitor and regulate itself. Cybernetics, from an AI perspective is mainly the application of control theory to computational models of cognition to produce AI.</p>
<p style="text-align:justify;">Due to the different areas of mathematics used in control theory and AI, there is some gap between the two fields. Control theory is built using calculus and matrix algebra, whereas AI (at least traditionally) used logic and computation. The problems of language, vision and planning, that were considered from AI perspectives, fell outside the domain of control theory due to the different mathematical tools used.</p>
<h2 style="text-align:justify;"><strong>Linguistics</strong></h2>
<p style="text-align:justify;">Chomsky introduced the idea of syntactic structures, which could capture the potential of a language so that it could be modeled computationally. This introduced linguistics to AI, resulting in the field of natural language processing. The main obstacle in NLP is understanding the context and subjec matter of the language (which is required due to the ambigious nature of human language), and thus knowledge representation is the field devoted to studying how information about the world can be captured into a computational structure so the information can be utilized by an intelligent agent.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Course on Practical Bayes for Beginners]]></title>
<link>http://ssaqld.wordpress.com/2009/07/22/course-on-practical-bayes-for-beginners/</link>
<pubDate>Wed, 22 Jul 2009 00:18:03 +0000</pubDate>
<dc:creator>ssaqld</dc:creator>
<guid>http://ssaqld.wordpress.com/2009/07/22/course-on-practical-bayes-for-beginners/</guid>
<description><![CDATA[Dear Colleagues, The QLD Branch in conjunction with QUT and the National Office of the Statistical S]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Dear Colleagues,</p>
<p>The QLD Branch in conjunction with QUT and the National Office of the Statistical Society of Australia is running a a Three-day course on Practical Bayes for Beginners, presented by Professor Kerrie Mengersen of the Centre for Data Analysis, Modelling &#38; Computation, QUT.</p>
<p>This will take place at the University of Queensland, Room S429, Hartley Teakle Building #83 University of Queensland, St Lucia, Brisbane on the 26th &#8211; 28th of August, 2009.</p>
<p>Given the trying times of the last year, as a branch and society, we really want to make this a success so I&#8217;m asking you to consider helping in the following ways:</p>
<p>1) Attending the workshop.<br />
2) If not attending passing the link think people you know who might be interested in attending.<br />
3) Promoting the workshop within your organisation.<br />
4) If you familiar with WinBugs and R and can assist as a &#8216;technical tutor&#8217; during the workshop let me know. You will be given a free place on the workshop.</p>
<p>It might be worth passing on to any interested parties, that by attending the workshop as a non-member of the SSAI, you are given membership of the society as part of your registration fee.</p>
<p>More details are available on the SSAI webiste: <a href="https://www.statsoc.org.au/CPD2">https://www.statsoc.org.au/CPD2</a></p>
<p>Kind Regards,</p>
<p>Helen</p>
<p>Helen Johnson, PhD &#124; Biostatistician &#124; Research Methods Group &#124; Institute of Health &#38; Biomedical Innovation &#124; Queensland University of Technology &#124; 60 Musk Ave, Kelvin Grove, Queensland, 4059, Australia &#124; t: +61 7 3138 6053 &#124; f: +61 7 3138 6030 &#124; e <a href="mailto:h.johnson@qut.edu.au">h.johnson@qut.edu.au</a> &#124; w: <span style="text-decoration:underline;">w</span><a title="BLOCKED::BLOCKED::BLOCKED::BLOCKED::http://www.ihbi.qut.edu.au/ BLOCKED::BLOCKED::BLOCKED::http://www.ihbi.qut.edu.au/ BLOCKED::BLOCKED::http://www.ihbi.qut.edu.au/ BLOCKED::http://www.ihbi.qut.edu.au/ http://www.ihbi.qut.edu.au/" href="http://www.ihbi.qut.edu.au/" target="_blank">ww.ihbi.qut.edu.au</a></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Bayes is the New Black Presentation]]></title>
<link>http://tomschenkjr.net/2009/07/01/bayes-is-the-new-black-presentation/</link>
<pubDate>Thu, 02 Jul 2009 01:07:41 +0000</pubDate>
<dc:creator>Tom Schenk</dc:creator>
<guid>http://tomschenkjr.net/2009/07/01/bayes-is-the-new-black-presentation/</guid>
<description><![CDATA[From 2007 International Society for the History, Philosophy, and Social Science of Biology in Exeter]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>From 2007 International Society for the History, Philosophy, and Social Science of Biology in Exeter, England.</p>
<p><!-- SlideShare error: doc is missing or has illegal characters /[^-_a-zA-Z0-9]/ --></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Probabilidad de aprobar las OPOSICIONES]]></title>
<link>http://pinae.wordpress.com/2009/05/27/probabilidad-de-aprobar-las-oposiciones/</link>
<pubDate>Wed, 27 May 2009 16:27:28 +0000</pubDate>
<dc:creator>pinamix</dc:creator>
<guid>http://pinae.wordpress.com/2009/05/27/probabilidad-de-aprobar-las-oposiciones/</guid>
<description><![CDATA[LLega el momento de la VERDAD,&#8230;.. El día quince de mayo de 2009 se publicó en el diario de la ]]></description>
<content:encoded><![CDATA[LLega el momento de la VERDAD,&#8230;.. El día quince de mayo de 2009 se publicó en el diario de la ]]></content:encoded>
</item>
<item>
<title><![CDATA[Tempi lunghi]]></title>
<link>http://strategieevolutive.wordpress.com/2009/05/22/tempi-lunghi/</link>
<pubDate>Fri, 22 May 2009 12:11:27 +0000</pubDate>
<dc:creator>Davide</dc:creator>
<guid>http://strategieevolutive.wordpress.com/2009/05/22/tempi-lunghi/</guid>
<description><![CDATA[Non tutte le ciambelle&#8230; Il metodo bayesiano rapresenta un approccio alternativo allo studio st]]></description>
<content:encoded><![CDATA[Non tutte le ciambelle&#8230; Il metodo bayesiano rapresenta un approccio alternativo allo studio st]]></content:encoded>
</item>
<item>
<title><![CDATA[Bertrand's Box : Probabilities in Cards]]></title>
<link>http://math10blog.wordpress.com/2009/04/18/bertrands-box/</link>
<pubDate>Sat, 18 Apr 2009 23:03:14 +0000</pubDate>
<dc:creator>Eric Lightborn</dc:creator>
<guid>http://math10blog.wordpress.com/2009/04/18/bertrands-box/</guid>
<description><![CDATA[Wikipedia calls this a classic paradox of elementary probability theory. Who was this Bertrand? Take]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><a href="http://en.wikipedia.org/wiki/Bertrand%27s_box_paradox" target="_blank">Wikipedia calls this a classic paradox of elementary probability theory.</a></p>
<p>Who was this <a href="http://en.wikipedia.org/wiki/Calcul_des_probabilit%C3%A9s" target="_blank">Bertrand?</a></p>
<p style="text-align:center;"><em><strong>Taken from Wikipedia.org:</strong></em></p>
<h2 style="text-align:center;"><span class="mw-headline">Card version</span></h2>
<p>Suppose you have three cards:</p>
<ul>
<li>a <em>black card</em> that is black on both sides,</li>
<li>a <em>white card</em> that is white on both sides, and</li>
<li>a <em>mixed card</em> that is black on one side and white on the other.</li>
</ul>
<p>You put all of the cards in a hat, pull one out at random, and place it on a table. The side facing up is black. What are the odds that the other side is also black?</p>
<p>The answer is that the other side is black with probability 2/3. However, common intuition suggests a probability of 1/2 because across all the cards, there are 3 white, 3 black. However, many people forget to eliminate the possibility of the &#8220;white card&#8221; in this situation (i.e. the card they flipped CANNOT be the &#8220;white card&#8221; because a black side was turned over).</p>
<p>In a survey of 53 Psychology freshmen taking an introductory probability course, 35 incorrectly responded 1/2; only 3 students correctly responded 2/3.<span class="reference plainlinksneverexpand"><sup><a class="external autonumber" title="http://en.wikipedia.org/wiki/Bertrand%27s_box_paradox#endnote_53" rel="nofollow" href="http://en.wikipedia.org/wiki/Bertrand%27s_box_paradox#endnote_53">[1]</a></sup></span></p>
<h3><span class="mw-headline">Preliminaries</span></h3>
<p>To solve the problem, either formally or informally, we must assign <a title="Probability" href="http://en.wikipedia.org/wiki/Probability">probabilities</a> to the events of drawing each of the six faces of the three cards. These probabilities could conceivably be very different; perhaps the white card is larger than the black card, or the black side of the mixed card is heavier than the white side. The statement of the question does not explicitly address these concerns. The only constraints implied by the <a class="mw-redirect" title="Kolmogorov axioms" href="http://en.wikipedia.org/wiki/Kolmogorov_axioms">Kolmogorov axioms</a> are that the probabilities are all non-negative, and they sum to 1.</p>
<p>The custom in problems when one literally pulls objects from a hat is to assume that all the drawing probabilities are equal. This forces the probability of drawing each side to be 1/6, and so the probability of drawing a given card is 1/3. In particular, the probability of drawing the double-white card is 1/3, and the probability of drawing a different card is 2/3.</p>
<p>In our question, however, you have already selected a card from the hat and it shows a black face. At first glance it appears that there is a 50/50 chance (ie. probability 1/2) that the other side of the card is black, since there are two cards it might be: the black and the mixed. However, this reasoning fails to exploit all of your information; you know not only that the card on the table <em>has</em> a black face, but also that one of its black faces is facing you.</p>
<h3><span class="mw-headline">Solutions:</span></h3>
<h4><span class="editsection"></span><span class="mw-headline">Intuition</span></h4>
<p>Intuition tells you that you are choosing a card at random. However, you are actually choosing a face at random. There are 6 faces, of which 3 faces are white and 3 faces are black. Two of the 3 black faces belong to the same card. The chance of choosing one of those 2 faces is 2/3. Therefore, the chance of flipping the card over and finding another black face is also 2/3. Another way of thinking about it is that the problem is not about the chance that the other side is black, it&#8217;s about the chance that you drew the all black card. If you drew a black face, then it&#8217;s twice as likely that that face belongs to the black card than the mixed card.</p>
<h4><span class="mw-headline">Labels</span></h4>
<p>One solution method is to label the card faces, for example numbers 1 through 6.<span class="reference plainlinksneverexpand"><sup><a class="external autonumber" title="http://en.wikipedia.org/wiki/Bertrand%27s_box_paradox#endnote_Label16" rel="nofollow" href="http://en.wikipedia.org/wiki/Bertrand%27s_box_paradox#endnote_Label16">[2]</a></sup></span> Label the faces of the black card 1 and 2; label the faces of the mixed card 3 (black) and 4 (white); and label the faces of the white card 5 and 6. The observed black face could be 1, 2, or 3, all equally likely; if it is 1 or 2, the other side is black, and if it is 3, the other side is white. The probability that the other side is black is 2/3.</p>
<h4><span class="mw-headline">Bayes&#8217; theorem</span></h4>
<p>Given that the shown face is black, the other face is black if and only if the card is the black card. If the black card is drawn, a black face is shown with probability 1. The total probability of seeing a black face is 1/2; the total probability of drawing the black card is 1/3. By <a title="Bayes' theorem" href="http://en.wikipedia.org/wiki/Bayes%27_theorem">Bayes&#8217; theorem</a>,<span class="reference plainlinksneverexpand"><sup><a class="external autonumber" title="http://en.wikipedia.org/wiki/Bertrand%27s_box_paradox#endnote_Bayes" rel="nofollow" href="http://en.wikipedia.org/wiki/Bertrand%27s_box_paradox#endnote_Bayes">[3]</a></sup></span> the conditional probability of having drawn the black card, given that a black face is showing, is</p>
<dl>
<dd><img class="tex" src="http://upload.wikimedia.org/math/2/c/b/2cb55cec2b0f1d1efe15674df4ccc4d5.png" alt="\frac{1\cdot1/3}{1/2}=2/3." /></dd>
</dl>
<p><a name="Eliminating_the_white_card"></a></p>
<h4><span class="mw-headline">Eliminating the white card</span></h4>
<p>Although the incorrect solution reasons that the white card is eliminated, one can also use that information in a correct solution. Modifying the previous method, given that the white card is not drawn, the probability of seeing a black face is 3/4, and the probability of drawing the black card is 1/2. The conditional probability of having drawn the black card, given that a black face is showing, is</p>
<dl>
<dd><img class="tex" src="http://upload.wikimedia.org/math/b/3/5/b356294f95d594fafdb738f1c13aa233.png" alt="\frac{1/2}{3/4}=2/3." /></dd>
</dl>
<p><a name="Symmetry"></a></p>
<h4><span class="mw-headline">Symmetry</span></h4>
<p>The probability (without considering the individual colors) that the hidden color is the same as the displayed color is clearly 2/3, as this holds <a title="If and only if" href="http://en.wikipedia.org/wiki/If_and_only_if">if and only if</a> the chosen card is black or white, which chooses 2 of the 3 cards. <a title="Symmetry" href="http://en.wikipedia.org/wiki/Symmetry">Symmetry</a> suggests that the probability is <a class="mw-redirect" title="Statistically independent" href="http://en.wikipedia.org/wiki/Statistically_independent">independent</a> of the color chosen. (This <em>can</em> be formalized, but requires more advanced mathematics than yet discussed.)</p>
<h4><span class="mw-headline">Experiment</span></h4>
<p>Using specially constructed cards, the choice can be tested a number of times. By constructing a fraction with the <a class="mw-redirect" title="Denominator" href="http://en.wikipedia.org/wiki/Denominator">denominator</a> being the number of times &#8220;B&#8221; is on top, and the <a title="Numerator" href="http://en.wikipedia.org/wiki/Numerator">numerator</a> being the number of times both sides are &#8220;B&#8221;, the experimenter will <em>probably</em> find the ratio to be near 2/3.</p>
<p>Note the logical fact that the B/B card contributes significantly more (in fact twice) to the number of times &#8220;B&#8221; is on top. With the card B/W there is always a 50% chance W being on top, thus in 50% of the cases card B/W is drawn, card B/W virtually does not count. Conclusively, the cards B/B and B/W are not of equal chances, because in the 50% of the cases B/W is drawn, this card is simply &#8220;disqualified&#8221;.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Stats Seminars at UQ]]></title>
<link>http://ssaqld.wordpress.com/2009/03/24/stats-seminars-at-uq/</link>
<pubDate>Tue, 24 Mar 2009 05:29:40 +0000</pubDate>
<dc:creator>ssaqld</dc:creator>
<guid>http://ssaqld.wordpress.com/2009/03/24/stats-seminars-at-uq/</guid>
<description><![CDATA[Members are advised of the following statistics talks taking place this week at UQ. Thanks to Ian Wo]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Members are advised of the following statistics talks taking place this week at UQ. Thanks to Ian Woods for passing this information on for the benefit of members!</p>
<p>Where: each in 67-641, Priestley Building (67), room 641 (6th floor), University of Queensland, St. Lucia.</p>
<p style="text-align:center;"><strong>A Bayesian Non-Parametric Approach to Estimating Minimum Phase Transfer Functions</strong></p>
<p style="text-align:center;"> (2pm Wednesday 25/3/09)</p>
<p style="text-align:center;">Ross McVinish</p>
<p style="text-align:center;">School of Mathematics and Physics, University of Queensland</p>
<p> In this seminar, I present a Bayesian nonparametric approach to the estimation of a stable, minimum phase transfer function. A minimum phase transfer function has no zeros outside the stability region, which implies that its phase component can be uniquely determined from its magnitude component, a property known as the Bode phase-magnitude relationship. The proposed approach specifies the prior for the magnitude of the transfer function and uses the Bode phase-magnitude relationship to determine the transfer function. The rate at which the posterior distribution concentrates around the true transfer function is derived. Posterior computation using Markov chain Monte Carlo simulation is briefly discussed and the approach illustrated on simulated data. For information about the speaker, see <a href="http://www.maths.uq.edu.au/MASCOS/people.html">http://www.maths.uq.edu.au/MASCOS/people.html</a></p>
<p> </p>
<p style="text-align:center;"><strong>Efficient simulation for the tail probability of a sum of the components of log-elliptical random vectors</strong></p>
<p style="text-align:center;"> (10am Thursday 26/3/09)</p>
<p style="text-align:center;"> Leonardo Rojas Nandayapa (joint work with Soren Asmussen and José Blanchet) Department of Mathematical Sciences, Aarhus University, Denmark</p>
<p style="text-align:left;">Tail probabilities of the sum of the components of a Log-elliptical random vector are considered. These random vectors are obtained by an exponential transformation of random vectors with elliptical distributions &#8211; a large class of multivariate distributions which includes the multivariate normal, the multivariate-t, normal mixtures and generalized hyperbolics among others. After a brief introduction to this class of distributions, a Monte Carlo estimator for the tail probability of the sum is proposed and analyzed. In particular, for the multivariate lognormal case it is shown to have optimal theoretical properties. For information about the speaker, see <a href="http://www.thiele.au.dk/index.php?id=13&#38;tx_thielecv_pi1[showUid]=12&#38;cHash=01337ad499">http://www.thiele.au.dk/index.php?id=13&#38;tx_thielecv_pi1[showUid]=12&#38;cHash=01337ad499</a></p>
<p style="text-align:center;"><strong>Estimating Change-Points in Biological Sequences via the Cross-Entropy Method</strong></p>
<p style="text-align:center;">(2pm Thursday 26/3/09)</p>
<p style="text-align:center;">Gareth Evans School of Mathematics and Physics, University of Queensland</p>
<p style="text-align:left;">The genomes of complex organisms, including the human genome, are known to vary in GC content along their length. That is, they vary in the local proportion of the nucleotides G and C, as opposed to the nucleotides A and T. Changes in GC content are often abrupt, producing well-defined regions. We model DNA sequences as a multiple change-point process in which the sequence is separated into segments by an unknown number of change-points, with each segment supposed to have been generated by a different process. Multiple change-point problems are important in many biological applications, particularly in the analysis of DNA sequences. Multiple change-point problems also arise in segmentation of protein sequences according to hydrophobicity. We use the Cross-Entropy method to estimate the positions of the change-points. Parameters of the process for each segment are approximated with maximum likelihood estimates. Numerical experiments illustrate the effectiveness of the approach. We obtain estimates of the locations of change-points in artificially generated sequences and compare the accuracy of these estimates with those obtained via other methods such as IsoFinder and Markov Chain Monte Carlo. Lastly, we provide examples with real data sets to illustrate the usefulness of our method. For information about the speaker, see <a href="http://www.maths.uq.edu.au/staff_visitors/academic_page.php?id=307">http://www.maths.uq.edu.au/staff_visitors/academic_page.php?id=307</a></p>
<p style="text-align:left;">UQ Statistics seminars web page: <a href="http://www.maths.uq.edu.au/statistics/seminars/UQ%20Statistics%20Seminars.html">http://www.maths.uq.edu.au/statistics/seminars/UQ%20Statistics%20Seminars.html</a></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Response Times]]></title>
<link>http://brainsandrobots.wordpress.com/2009/03/19/response-times/</link>
<pubDate>Thu, 19 Mar 2009 07:48:30 +0000</pubDate>
<dc:creator>vicentemalave</dc:creator>
<guid>http://brainsandrobots.wordpress.com/2009/03/19/response-times/</guid>
<description><![CDATA[I&#8217;m also reading R.Duncan Luce&#8217;s classic &#8220;Response Times&#8221; book about reactio]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I&#8217;m also reading R.Duncan Luce&#8217;s classic &#8220;Response Times&#8221; book about reaction times, and doing fancier statistics on them. Seems like an important topic, and I have some projects in mind.</p>
</div>]]></content:encoded>
</item>

</channel>
</rss>
