<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress.com" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>power-law &amp;laquo; WordPress.com Tag Feed</title>
	<link>http://en.wordpress.com/tag/power-law/</link>
	<description>Feed of posts on WordPress.com tagged "power-law"</description>
	<pubDate>Tue, 01 Dec 2009 19:18:43 +0000</pubDate>

	<generator>http://en.wordpress.com/tags/</generator>
	<language>en</language>

<item>
<title><![CDATA[Networks; The Power of Hubs]]></title>
<link>http://idm09.wordpress.com/2009/11/03/networks-the-power-of-hubs/</link>
<pubDate>Tue, 03 Nov 2009 14:40:09 +0000</pubDate>
<dc:creator>acolucci</dc:creator>
<guid>http://idm09.wordpress.com/2009/11/03/networks-the-power-of-hubs/</guid>
<description><![CDATA[How do some  webpages on the Internet become so ubiquitous that we are rarely ever less than two sho]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>How do some  webpages on the Internet become so ubiquitous that we are rarely ever less than two short links away from viewing that page? How have some webpages become so popular among the hundreds of billions of webpages that are on the Internet currently, not to mention the new ones that pop up almost every second? More importantly how do certain pages become the centre for web activity and lead us to other pages within their network? Albert-Lázló Barabási explored these questions about networks and the organization/disorganization of interconnectivity within his Chapter titled <em>Hubs and Connectors</em> .</p>
<p>Barabási begins the piece by explaining both the power and importance of links within the web. Quite simply the larger number of<!--more--> incoming links that direct visitors to your webpage then the higher the number of visitors you will have, which in turn increases your overall profile on the web. This sets your page apart from others, allowing for greater visibility and secures your place within the hiearchy as a destination site online. The perception of networks, its growth and mechanisms has traditionally followed the random worldview theory of networks as defined by Erdos-Rényi. Within this theory the interior of networks are connected at random, by nodes (this can be thought of as webpages online) and each node within the network is not distinguishable from another, meaning that they uniformly connect to the other nodes within the network the same number of times. (This example is presented in the previous <a href="http://idm09.wordpress.com/2009/11/01/its-a-small-world-after-all/#more-339" target="_blank">post</a>.) Hence the distribution of the links is equal between all the nodes thus abolishing any distinguishable character that one node may have over another. This distinguishable character within a network typically infers that there is “a few highly connected nodes, or hubs.” (58) According to the random network theory hubs could not exist because all nodes share the same amount of links to other nodes. Additionally the total number of nodes is static over time, leaving no room for growth in terms of the number of nodes existing within the network. Typically this theory has been represented by bell curves; in which various nodes are dispersed randomly and all connect to each other an equal amount of times, thus creating a peak within the middle of the graph. Accordingly within this random network theory all webpages would have an equal opportunity to be seen and heard. Barabási observes that this theory of networks is quite simply not true, and holds no ‘real world’ application and/or example.</p>
<p>He contests that the Erdos-Rényi theory of networks, which consist of nodes that are randomly connected with “roughly the same number of incoming links”, is not applicable at all to the hierarchal pattern of popular webpages prevalent within the web today. Barabási states that some webpages such as Amazon.com, Yahoo! etc. have an exceptional amount of incoming links and thus dominate the web through the sheer enormity and range of their incoming links from various networks across the web. Webpages like Amazon inherently become the hubs of the Internet. Additionally Barabási notes that just like actors within Hollywood (as observed within the Kevin Bacon phenomena, in which Bacon is only 2.79 degrees removed from everyone within the Hollywood network) the strength of these Internet hubs do not rely on the sheer size of the network but the range. This means that the links these nodes make to other various distant networks adds range to the network that the hubs possess. This adds variety and brings dissimilar things together. In contrast to hubs, Barabási points out that his own webpage is so insignificant within the web, due to the near infinite range of the Internet network, that the chance of a person viewing it is about forty in a billion. Within this contrast lies the main concerning question of the Internet and it’s network; how do some pages become these hubs of the Internet, in comparison to the majority of pages that exists in near obscurity?</p>
<p>Barabási claims that the theory and research of networks as represented by Erdos-Rényi random network theory fails to answer this question because of it’s fundamental underlying assumptions; <sup>1</sup> Static: the amount of nodes in a Erdos-Rényi network are set, meaning there is no room for growth, which is in exact opposition to the nature of the web itself and <sup>2 </sup>Equality (Random Interconnectivity): each node within the network contains an equal amount of links to the other nodes thus making each node indistinguishable from the last, hence the nodes are linked randomly and are equal (there can be no room for hub webpages such as Google!). Within the random network’s nodes are considered a “characteristic <em>scale</em>[,]<em> </em>embodied by the average node and fixed by the peak degree of distribution.” (pg70)</p>
<p><em> </em>Barabási dismisses this entire method of perceiving networks, and states that the actual distribution, connection and dominance (or lack thereof) certain webpages within the Internet is determined by inverting the two random network assumptions. The new foundations of network theory are ruled by the following;</p>
<p>“<sup>1</sup>Growth: For each give period of time we add a new node to the network. This step underscores the fact that networks are assembled one node at a time</p>
<p><sup>2 </sup>Preferential Attachment: We assume that each new node connects to the existing nodes with two links. The probability that it will choose a given node is proportional to the number of links the chosen node has. That is, given the choice between two nodes, one with twice as may link as the other; it is twice as likely that the new node will connect to the more connected node”.  (pg 86)</p>
<p>These foundations are further represented not in bell curves but by the Italian Economist Pareto’s 80/20 Rule and the power law phenomena it produces. Within a power law ruled network the majority of nodes have only a few links (incoming and outgoing) and these tiny nodes co-exist with a few big hubs that are highly connected with many nodes that traverse various differing networks. The power law exemplifies Pareto’s Rule, which states that 80% of effects are derived only from 20% of causes. The approximate 80/20 rule applied to the web means that “80% of the links on the web point to only 15% of total webpges”. (66)</p>
<p>Pareto’s 80/20 Rule and the power law graph that is supports provides us with a very different perspective on networks, especially according to how Barabási perceives networks in the Internet. The <strong>power law graph</strong> shows a slope that rises and then declines running horizontally parallel out to infinity.  The nodes located with this horizontal section account for the 80% of wepages and the peak of the slope which is located on the left hand side of the graph closest to the vertical line is the 15% of webpages that account for hubs. This unequal distribution of incoming links as presented within the power law graph is a more accurate description of the current state of the Internet. </p>
<p style="text-align:center;"> <img class="aligncenter" title="Power Law Barabási Graph " src="http://www.macs.hw.ac.uk/~pdw/topology/Pictures/S-power.jpg" alt="" width="323" height="265" /></p>
<p> As Barabási describes;</p>
<p>“The power law distribution thus forces us to abandon the idea of scale, or a characteristic node. In a continuous hierarchy there is no single node which we could pick out and claim to be characteristic of all the nodes. There is no intrinsic scale in these networks. This is the reason my research group started to describe networks with power-law degree distribution as scale-free.” (pg70)</p>
<p>This principle that describes the web and its interconnectivity is not a phenomenon distinctive to the web but applies to a variety of situations within human life.</p>
<p>The effect of power laws and the study of its presence is not a new phenomenon at all. In the last few years the existence of the Internet has dramatically changed how some view the various power laws at work within out world. <strong><a href="http://www.wired.com/wired/archive/12.10/tail.html">The Long Tail</a></strong> as it was coined by<a href="http://www.longtail.com/the_long_tail/"> <strong>Chris Anderson</strong> </a>has become a staple of what some believe will be the future for commerce, specifically within the music industry. Anderson felt that the hubs within the music industry were an effect of the old 20<sup>th</sup> century system of hits culture which encompassed production, manufacturing and the distribution of music. These hubs within music could be seen <img class="alignright" title="Long Tail shows head and tail of power law" src="http://www.leftclick.com/images/illustrations/long-tail.png" alt="" width="400" height="300" />as various major record labels, major publishing companies, big name acts, and their hit repertoire. For many years the music industry was ruled by power laws, with only 20% (termed by Anderson as the ‘head’ of the power law) of the products (the term products primarily refers to music recordings like CD’s. LP vinyl etc) accounted for 80%( termed by Anderson as the ‘tail’ of the power law) of the sales.  The 20%, of  the ‘head’, are comprised of the goliaths of the music industry such as Michael Jackson’s classic <a href="http://wiki.answers.com/Q/How_many_copies_of_Michael_Jacksons_thriller_have_sold">‘<em>Thriller’ </em></a>or by more recent releases like Rihanna’s album <em><a href="http://en.wikipedia.org/wiki/Good_Girl_Gone_Bad#Charts_and_certifications">‘A Good Girl Gone Bad’</a></em>. The other 80% of niche genre music (such as instead of the main genre of Rock, you want Acid Rock) was virtually inaccessible, because the justification of producing, manufacturing, distributing and stocking this niche music  would not be meet via mass profits…until now.</p>
<p>Anderson proposed that with the democratization of distribution via Internet retail sites like Amazon, anyone could find and purchase a piece of music that they enjoyed. By riding the tail of the power law to the outer reaches music lovers could find music that was deeper within the niche of music that they always wanted to own, and/or discover new music. Additionally if we are to take a look at Barabási’s notion of networks within the Internet, the hubs of music such as Jay-Z could refer music lovers to other musicians within that rap/hip-hop genre of music like Nas, and the links from the Nas node could lead to other lesser known acts.</p>
<p>The sheer enormity of information and access that the Internet provides would allow for music lovers (or a consumer of any product/service) to be able to find exactly what they want, and be able to obtain that item. Through the <a href="http://www.wired.com/wired/archive/12.10/tail.html"><strong>Long Tail</strong> </a>Anderson argues that we could begin to focus more on the niche products (or in this case music genres) that for so long have been relegated to the sidelines as the hit parade came into town. In fact Anderson argues that in terms of commerce businesses could focus on selling &#8216;alot of a little&#8217;. For example Anderson starts with just exploring the ‘A’ section of music genres and discovers a niche of Afro-Cuban Jazz. Although this isn’t a big genre within the ‘A’ section, in it there are hubs  like artist Tito Puente. You could focus as a retailer on having Tito Puente in addition to other artists within the same genre, like the Buena Vista Social Club. You may only sell a few copies of each artist, but the total of that entire sale of those artists in combination would equate to a profit either equal or similar to that amount if you might have sold if you were just focusing on the ‘head. Accordingly this would mark the end of hits, or is it?</p>
<p>Kanye, Lil&#8217;Wayne, Miley Cyrus, Pink, Kings of Leon etc, are all still selling millions of copies of their albums (or at the very least a solid half a million). These are mass marketed and mass produced artists, and more importantly as hubs of the music industry they still influence alot of other artists that are developing. Although in our preferences we might become very niche, the power and influence of hubs still remains. Hence although the playing field has been leveled, and as consumers we can have access to music like never before, we still need some sort of guidance. Anderson calls this &#8216;filtering&#8217;, which sites like Amazon with its Recommendations technology and Google through its search engine provide by offering a means of shifting through all of the products available.</p>
<p> However, once again he is missing the bigger point of networks and the position of hubs within that network. For many music lovers finding new music is a journey that starts with a hub like John Mayer and then through the connections that artist makes as a hub whether it’s through genre, featured artists on his album or label mates, music lovers had to start at the ‘head’ of the power law and then ride to the ‘tail’ to find that music. If those hubs didn’t exist then discovery would be highly difficult. The  Erdos-Rényi random network theory was disproved because of this very thing;  if all the nodes have scale and there are no distinguishable characteristics making any one node different from another than the same information is passed through that type of network. Within the music industry this would mean that no new music would be able to be found, and by only focusing on the niches the bigger picture of music trends and tastes would be ignored. Hubs need niches and niches need hubs. Anderson’s continued crusade against hubs or ‘hits’ destabilizes the entire network theory; and its applications within music, Internet and life in general. Anderson fails to realize that as within any network; when a node has a choice between linking between two other nodes it will most likely choose the node that has the higher amount of links. The same can be said for music fans and webpages alike; hits are still hits.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[He has Connections... How can I get to know Him!!? Network.]]></title>
<link>http://idm09.wordpress.com/2009/11/03/he-has-connections-how-can-i-get-to-know-him-network/</link>
<pubDate>Tue, 03 Nov 2009 06:22:24 +0000</pubDate>
<dc:creator>eyc223</dc:creator>
<guid>http://idm09.wordpress.com/2009/11/03/he-has-connections-how-can-i-get-to-know-him-network/</guid>
<description><![CDATA[The reading for this week, Linked by Albert-Laszlo Barabasi disproves the idea of randomness in netw]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>The reading for this week, <em>Linked </em>by Albert-Laszlo Barabasi disproves the idea of randomness in networks and champions the idea of hubs and connectors that function to bring together society. The idea behind linked and being connected to others is not new nor is it phenomenal: it is in our blood. This is our history and thus, inevitable. The Bible clears demonstrates this connection as in the beginning, there was Adam, then Eve, then Cain and Able. After that, humanity arises and creates towns to settle and we develop. Thus, we are all connected or linked in some way. If one is agnostic or believes in science, then think of evolution. People evolved as running from animals to hunting them in groups. Once in groups, they became nomads, and then quite surely, someone decided to settle down in a location, therefore forming a town. Thus a hub is created and with it, streets, other towns, etc…</p>
<p>It is divided into 3 chapters. I will discuss each chapter separately by its part and tie it together at the end. They are all connected; rather, they are components of a larger truth.</p>
<p><strong>The Fifth Link</strong>: Malcolm Gladwell’s <em>The Tipping Point</em> notices an interesting phenomenon: “sprinkled among every walk of life…are a handful of people with a truly extraordinary knack of making friends and acquaintances. They are connectors” (55). This brings in the idea of connectors as highly relevant components of our social lives. Another way to define connectors is by calling them hubs. These hubs have a large number of links. One can create the analogy of a large city, such as Los Angeles and the suburbs that surrounds it. Many highways enter and go out of Los Angeles, connecting many different cities along the way. This discovery has basically disproven Erdos-Renyi’s theory of random worldview and Watts and Strogatz’s simple circle network model. Social networks exhibit clusters and hubs, both points that demonstrate a worldly rule that applies.</p>
<p>We can also apply the theory of links and connectors/hubs to cyberspace. The World Wide Web is the ultimate space for freedom, an environment that represents and defines the meaning of space without limits, boundaries, nor rules. Anyone can publish their work online and allow anyone to see it. Problem is, there are billions of websites. The question that arises is visibility. On the Web, the “measure of visibility is the number of links. The more incoming links pointing to your Webpage, the more visible it is” (57). As such, only websites that become hubs are highly visible (Amazon, Google). These are hubs: connectors with enormous links to other nodes.</p>
<p>I can create a website, thus it becomes a node. Google has grown, thus it has become a hub. Without a link, both websites exist and move in different worlds, or just space. The internet is as big as the universe, each node moves on its own, until it makes contact. This is the link. My website can function in its own galaxy as networks tend to form a cluster. Clusters are “nodes that are linked only to nodes in their subculture or genre” (61).  As a cluster, it is easier to find a common connection to a hub. For example, if my website is about knee pains, then I would like to create some link to a popular website such as WebMD. This in turn, can allow me to link with Google. In two links, my website has escaped randomness and can be visible.</p>
<p><strong>The Sixth Link</strong>: Vilfredo Pareto may be a well known Italian economist, but I feel that his thought process behind the 80/20 Rule is sheer brilliance. It is all around us, and yet, he is the first to notice that it applies to the world. This is Simple Genius: understanding a law within the world that is so visible, and yet, invisible to the eye. 80/20 Rule applies to many things, but is generally regarded as 20 own rest of the 80. This also applies to network as well. It can be proven through a mathematical expression called a power law. In contrast to a bell curve, which is a “distribution rather similar to the peaked distribution characterizing random networks,” (67) the power law is by definition a special kind of mathematical relationship between two quantities. The exact definition is as follows: if one quantity is the frequency of an event, the other is the size of the event, then the relationship has a power law distribution when the frequency of the event decreases at a greater rate than the size increases. In layman terms, think of 20% of the population ruling 80% of the world.</p>
<p>Power laws basically functions to prove mathematically the fact that in most “real networks the majority of nodes have only a few links and that these numerous tiny nodes coexist with a few big hubs, nodes with an anomalously high number of links” (70). Again, this relates to Gladwell’s idea of connectors, but proving quantitatively that it exists within the realm of the World Wide Web. It demonstrates that real networks are not random at all, but exist under the power law. The interesting idea that Barabasi proposes is that in the beginning, nodes tend to be chaotic and without any form or order. However, through time, this disorder turned into order through self-organization. Under the theory of phase transitions, real networks demonstrate self creation from disorder into the 80/20 Rule. Barabasi’s idea is highly compelling, as he demonstrates quantitative analysis and by using the Web, a blank space which is like a universe unto itself, to explain a simple law that applies to the world. Everyone is intricately linked, because we are social beings. Networking is only a function of humanity, which always existed, but has now been proven through the power law. A question that arises is why do hubs and links form and how does it form in such a manner that to the naked eye is so random and chaotic, and yet, has order.</p>
<p><strong>The Seventh Link</strong>: Erdos and Renyi’s model of networks rely on two principles. First is the idea that all nodes are fixed and remains unchanged throughout the network’s life. Second is that all nodes are equivalent. However, this is obviously not the case as there exists hubs/connectors, links, and change from disorder to order through self-organization. To understand how it does this, we must understand that the web is constantly growing. It is changing and growing. This is quite self-explanatory. There has been exponential growth of websites on the World Wide Web. As there is growth, we can safely disprove the static nature of Erdos-Renyi’s model of networks. We also do not randomly decide on which websites to link. We choose based upon our knowledge and social upbringing. We prefer certain websites over others because we are comfortable and familiar with that certain product. Thus, rises the concept of hubs. Barabasi brings up the idea of preferential attachment: “when choosing between two pages, one with twice as many links as the other, about twice as many people link to the more connected page. While our individual choices are highly unpredictable, as a group we follow strict patterns” (85). In many sense, this is true. We are all sheep and follow the leader.</p>
<p>In truth, randomness does not really exist unless it is the role of a die. Linking between networks is not random. Though unmentioned as third criteria, I believe that popularity and attractiveness plays an important role in the addition of links and creation of hubs. Webpages that have more links are more likely to be “linked to again” (86). Thus, there exists first person advantage. Older nodes have greater chances to become bigger and eventually rise as a hub. As a senior member within a link, this node has greater links and more nodes want to be linked to that certain node. Barabasi has given compelling evidence that real networks are not random, but constantly evolving and growing, attracting more links through the legitimacy of the power law.</p>
<p>We are living in a complex world and yet, guided by the invisible hand or law that is, in my opinion, inexplicable. The idea of real networks applies not only to websites, but to humanity in large. I believe that Barabasi’s point is that humans are social beings and we are followers. There are different people: some are leaders and some are followers. This difference guides the principle of 80/20 Rule as well as the power law. Barabasi has done something quite remarkable: take a simple, obvious, and yet invisible rule, and proved it scientifically and gave it a name. Kudos.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[New paper: Ecological modeling of paleocommunity food webs]]></title>
<link>http://proopnarine.wordpress.com/2009/10/30/new-paper-ecological-modeling-of-paleocommunity-food-webs/</link>
<pubDate>Fri, 30 Oct 2009 22:40:36 +0000</pubDate>
<dc:creator>proopnarine</dc:creator>
<guid>http://proopnarine.wordpress.com/2009/10/30/new-paper-ecological-modeling-of-paleocommunity-food-webs/</guid>
<description><![CDATA[Roopnarine, P. D. 2009. Ecological modeling of paleocommunity food webs. in G. Dietl and K. Flessa, ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><a href="http://www.flickr.com/photos/57896849@N00/545874636/" title="2_times_diversity_network.png by peterroopnarine, on Flickr"><img src="http://farm2.static.flickr.com/1290/545874636_4118d2e4b5_t.jpg" width="100" height="96" alt="2_times_diversity_network.png" align="left" /></a></p>
<p>Roopnarine, P. D. 2009. Ecological modeling of paleocommunity food webs. in G. Dietl and K. Flessa, eds., Conservation Paleobiology, The Paleontological Society Papers, 15: 195-220.</p>
<p>Find the paper here:<br />
<a href="http://zeus.calacademy.org/roopnarine/Selected_Publications/Roopnarine_09.pdf">http://zeus.calacademy.org/roopnarine/Selected_Publications/Roopnarine_09.pdf</a><br />
or here<br />
<a href="http://zeus.calacademy.org/publications/">http://zeus.calacademy.org/publications/</a></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Power law confirmed]]></title>
<link>http://proopnarine.wordpress.com/2009/10/25/power-law-confirmed/</link>
<pubDate>Sun, 25 Oct 2009 03:58:53 +0000</pubDate>
<dc:creator>proopnarine</dc:creator>
<guid>http://proopnarine.wordpress.com/2009/10/25/power-law-confirmed/</guid>
<description><![CDATA[Species-level trophic link distributionOkay, this post just disappeared, so let&#8217;s try again. T]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><div id="attachment_484" class="wp-caption alignleft" style="width: 310px"><img src="http://proopnarine.wordpress.com/files/2009/10/expanded_guild_trophic_link_distrib1.png?w=300" alt="Species-level trophic link distribution" title="expanded_guild_trophic_link_distrib" width="300" height="231" class="size-medium wp-image-484" /><p class="wp-caption-text">Species-level trophic link distribution</p></div>Okay, this post just disappeared, so let&#8217;s try again. The updated and correct coral reef food web comprises 759 species. The incoming trophic link distribution, when expanded to the species level (compared to the guild level in the previous post), is a definite power law distribution. The log-transformed data (see figure) yield a function of <img src='http://l.wordpress.com/latex.php?latex=y+%3D+11196x%5E%7B-1.98%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='y = 11196x^{-1.98}' title='y = 11196x^{-1.98}' class='latex' />, i.e. <img src='http://l.wordpress.com/latex.php?latex=%5Cgamma%3D1.98&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\gamma=1.98' title='\gamma=1.98' class='latex' />. See the earlier coral reef posts to understand why this is significant.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Coral reef trophic levels, &amp; update]]></title>
<link>http://proopnarine.wordpress.com/2009/10/23/coral-reef-trophic-levels-update/</link>
<pubDate>Fri, 23 Oct 2009 19:50:53 +0000</pubDate>
<dc:creator>proopnarine</dc:creator>
<guid>http://proopnarine.wordpress.com/2009/10/23/coral-reef-trophic-levels-update/</guid>
<description><![CDATA[Guild-level trophic link distribution Spent a great week at the Annual Meeting of the Geological Soc]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><div id="attachment_475" class="wp-caption alignleft" style="width: 310px"><img src="http://proopnarine.wordpress.com/files/2009/10/metanetwork_trophic_link_distrib.png?w=300" alt="Guild-level trophic link distribution" title="metanetwork_trophic_link_distrib" width="300" height="231" class="size-medium wp-image-475" /><p class="wp-caption-text">Guild-level trophic link distribution</p></div>
<p>Spent a great week at the Annual Meeting of the <a href="http://geosociety.org/">Geological Society of America</a>. The <a href="http://www.paleosoc.org/">Paleontology Society</a> session on Conservation Paleobiology was a lot of fun, and my students also presented great posters. Now back to the coral reef.</p>
<p>I&#8217;ve been cleaning up the data, because with some much data, errors are bound to creep in. I believe that the current data are now accurate, and the metanetwork statistics are 265 guilds (including primary producers) and 4,651 links. That yields a metanetwork connectance of 0.066. The link distribution should therefore also be different, and indeed it is. The figure shows the no. of links per guild, and the regression plot demonstrates that the distribution is still a power law distribution. The exponent is smaller than previously calculated, (<img src='http://l.wordpress.com/latex.php?latex=%5Cgamma%3D1.54&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\gamma=1.54' title='\gamma=1.54' class='latex' />), but this is the guild-level network and does not reflect species richnesses (yet).</p>
<p><div id="attachment_477" class="wp-caption alignleft" style="width: 310px"><img src="http://proopnarine.wordpress.com/files/2009/10/g_trophic_levels.png?w=300" alt="Trophic level vs. no. of links" title="g_trophic_levels" width="300" height="231" class="size-medium wp-image-477" /><p class="wp-caption-text">Trophic level vs. no. of links</p></div>
<p>The next question that I&#8217;m looking at is the distribution of trophic levels among guilds and species. I therefore calculated trophic level for all guilds. The first figure (scatter plot) plots trophic level against the number of prey or incoming links to each guild. There are two things to notice: First, the variance of trophic levels decreases as the number of links, or diet generality of the guild increases. Second, the decrease in the variance is asymmetric, in that there is a bias against being a generalist of low trophic level. This is obvious if you look at all the empty space being vacated below the data points as no. of links increases. I can think of two non-exclusive explanations for this. If you think about a food chain, consumers toward the top of the chain simply have more prey to select from (on an evolutionary timescale), and therefore there should be a natural increase in the number of generalists as trophic level increases. Also, note that there are also many specialists of high trophic level. Perhaps the ability to exert power over other species, as a predator, combined with the previous statement, explains this observation. Finally, what is the distribution of trophic levels within the community? The second figure is a simple histogram plot of all non-primary consumer guilds (i.e. omnivores and carnivores). The distribution is approximately normal, with a definite central tendency. On average, most guilds in the reef are of similar trophic level! That&#8217;s very interesting. And referring to the previous scatter plot, we know that there is a biased composition in the tails of the distribution, in that the upper tail (higher trophic level) is a mixed composition of specialist to generalist guilds, but the lower tail is basically restricted to low trophic level specialists.</p>
<p><div id="attachment_478" class="wp-caption alignleft" style="width: 310px"><img src="http://proopnarine.wordpress.com/files/2009/10/g_trophic_level_histo.png?w=300" alt="Guild trophic level distribution" title="g_trophic_level_histo" width="300" height="176" class="size-medium wp-image-478" /><p class="wp-caption-text">Guild trophic level distribution</p></div>
<p>Some of you may have noticed that our trophic levels are non-integer numbers. Primary producers all occupy trophic level 1, and primary consumers are trophic level 2. &#8220;Above&#8221; that, trophic level is calculated on the basis of the trophic levels of your prey. Exactly how we do that will remain a secret for now.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[The Mathematics of War]]></title>
<link>http://alertindia.wordpress.com/2009/10/16/the-mathematics-of-war/</link>
<pubDate>Fri, 16 Oct 2009 03:20:30 +0000</pubDate>
<dc:creator>alertindia</dc:creator>
<guid>http://alertindia.wordpress.com/2009/10/16/the-mathematics-of-war/</guid>
<description><![CDATA[In the video below Sean Gourely gives a talk about how the Iraq war and war in general follows the p]]></description>
<content:encoded><![CDATA[In the video below Sean Gourely gives a talk about how the Iraq war and war in general follows the p]]></content:encoded>
</item>
<item>
<title><![CDATA[Coral reef species link distribution]]></title>
<link>http://proopnarine.wordpress.com/2009/10/01/coral-reef-species-link-distribution/</link>
<pubDate>Thu, 01 Oct 2009 22:07:41 +0000</pubDate>
<dc:creator>proopnarine</dc:creator>
<guid>http://proopnarine.wordpress.com/2009/10/01/coral-reef-species-link-distribution/</guid>
<description><![CDATA[Species-level trophic link distribution. The data presented in the previous post examined in-link or]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><div id="attachment_471" class="wp-caption alignleft" style="width: 310px"><img src="http://proopnarine.wordpress.com/files/2009/10/expanded_guild_trophic_link_distrib.png?w=300" alt="Species-level trophic link distribution." title="expanded_guild_trophic_link_distrib" width="300" height="231" class="size-medium wp-image-471" /><p class="wp-caption-text">Species-level trophic link distribution.</p></div>
<p>The data presented in the previous post examined in-link or in-degree distribution at the guild level, i.e. species are aggregated into ecological guilds. A comment on the previous post asked whether we&#8217;ve used any grouping algorithms for guild recognition, and the answer is no, at least not yet (and thanks again for the comment). The current guilds are based primarily on trophic habits and habitat, and other features such as the presence of photo- or chemosymbionts. Guild derived algorithmically would be based on species-level network topology, and ideally, the two would be very similar. Anyway, I noticed the comment when I logged on to post the current results. What I&#8217;ve done is to expand the guild-level network (metanetwork) to the species-level, and then re-examine the trophic link distribution. There is no guarantee that the two distributions should agree. For example, it is quite possible that guilds of high in-degree (lots of prey), though few in number, are very species rich, and hence one would lose the decay distribution at the species level. Conversely, guilds of low in-degree could be tremendously more species rich, and would expand disproportionately, when compared to high in-degree guilds, when expanded into member species. Nevertheless, for this dataset, when guilds are actually expanded from 255 consumer guilds to 704 consumer species, the scale-free nature of the distribution is reinforced. The new function is <i>y</i>=11158<i>x</i>^-1.981, implying a power law exponent very close to 2. Neat.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Coral reef food web II]]></title>
<link>http://proopnarine.wordpress.com/2009/09/30/coral-reef-food-web-ii/</link>
<pubDate>Wed, 30 Sep 2009 22:30:49 +0000</pubDate>
<dc:creator>proopnarine</dc:creator>
<guid>http://proopnarine.wordpress.com/2009/09/30/coral-reef-food-web-ii/</guid>
<description><![CDATA[Trophic link distribution What sort of network is the coral reef food web? In other words, how are t]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><div id="attachment_465" class="wp-caption alignleft" style="width: 310px"><img src="http://proopnarine.wordpress.com/files/2009/09/no_prey_histogram1.png?w=300" alt="Trophic link distribution" title="no_prey_histogram" width="300" height="176" class="size-medium wp-image-465" /><p class="wp-caption-text">Trophic link distribution</p></div>
<p>What sort of network is the coral reef food web? In other words, how are the links or interactions between nodes in a food web distributed? Food webs have been modelled variously as everything from random (Poisson) networks to networks based on exponential, power law or mixed distributions, with or without hierarchical structure. Empirical measures suggest that link distributions in <a href="http://en.wikipedia.org/wiki/Small-world_network">real world</a> food webs follow exponential or power law distributions, perhaps a mixture of both (differentiated by scale). One of my worries with those measures is that they are based on food webs of varying sizes, and more importantly, levels of taxonomic and ecological resolution. So, for example, how much does it matter if your food web covers only a small part of the community&#8217;s taxonomic diversity, or only part of the trophic diversity? What about the level of aggregation of species into more inclusive groups? The high resolution of the coral food web presents an opportunity to address some of these questions, and here&#8217;s the first one: How are trophic in-links distributed at the guild level? Recall that guilds here are groups of species with potentially the same prey and predators. I say potentially, for while we have very specific trophic data for some species, e.g. heavily studied fish, data are less certain for many smaller or less well known species. Still, there are 265 guilds in this dataset, and 4,756 links (see previous post). The histogram is a basic frequency histogram of the number of links per guild. As predicted on the basis of previously studied food webs, the distribution is a (right-skewed) decay distribution, with a greater number of species possessing fewer prey, i.e. being relative specialists, and a few species having a broad repetoire of prey, i.e. relative generalists. The extreme generalists (to the right or tail of the distribution) are all large sharks, the most extreme being the <a href="http://en.wikipedia.org/wiki/Tiger_shark">tiger shark, <i>Galeocerdo cuvier</i></a>. These species range from microscopic, single-celled <a href="http://en.wikipedia.org/wiki/Dinoflagellate">dinoflagellates</a> to large <a href="http://www.fishbase.org/Summary/FamilySummary.cfm?ID=11">carcharhinid sharks</a>!</p>
<p><img src="http://proopnarine.wordpress.com/files/2009/09/guild_trophic_link_distrib.png?w=300" alt="guild_trophic_link_distrib" title="guild_trophic_link_distrib" width="300" height="231" class="alignleft size-medium wp-image-467" />
<p>What type of distribution is this? A simple logarithmic transform of the data is shown in the second figure, and regression of the data yields the following function: <i>y</i> = 17238<i>x</i>^-1.9496 (r-squared=0.95). The significant and extremely good fit of a linear function to the transformed data suggests that the underlying link distribution is a power law distribution of the form <img src='http://l.wordpress.com/latex.php?latex=p%28r%29+%3D+M%5E%7B-%5Cgamma%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p(r) = M^{-\gamma}' title='p(r) = M^{-\gamma}' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=p%28r%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p(r)' title='p(r)' class='latex' /> is the link probability, <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='M' title='M' class='latex' /> is the number of prey available, and <img src='http://l.wordpress.com/latex.php?latex=%5Cgamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\gamma' title='\gamma' class='latex' /> is the power law exponent. An exponent of ~1.95 is tantalizingly close to other empirical measures. Even more exciting, for me at least, is the fact that we have predicted on the basis of <a href="http://zeus.calacademy.org/roopnarine/Selected_Publications/Roopnarine_etal_07a.pdf">previous work</a> that power law exponents that promote resistance or robustness to secondary extinctions should lie in the range 2-2.5. That work was based on terrestrial food webs from the Late Permian, 250+ million years ago!</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Return on conversation]]></title>
<link>http://curiouslypersistent.wordpress.com/2009/09/19/return-on-conversation/</link>
<pubDate>Sat, 19 Sep 2009 13:02:14 +0000</pubDate>
<dc:creator>Simon Kendrick</dc:creator>
<guid>http://curiouslypersistent.wordpress.com/2009/09/19/return-on-conversation/</guid>
<description><![CDATA[EDIT: As has been pointed out, I made a rather embarrassing miscalculation in the original post, whi]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p style="text-align:center;"><img class="aligncenter" title="Chatting on phone next to computer" src="http://farm4.static.flickr.com/3241/2455310444_64f6d67e4f.jpg" alt="" width="400" height="238" /></p>
<p><strong>EDIT: As has been pointed out, I made a rather embarrassing miscalculation in the original post, which made me seriously underestimate the CTR. I evidently need to evaluate my quantitative credentials</strong></p>
<p>My previous post on <a href="http://curiouslypersistent.wordpress.com/2009/09/06/should-we-listen-to-every-conversation/">conversation monitoring</a> was tweeted and retweeted by several individuals. Firstly, I&#8217;m grateful that people both read this blog and are motivated to share something I&#8217;ve written.</p>
<p>However, the additional traffic that this Twitter activity generated has left me wondering how valuable this social activity is to individuals or organisations that look to spread their message through this sphere.</p>
<p>What follows are some rough numbers given that:</p>
<ol>
<li>WordPress.com stats are pretty basic</li>
<li>I&#8217;ve left it two weeks to do the maths, and so follower numbers will have changed</li>
<li>Follower overlap and actual exposures are unknown</li>
</ol>
<p>Nevertheless:</p>
<ul>
<li>To my knowledge, the post was tweeted/retweeted 10 times</li>
<li>Combined number of people following those who linked the post is 10,354 as of today</li>
<li>The post probably got 100 additional hits as a result of Twitter activity</li>
</ul>
<p>A couple of guesstimated calculations:</p>
<ul>
<li>At an absolute level, this represents a click through rate of 1%</li>
<li>If I made the assumption that 5,000 followers are unduplicated (the largest follower count for a retweeter is over 3,000), the CTR changes to 2%</li>
<li>How many of the followers would have seen the tweet? A fifth? That changes the CTR to 10%</li>
</ul>
<p>10% is OK for a CTR, but it isn&#8217;t spectacular. The best ad campaigns with a strong call to action (e.g. competition entry) would achieve that.</p>
<p>The argument is that these 10% are going to be of a much higher quality than random visitors &#8211; they have acted upon a social recommendation and are likely to be engaged and interested in the content.</p>
<p>But that argument should work for the click through itself. If someone you follow and trust is recommending something, shouldn&#8217;t you be more likely to click through than if it were a random link or ad?</p>
<p>There a few issues at play here, which are causing this level of CTR</p>
<ul>
<li>Noise &#8211; Twitter is popular; there are a lot of tweets and links to browse and skim</li>
<li>Ambient intimacy &#8211; often, it is enough for me to know that person X has linked to a post on conversation monitoring by @curiouslyp. I may prefer to browse the remaining tweets rather than click through to this post</li>
<li>Power laws &#8211; if the post on conversation monitoring was by @jowyang or @chrisbrogan I may click through since they are renowned experts. Who is @curiouslyp and what would he know about this topic?</li>
<li>Nature of followers &#8211; my prior post was relevant to the PR community &#8211; very active on Twitter. I suspect posts of a different subject matter are unlikely to be spread and consumed to the same degree</li>
</ul>
<p>It is nice to think that the future is social, and that these networks will power traffic in future. But those perpetuating this &#8211; in my opinion &#8211; myth are those for whom power laws benefit, and who spend an inordinate amount of time on social networks (most likely because it is there job to do so). The average person does not have the time nor inclination to follow through on many, let alone all, posts or links.</p>
<p>So, in my opinion, the return on conversation is pretty minimal. Nevertheless, I did find it interesting to map how my post spread through Twitter via social graphs and, to repeat, I am grateful to the few that took the time to read and pass on my post.</p>
<p><a href="http://curiouslypersistent.wordpress.com/2009/09/19/return-on-conversation/">sk</a></p>
<p><span style="font-size:xx-small;">Image credit: <a href="http://www.flickr.com/photos/ironmonkey480/">http://www.flickr.com/photos/ironmonkey480/</a></span></p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/f7d08d08-791b-43fb-999f-5df4c1943be7/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_e.png?x-id=f7d08d08-791b-43fb-999f-5df4c1943be7" alt="Reblog this post [with Zemanta]" /></a></div>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Calling Bluffs and Naming Names: Myths about the Internet Topolgy]]></title>
<link>http://jsaia.wordpress.com/2009/09/11/calling-bluffs/</link>
<pubDate>Fri, 11 Sep 2009 21:02:49 +0000</pubDate>
<dc:creator>Jared</dc:creator>
<guid>http://jsaia.wordpress.com/2009/09/11/calling-bluffs/</guid>
<description><![CDATA[Tanya Berger-Wolf sent me a brave little paper recently: Mathematics and the Internet: A Source of E]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><a href="http://compbio.cs.uic.edu/~tanya/">Tanya Berger-Wolf</a> sent me a brave little paper recently: <a href="http://faculty.nps.edu/dlalders/docs/Internet-AMS-Notices-May2009.pdf">Mathematics and the Internet: A Source of Enormous Confusion and Great Potential</a> by Willinger, Alderson and Doyle.  This paper does a post mortem on the popular belief that empirical studies show that the Internet is <a href="http://en.wikipedia.org/wiki/Scale-free_network">scale-free</a>.  Spoiler: They don&#8217;t.</p>
<p>Willinger et al. trace this erroneous belief starting with a paper by <a href="http://lsiit.u-strasbg.fr/Publications/1998/PG98/ccr98-Pansiot.pdf">Pansiot and Grad</a> (who are by no means responsible), that collected data on node degrees &#8220;to get some experimental data on the shape of multicast trees one can actually obtain in [the real] Internet&#8221;.  This data was collected using trace routes, which was reasonable given that its purpose was to determine the shape of multicast trees.  In 1999, <a href="http://www.cs.cmu.edu/~christos/PUBLICATIONS/sigcomm99.ps.gz">Faloutsous, Faloutsous and Faloutsous</a> used the data from Pansiot and Grad for a purpose for which it was not intended: they used it to infer information about the degree distribution for the Internet&#8217;s router-level topology.  Next, Barabasi et al., who were studying the WWW graph, and observing scale-free distributions there, picked up the reports of scale-free distributions for the Internet and added this to their list of networks with such properties.</p>
<p>A paper soon followed by <a href="http://www.barabasilab.com/pubs/CCNR-ALB_Publications/200201-30_RevModernPhys-StatisticalMech/200201-30_RevModernPhys-StatisticalMech.pdf">Barabasi and Albert </a>that described the <a href="http://en.wikipedia.org/wiki/Preferential_attachment">preferential attachment</a> model of network growth, wherein nodes are added one at a time to a network; each node has a constant number of out links; and the destinations of these out links are selected from the current nodes with probability proportional to each current node&#8217;s number of incoming links. Barabasi and Albert showed that this model generates a network with a scale-free distribution, and posited the preferential attachment as a model of the growth of the Internet and the WWW network.  The preferential attachment model had actually been studied over about 75 years by Yule, Luria and Delbruck, and Simon, but the rediscovery by Albert and Barabasi, and the possible connection to modern networks generated a huge amount of excitement and a large number of follow-up papers.  In particular, a slew of follow-up papers specifically studied properties of preferential attachment (PA) networks, i.e. networks generated via the preferential attachment model of growth. It was shown that PA networks are very robust to random deletions, but very fragile to adversarial deletions.  One commonly accepted conclusion was that the Internet, like PA networks, had high degree nodes, that were very centrally located, and whose removal would easily disconnect the network.  This idea was actually celebrated as a nice implication from theoretical network models to the real world.</p>
<p>So what&#8217;s the problem?  Basically the research community made two huge mistakes; mistakes that some people in the community still have not recovered from (in the sense that &#8220;recover from&#8221; means  &#8220;become aware of&#8221;)!</p>
<p><strong>Mistake 1: </strong>Empirical studies do <strong>not</strong> support the claim that the Internet is scale-free</p>
<p>Willinger et al. go through several problems about why traceroutes do not create accurate maps of the Internet.  There are many technical details here that are hard for a theoretician like me to understand.  However, one detail that caught my eye was the fact that when a traceroute encounters an &#8220;opaque layer-2&#8243; cloud, &#8220;it falsely &#8216;discovers&#8217; a high-degree node that is really a logical entity  &#8230; rather than a physical node&#8221;.  This causes traceroutes to report high-degree nodes in the core of the router-level Internet, even when such nodes don&#8217;t exist.  This type of bias is claimed to be even worse than the well-known &#8220;over-sampling of high-degree nodes&#8221; bias that traceroutes also have.</p>
<p><strong>Mistake 2:</strong> Scale-free networks are <strong>not</strong> PA networks</p>
<p>PA Networks are scale-free, but the converse is not true.  In particular, there are scale-free networks that <strong>are</strong> robust to adversarial deletions.  Intuitively, it&#8217;s possible to have a scale-free network where the high degree nodes are not at the &#8220;core&#8221; of the network.  More generally, it&#8217;s possible to have a scale free network with  good <a href="http://en.wikipedia.org/wiki/Expander_graph">expansion</a> properties.  Such networks are robust to adversarial deletions.  In fact, Wilinger et al., suggest that the Internet is more likely to have the high degree nodes at the &#8220;periphery&#8221; of the network since each router can only handle a certain amount of traffic, and more edges are only possible if each edge is handling less traffic.</p>
<p><strong>What Next?</strong></p>
<p>Here the Willinger paper looses steam.  They suggest a first principles approach to Internet modeling, where the network formed is one that tries to optimize a given algorithmic problem over the network.  This is great.  What is not so great is the model they propose, which assumes that the network creation problem is solved in a completely centralized manner.  Much better would be if they had used game theory as a starting point.  After all, the Internet is inherently a massive, distributed enterprise. There is actually some really cool work done in algorithmic game theory on network creation games.  For example, see <a href="http://www.cs.cornell.edu/People/eva/agtchap19.pdf">Chapter 19</a> of The Book, or <a href="http://www.cs.berkeley.edu/~alexf/papers/flmps03.pdf">this paper</a> by Fabrikant et al. (shout out to <a href="http://www.lsi.upc.edu/~maneva/">Elitza Maneva</a> in the et al. for this paper, who was visiting UPC while I was there on sabbatical).  Yes, it&#8217;s true that most of these game theory result have shown that network creation games have small price of anarchy.  However, that does not imply that the network topology you get in a Nash equilibria will be like the topology you get in the socially optimum solution.  I&#8217;d like to see some results on the types of topologies one might expect in an Nash equilibria for network creation games.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[The Measurement of Whispers]]></title>
<link>http://miroslodki.wordpress.com/2009/08/26/the-measurement-of-whispers/</link>
<pubDate>Wed, 26 Aug 2009 22:59:18 +0000</pubDate>
<dc:creator>miroslodki</dc:creator>
<guid>http://miroslodki.wordpress.com/2009/08/26/the-measurement-of-whispers/</guid>
<description><![CDATA[The Measurement of Whispers Prologue*: It has been written that Sir Walter Raleigh wagered Queen Eli]]></description>
<content:encoded><![CDATA[The Measurement of Whispers Prologue*: It has been written that Sir Walter Raleigh wagered Queen Eli]]></content:encoded>
</item>
<item>
<title><![CDATA[The Black Swan]]></title>
<link>http://francescobaldisserri.wordpress.com/2009/08/10/the-black-swan/</link>
<pubDate>Mon, 10 Aug 2009 16:34:06 +0000</pubDate>
<dc:creator>Francesco</dc:creator>
<guid>http://francescobaldisserri.wordpress.com/2009/08/10/the-black-swan/</guid>
<description><![CDATA[Black Swans swimming in Reading, UK Nassim Nicholas Taleb&#8217;s (the author of &#8220;The Black Sw]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><div class="zemanta-img zemanta-action-dragged" style="display:block;margin:1em;">
<div class="wp-caption aligncenter" style="width: 310px"><a href="http://commons.wikipedia.org/wiki/Image:Black_swan_rdg.jpg"><img class=" " title="Black swan swimming in Reading, UK." src="http://upload.wikimedia.org/wikipedia/commons/thumb/6/6d/Black_swan_rdg.jpg/300px-Black_swan_rdg.jpg" alt="Black swan swimming in Reading, UK." width="300" height="225" /></a><p class="wp-caption-text">Black Swans swimming in Reading, UK</p></div>
</div>
<p><a class="zem_slink" title="Nassim Nicholas Taleb" rel="wikipedia" href="http://en.wikipedia.org/wiki/Nassim_Nicholas_Taleb">Nassim Nicholas Taleb</a>&#8217;s (the author of &#8220;The <a class="zem_slink" title="The Black Swan: The Impact of the Highly Improbable" rel="amazon" href="http://www.amazon.com/Black-Swan-Impact-Highly-Improbable/dp/1400063515%3FSubscriptionId%3D0G81C5DAZ03ZR9WH9X82%26tag%3Dzemanta-20%26linkCode%3Dxm2%26camp%3D2025%26creative%3D165953%26creativeASIN%3D1400063515">Black Swan</a>&#8220;) writing style is very provocative, brash and presumptuous, however the argument he presents about &#8220;<a class="zem_slink" title="Black Swan" rel="wikipedia" href="http://en.wikipedia.org/wiki/Black_Swan">Black Swans</a>&#8220;, very unlikely events with deep consequences, is one of the most brilliant I&#8217;ve read so far.</p>
<p>Beyond all his attacks to &#8220;empty suits&#8221; (experts and consultants of a certain field), Nobel prizes and &#8220;fawning MBA students&#8221; (hem hem&#8230;), Taleb clearly shows our limits in understanding the causes which triggered an event and our poor skills in forecasting. Very interesting at this regard are his examples about how we are biased in explaining past events (narrative fallacy) and how we systematically consider evidences backing our theories but ignore facts which disprove our beliefs (we often forget that 1000 evidences don&#8217;t prove a theory but just 1 counter-evidence is enough to disprove it).</p>
<p>The first 2 parts of the book are a sad enumeration of how poor is our critical thought and how we always underestimate our ignorance (great the example of <a class="zem_slink" title="Umberto Eco" rel="wikipedia" href="http://en.wikipedia.org/wiki/Umberto_Eco">Umberto Eco</a>&#8217;s anti-library) yet these parts are very useful in highlighting how we tend to commit the turkey&#8217;s error (the farmer has fed me for the last 100o days and so I&#8217;m sure he will do again tomorrow).</p>
<p>The last part deals instead with what we can do when facing risks and having to take decisions; most of this section is labeled &#8220;optional&#8221; by the author but I believe is a necessary read as well. Having demolished (convincingly) the Bell&#8217;s curve as a universal tool for risk evaluation, Taleb talks about <a class="zem_slink" title="Power law" rel="wikipedia" href="http://en.wikipedia.org/wiki/Power_law">Power Law</a> distributions, with interesting overlaps with <a class="zem_slink" title="Chris Anderson" rel="homepage" href="http://www.thelongtail.com/about.html">Chris Anderson</a>&#8217;s <a class="zem_slink" title="The Long Tail : Why the Future Is Selling Less of More" rel="amazon" href="http://www.amazon.com/Long-Tail-Future-Selling-Less/dp/190521121X%3FSubscriptionId%3D0G81C5DAZ03ZR9WH9X82%26tag%3Dzemanta-20%26linkCode%3Dxm2%26camp%3D2025%26creative%3D165953%26creativeASIN%3D190521121X">Long Tail</a> and <a class="zem_slink" title="Benoît Mandelbrot" rel="wikipedia" href="http://en.wikipedia.org/wiki/Beno%C3%AEt_Mandelbrot">Mandelbrot</a>&#8217;s Geometry.</p>
<p>A book absolutely brilliant, which teaches that what we don&#8217;t know is as important as what we do know, regardless the field: financial risks&#8217; evaluation, historical events, everyday events&#8230;</p>
<h6 class="zemanta-related-title" style="font-size:1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.kottke.org/09/06/antilibraries">Antilibraries</a>(kottke.org)</li>
<li class="zemanta-article-ul-li"><a href="http://www.socialhallucinations.com/2009/04/can-you-see-any-black-swan-in-your-rearview-mirror.html">Can You See Any Black Swan in Your Rear-view Mirror?</a>(socialhallucinations.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.omnivoracious.com/2009/04/ten-against-the-swan-nassim-nicholas-talebs-10-theses.html">Ten Against the Swan: Nassim Nicholas Taleb&#8217;s 10 Theses</a> (omnivoracious.com)</li>
</ul>
<p> </p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/3f4c75bd-f8c9-422d-ba29-f1f62019abdd/"><img class="zemanta-pixie-img" style="float:right;border-style:none;" src="http://img.zemanta.com/reblog_e.png?x-id=3f4c75bd-f8c9-422d-ba29-f1f62019abdd" alt="Reblog this post [with Zemanta]" /></a></div>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Is the Mother of All Black Swans Coming?]]></title>
<link>http://paulbarsch.wordpress.com/2009/08/05/is-the-mother-of-all-black-swans-coming/</link>
<pubDate>Wed, 05 Aug 2009 16:58:00 +0000</pubDate>
<dc:creator>paulbarsch</dc:creator>
<guid>http://paulbarsch.wordpress.com/2009/08/05/is-the-mother-of-all-black-swans-coming/</guid>
<description><![CDATA[Whether you like Nassim Taleb&#8217;s writing style or not, his books and the ideas within are worth]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><img class="alignleft size-full wp-image-238" title="dragon king" src="http://paulbarsch.wordpress.com/files/2009/08/dragon.jpg" alt="dragon king" width="129" height="129" />Whether you like Nassim Taleb&#8217;s writing style or not, his books and the ideas within are worth careful study.</p>
<p>For example, a key theme of Nassim Taleb’s “<a href="http://www.fooledbyrandomness.com/">The Black Swan</a>” is to pay close attention to the possibility of the rare event, the outlier that brings devastating consequence. Taleb forcefully and intelligently argues that these outliers are much more common than we think (1 in 100 year floods happen every 3-5 years) especially when we base our statistical analysis on the assumptions of a normal distribution of data and independence.</p>
<p>However, some interesting research by Didier Sornette at the Swiss Federal Institute of Technology shows that while power laws can explain some of the extreme events that have taken place in history, there are some <strong>extreme outliers</strong> that even defy classic power laws.</p>
<p>A <a href="http://www.technologyreview.com/blog/arxiv/23935/">post in MIT’s Technology Review</a> blog cites the following:</p>
<blockquote><p>“Sornette gives as an example the distribution of city sizes in France, which follows a classic power law, meaning that there are many small cities and only a few large ones. On a log-to-log scale, this distribution gives a straight line&#8211;except for Paris, which is an outlier and many times larger than it ought to be if it were to follow the power law.”</p></blockquote>
<p>The article also mentions that the city of London also follows this same example.</p>
<p>Sornette calls these extreme outliers “Dragon Kings.” A sobering commentary from the article ensues; “(The) seemingly ubiquitous presence of these dragon kings in all kinds of data sets means that extreme events are significantly more likely than power laws suggest.”</p>
<p>This in turn suggests that the Mother of all Black Swans might be unaccounted for in your data set. And if this is the case, does this mean that we cannot predict these extreme events,  and/or that preparation is futile?</p>
<p>I would love to hear your thoughts!</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[scl, sails and surgery]]></title>
<link>http://lamington.wordpress.com/2009/07/23/scl-sails-and-surgery/</link>
<pubDate>Thu, 23 Jul 2009 23:10:32 +0000</pubDate>
<dc:creator>Danny Calegari</dc:creator>
<guid>http://lamington.wordpress.com/2009/07/23/scl-sails-and-surgery/</guid>
<description><![CDATA[I have just uploaded a paper to the arXiv, entitled &#8220;Scl, sails and surgery&#8221;. The paper ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I have just uploaded a paper to the arXiv, entitled &#8220;<a href="http://arxiv.org/abs/0907.3541">Scl, sails and surgery</a>&#8221;. The paper discusses a connection between stable commutator length in free groups and the geometry of sails. This is an interesting example of what sometimes happens in geometry, where a complicated topological problem in low dimensions can be translated into a &#8220;simple&#8221; geometric problem in high dimensions. Other examples include the Veronese embedding in Algebraic geometry (i.e. the embedding of one projective space into another taking a point with homogeneous co-ordinates <img src='http://l.wordpress.com/latex.php?latex=x_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_i' title='x_i' class='latex' /> to the point whose homogeneous co-ordinates are the monomials of some fixed degree in the <img src='http://l.wordpress.com/latex.php?latex=x_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='x_i' title='x_i' class='latex' />), which lets one exhibit any projective variety as an intersection of a Veronese variety (whose geometry is understood very well) with a linear subspace.</p>
<p>In my paper, the fundamental problem is to compute stable commutator length in free groups, and more generally in free products of Abelian groups. Let&#8217;s focus on the case of a group <img src='http://l.wordpress.com/latex.php?latex=G+%3D+A%2AB&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='G = A*B' title='G = A*B' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=A%2CB&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A,B' title='A,B' class='latex' /> are free abelian of finite rank. A <img src='http://l.wordpress.com/latex.php?latex=K%28G%2C1%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='K(G,1)' title='K(G,1)' class='latex' /> is just a wedge <img src='http://l.wordpress.com/latex.php?latex=X%3A%3DK_A+%5Cvee+K_B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X:=K_A \vee K_B' title='X:=K_A \vee K_B' class='latex' /> of tori of dimension equal to the ranks of <img src='http://l.wordpress.com/latex.php?latex=A%2CB&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A,B' title='A,B' class='latex' />. Let <img src='http://l.wordpress.com/latex.php?latex=%5CGamma%3A+%5Ccoprod_i+S%5E1+%5Cto+X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma: \coprod_i S^1 \to X' title='\Gamma: \coprod_i S^1 \to X' class='latex' /> be a free homotopy class of <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />-manifold in <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X' title='X' class='latex' />, which is homologically trivial. Formally, we can think of <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> as a chain <img src='http://l.wordpress.com/latex.php?latex=%5Csum_i+g_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_i g_i' title='\sum_i g_i' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=B_1%5EH%28G%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='B_1^H(G)' title='B_1^H(G)' class='latex' />, the vector space of group <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />-boundaries, modulo homogenization; i.e. quotiented by the subspace spanned by chains of the form <img src='http://l.wordpress.com/latex.php?latex=g%5En+-+ng&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g^n - ng' title='g^n - ng' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=g-hgh%5E%7B-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='g-hgh^{-1}' title='g-hgh^{-1}' class='latex' />. One wants to find the simplest surface <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> mapping to <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X' title='X' class='latex' /> that rationally bounds <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' />. I.e. we want to find a map <img src='http://l.wordpress.com/latex.php?latex=f%3AS+%5Cto+X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='f:S \to X' title='f:S \to X' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%5Cpartial+f%3A%5Cpartial+S+%5Cto+X&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\partial f:\partial S \to X' title='\partial f:\partial S \to X' class='latex' /> factors through <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' />, and so that the boundary <img src='http://l.wordpress.com/latex.php?latex=%5Cpartial+S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\partial S' title='\partial S' class='latex' /> wraps homologically <img src='http://l.wordpress.com/latex.php?latex=n%28S%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n(S)' title='n(S)' class='latex' /> times around each loop of <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' />, in such a way as to infimize <img src='http://l.wordpress.com/latex.php?latex=-%5Cchi%28S%29%2F2n%28S%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-\chi(S)/2n(S)' title='-\chi(S)/2n(S)' class='latex' />. This infimum, over all maps of all surfaces <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> of all possible genus, is the <em>stable commutator length</em> of the chain <img src='http://l.wordpress.com/latex.php?latex=%5Csum_i+g_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum_i g_i' title='\sum_i g_i' class='latex' />. Computing this quantity for all such finite chains is tantamount to understanding the bounded cohomology of a free group in dimension <img src='http://l.wordpress.com/latex.php?latex=2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='2' title='2' class='latex' />.</p>
<p>Given such a surface <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' />, one can cut it up into simpler pieces, along the preimage of the basepoint <img src='http://l.wordpress.com/latex.php?latex=K_A+%5Ccap+K_B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='K_A \cap K_B' title='K_A \cap K_B' class='latex' />. Since <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> is a surface with boundary, these simpler pieces are surfaces with <em>corners</em>. In general, understanding how a surface can be assembled from an abstract collection of surfaces with corners is a hopeless task. When one tries to glue the pieces back together, one runs into trouble at the corners &#8212; how does one decide when a collection of surfaces &#8220;closes up&#8221; around a corner? The wrong decision leads to branch points; moreover, a decision made at one corner will propogate along an edge and lead to constraints on the choices one can make at other corners. This problem arises again and again in low-dimensional topology, and has several different (and not always equivalent) formulations and guises, including -</p>
<ul>
<li>Given an abstract branched surface and a weight on that surface, when is there an unbranched surface carried by the abstract branched surface and realizing the weight?</li>
<li>Given a triangulation of a <img src='http://l.wordpress.com/latex.php?latex=3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='3' title='3' class='latex' />-manifold and a collection of normal surface types in each simplex satisfying the gluing constraints but *not*  necessarily satisfying the quadrilateral condition (i.e. there might be more than one quadrilateral type per simplex), when is there an immersed unbranched normal surface in the manifold realizing the weight?</li>
<li>Given an immersed curve in the plane, when is there an immersion from the disk to the plane whose boundary is the given curve?</li>
<li>Given a polyhedral surface (arising e.g. in computer graphics), how can one choose smooth approximations of the polygonal faces that mesh smoothly at the vertices?</li>
</ul>
<p>I think of all these problems as examples of what I like to call the <em>holonomy problem</em>, since all of them can be reduced, in one way or another, to studying representations of fundamental groups of punctured surfaces into finite groups. The fortunate &#8220;accident&#8221; in this case is that every corner arises by intersecting a cut with a boundary edge of <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' />. Consequently, one never wants to glue more than two pieces up at any corner, and the holonomy problem does not arise. Hence in principle, to understand the surface <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> one just needs to understand the pieces of <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' /> that can arise by cutting, and the ways in which they can be reassembled.</p>
<p>This is still not a complete solution of the problem, since infinitely many kinds of pieces can arise by cutting complicated surfaces <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' />. The <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />-manifold <img src='http://l.wordpress.com/latex.php?latex=%5CGamma&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\Gamma' title='\Gamma' class='latex' /> decomposes into a collection of arcs in the tori <img src='http://l.wordpress.com/latex.php?latex=K_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='K_A' title='K_A' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=K_B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='K_B' title='K_B' class='latex' /> which we denote <img src='http://l.wordpress.com/latex.php?latex=%5Ctau_A%2C%5Ctau_B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau_A,\tau_B' title='\tau_A,\tau_B' class='latex' /> respectively, and the surface <img src='http://l.wordpress.com/latex.php?latex=S+%5Ccap+K_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S \cap K_A' title='S \cap K_A' class='latex' /> (hereafter abbreviated to <img src='http://l.wordpress.com/latex.php?latex=S_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S_A' title='S_A' class='latex' />) has edges that alternate between elements of <img src='http://l.wordpress.com/latex.php?latex=%5Ctau_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau_A' title='\tau_A' class='latex' />, and edges mapping to <img src='http://l.wordpress.com/latex.php?latex=K_A+%5Ccap+K_B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='K_A \cap K_B' title='K_A \cap K_B' class='latex' />. Since <img src='http://l.wordpress.com/latex.php?latex=K_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='K_A' title='K_A' class='latex' /> is a torus, handles of <img src='http://l.wordpress.com/latex.php?latex=S_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S_A' title='S_A' class='latex' /> mapping to <img src='http://l.wordpress.com/latex.php?latex=K_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='K_A' title='K_A' class='latex' /> can be compressed, reducing the complexity of <img src='http://l.wordpress.com/latex.php?latex=S_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S_A' title='S_A' class='latex' />, and thereby <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S' title='S' class='latex' />, so one need only consider <em>planar</em> surfaces <img src='http://l.wordpress.com/latex.php?latex=S_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S_A' title='S_A' class='latex' />.</p>
<p>Let <img src='http://l.wordpress.com/latex.php?latex=C_2%28A%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='C_2(A)' title='C_2(A)' class='latex' /> denote the real vector space with basis the set of ordered pairs <img src='http://l.wordpress.com/latex.php?latex=%28t%2Ct%27%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(t,t&#039;)' title='(t,t&#039;)' class='latex' /> of elements of <img src='http://l.wordpress.com/latex.php?latex=%5Ctau_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau_A' title='\tau_A' class='latex' /> (not necessarily distinct), and <img src='http://l.wordpress.com/latex.php?latex=C_1%28A%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='C_1(A)' title='C_1(A)' class='latex' /> the real vector space with basis the elements of <img src='http://l.wordpress.com/latex.php?latex=%5Ctau_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\tau_A' title='\tau_A' class='latex' />. A surface <img src='http://l.wordpress.com/latex.php?latex=S_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S_A' title='S_A' class='latex' /> determines a non-negative integral vector <img src='http://l.wordpress.com/latex.php?latex=v%28S_A%29+%5Cin+C_2%28A%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v(S_A) \in C_2(A)' title='v(S_A) \in C_2(A)' class='latex' />, by counting the number of times a given pair of edges <img src='http://l.wordpress.com/latex.php?latex=%28t%2Ct%27%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(t,t&#039;)' title='(t,t&#039;)' class='latex' /> appear in succession on one of the (oriented) boundary components of <img src='http://l.wordpress.com/latex.php?latex=S_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S_A' title='S_A' class='latex' />. The vector <img src='http://l.wordpress.com/latex.php?latex=v%28S_A%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v(S_A)' title='v(S_A)' class='latex' /> satisfies two linear constraints. First, there is a map <img src='http://l.wordpress.com/latex.php?latex=%5Cpartial%3A+C_2%28A%29+%5Cto+C_1%28A%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\partial: C_2(A) \to C_1(A)' title='\partial: C_2(A) \to C_1(A)' class='latex' /> defined on a basis vector by <img src='http://l.wordpress.com/latex.php?latex=%5Cpartial%28t%2Ct%27%29+%3D+t+-+t%27&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\partial(t,t&#039;) = t - t&#039;' title='\partial(t,t&#039;) = t - t&#039;' class='latex' />. The vector <img src='http://l.wordpress.com/latex.php?latex=v%28S_A%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v(S_A)' title='v(S_A)' class='latex' /> satisfies <img src='http://l.wordpress.com/latex.php?latex=%5Cpartial+v%28S_A%29+%3D+0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\partial v(S_A) = 0' title='\partial v(S_A) = 0' class='latex' />. Second, each element <img src='http://l.wordpress.com/latex.php?latex=t+%5Cin+%5Ctau_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='t \in \tau_A' title='t \in \tau_A' class='latex' /> is a based loop in <img src='http://l.wordpress.com/latex.php?latex=K_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='K_A' title='K_A' class='latex' />, and therefore corresponds to an element in the free abelian group <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' />. Define <img src='http://l.wordpress.com/latex.php?latex=h%3AC_2%28A%29+%5Cto+A+%5Cotimes+%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h:C_2(A) \to A \otimes \mathbb{R}' title='h:C_2(A) \to A \otimes \mathbb{R}' class='latex' /> on a basis vector by <img src='http://l.wordpress.com/latex.php?latex=h%28t%2Ct%27%29+%3D+t%2Bt%27&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h(t,t&#039;) = t+t&#039;' title='h(t,t&#039;) = t+t&#039;' class='latex' /> (warning: the notation obscures the fact that <img src='http://l.wordpress.com/latex.php?latex=%5Cpartial&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\partial' title='\partial' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' /> map to quite different vector spaces). Then <img src='http://l.wordpress.com/latex.php?latex=h+v%28S_A%29%3D0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h v(S_A)=0' title='h v(S_A)=0' class='latex' />; moreover, a non-negative rational vector <img src='http://l.wordpress.com/latex.php?latex=v+%5Cin+C_2%28A%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v \in C_2(A)' title='v \in C_2(A)' class='latex' /> satisfying <img src='http://l.wordpress.com/latex.php?latex=%5Cpartial+v+%3D+h+v+%3D+0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\partial v = h v = 0' title='\partial v = h v = 0' class='latex' /> has a multiple of the form <img src='http://l.wordpress.com/latex.php?latex=v%28S_A%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v(S_A)' title='v(S_A)' class='latex' /> for some <img src='http://l.wordpress.com/latex.php?latex=S_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S_A' title='S_A' class='latex' /> as above. Denote the subspace of <img src='http://l.wordpress.com/latex.php?latex=C_2%28A%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='C_2(A)' title='C_2(A)' class='latex' /> consisting of non-negative vectors in the kernel of <img src='http://l.wordpress.com/latex.php?latex=%5Cpartial&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\partial' title='\partial' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' /> by <img src='http://l.wordpress.com/latex.php?latex=V_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_A' title='V_A' class='latex' />. This is a rational polyhedral cone &#8212; i.e. a cone with finitely many extremal rays, each spanned by a rational vector.</p>
<p>Although every integral <img src='http://l.wordpress.com/latex.php?latex=v+%5Cin+V_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v \in V_A' title='v \in V_A' class='latex' /> is equal to <img src='http://l.wordpress.com/latex.php?latex=v%28S_A%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v(S_A)' title='v(S_A)' class='latex' /> for some <img src='http://l.wordpress.com/latex.php?latex=S_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S_A' title='S_A' class='latex' />, many different <img src='http://l.wordpress.com/latex.php?latex=S_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S_A' title='S_A' class='latex' /> correspond to a given <img src='http://l.wordpress.com/latex.php?latex=v&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v' title='v' class='latex' />. Moreover, if we are allowed to consider formal weighted sums of surfaces, then even more possibilities. In order to compute stable commutator length, we must determine, for a given vector <img src='http://l.wordpress.com/latex.php?latex=v+%5Cin+V_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v \in V_A' title='v \in V_A' class='latex' />, an expression <img src='http://l.wordpress.com/latex.php?latex=v+%3D+%5Csum+t_i+v%28S_i%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v = \sum t_i v(S_i)' title='v = \sum t_i v(S_i)' class='latex' /> where the <img src='http://l.wordpress.com/latex.php?latex=t_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='t_i' title='t_i' class='latex' /> are positive real numbers, which minimizes <img src='http://l.wordpress.com/latex.php?latex=%5Csum+-t_i+%5Cchi_o%28S_i%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\sum -t_i \chi_o(S_i)' title='\sum -t_i \chi_o(S_i)' class='latex' />. Here <img src='http://l.wordpress.com/latex.php?latex=%5Cchi_o%28%5Ccdot%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\chi_o(\cdot)' title='\chi_o(\cdot)' class='latex' /> denotes <em>orbifold</em> Euler characteristic of a surface with corners; each corner contributes <img src='http://l.wordpress.com/latex.php?latex=-1%2F4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-1/4' title='-1/4' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cchi_o&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\chi_o' title='\chi_o' class='latex' />. The reason one counts complexity using this modified definition is that the result is additive: <img src='http://l.wordpress.com/latex.php?latex=%5Cchi%28S%29+%3D+%5Cchi_o%28S_A%29+%2B+%5Cchi_o%28S_B%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\chi(S) = \chi_o(S_A) + \chi_o(S_B)' title='\chi(S) = \chi_o(S_A) + \chi_o(S_B)' class='latex' />. The contribution to <img src='http://l.wordpress.com/latex.php?latex=%5Cchi_o&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\chi_o' title='\chi_o' class='latex' /> from corners is a linear function on <img src='http://l.wordpress.com/latex.php?latex=V_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_A' title='V_A' class='latex' />. Moreover, a component <img src='http://l.wordpress.com/latex.php?latex=S_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S_i' title='S_i' class='latex' /> with <img src='http://l.wordpress.com/latex.php?latex=%5Cchi%28S_i%29+%5Cle+0&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\chi(S_i) \le 0' title='\chi(S_i) \le 0' class='latex' /> can be covered by a surface of high genus and compressed (increasing <img src='http://l.wordpress.com/latex.php?latex=%5Cchi&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\chi' title='\chi' class='latex' />); so such a term can always be replaced by a formal sum <img src='http://l.wordpress.com/latex.php?latex=1%2Fn+S_i%27&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1/n S_i&#039;' title='1/n S_i&#039;' class='latex' /> for which <img src='http://l.wordpress.com/latex.php?latex=%5Cchi%28S_i%27%29+%3D+%5Cchi%28S_i%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\chi(S_i&#039;) = \chi(S_i)' title='\chi(S_i&#039;) = \chi(S_i)' class='latex' />. Thus the only nonlinear contribution to <img src='http://l.wordpress.com/latex.php?latex=%5Cchi_o&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\chi_o' title='\chi_o' class='latex' /> comes from the surfaces <img src='http://l.wordpress.com/latex.php?latex=S_i&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S_i' title='S_i' class='latex' /> whose underlying topological surface is a <em>disk</em>.</p>
<p>Call a vector <img src='http://l.wordpress.com/latex.php?latex=v+%5Cin+V_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v \in V_A' title='v \in V_A' class='latex' /> a <em>disk vector</em> if <img src='http://l.wordpress.com/latex.php?latex=v+%3D+v%28S_A%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v = v(S_A)' title='v = v(S_A)' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=S_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='S_A' title='S_A' class='latex' /> is topologically a disk (with corners). It turns out that the set of disk vectors <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BD%7D_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{D}_A' title='\mathcal{D}_A' class='latex' /> has the following simple form: it is equal to the union of the integer lattice points contained in certain of the open faces of <img src='http://l.wordpress.com/latex.php?latex=V_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_A' title='V_A' class='latex' /> (those satisfying a combinatorial criterion). Define the <em>sail</em> of <img src='http://l.wordpress.com/latex.php?latex=V_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_A' title='V_A' class='latex' /> to be equal to the boundary of the convex hull of the polyhedron <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BD%7D_A+%2B+V_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{D}_A + V_A' title='\mathcal{D}_A + V_A' class='latex' /> (where <img src='http://l.wordpress.com/latex.php?latex=%2B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='+' title='+' class='latex' /> here denotes Minkowski sum). The <em>Klein function</em> <img src='http://l.wordpress.com/latex.php?latex=%5Ckappa&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\kappa' title='\kappa' class='latex' /> is the unique continuous function on <img src='http://l.wordpress.com/latex.php?latex=V_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_A' title='V_A' class='latex' />, linear on rays, that is equal to <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' /> exactly on the sail. Then <img src='http://l.wordpress.com/latex.php?latex=%5Cchi_o%28v%29%3A%3D+%5Cmax+%5Csum+t_i%5Cchi_o%28S_i%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\chi_o(v):= \max \sum t_i\chi_o(S_i)' title='\chi_o(v):= \max \sum t_i\chi_o(S_i)' class='latex' /> over expressions <img src='http://l.wordpress.com/latex.php?latex=v+%3D+%5Csum+t_i+v%28S_i%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='v = \sum t_i v(S_i)' title='v = \sum t_i v(S_i)' class='latex' /> satisfies <img src='http://l.wordpress.com/latex.php?latex=%5Cchi_o%28v%29+%3D+%5Ckappa%28v%29+-+%26%23124%3Bv%26%23124%3B%2F2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\chi_o(v) = \kappa(v) - &#124;v&#124;/2' title='\chi_o(v) = \kappa(v) - &#124;v&#124;/2' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%26%23124%3B%5Ccdot%26%23124%3B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='&#124;\cdot&#124;' title='&#124;\cdot&#124;' class='latex' /> denotes <img src='http://l.wordpress.com/latex.php?latex=L%5E1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='L^1' title='L^1' class='latex' /> norm. To calculate stable commutator length, one minimizes <img src='http://l.wordpress.com/latex.php?latex=-%5Cchi_o%28v%29+-+%5Cchi_o%28v%27%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='-\chi_o(v) - \chi_o(v&#039;)' title='-\chi_o(v) - \chi_o(v&#039;)' class='latex' /> over <img src='http://l.wordpress.com/latex.php?latex=%28v%2Cv%27%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='(v,v&#039;)' title='(v,v&#039;)' class='latex' /> contained in a certain rational polyhedron in <img src='http://l.wordpress.com/latex.php?latex=V_A+%5Ctimes+V_B&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_A \times V_B' title='V_A \times V_B' class='latex' />.</p>
<p>Sails are considered elsewhere by several authors; usually, people take <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BD%7D_A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathcal{D}_A' title='\mathcal{D}_A' class='latex' /> to be the set of all integer vectors except the vertex of the cone, and the sail is therefore the boundary of the convex hull of this (simpler) set. Klein introduced sails as a higher-dimensional generalization of continued fractions: if <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V' title='V' class='latex' /> is a polyhedral cone in two dimensions (i.e. a sector in the plane, normalized so that one edge is the horizontal axis, say), the vertices of the sail are the continued fraction approximations of the boundary slope. Arnold has <a href="http://www.ams.org/mathscinet-getitem?mr=1704965">revived</a> the study of such objects in recent years. They arise in many different interesting contexts, such as numerical analysis (especially diophantine approximation) and algebraic number theory. For example, let <img src='http://l.wordpress.com/latex.php?latex=A+%5Cin+%5Ctext%7BSL%7D%28n%2C%5Cmathbb%7BZ%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A \in \text{SL}(n,\mathbb{Z})' title='A \in \text{SL}(n,\mathbb{Z})' class='latex' /> be a matrix with irreducible characteristic equation, and all eigenvalues real and positive. There is a basis for <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5En&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{R}^n' title='\mathbb{R}^n' class='latex' /> consisting of eigenvalues, spanning a convex cone <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V' title='V' class='latex' />. The cone &#8212; and therefore its sail &#8212; is invariant under <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='A' title='A' class='latex' />; moreover, there is a <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BZ%7D%5E%7Bn-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{Z}^{n-1}' title='\mathbb{Z}^{n-1}' class='latex' /> subgroup of <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7BSL%7D%28n%2C%5Cmathbb%7BZ%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\text{SL}(n,\mathbb{Z})' title='\text{SL}(n,\mathbb{Z})' class='latex' /> consisting of matrices with the same set of eigenvectors; this observation follows from Dirichlet&#8217;s theorem on the units in a number field, and is due to <a href="http://www.ams.org/mathscinet-getitem?mr=0721966">Tsuchihashi</a>. This abelian group acts freely on the sail with quotient a (topological) torus of dimension <img src='http://l.wordpress.com/latex.php?latex=n-1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n-1' title='n-1' class='latex' />, together with a &#8220;canonical&#8221; cell decomposition. This connection between number theory and combinatorics is quite mysterious; for example, Arnold asks: which cell decompositions can arise? This is unknown even in the case <img src='http://l.wordpress.com/latex.php?latex=n%3D3&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n=3' title='n=3' class='latex' />.</p>
<p>The most interesting aspect of this correspondence, between stable commutator length and sails, is that it allows one to introduce <em>parameters</em>. An element in a free group <img src='http://l.wordpress.com/latex.php?latex=F_2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='F_2' title='F_2' class='latex' /> can be expressed as a word in letters <img src='http://l.wordpress.com/latex.php?latex=a%2Cb%2Ca%5E%7B-1%7D%2Cb%5E%7B-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a,b,a^{-1},b^{-1}' title='a,b,a^{-1},b^{-1}' class='latex' />, e.g. <img src='http://l.wordpress.com/latex.php?latex=aab%5E%7B-1%7Db%5E%7B-1%7Da%5E%7B-1%7Da%5E%7B-1%7Da%5E%7B-1%7Dbbbbab%5E%7B-1%7Db%5E%7B-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='aab^{-1}b^{-1}a^{-1}a^{-1}a^{-1}bbbbab^{-1}b^{-1}' title='aab^{-1}b^{-1}a^{-1}a^{-1}a^{-1}bbbbab^{-1}b^{-1}' class='latex' />, which is usually abbreviated with exponential notation, e.g. <img src='http://l.wordpress.com/latex.php?latex=a%5E2b%5E%7B-2%7Da%5E%7B-3%7Db%5E4ab%5E%7B-2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a^2b^{-2}a^{-3}b^4ab^{-2}' title='a^2b^{-2}a^{-3}b^4ab^{-2}' class='latex' />. Having introduced this notation, one can think of the exponents as parameters, and study stable commutator length in families of words, e.g. <img src='http://l.wordpress.com/latex.php?latex=a%5E%7B2%2Bp%7Db%5E%7B-2%2Bq%7Da%5E%7B-3-p%7Db%5E%7B4-q%7Dab%5E%7B-2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='a^{2+p}b^{-2+q}a^{-3-p}b^{4-q}ab^{-2}' title='a^{2+p}b^{-2+q}a^{-3-p}b^{4-q}ab^{-2}' class='latex' />. Under the correspondence above, the parameters only affect the coefficients of the linear map <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='h' title='h' class='latex' />, and therefore one obtains families of polyhedral cones <img src='http://l.wordpress.com/latex.php?latex=V_A%28p%2Cq%2C%5Ccdots%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='V_A(p,q,\cdots)' title='V_A(p,q,\cdots)' class='latex' /> whose extremal rays depend linearly on the exponent parameters. This lets one prove many facts about the stable commutator length spectrum in a free group, including:</p>
<p><strong>Theorem: </strong>The image of a nonabelian free group of rank at least <img src='http://l.wordpress.com/latex.php?latex=4&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='4' title='4' class='latex' /> under scl in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%2F%5Cmathbb%7BZ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{R}/\mathbb{Z}' title='\mathbb{R}/\mathbb{Z}' class='latex' /> is precisely <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BQ%7D%2F%5Cmathbb%7BZ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{Q}/\mathbb{Z}' title='\mathbb{Q}/\mathbb{Z}' class='latex' />.</p>
<p>and</p>
<p><strong>Theorem:</strong> For each <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='n' title='n' class='latex' />, the image of the free group <img src='http://l.wordpress.com/latex.php?latex=F_n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='F_n' title='F_n' class='latex' /> under scl contains a well-ordered sequence of values with ordinal type <img src='http://l.wordpress.com/latex.php?latex=%5Comega%5E%7B%5Clfloor+n%2F4+%5Crfloor%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\omega^{\lfloor n/4 \rfloor}' title='\omega^{\lfloor n/4 \rfloor}' class='latex' />. The image of <img src='http://l.wordpress.com/latex.php?latex=F_%5Cinfty&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='F_\infty' title='F_\infty' class='latex' /> contains a well-ordered sequence of values with ordinal type <img src='http://l.wordpress.com/latex.php?latex=%5Comega%5E%5Comega&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\omega^\omega' title='\omega^\omega' class='latex' />.</p>
<p>One can also say things about the precise dependence of scl on parameters in particular families. More conjecturally, one would like to use this correspondence to say something about the <em>statistical</em> distribution of scl in free groups. Experimentally, this distribution appears to obey power laws, in the sense that a given (reduced) fraction <img src='http://l.wordpress.com/latex.php?latex=p%2Fq&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p/q' title='p/q' class='latex' /> appears in certain infinite families of elements with frequency proportional to <img src='http://l.wordpress.com/latex.php?latex=q%5E%7B-%5Cdelta%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='q^{-\delta}' title='q^{-\delta}' class='latex' /> for some power <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\delta' title='\delta' class='latex' /> (which unfortunately depends in a rather opaque way on the family). Such power laws are reminiscent of Arnold tongues in dynamics, one of the best-known examples of <em>phase locking</em> of coupled nonlinear oscillators. Heuristically one expects such power laws to appear in the geometry of &#8220;random&#8221; sails &#8212; this is explained by the fact that the (affine) geometry of a sail depends only on its <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7BSL%7D%28n%2C%5Cmathbb%7BZ%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\text{SL}(n,\mathbb{Z})' title='\text{SL}(n,\mathbb{Z})' class='latex' /> orbit, and the existence of invariant measures on a natural moduli space; see e.g. <a href="http://www.ams.org/mathscinet-getitem?mr=1733869">Kontsevich and Suhov</a>. The simplest example concerns the (<img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='1' title='1' class='latex' />-dimensional) cone spanned by a random integral vector in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BZ%7D%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\mathbb{Z}^2' title='\mathbb{Z}^2' class='latex' />. The <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7BSL%7D%282%2C%5Cmathbb%7BZ%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\text{SL}(2,\mathbb{Z})' title='\text{SL}(2,\mathbb{Z})' class='latex' /> orbit of such a vector depends only on the gcd of the two co-ordinates. As is easy to see, the probability distribution of the gcd of a random pair of integers <img src='http://l.wordpress.com/latex.php?latex=p%2Cq&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p,q' title='p,q' class='latex' /> obeys a power law: <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bgcd%7D%28p%2Cq%29+%3D+n&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\text{gcd}(p,q) = n' title='\text{gcd}(p,q) = n' class='latex' /> with probability <img src='http://l.wordpress.com/latex.php?latex=%5Czeta%282%29%5E%7B-1%7D%2Fn%5E2&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\zeta(2)^{-1}/n^2' title='\zeta(2)^{-1}/n^2' class='latex' />. The rigorous justification of the power laws observed in the scl spectrum of free groups remains the focus of current research by myself and my students.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Small Worlds]]></title>
<link>http://jsaia.wordpress.com/2009/07/17/small-worlds/</link>
<pubDate>Fri, 17 Jul 2009 17:28:05 +0000</pubDate>
<dc:creator>Jared</dc:creator>
<guid>http://jsaia.wordpress.com/2009/07/17/small-worlds/</guid>
<description><![CDATA[Piere Fraigniaud and George Giakkoupis from the University of Paris at Diderot have a really nice pa]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Piere Fraigniaud and George Giakkoupis from the University of Paris at Diderot have a really nice paper in this upcoming Principles of Distributed Computing (PODC). Their paper titled <a href="http://www.dcl.utoronto.ca/%7Eggiak/papers/podc09-ggiak-full.pdf">&#8220;The Effect of Power-Law Degrees on the Navigability of Small Worlds&#8221;</a> builds on the classic paper by Jon Kleinberg on navigating small world networks.</p>
<p>Jon Kleinberg&#8217;s classic <a href="http://http//www.cs.cornell.edu/home/kleinber/swn.pdf">paper</a> concerns navigation in a grid network where each node, in addition to its local edges, has one additional long range edge. What Kleinberg showed is that provided that, for each node, this long range link covers a distance d with probability proportional to d^2, then a greedy routing algorithm will ensure that any node can reach any other node in the network within no more than about log^2 n hops where n is the number of nodes in the network[1]. Moreover, the exponent in this probability is pretty important, even a slight deviation from an exponent of 2 results in networks that can not be efficiently navigated by greedy algorithms. In this way, Kleinberg was thus one of the first people to describe a type of network that might mimic the social network that allowed quick routing in Stanley Milgram&#8217;s famous <a href="http://en.wikipedia.org/wiki/Six_degrees_of_separation">six degrees of separation</a> experiments.</p>
<p>So what about this new paper by Piere and George? Well for many years the exponent of 2 in log^2 has bothered people. Piere and George show that it is possible to get rid of it with power laws. In particular, they show that if instead of each node having exactly 1 long distance link, the number of long distance links per node follows a certain powerlaw distribution, then greedy routing works in about log n hops. A powerlaw distribution means that the number of nodes with a number of long distance links x is proportional to x^k for some fixed constant k. This is a so called heavy tail distribution which occurs in many natural complex systems. Surprisingly, Piere and George show that the type of powerlaw distribution for which greedy routing works is when k is in the range between 2 and 3, which is very similar to the exponent one observes for degree distributions in many naturally occuring social networks.</p>
<p>As far as I know this is one of the first papers that suggests a nice <span style="font-weight:bold;">functional</span> property of powerlaw distributions. In particular, it shows that powerlaw distributions are more powerful than other distributions in achieving a specific mathematical goal. Are there other algorithmic or mathematical problems that powerlaw distributions are &#8220;good&#8221; for. It looks like a very nice paper.</p>
<p>[1] I&#8217;m using the term about to meet O(log^2 n) or roughly a function that grows like C log^2 n for some fixed constant C.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Todo lo que siempre quisiste saber sobre leyes de potencia]]></title>
<link>http://francisthemulenews.wordpress.com/2009/06/21/todo-lo-que-siempre-quisiste-saber-sobre-leyes-de-potencia/</link>
<pubDate>Sun, 21 Jun 2009 09:14:20 +0000</pubDate>
<dc:creator>emulenews</dc:creator>
<guid>http://francisthemulenews.wordpress.com/2009/06/21/todo-lo-que-siempre-quisiste-saber-sobre-leyes-de-potencia/</guid>
<description><![CDATA[Las leyes de potencia están de moda a la hora de interpretar datos experimentales. Pero las leyes de]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Las leyes de potencia están de moda a la hora de interpretar datos experimentales. Pero las leyes de potencia son un arma de doble filo. No es fácil estimar sus parámetros utilizando un estimador de máxima verosimilitud. Códigos en Matlab y en R que te permiten estimar los parámetros de una ley de potencias así como calcular la bondad de dicho ajuste los puedes encontrar en <a href="http://www.santafe.edu/~aaronc/powerlaws/" target="_blank">esta página web</a>, resultado de un artículo que se publicará en la prestigiosa SIAM Review, &#8220;<a href="http://arxiv.org/abs/0706.1062" target="_blank">Power-law distributions in empirical data</a>,&#8221; de Aaron Clauset, Cosma Rohilla Shalizi, y M. E. J. Newman, disponible en ArXiv desde junio de 2007 (last revised 2 Feb 2009).</p>
<p>El código más interesante <strong>plfit.m</strong> no funciona en la versión 6 de Matlab, requiriendo al menos la versión 7 (los cambios para adaptarlo a la versión 6 no son difíciles de hacer pero hay que hacerlos). Es un código lento pero ni mucho menos tan lento como <strong>plvar.m</strong> que estima el error en los parámetros del ajuste y <strong>plpva.m</strong> que determina el valor p del ajuste mediante un test de Kolmogorov-Smirnov (si p&#60;0.1 la ley de potencias es un pésimo ajuste a los datos). Estos últimos comandos repiten 1000 veces un cálculo que tarda decenas de segundos. Así que hay que tomárselo con mucha tranquilidad.</p>
<p>Mucha gente afirma que las citas de artículos científicos de un investigador siguen una ley de potencia. Uno de los ejemplos del artículo de Clauset et al. para las citas totales en el ISI WOS (Web of Science) a todos los artículos publicados en una serie de años encuentra un valor de p=0,2 que indica cierta evidencia, pero afirma que una ley de potencias truncada (con un corte) obtiene un p=0,87 (un valor muy bueno). ¿Qué pasará con un autor individual? He buscado en el ISI WOS las publicaciones y su número de citas (a día de hoy) de varios investigadores para comprobar si con los programas de Clauset et al. es válida la hipótesis de que siguen una ley de potencias. Los resultados hasta el año 2008 (inclusive) son los siguientes:</p>
<p>- Edward Witten que tiene 271 artículos y un índice h de 120; obtenemos alpha =1,89 +/- 0,38, con xmin =102 +/- 133, lo que claramente nos hace dudar de la validez de la ley de la potencia, de hecho lo confirma un p=0,001.</p>
<p>- Jorge E. Hirsch, inventor del índice h, que tiene 215 artículos y un índice h de 52; obtenemos un alpha=2,11 +/- 0,29, con xmin=36 +/- 20, un un valor p=0,028. Tampoco sigue una ley de potencia.</p>
<p>- Un investigador español senior con 204 artículos y un índice h de 12; obtenemos alpha=2,33 +/- 0,59, con xmin=4 +/-3, y un p=0,001.</p>
<p>- Un investigador español joven con 48 artículos y un índice h de 8; obtenemos alpha=1,83 +/- 0,50, con xmin=3 +/-4, y un p=0,25.</p>
<p>Son solo 4 ejemplos, pero parece claro que las leyes de potencia no describen bien las citas de investigadores.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Some animals can bet at the Stock Exchange while beating the market up to ten times]]></title>
<link>http://chemoton.wordpress.com/2009/06/03/some-animals-can-bet-at-the-stock-exchange-while-beating-the-market-up-to-ten-times/</link>
<pubDate>Wed, 03 Jun 2009 15:44:19 +0000</pubDate>
<dc:creator>Vitorino Ramos</dc:creator>
<guid>http://chemoton.wordpress.com/2009/06/03/some-animals-can-bet-at-the-stock-exchange-while-beating-the-market-up-to-ten-times/</guid>
<description><![CDATA[Last year, at the beginning of October I decided to dedicate my second post on financial markets (I,]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p style="text-align:justify;">Last year, at the beginning of October I decided to dedicate my second post on financial markets (<a href="http://chemoton.wordpress.com/2008/09/25/on-financial-markets/" target="_blank">I</a>, <a href="http://chemoton.wordpress.com/2008/10/09/black-swan-financial-markets-part-ii/" target="_blank">II</a>) to <span style="text-decoration:underline;"><em>Black Swans</em></span>. Swans are beautiful animals, but while white swans are vulgar and omnipresent at every pond, black swans are rare! Meanwhile, 2 days ago (June 1) the <em><a href="http://online.wsj.com/article/SB124380234786770027.html" target="_blank">Wall Street Journal</a></em> comes with this very awkward - and by all means for that precise reason - interesting article written by journalist Scott Patterson, where Mr. Taleb&#8217;s name pops-up again (image below).</p>
<p style="text-align:justify;">Well, &#8230; let&#8217;s face it: you could put your money in the bank and have &#8211; let&#8217;s say &#8211; a 3% revenue at the end of your fiscal year. Or you could apply it to raise a new fancy <em>gourmet</em> restaurant at your local vicinity. Restaurants and local food stores are known to have 5-7% revenues in one year, not to speak on the immense burden they represent as well as for some associated risks &#8211; specially these days. But then you may think &#8211; better than banks, right? <a href="http://chemoton.wordpress.com/2008/10/24/i-made-a-mistake/" target="_blank">Right</a>! Or, just to give you another example on this increasing scale - raising a little bit the risk -, on the other hand you could apply your money in stock markets. Main financial indexes (Dow Jones, NASDAQ, etc) are known to have an annual average revenue of 10-12% (since 1918). Not these days of course, where high volatility and entropy in the markets are installed. Well,  emergent countries like China are raising themselves at 12%/year also. We could go on and on with so many other examples. Some say that Eolic parks could achieve 40%. Normally the cost of one eolic tower is around 1 million euros, which could be paid back after one year producing energy trough wind at normal operating conditions. The rest are maintenance costs, as well as initial investment in terrains, etc. So, what&#8217;s new? Consider this. For moments imagine yourself having 100% in revenues, just last year, at this precise dramatic context. That&#8217;s 10 times what the market does in regular years, 20 times what your favorite restaurant does. Moreover, there is a substantial difference between all these examples. If you keep dropping money at the restaurant (for instance the revenue you have earned in the last year), still liquid revenues will be the same in the next year (unless you open a new dinner room next to the first one, while the awful burden keeps increasing). Some business are static and linear in time while others are exponential. As <em>Alice</em> in the wonderland, you will need to keep running twice as faster in order to be at the same place. Amazing those differences, no? Well, not for those &#8220;lovely&#8221; animal creatures known as <em>Black Swans</em>. According to Patterson, &#8230; <em>Funds run by Universa, which is managed and owned by Mr. Taleb&#8217;s long-time collaborator Mark Spitznagel, last year gained more than 100% thanks to its bearish bets. Universa now runs about $6 billion, up from the $300 million it began with in January 2007</em>. Excerpts from the <em>Wall Street Journal</em> article (<em><span style="text-decoration:underline;">Black Swan Fund Makes a Big Bet on Inflation</span></em>) follow below. So, why the hell I do not feel at all surprised by this?! Really, I am not. Let me just say, I do have my own reasons:</p>
<p style="text-align:justify;"><img class="alignleft size-full wp-image-779" title="Nassim Nicholas Taleb - Black Swan author" src="http://chemoton.wordpress.com/files/2009/06/nassim-nicholas-taleb.gif" alt="Nassim Nicholas Taleb - Black Swan author" width="124" height="212" />  [...] A hedge fund firm that reaped huge rewards betting against the market last year is about to open a fund premised on another wager: that the massive stimulus efforts of global governments will lead to hyperinflation. The firm, Universa Investments L.P., is known for its ties to gloomy investor Nassim Nicholas Taleb, author of the 2007 bestseller &#8220;The Black Swan,&#8221; which describes the impact of extreme events on the world and financial markets.</p>
<p style="text-align:justify;">Funds run by Universa, which is managed and owned by Mr. Taleb&#8217;s long-time collaborator Mark Spitznagel, last year gained more than 100% thanks to its bearish bets. Universa now runs about $6 billion, up from the $300 million it began with in January 2007. Earlier this year, Mr. Spitznagel closed several funds to new investors&#8230;.</p>
<p style="text-align:justify;">Mr. Taleb doesn&#8217;t have an ownership interest in the Santa Monica, Calif., firm, but he has significant investments in it and helps shape its strategies. The term &#8220;black swan,&#8221; which has become a market catchphrase in the last few years, alludes to the once-widespread belief in the West that all swans are white. The notion was proven false when European explorers discovered black swans in Australia. A black-swan event, according to Mr. Taleb, is something that is extreme and highly unexpected. &#8230; [...]</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Uncorrelated Risk and Complex Systems]]></title>
<link>http://julianeverett.wordpress.com/2009/06/02/risk-correlation-complex-systems/</link>
<pubDate>Tue, 02 Jun 2009 12:15:23 +0000</pubDate>
<dc:creator>julianeverett</dc:creator>
<guid>http://julianeverett.wordpress.com/2009/06/02/risk-correlation-complex-systems/</guid>
<description><![CDATA[An interesting article in the Wall Street Journal by the father of modern portfolio theory, Harry Ma]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>An <a href="http://online.wsj.com/article/SB122567428153591981.html">interesting article</a> in the Wall Street Journal by the father of <a href="http://en.wikipedia.org/wiki/Modern_portfolio_theory">modern portfolio theory</a>, <a href="http://en.wikipedia.org/wiki/Harry_Markowitz">Harry Markowitz</a>, discussing the current financial crisis. He emphasises the need for diversification across uncorrelated risk as the key to minimising balanced portfolio risk, and highlights its lack as a major factor behind our present predicament. However to my mind this still begs a question of whether any risk can be truly uncorrelated in complex systems. <a href="http://julianeverett.wordpress.com/2008/05/05/the-power-law/">Power law</a> environments such as financial markets are defined by their interconnectedness, and the presence of positive and negative feedback loops which undermine their predictability. That interconnectedness makes identifying uncorrelated risk exceptionally problematic, especially when such correlations have been hidden inside repackaged derivatives and insurance products.</p>
<p>In systems that are intrinsically unpredictable, no risk management framework can ever tell us which decisions to make: that is essentially unknowable. Instead, good risk strategies should direct us towards minimising our liability, in order to <strong><a href="http://julianeverett.wordpress.com/2009/04/23/on-failure/">minimise the cost of failure</a></strong>. If we consider &#8220;current liability&#8221; within the field of software product development as our investment in features that have yet to ship and prove their business value generation capabilities, then this gives us a clear, objective steer towards frequent iterative releases of minimum marketable featuresets and trying to incur as much cost as possible at the point of value generation (i.e. through Cloud platforms, open source software, etc). I think that is the real reason why agile methodologies have been so much more successful that traditional upfront-planning approaches: they allow organisations to be much more efficient at limiting their technology investment liability.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[From Prioritised Backlog to Componentised ROI Model]]></title>
<link>http://julianeverett.wordpress.com/2009/05/20/componentised-roi-model/</link>
<pubDate>Wed, 20 May 2009 11:17:29 +0000</pubDate>
<dc:creator>julianeverett</dc:creator>
<guid>http://julianeverett.wordpress.com/2009/05/20/componentised-roi-model/</guid>
<description><![CDATA[In the last post we argued for a more rigourous, quantitative approach to featureset valuation over ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>In the <a href="http://julianeverett.wordpress.com/2009/05/05/project-prioritisation-is-evil/">last post</a> we argued for a more rigourous, quantitative approach to featureset valuation over the conventional, implicit and overly blunt mechanisms of product backlog prioritisation. We borrowed a simple valuation equation from decision tree analysis to give us a more powerful tool for both managing risk and determining the optimal exercise point for any MMF:</p>
<p><code>Value = (Estimated Generated Value * Estimated Risk) - Estimated Cost</code></p>
<p>where </p>
<p><code>Estimated Risk = <code>Estimated P</code>roject Risk * <code>Estimated M</code>arket Risk</code></p>
<p>A few comments are worth noting about this equation:<br />
<strong>1.) It contains no time-dependent variable. </strong>The equation simply assumes a standard amortisation period to be agreed with stakeholders (typically 12 to 24 months). Market payback functions and similar are ignored as they introduce complexity and hence risk (as described in more detail below). We are not seeking accuracy per se, but simply enough accuracy to enable us to make the correct implementation decisions.<br />
<strong>2.) It is very simplistic. </strong>Risk management must be <strong>reflexive</strong>: at the most basic level, project risk can be divided into two fundamental groupings:</p>
<ul>
<li>Model-independent risks</li>
<li>Model-related risk</li>
</ul>
<p>The former includes typical factors such as new technologies, staff quality and training, external project dependecies, etc. The latter includes two components: <strong>the inaccuracy of the risk model</strong> and <strong>the incomprehensibility of the risk model</strong>. We start incurring inaccuracy risk as soon as the simplicity of our model is so great that it leads us to make bad decisions or else provides no guidance. MoSCoW prioritisation is a good example of this. On the other hand we start incurring incomprehensibility risk as soon as the risk model is so complex that it is no longer comprehensible by everyone in the delivery team (which will clearly be relative across different teams). The current financial crisis is a large-scale example of a collapse in incomprehensibility risk management. If financial risk models had been reflexive and taken their own complexity into account as a risk factor, then there is no way we would have ended up with situations where cumulative liabilities were only even vaguely understood by financial maths PhDs: if we take a team of twenty people, it is clear that a sophisticated and accurate model that is only understood by one person entails vastly more risk than a simplistic, less accurate model that everyone can follow. We can generalise this in our estimation process as follows:</p>
<p><code>Total Risk = Project Risk * Market Risk * Model Incomprehensibility Risk * Model Inaccuracy Risk</code></p>
<p>or as functions:</p>
<p><code>Total Risk = Risk(Project) * Risk(Market) * Risk(Incomprehensibility(Model)) * Risk(Inaccuracy(Model))</code></p>
<p>Furthermore, given our general human tendency towards overcomplexity for most situations this can be approximated to</p>
<p><code>Total Risk = Risk(Project) * Risk(Market) * Risk(Incomprehensibility(Model))</code></p>
<p><strong>3.) All risk is assigned as a multiplier against Generated Value</strong>, rather than treating delivery risk as an inverse multiplier of Cost. I have had very interesting conversations about this recently with both <a href="http://decision-coach.com/" target="_blank">Chris Matts</a> and some of the product managers I am working with. They have suggested a more accurate valuation might be some variation of:</p>
<p><code>Value = (Estimated Generated Value * Estimated Realisation Risk) - (Estimated Cost / Estimated Delivery Risk)</code></p>
<p>In other words, risks affecting technical delivery should result in a greater risk-adjusted cost rather than a lesser risk-adjusted revenue. This is probably more accurate. However is that level of accuracy necessary? In my opinion at least, no. Firstly it creates a degree of confusion as regards how to differentiate revenue realisation risk and delivery risk: is your marketing campaign launch really manifestly different in risk terms from your software release? If either fails it is going to blow the return on investment model, so I would say fundamentally no. Secondly, I might be wrong but I got the feeling that part of the reticence to accept the simpler equation from our product management was a preference against their revenue forecasts being infected by a thing over which they had no control: namely delivery risk (perhaps a reflection of our general psychological tendency to perceive greater risk in situations where we have no control). However that is a major added benefit in my opinion: it helps break down the traditional divides between &#8220;the business&#8221; and &#8220;IT&#8221;. As the technology staff of Lehman Brothers will now no doubt attest, the only people who aren&#8217;t part of &#8220;the business&#8221; are the people who work for someone else.</p>
<p>For me, this approach creates the missing link between high-level project business cases and the MMF backlog. We start with a high-level return on investment model in the business case, that then gets factored down into componentised return on investement models as part of the MMF valuation process. These ROI components effectively comprise the business level acceptance tests for the business case. The componentised ROI models then drive out the MMF acceptance tests, from which we define our unit tests and then start development. In this way, we complete the chain of red-gree-refactor cycles from the highest level of commercial strategy down to unit testing a few lines of code. The <a href="http://julianeverett.wordpress.com/2008/05/05/the-power-law">scale invariance </a>of this approach I find particularly aestheticly pleasing: it is red-green-refactor for complex systems&#8230;</p>
<p><img class="aligncenter size-full wp-image-70" title="fractal-red-green-refactor" src="http://julianeverett.wordpress.com/files/2009/05/fractal-red-green-refactor.jpg" alt="fractal-red-green-refactor" width="459" height="345" /></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Power Law Revisited]]></title>
<link>http://skylertanner.com/2009/04/30/power-law-revisited/</link>
<pubDate>Thu, 30 Apr 2009 20:44:18 +0000</pubDate>
<dc:creator>Skyler Tanner</dc:creator>
<guid>http://skylertanner.com/2009/04/30/power-law-revisited/</guid>
<description><![CDATA[A really awesome slide When I first stated this law I noted that might add commentary on the notion ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><div class="wp-caption alignnone" style="width: 530px"><img title="Power Law" src="http://blog.ning.com/files/Power%20Law%20Curve.bmp" alt="A really awesome slide" width="520" height="280" /><p class="wp-caption-text">A really awesome slide</p></div>
<p>When I first stated this law I noted that might add commentary on the notion of Power Law Training, specifically the application from Art De Vany. I like the idea, but disagree with the application.<!--more-->To generalize, De Vany is a fan of varied frequency, working out only when you really want to, when you feel good about doing so. He is also a fan of pyramiding weight up while dropping repetitions (15, 8, 4) as a means to fatigue all the fibers in sequence. It all sounds good in theory.</p>
<p>Here are the troubles with this:</p>
<ol>
<li>15 reps isn&#8217;t going to adequately fatigue the Type Ia fibers at all. Slow twitch fibers are most active during aerobic exercise, responding extremely well to work done in the oxidative energy pathway and aren&#8217;t going to be fatigued in 15 reps. To quote Lyle McDonald, &#8221; It takes minutes to fatigue Type I fibers, probably longer, like hours.&#8221;</li>
<li>Intensity, specifically momentary muscular effort, isn&#8217;t the only variable for which the power law can apply.</li>
</ol>
<p>So how can you apply the power law in another way? Here are a few examples:</p>
<ol>
<li>Decrease &#8220;effort&#8221; but increase volume. If my max deadlift is 385, I could do single rep warmup sets until my main workset at 385. The Westside gurus have pointed out that adding more volume to your lower sets might reduce max force output, but you&#8217;re not maxing out. You&#8217;ll move more tonnage for more stimulation and this can be cycled in randomly&#8230;</li>
<li>If you use Chaos Training. The notion that your first &#8220;work&#8221; set becomes the benchmark in your routines and you perform whatever else you want for a given muscle group. As long as the benchmark is improving, you are doing well with the rest of the workout you&#8217;re making up as you go along.</li>
<li>A full Blitz cycle would be taking whatever your normal workload would be and, for a period of no more than 2 weeks, doubling or tripling the workload (both sets and work days). The intent is to push yourself to the brink of overtraining, back off and don&#8217;t train for a week, and you end up overcompensating with more muscle.</li>
<li>A variation for competition would be Pendelay&#8217;s Hormone Manipulation Cycle. Pyramiding up in intensity each week (low, medium, brutally high, brutally high, medium, low, competition) depresses hormones, setting them up for a big rebound for a competition in week 7 of that sequence.</li>
</ol>
<p>The paleo folks (in which I find myself partaking in) are as dogmatic as any other fitness subculture. I might seem like I&#8217;m slaughtering a sacred cow, but there&#8217;s more than one word written about this. Give it a try.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Aufmerksamkeit in der Blogosphäre]]></title>
<link>http://silviawitzig.wordpress.com/2009/04/02/aufmerksamkeit-in-der-blogosphare/</link>
<pubDate>Thu, 02 Apr 2009 08:25:43 +0000</pubDate>
<dc:creator>silviawitzig</dc:creator>
<guid>http://silviawitzig.wordpress.com/2009/04/02/aufmerksamkeit-in-der-blogosphare/</guid>
<description><![CDATA[Um aus der unglaublichen Menge von Blogs hervorzustechen, braucht es Verlinkungen, die regelmässig v]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><img src="http://silviawitzig.wordpress.com/files/2009/04/link-to-me3.gif?w=206" alt="Link to me" title="Link to me" width="206" height="300" class="alignright size-medium wp-image-52" />Um aus der unglaublichen Menge von Blogs hervorzustechen, braucht es Verlinkungen, die regelmässig viele Leser auf die Website eines Blogs bringen.<br />
Die wichtigsten Grundelemente von Blogs sind Blogrolls, Permalinks und Trackback.<br />
Eine Blogroll findet sich auf jedem Blog. In dieser Liste werden Blogs aufgeführt, die vom Autor regelmässig gelesen werden. Blogrolls sind einerseits ein Navigations-Tool und andererseits eine Art der sozialen Anerkennung (<a href="http://alumni.media.mit.edu/~cameron/cv/pubs/04-01.html">Marlow</a>, 2004, S. 3).<br />
Jeder Post erhält eine feste Internetadresse, einen sogenannten Permalink. Damit ist jeder Blog-Beitrag und auch jeder Kommentar eines Blog-Lesers referenzierbar, was das Verlinken auf fremde Blog-Beiträge deutlich vereinfacht (<a href="http://nyblog.de/index.php/tino_seeber_weblogs_gewalt/">Seeber</a>, 2008, S. 19).<br />
<a href="http://sw-guide.de/webdienste-blogging/was-sind-trackbacks/">Trackbacks</a> sind Links zwischen zwei Blog-Posts. Der Autor des zweiten Posts bezieht sich in seinem Text auf den ersten Post. Er kann nun ein Signal an diesen ersten Post senden, welches dort als Kommentar mit Link erscheint. Dadurch können die Leser des Ursprungs-Posts gleich in den Kommentaren sehen, dass andere Blogger über das selbe Thema schrieben.</p>
<p>Diese drei Elemente erklären, wie Blogs vernetzt sind. Welche Blogs nun häufig verlinkt werden und dadurch Aufmerksamkeit erhalten, kann durch die sogenannte Power Law-Verteilung erklärt werden. „We know that power law distributions tend to arise in social systems where many people express their preferences among many options. We also know that as the number of options rise, the curve becomes more extreme“ (<a href="http://www.shirky.com/writings/powerlaw_weblog.html">Shirky</a>, 2003).<br />
In der Blogosphäre hat es nun viele Blogs, wie im letzten Post thematisiert, und es gibt viele Leser, die sich aus diesen Blogs Favoriten auswählen und auf sie verlinken. Man könnte nun eigentlich davon ausgehen, dass die Leser ihre Favoriten frei auswählen und damit die meisten Blogs etwa gleich populär wären. Dem ist jedoch nicht so, denn „people&#8217;s choices do affect one another. If we assume that any blog chosen by one user is more likely, by even a fractional amount, to be chosen by another user, the system changes dramatically. Alice, the first user, chooses her blogs unaffected by anyone else, but Bob has a slightly higher chance of liking Alice&#8217;s blogs than the others. When Bob is done, any blog that both he and Alice like has a higher chance of being picked by Carmen, and so on, with a small number of blogs becoming increasingly likely to be chosen in the future because they were chosen in the past“ (<a href="http://www.shirky.com/writings/powerlaw_weblog.html">Shirky</a>, 2003). Dadurch werden Blogs, die schon von vielen Personen als Favoriten ausgewählt wurden, immer populärer und erhalten viel Aufmerksamkeit. Es kommt zur oben erwähnten Power Law-Verteilung und das heisst, dass ein grosser Anteil der Links auf einen kleinen Anteil der existierenden Blogs eingeht. Diese Blogs werden auch A-Blogs oder A-List Blogs genannt.</p>
<p>Die Blogsuchmaschine <a href="http://technorati.com/">Technorati</a> berechnet einen Autoritätswert, der ebenfalls auf der Verlinkung unter den Blogs beruht. Ausschlaggebend für eine hohe <a href="http://support.technorati.com/faq/topic/71">Technorati Authority</a> ist, wie viele Blogs in den letzten sechs Monaten auf den eigenen Blog verlinkten. Die<a href="http://technorati.com/pop/blogs/"> Top 100 Blogs</a> können bei Technorati angesehen werden. Zur Zeit hat der Blog <a href="http://www.huffingtonpost.com/">The Huffington Post</a> den höchsten Authority-Wert. Dies ist übrigens ein weiteres Beispiel für einen Blog, der dem Bereich Citizen Journalism zugeordnet werden kann.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[La legge dei tag]]></title>
<link>http://motobrowniano.wordpress.com/2009/03/11/la-legge-dei-tag/</link>
<pubDate>Wed, 11 Mar 2009 11:04:49 +0000</pubDate>
<dc:creator>Federico Bo</dc:creator>
<guid>http://motobrowniano.wordpress.com/2009/03/11/la-legge-dei-tag/</guid>
<description><![CDATA[Uno degli aspetti più interessanti della folksonomy, la categorizzazione condivisa  e distribuita tr]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Uno degli aspetti più interessanti della <a href="http://en.wikipedia.org/wiki/Folksonomy" target="_blank"><em>folksonomy</em></a>, la categorizzazione condivisa  e distribuita tramite <em>tag</em>, è che riesce a creare piuttosto rapidamente un set di etichette stabili per ogni contenuto, evidenziato dall&#8217;apparire di una <a href="http://nonlineare.blogspot.com/2008/03/leggi-della-potenza-power-laws.html" target="_blank">legge di potenza</a>: pochi <em>tag</em> sono utilizzati da moltissimi utenti, moltissimi <em>tag </em>sono utilizzati solo da qualche utente. Questo accade a dispetto di diversi  problemi che tale tipo di tecnica comporta: differenze culturali e linguistiche, ambiguità dei termini, acronimi, parole multiple, difficoltà di etichettare dati strutturati come le date, solo per citare i più rilevanti.</p>
<p>Le cause che determinano la stabilizzazione dell&#8217;insieme dei <em>tag</em> e l&#8217;apparire di una <a href="http://en.wikipedia.org/wiki/Power_law" target="_self"><em>power law</em></a> non sono del tutto chiare: i sospetti principali ricadono su un possibile <em>background</em> culturale comune agli utenti e soprattutto sul meccanismo di imitazione, che spinge gli utenti a confermare etichette già presenti. Alcune ricerche si spingono ad ipotizzare che la curva di potenza non apparirebbe se non fosse data agli utenti la possibilità di vedere l&#8217; insieme di <em>tag</em> già definito da altri.</p>
<p><a href="http://arxiv.org/PS_cache/arxiv/pdf/0903/0903.1788v1.pdf" target="_blank">Una ricerca appena pubblicata</a> sembra però smentire tale ipotesi: la legge di potenza emerge anche nel cosiddetto <em>blind tagging</em>, il <em>tagging</em> senza suggerimenti dando più credito alla spiegazione sul<em> background</em> comune degli utenti. Ciò che comunque ancora manca, secondo gli autori della ricerca, è la comprensione dei modelli cognitivi che guidano la capacità di categorizzazione umana, ed in particolare quella tramite il <em>tagging.</em></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Week 5.0: Here comes Shirky (again)!]]></title>
<link>http://johncummins.wordpress.com/2009/02/12/week-50-here-comes-more-shirky/</link>
<pubDate>Thu, 12 Feb 2009 01:25:46 +0000</pubDate>
<dc:creator>vagabondrecon</dc:creator>
<guid>http://johncummins.wordpress.com/2009/02/12/week-50-here-comes-more-shirky/</guid>
<description><![CDATA[Hi everybody, Well I finally finished Shirky and I have to say there were several reasons I enjoyed ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Hi everybody,</p>
<p>Well I finally finished Shirky and I have to say there were several reasons I enjoyed this book immensely.</p>
<ol>
<li>As Jason said to me during a class break, Shirky had no agenda. At least if he did he kept it pretty much to himself and just informed us about the power of organizing.</li>
<li>He relegated Keen to one paragraph on page 209 with an example of Keen complaining about big time professional ad agencies losing money.</li>
<li>He explained the Power law. This is different from Pareto&#8217;s 80/20 law that says, for instance that 20% of the people do 80% of the work. It is more like, in Wikipedia, or another social network, maybe 3 people do 1000 or 10,000 times the other contributors, and yet it all works out. Most people do a below average amount in contributions whilst a few do a huge amount&#8230;but all are needed to make social online groups work. I think many studies could look at this power law.</li>
<li>The cost of organizing groups has dropped to nil with the tools on the net</li>
<li>I like how he explains that the price of failure has dropped to nil as well. It&#8217;s the fact that you can afford to fail that groups can form, or even try to form. Groups like the picture-sharing group from NYC would have had no audience in the past, but now can be seen around the world.</li>
<li>I love the amount of time he spent with meet-up and the fact that meet-up (and groups like it) can actually allow people to connect around common interests (often extremely nichy), geographically in real-life. Hence, the internet can bring virtual groups into actual real-life groups. I experienced this myself locally in the primary election cycle. I actually met up with some folks online and participated in a political parade in Knoxville with people that I met for the first time in real life that day. So, contrary to doomers and gloomers, the internet can foster local geographical community in ways that would have been imposible before the new tools for organization on the net got cheap and attractive.</li>
<li>Shirky is realistic in seeing that the genie is out of the bottle for good and bad and that both good and bad will occur and must be dealt with accordingly.</li>
<li>In the final chapter, Shirky explains the three main components needed to make a group work: the promise, the tool, and the bargain and how these tools must be lined up perfectly in order for an organization or group to &#8220;work&#8221;. It is just good to know!</li>
<li>Oh and also, I don&#8217;t know if Shirky ever said Web 2.0&#8230; and that is a positive at this point, IMNSHO.</li>
</ol>
<p>I thought this book was extremely well written and I will look for more Shirky.  Will you?</p>
<p>John Cummins</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Saturation in the scale-free dependency networks of free software ]]></title>
<link>http://gnowgi.org/2009/02/03/saturation-in-the-scale-free-dependency-networks-of-free-software/</link>
<pubDate>Tue, 03 Feb 2009 14:02:10 +0000</pubDate>
<dc:creator>gnowgi</dc:creator>
<guid>http://gnowgi.org/2009/02/03/saturation-in-the-scale-free-dependency-networks-of-free-software/</guid>
<description><![CDATA[As reported in my previous post on Debian Dependency Maps we started to study the properties of depe]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>As reported in my previous post on <a title="Debian Dependency maps" href="http://gnowgi.org/2008/12/16/debian-dependency-map/" target="_blank">Debian Dependency Maps</a> we started to study the properties of dependency relation and the kind of networks the relation can generate.  One preliminary study we (me along with Arnab K. Ray and Rajiv Nair) posit a  nonlinear model for the global analysis of data pertaining to the semantic network of a complex operating system (free and open-source software). While the distribution of links in the dependency network of this system is scale-free for the intermediate nodes, we found that the richest nodes deviate from this trend, and exhibit a nonlinearity-induced saturation effect. This also distinguishes the two directed networks of incoming and outgoing links from each other. The initial condition for a dynamic model, evolving towards the steady dependency distribution, determines the saturation properties of the mature scale-free network.</p>
<p>Here I give some of the motivations on conducting this study and some conclusions.<br />
The full paper with all the technical details is uploaded at <a title="full paper" href="http://arxiv.org/abs/0901.4904" target="_blank">arxiv.org</a>.</p>
<p>Scale-free distributions in complex networks have been very well studied by now. The ubiquity of scale-free properties is quite noteworthy, and spans across vastly  diverse domains like (to name a few) the World Wide Web and the Internet, the social, ecological, biological and linguistic networks, income and wealth distributions, trade and business networks, and semantic networks.</p>
<p>It should occasion no surprise, therefore, that further developments have led to the discovery of scale-free features in the architecture of computer software as well. A recent<br />
work  has shown that the structure of object-oriented software is a heterogeneous network characterised by a power-law distribution.  More in keeping with the purpose of this present paper, an earlier work on complex networks in software engineering had found evidence of power-law behaviour in the inter-package dependency networks in free and open-source software (FOSS).  It is a matter of common knowledge that when it comes to installing a software package from the  <a title="Debian GNU/Linux" href="http://www.debian.org/" target="_blank">Debian GNU/Linux</a> repository, many other packages — the “dependencies” — are also called for as prerequisites. This leads to a network of these dependencies, and every such package may be treated as a node in a network of dependency relationships. Each dependency relationship connecting any two packages (nodes) is treated as a link (an edge), and every link establishes a relation between a prior package and a posterior package, whereby the functions defined in the<br />
prior package are called in the posterior package. This enables reuse (economy) of functions and eliminates duplicate development. As a result the whole operating system emerges as a coherent and stable semantic network. However, unlike other semantic networks, the network of nodes in the Debian repository is founded on a single relation spanning across all its nodes: <em>Y</em> depends on <em>X</em>; its inverse, <em>X</em> is required for <em>Y</em> .<br />
So, given any particular node, its links (the relations with other nodes) can be of two types — incoming links and outgoing links — as a result of which, there will arise two distinct kinds of directed network.  For the network of incoming links, a newly-reported work  has empirically established the relevance of Zipf’s law and the conditions attendant on it in Debian GNU/Linux distribution. Carrying further along these very lines, the present study purports to analyse and model the finite-size effects in a FOSS network. There is a general appreciation that for any system with a finite size, the power-law trend is not manifested indefinitely, and in the context of the FOSS network, this is a matter that is recognised as one worthy of a more thorough investigation. Deviations from the power-law trend appear for both the heavily-linked and the sparsely-linked nodes. The former case corresponds to the distribution of a disproportionately high number of links connected to a very few special nodes — the so-called “top nodes” (or rich nodes).  The importance of these nodes is, therefore, a self-evident fact.</p>
<p>The data needed for the modelling pertain to the current stable Debian release, Etch (Debian GNU/Linux 4.0).  The respective networks of both the incoming links and the outgoing links span 18630 nodes (software packages).<br />
The study argues for the signiﬁcance of non-linearity and saturation, as regards a quantitative characterisation of the incoming and outgoing distribution in the Debian GNU/Linux network.  One might rightly expect to encounter similar features in other networks.  And indeed, given the possibility that the entire network of software packages in an operating system can be construed to be a semantic (albeit non-autonomous) system, its characteristics can furnish a model that can shed light on much more complex but realistic autonomous semantic and cognitive systems, such as the human society, or<br />
even the human mind.</p>
<p>In the road ahead, the gnowledge lab will conduct a similar study for the dependencies between concepts and activities as and when we obtain sufficient number of nodes at <a title="gnowledge network" href="http://www.gnowledge.org" target="_blank">gnowledge.org</a>.  Currently we have only about 1000 dependency relations.  As more people get to know about the need of establishing dependency relation between concepts, and as and when the portal itself matures with features to attract more users we can study the properties of the resulting knowledge network.</p>
<p>The full paper with all the technical details is uploaded at <a title="Full paper" href="http://arxiv.org/abs/0901.4904" target="_blank">arxiv.org</a>.</p>
</div>]]></content:encoded>
</item>

</channel>
</rss>
