<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress.com" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>computational-linguistics &amp;laquo; WordPress.com Tag Feed</title>
	<link>http://en.wordpress.com/tag/computational-linguistics/</link>
	<description>Feed of posts on WordPress.com tagged "computational-linguistics"</description>
	<pubDate>Sun, 27 Dec 2009 14:16:08 +0000</pubDate>

	<generator>http://en.wordpress.com/tags/</generator>
	<language>en</language>

<item>
<title><![CDATA[Hearing Impared Software For The Deaf Online]]></title>
<link>http://businessdevelopmentinfo.wordpress.com/2009/12/09/hearing-impared-software-for-the-deaf-online/</link>
<pubDate>Wed, 09 Dec 2009 16:52:40 +0000</pubDate>
<dc:creator>Bruce Oliver - Multiple Stream Income Approach™</dc:creator>
<guid>http://businessdevelopmentinfo.wordpress.com/2009/12/09/hearing-impared-software-for-the-deaf-online/</guid>
<description><![CDATA[Software For The Deaf Online by Benedict Perez (hosted by Bruce Oliver, http://multiplestreamincomea]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><h2>Software For The Deaf Online</h2>
<p><em>by Benedict Perez (hosted by Bruce Oliver, <a href="http://multiplestreamincomeapproach.com">http://multiplestreamincomeapproach.com</a> )</em></p>
<p>Software for the deaf is proliferating in the market either online or offline. Yes there are now many different types of software for the hearing impaired. Manufacturers of these kinds of software saw the need for people hounded by this physical handicap. Thanks to their noble purpose of having to manufacturer software that does not only make lives of the hearing impaired easier but also aides those in learning anything they wish to learn.</p>
<p>Yes, you can now have the wonders brought about by modern technology if you are one of the many individuals having problem with their sense of hearing. Hearing impaired all over the world can now enjoy and learn things other than the traditional braille. They can now have the modern touch to learn and enjoy things sans their sense of hearing by having the perfect software for the deaf that perfectly addresses their needs.</p>
<p>You just need to be extra cautious in having to choose these kinds of software. This is especially so if you so desire to purchase them online. Software for the deaf is now so easy to find. Many of these now abound the market especially in the internet.</p>
<p>While it would be much fast and easy to look and purchase software for the deaf online, it also tags along a lot of risk. Well not the kind of risk you may have in mind. Having it from just about any of the websites you will bump in while browsing for it can be that risky.</p>
<p>Risky in the sense that you could lose money if you will not be extra careful with it. It is therefore advised that you should make sure to have this kind of software only from online vendors known for good quality products that are also proven efficient. You can easily identify these vendors by carefully scrutinizing their products and observing the kind of customer care it has. These are just a few of the many things you need to look out for in purchasing software for the deaf.</p>
<p><em><br />
About the Author:</em></p>
<p>You can find available in the market some <a href="http://www.alleducationalsoftware.com/software-for-the-deaf.html">software for the deaf</a>. This has been intentionally created for those hearing impaired individuals so that they learn things like normal individuals do. You should also try <a href="http://www.alleducationalsoftware.com/learn-to-type.html">learn to type software</a> for both young and adults.</p>
<p><code> </code></p>
<p><code> </code></p>
<p><code> </code></p>
<p><code> </code></p>
<p><code> </code></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Embodiment and Metaphor]]></title>
<link>http://apperceptual.wordpress.com/2009/10/06/embodiment-and-metaphor/</link>
<pubDate>Tue, 06 Oct 2009 19:11:57 +0000</pubDate>
<dc:creator>Peter Turney</dc:creator>
<guid>http://apperceptual.wordpress.com/2009/10/06/embodiment-and-metaphor/</guid>
<description><![CDATA[I recently wrote about Criticisms of Lakoff’s Theory of Metaphor and how some of these criticisms ca]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I recently wrote about <a title="http://apperceptual.wordpress.com/2009/08/23/criticisms-of-lakoffs-theory-of-metaphor/" href="http://apperceptual.wordpress.com/2009/08/23/criticisms-of-lakoffs-theory-of-metaphor/">Criticisms of Lakoff’s Theory of Metaphor</a> and how some of these criticisms can be addressed by integrating the work of <a title="http://apperceptual.wordpress.com/2009/09/24/lakoff-and-gentner/" href="http://apperceptual.wordpress.com/2009/09/24/lakoff-and-gentner/">Lakoff and Genter</a>. Perhaps the most controversial aspect of <a title="http://en.wikipedia.org/wiki/Conceptual_blending#Conceptual_Metaphor_Theory" href="http://en.wikipedia.org/wiki/Conceptual_blending#Conceptual_Metaphor_Theory">conceptual metaphor theory</a> (CMT) is the idea of <a title="http://en.wikipedia.org/wiki/Embodied_cognition" href="http://en.wikipedia.org/wiki/Embodied_cognition">embodied cognition</a>, that metaphor is grounded in the sensory experiences of our bodies. I went looking for experimental evaluations of this idea and found some interesting papers.</p>
<p><!--more--></p>
<p>The Boston Globe recently had a story on this topic:</p>
<blockquote><p><a title="http://www.boston.com/bostonglobe/ideas/articles/2009/09/27/thinking_literally/?page=full" href="http://www.boston.com/bostonglobe/ideas/articles/2009/09/27/thinking_literally/?page=full">Thinking literally: The surprising ways that metaphors shape your world</a><br />
Drake Bennett</p>
<p>Drawing on philosophy and linguistics, cognitive scientists have begun to see the basic metaphors that we use all the time not just as turns of phrase, but as keys to the structure of thought. By taking these everyday metaphors as literally as possible, psychologists are upending traditional ideas of how we learn, reason, and make sense of the world around us.</p></blockquote>
<p>Several researchers are mentioned, but no references are given, so I tracked down some of the references:</p>
<blockquote><p><a title="http://www.sciencemag.org/cgi/content/full/322/5901/606" href="http://www.sciencemag.org/cgi/content/full/322/5901/606">Experiencing Physical Warmth Promotes Interpersonal Warmth</a><br />
Lawrence E. Williams and John A. Bargh</p>
<p>&#8230; we hypothesized that experiences of physical warmth (or coldness) would increase feelings of interpersonal warmth (or coldness), without the person&#8217;s awareness of this influence &#8230;</p>
<p>Contemporary cognitive linguists have advanced similar arguments that people conceptualize their internal, mental worlds by analogy to the physical world (9–13). Applied to the question of how warm objects can produce the same affective states as a &#8220;warm&#8221; person, embodiment theorists have noted how objects and events that produce the same quality of affective response are associated (categorized) together in memory (14).</p>
<p>In summary, experiences of physical temperature per se affect one&#8217;s impressions of and prosocial behavior toward other people, without one&#8217;s awareness of such influences.</p></blockquote>
<blockquote><p><a title="http://www.rotman.utoronto.ca/geoffrey.leonardelli/inpressPS.pdf" href="http://www.rotman.utoronto.ca/geoffrey.leonardelli/inpressPS.pdf">Cold and Lonely: Does Social Exclusion Literally Feel Cold?</a><br />
Chen-Bo Zhong and Geoffrey J. Leonardelli</p>
<p>Metaphors such as icy stare and cold reception depict social exclusion using cold-related concepts; they are not to be taken literally and certainly do not imply reduced temperature. Two experiments, however, revealed that social exclusion literally feels cold.</p></blockquote>
<blockquote><p><a title="http://www.rotman.utoronto.ca/facBios/file/Smell%20of%20Virtue%20Psych%20Sci.pdf" href="http://www.rotman.utoronto.ca/facBios/file/Smell%20of%20Virtue%20Psych%20Sci.pdf">The Smell of Virtue: Clean Scents Promote Reciprocity and Charity</a><br />
Katie Liljenquist, Chen-Bo Zhong, and Adam D. Galinsky</p>
<p>Two experiments demonstrated that clean scents not only motivate clean behavior, but also promote virtuous behavior by increasing the tendency to reciprocate trust and to offer charitable help. Capitalizing on the fact that abstract concepts are often symbolically derived from the concrete environment (Emerson, 1836), our results suggest that olfactory cues can trigger virtuous behaviors that are related to cleanliness at only a symbolic level. The link from cleanliness to virtuous behavior appears to be a nonconscious one: in neither experiment did participants recognize an influence of scent on their behavior, and in Experiment 2, perceived cleanliness did not differ by condition nor correlate with the effects.</p></blockquote>
<blockquote><p><a title="http://www.igroup.org/schubert/papers/jostmann_lakens_schubert_ps_2009.pdf" href="http://www.igroup.org/schubert/papers/jostmann_lakens_schubert_ps_2009.pdf">Weight as an Embodiment of Importance</a><br />
Nils B. Jostmann, Daniël Lakens, Thomas W. Schubert</p>
<p>Four studies show that the abstract concept of importance is grounded in bodily experiences of weight. Participants provided judgments of importance while they held either a heavy or a light clipboard. Holding a heavy clipboard increased judgments of monetary value (Study 1), and made participants consider fair decision-making procedures to be more important (Study 2). It also caused more elaborate thinking as indicated by higher consistency between related judgments (Study 3) and by greater polarization between strong versus weak arguments (Study 4). In line with an embodied perspective on cognition, these findings suggest that, similar to how weight makes people invest more physical effort in dealing with concrete objects, weight also makes people invest more cognitive effort in dealing with abstract issues.</p></blockquote>
<p>Here are the home pages of some people who are doing research in this area:</p>
<ul>
<li><a title="http://bargh.socialpsychology.org/" href="http://bargh.socialpsychology.org/">John A. Bargh</a></li>
<li><a title="http://www.casasanto.com/Site/home.html" href="http://www.casasanto.com/Site/home.html">Daniel Casasanto</a></li>
<li><a title="http://leeds-faculty.colorado.edu/lw/" href="http://leeds-faculty.colorado.edu/lw/">Lawrence Williams</a></li>
<li><a title="http://www.rotman.utoronto.ca/facbios/viewfac.asp?facultyid=chenbo.zhong" href="http://www.rotman.utoronto.ca/facbios/viewfac.asp?facultyid=chenbo.zhong">Chen-Bo Zhong</a></li>
<li><a title="http://home.medewerker.uva.nl/n.b.jostmann/" href="http://home.medewerker.uva.nl/n.b.jostmann/">Nils B. Jostmann</a></li>
</ul>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[THAT'S a STEM career?]]></title>
<link>http://stemology.wordpress.com/2009/09/27/thats-a-stem-career/</link>
<pubDate>Sun, 27 Sep 2009 04:23:22 +0000</pubDate>
<dc:creator>Liz</dc:creator>
<guid>http://stemology.wordpress.com/2009/09/27/thats-a-stem-career/</guid>
<description><![CDATA[Here&#8217;s a career that wouldn&#8217;t occur to most kids: cryptographer. Greg Laden&#8217;s blog]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Here&#8217;s a career that wouldn&#8217;t occur to most kids: cryptographer. Greg Laden&#8217;s <a href="http://scienceblogs.com/gregladen/2009/09/appyhay_anniversaryway_yptogra.php?utm_source=combinedfeed&#38;utm_medium=rss">blog</a> features a link to a CIO (that&#8217;s a trade magazine) <a href="http://www.cio.com.au/article/319119/slideshow_cio_blast_from_past_-_60_years_cryptography" target="_blank">slideshow </a>on the history of computer cryptography.</p>
<p>If you know a middle or high school student interested in this sort of thing, they might also like <a href="http://en.wikipedia.org/wiki/Computational_linguistics" target="_blank">computational linguistics</a> &#8211; and there&#8217;s an annual <a href="http://www.naclo.cs.cmu.edu/" target="_blank">contest </a>they can enter. The contest site also has lots of practice questions, which are a lot of fun to work through for anyone who enjoys codes, puzzles and the like.</p>
<p class="getsocial" style="text-align:left;"><img style="border:0;margin:0;padding:0;" src="http://getsocialserver.files.wordpress.com/2009/02/gs1004.png" alt="" /><a title="Add to Facebook" href="http://www.facebook.com/sharer.php?u=http://stemology.wordpress.com/2009/09/27/thats-a-stem-career" target="_blank"><img style="border:0;margin:0;padding:0;" src="http://getsocialserver.files.wordpress.com/2009/02/gs1014.png" alt="Add to Facebook" /></a><a title="Add to Digg" href="http://digg.com/submit?phase=2&#38;url=http%3A%2F%2Fstemology.wordpress.com%2F2009%2F09%2F27%2Fthats-a-stem-career&#38;title=THAT%27S%20a%20STEM%20career%3F" target="_blank"><img style="border:0;margin:0;padding:0;" src="http://getsocialserver.files.wordpress.com/2009/02/gs1024.png" alt="Add to Digg" /></a><a title="Add to Del.icio.us" href="http://del.icio.us/post?url=http%3A%2F%2Fstemology.wordpress.com%2F2009%2F09%2F27%2Fthats-a-stem-career&#38;title=THAT%27S%20a%20STEM%20career%3F" target="_blank"><img style="border:0;margin:0;padding:0;" src="http://getsocialserver.files.wordpress.com/2009/02/gs1034.png" alt="Add to Del.icio.us" /></a><a title="Add to Stumbleupon" href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fstemology.wordpress.com%2F2009%2F09%2F27%2Fthats-a-stem-career&#38;title=THAT%27S%20a%20STEM%20career%3F" target="_blank"><img style="border:0;margin:0;padding:0;" src="http://getsocialserver.files.wordpress.com/2009/02/gs1044.png" alt="Add to Stumbleupon" /></a><a title="Add to Reddit" href="http://reddit.com/submit?url=http%3A%2F%2Fstemology.wordpress.com%2F2009%2F09%2F27%2Fthats-a-stem-career&#38;title=THAT%27S%20a%20STEM%20career%3F" target="_blank"><img style="border:0;margin:0;padding:0;" src="http://getsocialserver.files.wordpress.com/2009/02/gs1054.png" alt="Add to Reddit" /></a><a title="Add to Blinklist" href="http://www.blinklist.com/index.php?Action=Blink/addblink.php&#38;Description=&#38;Url=http%3A%2F%2Fstemology.wordpress.com%2F2009%2F09%2F27%2Fthats-a-stem-career&#38;Title=THAT%27S%20a%20STEM%20career%3F" target="_blank"><img style="border:0;margin:0;padding:0;" src="http://getsocialserver.files.wordpress.com/2009/02/gs1064.png" alt="Add to Blinklist" /></a><a title="Add to Twitter" href="http://twitter.com/home/?status=THAT%27S%20a%20STEM%20career%3F+%40+http%3A%2F%2Fstemology.wordpress.com%2F2009%2F09%2F27%2Fthats-a-stem-career" target="_blank"><img style="border:0;margin:0;padding:0;" src="http://getsocialserver.files.wordpress.com/2009/02/gs1074.png" alt="Add to Twitter" /></a><a title="Add to Technorati" href="http://www.technorati.com/faves?add=http%3A%2F%2Fstemology.wordpress.com%2F2009%2F09%2F27%2Fthats-a-stem-career" target="_blank"><img style="border:0;margin:0;padding:0;" src="http://getsocialserver.files.wordpress.com/2009/02/gs1084.png" alt="Add to Technorati" /></a><a title="Add to Furl" href="http://www.furl.net/storeIt.jsp?u=http%3A%2F%2Fstemology.wordpress.com%2F2009%2F09%2F27%2Fthats-a-stem-career&#38;t=THAT%27S%20a%20STEM%20career%3F" target="_blank"><img style="border:0;margin:0;padding:0;" src="http://getsocialserver.files.wordpress.com/2009/02/gs1094.png" alt="Add to Furl" /></a><a title="Add to Newsvine" href="http://www.newsvine.com/_wine/save?u=http%3A%2F%2Fstemology.wordpress.com%2F2009%2F09%2F27%2Fthats-a-stem-career&#38;h=THAT%27S%20a%20STEM%20career%3F" target="_blank"><img style="border:0;margin:0;padding:0;" src="http://getsocialserver.files.wordpress.com/2009/02/gs1104.png" alt="Add to Newsvine" /></a><img style="border:0;margin:0;padding:0;" src="http://getsocialserver.files.wordpress.com/2009/02/gs1114.png" alt="" /></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Lakoff and Gentner]]></title>
<link>http://apperceptual.wordpress.com/2009/09/24/lakoff-and-gentner/</link>
<pubDate>Thu, 24 Sep 2009 15:26:58 +0000</pubDate>
<dc:creator>Peter Turney</dc:creator>
<guid>http://apperceptual.wordpress.com/2009/09/24/lakoff-and-gentner/</guid>
<description><![CDATA[In response to my previous post, Criticisms of Lakoff’s Theory of Metaphor, Cosma Shalizi suggested ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>In response to my previous post, <a title="http://apperceptual.wordpress.com/2009/08/23/criticisms-of-lakoffs-theory-of-metaphor/" href="http://apperceptual.wordpress.com/2009/08/23/criticisms-of-lakoffs-theory-of-metaphor/">Criticisms of Lakoff’s Theory of Metaphor</a>, <a title="http://www.cscs.umich.edu/~crshalizi/notabene/analogy.html" href="http://www.cscs.umich.edu/~crshalizi/notabene/analogy.html">Cosma Shalizi</a> suggested that I should look at some criticisms from <a title="http://scienceblogs.com/mixingmemory/" href="http://scienceblogs.com/mixingmemory/">Chris</a> and <a title="http://www.psych.nyu.edu/murphy/" href="http://www.psych.nyu.edu/murphy/">Murphy</a>:</p>
<ul>
<li><a title="http://scienceblogs.com/mixingmemory/2007/06/explaining_the_war_of_the_meta.php" href="http://scienceblogs.com/mixingmemory/2007/06/explaining_the_war_of_the_meta.php">Explaining the War of the Metaphors</a>, Chris (2007)</li>
<li><a title="http://scienceblogs.com/mixingmemory/2006/11/idioms_metaphors_and_lakoff_oh.php" href="http://scienceblogs.com/mixingmemory/2006/11/idioms_metaphors_and_lakoff_oh.php">Idioms, Metaphors, and Lakoff, Oh My!</a>, Chris (2006)</li>
<li><a title="http://www.psych.nyu.edu/murphy/MetaRep_96.pdf" href="http://www.psych.nyu.edu/murphy/MetaRep_96.pdf">On Metaphoric Representation</a>, Murphy (1996)</li>
</ul>
<p>Here are my thoughts on these criticisms.</p>
<p><!--more--></p>
<p>This is what I make of <a title="http://www.psych.nyu.edu/murphy/MetaRep_96.pdf" href="http://www.psych.nyu.edu/murphy/MetaRep_96.pdf">Murphy&#8217;s paper</a>: What Murphy calls Structural Similarity (Sections 3.3 and 6) is the right approach to metaphor. Murphy&#8217;s idea of Structural Similarity is that (something like) Gentner&#8217;s <a title="http://en.wikipedia.org/wiki/Structure_mapping_engine#Structure_mapping_theory" href="http://en.wikipedia.org/wiki/Structure_mapping_engine#Structure_mapping_theory">Structure Mapping Theory</a> (SMT) [1] is the underlying mechanism for Lakoff&#8217;s Conceptual Metaphor. I support this view [2], as does Gentner [3,4,5]. The other views Murphy talks about, strong and weak metaphoric representation (Sections 3.1 and 3.2), are <a title="http://en.wikipedia.org/wiki/Strawman" href="http://en.wikipedia.org/wiki/Strawman">strawmen</a>.</p>
<p>However, it seems to me that Murphy does not fully understand Gentner&#8217;s work. In Section 3.3, he says, &#8220;The structural similarity view is not a theory of novel verbal metaphor.&#8221; This is wrong. The structural similarity view is <em>exactly</em> a theory of novel verbal metaphor [3]. Murphy doesn&#8217;t seem to understand that the point of Gentner&#8217;s SMT is to transfer knowledge from one domain to another. Murphy seems to believe that the two domains (source and target) must be completely structured before SMT can be applied, and that SMT does not add any new structure. If this were so, then the mapping process in SMT would be no more than an entertaining exercise, rather than a method to generate new knowledge.</p>
<p>Summary: the bulk of Murphy&#8217;s paper is spent on presenting and refuting strawmen. His proposed solution is the right way to go, but he doesn&#8217;t fully understand his own proposal.</p>
<p style="padding-left:30px;">[1] <a title="http://groups.psych.northwestern.edu/gentner/papers/Gentner83.pdf" href="http://groups.psych.northwestern.edu/gentner/papers/Gentner83.pdf">Structure-mapping: A theoretical framework for analogy</a>, Gentner (1983)<br />
[2] <a title="http://jair.org/papers/paper2693.html" href="http://jair.org/papers/paper2693.html">The latent relation mapping engine</a>, Turney (2008)<br />
[3] <a title="http://www.psych.northwestern.edu/psych/people/faculty/gentner/papers/BowdleGentner05.pdf" href="http://www.psych.northwestern.edu/psych/people/faculty/gentner/papers/BowdleGentner05.pdf">The career of metaphor</a>, Bowdle and Gentner (2005)<br />
[4] <a title="http://www.psych.northwestern.edu/psych/people/faculty/gentner/papers/GentnerA2K01.pdf" href="http://www.psych.northwestern.edu/psych/people/faculty/gentner/papers/GentnerA2K01.pdf">Metaphor is like analogy</a>, Gentner, Bowdle, Wolff, and Boronat (2001)<br />
[5] <a title="http://groups.psych.northwestern.edu/gentner/papers/gentner&#38;Bowdle_2008.pdf" href="http://groups.psych.northwestern.edu/gentner/papers/gentner&#38;Bowdle_2008.pdf">Metaphor as structure-mapping</a>, Gentner and Bowdle (2008)</p>
<p>In <a title="http://scienceblogs.com/mixingmemory/2007/06/explaining_the_war_of_the_meta.php" href="http://scienceblogs.com/mixingmemory/2007/06/explaining_the_war_of_the_meta.php">Explaining the War of the Metaphors</a>, Chris writes:</p>
<blockquote><p>&#8220;I and others think that CMT and blending would benefit greatly from research on analogy. If these theories had a testable model of how mappings between conceptual domains are formed, interpreted, and used &#8212; a model that <a title="http://en.wikipedia.org/wiki/Structure_mapping_engine#Structure_mapping_theory" href="http://en.wikipedia.org/wiki/Structure_mapping_engine#Structure_mapping_theory">structure mapping theory</a> can provide &#8212; it would be possible to make some concrete, experimentally testable predictions about things like inference, object recognition, etc.&#8221;</p></blockquote>
<p>I completely agree that CMT (conceptual metaphor theory) would benefit greatly from research on analogy; in particular, the research of <a title="http://groups.psych.northwestern.edu/gentner/index.htm" href="http://groups.psych.northwestern.edu/gentner/index.htm">Gentner</a> and her colleagues.</p>
<p>In <a title="http://scienceblogs.com/mixingmemory/2006/11/idioms_metaphors_and_lakoff_oh.php" href="http://scienceblogs.com/mixingmemory/2006/11/idioms_metaphors_and_lakoff_oh.php">Idioms, Metaphors, and Lakoff, Oh My!</a>, Chris writes:</p>
<blockquote><p>&#8220;Sometimes I forget that not everyone who happens upon this blog today has been reading it from day one (I mean come on, why haven&#8217;t you?). It surprises me, then, when people tell me they&#8217;ve seen no evidence that George Lakoff and Mark Johnson&#8217;s conceptual metaphor theory is, well, wrong.&#8221;</p></blockquote>
<p>I disagree that Lakoff and Johnson&#8217;s theory is (wholly) wrong. Some details may be wrong, but I think that many of the core ideas are correct and furthermore consistent with Gentner&#8217;s theories and experiments.</p>
<p>It seems to me that Chris is arguing two incompatible claims: (1) Gentner&#8217;s theory is the proper foundation for Lakoff&#8217;s theories. (2) Lakoff is wrong. (If not incompatible, these claims are at least dissonant.)</p>
<p>In <a title="http://groups.psych.northwestern.edu/gentner/papers/gentner&#38;Bowdle_2008.pdf" href="http://groups.psych.northwestern.edu/gentner/papers/gentner&#38;Bowdle_2008.pdf">Metaphor as structure-mapping</a>, Gentner and Bowdle write:</p>
<blockquote><p>&#8220;We have suggested that metaphor is like analogy &#8212; that the basic processes of analogy are at work in metaphor. Specifically, we suggest that structural alignment, inference projection, progressive abstraction, and re-representation are employed in the processing of metaphor and simile. This view can help resolve some tensions in the field: for example, on this view, metaphor both reflects parallels (Murphy, 1996) and creates new similarities (Lakoff, 1990) between the domain compared, via structural alignment and candidate inferences, respectively.&#8221;</p></blockquote>
<p>I agree with Gentner that her view &#8220;can help resolve some tensions in the field&#8221;. It puzzles me that Chris simultaneously agrees with Gentner and yet seems to actively work to increase tensions in the field. Instead of attacking Lakoff, why doesn&#8217;t Chris work to integrate Lakoff&#8217;s and Gentner&#8217;s theories, <a title="http://jair.org/papers/paper2693.html" href="http://jair.org/papers/paper2693.html">as I attempt to do</a>?</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[NLP Resources for Ruby]]></title>
<link>http://mendicantbug.com/2009/09/13/nlp-resources-for-ruby/</link>
<pubDate>Sun, 13 Sep 2009 06:28:02 +0000</pubDate>
<dc:creator>Jason Adams</dc:creator>
<guid>http://mendicantbug.com/2009/09/13/nlp-resources-for-ruby/</guid>
<description><![CDATA[There are quite a few well-known libraries for doing various NLP tasks in Java and Python, such as t]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p style="text-align:justify;">There are quite a few well-known libraries for doing various NLP tasks in Java and Python, such as the Stanford Parser (Java) and the Natural Language Toolkit (Python).  For Ruby, there are a few resources out there, but they are usually derivative or not as mature.  By derivative, I mean they are ports from other languages or extensions using code from another language.  And I&#8217;m responsible for two of them! <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<ul>
<li><span style="background-color:#ffffff;"><a href="http://www.deveiate.org/projects/Linguistics/" target="_blank">Ruby Linguistics</a> &#8211; this is one of the more ambitious projects, but is not as mature as NLTK</span>
<ul>
<li><span style="background-color:#ffffff;">interface for WordNet</span></li>
<li><span style="background-color:#ffffff;">Link grammar parser</span></li>
<li><span style="background-color:#ffffff;">some inflection stuff</span></li>
</ul>
</li>
<li><a href="http://rubyforge.org/projects/stanfordparser/" target="_blank">Stanford Parser</a> interface &#8211; uses a Java bridge to access the Stanford Parser library</li>
<li><a href="http://www.markwatson.com/" target="_blank">Mark Watson</a> has a <a href="http://www.markwatson.com/opensource/rubytagger_0.1.1.zip" target="_blank">part of speech tagger</a> [zip], a <a href="http://www.markwatson.com/opensource/rubyreuters_0.1.zip" target="_blank">text categorizer</a> [zip], and <a href="http://www.markwatson.com/opensource/ruby_read_docs.zip" target="_blank">some text extraction utilities</a> [zip], but I haven&#8217;t tried to use them yet</li>
<li><a href="http://github.com/ealdent/lda-ruby" target="_blank">LDA Ruby Gem</a> &#8211; Ruby port of David Blei&#8217;s lda-c library by yours truly
<ul>
<li>Uses Blei&#8217;s c-code for the actual LDA but I include some wrappers to make using it a bit easier</li>
</ul>
</li>
<li><a href="http://github.com/ealdent/uea-stemmer" target="_blank">UEA Stemmer</a> &#8211; Ruby port (again by yours truly) of a conservative stemmer based on Jenkins and Smith&#8217;s <a href="http://www.uea.ac.uk/cmp/research/graphicsvisionspeech/speech/WordStemming" target="_blank">UEA Stemmer</a></li>
<li><a href="http://rubyforge.org/projects/stemmer/" target="_blank">Stemmer gem</a> &#8211; <a href="http://tartarus.org/~martin/PorterStemmer/" target="_blank">Porter stemmer</a></li>
<li><a href="http://www.locknet.ro/projects/ann-ruby-stemmer" target="_blank">Lingua Stemmer </a>- another stemming library, Porter stemmer</li>
<li><a href="http://www.deveiate.org/projects/Ruby-WordNet/" target="_blank">Ruby WordNet </a>- basically what&#8217;s included in Ruby Linguistics</li>
<li><a href="http://sourceforge.net/projects/raspell/" target="_blank">Raspell</a> &#8211; Ruby interface to Aspell spell checker</li>
</ul>
<p style="text-align:justify;">There are also a number of fledgling or orphaned projects out there purporting to be ports or interfaces for various other libraries like Stanford POS Tagger and Named Entity Recognizer.  Ruby (straight Ruby, not just JRuby) can interface just about any Java library using the <a href="http://rjb.rubyforge.org/" target="_blank">Ruby Java Bridge</a> (RJB).  RJB can be a pain, and I could only initialize it once per run (a second attempt never succeeds), so there are some limitations.  But using it, I was able to easily interface with the Stanford POS tagger.</p>
<p style="text-align:justify;">So while there aren&#8217;t terribly many libraries for NLP tasks in Ruby, the availability of interfacing with Java directly widens the scope quite a bit.  You can also incorporate a c library using extensions.</p>
<p style="text-align:justify;">Naturally, if I missed anything, no matter how small, please let me know.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Sentiment analysis seminar]]></title>
<link>http://syntagmata.wordpress.com/2009/09/09/sentiment-analysis-seminar/</link>
<pubDate>Wed, 09 Sep 2009 19:45:46 +0000</pubDate>
<dc:creator>Asad Sayeed</dc:creator>
<guid>http://syntagmata.wordpress.com/2009/09/09/sentiment-analysis-seminar/</guid>
<description><![CDATA[I will attempt to blog some of things I attend during the semester.  One of them is a weekly seminar]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I will attempt to blog some of things I attend during the semester.  One of them is a weekly seminar on sentiment analysis taught by Philip Resnik.  I am in it right now. This is therefore a liveblog and hence not guaranteed to make sense or be complete. Especially the latter, far from it.</p>
<p><strong>Tim Hawes&#8217; thesis and conversational analysis</strong></p>
<p>The first thing we&#8217;re talking about is Tim Hawes work for his Master&#8217;s degree, which he defended just yesterday.  I attended it yesterday before I had even started this blog.  It was about predicting the outcome of Supreme Court (US) cases from transcripts of oral arguments.  This is particularly interesting today, as Philip just mentioned, as Sonia Sotomayor showed up for work today at SCOTUS for the first time.  I proposed by mailing list that one further means of predicting how individual would vote, even if they rarely say anything on the bench (true of some justices) would be their body of writing and argument prior to confirmation.  Philip proposed the use of a mixture model using prior argument, updated as the justice moves through his/her career.</p>
<p>In the case of legal arguments, we have to make some assumptions.  Hawes&#8217; thesis mentioned a couple of textual assumptions: cohesion and coherence.  That is, we assume that there are topical and other elements that evolve through the text in a consistent way.  There are techniques we can use to measure and segment a document based on these kinds of assumptions, such as TextTiling and lexical chaining.</p>
<p>The distinction between cohesion and coherence: the latter is a semantic value that really must be judged by a person&#8212;it&#8217;s about interpretation.  It is possible to have cohesion without coherence. (We mentioned the word &#8220;zeugma&#8221;, look it up.)</p>
<p>The right place to look for discourse analysis from an NLP point of view is to start with Rhetorical Structure Theory (RST).  Daniel Marcu has written on this topic.</p>
<p>Hawes&#8217; thesis had two kinds of results.  One of these was encoded in &#8220;rose diagrams.&#8221;  These are modified pie charts for which each slice varies in radius as though it were a petal of a rose, and each petal is coloured differently in a gradient of shades.  In this representation, we can visualize a large number of things at once.  In the case of SCOTUS, each justice can be represented as a petal, whose colour represents political leaning, radius agreement, and width the number of follow-up turns at questioning.  It&#8217;s a bit of a complicated representation and hard to describe without a diagram, which I&#8217;m not about to do on the fly.</p>
<p>While this form of visualization is quite complicated by itself, it can be used to make contrasts between types of cases and judicial situations, in which case it produces often very strong and visible contrasts.  Contrasts we can examine are between &#8220;liberal&#8221; vs. &#8220;conservative&#8221;, affirm vs. overturn, plaintiff win vs. lose, and so on.   It turns out that by this method, you can predict Clarence Thomas&#8217; vote (who rarely speaks) to a high degree of accuracy.</p>
<p>We didn&#8217;t seem to get to the other technique Hawes&#8217; used, but we had to change rooms.</p>
<p><strong>Sentiment analysis</strong></p>
<p>We didn&#8217;t end up changing rooms.</p>
<p>This part of the seminar we briefly touch on the basics of sentiment analysis, particular with reference to Bing Liu&#8217;s recent review article.  So we begin with a discussion of the general dimension and challenges of sentiment analysis, such as</p>
<ul>
<li>What do people think of _____?</li>
<li>What features/facets/topics matter?</li>
<li>Mixed and neutral sentiment.</li>
<li>The effect of comparatives.</li>
<li>How opinions change, and what influences this.</li>
<li>Covert opinion/spin vs. overt expressions.</li>
<li>The holders of opinion</li>
<li>Multilingual and cross-language issues.</li>
</ul>
<p>We then had a rather wide-ranging discussion of the different kinds of issues, including a detailed discussion on syntactic relationships within sentences that might relate opinion-holders to sentiments to opinion targets/objects. This discussion was so wide-ranging and yet very compressed that it is hard to represent it in a liveblog, but it covers issues that we will revisit in later sessions.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Criticisms of Lakoff's Theory of Metaphor]]></title>
<link>http://apperceptual.wordpress.com/2009/08/23/criticisms-of-lakoffs-theory-of-metaphor/</link>
<pubDate>Mon, 24 Aug 2009 02:37:02 +0000</pubDate>
<dc:creator>Peter Turney</dc:creator>
<guid>http://apperceptual.wordpress.com/2009/08/23/criticisms-of-lakoffs-theory-of-metaphor/</guid>
<description><![CDATA[Lakoff&#8217;s theory of metaphor has been both highly praised and highly criticized. My own thinkin]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><a title="http://en.wikipedia.org/wiki/George_Lakoff" href="http://en.wikipedia.org/wiki/George_Lakoff">Lakoff&#8217;s</a> theory of metaphor has been both highly praised and highly criticized. My own thinking about how the mind works has been greatly influenced by Lakoff&#8217;s books, yet I also agree with much of what his critics say. I would like to make a case here that his books are worth reading, although much of the criticism is correct.</p>
<p><!--more--></p>
<p>These are the books by Lakoff and various co-authors that I have read:</p>
<ul>
<li><em><a title="http://www.amazon.com/Metaphors-We-Live-George-Lakoff/dp/0226468011/" href="http://www.amazon.com/Metaphors-We-Live-George-Lakoff/dp/0226468011/">Metaphors We Live By</a></em>, Lakoff and <a title="http://en.wikipedia.org/wiki/Mark_Johnson_(professor)" href="http://en.wikipedia.org/wiki/Mark_Johnson_(professor)">Johnson</a></li>
<li><em><a title="http://www.amazon.com/Women-Dangerous-Things-George-Lakoff/dp/0226468046/" href="http://www.amazon.com/Women-Dangerous-Things-George-Lakoff/dp/0226468046/">Women, Fire, and Dangerous Things</a></em>, Lakoff</li>
<li><em><a title="http://www.amazon.com/Where-Mathematics-Comes-Embodied-Brings/dp/0465037712/ " href="http://www.amazon.com/Where-Mathematics-Comes-Embodied-Brings/dp/0465037712/">Where Mathematics Comes From</a></em>, Lakoff and <a title="http://en.wikipedia.org/wiki/Rafael_E._N%C3%BA%C3%B1ez" href="http://en.wikipedia.org/wiki/Rafael_E._N%C3%BA%C3%B1ez">Nuñez</a></li>
<li><em><a title="http://www.amazon.com/Philosophy-Flesh-Embodied-Challenge-Western/dp/0465056741/" href="http://www.amazon.com/Philosophy-Flesh-Embodied-Challenge-Western/dp/0465056741/">Philosophy in the Flesh</a></em>, Lakoff and <a title="http://en.wikipedia.org/wiki/Mark_Johnson_(professor)" href="http://en.wikipedia.org/wiki/Mark_Johnson_(professor)">Johnson</a></li>
<li><em><a title="http://www.amazon.com/More-than-Cool-Reason-Metaphor/dp/0226468127/" href="http://www.amazon.com/More-than-Cool-Reason-Metaphor/dp/0226468127/">More than Cool Reason</a></em>, Lakoff and <a title="http://en.wikipedia.org/wiki/Mark_Turner_(cognitive_scientist)" href="http://en.wikipedia.org/wiki/Mark_Turner_(cognitive_scientist)">Turner</a></li>
</ul>
<p>I enthusiastically recommend all five of these books to anybody who has a strong interest in either cognition or language.</p>
<p>Below are excerpts from four opinions of the work of Lakoff and co-authors. After each excerpt, I give my reaction to the opinion.</p>
<blockquote><p>I <em>dislike</em> Lakoff&#8217;s theory of metaphor, especially when compared to Sperber and Wilson&#8217;s. His work is deeply, purely speculative, which is fine, but he seems not to care at all about experimental controls, or even admit they&#8217;re an issue, which is not. And his manner of writing is most unpleasant, principally because it&#8217;s so grandiose (he&#8217;s overthrowing the whole tradition of western thought) and so dogmatic (e.g., he often writes &#8220;cognitive science has shown that&#8221; when what he means is &#8220;as I have often claimed, and many others vehemently denied&#8221;). I half suspect, given the subjects Lakoff is writing on, that he&#8217;s deliberately positioning himself to be the 21st-century version of Freud, the man who provides a educated non-specialists with a scientific-sounding vocabulary for mental life. But I hasten to add that (a) I really have no evidence for that, and (b) Lakoff is in person polite, affable and well-spoken. Still, it&#8217;s very hard for me to force myself all the way through one of his books. — <a title="http://en.wikipedia.org/wiki/Cosma_Shalizi" href="http://en.wikipedia.org/wiki/Cosma_Shalizi">Cosma Rohilla Shalizi</a>, <a title="http://www.cscs.umich.edu/~crshalizi/notabene/analogy.html" href="http://www.cscs.umich.edu/~crshalizi/notabene/analogy.html">Analogy and Metaphor</a></p></blockquote>
<p>It is true that Lakoff does not seem interested in experimental controls, but other researchers have been working to address this problem. See Chapter 6 of <em>Philosophy in the Flesh</em> for many references. It is also true that Lakoff is often grandiose and dogmatic, and he neglects to cite much related work, such as the work of <a title="http://groups.psych.northwestern.edu/gentner/index.htm" href="http://groups.psych.northwestern.edu/gentner/index.htm">Dedre Gentner</a> and her colleagues. However, his basic claims, stripped of their abrasive presentation, are insightful and fruitful. If you can focus on the ideas and ignore the style of presentation, there is much to learn from Lakoff&#8217;s books.</p>
<blockquote><p>No one has a problem with the idea that the lens of an eye and the lens of a telescope are two instances of the general category &#8220;lens,&#8221; rather than the telescope being a &#8220;metaphor&#8221; for the eye. Nor is there anything metaphorical going on when we refer to &#8220;the genetic code&#8221;: a code by now is an information-theoretic term for a mapping scheme, and it subsumes cryptograms and DNA as special cases. But do cognitive psychologists use the computer as a &#8220;metaphor&#8221; for the mind, or (as I believe) can it be said that the mind <em>literally</em> engages in computation, and that the human mind and commercial digital computers are two exemplars of the category &#8220;computational system&#8221;?</p>
<p>So the ubiquity of metaphor in language does not mean that all thought is grounded in bodily experience, nor that all ideas are merely rival frames rather than verifiable propositions. Conceptual metaphors can be learned and used only if they are analyzed into more abstract elements like &#8220;cause,&#8221; &#8220;goal,&#8221;, and &#8220;change,&#8221; which make up the real currency of thought. And the methodical use of metaphor in science shows that metaphor is a way of adapting language to reality, not the other way around, and that it can capture genuine laws in the world, not just project comfortable images onto it.</p>
<p>[...]</p>
<p>Though metaphors are omnipresent in language, many of them are effectively dead in the minds of today&#8217;s speakers, and the living ones could never be learned, understood, or used as a reasoning tool unless they were built out of more abstract concepts that capture the similarities and differences between the symbol and the symbolized. For this reason, conceptual metaphors do not render truth and objectivity obsolete, nor do they reduce philosophical, legal, and political discourse to a beauty contest between rival frames.</p>
<p>Still, I think that metaphor really is a key to explaining thought and language. The human mind comes equipped with an ability to penetrate the cladding of sensory appearance and discern the abstract construction underneath — not always on demand, and not infallibly, but often enough and insightfully enough to shape the human condition. Our powers of analogy allow us to apply ancient neural structures to newfound subject matter, to discover hidden laws and systems in nature, and not least, to amplify the expressive power of language itself. — <a title="http://en.wikipedia.org/wiki/Steven_Pinker" href="http://en.wikipedia.org/wiki/Steven_Pinker">Steven Pinker</a>, <em><a title="http://www.amazon.com/Stuff-Thought-Language-Window-Nature/dp/0143114247/" href="http://www.amazon.com/Stuff-Thought-Language-Window-Nature/dp/0143114247/">The Stuff of Thought</a></em></p></blockquote>
<p>Pinker agrees that metaphor is important, but he believes that abstraction is more important. He claims that abstraction captures the similarities and differences that are the basis of metaphor. However, Pinker neglects to explain how abstraction works. I believe that metaphor (more precisely, <a title="http://apperceptual.wordpress.com/2007/12/20/readings-in-analogy-making/" href="http://apperceptual.wordpress.com/2007/12/20/readings-in-analogy-making/">analogy-making</a>) is the basis for abstraction. First we observe an analogy, such as the analogy between water waves and sound waves, and then we form an abstraction, <a title="http://en.wikipedia.org/wiki/Wave_theory" href="http://en.wikipedia.org/wiki/Wave_theory">wave theory</a>, which includes both types of waves. Later we add light waves. Abstraction does not subsume metaphor, because analogy precedes abstraction and is the basis for abstraction.</p>
<blockquote><p>Whereas the authors devote 44 pages to Chomsky, they cover all of &#8220;Anglo-American analytic philosophy&#8221; in 29 pages, while lumping together Frege, Russell, Carnap and the Vienna Circle, Quine, Goodman, Davidson, Putnam, Kripke, Montague, and Lewis. In the same chapter, they continue with ordinary language philosophy (Strawson, Austin, and the later Wittgenstein), which they consider to be based on the same metaphors. Yet these philosophers have expressed widely divergent views on the embodiment of mind, the nature of language, and Chomsky&#8217;s theory of autonomous syntax. By drawing finer distinctions, the authors might have claimed some of them as potential allies against Chomsky&#8217;s position. — <a title="http://www.jfsowa.com/" href="http://www.jfsowa.com/">John Sowa</a>, <a title="http://www.jfsowa.com/pubs/lakoff.htm" href="http://www.jfsowa.com/pubs/lakoff.htm">Review of <em>Philosophy in the Flesh</em></a></p></blockquote>
<p>Like Shalizi, Sowa criticizes Lakoff&#8217;s presentation. Yes, Lakoff distorts the work of others and exaggerates the novelty of his own work. Yes, his ideas could be presented more clearly, more fairly, and with better discussion of related work. Nonetheless, if you can ignore these problems with presentation, you will find many interesting ideas in the work of Lakoff and his co-authors.</p>
<blockquote><p>Lakoff and Nuñez have a very interesting general framework for approaching their topic: mathematics arises by the extension of innate human capacities (e.g. subitization) or basic universals of human experience (spatial and motor experience), and the means of extension is cognitive metaphors which preserve the basic inferential structure of the source domain. The first few chapters provide a plausible sounding, perhaps workable account of arithmetic, simple logic and set theory, but one that they should have developed in far more detail (e.g. their account of intersection in terms of container schemas is criminally underexplained).</p>
<p>[...]</p>
<p>In short, they have very interesting ideas, but the mass of technical vagueness and blunders, plus the big strawman that is their &#8220;philosophical&#8221; argument, suggests that they are more interested in passing off as intellectual revolutionaries among the pop-science book audience than in contributing to our understanding of the topic. — Idiosyncrat, <a title="http://www.amazon.com/review/R2U23Q5D5D7MY3/ref=cm_cr_pr_viewpnt#R2U23Q5D5D7MY3" href="http://www.amazon.com/review/R2U23Q5D5D7MY3/ref=cm_cr_pr_viewpnt#R2U23Q5D5D7MY3">Review of <em>Where Mathematics Comes From</em></a></p></blockquote>
<p>The final sentence of this review serves as a good summary: the ideas are very interesting, but the presentation of the ideas is impaired by a desire to seem revolutionary, at the cost of fairness, balance, rigour, and careful discussion of related work. For me, ultimately, ideas trump presentation. Until there is a better source for these ideas, I recommend Lakoff&#8217;s books.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Lazyfeed: the missing link in the evolution of RSS?]]></title>
<link>http://mendicantbug.com/2009/08/01/lazyfeed-the-missing-link-in-the-evolution-of-rss/</link>
<pubDate>Sat, 01 Aug 2009 13:00:09 +0000</pubDate>
<dc:creator>Jason Adams</dc:creator>
<guid>http://mendicantbug.com/2009/08/01/lazyfeed-the-missing-link-in-the-evolution-of-rss/</guid>
<description><![CDATA[When Lazyfeed announced a limited round of beta invites on TechCrunch, I admit, I lusted after them.]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p style="text-align:justify;">When <a href="http://www.lazyfeed.com" target="_blank">Lazyfeed</a> announced a limited round of beta invites on <a href="http://www.techcrunch.com/2009/07/10/lazyfeed-wakes-up-%E2%80%94-and-we-have-invites/" target="_blank">TechCrunch</a>, I admit, I lusted after them.  Only 250?  I wanted to be one!  But alas, I was put on the waiting list.  It&#8217;s a decent marketing strategy for building up some hype.  When I finally did get my invite, I tried them out for about 5 minutes and fell prey to the distractions of the internet.  That was a bad sign, though.  Usually a new service can hold my attention for a little while longer.  So what happened?</p>
<h3>Lazy-wha?</h3>
<p style="text-align:justify;">Lazyfeed is a service that lets you enter topics, blogs, twitter, delicious and flickr accounts to form a live streaming <em>lazyfeed</em>.  You then get live updates in the form of your tags being updated.  Your main screen consists of a bunch of boxes with your topics and then things it guesses are related.</p>
<h3>The hook</h3>
<p style="text-align:justify;">Lazyfeed&#8217;s marketing strategy succeeded again by giving me three invites to hand out to friends.  I offered them on <a class="zem_slink" title="Twitter" rel="homepage" href="http://twitter.com">Twitter</a>, having only one person bite.  So here are the other two invites for the adventurous.  Get em while they&#8217;re hot.  If you manage to take one, please comment that you did so, so that I can at least know who you were and we can save someone else the wasted time.  I&#8217;m just throwing them into the ether like this because I don&#8217;t feel like pushing them on Twitter again.</p>
<blockquote><p><span style="text-decoration:line-through;">NTI1MzMxMjc5ZVhmUTl5cDBiek1R<br />
</span><span style="text-decoration:line-through;"> OTk5MTUwNjczN3JCLklmZHhjMDdV</span></p></blockquote>
<h3>The rub</h3>
<p style="text-align:justify;">Lazyfeed is a lovely service in terms of appearance and ajaxy goodness, but my initial impression is that it ends up being streaming <a class="zem_slink" title="Information overload" rel="wikipedia" href="http://en.wikipedia.org/wiki/Information_overload">information overload</a>.  For one, the topic suggestion feature appears to be fairly naive.  Someone correct me if I&#8217;m wrong, but it looks a bit like document similarity for topics is done purely by one-for-one matching on tags.  Whatever the method, the result of their suggested topics (&#8220;Stuff for Lazy Jason&#8221;) is stuff like the following:</p>
<div id="attachment_1241" class="wp-caption aligncenter" style="width: 510px"><a href="http://ealdent.wordpress.com/files/2009/08/picture-1.png"><img class="size-full wp-image-1241" title="Lazyfeed sample results" src="http://ealdent.wordpress.com/files/2009/08/picture-1.png" alt="Lazyfeed sample results" width="500" height="287" /></a><p class="wp-caption-text">Lazyfeed sample suggested topics</p></div>
<p style="text-align:justify;">Granted, it&#8217;s a hard problem, but those results are pretty bad.  So as I started to write this post lambasting this service, I considered that maybe I was just seeing cold-start problems, and I was being unfair.  So I trained it with some additional feeds and topics that are straight-to-the-point of stuff I&#8217;m interested in, like <em>sigir2009</em>, <em>topicmodeling</em>, <em>recommendersystems</em>, etc.  Tags can contain no spaces, btw, which is why those don&#8217;t.  When I tried using dashes, like I often do on delicious, it gives no results.  I also removed some things that were too general or contained too many spurious results.</p>
<h3>The light</h3>
<p style="text-align:justify;">Things started improving here, and I actually began to understand what the point of Lazyfeed is.  My initial confusion was that &#8220;Stuff for Lazy Jason&#8221; is stuff that I would want to read right now.  Being lazy, I didn&#8217;t expect to have to do work to get those things.  But &#8220;Stuff for Lazy Jason&#8221; is a list of topics it thinks I might be interested in.  Saving any one of those puts it into my lazyfeed, which is in the bar on the left.</p>
<div id="attachment_1242" class="wp-caption aligncenter" style="width: 510px"><a href="http://ealdent.wordpress.com/files/2009/08/picture-2.png"><img class="size-full wp-image-1242" title="My lazyfeed topics" src="http://ealdent.wordpress.com/files/2009/08/picture-2.png" alt="My lazyfeed topics" width="500" height="375" /></a><p class="wp-caption-text">My lazyfeed topics</p></div>
<p style="text-align:justify;">So now what happens is that occasionally it discovers something new related to my interests and it bumps that category to the top of the list and turns it bold again (grayed out topics have been read).  Most of my topics are low traffic, so add something like <em>mariahcarey</em> if you want to see this functionality in action.  Now we&#8217;re getting somewhere.  It has actually started being helpful and has found me some stuff that my Google alerts haven&#8217;t.  Which is weird, and is making me think I need to double check to make sure my Google alerts are working&#8230;</p>
<h3>The end</h3>
<p style="text-align:justify;">My takeaway after using Lazyfeed for nigh on two hours is that it&#8217;s an interesting alternative (or even extension) to <a class="zem_slink" title="RSS" rel="wikipedia" href="http://en.wikipedia.org/wiki/RSS">RSS</a>, but one that still hasn&#8217;t crossed the bridge to the next stage in evolution.  The idea is solid.  Automatically discover stuff in the sea of human knowledge (or human idiocy) and serve it up fresh.  The implementation lacks robust topic detection which is unfortunately going to be necessary unless it is to become another source of information overload rather than a useful stream of relevant information. Relevance is an ephemeral thing, given that your information needs change from day to day.  Lazyfeed makes it pretty easy to get rid of old topics and add new ones, even if some of their suggestions are still wonky.  It&#8217;s an interesting recommender system problem with a lot of potential.</p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/403d8746-534f-461c-8872-df09ca6433c9/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_c.png?x-id=403d8746-534f-461c-8872-df09ca6433c9" alt="Reblog this post [with Zemanta]" /></a></div>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Updates to lda-ruby gem]]></title>
<link>http://mendicantbug.com/2009/07/30/updates-to-lda-ruby-gem/</link>
<pubDate>Fri, 31 Jul 2009 02:20:25 +0000</pubDate>
<dc:creator>Jason Adams</dc:creator>
<guid>http://mendicantbug.com/2009/07/30/updates-to-lda-ruby-gem/</guid>
<description><![CDATA[A while back I ported David Blei&#8217;s lda-c code for performing Latent Dirichlet Allocation to Ru]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p style="text-align:justify;"><a href="http://ealdent.wordpress.com/files/2009/07/works-on-my-machine-starburst.jpg"><img class="alignright size-full wp-image-1232" style="margin:3px 4px;" title="works-on-my-machine-starburst" src="http://ealdent.wordpress.com/files/2009/07/works-on-my-machine-starburst.jpg" alt="works-on-my-machine-starburst" width="150" height="145" /></a>A while back <a href="http://mendicantbug.com/2008/11/17/lda-in-ruby/" target="_self">I ported</a> David Blei&#8217;s <a href="http://www.cs.princeton.edu/~blei/lda-c/" target="_blank">lda-c</a> code for performing <a class="zem_slink" title="Latent Dirichlet allocation" rel="wikipedia" href="http://en.wikipedia.org/wiki/Latent_Dirichlet_allocation">Latent Dirichlet Allocation</a> to <a class="zem_slink" title="Ruby (programming language)" rel="homepage" href="http://www.ruby-lang.org/">Ruby</a>.  Basically I just wrapped the C methods in a Ruby class, turned it into <a href="http://github.com/ealdent/lda-ruby" target="_blank">a gem</a>, and called it a day.  The result was a bit ugly and unwieldy, like most research code.  A few months later, <a href="http://github.com/taf2" target="_blank">Todd Fisher</a> came along and discovered a couple bugs and memory leaks in the C code, for which I am very grateful.  I had been toying with the idea of improving the Ruby code, and embarked on a mission to do so.  The result is a hopefully much cleaner <a class="zem_slink" title="RubyGems" rel="homepage" href="http://docs.rubygems.org">gem</a> that can be used right out of the box with little screwing around.</p>
<p style="text-align:justify;">Unfortunately, I did something I&#8217;m ashamed of.  Ruby gems are notorious for breaking backwards compatibility, and I have done just that.  The good news is, your code will almost work, assuming you didn&#8217;t start diving into the Document and Corpus classes too heavily.  If you did, then you will probably experience a lot of breakage.  The result, I hope is a more sensical implementation, however, so maybe you won&#8217;t hate me.  Of course, I could be wrong and my implementation is still crap.  If that&#8217;s the case, please let me know what needs to be improved.</p>
<p>To install the gem:</p>
<p style="padding-left:30px;"><code>gem sources -a http://gems.github.com<br />
sudo gem install ealdent-lda-ruby</code></p>
<p>Enjoy!</p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/f6bb550a-b4f1-4af6-bcde-fabdba98ad7a/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_c.png?x-id=f6bb550a-b4f1-4af6-bcde-fabdba98ad7a" alt="Reblog this post [with Zemanta]" /></a></div></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Meaning, Mapping, Panalogy, and Netflix]]></title>
<link>http://apperceptual.wordpress.com/2009/07/24/meaning-mapping-panalogy-and-netflix/</link>
<pubDate>Fri, 24 Jul 2009 15:15:36 +0000</pubDate>
<dc:creator>Peter Turney</dc:creator>
<guid>http://apperceptual.wordpress.com/2009/07/24/meaning-mapping-panalogy-and-netflix/</guid>
<description><![CDATA[&#8220;Never swallow anything whole. We live perforce by half-truths and get along fairly well as lo]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><blockquote><p>&#8220;Never swallow anything whole. We live perforce by half-truths and get along fairly well as long as we do not mistake them for whole-truths, but when we do so mistake them, they raise the devil with us.&#8221; —<em> <a title="http://en.wikipedia.org/wiki/Alfred_north_whitehead" href="http://en.wikipedia.org/wiki/Alfred_north_whitehead">Alfred North Whitehead</a>, <a title="http://www.amazon.com/Dialogues-Alfred-North-Whitehead-Nonpareil/dp/1567921299/" href="http://www.amazon.com/Dialogues-Alfred-North-Whitehead-Nonpareil/dp/1567921299/">Dialogues of Alfred North Whitehead</a></em></p></blockquote>
<p>There are <a title="http://singyourownlullaby.blogspot.com/2009/07/meaning-theories.html" href="http://singyourownlullaby.blogspot.com/2009/07/meaning-theories.html">dozens of theories about meaning</a>, but they share a common element: <a title="http://en.wikipedia.org/wiki/Meaning" href="http://en.wikipedia.org/wiki/Meaning">meaning</a> (<a title="http://en.wikipedia.org/wiki/Semantics" href="http://en.wikipedia.org/wiki/Semantics">semantics</a>) is about <a title="http://en.wikipedia.org/wiki/Conceptual_metaphor" href="http://en.wikipedia.org/wiki/Conceptual_metaphor">mapping</a>. We understand a thing, we give it meaning, by <a title="http://jair.org/papers/paper2693.html" href="http://jair.org/papers/paper2693.html">mapping</a> it to another thing. Furthermore, and this is a crucial point, <em>one mapping is not enough</em>. The more mappings we make, the better we understand. A single mapping only gives us part of the truth.</p>
<p><!--more--></p>
<p>We understand things (words, events, perceptions, people, signs) by relating (connecting, mapping) them to other things. This <a title="http://apperceptual.wordpress.com/2007/12/20/readings-in-analogy-making/" href="http://apperceptual.wordpress.com/2007/12/20/readings-in-analogy-making/">analogy-making</a> is how we understand both high-level ideas and concepts and low-level perceptions:</p>
<blockquote><p>&#8220;We have repeatedly seen how analogies and mappings give rise to secondary meanings that ride on the backs of primary meanings. We have seen that even primary meanings depend on unspoken mappings, and so in the end, we have seen that <em>all meaning is mapping-mediated</em>, which is to say, <em>all meaning comes from analogies</em>.&#8221; [emphasis added] — <em><a title="http://en.wikipedia.org/wiki/Douglas_Hofstadter" href="http://en.wikipedia.org/wiki/Douglas_Hofstadter">Douglas Hofstadter</a>, <a title="http://www.amazon.com/Am-Strange-Loop-Douglas-Hofstadter/dp/B001FA23HM/" href="http://www.amazon.com/Am-Strange-Loop-Douglas-Hofstadter/dp/B001FA23HM/">I am a Strange Loop<br />
</a></em></p></blockquote>
<p>But one analogy is not enough. No analogy is perfect, and we compensate for their imperfections by using multiple analogies, and by <a title="http://en.wikipedia.org/wiki/Conceptual_blending" href="http://en.wikipedia.org/wiki/Conceptual_blending">blending</a> analogies together. <a title="http://en.wikipedia.org/wiki/Marvin_minsky" href="http://en.wikipedia.org/wiki/Marvin_minsky">Marvin Minsky</a> calls this <em>panalogy</em> (parallel analogy):</p>
<blockquote><p>&#8220;If you &#8216;understand&#8217; something in only one way, then you scarcely understand it at all—because when you get stuck, you&#8217;ll have nowhere to go. But if you represent something in several ways, then when you get frustrated enough, you can switch among different points of view, until you find one that works for you!&#8221; — <em><a title="http://en.wikipedia.org/wiki/Marvin_minsky" href="http://en.wikipedia.org/wiki/Marvin_minsky">Marvin Minsky</a>, <a title="http://www.amazon.com/Emotion-Machine-Commonsense-Artificial-Intelligence/dp/0743276647" href="http://www.amazon.com/Emotion-Machine-Commonsense-Artificial-Intelligence/dp/0743276647">The Emotion Machine</a></em></p></blockquote>
<p>If you want to learn more about how we blend analogies, I highly recommend <a title="http://www.amazon.com/Way-We-Think-Conceptual-Complexities/dp/0465087868/" href="http://www.amazon.com/Way-We-Think-Conceptual-Complexities/dp/0465087868/">The Way We Think: Conceptual Blending and the Mind&#8217;s Hidden Complexities</a>, by <a title="http://en.wikipedia.org/wiki/Gilles_Fauconnier" href="http://en.wikipedia.org/wiki/Gilles_Fauconnier">Gilles Fauconnier</a> and <a title="http://en.wikipedia.org/wiki/Mark_Turner_(cognitive_scientist)" href="http://en.wikipedia.org/wiki/Mark_Turner_(cognitive_scientist)">Mark Turner</a>.</p>
<p>Perhaps the most important lesson from the <a title="http://en.wikipedia.org/wiki/Netflix_prize" href="http://en.wikipedia.org/wiki/Netflix_prize">Netflix Prize</a> has been that many models are better than one:</p>
<ul>
<li>&#8220;the biggest lesson for ML from the Netflix contest has been the formidable performance edge of <a title="http://en.wikipedia.org/wiki/Machine_learning_ensemble" href="http://en.wikipedia.org/wiki/Machine_learning_ensemble">ensemble methods</a>&#8221; — <em>John Langford, <a title="http://hunch.net/?p=827" href="http://hunch.net/?p=827">Netflix nearly done</a></em></li>
<li>&#8220;We have learned that ensemble methods are the solution for more accuracy.&#8221; — <em>Daniel Lemire, <a title="http://www.daniel-lemire.com/blog/archives/2009/07/09/after-netflix-what-next/" href="http://www.daniel-lemire.com/blog/archives/2009/07/09/after-netflix-what-next/">After Netflix? What next?</a></em></li>
</ul>
<p>I view this as confirmation of the Panalogy Principle: we don&#8217;t understand anything until we understand it many ways. There are no whole-truths, but we can get by reasonably well with a large number of half-truths.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Simple Text Analysis]]></title>
<link>http://pythonism.wordpress.com/2009/07/20/simple-text-analysis/</link>
<pubDate>Mon, 20 Jul 2009 18:25:57 +0000</pubDate>
<dc:creator>pythonisms</dc:creator>
<guid>http://pythonism.wordpress.com/2009/07/20/simple-text-analysis/</guid>
<description><![CDATA[An easy start to coding and thinking about NLP &#8211; Natural Language Processing or Computational ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>An easy start to coding and thinking about NLP &#8211; Natural Language Processing or Computational Linguistics is to try and obtain things like vocabulary, frequencies of words etc from a text. One grand project is a text fingerprinting system that could derive a numerical index to characterise a given writer&#8217;s style.</p>
<p>One of the questions I posed about this was&#8230; &#8220;exactly how much information can you get from a text without ever having to interpret what it is actually talking about ?&#8221; I have likened this to trying to learn about a given animal by liquidising it completely and then analysing the liquid obtained.</p>
<p>Actual semantic understanding of a text is a genuine Hard Problem of AI. We can&#8217;t go that far yet,  but there are still a lot of useful things simple code can do. Here I wanted to find the number of sentences in a text, I was using Dracula by Bram Stoker.</p>
<p><code>def&#160;sentfull(a):<br />
&#160;&#160;&#160;&#160;"""&#160;takes&#160;a&#160;text&#160;and&#160;returns<br />
&#160;&#160;&#160;&#160;&#160;list&#160;where&#160;sentences&#160;have&#160;been&#160;split<br />
&#160;&#160;&#160;&#160;&#160;&#160;with&#160;full&#160;stops"""<br />
&#160;&#160;&#160;&#160;<br />
&#160;&#160;&#160;&#160;b=open(a)<br />
&#160;&#160;&#160;&#160;c=[]<br />
&#160;&#160;&#160;&#160;for&#160;x&#160;in&#160;b.read().split('.'):<br />
&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;c.append(x)<br />
&#160;&#160;&#160;&#160;return&#160;c<br />
</code><code><br />
def&#160;sentques(b):<br />
&#160;&#160;&#160;&#160;"""&#160;takes&#160;list&#160;of&#160;strings&#160;and&#160;returns&#160;<br />
&#160;&#160;&#160;&#160;list&#160;where&#160;sentences&#160;have&#160;been&#160;split&#160;<br />
&#160;&#160;&#160;&#160;with&#160;question&#160;marks"""<br />
&#160;&#160;&#160;&#160;for&#160;a&#160;in&#160;range(len(b)):<br />
&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;if&#160;'?'&#160;in&#160;b[a]:<br />
&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;c=b[a].split('?')<br />
&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;b.remove(b[a])<br />
&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;for&#160;x&#160;in&#160;c:<br />
&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;b.append(x)<br />
&#160;&#160;&#160;&#160;return&#160;b<br />
</code><code><br />
def&#160;sentexcl(b):<br />
&#160;&#160;&#160;&#160;"""&#160;takes&#160;list&#160;of&#160;strings&#160;and&#160;returns&#160;<br />
&#160;&#160;&#160;&#160;list&#160;where&#160;sentences&#160;have&#160;been&#160;split&#160;<br />
&#160;&#160;&#160;&#160;with&#160;exclamation&#160;marks"""<br />
&#160;&#160;&#160;&#160;for&#160;a&#160;in&#160;range(len(b)):<br />
&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;if&#160;'!'&#160;in&#160;b[a]:<br />
&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;c=b[a].split('!')<br />
&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;b.remove(b[a])<br />
&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;for&#160;x&#160;in&#160;c:<br />
&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;b.append(x)<br />
&#160;&#160;&#160;&#160;return&#160;b<br />
&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;<br />
</code><code><br />
def&#160;sentence(a):<br />
&#160;&#160;&#160;&#160;"""&#160;takes&#160;list&#160;of&#160;strings&#160;and&#160;returns&#160;list<br />
&#160;&#160;&#160;&#160;&#160;where&#160;sentences&#160;have&#160;been&#160;separated"""<br />
&#160;&#160;&#160;&#160;return&#160;sentexcl(sentques(sentfull(a)))<br />
</code><code><br />
print&#160;(sentence('dracula.txt'))<br />
</code></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Porting the UEA-Lite Stemmer to Ruby]]></title>
<link>http://mendicantbug.com/2009/07/16/porting-the-uea-lite-stemmer-to-ruby/</link>
<pubDate>Thu, 16 Jul 2009 11:00:44 +0000</pubDate>
<dc:creator>Jason Adams</dc:creator>
<guid>http://mendicantbug.com/2009/07/16/porting-the-uea-lite-stemmer-to-ruby/</guid>
<description><![CDATA[A twitter friend (@communicating) tipped me off to the UEA-Lite Stemmer by Marie-Claire Jenkins and ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p style="text-align:justify;">A twitter friend (<a href="http://twitter.com/communicating" target="_blank">@communicating</a>) tipped me off to the <a href="http://www.uea.ac.uk/cmp/research/graphicsvisionspeech/speech/WordStemming" target="_blank">UEA-Lite Stemmer</a> by <a href="http://www.scienceforseo.com/" target="_blank">Marie-Claire Jenkins</a> and Dan J. Smith.  Stemmers are <a class="zem_slink" title="Natural language processing" rel="wikipedia" href="http://en.wikipedia.org/wiki/Natural_language_processing">NLP</a> tools that get rid of inflectional and derivational <a class="zem_slink" title="Affix" rel="wikipedia" href="http://en.wikipedia.org/wiki/Affix">affixes</a> from words.  In English, that usually means getting rid of the plural <em>-s</em>, progressive <em>-ing</em>, and preterite <em>-ed</em>.  Depending on the type of <a class="zem_slink" title="Stemming" rel="wikipedia" href="http://en.wikipedia.org/wiki/Stemming">stemmer</a>, that might also mean getting rid of derivational <a class="zem_slink" title="Suffix" rel="wikipedia" href="http://en.wikipedia.org/wiki/Suffix">suffixes</a> like <em>-ful</em> and <em>-ness</em>.  Sometimes it&#8217;s useful to be able to reduce words like <em>consolation</em> and <em>console</em> to the same root form: <em>consol</em>.  But sometimes that doesn&#8217;t make sense.  If you&#8217;re searching for video game consoles, you don&#8217;t want to find documents about consolation.  In this case, you need a conservative stemmer.</p>
<p style="text-align:justify;">The UEA-Lite Stemmer is a rule-based, conservative stemmer that handles regular words, proper nouns and acronyms.  It was originally written in <a class="zem_slink" title="Perl" rel="homepage" href="http://www.perl.org/">Perl</a>, but had been ported to <a class="zem_slink" title="Java (software platform)" rel="homepage" href="http://java.sun.com">Java</a>.  Since I usually code in Ruby these days, I thought it&#8217;d be nice to make it available to the Ruby community, so I ported it over last night.</p>
<p style="text-align:justify;">The code is open source under the <a class="zem_slink" title="Apache License" rel="wikipedia" href="http://en.wikipedia.org/wiki/Apache_License">Apache 2 License</a> and hosted on <a href="http://github.com/ealdent/uea-stemmer/tree/master" target="_blank">github</a>.  So please check out the code and let me know what you think.  Heck, you can even fork the project and make some improvements yourself if you want.</p>
<p style="text-align:justify;">One direction I&#8217;d like to be able to go is to turn all of the rules into <a href="http://en.wikipedia.org/wiki/Finite_state_transducer" target="_blank">finite state transducers</a>, which can be composed into a single large deterministic finite state transducer.  That would be a lot more efficient (and even fun!), but Ruby lacks a decent FST implementation.</p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/ac71b162-56e6-4cbb-b1ab-10ee9055ed7b/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_c.png?x-id=ac71b162-56e6-4cbb-b1ab-10ee9055ed7b" alt="Reblog this post [with Zemanta]" /></a></div>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[OpenCog]]></title>
<link>http://doppelleben.wordpress.com/2009/07/08/opencog/</link>
<pubDate>Wed, 08 Jul 2009 10:20:56 +0000</pubDate>
<dc:creator>Tom</dc:creator>
<guid>http://doppelleben.wordpress.com/2009/07/08/opencog/</guid>
<description><![CDATA[I just discovered this cool blogpost on OpenCog. It is a very interesting blog by someone who has bo]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I just discovered <a href="http://opencog.wordpress.com/2009/01/12/determining-word-senses-from-grammatical-usage/">this cool blogpost</a> on OpenCog. It is a very interesting blog by someone who has both time and talent for nice experiments (and he seems to have the needed hardware as well&#8230;). Although he is not directly in my line of research, which is quantitative sociolinguistics, he touches with my fondness for computational linguistis, statistics and lexical semantics via NLP.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Symbol Grounding and Proportional Analogy]]></title>
<link>http://apperceptual.wordpress.com/2009/06/28/symbol-grounding-and-proportional-analogy/</link>
<pubDate>Sun, 28 Jun 2009 15:33:02 +0000</pubDate>
<dc:creator>Peter Turney</dc:creator>
<guid>http://apperceptual.wordpress.com/2009/06/28/symbol-grounding-and-proportional-analogy/</guid>
<description><![CDATA[If symbols must be grounded in perception, how does this grounding happen? How do we learn to create]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>If <a title="http://apperceptual.wordpress.com/2007/05/04/the-symbol-grounding-problem/" href="http://apperceptual.wordpress.com/2007/05/04/the-symbol-grounding-problem/">symbols must be grounded in perception</a>, how does this <a title="http://apperceptual.wordpress.com/2008/07/13/grounding-meaning-composition-versus-abstraction/" href="http://apperceptual.wordpress.com/2008/07/13/grounding-meaning-composition-versus-abstraction/">grounding happen</a>? How do we learn to create mappings between language and perception? For example, how does the word &#8220;rabbit&#8221; get tied to the perception (visual, tactile, whatever) of a rabbit? AI algorithms for <a title="http://www.jacobsschool.ucsd.edu/news/news_releases/release.sfe?id=650" href="http://www.jacobsschool.ucsd.edu/news/news_releases/release.sfe?id=650">assigning textual labels to photographs</a> are not yet able to approach human performance on this task. The problem is somewhat similar to <a title="http://en.wikipedia.org/wiki/Statistical_machine_translation" href="http://en.wikipedia.org/wiki/Statistical_machine_translation">statistical machine translation</a>, which exploits <a title="http://en.wikipedia.org/wiki/Parallel_corpora" href="http://en.wikipedia.org/wiki/Parallel_corpora">parallel corpora</a> to learn mappings between two different languages, although the difference between text and photographs is more extreme than the difference between any two written languages. Perhaps ideas from statistical machine translation are applicable to symbol grounding. The <a title="http://dx.doi.org/10.1007/s10590-006-9010-x" href="http://dx.doi.org/10.1007/s10590-006-9010-x">translation algorithm</a> of <a title="http://users.info.unicaen.fr/~ylepage/" href="http://users.info.unicaen.fr/~ylepage/">Lepage</a> and <a title="http://www.slt.atr.jp/~edenoual/" href="http://www.slt.atr.jp/~edenoual/">Denoual</a>, based on <a title="http://apperceptual.wordpress.com/2008/12/24/beyond-proportional-analogy/" href="http://apperceptual.wordpress.com/2008/12/24/beyond-proportional-analogy/">proportional analogy</a>, seems particularly appropriate, since it makes minimal assumptions about the structures of the languages.</p>
<p><!--more--></p>
<p>There is a view that the meaning of words must be grounded in perception:</p>
<ul>
<li><a title="http://psych.wisc.edu/glenberg/jml_g&#38;r.html" href="http://psych.wisc.edu/glenberg/jml_g&#38;r.html">Symbol grounding and meaning: A comparison of high-dimensional and embodied theories of meaning</a></li>
<li><a title="http://www.bbsonline.org/documents/a/00/00/04/29/bbs00000429-00/bbs.barsalou.html" href="http://www.bbsonline.org/documents/a/00/00/04/29/bbs00000429-00/bbs.barsalou.html">Perceptual Symbol Systems</a></li>
</ul>
<p>But how do we connect a word with a perception? <a title="http://en.wikipedia.org/wiki/Willard_Van_Orman_Quine" href="http://en.wikipedia.org/wiki/Willard_Van_Orman_Quine">Quine</a> gives the <a title="http://en.wikipedia.org/wiki/Indeterminacy_of_translation" href="http://en.wikipedia.org/wiki/Indeterminacy_of_translation">following example</a>: suppose we hear a person say &#8220;gavagai&#8221; in the presence of a rabbit. How do we know that &#8220;gavagai&#8221; means &#8220;rabbit&#8221;? Other possible interpretations of &#8220;gavagai&#8221; are &#8220;Lo, food&#8221;, &#8220;Let&#8217;s go hunting&#8221;, and &#8220;There will be a storm tonight&#8221;. The possibilities are endless.</p>
<p>It is interesting that Quine links the <a title="http://cogprints.org/3106/" href="http://cogprints.org/3106/">symbol grounding problem</a> to the problem of translation. In Quine&#8217;s thought experiment, a linguist must translate &#8220;gavagai&#8221; into English. Perhaps recent progress with machine translation algorithms is applicable to the symbol grounding problem?</p>
<p><a title="http://hal.archives-ouvertes.fr/docs/00/26/09/94/PDF/purest_ever_lepage.pdf" href="http://hal.archives-ouvertes.fr/docs/00/26/09/94/PDF/purest_ever_lepage.pdf">Lepage and Denoual</a> use proportional analogies to derive translations from <a title="http://en.wikipedia.org/wiki/Parallel_corpora" href="http://en.wikipedia.org/wiki/Parallel_corpora">parallel corpora</a>. A <a title="http://apperceptual.wordpress.com/2008/06/21/the-logic-of-attributional-and-relational-similarity/" href="../2008/06/21/the-logic-of-attributional-and-relational-similarity/">proportional analogy</a> has the form <em>A</em>:<em>B</em>::<em>C</em>:<em>D</em>, meaning “<em>A</em> is to <em>B</em> as <em>C</em> is to <em>D</em>“. For example, quart:volume::mile:distance means “quart is to volume as mile is to distance”. Consider the following proportional analogy:</p>
<p style="padding-left:30px;"><em>A</em> = &#8220;This is a rabbit.&#8221;</p>
<p style="padding-left:30px;"><em>B</em> = &#8220;This is a fox.&#8221;</p>
<p style="padding-left:30px;"><em>C</em> = <img class="size-full wp-image-425 alignnone" title="rabbit" src="http://apperceptual.wordpress.com/files/2009/06/rabbit.jpg" alt="rabbit" width="58" height="58" /></p>
<p style="padding-left:30px;"><em>D</em> = <img class="alignnone size-full wp-image-424" title="fox" src="http://apperceptual.wordpress.com/files/2009/06/fox.jpg" alt="fox" width="79" height="52" /></p>
<p>The analogy <em>A</em>:<em>B</em>::<em>C</em>:<em>D</em> helps us to map &#8220;rabbit&#8221; to the image of the rabbit, whereas <em>A</em> and<em> C</em> alone leave the mapping <a title="http://en.wikipedia.org/wiki/Indeterminacy_of_translation" href="http://en.wikipedia.org/wiki/Indeterminacy_of_translation">indeterminate</a>. Comparing <em>A</em> and <em>B</em>, we see the shared structure &#8220;This is a <em>X</em>.&#8221; and we note that <em>X</em> = &#8220;rabbit&#8221; for <em>A</em> and <em>X</em> = &#8220;fox&#8221; for <em>B</em>. Likewise, comparing <em>C</em> and <em>D</em>, we see the shared backgrounds and note the differing foregrounds. This helps us map the foreground of <em>C</em> to &#8220;rabbit&#8221; and the foreground of <em>D</em> to &#8220;fox&#8221;. Given only <em>A</em> and <em>C</em>, we have no reason to pick out &#8220;rabbit&#8221; from the sentence &#8220;This is a rabbit&#8221; in <em>A</em> and we have no reason to pick out the foreground rabbit from the background grass in<em> C</em>.</p>
<p>The core idea here is that we do not ground symbols in perceptions by noting correlations between symbols and perceptions; rather, we note <em>meta</em> correlations between <em>relations</em> between symbols (e.g., the relation between <em>A</em> and <em>B</em>) and <em>relations</em> between perceptions (e.g., the relation between <em>C</em> and <em>D</em>).</p>
<blockquote><p>The ability to consider differences between differences is important because it lies at the heart of our abilities to solve new problems. This is because these “second-order-differences” are what we use to remind ourselves of other problems we already know how to solve. Sometimes this is called “reasoning by analogy” and is considered to be an exotic or unusual way to solve problems. But in my view, it’s our most ordinary way of doing things.<br />
— <a title="http://apperceptual.wordpress.com/quotes-analogy/" href="http://apperceptual.wordpress.com/quotes-analogy/"><em>Marvin Minsky, The Society of Mind, 1988</em></a></p></blockquote>
<p>Related work: <a href="http://web.media.mit.edu/~dkroy/">Deb Roy</a>.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[direction]]></title>
<link>http://efidetum.wordpress.com/2009/06/27/direction/</link>
<pubDate>Sat, 27 Jun 2009 15:43:01 +0000</pubDate>
<dc:creator>efidetum</dc:creator>
<guid>http://efidetum.wordpress.com/2009/06/27/direction/</guid>
<description><![CDATA[It is common to derive vocabulary from the usage at, for instance, Twitter. This only leads to the d]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>It is common to derive vocabulary from the usage at, for instance, Twitter. This only leads to the defintion of an artificial language, not identical to any kind of English. In the beginning, there should be &#8230;.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[SemLab's Semantic Search]]></title>
<link>http://semlab.wordpress.com/2009/06/11/semlabs-semantic-search/</link>
<pubDate>Thu, 11 Jun 2009 06:46:47 +0000</pubDate>
<dc:creator>bbmv</dc:creator>
<guid>http://semlab.wordpress.com/2009/06/11/semlabs-semantic-search/</guid>
<description><![CDATA[The SemLab semantic search platform is based on our Vicore™ data processing platform which, among ot]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>The SemLab semantic search platform is based on our Vicore™ data processing platform which, among others, is the basis of our leading semantic news analysis solution ViewerPro™.</p>
<p>The purpose of semantic search is to improve standard keyword-based search technologies by adding domain dependent, a-priory knowledge about the semantics of the concepts entailed in the documents that are analysed. This a-priory knowledge is contained in ontologies: formal descriptions of the concepts that exist in the domain of application. In general the ontologies play a pivotal role in a semantic enabled technology and have a profound effect on its overall quality. For a search platform, the common quality parameters Precision and Recall will largely depend on the quality of the ontologies used.</p>
<p>Semlab’s semantic platform fully acknowledges that ontologies are the key to a successful semantic analysis system and therefore support ontology maintenance as a core technology. This includes both expert knowledge expression as well as (supervised) automatic ontology learning from domain-specific documents. In addition the SemLab platform fully supports existing base ontologies and offers their own default domain ontologies for many areas of financial information management, exactly as are used every day by the ViewerPro semantic news analysis platform.</p>
<p>The SemLab semantic platform is in essence a data pipelining platform in the sense that the data sources are fed to the start of the line, various relatively independent subsequent operations are done on the data. The results of these are stored in various databases.</p>
<p style="text-align:center;"><img class="aligncenter size-full wp-image-157" title="illustration2" src="http://semlab.wordpress.com/files/2009/06/illustration23.jpg" alt="illustration2" width="450" height="224" /></p>
<p>In the case of semantic search, the data sources consist of text documents (e.g. reports, news messages, e-mails) in various formats and the final result is an semantic index, modeling the occurrence of ontology terms in the documents. When a user presents the platform with search query, this query is treated as if it were another document and also expressed in terms of the ontology. Since both the document corpus and the user’s query are expressed within the same domain of discourse, they can be compared and ranked on similarity. Finally, using carefully designed threshold values, the resulting document sections are presented to the user in a GUI or available through an API to other applications. In addition to the retrieval of documents, the domain ontology can also be queried directly. In this way the semantic search platform also serves as a question answering tool, enabling users to quickly retrieve the corporate domain knowledge.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Getting the Gist - Machine Translation on the Web]]></title>
<link>http://angelikaf.wordpress.com/2009/06/01/getting-the-gist-via-mt/</link>
<pubDate>Mon, 01 Jun 2009 12:00:20 +0000</pubDate>
<dc:creator>angelikaf</dc:creator>
<guid>http://angelikaf.wordpress.com/2009/06/01/getting-the-gist-via-mt/</guid>
<description><![CDATA[MT technology has come a long way since the Russian to English Georgetown experiment back in 1954 wh]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>MT technology has come a long way since the Russian to English Georgetown experiment back in 1954 which at the time gave rise to an optimistic prediction of having it all sorted out within some 5 or so years. Progress turned out to be a bit slower than that with the complexities of natural language presenting quite a number of stumbling blocks.</p>
<p>Putting aside just for the moment the need for post-editing of MT output by a human translator to achieve publishable standard, would you agree and have you noticed that the results from online translation tools have improved over the years?</p>
<p>As long as your expectations are properly adjusted and while you have to be aware that not all of what you get may be accurate, these tools do offer anyone the option of at least getting the gist of what something written in a foreign language is about, which adds another dimension to the global availability and sharing of information.</p>
<p>I ran a little test and put one of my blog pages through Babelfish (<a href="http://babelfish.yahoo.com/">http://babelfish.yahoo.com/</a>), Google Language Tools (<a href="http://www.google.com/language_tools">http://www.google.com/language_tools</a>), and the Microsoft Live Translator (<a href="http://translator.live.com/">http://translator.live.com/</a>). Aware that I hadn&#8217;t used anything even remotely approximating controlled language in the source text, it wasn&#8217;t surprising that the German MT output was anything but perfect in all three cases. However, all three renditions manage to give a reasonable idea as to what the page is about.</p>
<p>There are of course some obvious problems with disambiguation (the word &#8220;quotes&#8221; for example causing problems and in some instances translated incorrectly as &#8220;Anführungszeichen&#8221; instead of &#8220;Zitate&#8221; which would have been the right choice in the given context), and the intricacies of German grammar and syntax in particular result in some awkward and grammatically and syntactically wrong constructions that are difficult to make any sense of - the odd time to the extent that the translated phrase actually says the opposite to what the source states.</p>
<p>A quick test using technical specifications for a printer gave pretty good results, and that is due to standard terminology and simpler linguistic structures being used in this type of material.</p>
<p>MT systems are improving at what appears to be a faster pace now, with hybrid solutions combining rule based and statistical approaches as well as other techniques. The main prerequesites for successful application of MT to produce publishable outputs are</p>
<ul>
<li>controlled language used in source material</li>
<li>preparation and tuning of the MT engine (for the specific domain)</li>
<li>setting and agreeing quality expectations for the translated output (to control human post-editing effort)</li>
<li>regularly re-training engine with corrections.</li>
</ul>
<p>MT therefore looks to be most suited for specialised translations (e.g. technical documentation &#8211; user manuals, product specifications). To ensure that quality expectations for the translated output are agreed upfront is key, otherwise little or no benefits will be gained in terms of time, effort and cost compared to human translation. The post-edited final output will have to be accurate and linguistically correct. Understood that this is not synonymous with highest achievable quality standard (which is quite hard to define actually), but how important is stylistic excellence to someone installing a newly acquired piece of software and looking for guidance from the installation manual?</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Milk as Karma]]></title>
<link>http://anileklavya.wordpress.com/2009/05/29/milk-as-karma/</link>
<pubDate>Fri, 29 May 2009 13:45:05 +0000</pubDate>
<dc:creator>anileklavya</dc:creator>
<guid>http://anileklavya.wordpress.com/2009/05/29/milk-as-karma/</guid>
<description><![CDATA[Someone called someone milk Milk as noun or milk as verb? Milk as the subject or milk as the object?]]></description>
<content:encoded><![CDATA[Someone called someone milk Milk as noun or milk as verb? Milk as the subject or milk as the object?]]></content:encoded>
</item>
<item>
<title><![CDATA[How Many Grams?]]></title>
<link>http://anileklavya.wordpress.com/2009/05/22/how-many-grams/</link>
<pubDate>Fri, 22 May 2009 01:46:03 +0000</pubDate>
<dc:creator>anileklavya</dc:creator>
<guid>http://anileklavya.wordpress.com/2009/05/22/how-many-grams/</guid>
<description><![CDATA[There is an automatically (intelligently) generated blog which I have read recently. It appears to b]]></description>
<content:encoded><![CDATA[There is an automatically (intelligently) generated blog which I have read recently. It appears to b]]></content:encoded>
</item>
<item>
<title><![CDATA[First Impressions of Wolfram|Alpha]]></title>
<link>http://mendicantbug.com/2009/05/16/first-impressions-of-wolframalpha/</link>
<pubDate>Sun, 17 May 2009 04:42:46 +0000</pubDate>
<dc:creator>Jason Adams</dc:creator>
<guid>http://mendicantbug.com/2009/05/16/first-impressions-of-wolframalpha/</guid>
<description><![CDATA[Perhaps you&#8217;ve heard of the latest brainchild of the Wunderkind Stephen Wolfram:  Wolfram|Alph]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p style="text-align:justify;">Perhaps you&#8217;ve heard of the latest brainchild of the Wunderkind <a class="zem_slink" title="Stephen Wolfram" rel="wikipedia" href="http://en.wikipedia.org/wiki/Stephen_Wolfram">Stephen Wolfram</a>:  <a href="http://wolframalpha.com" target="_blank">Wolfram&#124;Alpha</a>.  Matthew Hurst nicknamed it <a href="http://datamining.typepad.com/data_mining/2009/05/alphram-and-google.html" target="_blank">Alphram</a> today and I agree that&#8217;s a much better name.   Wolfram&#124;Alpha (W&#124;A henceforth) is not a <a class="zem_slink" title="Web search engine" rel="wikipedia" href="http://en.wikipedia.org/wiki/Web_search_engine">search engine</a>, it&#8217;s a knowledge engine.  It will compete with <a class="zem_slink" title="Google" rel="homepage" href="http://google.com">Google</a> on a slice of traffic that Google really isn&#8217;t all that hot in for now, comparative questioning answering.  When you <a href="http://www.google.com/search?source=ig&#38;hl=en&#38;rlz=1G1GGLQ_ENUS245&#38;=&#38;q=how+does+the+gdp+of+south+africa+compare+to+china%3F&#38;btnG=Google+Search&#38;aq=f&#38;oq=" target="_blank">ask Google</a> something like &#8220;How does the <a class="zem_slink" title="Gross domestic product" rel="wikipedia" href="http://en.wikipedia.org/wiki/Gross_domestic_product">GDP</a> of <a class="zem_slink" title="South Africa" rel="wikipedia" href="http://en.wikipedia.org/wiki/South_Africa">South Africa</a> compare to <a class="zem_slink" title="China" rel="geolocation" href="http://maps.google.com/maps?ll=35.0,105.0&#38;spn=10.0,10.0&#38;q=35.0,105.0%20%28China%29&#38;t=h">China</a>?&#8221; you hope you get back something relevant in the first few results (<em>spoiler alert:  you don&#8217;t</em>).  When you ask that of W&#124;A, you get <a href="http://www.wolframalpha.com/input/?i=how+does+the+gdp+of+south+africa+compare+to+china%3F" target="_blank">exactly what you&#8217;re looking for</a>.  Beautiful.  W&#124;A&#8217;s so-called natural language interface isn&#8217;t perfect, though.  You get a lot of flakiness from it until you start to recognize what works and what doesn&#8217;t.</p>
<p style="text-align:justify;">Now let&#8217;s be honest.  How often do we search for that kind of thing?  Not very often.  I think that&#8217;s partly because Google is notoriously bad at it.  Once we start to get a handle on what W&#124;A is capable of, I think people will start expecting more of their friendly neighborhood search giant.  Google claims to have <a href="http://www.techcrunch.com/2009/05/12/what-is-google-squared-it-is-how-google-will-crush-wolfram-alpha-exclusive-video/" target="_blank">a few tricks</a> up its sleeves, but everything <a href="http://googleblog.blogspot.com/2009/04/adding-search-power-to-public-data.html" target="_blank">I&#8217;ve seen out</a> of Google lately has been such a disappointment I am deeply skeptical.  The new trick is called Google Squared and it returns search results in a spreadsheet format, breaking down the various facets of the things you are searching for.  In the demo, it shows stuff like rollercoaster drop speeds, heights, etc when you search for <a class="zem_slink" title="Roller coaster" rel="wikipedia" href="http://en.wikipedia.org/wiki/Roller_coaster">roller coasters</a>.  You can add to the square and do some pretty nifty stuff.  <a class="zem_slink" title="TechCrunch" rel="homepage" href="http://www.techcrunch.com">TechCrunch</a> claims this will kill W&#124;A.  I think the two could be complementary.  Based on the demo, I expect W&#124;A will return results of a higher calibre, but will miss out on a lot of queries because the knowledge is just missing.  Google Squared appears to be doing something fuzzier and will return results that might be really bad.  So while W&#124;A just says it doesn&#8217;t know, Google Squared will let you pick through the junk to find the gem.  Google Squared is expected to launch later this month in <a class="zem_slink" title="Google Labs" rel="homepage" href="http://labs.google.com">Google Labs</a>.</p>
<p style="text-align:justify;">Many have said that where W&#124;A will really compete is against Wikipedia and I am inclined to agree.  There are plenty of things I go to Wikipedia for now that I probably will switch over to W&#124;A for, like populations of countries, size of Neptune&#8217;s moons, and so on.  Wikipedia still wins for more in-depth knowledge on a topic.  W&#124;A also does some pretty cool stuff when you search for the definition of a word (use a query like &#8220;<a href="http://www.wolframalpha.com/input/?i=word+kitten" target="_blank">word kitten</a>&#8220;).  You learn that kitten comes from <a class="zem_slink" title="Classical Latin" rel="wikipedia" href="http://en.wikipedia.org/wiki/Classical_Latin">Classical Latin</a>, and entered English about 700 years ago.  You can find out a similar thing (and go further in depth for the etymology at least) using <a class="zem_slink" title="The American Heritage Dictionary (based on the New Second College Edition)" rel="amazon" href="http://www.amazon.com/American-Heritage-Dictionary-Second-College/dp/0440201896%3FSubscriptionId%3D0G81C5DAZ03ZR9WH9X82%26tag%3Dthemenbug-20%26linkCode%3Dxm2%26camp%3D2025%26creative%3D165953%26creativeASIN%3D0440201896">the American Heritage dictionary</a> on <a href="http://dictionary.reference.com" target="_blank">dictionary.com</a>, but W&#124;A requires less digging.</p>
<p style="text-align:justify;">And this brings me around to a key point with W&#124;A.  It&#8217;s an awesome factoid answering service.  It does it well and it does it in a pretty way.  Stuff you can find in more depth elsewhere you can get quickly and easily, but only superficially via W&#124;A.  There are links to more information, though, so you don&#8217;t lose much by relying on W&#124;A &#8212; assuming it has knowledge about what you&#8217;re looking for.  You&#8217;re still going to be more likely to hit a brick wall with W&#124;A.</p>
<p style="text-align:justify;">And of course, since Wolfram developed <a class="zem_slink" title="Mathematica" rel="amazon" href="http://www.amazon.com/Mathematica-Stephen-Wolfram/dp/0201193302%3FSubscriptionId%3D0G81C5DAZ03ZR9WH9X82%26tag%3Dthemenbug-20%26linkCode%3Dxm2%26camp%3D2025%26creative%3D165953%26creativeASIN%3D0201193302">Mathematica</a>, W&#124;A is backed by it.  Enter an equation and you get some really handy math info back.  Need to quickly know the derivative of a fairly complicated equation?  Presto.  Probably the most satisfying feeling I got today was from a query similar to &#8220;<a href="http://www.wolframalpha.com/input/?i=what+is+the+area+under+x^4%2B3x^2%2B4+from+1+to+8%3F" target="_blank">what is the area under x^4+3x^2+4 from 1 to 8?</a>&#8220;  Let&#8217;s see you answer that, Google Squared.</p>
<p style="text-align:justify;"><a href="http://ealdent.wordpress.com/files/2009/05/picture-11.png"><img class="aligncenter size-full wp-image-1160" title="Wolfram&#124;Alpha sample results" src="http://ealdent.wordpress.com/files/2009/05/picture-11.png" alt="Wolfram&#124;Alpha sample results" width="500" height="374" /></a></p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/04142ddc-4b5d-4b78-b0ba-2a10d21b90e8/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_c.png?x-id=04142ddc-4b5d-4b78-b0ba-2a10d21b90e8" alt="Reblog this post [with Zemanta]" /></a></div>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Measuring L2 Lexical Growth Using Hypernymic Relationships]]></title>
<link>http://callierlibrary.wordpress.com/2009/05/15/measuring-l2-lexical-growth-using-hypernymic-relationships/</link>
<pubDate>Fri, 15 May 2009 17:57:24 +0000</pubDate>
<dc:creator>Callier Library</dc:creator>
<guid>http://callierlibrary.wordpress.com/2009/05/15/measuring-l2-lexical-growth-using-hypernymic-relationships/</guid>
<description><![CDATA[This study investigated second language (L2) lexical development in the spontaneous speech of six ad]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>This study investigated second language (L2) lexical development in the spontaneous speech of six adult, L2 English learners in a 1-year longitudinal study. One important aspect of lexical development is lexical organization and depth of knowledge. Hypernymic relations, the hierarchical relationships among related words that vary in relation to their semantic specificity (e.g., Golden Retriever vs. dog vs. animal), are an important indicator of both lexical organization and depth of knowledge. Thus, this study used hypernymy values from the WordNet database and a lexical diversity measure to analyze lexical development. Statistical analyses in this study indicated that both hypernymic relations and lexical diversity in L2 learners increase over time. Additionally, lexical diversity and hypernymic values correlated significantly, suggesting that as learners&#8217; lexicons grow, learners have access to a wider range of hypernymy levels. These findings are discussed in relation to developing abstractness in language, extending hypernymic knowledge, and the growth of lexical networks.
</p>
<p>from <a href="http://www3.interscience.wiley.com/journal/122380394/abstract"><em>Language Learning</em></a></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Logical Atoms]]></title>
<link>http://apperceptual.wordpress.com/2009/05/14/logical-atoms/</link>
<pubDate>Thu, 14 May 2009 21:16:28 +0000</pubDate>
<dc:creator>Peter Turney</dc:creator>
<guid>http://apperceptual.wordpress.com/2009/05/14/logical-atoms/</guid>
<description><![CDATA[In predicate logic, the concept red ball is represented as a combination of the concepts of red and ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>In <a title="http://en.wikipedia.org/wiki/Predicate_logic" href="http://en.wikipedia.org/wiki/Predicate_logic">predicate logic</a>, the concept <em>red ball</em> is represented as a combination of the concepts of <em>red</em> and <em>ball</em>. We can define the predicate RedBall(<em>x</em>) as (Red(<em>x</em>) &#38; Ball(<em>x</em>)). <a title="http://en.wikipedia.org/wiki/Logical_atomism" href="http://en.wikipedia.org/wiki/Logical_atomism">Logical atomism</a> views the world in terms of <em>compound</em> predicates, such as RedBall(<em>x</em>), that are built up from <em>atomic</em> predicates, such as Red(<em>x</em>) and Ball(<em>x</em>). Good old-fashioned AI (<a title="http://en.wikipedia.org/wiki/GOFAI" href="http://en.wikipedia.org/wiki/GOFAI">GOFAI</a>) research almost always assumes a kind of logical atomism. <a title="http://en.wikipedia.org/wiki/Cyc" href="http://en.wikipedia.org/wiki/Cyc">Cyc</a>, for example, represents knowledge using a form of logical atomism. Even those researchers who reject GOFAI still tend to assume logical atomism. Statistical and <a title="http://en.wikipedia.org/wiki/Connectionism" href="http://en.wikipedia.org/wiki/Connectionism">connectionist</a> models of concepts typically view <em>red ball</em> as a combination of <em>red</em> and <em>ball</em>. I believe that we should turn this view on its head. That is, <em>red ball</em> comes first (is more basic, more primitive); <em>red</em> and <em>ball</em> come later (are more complex, more refined).</p>
<p><!--more--></p>
<p>In <a title="http://en.wikipedia.org/wiki/Latent_semantic_analysis" href="http://en.wikipedia.org/wiki/Latent_semantic_analysis">Latent Semantic Analysis</a> (LSA), we represent the semantics of <em>red</em> and <em>ball</em> with vectors, in which the elements are derived from the frequencies of the terms <em>red</em> and <em>ball</em> in various contexts. For those who are familiar with logic, it is natural to think of representing <em>red ball</em> by some mathematical operation on the vectors for <em>red</em> and <em>ball</em>. For example, we might add the <em>red</em> vector to the <em>ball</em> vector. One problem with this idea is that addition is not sensitive to order; thus <em>house boat</em> and <em>boat house</em> would have the same vector, although they have different meanings. One solution to this problem is a mathematical operation on vectors that is sensitive to order, such as the tensor product or the circular convolution.</p>
<p>Several papers explore this vector combination approach to representing compound predicates:</p>
<ul>
<li>Plate (1991), <a title="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.52.2471" href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.52.2471">Holographic reduced representations: Convolution algebra for compositional distributed representations</a></li>
<li>Plate (1994), <a title="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.48.5527" href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.48.5527">Distributed representations and nested compositional structure</a></li>
<li>Smolensky (1994), <a title="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.33.3070" href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.33.3070">Grammar-based connectionist approaches to language</a></li>
<li>Plate (1995), <a title="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.49.6050" href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.49.6050">Estimating analogical similarity by vector dot-products of holographic reduced representations</a></li>
<li>Wilson, Street, and Halford (1995),<a title="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.45.9478" href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.45.9478"> Solving proportional analogy problems using tensor product networks with random representations</a></li>
<li>Jones and Mewhort (2007), <a title="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.89.3355" href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.89.3355">Representing word meaning and order information in a composite holographic lexicon</a></li>
<li>Widdows (2008), <a title="http://www.puttypeg.com/papers/semantic-vector-products.pdf" href="http://www.puttypeg.com/papers/semantic-vector-products.pdf">Semantic vector products: Some initial investigations</a></li>
</ul>
<p>Plate (1995) describes his approach as follows:</p>
<blockquote><p>Suppose we have distributed representations for the concept &#8220;circle&#8221;, &#8220;triangle&#8221;, &#8220;small&#8221;, and &#8220;large&#8221;. We can represent a small circle by superposing the patterns for &#8220;small&#8221; and &#8220;circle&#8221;. However, when we try to represent a small circle and large triangle we have the problem that the superposition of the four patterns is ambiguous &#8211; the information that small is associated with triangle and large associated with circle is lost &#8211; it could be a large circle and a small triangle. The same problem arises when we try to represent conceptual relations. Suppose we have a predicate representation for &#8220;Spot bit Jane&#8221;: <strong>bite(spot, jane)</strong>. Spot is the agent of this relation, and Jane is the object (or patient). A distributed representation of this relation must be careful to preserve the information about which person is associated with which role (agent or object) so that there is no confusion with &#8220;Jane bit Spot&#8221;. I refer to associations between roles and fillers as <em>role/filler bindings</em>.</p></blockquote>
<p>The assumption here is that we build up the complex structure <strong>bite(spot, jane)</strong> from simple atomic elements <strong>bite</strong>, <strong>spot</strong>, and <strong>jane</strong>. But I suggest that the atomic element is <strong>bite(spot, jane)</strong>, and that we construct <strong>bite</strong>, <strong>spot</strong>, and <strong>jane</strong> from this atomic element.</p>
<p>Suppose we have a <a title="http://apperceptual.wordpress.com/2007/07/24/tensors-for-data-and-text-analysis/" href="http://apperceptual.wordpress.com/2007/07/24/tensors-for-data-and-text-analysis/">third-order tensor</a> of the form <em>pattern</em> × <em>word</em> × <em>word</em>. The predicate <em>P</em>(<em>x</em>,<em>y</em>) is a pattern (&#8220;<em>x</em> bit <em>y</em>&#8220;), <em>x</em> is a word (&#8220;spot&#8221;), <em>y</em> is another word (&#8220;jane&#8221;), and the triple &#60;<em>P</em>(<em>x,y</em>),<em>x,y</em>&#62; is a cell (&#60;&#8221;<em>x</em> bit <em>y</em>&#8220;,&#8221;spot&#8221;,&#8221;jane&#8221;&#62;) in this third-order tensor. The concept &#8220;bite&#8221; corresponds to the slice (a matrix cut out of the tensor) &#60;&#8221;<em>x</em> bit <em>y</em>&#8220;,*,*&#62;, the concept &#8220;spot&#8221; is the slice &#60;*,&#8221;spot&#8221;,*&#62;, and the concept &#8220;jane&#8221; is the slice &#60;*,*,&#8221;jane&#8221;&#62;.</p>
<p>In this representation, the atomic elements (tensor cells; scalars) are whole events (&#8220;I see a red ball&#8221;, &#8220;Spot bit Jane&#8221;), and abstract concepts (&#8220;red&#8221;, &#8220;bite&#8221;) are complicated structures (tensor slices; matrices), composed of these atomic elements. This turns the usual picture upside-down. There is nothing &#8220;simple&#8221; or &#8220;atomic&#8221; about the concept &#8220;circle&#8221;. It is a complex structure, composed of all of our atomic experiences of events that contained some aspect of circularity.</p>
<p>Using a vector combination approach, <a title="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.89.3355" href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.89.3355">Jones and Mewhort (2007)</a> report a score of 57.81% (see page 18) on multiple-choice synonym questions from the TOEFL (Test of English as a Foreign Language). Using the opposite approach (<a title="http://arxiv.org/abs/0711.2023" href="http://arxiv.org/abs/0711.2023">Turney, 2007</a>), a third-order <em>pattern</em> × <em>word</em> × <em>word</em> tensor achieves a score of 83.75% (see page 22).</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Computer Power and Human Reason (Part 2)]]></title>
<link>http://orlandoutland.wordpress.com/2009/04/29/computer-power-and-human-reason-part-2/</link>
<pubDate>Wed, 29 Apr 2009 15:26:21 +0000</pubDate>
<dc:creator>ooutland</dc:creator>
<guid>http://orlandoutland.wordpress.com/2009/04/29/computer-power-and-human-reason-part-2/</guid>
<description><![CDATA[Weizenbaum devotes a good deal of space in the book to a chapter called “How Computers Work,” a usef]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p style="margin:0 0 10pt;" class="MsoNormal"><span style="line-height:115%;font-size:14pt;"><span style="font-family:calibri;"><font size="4">Weizenbaum devotes a good deal of space in the book to a chapter called “How Computers Work,” a useful introduction to programming for anyone who comes into his book with no tech background.<span>&#160;&#160; </span>Some of the most famous (or notorious) quotes from the book stem from Weizenbaum’s description of programming, and programmers.<span>&#160; </span>Discussing high-level programming languages, he notes that most programmers “have no knowledge whatsoever of their computer’s machine language or of the content and structure of the translators that mediate between them and their computers.”<span>&#160; </span>Weizenbaum notes that, rather than being the perfect transcription of mathematical principles, programming is “like any form of writing, more often than not experimental.<span>&#160; </span>One programs, just as one writes, not because one understands, but in order to come to understand.”<span>&#160; </span></font></span></span></p>
<p style="margin:0 0 10pt;" class="MsoNormal"><span style="line-height:115%;font-size:14pt;"><span style="font-family:calibri;"><font size="4">And he devotes a chapter to the “compulsive programmer,” a type we’re all familiar with now:</font></span></span></p>
<p style="margin:0 0 10pt .5in;" class="MsoNormal"><span style="line-height:115%;font-size:14pt;"><span style="font-family:calibri;"><font size="4">Wherever computer centers have become established…bright young men of disheveled appearance, often with sunken glowing eyes, can be seen sitting at computer consoles, their arms tensed and waiting to fire their fingers, already poised to strike, at the buttons and keys on which their attention seems to be as riveted as a gambler’s on the rolling dice…they work until they drop, twenty, thirty hours at a time.<span>&#160; </span>Their food, if they arrange it, is brought to them: coffee, Cokes, sandwiches.<span>&#160; </span>If possible, they sleep on cots near the computer…their rumpled clothes, their unwashed and unshaven faces, and their uncombed hair all testify that they are oblivious to their bodies and to the world in which they move.<span>&#160; </span>They exist, at least when so engaged, only through and for the computers.<span>&#160; </span>These are computer bums, compulsive programmers.<span>&#160; </span>They are an international phenomenon.</font></span></span></p>
<p style="margin:0 0 10pt;" class="MsoNormal"><span style="line-height:115%;font-size:14pt;"><span style="font-family:calibri;"><font size="4">The only thing that has changed in the intervening decades has been the relocation of the console from the university computer center to the dark basement or bedroom.<span>&#160; </span>Weizenbaum notes the difference between the compulsive programmer and the “dedicated, hard-working professional”:<span>&#160; </span>The professional has a problem to solve, uses the tool at hand and moves on, “whereas the compulsive programmer sees the problem mainly as an opportunity to interact with the computer.” <span>&#160;</span>Still, there’s no doubt many discoveries and inventions have come about because of some hacker tinkering endlessly with registries and routines, which the workaday programmer would never have discovered or created.<span>&#160; </span>But Weizenbaum’s argument is valid – to wall one’s self off into a “hard-coded” world where, say, everyone has a PhD in math, engineering or computer science, and design problems which should be handed to “creatives” are left instead to microgranular analysis of preference polls of thousands of Pantone shades, is to deny “the whole man,” to declare that problems which can’t be solved with pure science are not interesting or relevant.</font></span></span></p>
<p style="margin:0 0 10pt;" class="MsoNormal"><span style="line-height:115%;font-size:14pt;"><span style="font-family:calibri;"><font size="4">Moreover, it’s just as dangerous when a Science that is really more of a Humanity, such as psychology, attempts to become “scientific,” to convert human behavior into codifiable rules (see for instance </font></span><a href="http://www.furiousseasons.com/archives/2008/11/dsmv_criticized_in_la_times_oped_for_lack_of_transparency.html"><span style="font-family:calibri;color:#800080;"><font size="4">the creation of new “illnesses” for the DSM-V</font></span></a><span style="font-family:calibri;"><font size="4"> like “apathy disorder” and “parental alienation syndrome” so that all human problems can be assigned a billing code – and a prescription).<span>&#160; </span></font></span></span></p>
<p style="margin:0 0 10pt;" class="MsoNormal"><span style="line-height:115%;font-size:14pt;"><span style="font-family:calibri;"><font size="4">Weizenbaum is openly disdainful of the AI research of his time, using the unpleasantly Stalinist term “artificial intelligentsia” to describe its proponents.<span>&#160; </span>He lays out the now well-known linguistic problems with AI, i.e. “the house blew it” is a statement which the human mind can decode by using context – “the house” is often used to refer to a casino, and if the conversation is about gambling, the listener may glean the meaning even without having heard the phrase before.<span>&#160; </span></font></span></span></p>
<p style="margin:0 0 10pt;" class="MsoNormal"><span style="line-height:115%;font-size:14pt;"><span style="font-family:calibri;"><font size="4">Weizenbaum’s concern is that even a system like ELIZA, which isn’t the greatest conversationalist, provokes a willing suspension of disbelief in many of its users.<span>&#160; </span>Even in our interpersonal dealings, we bring a “working hypothesis” about the other person, usually based on first impressions – i.e., I see a fat old man getting out of a car with a RUSH IS RIGHT sticker, and I fill out my “conceptual framework” on that person with my prediction of what he’s going to think and say.<span>&#160; </span>These “acts of induction” are automatic and probably instinctive, and they come into play whenever we interact with another, whether that other is man or machine.<span>&#160; </span>Weizenbaum seems off the mark when he says that people conversing with ELIZA “cling to the belief that they are being understood” not because of our lifelong habit of infilling detail on new people, but because they don’t understand the science behind it, and that “it is therefore clothed in the magical mantle of Science and all of Science’s well known powers may be attributed to it.”<span>&#160; </span></font></span></span></p>
<p style="margin:0 0 10pt;" class="MsoNormal"><span style="line-height:115%;font-size:14pt;"><span style="font-family:calibri;"><font size="4">Next post: the conclusion of my review.</font></span></span></p>
<p><font size="4" face="Calibri"></font></p>
</div>]]></content:encoded>
</item>

</channel>
</rss>
