<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress.com" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>speech-recognition &amp;laquo; WordPress.com Tag Feed</title>
	<link>http://en.wordpress.com/tag/speech-recognition/</link>
	<description>Feed of posts on WordPress.com tagged "speech-recognition"</description>
	<pubDate>Sat, 26 Dec 2009 01:32:54 +0000</pubDate>

	<generator>http://en.wordpress.com/tags/</generator>
	<language>en</language>

<item>
<title><![CDATA[Speech recognition in Windows 7]]></title>
<link>http://kiransway.wordpress.com/2009/12/22/speech-recognition-in-windows-7/</link>
<pubDate>Tue, 22 Dec 2009 09:38:23 +0000</pubDate>
<dc:creator>Kiran Kumar</dc:creator>
<guid>http://kiransway.wordpress.com/2009/12/22/speech-recognition-in-windows-7/</guid>
<description><![CDATA[Here is a great feature from Windows 7, “Speech Recognition”. The computer will do what ever you say]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Here is a great feature from <span id="IL_AD1" class="IL_AD">Windows</span> 7, “Speech Recognition”. The computer will do what ever you say to it. It will be a great feature to the people who hate to use the keyboard or mouse. Here is how to configure the “Speech Recognition”?</p>
<p><a href="http://myinfoland.com/2009/12/21/speech-recognition-in-windows-7/">Read More&#8230;&#8230;</a></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[New NUI]]></title>
<link>http://sharathpatali.wordpress.com/2009/12/20/new-nui/</link>
<pubDate>Sun, 20 Dec 2009 20:49:13 +0000</pubDate>
<dc:creator>Sharath Patali</dc:creator>
<guid>http://sharathpatali.wordpress.com/2009/12/20/new-nui/</guid>
<description><![CDATA[Recently I saw a interesting tweet from Seth Sandler regarding a AS3 speech recognition lib. The dem]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Recently I saw a interesting tweet from Seth Sandler regarding a<a href="http://www.bytearray.org/?p=1151" target="_blank"> AS3 speech recognition</a> lib. The demo was very impressive. So I wondered why not for python ?. So I started looking around for some premade librarys in Python. I found three good ones</p>
<ul>
<li><a href="http://" target="_blank">pyspeech</a></li>
<li><a href="http://dragonfly.googlecode.com/" target="_blank">dragonfly</a></li>
<li><a href="http://cmusphinx.sourceforge.net/wordpress/" target="_blank">Sphinx from CMU</a></li>
</ul>
<p>The first two where windows only solution so they where rejected immediately. The Sphinx library from CMU is very vast library. After some reading I found out that the dwarf version of Sphinx called pocketsphinx is just a plugin for Gstreamer. It was very easy to integrate them and finally integrated them to PyMT. <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  . I will be posting the example app as well as Speech Recognition engine for PyMT will be included in the 0.4 release. Can&#8217;t wait to see how users will use this new capabilities in their apps <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p><span style='text-align:center; display: block;'><br />
<object type="application/x-shockwave-flash" width="400" height="300" data="http://www.vimeo.com/moogaloop.swf?clip_id=8296468&amp;server=www.vimeo.com&amp;fullscreen=1&amp;show_title=1&amp;show_byline=0&amp;show_portrait=0&amp;color=01AAEA"><param name="quality" value="best" /><param name="allowfullscreen" value="true" /><param name="scale" value="showAll" /><param name="movie" value="http://www.vimeo.com/moogaloop.swf?clip_id=8296468&amp;server=www.vimeo.com&amp;fullscreen=1&amp;show_title=1&amp;show_byline=0&amp;show_portrait=0&amp;color=01AAEA" /></object><br />
</span></p>
<p>Other then this i&#8217;ve been working on completely rebuilding the animation framework for PyMT 0.4 and its complete now WITH the wiki documentation. Thanks Mathieu for all the advices you gave me in improving this. Heres a video of the app he made with the animation framework.<br />
<span style='text-align:center; display: block;'><br />
<object type="application/x-shockwave-flash" width="400" height="300" data="http://www.vimeo.com/moogaloop.swf?clip_id=8206878&amp;server=www.vimeo.com&amp;fullscreen=1&amp;show_title=1&amp;show_byline=0&amp;show_portrait=0&amp;color=01AAEA"><param name="quality" value="best" /><param name="allowfullscreen" value="true" /><param name="scale" value="showAll" /><param name="movie" value="http://www.vimeo.com/moogaloop.swf?clip_id=8206878&amp;server=www.vimeo.com&amp;fullscreen=1&amp;show_title=1&amp;show_byline=0&amp;show_portrait=0&amp;color=01AAEA" /></object><br />
</span></p>
<p>This one I did.</p>
<p><span style='text-align:center; display: block;'><br />
<object type="application/x-shockwave-flash" width="400" height="300" data="http://www.vimeo.com/moogaloop.swf?clip_id=8060864&amp;server=www.vimeo.com&amp;fullscreen=1&amp;show_title=1&amp;show_byline=0&amp;show_portrait=0&amp;color=01AAEA"><param name="quality" value="best" /><param name="allowfullscreen" value="true" /><param name="scale" value="showAll" /><param name="movie" value="http://www.vimeo.com/moogaloop.swf?clip_id=8060864&amp;server=www.vimeo.com&amp;fullscreen=1&amp;show_title=1&amp;show_byline=0&amp;show_portrait=0&amp;color=01AAEA" /></object><br />
</span></p>
<p>We are trying our best to release a Alpha version before Christmas, lets see how it goes. Lots and lots of changes and feature additions <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Read with Dragon (sort of): Kindle for PC]]></title>
<link>http://hand2mouth.wordpress.com/2009/12/18/read-with-dragon-sort-of-kindle-for-pc/</link>
<pubDate>Sat, 19 Dec 2009 02:48:46 +0000</pubDate>
<dc:creator>hand2mouth</dc:creator>
<guid>http://hand2mouth.wordpress.com/2009/12/18/read-with-dragon-sort-of-kindle-for-pc/</guid>
<description><![CDATA[Although I&#8217;m able to turn pages and make good use of my BookChair, there are times that I woul]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Although I&#8217;m able to turn pages and make good use of my <a title="BookChair" href="http://hand2mouth.wordpress.com/2009/06/01/review-bookchair-just-right-bookholder/">BookChair</a>, there are times that I would rather read on the computer. This is mainly because of neck trouble and trying to learn C++; getting the keyboard out of the way, setting the book on the keyboard table, turning the pages and hoping Dragon doesn&#8217;t pick that up, and glancing up and down from the book to the screen make it a little more cumbersome than I would like. I&#8217;ve been thinking it would be nice to just vocally switch windows; read the book on the screen, then go into Win32Pad to dictate some code.</p>
<p>It turns out that I can potentially do that now using Amazon&#8217;s <a title="Kindle for PC" href="http://www.amazon.com/gp/feature.html/ref=kcp_pc_mkt_lnd?docId=1000426311">Kindle for PC</a> application, which is free (with the exception of the books, of course). I hadn&#8217;t had very high hopes for the application&#8217;s accessibility, considering that Amazon isn&#8217;t great with it (see the lack of captioning for Unbox video products). However, it&#8217;s actually not that bad. (Better with Vocola commands, which I will create shortly.) While all buttons must be accessed by mouse commands, actually reading isn&#8217;t a bad experience at all.</p>
<p><em>C++ without Fear</em> was not available for Kindle, but <em>C++ Primer Plus</em> was, so I downloaded a free sample to test how Kindle PC would work with Dragon. To open a book, mousegrid to the Home button (yay, verbing nouns) and click it if it&#8217;s not already highlighted. Then, by the same method, double click the book icon. The clicking isn&#8217;t as tedious as it sounds because the buttons are large; the grid doesn&#8217;t require much narrowing. To turn pages, say &#8220;Go left&#8221; [previous page] or &#8220;go right&#8221; [next page]. Up and down do the same thing respectively. To turn multiple pages, say &#8220;Go &#60;direction&#62; &#60;number&#62;.&#8221;</p>
<p>Text is very clear, and there is a button to adjust the font size. There are also a Go To and a Bookmark button, but Kindle PC already saves your last position when you exit, so that you can open the book where you left off. Note that I didn&#8217;t say page number &#8212; Kindle doesn&#8217;t use them. That somewhat disorients me, but for something like a programming text I suppose it would keep me from getting overwhelmed. That is, seeing 20% read looks more encouraging than &#8220;Oh god, I have 900 pages left. Dammit I&#8217;m slow!&#8221;.</p>
<p>At the moment you can&#8217;t create notes, unfortunately. (Not that Kindle would be select and say compatible even if you could, but I&#8217;m hoping for something that would allow at least partial dictation, a la Foxit. You also can&#8217;t copy text, for understandable reasons, but that sucks for a programming book. I do wish there could be a PDF style layout option, by which I mean the possibility of an autoscroll feature. Kindle PC is nowhere near perfect, but it is passable.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[VoiceXML , what about it ?]]></title>
<link>http://boneymtom.wordpress.com/2009/12/16/voicexml-what-is-the-story/</link>
<pubDate>Wed, 16 Dec 2009 06:27:09 +0000</pubDate>
<dc:creator>boneymtom</dc:creator>
<guid>http://boneymtom.wordpress.com/2009/12/16/voicexml-what-is-the-story/</guid>
<description><![CDATA[VoiceXML is designed for creating audio dialogues that feature synthesized speech, digitized audio, ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>VoiceXML is designed for creating audio dialogues that feature synthesized speech, digitized audio, recognition of spoken and DTMF key input, recording of spoken input, telephony, and mixed initiative conversations. Its major goal is to bring the advantages of web-based development and content delivery to interactive voice response applications.</p>
<p><strong>VoiceXML</strong> (<strong>VXML</strong>) is the <a title="World Wide Web Consortium" href="http://en.wikipedia.org/wiki/World_Wide_Web_Consortium">W3C</a>&#8217;s standard <a title="XML" href="http://en.wikipedia.org/wiki/XML">XML</a> format for specifying interactive voice dialogues between a human and a computer. It allows voice applications to be developed and deployed in an analogous way to HTML for visual applications. Just as HTML documents are interpreted by a visual web browser, VoiceXML documents are interpreted by a <a title="Voice browser" href="http://en.wikipedia.org/wiki/Voice_browser">voice browser</a>. A common architecture is to deploy banks of voice browsers attached to the Public Switched Telephone Network (<a title="Public Switched Telephone Network" href="http://en.wikipedia.org/wiki/Public_Switched_Telephone_Network">PSTN</a>) so that users can use a telephone to interact with voice applications.</p>
<p>While HTML assumes a graphical web browser with display, keyboard, and mouse, VoiceXML assumes a voice browser with audio output, audio input, and keypad input. Audio input is handled by the voice browser&#8217;s speech recognizer. Audio output consists both of recordings and speech synthesized by the voice browser&#8217;s text-to-speech system.</p>
<p>More info from http://www.vxml.org and http://www.voicexml.org</p>
<div class="wp-caption alignnone" style="width: 490px"><img title="VoiceXML explained" src="http://www.voicexml.org/sites/default/files/intro1_0.jpg" alt="VoiceXML simple diagram" width="480" height="288" /><p class="wp-caption-text">VoiceXML example</p></div>
<p>VoiceXML has the following advantages</p>
<ul>
<li>Leverage of the World-Wide Web and of its capabilities.</li>
<li>Latest developments in computer-based speech recognition and text-to-speech synthesis.</li>
<li>Spread of Internet to everyday life</li>
</ul>
<p>Another important mention is Automatic speech recognition (ASR)systems .</p>
<p>Over the phone, and with no speaker training, a speech recognition system needs to be given a set of speech grammars that tell it what words and phrases it should expect. Recognition over mobile phones in noisy environments, while problematic, can be improved with a new technology called distributed speech recognition, where the early analysis is done on the handset. Advances are also being made in speech synthesis, or text-to-speech (TTS). They can be hard to listen to, and at times even incomprehensible. But newer TTS systems are much more lifelike &#8211; they use a technique called waveform concatenation, in which speech is generated from libraries of pre-recorded waveforms.</p>
<p>Why is VoiceXML important ?</p>
<p>First, the phone is important. There are over 1.5 billion phones in use, far more than there are Internet-connected computers. Phones are easy to use and don&#8217;t need to be booted up. Telephone networks are much more reliable than data networks.</p>
<p>Second, voice is important on the phone. Voice has always been the natural mode of communication for phones. Even though some mobiles have WAP/XHTML browsers, their small screens and keypads make micro browsers hard to use, especially while driving. The i-mode system is more compelling, though shares the same limitations.</p>
<p><strong>A</strong><strong>utomatic speech recognition</strong> or <strong>computer speech recognition</strong> converts spoken words to text. The term &#8220;voice recognition&#8221; is sometimes used to refer to speech recognition where the recognition system is trained to a particular speaker &#8211; as is the case for most desktop recognition software, hence there is an element of <a title="Speaker recognition" href="http://en.wikipedia.org/wiki/Speaker_recognition">speaker recognition</a>, which attempts to identify the person speaking, to better recognize what is being said. Speech recognition is a broad term which means it can recognize almost anybody&#8217;s speech &#8211; such as a call-centre system designed to recognize many voices. Voice recognition is a system trained to a particular user, where it recognizes their speech based on their unique vocal sound.</p>
<p>Both <span style="color:#000000;">acoustic modeling</span> and language modeling are important parts of modern statistically-based speech recognition algorithms.  Modern general-purpose speech recognition systems are generally based on Hidden Markov Models (HMM). These are statistical models which output a sequence of symbols or quantities. One possible reason why HMMs are used in speech recognition is that a speech signal could be viewed as a piecewise stationary signal or a short-time stationary signal. That is, one could assume in a short-time in the range of 10 milliseconds, speech could be approximated as a stationary process. Speech could thus be thought of as a Markov model for many stochastic processes.</p>
<p>If you feel like developing your own application download the <a title="CSU Toolkit" href="http://www.cslu.ogi.edu/toolkit/" target="_blank">CSU Toolkit</a>. It is a comprehensive suite of tools to enable exploration , learning and research into speech and human-computer interaction.</p>
<p>Applications best suited for VoiceXML</p>
<p>Information retrieval(IR) is a good example for VoiceXML. In such an application the audio output tends to be a pre recorded infromation. Voice input can be a few browsing commands or a whole street address , it can be simple as well as quite rich. Let us consider using this IR application for weather information , where people can call a number and use post code as a commands to retrieve the weather forecasts. Unified Messaging is another example where emails can be read over the phone , outgoing email can be recorded . Directory assistance applications also work well with VoiceXML, this is already demonstrated by many telecommunication providers.</p>
<p>What is the future for Voice XML</p>
<p>The W3C is trying to ensure a VoiceXML standard is available to ensure interoperability between different implementations of VoiceXML can execute the same content the same way. There could be standardised audio playback controls . Speaker verification is a major security concern that needs to be addressed to ensure the authenticity of the messages. Support for multimodal markup standards is also a proposed solution.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Speech recognition + WP template = Bloated font!]]></title>
<link>http://cathedralist.wordpress.com/2009/12/08/speech-recognition-wp-template-bloated-font/</link>
<pubDate>Tue, 08 Dec 2009 09:08:42 +0000</pubDate>
<dc:creator>cathedralist</dc:creator>
<guid>http://cathedralist.wordpress.com/2009/12/08/speech-recognition-wp-template-bloated-font/</guid>
<description><![CDATA[It was only after I published my last post that I realized there was something very different about ]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>It was only after I published my last post that I realized there was something very different about the font &#8211; which is supposed to be fixed in this WP template.  So how did it get so humungous?  No, WP isn&#8217;t to blame &#8211;  it was the Windows Speech Recognition program on my new Lenovo  G550 BAQ laptop.  I had been breaking it in, <em>training</em><em> </em>the computer (that makes it sound like some kind of wild beast, don&#8217;t it?) by dictating to it -  and I had what I thought was a bright idea&#8230; I&#8217;d dictate the post!! I had a niggling doubt whether it would work on the WP platform &#8211; perhaps it was configured only to MS Word?  It worked alright &#8211; but with a bloated font as you can see!!</p>
<p>Raw unedited speech recognition looks like garbled gibberish &#8211; no kidding!  The Help section assures me that the software program gets better with practice.  Until then, Speech Recognition  can be considered the twin of the perverse Spell Checker &#8211; you need a firm hand and agile brain to rein both in.  Here is a short verse (that could have well been produced by Speech Recognition  I reckon).</p>
<p>OWED TO A SPELLING CHECKER </p>
<p>I have a spelling checker</p>
<p>It came with my pea sea</p>
<p>It plain lee marks four my revue</p>
<p>Mix steaks eye kin not sea.</p>
<p>Eye have run this poem threw it</p>
<p>I&#8217;m shore your pleased to no</p>
<p>Its letter perfect awl the weigh</p>
<p>My checker told me sew.</p>
<p>           &#8211; Sauce Unknown</p>
<p>After  a bit of research on the Net, I found the <a href="http://www.bios.niu.edu/zar/poem.pdf" target="_blank">original version of  the poem:<em> Candidate for a Pullet Surprise</em> by Jerry Zar</a>.</p>
<p>© Sosha Srinivasan</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Thunderbird 3 RC 2 with Dragon NaturallySpeaking 10]]></title>
<link>http://hand2mouth.wordpress.com/2009/12/02/thunderbird-3-rc-2-with-dragon-naturallyspeaking-10/</link>
<pubDate>Wed, 02 Dec 2009 18:58:21 +0000</pubDate>
<dc:creator>hand2mouth</dc:creator>
<guid>http://hand2mouth.wordpress.com/2009/12/02/thunderbird-3-rc-2-with-dragon-naturallyspeaking-10/</guid>
<description><![CDATA[I am happy to report that nothing really got broken in trying to use Dragon NaturallySpeaking 10 wit]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I am happy to report that nothing really got broken in trying to use Dragon NaturallySpeaking 10 with Mozilla Thunderbird 3 RC 2. The toolbar buttons and menus are still accessible, and my custom buttons are still there, though I had to put them back again. Dragon&#8217;s Thunderbird 2 commands, like &#8220;next/previous message&#8221; and &#8220;write mail,&#8221; still work. Don&#8217;t be alarmed if the File, etc. menu is grayed out &#8212; you can still access it, as well as the submenus.</p>
<p>Please note, however, that you will need to use the Mouse Grid commands and Tab and Enter to get around the box and buttons for Tools &#8212; Options. They seem to have gone the route of Firefox 3, that way. Oh well.</p>
<p>The newest thing about it is the tabbed interface. When you highlight a message and click Read (if you have the button), your message will only open in a new window. But if you say &#8220;press Enter,&#8221; Thunderbird will open the message in a new tab. As with Firefox, you can speak a word in the tab &#8212; like &#8220;Inbox&#8221; &#8212; and Dragon will click the tab. EDIT: I may have spoken too soon. The only tab that Dragon will click is the 1st one, the main one &#8212; Inbox, or Sent, or whatever folder your messages are in. To navigate around any other tabs, you will have to use the &#8220;press Control &#60;number&#62;&#8221; command.</p>
<p>The &#8220;close tab&#8221; command doesn&#8217;t transfer, because Dragon obviously couldn&#8217;t apply that to Thunderbird 2. So you have to say &#8220;press Control W.&#8221; This is where Vocola or another add-on program might come in handy.</p>
<p>Another thing I like about Thunderbird 3 is that there seem to be more buttons included. A few I don&#8217;t remember seeing are Reply to List, Archive, and Restart. One of those may be a custom button I forgot about, but I don&#8217;t think so. There are also better keyboard shortcuts in Thunderbird&#8217;s menus to begin with, so that even if you don&#8217;t have a button, you can make a command if you have an add-on program.</p>
<p>Speaking of buttons, there are now automatically included buttons in each e-mail you open. Rather than putting the Delete, Reply, and other like buttons up in the toolbar, Thunderbird has put them above the message, by the subject and recipient lines. (If you use Skip Trash rather than Delete, you will still have to put the Skip button up in the toolbar. You can&#8217;t put it within the message.)</p>
<p>One thing I do want to mention is that if you&#8217;re using the custom Read button, you want to say &#8220;click to select columns to display,&#8221; and then uncheck &#8220;read.&#8221; If you don&#8217;t do that, then your Read button won&#8217;t work, because Dragon will click the tiny button labeled &#8220;click to sort by Read&#8221; instead of opening your highlighted message.</p>
<p>Another thing I want to mention is that there is still no total select and say support, but you can still dictate reasonably into a message field &#8212; though to be on the safe side, as with anything, you might want to dictate into your word processor or the dictation box first.</p>
<p>I haven&#8217;t tried any more add-ons yet, but will report when I do, and when I&#8217;ve had more of a chance to use the program. But for now it&#8217;s pretty cool. <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Speech Recognation]]></title>
<link>http://bluewarrior.wordpress.com/2009/11/29/speech-recognation/</link>
<pubDate>Sun, 29 Nov 2009 03:41:14 +0000</pubDate>
<dc:creator>bluewarrior</dc:creator>
<guid>http://bluewarrior.wordpress.com/2009/11/29/speech-recognation/</guid>
<description><![CDATA[Speech Recognation adalah suatu sistem mengidentifikasi seseorang dari suaranya. Voice Recognition/V]]></description>
<content:encoded><![CDATA[Speech Recognation adalah suatu sistem mengidentifikasi seseorang dari suaranya. Voice Recognition/V]]></content:encoded>
</item>
<item>
<title><![CDATA[Garmin nüvi 855 4.3-Inch Widescreen Portable GPS Navigator with Speech Recognition]]></title>
<link>http://garminn.wordpress.com/2009/11/26/garmin-nuvi-855-4-3-inch-widescreen-portable-gps-navigator-with-speech-recognition/</link>
<pubDate>Thu, 26 Nov 2009 10:37:30 +0000</pubDate>
<dc:creator>amz0</dc:creator>
<guid>http://garminn.wordpress.com/2009/11/26/garmin-nuvi-855-4-3-inch-widescreen-portable-gps-navigator-with-speech-recognition/</guid>
<description><![CDATA[&nbsp; List Price: $499.99 Price: $199.99 &amp; eligible for FREE Super Saver Shipping on orders ove]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><a href="http://astore.amazon.com/garmin.nuvi.portable.best.gps.navigator-20/detail/B001OMGVII"><img class="alignnone" title="Garmin nüvi 855 4.3-Inch Widescreen Portable GPS Navigator with Speech Recognition" src="http://ecx.images-amazon.com/images/I/51cMJjupwVL._SL210_.jpg" alt="Garmin nüvi 855 4.3-Inch Widescreen Portable GPS Navigator with Speech Recognition" width="210" height="139" /></a></p>
<p>&#160;</p>
<table id="prices">
<tbody>
<tr>
<td>List Price:</td>
<td>$499.99</td>
</tr>
<tr>
<td>Price:</td>
<td>$199.99 &#38; eligible for FREE Super Saver Shipping on orders over $25</td>
</tr>
</tbody>
</table>
<p>&#160;</p>
<h3>Features</h3>
<ul>
<li>Sleek, ultra-slim design fits in a pocket; High-sensitivity GPS receiver for fast satellite lock</li>
<li>Pre-loaded with City Navigtor NT maps for North America, including more than 6-million name-searchable points of interest</li>
<li>Speech recognition&#8211;speak menu options and keep your eyes on the road</li>
</ul>
<p><a href="http://astore.amazon.com/garmin.nuvi.portable.best.gps.navigator-20/detail/B001OMGVII">Read Full Detail&#8230;</a></p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Google brings buzz to captions like never before]]></title>
<link>http://blog.3playmedia.com/2009/11/23/google-brings-buzz-to-captions-like-never-before/</link>
<pubDate>Mon, 23 Nov 2009 17:55:13 +0000</pubDate>
<dc:creator>Josh Miller</dc:creator>
<guid>http://blog.3playmedia.com/2009/11/23/google-brings-buzz-to-captions-like-never-before/</guid>
<description><![CDATA[Just the other day Google announced its intentions to automatically generate closed caption files on]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>Just the other day <a href="http://googleblog.blogspot.com/2009/11/automatic-captions-in-youtube.html">Google announced</a> its intentions to automatically generate closed caption files on a select group of YouTube files.  The story quickly made it to the <a href="http://www.nytimes.com/2009/11/20/technology/internet/20google.html">NY Times</a> and all over the blogosphere, as it rightfully should.  The idea is to eventually rollout the capability across YouTube for all users to test.  With <a href="http://www.youtube.com/t/fact_sheet">20 hours</a> of video being uploaded to YouTube every minute, that&#8217;s a lot of text being created!</p>
<p>At its core, this is a brilliant move by Google to improve <a href="http://erictric.com/online/youtube-auto-captioning-a-great-way-to-search-for-video">YouTube search</a> (and advertising) capabilities.  But Google&#8217;s announcement, largely because it&#8217;s Google, also puts the accessibility issue in front of the entire country for a change.  Captions are mandated for much of television, but they are only beginning to get some attention on the internet, well until now.  Representative Ed Markey, the same Congressman who made the original push for closed captioning on television, introduced <a href="http://www.govtrack.us/congress/bill.xpd?bill=h111-3101">H.R. 3101</a>, the Twenty-first Century Communications and Video Accessibility Act of 2009, during this session of Congress, and it currently has 19 co-sponsors.  This is actually the second attempt at getting a bill passed that would mandate an improved user experience for the hearing impaired.</p>
<p>Thanks to one of the most talked about technology companies of our time, closed captioning is getting attention all over the internet.  Anyone who works with online video is now paying attention to closed captioning.  Not only are we <a href="http://deafness.about.com/b/2009/11/19/way-to-go-google-thanks-for-automatic-captions.htm">empowering the hearing impaired</a>, but in a virtual world that seems to be driven by search and discovery, video can now be made more &#8220;accessible&#8221; than ever.</p>
<p>So for a business that is centered on providing high quality, time synchronized transcripts, what does this announcement mean?</p>
<p>Well, it could mean a lot of things.  First, let&#8217;s look into this new Google service.  Google will deploy the same technology that powers Google Voice across YouTube to enable the creation of text.  This means they will be using automatic speech recognition (ASR) to create the caption files.  Using ASR on audio and video is not a new concept, but it&#8217;s new at this scale.  We&#8217;ve commented on <a href="http://blog.3playmedia.com/2009/05/22/adventures-in-speech-recognition/">our experiences with ASR</a> capbilities in the past.  In fact, we&#8217;ve even played with the <a href="http://blog.3playmedia.com/2009/06/30/accuracy-still-a-problem-for-googles-ears/">very engine</a> that will be front and center for the YouTube initiative.</p>
<p>We&#8217;ve spoken with many people who have tested ASR solutions.  Usually, if they are talking to us, they weren&#8217;t satisfied!  The truth of the matter is that ASR will be good enough for some people, and it won&#8217;t be good enough for others.  80% accuracy (at its best and in studio quality recording conditions) leaves a lot to be desired.  In fact, Google even admits that <a href="http://googleblog.blogspot.com/2009/11/automatic-captions-in-youtube.html">results can be somewhat amusing</a> when they&#8217;re off.  On the search front, the most critical keywords tend to be the most unique and, therefore, least common to be recognized accurately.  Google&#8217;s announcement does not change that, it just makes an ASR solution easier to use and free to consume.  In many cases, Google has likely provided a medium for people who may never have put captions on their video with the ability to do so with very little effort.  Google has also made the search benefits of captions glaringly obvious.</p>
<p>Ultimately, the organizations that require (or believe in) high quality output for captions and search will be willing to pay for cleaned up text.  There are significant benefits to the high-quality approach, whether it be accurate search results or truly legible transcripts.  Branding is also a critical issue for many organizations who add a text component to their video offering.</p>
<p>We at 3Play Media will continue building high quality solutions that make multimedia more accessible for everyone.  More people than ever are aware of the benefits of captions and time-synchronized transcripts now.  We have some new product launches on the way that will build off these very benefits, and we can&#8217;t wait to show the world how their online video experience can be changed forever.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[Closed Captioning for YouTube Brought to You by Google]]></title>
<link>http://nohappymedium.com/2009/11/20/closed-captioning-for-youtube-brought-to-you-by-google/</link>
<pubDate>Fri, 20 Nov 2009 17:00:37 +0000</pubDate>
<dc:creator>The Unhappy Mediator</dc:creator>
<guid>http://nohappymedium.com/2009/11/20/closed-captioning-for-youtube-brought-to-you-by-google/</guid>
<description><![CDATA[The New York Times today reports that In the first major step toward making millions of videos on Yo]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p><a href="http://nohappymedium.wordpress.com/files/2009/11/youtube-grab.jpg"><img class="aligncenter size-medium wp-image-1330" title="youtube grab" src="http://nohappymedium.wordpress.com/files/2009/11/youtube-grab.jpg?w=300" alt="" width="300" height="144" /></a>The New York Times today <a href="http://www.nytimes.com/2009/11/20/technology/internet/20google.html?th&#38;emc=th" target="_blank">reports</a> that</p>
<blockquote><p>In the first major step toward making millions of videos on YouTube accessible to deaf and hearing-impaired people, <a title="More information about Google Inc" href="http://topics.nytimes.com/top/news/business/companies/google_inc/index.html?inline=nyt-org">Google</a> unveiled new technologies on Thursday that will automatically bring text captions to many videos on the site.</p></blockquote>
<p>According to the article, Google&#8217;s speech recognition technology, already used in applications such as its voice-to-text phone message service, initially will be applied to largely educational video channels on the YouTube, such as PBS, National Geographic and university stations. Another version of it will be available to regular users, giving them the option of having YouTube caption their videos for them, auto-generating transcripts from the audio.</p>
<p>While aimed at making online videos more accessible to the aurally-challenged, the technology will also be a revelation for the rest of us who want to bask ever more thoroughly in the glow of humanity&#8217;s radiant stupidity. Did she just say, &#8220;<a href="http://www.youtube.com/watch?v=nda_OSWeyn8" target="_blank">could be a crack head that got hold to the wrong stuff</a>”? Let&#8217;s go to the transcript for confirmation. (Yes. I am getting word that, yes, that is what she said.)</p>
<p>Furthermore, imagine the far out search implications. Forget hoping someone&#8217;s typed out and time-logged choice quotes from <a href="http://www.hulu.com/watch/108356/family-guy-quagmires-baby#s-p1-so-i0" target="_blank">the latest Family Guy</a> (06:24: &#8220;I did some poos, I did some poos, I didn&#8217;t  mean to.&#8221;), or straining to recall in <a href="http://www.youtube.com/watch?v=N8KnA30Pv6c&#38;NR=1" target="_blank">which chapter</a> R. Kelly sings &#8220;I thought your name was Mary/ That&#8217;s what you said at the party/ Man, this is getting scary/ I&#8217;m gonna shoot somebody.&#8221; Uploaded videos will be automatically tagged with text files, unleashing a deluge of previously untap-able search reserves. It&#8217;s the web&#8217;s next logical step, really.</p>
<p>And you thought you spent a lot of time on the computer now.</p>
</div>]]></content:encoded>
</item>
<item>
<title><![CDATA[More buttons for Firefox and Dragon]]></title>
<link>http://hand2mouth.wordpress.com/2009/11/18/more-buttons-for-firefox-and-dragon/</link>
<pubDate>Wed, 18 Nov 2009 15:38:46 +0000</pubDate>
<dc:creator>hand2mouth</dc:creator>
<guid>http://hand2mouth.wordpress.com/2009/11/18/more-buttons-for-firefox-and-dragon/</guid>
<description><![CDATA[I&#8217;ve mentioned the Custom Toolbar Buttons for using Firefox and Thunderbird with Dragon before]]></description>
<content:encoded><![CDATA[<div class='snap_preview'><p>I&#8217;ve mentioned the <a title="Custom Toolbar Buttons" href="http://codefisher.org/toolbar_button/toolbar_button_maker">Custom Toolbar Buttons</a> for using Firefox and Thunderbird with Dragon before, but am plugging it again because there are some navigation buttons I didn&#8217;t notice. These buttons would be especially useful for those who don&#8217;t use add-on command programs, but they may also simply reduce the syllables of an existing command if that&#8217;s important to you.</p>
<p>1. Find Again and Find Previous. This might be helpful for those using typeaheadfind.linksonly rather than Mouseless Browsing. If Firefox focuses on a word with multiple instances, saying &#8220;find again/previous&#8221; is more fluid than &#8220;press Control G&#8221; or &#8220;press Control Shift G.&#8221; However, it will be somewhat slower than the keypress command or a Vocola &#8220;find again&#8221; command since it uses cursor movement instead of keystrokes.</p>
<p>2. Close Other Tabs. Does exactly what it says, leaving only your current tab.</p>
<p>3. Snap Back. Goes to the first site you visited in the current tab. Useful depending on your browsing habits.</p>
</div>]]></content:encoded>
</item>

</channel>
</rss>
