<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Okko in Speech &#187; search engines</title>
	<atom:link href="http://www.okkoblog.com/tag/search-engines/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.okkoblog.com</link>
	<description>Working with speech and language technology</description>
	<lastBuildDate>Thu, 29 Sep 2011 12:37:20 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>A More Optimistic Outlook on the Future of Speech</title>
		<link>http://www.okkoblog.com/2010/06/30/a-more-optimistic-outlook-on-the-future-of-speech/</link>
		<comments>http://www.okkoblog.com/2010/06/30/a-more-optimistic-outlook-on-the-future-of-speech/#comments</comments>
		<pubDate>Wed, 30 Jun 2010 09:47:04 +0000</pubDate>
		<dc:creator>Okko</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[ASR]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[Siri]]></category>
		<category><![CDATA[usability]]></category>

		<guid isPermaLink="false">http://www.okkoblog.com/?p=187</guid>
		<description><![CDATA[The speech application industry got some critical press in recent months (here are some spirited responses, respectively.) All the more refreshing to come across this New York Times article presenting current work in speech and artificial intelligence. The article highlights broadly what kind of AI applications have moved into the mainstream (or have potential to [...]]]></description>
			<content:encoded><![CDATA[<p>The speech application industry got some <a href="http://robertfortner.posterous.com/the-unrecognized-death-of-speech-recognition" target="_blank">critical</a> <a href="http://www.signalprocessingsociety.org/technical-committees/list/sl-tc/spl-nl/2010-04/suendermann/">press</a> in recent months (here are some <a href="http://robertopieraccini.blogspot.com/2010/05/un-rest-in-peas-unrecognized-life-of.html">spirited</a> <a href="http://languagelog.ldc.upenn.edu/nll/?p=2275">responses</a>, respectively.)</p>
<p>All the more refreshing to come across this New York Times <a href="http://www.nytimes.com/2010/06/25/science/25voice.html">article</a> presenting <a href="http://research.microsoft.com/en-us/um/people/horvitz/">current</a> <a href="http://siri.com/">work</a> in speech and artificial intelligence. The article highlights broadly what kind of AI applications have moved into the mainstream (or have potential to do so). Speech and natural language understanding, the article claims, have gone furthest.</p>
<p>One thing that is generalizable from both criticisms above is that development of speech-enabled applications has stagnated, in various ways<sup>1</sup>. The underlying technology – speech recognition (ASR) – has gone as far as it can. Application designers and developers haven&#8217;t adopted. Dictation has learned to understand doctors and lawyers better, but still struggles with conversational speech.</p>
<p>This point may have to be conceded. In terms of commercial applications however, especially speech-enabled voice (IVR) systems, the root cause for stagnation is not necessarily a failure of AI, rather than a maturing of standards and best-practices. Fulfilling expectations that voice applications, much like websites, behave according to certain rules is much to the advantage of the millions who interact with such systems every day.</p>
<p>What I walk away with from the generalized critical, as well as the Times&#8217; optimistic perspective is that, short of a revolution in underlying technologies (which hardly anyone expects), filling practical, everyday niches is where things can still move forward for speech and language processing.  These niches have certainly not been fully uncovered.</p>
<p>Thoughts?</p>
<hr /><sup>1</sup> Roughly summarized, Robert Fostner: &#8220;development in speech technology has flat-lined since 2001&#8243;; David Suendermann: &#8220;(statistical) engineering methods are more efficient than traditional symbolic linguistic approaches to language processing.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.okkoblog.com/2010/06/30/a-more-optimistic-outlook-on-the-future-of-speech/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>GOOG: We need more data</title>
		<link>http://www.okkoblog.com/2008/01/03/goog-we-need-more-data/</link>
		<comments>http://www.okkoblog.com/2008/01/03/goog-we-need-more-data/#comments</comments>
		<pubDate>Thu, 03 Jan 2008 08:42:00 +0000</pubDate>
		<dc:creator>Okko</dc:creator>
				<category><![CDATA[Brands]]></category>
		<category><![CDATA[Services]]></category>
		<category><![CDATA[ASR]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Loquendo]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[Telisma]]></category>

		<guid isPermaLink="false">http://okkoblog.com/blog/?p=34</guid>
		<description><![CDATA[addthis_url = location.href; addthis_title = document.title; addthis_pub = 'okkobuss'; The old maxim &#8220;I need more data&#8221; should be familiar to anyone who has ever tried to wrestle with language technology issues, attempted speech application tuning or delved into any statistical approach to an AI-related problem. Google moved into the speech world last year with GOOG-411, [...]]]></description>
			<content:encoded><![CDATA[<p><!-- AddThis Bookmark Button BEGIN --><script type="text/javascript"><br />  addthis_url    = location.href;   <br />  addthis_title  = document.title;  <br />  addthis_pub    = 'okkobuss';     <br /></script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12"></script>The old maxim &#8220;I need more data&#8221; should be familiar to anyone who has ever tried to wrestle with language technology issues, attempted speech application tuning or delved into any statistical approach to an AI-related problem.   Google <a href="http://www.google.com/goog411/">moved into the speech world</a> last year with GOOG-411, a speech recognition driven directory assistance application (you say what you are looking for and where, it returns suitable businesses and connects you to the one you want or sends you details in an SMS).<br />Like all (well, most) other Google services, GOOG-411 is free for the end-user.  As such, the basic business model (collect data, turn data into cash) applies.  This was <a href="http://www.infoworld.com/article/07/10/23/Google-wants-your-phonemes_3.html">recently confirmed</a>  in interview by Marissa Mayer, Google&#8217;s VP <span class="mdTitleGen">of Search Products and User Experience:</span><br />
<blockquote></blockquote>
<p><span class="artText"><br />
<blockquote><span style="font-size:85%;">Whether or not free-411 is a profitable business unto itself is yet to be seen. I myself am somewhat skeptical. The reason we really did it is because we need to build a great speech-to-text model &#8230; that we can use for all kinds of different things, including video search.</span></p></blockquote>
<p>Google thus couples statistical AI and its general data-driven approach to everything in a novel way.  In doing so, Google may find itself in a catch-up race with the ilk of <a href="http://www.nuance.com/">Nuance</a>, <a href="http://www.loquendo.com/">Loquendo</a> <a href="http://www-306.ibm.com/software/pervasive/voice_server/ivrgateway.html">IBM</a>, or <a href="http://www.telisma.com/">Telisma</a>, whose stronghold on speech recognition technology comes, in part, from having aggregated speech and language databases through data collection during professional services projects.<br /></span><span class="artText">What&#8217;s new in Google&#8217;s approach, however, is the convergence of the dual role that data plays in AI and in the overall service-driven business model.  Google will presumably not be content to bootstrap a pattern matching engine to sell licenses like the technology companies above.  More interestingly to follow will be the range of services Google can spin using this technology (context sensitive video advertising, audio indexing, IVR hosting) which are more befitting of their overall company strategy.</span><span class="artText"><br />Unsurprisingly, Mayer goes on to claim that Google isn&#8217;t working on ways out of the world of brute-force data-driven algorithms:<br /></span><span class="artText"></span><br />
<blockquote><span style="font-size:85%;"><span class="artText">People should be able to ask questions, and we should understand their meaning, or they should be able to talk                      about things at a conceptual level. &#8230; </span><span class="artText">A lot of people will turn to things like the semantic Web as a possible answer to that. But what we&#8217;re seeing actually is that with a lot of data, you ultimately see things that seem intelligent even though they&#8217;re done through brute force.</span></span></p></blockquote>
<p><span class="artText"></span><span class="artText">User privacy advocates may also have a thought or two on this new dimension of data collection, as Google is beginning to loose the &#8220;conventionally trustworthy&#8221; image it held amongst many over the past years.  Fortunately the ways in which speech data is commonly used to train pattern matching models involves very little in the ways of privacy infringement.</span><span class="artText"><br />Happy data collecting!<br /></span><!-- AddThis Bookmark Button END --></p>
]]></content:encoded>
			<wfw:commentRss>http://www.okkoblog.com/2008/01/03/goog-we-need-more-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Daily News Redux&#8230;</title>
		<link>http://www.okkoblog.com/2007/04/18/daily-news-redux-13/</link>
		<comments>http://www.okkoblog.com/2007/04/18/daily-news-redux-13/#comments</comments>
		<pubDate>Wed, 18 Apr 2007 09:25:00 +0000</pubDate>
		<dc:creator>Okko</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[ASR]]></category>
		<category><![CDATA[Fonix]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[TTS]]></category>

		<guid isPermaLink="false">http://okkoblog.com/blog/?p=18</guid>
		<description><![CDATA[On the WWW today: Some recent news about Google and MS voice search offensive. Epson chip to feature Fonix DECtalk speech synthesis.]]></description>
			<content:encoded><![CDATA[<p>On the WWW today:
<ul>
<li><a href="http://www.rcrnews.com/apps/pbcs.dll/article?AID=/20070409/FREE/70409008/1012">Some</a> <a href="http://arstechnica.com/news.ars/post/20070416-google-microsoft-look-beyond-mobile-search-for-voice-interaction.html">recent</a> <a href="http://www.tech2.com/india/news/general/ms-deal-for-tellme-gets-u.s.-antitrust-ok/5147/0">news</a> about Google and MS voice search offensive.</li>
<li><a href="http://www.tmcnet.com/usubmit/2007/04/17/2513256.htm">Epson chip</a> to feature Fonix DECtalk speech synthesis.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.okkoblog.com/2007/04/18/daily-news-redux-13/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Daily News Redux&#8230;</title>
		<link>http://www.okkoblog.com/2007/04/17/daily-news-redux-12/</link>
		<comments>http://www.okkoblog.com/2007/04/17/daily-news-redux-12/#comments</comments>
		<pubDate>Tue, 17 Apr 2007 06:04:00 +0000</pubDate>
		<dc:creator>Okko</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[web3.0]]></category>

		<guid isPermaLink="false">http://okkoblog.com/blog/?p=17</guid>
		<description><![CDATA[On the WWW today: Article providing roundup of some semantic web solutions. InQuira executive to discuss benefits of NLP for search engine optimization.]]></description>
			<content:encoded><![CDATA[<p>On the WWW today:
<ul>
<li><a href="http://www.intelligententerprise.com/channels/applications/showArticle.jhtml?articleID=199001226">Article</a> providing roundup of some semantic web solutions.</li>
<li>InQuira executive to discuss benefits of <a href="http://www.sys-con.com/read/361678.htm">NLP for search engine optimization</a>.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.okkoblog.com/2007/04/17/daily-news-redux-12/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Daily News Redux&#8230;</title>
		<link>http://www.okkoblog.com/2007/04/16/daily-news-redux-11/</link>
		<comments>http://www.okkoblog.com/2007/04/16/daily-news-redux-11/#comments</comments>
		<pubDate>Mon, 16 Apr 2007 08:17:00 +0000</pubDate>
		<dc:creator>Okko</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[NLP]]></category>
		<category><![CDATA[search engines]]></category>

		<guid isPermaLink="false">http://okkoblog.com/blog/?p=16</guid>
		<description><![CDATA[Today on the WWW: Software Ali Baba parses medical abstracts, generates visual network or terminology using natural language processing. A redux of latent semantic indexing (LSI) for use in search engines.]]></description>
			<content:encoded><![CDATA[<p>Today on the WWW:
<ul>
<li>Software <a href="http://mndoci.com/blog/2007/04/14/ali-baba-mining-pubmed-with-natural-language-processing/">Ali Baba</a> parses medical abstracts, generates visual network or terminology using natural language processing.</li>
<li>A <a href="http://www.for-the-record.biz/for-the-record/latent-semantic-indexing-and-search-engines-optimimization-seo">redux </a>of latent semantic indexing (LSI) for use in search engines.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.okkoblog.com/2007/04/16/daily-news-redux-11/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Web 3.0 and Natural Language Processing</title>
		<link>http://www.okkoblog.com/2007/04/09/web-3-0-and-natural-language-processing/</link>
		<comments>http://www.okkoblog.com/2007/04/09/web-3-0-and-natural-language-processing/#comments</comments>
		<pubDate>Mon, 09 Apr 2007 06:53:00 +0000</pubDate>
		<dc:creator>Okko</dc:creator>
				<category><![CDATA[Brands]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Services]]></category>
		<category><![CDATA[NLP]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[web3.0]]></category>

		<guid isPermaLink="false">http://okkoblog.com/blog/?p=13</guid>
		<description><![CDATA[Web 3.0 is getting some buzz in the blogosphere. Like Web 2.0, it begs the question that PCMag.com recently ran by its readers: what is it? However this time around things seems a bit easier. Web 2.0 seems to be happy with being vaguely defined (delimited may be a better term) and equally a social [...]]]></description>
			<content:encoded><![CDATA[<p>Web 3.0 is getting <a href="http://scobleizer.com/2007/04/05/i-finally-get-semantic-web/">some</a> <a href="http://yihongs-research.blogspot.com/2007/04/semantic-web-is-closer-to-be-real-isnt.html">buzz</a> <a href="http://www.pelicancrossing.net/netwars/2007/04/whats_in_a_20.html">in</a> <a href="http://billboushka.blogspot.com/2007/04/web-30-is-getting-attention.html">the</a> blogosphere.  Like Web 2.0, it begs the question that PCMag.com <a href="http://www.pcmag.com/article2/0,1759,2102852,00.asp">recently</a> ran by its readers:  what is it?  However this time around things seems a bit easier.</p>
<p>Web 2.0 seems to be happy with being vaguely defined (delimited may be a better term) and equally a social and a technological movement.  Web 3.0 clearly hovers over the idea of the &#8220;Semantic Web&#8221;, a term coined by <a href="http://de.wikipedia.org/wiki/Berners-Lee">Tim Berners-Lee</a>, in which richly <a href="http://de.wikipedia.org/wiki/Resource_Description_Framework">mark-upped</a> hypertext and data allow for novel more meaningful human-machine and machine-machine communication.  <a href="http://www.radarnetworks.com/">Radar Networks</a> (currently in stealth mode) claim to be driving some interesting developments in this direction and are followed closely by those interested.</p>
<p>This has already raised some questions: will content be expensive hand labor or machine boot-strappable, what new privacy policies do we have to live with, how does one separate <a href="http://www.elainevigneault.com/2007/04/08/semantic-web-and-the-future-of-the-internet.html">style and content</a>, what are <a href="http://mukhlason.multiply.com/reviews/item/25">alternatives to RDF</a>.</p>
<p>Sadly, there&#8217;s very little inspiring out there about potential applications.</p>
<p>My question (though not uniquely mine) to add to this:  What role will natural language processing play in this (i.e. how &#8220;semantic&#8221; is this talk of Semantics)?  Semantic content in RDF appears to be little more than a means for one machine to tell another who authored a particular book or what are the postal codes in the greater Boston area.  Semantics to me is as much about intentions (&#8220;Why is web-service A dispensing such information?&#8221;) and interpreting such  information for the purposes of action (&#8220;What can web-service B &#8211; or my browser or I &#8211; do with it?&#8221;).</p>
<p>Perhaps this misses the mark and semantic really isn&#8217;t about natural language.  But there is a weaker, more real form of this &#8220;language and technology&#8221; concern: Insofar as semantics <span style="font-style: italic;">is</span> just information, can it be bootstrapped by a machine (perhaps even linguistically informed rather than statistically)?</p>
<p>Thoughts?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.okkoblog.com/2007/04/09/web-3-0-and-natural-language-processing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Daily News Redux&#8230;</title>
		<link>http://www.okkoblog.com/2007/04/03/daily-news-redux-7/</link>
		<comments>http://www.okkoblog.com/2007/04/03/daily-news-redux-7/#comments</comments>
		<pubDate>Tue, 03 Apr 2007 13:40:00 +0000</pubDate>
		<dc:creator>Okko</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[ASR]]></category>
		<category><![CDATA[Cognition]]></category>
		<category><![CDATA[machine translation]]></category>
		<category><![CDATA[NLP]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[TTS]]></category>
		<category><![CDATA[Wizzard]]></category>
		<category><![CDATA[ZoomInfo]]></category>

		<guid isPermaLink="false">http://okkoblog.com/blog/?p=11</guid>
		<description><![CDATA[Daily News Redux: Some electronic dictionary news. Coolsoft Coolexon electronic dictionary features 60 languages, including translation and TTS accessibility features, is only 3.3MB large. Also in the news, Ultralingua and Collins partner to release electronic dictionaries. Some business automation success stories using speech recognition. Some more semantic search engine-related news featuring ZoomInfo and Cognition. Arab [...]]]></description>
			<content:encoded><![CDATA[<p>Daily News Redux:
<ul>
<li>Some electronic dictionary news.  Coolsoft <a href="http://http//www.prweb.com/releases/2007/04/prweb514946.htm">Coolexon</a> electronic dictionary features 60 languages, including translation and TTS accessibility features, is only 3.3MB large.  Also in the news, <a href="http://www.emediawire.com/releases/2007/4/emw515927.htm">Ultralingua and Collins </a>partner to release electronic dictionaries.</li>
<li>Some business automation <a href="http://www.crmbuyer.com/story/TSU5kk8DwYiu88/New-Voice-Response-System-Aims-to-Ease-Customer-Frustration.xhtml">success</a> <a href="http://www.tmcnet.com/channels/speech-applications-and-solutions/articles/6077-nuance-ctg-automate-warehouse-management-using-speech.htm">stories</a> using speech recognition.</li>
<li>Some more semantic search engine-related news featuring <a href="http://press-releases.techwhack.com/8694/zoominfo/">ZoomInfo </a>and <a href="http://newsbreaks.infotoday.com/nbReader.asp?ArticleId=35805">Cognition</a>.</li>
<li><a href="http://www.albawaba.com/en/countries/UAE/211506">Arab language speech recognition</a> market growing.</li>
<li>Wizzard announces <a href="http://home.businesswire.com/portal/site/google/index.jsp?ndmViewId=news_view&#038;newsId=20070402005757&amp;newsLang=en">2006 financial results</a>.</li>
</ul>
<p>Questions of the day:
<ul>
<li>Web X.0 IEEE <a href="http://linguistlist.org/issues/18/18-969.html#2">workshop</a>.  What role will NLP play?</li>
<li> Are <a href="http://www.pocket-lint.co.uk/news/news.phtml/7217/8241/magellan-maestro-4050-gps-receiver.phtml">GPS</a> <a href="http://news.thomasnet.com/fullstory/513155/3287">navigation</a> systems <a href="http://news.trendaz.com/cgi-bin/readnews2.pl?newsId=904584&amp;lang=EN">driving</a> the TTS market (links randomly chosen from recent navigation system releases)?</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.okkoblog.com/2007/04/03/daily-news-redux-7/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Daily News Redux&#8230;</title>
		<link>http://www.okkoblog.com/2007/04/03/daily-news-redux-6/</link>
		<comments>http://www.okkoblog.com/2007/04/03/daily-news-redux-6/#comments</comments>
		<pubDate>Tue, 03 Apr 2007 08:42:00 +0000</pubDate>
		<dc:creator>Okko</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[ASR]]></category>
		<category><![CDATA[machine translation]]></category>
		<category><![CDATA[NLP]]></category>
		<category><![CDATA[search engines]]></category>

		<guid isPermaLink="false">http://okkoblog.com/blog/?p=10</guid>
		<description><![CDATA[On the WWW today: CallMiner announces Eureka product for call center speech analytics and QA. Envox CT Connect 7 VXML/CTI plattform now Avaya telephony compliant Some blogging about the role of symbolic vs brute-force statistics in articificial intelligence, NLP, Google&#8216;s machine translation vision.]]></description>
			<content:encoded><![CDATA[<p>On the WWW today:
<ul>
<li>CallMiner announces <a href="http://http//www.crmmarketplace.com/content/news/article.asp?DocID=%7BC99548AF-2147-4114-9218-4996FE7F2457%7D&#038;Bucket=Current+Headlines&amp;VNETCOOKIE=NO">Eureka product</a> for call center speech analytics and QA.</li>
<li><a href="http://www.tmcnet.com/news/2007/04/02/2458671.htm">Envox CT Connect 7</a> VXML/CTI plattform now Avaya telephony compliant</li>
<li><a href="http://earningmyturns.blogspot.com/2007/04/why-google-ai-vision-is-wrong.html">Some</a> <a href="http://datamining.typepad.com/data_mining/2007/04/ai_language_and.html">blogging</a> about the role of symbolic vs brute-force statistics in articificial intelligence, NLP, <a href="http://today.reuters.com/news/articlenews.aspx?type=technologyNews&amp;storyid=2007-03-28T132638Z_01_N19218815_RTRUKOC_0_US-GOOGLE-TRANSLATE.xml">Google</a>&#8216;s machine translation vision.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.okkoblog.com/2007/04/03/daily-news-redux-6/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Daily News Redux&#8230;</title>
		<link>http://www.okkoblog.com/2007/03/29/daily-news-redux/</link>
		<comments>http://www.okkoblog.com/2007/03/29/daily-news-redux/#comments</comments>
		<pubDate>Thu, 29 Mar 2007 09:57:00 +0000</pubDate>
		<dc:creator>Okko</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[ASR]]></category>
		<category><![CDATA[machine translation]]></category>
		<category><![CDATA[NLP]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[TTS]]></category>

		<guid isPermaLink="false">http://okkoblog.com/blog/?p=5</guid>
		<description><![CDATA[On the WWW today: Article about Google statistical machine translation algorithms, mentions success in Arabic (cf. NIST benchmarks finding Google&#8217;s Arabic/Chinese->English translation most accuracte.) Teragram MyGAD.com search engine launch, employing NLP for improved information retrieval. In related news, a list of top-100 search engines, including more NLP and some audio searches. Article about predicive software [...]]]></description>
			<content:encoded><![CDATA[<p>On the WWW today:
<ul>
<li><a href="http://today.reuters.com/news/articlenews.aspx?type=technologyNews&#038;storyid=2007-03-28T132638Z_01_N19218815_RTRUKOC_0_US-GOOGLE-TRANSLATE.xml">Article </a>about Google statistical machine translation algorithms, mentions success in Arabic (cf. NIST <a href="http://www.poynter.org/article_feedback/article_feedback_list.asp?user=&amp;id=120155">benchmarks </a>finding Google&#8217;s Arabic/Chinese->English translation most accuracte.)</li>
<li>Teragram MyGAD.com search engine <a href="http://sev.prnewswire.com/multimedia-online-internet/20070327/CLTU04927032007-1.html">launch</a>, employing NLP for improved information retrieval.  In  related news, a <a href="http://www.readwriteweb.com/archives/top_100_alternative_search_engines_mar07.php">list </a>of top-100 search engines, including more NLP and some audio searches.</li>
<li>Article about predicive software application for the tourism industry, calls for NLP and other AI techniques such as neural networks.</li>
<li>Nuance unveils <a href="http://www.tmcnet.com/usubmit/2007/03/28/2448688.htm">voice music search</a> application for mobile ASR applications.  In related news, Nuance ships <a href="http://www.tmcnet.com/channels/speech-applications-and-solutions/articles/5974-nuance-improves-speech-output-mobile-applications.htm">improved mobile TTS</a>.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.okkoblog.com/2007/03/29/daily-news-redux/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Three Observations about Recent Language Technology News</title>
		<link>http://www.okkoblog.com/2007/03/28/three-observations-about-recent-language-technology-news/</link>
		<comments>http://www.okkoblog.com/2007/03/28/three-observations-about-recent-language-technology-news/#comments</comments>
		<pubDate>Wed, 28 Mar 2007 11:50:00 +0000</pubDate>
		<dc:creator>Okko</dc:creator>
				<category><![CDATA[Brands]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Services]]></category>
		<category><![CDATA[Vendors]]></category>
		<category><![CDATA[ASR]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[multi-modal]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[open-source]]></category>
		<category><![CDATA[Powerset]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[TTS]]></category>

		<guid isPermaLink="false">http://okkoblog.com/blog/?p=4</guid>
		<description><![CDATA[To start us off, recent experience has shown three things: Speech (i.e. voice) related news is TTS-dominated, less so by ASR. The company featured most frequently in the news is Nuance. The talk of semantic search engines seems to dominate the NLP news. The success of TTS is largely due to requirements set by mobile [...]]]></description>
			<content:encoded><![CDATA[<p>To start us off, <a href="http://okkobuss.googlepages.com/">recent experience</a> has shown three things:
<ol>
<li>Speech (i.e. voice) related news is TTS-dominated, less so by ASR.</li>
<li>The company featured most frequently in the news is Nuance.</li>
<li>The talk of semantic search engines seems to dominate the NLP news.</li>
</ol>
<p> The success of TTS is largely due to requirements set by mobile and in-car technologies, especially GPS and communications.  The future of ASR in the other hand seems to depend on the dictation market (especially in the healthcare sector) and a growing relevance of network ASR (driven by advancing VoIP, impact of multi-modal applications).</p>
<p>Nuance&#8217;s continued position will depend on the role of &#8220;super players&#8221; IBM and Microsoft and to a lesser degree the role of open-source initiatives, especially on the network/telephony side.</p>
<p>Semantic search engines recently got some media hype with &#8220;Google-Killer&#8221; Powerset, a PARC offspring.  While in its infancy, some believe this development towards semantic web will usher in a Web3.0 revolution.  Of course, soem others believe this has already begun, while yet more just wanna see what happens with all this.</p>
<p>Let&#8217;s see how these trends develop.  Especially multi-modality and semantic searches will be issues to follow closely.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.okkoblog.com/2007/03/28/three-observations-about-recent-language-technology-news/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

