<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Okko in Speech &#187; Loquendo</title>
	<atom:link href="http://www.okkoblog.com/tag/loquendo/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.okkoblog.com</link>
	<description>Working with speech and language technology</description>
	<lastBuildDate>Tue, 20 Jul 2010 08:09:21 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>GOOG: We need more data</title>
		<link>http://www.okkoblog.com/2008/01/03/goog-we-need-more-data/</link>
		<comments>http://www.okkoblog.com/2008/01/03/goog-we-need-more-data/#comments</comments>
		<pubDate>Thu, 03 Jan 2008 08:42:00 +0000</pubDate>
		<dc:creator>Okko</dc:creator>
				<category><![CDATA[Brands]]></category>
		<category><![CDATA[Services]]></category>
		<category><![CDATA[ASR]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Loquendo]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[Telisma]]></category>

		<guid isPermaLink="false">http://okkoblog.com/blog/?p=34</guid>
		<description><![CDATA[addthis_url = location.href; addthis_title = document.title; addthis_pub = 'okkobuss'; The old maxim &#8220;I need more data&#8221; should be familiar to anyone who has ever tried to wrestle with language technology issues, attempted speech application tuning or delved into any statistical approach to an AI-related problem. Google moved into the speech world last year with GOOG-411, [...]]]></description>
			<content:encoded><![CDATA[<p><!-- AddThis Bookmark Button BEGIN --><script type="text/javascript"><br />  addthis_url    = location.href;   <br />  addthis_title  = document.title;  <br />  addthis_pub    = 'okkobuss';     <br /></script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12"></script>The old maxim &#8220;I need more data&#8221; should be familiar to anyone who has ever tried to wrestle with language technology issues, attempted speech application tuning or delved into any statistical approach to an AI-related problem.   Google <a href="http://www.google.com/goog411/">moved into the speech world</a> last year with GOOG-411, a speech recognition driven directory assistance application (you say what you are looking for and where, it returns suitable businesses and connects you to the one you want or sends you details in an SMS).<br />Like all (well, most) other Google services, GOOG-411 is free for the end-user.  As such, the basic business model (collect data, turn data into cash) applies.  This was <a href="http://www.infoworld.com/article/07/10/23/Google-wants-your-phonemes_3.html">recently confirmed</a>  in interview by Marissa Mayer, Google&#8217;s VP <span class="mdTitleGen">of Search Products and User Experience:</span><br />
<blockquote></blockquote>
<p><span class="artText"><br />
<blockquote><span style="font-size:85%;">Whether or not free-411 is a profitable business unto itself is yet to be seen. I myself am somewhat skeptical. The reason we really did it is because we need to build a great speech-to-text model &#8230; that we can use for all kinds of different things, including video search.</span></p></blockquote>
<p>Google thus couples statistical AI and its general data-driven approach to everything in a novel way.  In doing so, Google may find itself in a catch-up race with the ilk of <a href="http://www.nuance.com/">Nuance</a>, <a href="http://www.loquendo.com/">Loquendo</a> <a href="http://www-306.ibm.com/software/pervasive/voice_server/ivrgateway.html">IBM</a>, or <a href="http://www.telisma.com/">Telisma</a>, whose stronghold on speech recognition technology comes, in part, from having aggregated speech and language databases through data collection during professional services projects.<br /></span><span class="artText">What&#8217;s new in Google&#8217;s approach, however, is the convergence of the dual role that data plays in AI and in the overall service-driven business model.  Google will presumably not be content to bootstrap a pattern matching engine to sell licenses like the technology companies above.  More interestingly to follow will be the range of services Google can spin using this technology (context sensitive video advertising, audio indexing, IVR hosting) which are more befitting of their overall company strategy.</span><span class="artText"><br />Unsurprisingly, Mayer goes on to claim that Google isn&#8217;t working on ways out of the world of brute-force data-driven algorithms:<br /></span><span class="artText"></span><br />
<blockquote><span style="font-size:85%;"><span class="artText">People should be able to ask questions, and we should understand their meaning, or they should be able to talk                      about things at a conceptual level. &#8230; </span><span class="artText">A lot of people will turn to things like the semantic Web as a possible answer to that. But what we&#8217;re seeing actually is that with a lot of data, you ultimately see things that seem intelligent even though they&#8217;re done through brute force.</span></span></p></blockquote>
<p><span class="artText"></span><span class="artText">User privacy advocates may also have a thought or two on this new dimension of data collection, as Google is beginning to loose the &#8220;conventionally trustworthy&#8221; image it held amongst many over the past years.  Fortunately the ways in which speech data is commonly used to train pattern matching models involves very little in the ways of privacy infringement.</span><span class="artText"><br />Happy data collecting!<br /></span><!-- AddThis Bookmark Button END --></p>
]]></content:encoded>
			<wfw:commentRss>http://www.okkoblog.com/2008/01/03/goog-we-need-more-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>News Redux &amp; Building VoiceGlue</title>
		<link>http://www.okkoblog.com/2007/12/04/news-redux-building-voiceglue/</link>
		<comments>http://www.okkoblog.com/2007/12/04/news-redux-building-voiceglue/#comments</comments>
		<pubDate>Tue, 04 Dec 2007 07:12:00 +0000</pubDate>
		<dc:creator>Okko</dc:creator>
				<category><![CDATA[How To]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[across Systems]]></category>
		<category><![CDATA[ASR]]></category>
		<category><![CDATA[Language Weaver]]></category>
		<category><![CDATA[Loquendo]]></category>
		<category><![CDATA[machine translation]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[open-source]]></category>
		<category><![CDATA[Persay]]></category>
		<category><![CDATA[TTS]]></category>
		<category><![CDATA[Viecore]]></category>
		<category><![CDATA[VoiceGlue]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://okkoblog.com/blog/?p=33</guid>
		<description><![CDATA[I stumbled across some &#8220;traditional&#8221; news bits this week for speech and language technologies, representing most of the major and a few interesting minor market players . Yahoo is offering some kind of NLP-driven structured search for e-commerce solutions starting next year. A new bundled automatic translation software with automatic learning capabilities was announced by [...]]]></description>
			<content:encoded><![CDATA[<p>I stumbled across some &#8220;traditional&#8221; news bits this week for speech and language technologies, representing most of the major and a few interesting minor market players .  <a href="http://www.yahoo.com/">Yahoo</a> is offering some kind of NLP-driven <a href="http://www.washingtonpost.com/wp-dyn/content/article/2007/11/27/AR2007112700976.html">structured search</a> for e-commerce solutions starting next year.  A <a href="http://startupbeat.com/sub/2007/11/language_weaver_and_across_systems_announce_bundle_id2116.html">new bundled automatic translation software</a> with automatic learning capabilities was announced by <a href="http://www.across.net/en/index.html">across Systems GmbH</a> and <a href="http://www.languageweaver.com/home.asp">Language Weaver</a>.  <a href="http://www.loquendo.com/">Loquendo</a> is sponsoring a <a href="http://www.tmcnet.com/channels/speech-recognition/articles/15636-loquendo-sponsors-next-gen-navigation-event.htm">speech-for-in-car-navigation industry event</a>.  <a href="http://www.persay.com/">Persay</a>, maker of voice authentication software, is shipping solutions securing Planet Payment&#8217;s <a href="http://www.reuters.com/article/pressRelease/idUS06868+29-Nov-2007+PRN20071129">voice-enabled payment processing</a>.  Lastly <a href="http://www.nuance.com/">Nuance</a>, continuing its <a href="http://www.tmcnet.com/channels/speech-recognition/articles/15636-loquendo-sponsors-next-gen-navigation-event.htm">acquisition spree</a>, buys <a href="http://www.viecore.com/home.asp">Viecore</a>, a contact-center integration consulting company, indicating a clear focus on strengthening its traditional speech and telephony market position.</p>
<p>Recently I stumbled across and <a href="http://okkobuss.blogspot.com/2007/11/back-in-saddle-with-msft-goog-and.html">blogged about</a> <a href="http://www.voiceglue.org/">VoiceGlue</a>, an integration of various GPL-licensed pieces of software, providing full IVR capabilities (including rudimentary speech synthesis but not recognition.)  Well, last night, together with <a href="http://www.christophbuente.de/">Christoph</a>, I finally had a stab at it myself.<br />Our test setup involved running Fedora 9 virtualized in Mac OS X.  Our Fedora installation was missing a few pieces of software beyond the indicated prerequisites, but after about an hour everything was under way.<br />The trickiest bit proved to be building various modules required for the XML parser (I presume needed later for VoiceGlue-customized DTMF grammar parser.)  For some reason CPAN&#8217;s console kept conking out on us (claiming inexplicably missing/unbuildable prereqs), so after wrestling with that for some time, we decided to manually build all the modules ourself (hoorah, makefiles).<br />This worked like a charm, though we hit a snag with the Module::Build perl module, which required C_Support, which in turn required another perl module (ExtUtils-CBuilders), not mentioned in any documentation (scant across the board, though that&#8217;s half the fun, isn&#8217;t it).<br />After that, the VoiceGlue installation completed swiftly and all services started running after a minimal bit of configuration.<br />Next week we&#8217;ll be back with some test calls and our first impressions.  In the meanwhile we&#8217;ll keep our eyes peeled for ASR integration (LumenVox/Sphinx), which will make this a truly valuable stab at open sourcing some of the most expensive carrier-grade technology out there.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.okkoblog.com/2007/12/04/news-redux-building-voiceglue/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Daily News Redux&#8230;</title>
		<link>http://www.okkoblog.com/2007/04/11/daily-news-redux-10/</link>
		<comments>http://www.okkoblog.com/2007/04/11/daily-news-redux-10/#comments</comments>
		<pubDate>Wed, 11 Apr 2007 09:05:00 +0000</pubDate>
		<dc:creator>Okko</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[ASR]]></category>
		<category><![CDATA[embedded]]></category>
		<category><![CDATA[Fonix]]></category>
		<category><![CDATA[Loquendo]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[TTS]]></category>

		<guid isPermaLink="false">http://okkoblog.com/blog/?p=15</guid>
		<description><![CDATA[Today on the WWW: Nuance announces voice search framework, based on directory assistance solutions portfolio. Epson releases speech synthesis chip, powered by Fonix engine, allows mixed output of synthesis and pre-recorded speech. Loquendo text-to-speech gives speech to Activa Multimedia iVAC avatars.]]></description>
			<content:encoded><![CDATA[<p>Today on the WWW:
<ul>
<li>Nuance announces <a href="http://www.huliq.com/18055/nuance-unveils-nuance-voice-search-breakthrough-solutions">voice</a> <a href="http://home.businesswire.com/portal/site/google/index.jsp?ndmViewId=news_view&#038;newsId=20070410005403&amp;newsLang=en">search</a> framework, based on directory assistance solutions portfolio.</li>
<li><span class="blsp-spelling-error" id="SPELLING_ERROR_0">Epson</span> releases <a href="http://pittsburgh.dbusinessnews.com/shownews.php?newsid=114810&amp;type_news=latest">speech synthesis chip</a>, powered by <span class="blsp-spelling-error" id="SPELLING_ERROR_1">Fonix</span> engine, allows mixed output of synthesis and <span class="blsp-spelling-error" id="SPELLING_ERROR_2">pre</span>-recorded speech.</li>
<li><span class="blsp-spelling-error" id="SPELLING_ERROR_3">Loquendo</span> text-to-speech gives speech to <span class="blsp-spelling-error" id="SPELLING_ERROR_4">Activa</span> Multimedia <a href="http://www.tmcnet.com/channels/speech-recognition/articles/6222-loquendo-activa-multimedia-humanize-computer-interactions.htm"><span class="blsp-spelling-error" id="SPELLING_ERROR_5">iVAC</span> avatars</a>.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.okkoblog.com/2007/04/11/daily-news-redux-10/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
