<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for Okko in Speech</title>
	<atom:link href="http://www.okkoblog.com/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.okkoblog.com</link>
	<description>Working with speech and language technology</description>
	<lastBuildDate>Tue, 20 Jul 2010 09:13:37 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
	<item>
		<title>Comment on This Goes to Eleven by Okko</title>
		<link>http://www.okkoblog.com/2010/07/20/this-goes-to-eleven/comment-page-1/#comment-312</link>
		<dc:creator>Okko</dc:creator>
		<pubDate>Tue, 20 Jul 2010 09:13:37 +0000</pubDate>
		<guid isPermaLink="false">http://www.okkoblog.com/?p=195#comment-312</guid>
		<description>Thanks David!</description>
		<content:encoded><![CDATA[<p>Thanks David!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on A More Optimistic Outlook on the Future of Speech by nsh</title>
		<link>http://www.okkoblog.com/2010/06/30/a-more-optimistic-outlook-on-the-future-of-speech/comment-page-1/#comment-294</link>
		<dc:creator>nsh</dc:creator>
		<pubDate>Thu, 08 Jul 2010 23:51:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.okkoblog.com/?p=187#comment-294</guid>
		<description>&gt; What I mean by niches for speech are interesting applications that don’t compete
&gt; with mouse, keyboard or touch screen for user attention.

Exactly, that&#039;s why I consider speech analytics that acts in parallel with usual user activity transparently listening for call, talk or meeting more perspective technology than IVR or dictation. I even started voting about that on blog but suprisingly it shows that way more readers still think that dictation and command &amp; control are usable.

Another such domain is language learning.

&gt; I would love to hear where you think the next major breakthrough for ASR 
&gt; technology will come from.

Well, it should be another source of information. Not necessary AI since I still believe that planes shouldn&#039;t flap wings. It might be WWW, then google will do that faster than anyone else ;) I took this new source idea from this nice post

http://caterina.net/archive/001211.html</description>
		<content:encoded><![CDATA[<p>&gt; What I mean by niches for speech are interesting applications that don’t compete<br />
&gt; with mouse, keyboard or touch screen for user attention.</p>
<p>Exactly, that&#8217;s why I consider speech analytics that acts in parallel with usual user activity transparently listening for call, talk or meeting more perspective technology than IVR or dictation. I even started voting about that on blog but suprisingly it shows that way more readers still think that dictation and command &amp; control are usable.</p>
<p>Another such domain is language learning.</p>
<p>&gt; I would love to hear where you think the next major breakthrough for ASR<br />
&gt; technology will come from.</p>
<p>Well, it should be another source of information. Not necessary AI since I still believe that planes shouldn&#8217;t flap wings. It might be WWW, then google will do that faster than anyone else ;) I took this new source idea from this nice post</p>
<p><a href="http://caterina.net/archive/001211.html" rel="nofollow">http://caterina.net/archive/001211.html</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on A More Optimistic Outlook on the Future of Speech by Okko</title>
		<link>http://www.okkoblog.com/2010/06/30/a-more-optimistic-outlook-on-the-future-of-speech/comment-page-1/#comment-289</link>
		<dc:creator>Okko</dc:creator>
		<pubDate>Mon, 05 Jul 2010 19:05:47 +0000</pubDate>
		<guid isPermaLink="false">http://www.okkoblog.com/?p=187#comment-289</guid>
		<description>Agree - even good practices are the cause of the stagnation. We know IVR systems are awful to use compared to web pages (or flashy iPhone apps), but by giving their awfulness a certain pattern, we can work around technological limitations.

What I mean by niches for speech are interesting applications that don&#039;t compete with mouse, keyboard or touch screen for user attention. These are battles bound to be lost. Fancy voice interfaces will always come second to more immersive or efficient input methods.

I would love to hear where you think the next major breakthrough for ASR technology will come from.</description>
		<content:encoded><![CDATA[<p>Agree &#8211; even good practices are the cause of the stagnation. We know IVR systems are awful to use compared to web pages (or flashy iPhone apps), but by giving their awfulness a certain pattern, we can work around technological limitations.</p>
<p>What I mean by niches for speech are interesting applications that don&#8217;t compete with mouse, keyboard or touch screen for user attention. These are battles bound to be lost. Fancy voice interfaces will always come second to more immersive or efficient input methods.</p>
<p>I would love to hear where you think the next major breakthrough for ASR technology will come from.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on A More Optimistic Outlook on the Future of Speech by nsh</title>
		<link>http://www.okkoblog.com/2010/06/30/a-more-optimistic-outlook-on-the-future-of-speech/comment-page-1/#comment-279</link>
		<dc:creator>nsh</dc:creator>
		<pubDate>Thu, 01 Jul 2010 00:45:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.okkoblog.com/?p=187#comment-279</guid>
		<description>I kind of disagree that it&#039;s just practice issue. The whole experience in bringing up ASR products leads to the the conclusion that technolgy is not there yet. Users can&#039;t operate with 90% success rate, most applications require 99.999%.

But I consider this stagnation as a delay before major breakthrough in the technology, so right now is a perfect time to start with ASR and catch the wave that will appear soon.</description>
		<content:encoded><![CDATA[<p>I kind of disagree that it&#8217;s just practice issue. The whole experience in bringing up ASR products leads to the the conclusion that technolgy is not there yet. Users can&#8217;t operate with 90% success rate, most applications require 99.999%.</p>
<p>But I consider this stagnation as a delay before major breakthrough in the technology, so right now is a perfect time to start with ASR and catch the wave that will appear soon.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Quick Voice Prompts with Google Translate TTS Service by Facebook Like Site</title>
		<link>http://www.okkoblog.com/2010/01/12/quick-voice-prompts-with-google-translate-tts-service/comment-page-1/#comment-233</link>
		<dc:creator>Facebook Like Site</dc:creator>
		<pubDate>Sun, 06 Jun 2010 01:09:58 +0000</pubDate>
		<guid isPermaLink="false">http://www.okkoblog.com/?p=149#comment-233</guid>
		<description>Nice post. Have u heard about the iPad hack? Kinda random but lol why not.</description>
		<content:encoded><![CDATA[<p>Nice post. Have u heard about the iPad hack? Kinda random but lol why not.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on News Redux &amp; Building VoiceGlue by forexs</title>
		<link>http://www.okkoblog.com/2007/12/04/news-redux-building-voiceglue/comment-page-1/#comment-217</link>
		<dc:creator>forexs</dc:creator>
		<pubDate>Thu, 27 May 2010 15:26:28 +0000</pubDate>
		<guid isPermaLink="false">http://okkoblog.com/blog/?p=33#comment-217</guid>
		<description>thanks a lot for this wonderful article</description>
		<content:encoded><![CDATA[<p>thanks a lot for this wonderful article</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on SpinVox, Voice-to-Text and Some Terminology by philippine tv online</title>
		<link>http://www.okkoblog.com/2010/01/18/spinvox-voice-to-text-and-some-terminology/comment-page-1/#comment-22</link>
		<dc:creator>philippine tv online</dc:creator>
		<pubDate>Wed, 17 Feb 2010 19:44:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.okkoblog.com/?p=156#comment-22</guid>
		<description>I used osx&#039;s &#039;text to mp3&#039; to do the voice within my Youtube video. So trust me guys, voice things work great, you should use them too! One problem I had was when it tried to pronounce some &#039;fake&#039; words/ sounds like &#039;woosh&#039;, but with practice it&#039;s easy :-)</description>
		<content:encoded><![CDATA[<p>I used osx&#8217;s &#8216;text to mp3&#8242; to do the voice within my Youtube video. So trust me guys, voice things work great, you should use them too! One problem I had was when it tried to pronounce some &#8216;fake&#8217; words/ sounds like &#8216;woosh&#8217;, but with practice it&#8217;s easy :-)</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on SpinVox, Voice-to-Text and Some Terminology by Okko</title>
		<link>http://www.okkoblog.com/2010/01/18/spinvox-voice-to-text-and-some-terminology/comment-page-1/#comment-19</link>
		<dc:creator>Okko</dc:creator>
		<pubDate>Sun, 31 Jan 2010 19:51:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.okkoblog.com/?p=156#comment-19</guid>
		<description>Thanks for those links. That study slipped by me.

I&#039;m very curious to see where this is going. Some obvious questions are how accessible and pervasive voice transcription will become. Will there be a healthy developer base (voice technologies always suffer from having a small, somewhat esoteric one). What about &quot;real&quot; web APIs, leveraging this stuff for mash ups like smart ad placements in video, 3rd party calendar plug-ins, etc..

How to enable real international market penetration is also big question. English speaking ones appear reaching a good level of maturity. I&#039;ve &lt;a href=&quot;http://www.okkoblog.com/2008/05/05/internationalization-and-speech-technologies/&quot; rel=&quot;nofollow&quot;&gt;previously written&lt;/a&gt; about the fact that there is no long tail in speech and language technology development. The buy-in costs per market/language remain the same, regardless of market size. Capital per market however varies greatly. This could be a real show stopper.

Thoughts?</description>
		<content:encoded><![CDATA[<p>Thanks for those links. That study slipped by me.</p>
<p>I&#8217;m very curious to see where this is going. Some obvious questions are how accessible and pervasive voice transcription will become. Will there be a healthy developer base (voice technologies always suffer from having a small, somewhat esoteric one). What about &#8220;real&#8221; web APIs, leveraging this stuff for mash ups like smart ad placements in video, 3rd party calendar plug-ins, etc..</p>
<p>How to enable real international market penetration is also big question. English speaking ones appear reaching a good level of maturity. I&#8217;ve <a href="http://www.okkoblog.com/2008/05/05/internationalization-and-speech-technologies/" rel="nofollow">previously written</a> about the fact that there is no long tail in speech and language technology development. The buy-in costs per market/language remain the same, regardless of market size. Capital per market however varies greatly. This could be a real show stopper.</p>
<p>Thoughts?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on SpinVox, Voice-to-Text and Some Terminology by James Siminoff</title>
		<link>http://www.okkoblog.com/2010/01/18/spinvox-voice-to-text-and-some-terminology/comment-page-1/#comment-18</link>
		<dc:creator>James Siminoff</dc:creator>
		<pubDate>Sun, 31 Jan 2010 17:23:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.okkoblog.com/?p=156#comment-18</guid>
		<description>You should check out, http://www.techcrunch.com/2010/01/28/phonetag-voice-to-text-86-percent-accurate-google-voice/.  You will probably also be interested in the study, http://www.scribd.com/doc/26017529/Accuracy-of-Voicemail-To-text-Services.

I think that this will be a very interesting market to follow in the next few years.</description>
		<content:encoded><![CDATA[<p>You should check out, <a href="http://www.techcrunch.com/2010/01/28/phonetag-voice-to-text-86-percent-accurate-google-voice/" rel="nofollow">http://www.techcrunch.com/2010/01/28/phonetag-voice-to-text-86-percent-accurate-google-voice/</a>.  You will probably also be interested in the study, <a href="http://www.scribd.com/doc/26017529/Accuracy-of-Voicemail-To-text-Services" rel="nofollow">http://www.scribd.com/doc/26017529/Accuracy-of-Voicemail-To-text-Services</a>.</p>
<p>I think that this will be a very interesting market to follow in the next few years.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Speaking Piano by Okko</title>
		<link>http://www.okkoblog.com/2009/12/31/speaking-piano/comment-page-1/#comment-17</link>
		<dc:creator>Okko</dc:creator>
		<pubDate>Fri, 08 Jan 2010 21:16:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.okkoblog.com/?p=71#comment-17</guid>
		<description>Let&#039;s do it!</description>
		<content:encoded><![CDATA[<p>Let&#8217;s do it!</p>
]]></content:encoded>
	</item>
</channel>
</rss>
