Conference time: I will be spending a couple of days in London and Brighton from September 5th attending Interspeech, SIGDIAL as well as a researcher round-table. Anyone interested in meeting up, feel free to get in touch.
Posts Tagged ‘iPhone’
The iPhone has proved a game-changer in many regards and speech is no exception. Both Google and Yahoo (with vlingo) have deployed mobile speech applications for the iPhone.
Today I came across another sighting of iPhone speech recognition, Vocalia by Creaceed, employing open-source ASR engine Julius for back-end technology. There is no “push to talk” button but a “shake to retry”, which may prove useful when recognition goes awry. The app supports French, English and German for now and costs €2.99. Dictation is not available at this point, though Julius is certainly capable of it from an architecture point of view.
Other speech and language related iPhone apps:,
- Google Mobile – voice search app
- Vlingo – speech-enables your phone
- Pocket – language learning app
- Voice Dial – speech-enabled dialer
- VoiceThis – speech-enabled dialer
- iSpeak – multi-language translator with synthesized output
- A stuttering aid (not yet available)
Has anyone used these extensively? What is your experience with speech on the iPhone?
A no-frills phone with the unlikely name of Zumba Lumba has recently received some attention by the BBC. The phone is said to be top-secret, developed by a defense-aviation company. It does without frills like a camera or an applications platform, but touts some interesting security and computational features, (not only) related to speech technology:
- Cloud computing – the phone uses no local storage for contacts, data.
- Network speech recognition – user input is recognized over the internet. This should avoid hardware intensive local computing for voice input, but requires internet access.
- Voice identification – enhanced security, because the phone will only respond to a single user’s voice.
Some seem to think this is a potential iPhone killer at least in terms of making use of innovative input modalities (though Google already released a speech recognition app for the iPhone.) Others simply thinks it’s a hoax.
Either way, the idea of joining mobile with cloud computing is interesting. Using voice identification for security has its appeal as well, even if it’s unclear whether keeping data in the cloud and sending voice data over the internet is any more secure than simply keeping data on your phone, locally.
Google released a new feature for its Google Mobile iPhone Application yesterday: voice search. Users speak a query and the application returns search results formatted for the iPhone. This is similar to the GOOG411 directory assistance application, which allows users to call a phone number, speak a query and receive information about local listings in voice or SMS formats. However the new application apparently performs recognition locally on the iPhone, meaning it comes bundled with an embedded speech recognition engine.
Aside from GOOG411, during the US presidential Google released Gaudi, a voice indexing technology for video. That makes the iPhone app the third official service the company releases, making use of speech recognition, leaving one guessing when Google’s speech technology becomes available as API, like the Google AJAX Language API for translation and transliteration, rather than bundled as software services. Also, an Android version is probably in the works, one would guess.
All applications are available in US English for now.