Speech SDK

John Lawrence writes about speech2blog:

“Now if the blog page was navigable with the Speech Application SDK speech controls, that would be really cool.“


So I dug out the cool cds & dvds that I hadn't gotten to look at yet from the PDC, and checked out the Speech Application SDK (PDC gave out v1.0b3).  There's also a "Must Have Cool Stuff For Developers" CD (a great hook if ever I heard one), which contains speech enabled versions of the Commerce Starter Kit and the Fitch and Mather Stock app (among other things).

In response to John's comment, it'll be tough to add support in Speech2Blog since it's a Longhorn app, and SASDK will not install on Longhorn (OS Requirements state 2000sp3, 2003, or XPProSP1).  The MS Enterprise Instrumentation Framework is also required (download link in this post ).  The feature looked like adding support for speech navigation to .Text, rather than through my app. From the marketing info: "Multimodal applications use Internet Explorer as the client."

This does sound intriguing, though.  Here are some highlights from the documentation I was given as far as ASP.NET and the SASDK:

 *ASP.NET Speech controls (needs either Telephony Services client or the "Speech Add-in for MSIE")
 *Speech Application Project Wizard
 *Grammar editor
 *Modify pronunciations (with the Semantic Script Editor) 
 *Record Phrases: "you can associate prerecorded .wav files with transcriptions in the Transcription Window" (I wonder if this means we'll finally be able to do voice recognition from .wav files, rather than being forced to use the microphone!) From the website FAQ: "The SASDK supports only 8 kHz, 8 bit, mono, PCM recording."
 *Speech Debugging Console: "Tells you what is recognized and how confident the speech engine is in that recognition."


Link to order/download free copy of the SASDK

No Comments