Prediction: Voice Recognition and Text-to-Speech

June 30th, 2005 by Joe

Voice Recognition is getting pretty good these days… so is Text-to-Speech… but what would happen if you combined them?

Imagine a world where you could have 95% accuracy in voice recognition, and 95% intelligible text-to-speech. We’re pretty close to that now.

Add one other component: I’ll call it Voice Font(tm). Imagine a text-to-speech “font” of your own voice. Where anything you type can be run through a font and made to sound as if you’d said it, just like a Font applied to plain text might make something appear in calligraphy.

Tie it all together: Talk into your phone, your phone converts your voice into text then sends your Voice Font and message to the recipient. The recipient’s phone (computer, whatever) then receives text and converts it into YOUR voice. They respond in kind.

The potential bandwidth savings by sending text only is staggering. We’ve been focusing on how to expand our bandwidth, we can just optimize our bandwith usage.

Posted in Experimentation, Joe, Science, technology

Leave a Comment

Please note: Comment moderation is enabled and may delay your comment. There is no need to resubmit your comment.

About Greener Living thru Technology

JoeLevi.com is the personal web log of Joe Levi -- an ASP.NET Web Developer by trade and by hobby. Joe's love of technology isn't just limited to the web, he's also interested in green and environmentally friendly technology and technological solutions. If it has to do with technology, improving the quality of life, geek humor, tech politics, self-defense, environmental stewardship, or anything related, you'll probably find it at www.JoeLevi.com.

Site statistics:
Average: ~1.3 P/V; Visits: ~3,000; Pageviews: ~3,600; Google PR: 4; TechnoratiAuthority: 17; Technorati Rank: 487,964





Watch the latest videos on YouTube.com