HTML5 may help Web pages talk, listen

The World Wide Consortium is investigating integrating speech capabilities into Web standard

Sometime in the near future, users might not only read Web pages but hold conversations with them as well, at least if a new activity group in the W3C (World Wide Consortium) bears fruit.

The W3C is investigating the possibility of incorporating voice recognition and speech synthesis interfaces within Web pages. A new incubator group will file a report a year from now summarizing the feasibility of adding voice and speech features into HTML, the W3C's standard for rendering Web pages.

AT&T, Google, Microsoft and the Mozilla Foundation, among others, all have engineers participating in this effort.

The human voice and the Web are not strangers: Google includes a voice-based Web search app in its Android smartphone operating system and Microsoft promises robust voice-driven features in its upcoming Windows Phone 7.

The HTML Speech Incubator Group is studying the feasibility of developing a standard Web interface for both speech recognition and synthesis, said group chair Dan Burnett, who is also director of speech technologies and standards at voice response system provider Voxeo.

Such an interface could be used across multiple browsers. Using built-in or plug-in voice recognition and speech synthesis engines, browsers could read pages aloud or permit users to audibly fill out Web forms.

While this work may overlap with another voice-based W3C effort, VoiceXML, the two efforts are somewhat different, Burnett said. VoiceXML wouldn't work very well for the Web, given that it was primarily designed for voice-driven applications, such as telephone-based voice response systems, where it is used widely. Like HTML itself, the voice capabilities of HTML would be stateless, or not require a dedicated session with the user.

Burnett noted that while the report would discuss the feasibility of establishing a set of interfaces, the work of developing the interfaces themselves, should they be warranted, would be taken on by another W3C group, such as the HTML Working Group.

The W3C has been busy with speech technologies on a number of other fronts as well. The organization also recently released version 3.0 of VoiceXML. In this new version, the working group added semantic descriptions of the features, and organized the functionality into modules.

The W3C also plans to shortly release version 1.1 of SSML (the Speech Synthesis Markup Language) -- often used in conjunction with VoiceXML -- that will incorporate Asian languages, and provide developers more flexibility with voice selection and handling of content in unexpected languages.

Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com

Join the newsletter!

Error: Please check your email address.
Rocket to Success - Your 10 Tips for Smarter ERP System Selection

Tags Internet-based applications and servicesapplication developmentLanguages and standardsGoogleMicrosoftsoftwarevoice recognitioninternethtml 5

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Joab Jackson

IDG News Service
Show Comments

Cool Tech

SanDisk MicroSDXC™ for Nintendo® Switch™

Learn more >

Breitling Superocean Heritage Chronographe 44

Learn more >

Toys for Boys

Family Friendly

Panasonic 4K UHD Blu-Ray Player and Full HD Recorder with Netflix - UBT1GL-K

Learn more >

Stocking Stuffer

Razer DeathAdder Expert Ergonomic Gaming Mouse

Learn more >

Christmas Gift Guide

Click for more ›

Most Popular Reviews

Latest Articles

Resources

PCW Evaluation Team

Walid Mikhael

Brother QL-820NWB Professional Label Printer

It’s easy to set up, it’s compact and quiet when printing and to top if off, the print quality is excellent. This is hands down the best printer I’ve used for printing labels.

Ben Ramsden

Sharp PN-40TC1 Huddle Board

Brainstorming, innovation, problem solving, and negotiation have all become much more productive and valuable if people can easily collaborate in real time with minimal friction.

Sarah Ieroianni

Brother QL-820NWB Professional Label Printer

The print quality also does not disappoint, it’s clear, bold, doesn’t smudge and the text is perfectly sized.

Ratchada Dunn

Sharp PN-40TC1 Huddle Board

The Huddle Board’s built in program; Sharp Touch Viewing software allows us to easily manipulate and edit our documents (jpegs and PDFs) all at the same time on the dashboard.

George Khoury

Sharp PN-40TC1 Huddle Board

The biggest perks for me would be that it comes with easy to use and comprehensive programs that make the collaboration process a whole lot more intuitive and organic

David Coyle

Brother PocketJet PJ-773 A4 Portable Thermal Printer

I rate the printer as a 5 out of 5 stars as it has been able to fit seamlessly into my busy and mobile lifestyle.

Featured Content

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?