Microsoft Tries To Make Vox Popular - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.


Microsoft Tries To Make Vox Popular

Voice recognition is a languishing technology that Microsoft apparently sees as a ripe opportunity.

Microsoft is expanding a software effort that, if successful, could have even some of the most ardent Microsoft bashers singing its praises. The company is moving on voice-recognition apps designed to, among other things, lead callers through those thickets of customer-service phone menus.

At a speech-technology conference in New York on Wednesday, Microsoft introduced a second beta version of its .Net Speech Software development kit, a plug-in to its Visual Studio .Net development-tools suite that aims to get more programmers writing speech apps that use Microsoft tools and servers. The company also released a "technical preview," of its .Net Speech Platform, a set of application-building components. The software is due next year.

Microsoft's efforts could expand markets for wireless phone airtime, speech-recognition software, and PDAs--an area in which Microsoft is picking up market share. Microsoft said Oct. 22 it is buying Vicinity Corp., which makes software that delivers maps and directions to cell phones and PDAs, for $96 million in cash.

"Speech recognition has never really taken off the way people thought it would," says Brian Strachman, an analyst at market-research firm In-Stat. "We need a broad base of applications, and Microsoft has a direct line to all the developers out there."

Microsoft's Office suite includes a dictation-taking capability, and the company published an API for building speech-capable Windows apps. But the company's latest effort focuses on selling call-center software. Airlines and banks often maintain separate sets of application software logic for their Web sites and call centers, which is wasteful, says Kai-Fu Lee, a Microsoft VP who managed the company's Beijing research lab in the late '90s. "The Web is where the real-time, accurate data resides." Microsoft hopes the widespread use of its tools will encourage more companies to write apps that allow access to the same data by phone or over the Web.

"The call center has always been the first, best market for speech recognition," analyst Strachman says, since software can provide a quick payback in saved labor. Telephone customer-service, or "interactive voice response," apps have been characterized by dedicated hardware and application-specific protocols for retrieving data. VoiceXML, a markup language backed by AT&T, IBM, Lucent Technologies, and Motorola, has been a way to reuse logic across voice applications.

Microsoft's approach, called Speech Application Language Tags, adds support for "multimodal" apps that can deliver text and graphics to computer and cell-phone screens, in addition to voice feedback. Microsoft and its partners in the Salt Forum, which include Cisco Systems and Intel, submitted Salt 1 to the World Wide Web Consortium, an industry-standards body, in August. There aren't any multimodal applications on the market today, but they could debut next year as more developers adopt Salt. The new beta version of the .Net Speech development kit--a collection of pre-built components that automate common programming tasks--includes the ability to build a usable server-side application and complies with the version of Salt submitted to the W3C. The .Net Speech Platform includes a Salt-enabled version of the Internet Explorer Web browser, software that communicates with call-center answering and routing hardware, and a speech-recognition engine.

Microsoft plans to give away the development kit but hasn't decided how it will package its speech platform. In addition to license revenue, there's also value in seeding the market with speech software, Lee says. "Speech will be part of the PC experience," Lee says. "But it won't come from the PC, because the keyboard and mouse work so well. We want to decrease the users' fear of the technology."

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Comment  | 
Print  | 
More Insights
2020 State of DevOps Report
2020 State of DevOps Report
Download this report today to learn more about the key tools and technologies being utilized, and how organizations deal with the cultural and process changes that DevOps brings. The report also examines the barriers organizations face, as well as the rewards from DevOps including faster application delivery, higher quality products, and quicker recovery from errors in production.
Think Like a Chief Innovation Officer and Get Work Done
Joao-Pierre S. Ruth, Senior Writer,  10/13/2020
10 Trends Accelerating Edge Computing
Cynthia Harvey, Freelance Journalist, InformationWeek,  10/8/2020
Northwestern Mutual CIO: Riding Out the Pandemic
Jessica Davis, Senior Editor, Enterprise Apps,  10/7/2020
Register for InformationWeek Newsletters
Current Issue
[Special Report] Edge Computing: An IT Platform for the New Enterprise
Edge computing is poised to make a major splash within the next generation of corporate IT architectures. Here's what you need to know!
White Papers
Twitter Feed
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.
Sponsored Video
Flash Poll