NCTI -

National Center for Technology Innovation
Advancing Technology Innovations for All Students

Chuck Rogers, Chief-Evangelist MacSpeech

Tags:, ,

photo: Chuck Rogers smiling

By Eric Morrison

Inquiries:
(603) 251-1477
http://www.macspeech.com


Description of technology: MacSpeech is a speech recognition solution specifically for Macintosh computers.


Intrigued by: Chuck is intrigued by complexities in marketing speech recognition. He frequently refers to core functionality and the glitz and attraction of relatively new functionality by using the analogy of “the steak and the sizzle.” For people with disabilities, Chuck indicates that the ‘steak’ of the product really is comprised of the core dictation, command and control aspects. For much of the rest of the market, the ‘sizzle’ aspects of a relatively new and unusual application are the attraction. Chuck seeks to move more users into a sense of the productive value of the product.


Seeing the world in new ways: Chuck uses the concept of a “third hand” to describe interactions with computer systems – an augmentation through another sensory input of the “two-hand” approach of conventional computing input through the keyboard and mouse or trackball. He emphasizes the productivity and control benefits of adding a voice control dimension for simple and common tasks, for example zooming in and out while using programs like PhotoShop. This could mitigate the need to change modes or tools to permit access to additional functions, increasing efficiency. Chuck believes that increasing demands for productivity and complexity in computing will drive a need for voice interaction with systems.


On design principles and challenges: Ease of use and interaction with speech that mirrors the way people are used to interacting through the keyboard constitute fundamental principles. Additionally, Chuck explains that speech recognition uses base voice models that are effective “amalgams” of various persons’ voices that seek to be compatible with the largest range of predicted male and female voices. Training a speech recognition file involves making individual adjustments to these base files as they respond to each unique speech pattern. Thus, Chuck explains that much of MacSpeech’s resources are targeted at, “going through all of testing that has to be done to make sure that we are collecting all of the necessary information to be able to create a useful voice model.”


On initial development challenges:Chuck related that in MacSpeech’s early days, it was essential to partner with another company that had a speech engine ready to go: a very narrow field. Critical considerations and factors included potential partners who were willing to work with them, were interested in seeking a “sliver” of the Macintosh software market, and would be willing to allow MacSpeech to maintain control over the product and its marketing. Ultimately MacSpeech has utilized its own modified version of a Phillips speech engine.


On users: Chuck categorizes MacSpeech users as, “either power users or just shy of power users.” Using the term in the positive, sophisticated sense, he says, “there’s a pretty big geek factor in the software… or people have a real need.”


On training and use: Human nature and perception, Chuck feels, play a major role in training and use. Changes in user interface in release iterations sometimes result in expansive claims of improved accuracy, even when no changes have been made to the speech engine within that iteration. Chuck feels that this accounts for a substantial amount of user preference of one competing product over another.


Views on the market: Chuck positions persons with disabilities who are interested in speech recognition for access integrally as part of the “community at large,” which also includes consumers in the medical and legal fields. However, he indicates that, “eighty percent of the market are people who want the ‘Star Trek’ factor” – the novelty of software that will convert speech to text on the screen. This realization has direct marketing implications for emphasizing the “sizzle” of the product for some present markets.


Success indicators - making a difference: Chuck puts forward the capacity of speech recognition to prevent the onset of repetitive motion injuries as a major success factor for the product for people who must engage in significant activity with computers. He also believes that those who have existing injuries may experience improvement by avoiding the keyboard and mouse. Chuck’s ultimate test of success is the level of productivity users experience.


On new horizons: Major horizons in speech recognition technology, such as recognition that does not require the user to verbalize punctuation or substantially improved accuracy, are seen as requiring both significant time and gains in processing power. Chuck is looking forward to the day of an imagined “10 gigahertz Macintosh G-7,” but feels this is fairly distant. With future gains in processing power of the magnitude of several times today’s capacities, Chuck sees an additional value for youth with and without disabilities: “You get one other benefit, you lower the threshold at which people can use it. Now all of a sudden, 8 year olds, 10 year olds and 12 year olds can start using it.


Provocative views & quotes: Chuck speaks provocatively of a “pain threshold” that is inherent in complex technology like speech recognition at this stage of development, and which varies across consumers. He says, “I think the assistive technologies market for people with disabilities has a higher pain threshold. They need the technology more, so they have a higher adoption rate after they acquire it.” Ironically, because sales are higher within the general market for people without disabilities, even though their ultimate adoption rates are presently substantially lower, it is necessary to focus marketing efforts on the general public who do not experience disabilities.

Reader Comments

Be the first to leave a comment!

Write a Comment

Take a moment to comment and tell us what you think. Some basic HTML is allowed for formatting.

You must be logged in to post a comment. Click here to login, or you can Register.


Information

Register to join the conversation by commenting, tracking what others have to say, or linking to it from your blog.

Related Resources
Tim Lalor, Software Developer, Ai Squared (ZoomText)
Lessons in Collaboration: How to Make a Partnership Work