
In the vast majority of cases, the sole representative of human factors on
speech system projects is the VUI designer (VUID). Much like the early human
factors pioneers, VUIDs are a heterogeneous group made up of ex-developers, way-
ward linguists, telephony managers, assorted psychologists, and many others with
little or no relevant formal training in speech technology or human factors. There
are currently no formal degree programs specifically in VUI design; however,
Auburn University has an engineering program with a strong VUI emphasis under
the direction of Juan Gilbert. What VUIDs may lack in formal human factors train-
ing, most make up for in their enthusiasm to bring strong user-centered design
practices to speech projects. To get a sense of the VUID community, consider
visiting the VUIDs Yahoo group at
http://tech.groups.yahoo.com/group/vuids
.
Strong support for human factors in speech technology systems has come from
AVIOS, the Applied Voice Input Output Society. AVIOS is a nonprofit membership
organization for speech technology professionals that has been influential for over
20 years. AVIOS often sponsors speech technology conferences and previously had
editorial control of the
International Journal of Speech Technology
, an important
source of rigorous technical and user interface information for speech systems.
More so than in the GUI world, VUIDs need a mix of both technical and
design skills. GUI interactions occur using a mouse, keyboard, or some other pre-
cise input device, and GUI designers can reasonably assume that user input will
be interpreted correctly by the system. In speech-enabled applications, well-
formed user input will be misinterpreted in a significant proportion of cases. Most
current speech recognition algorithms boast near 100 percent recognition accu-
racy, but in practice, error rates of 10 to 20 percent or more at a given recognition
state are not uncommon. Therefore, the VUI designer must understand the limita-
tions of the technology and design using strategies that minimize technology-level
errors. VUIDs also serve as the voice of the user in speech projects, much as other
human factors professionals do in the GUI world. On many projects, the VUI
designer is working alone without any other human factors support, so it is vital
to have firm guidelines for appropriate user-centered design processes, as well
as guerrilla techniques to fall back on when the ideal is not possible. The ability
to justify and evangelize for user-centered design (UCD) processes is vital for
VUIDs who are often the first exposure project members have to human factors.
6.4.1 User-Centered Design for Speech Projects
User-centered design practice for speech technologies is similar but not identical
to established UCD techniques. At a general level, the process is similar: Gather
information from users and the business, design a preliminary interface, test with
real users, modify the design, and iterate. There are different twists at each stage,
however, because of the unique position of speech technologies.
During requirements gathering, VUI designers have unique sources of infor-
mation relevant to the voice channel. Since most speech technology is deployed
6.4 Human Factors Design of the Interface
215