Networked humanoid animation driven by human voice using extensible 3D (X3D), H-Anim and JAVA speech open standards

Download
Author
Apaydin, Ozan
Date
2002-03Advisor
Brutzman, Don
Second Reader
Yun, Xiaoping
Metadata
Show full item recordAbstract
Speech-recognition technology is beginning to be used in automobiles, telephones, personal digital assistants (PDAs), medical records, e-commerce, text dictation and editing. Speech recognition can also be integrated into Virtual Environments (VEs) to create responsive virtual entities. Like the mouse, keyboard, and the trackball, Speech-recognition technology can enhance the control of a computer and improve communication. Dramatically expanding interest in the Internet and VEs has been gated by limited interactivity with human-avatar models. As more users begin interacting with avatars in VEs, designers are prompted to create more realistic, humanlike avatars. This quest for realism needs to go beyond visual aspects to include speechrecognition technology, which can greatly augment the realism of these avatars. This thesis presents design and development of a Voice User Interface (VUI), which maps to a set of behavioral motions for humanoid avatars using Extensible 3D (X3D) graphics, the Virtual Reality Modeling Language (VRML), Humanoid Animation (H-Anim) Standard and Java Speech API. The VUI includes a suitable speech-recognition component for application-command vocabularies. This thesis also demonstrates interchangeability of both avatars and animation behaviors, and creates networked humanoid animation driven by a human voice.
Rights
Copyright is reserved by the copyright ownerCollections
Related items
Showing items related by title, author, creator and subject.
-
Augmenting comprehension of speech in noise with a computer-generated facial avatar and its effect on performance
Swann, William R. (Monterey, California. Naval Postgraduate School, 2010-12);Military operations often occur in noisy environments, which can interfere with effective verbal communication. Previous studies have established the effectiveness of allowing a listener to see the speaker's mouth. This ... -
Developing articulated human models from laser scan data for use as avatars in real time networked virtual environments
Dutton, James Allen (Monterey, California. Naval Postgraduate School, 2001-09);With the continuing gain in computing power, bandwidth, and Internet popularity there is a growing interest in Internet communities. To participate in these communities, people need virtual representations of their bodies, ... -
An investigation of methods of improving the intelligibility of audio-frequency speech in noise
Huddy, Norman Walter (Monterey, California. U.S. Naval Postgraduate School, 1966-10);A discussion of the nature of speech is presented, followed by a review of speech processing to date, with emphasis on the characteristics of speech which must be retained for intelligibility. Methods of measuring speech ...