Reference Hub5
Audio-Visual and Visual-Only Speech and Speaker Recognition: Issues about Theory, System Design, and Implementation

Audio-Visual and Visual-Only Speech and Speaker Recognition: Issues about Theory, System Design, and Implementation

Derek J. Shiell, Louis H. Terry, Petar S. Aleksic, Aggelos K. Katsaggelos
Copyright: © 2009 |Pages: 38
ISBN13: 9781605661865|ISBN10: 1605661864|ISBN13 Softcover: 9781616925338|EISBN13: 9781605661872
DOI: 10.4018/978-1-60566-186-5.ch001
Cite Chapter Cite Chapter

MLA

Shiell, Derek J., et al. "Audio-Visual and Visual-Only Speech and Speaker Recognition: Issues about Theory, System Design, and Implementation." Visual Speech Recognition: Lip Segmentation and Mapping, edited by Alan Wee-Chung Liew and Shilin Wang, IGI Global, 2009, pp. 1-38. https://doi.org/10.4018/978-1-60566-186-5.ch001

APA

Shiell, D. J., Terry, L. H., Aleksic, P. S., & Katsaggelos, A. K. (2009). Audio-Visual and Visual-Only Speech and Speaker Recognition: Issues about Theory, System Design, and Implementation. In A. Liew & S. Wang (Eds.), Visual Speech Recognition: Lip Segmentation and Mapping (pp. 1-38). IGI Global. https://doi.org/10.4018/978-1-60566-186-5.ch001

Chicago

Shiell, Derek J., et al. "Audio-Visual and Visual-Only Speech and Speaker Recognition: Issues about Theory, System Design, and Implementation." In Visual Speech Recognition: Lip Segmentation and Mapping, edited by Alan Wee-Chung Liew and Shilin Wang, 1-38. Hershey, PA: IGI Global, 2009. https://doi.org/10.4018/978-1-60566-186-5.ch001

Export Reference

Mendeley
Favorite

Abstract

The information imbedded in the visual dynamics of speech has the potential to improve the performance of speech and speaker recognition systems. The information carried in the visual speech signal compliments the information in the acoustic speech signal, which is particularly beneficial in adverse acoustic environments. Non-invasive methods using low-cost sensors can be used to obtain acoustic and visual biometric signals, such as a person’s voice and lip movement, with little user cooperation. These types of unobtrusive biometric systems are warranted to promote widespread adoption of biometric technology in today’s society. In this chapter, the authors describe the main components and theory of audio-visual and visual-only speech and speaker recognition systems. Audio-visual corpora are described and a number of speech and speaker recognition systems are reviewed. Finally, various open issues about the system design and implementation, and present future research and development directions in this area are discussed.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.