CLSP Homepage : Workshop Homepage
Workshop 2002
Preworkshop Lecture Sunday, July 20, 2008


Jump To:

Seminar Information
An Overview of Automatic Speaker Recognition: Current and Future Approaches: Douglas Reynolds - 07/10/2002


slides from Douglas Reynolds' lecture (.pdf format)

  • Abstract:

    In this talk we provide an overview of the area of speaker recognition, defining terminology, discussing applications, describing current underlying techniques and providing some indications of performance. We will discuss some of the strengths and weaknesses of current acoustic based approaches to speaker recognition and present some newer approaches aimed at exploiting higher-levels of information conveying speaker identity. Finally we outline some potential future trends in research and development in this area.

     

  • Biography:

    Douglas Reynolds received his B.E.E. degree (with highest honors) in June 1986 and his Ph.D. degree in September 1992 both from the Georgia Institute of Technology. During the summers of 1989 and 1991, he was a summer staff member of the Information Systems Technology Group (formerly known as the Speech Systems Technology Group) at MIT Lincoln Laboratory, where he conducted research in speaker recognition, developing the use of Gaussian mixture models (GMMs) for text-independent speaker representations. Doug joined the Information Systems Technology Group as a Member of Technical Staff in 1992 where he conducted research in the areas of robust speaker recognition (identification and verification), transient classification and robust speech representations for recognition. During this period he invented and developed several widely used techniques in the area of speaker recognition, such as robust modeling with GMMs, application of a universal background model to text-independent recognition tasks, the use of Bayesian adaptation to train and update speaker models, fast scoring techniques for GMM based systems, the development and use of a handset/channel-type detector, and several normalization techniques based on the handset/channel-type detector. These and other ideas have been implemented in the Lincoln speaker recognition system which has won several annual international speaker recognition evaluations conducted by the National Institute of Standards and Technology (NIST). Research into combining robust speaker verification with other forms of authentication for highly secure information and/or account access has resulted in two pending patents. In 1998, Doug was appointed to the position of Senior Member of Technical Staff where he provides technical oversight of the Lincoln speech projects in speaker and language recognition and speech-content based information retrieval. Current research in the area of speaker recognition is focused on application of recognition techniques to multi-speaker speech and improving the performance of recognition systems under mismatched channel conditions. Doug is a member of the IEEE and is currently serving on the IEEE Signal Processing Society's Speech Technical Committee.




The Center for Language and Speech Processing
The Johns Hopkins University
3400 North Charles Street, Barton Hall
Baltimore, MD 21218
*Telephone: (410) 516-4237 *Fax: (410) 516-5050 *E-mail: clsp@clsp.jhu.edu