Khudanpur, Sanjeev

Associate Professor: Department of Electrical and Computer Engineering &
Department of Computer Science (secondary appointment)
Faculty Member: Center for Language and Speech Processing
Faculty Affiliate: Human Language Technology Center of Excellence
E mail: My last-name at jhu dot edu
Telephone: +1.410.516.7024
Fax: +1.410.516.5050
Office Location: 325 Computational Sciences and Engineering Building
3400 North Charles Street
Baltimore, MD 21218
Office Hours: By appointment (Tue-Thu 10:30-12:00)

Biosketch: I received the B.Tech. degree in Electrical Engineering from the Indian Institute of Technology, Bombay, in 1988, and the Ph.D. degree in Electrical and Computer Engineering from the University of Maryland, College Park, in 1997. My doctoral dissertation was supervised by Prof. Prakash Narayan and was titled Model Selection and Universal Data Compression.Since 1996, I have been on the faculty of the Johns Hopkins University. Until June 2001, I was an Associate Research Scientist in the Center for Language and Speech Processing and, from July 2001 to June 2008, an Assistant Professor in the Department of Electrical and Computer Engineering and the Department of Computer Science. I became an Associate Professor in July 2008.

I am also affiliated with the Johns Hopkins University Human Language Technology Center of Excellence.

In Fall 2000, I held a visiting appointment in the Institute for Mathematics and its Applications (IMA), University of Minnesota, Minneapolis, MN. I organized two IMA workshops on the role of mathematics in multimedia – “Mathematical Foundations of Speech Processing and Recognition,” and “Mathematical Foundations of Natural Language Modeling.”

Click here for a full vita or a two-page NSF-style resume.

Teaching: 520.447 Introduction to Information Theory and Coding (Fall semesters)
520.651 Random Signal Analysis (Fall semesters)
520.666 Information Extraction from Speech and Text (Spring semesters)
520.674 Information Theoretic Methods in Statistics (Spring semesters)
Research: I am interested in the application of information theoretic methods to human language technologies such as automatic speech recognition, machine translation and natural language processing. All these technologies make heavy use of statistical models of human language. I am interested in understanding the structure of such models and in estimating their parameters from data.I also organize the CLSP Summer Workshops to advance the greater research agenda of this field.
Publications in Peer-Reviewed Journals:

[1] A. Kanlis, S. Khudanpur and P. Narayan, “Typicality of a Good Rate-Distortion Code,” in Problemy Peredachi Informatsii (Problems of Information Transmission), 32(1):96-103, 1996.

[2] M. Riley, W. Byrne, M. Finke, S. Khudanpur, A. Ljolje, J. McDonough, H. Nock, M. Saraclar, C. Wooters and G. Zavaliagkos, “Stochastic Pronunciation Modeling from Hand-Labelled Phonetic Corpora,” in Speech Communication, 29(2-4):209-224, 1999.

[3] M. Saraclar, H. Nock and S. Khudanpur, “Pronunciation Modeling by Sharing Gaussian Densities Across Phonetic Models,” in Computer Speech and Language, 14(2):137-160, 2000.

[4] S. Khudanpur and J. Wu, “Maximum Entropy Techniques for Exploiting Syntactic, Semantic and Collocational Dependencies in Language Modeling,” in Computer Speech and Language, 14(4):355-372, 2000.

[5] S. Khudanpur and P. Narayan, “Order Estimation for a Special Class of Hidden Markov Sources and Binary Renewal Processes,” in IEEE Transactions on Information Theory, 48(6):1704-1713, 2002.

[6] D. He, D. Oard, J. Wang, J. Luo, D. Demner-Fushman, K. Darwish, P. Resnik, S. Khudanpur, M. Nossal and A. Leuski, “Making MIRACLEs: Interactive Translingual Search for Cebuano and Hindi,” in ACM Transactions on Asian Language Information Processing, 2(3):219-244, 2003.

[7] S. Khudanpur and W. Kim, “Contemporaneous Text as Side Information in Statistical Language Modeling,” in Computer Speech and Language, 18(2):143-162, 2004.

[8] H. Meng, B. Chen, S. Khudanpur, G. Levow, W. Lo, D. Oard, P. Schone, K. Tang, H. Wang and J. Wang, “Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval,” in Computer Speech and Language, 18(2):163-179, 2004.

[9] W. Kim and S. Khudanpur, “Lexical Triggers and Latent Semantic Analysis for Cross-Lingual Language Model Adaptation ,” in ACM Transactions on Asian Language Information Processing, 3(2):94-112, 2004.

[10] M. Saraclar and S. Khudanpur, “Pronunciation Change in Conversational Speech and Its Implications for Automatic Speech Recognition,” in Computer Speech and Language, 18(4):375-395, 2004.

[11] B. Jedynak and S. Khudanpur, “Maximum Likelihood Set for Estimating a Probabilty Mass Function,” in Neural Computation, 17(7):1508-1530, 2005.

[12] D. Karakos, S. Khudanpur, D. Marchette, A. Papamarcou and C. Priebe, “On the Minimization of Concave Information Functionals for Unsupervised Classification via Decision Trees,” in Statistics and Probability Letters, 78(8):975-984, 2008.

[13] Z. Li, C. Callison-Burch, S. Khudanpur and W. Thornton, “Decoding in Joshua: Open Source, Parsing Based Machine Translation,,” in The Pargue Bulletin of Mathematical Linguistics, Special Issue: Open Source Tools for Machine Translation, 91:47-56, 2009.

[14] J. Baker, L. Deng, J. Glass, S. Khudanpur, C. Lee, N. Morgan and D. O’Shaughnessy, “Research Developments and Directions in Speech Recognition and Understanding, Part 1,,” in IEEE Signal Processing Magazine, 26(3):75-80, 2009.

[15] J. Baker, L. Deng, S. Khudanpur, C. Lee, J. Glass, N. Morgan and D. O’Shaughnessy, “Research Developments and Directions in Speech Recognition and Understanding, Part 2,,” in IEEE Signal Processing Magazine, 26(4):78-85, 2009.

[16] C. White, S. Khudanpur and P. Wolfe, “Likelihood-Based Semi-Supervised Model Selection with Applications to Speech Processing,,” in IEEE Journal of Selected Topics in Signal Processing, 4(6):1016-1026, 2010.

[17] B. Varadarajan, S. Khudanpur and T. Tran, “Stepwise Optimal Subspace Pursuit for Improving Sparse Recovery,,” in IEEE Signal Processing Letters, 18(1):27-30, 2011.

Publications in Refereed Conference Proceedings:

These are listed in my full vita.

Edited Books: Mathematical Foundations of Speech and Language Processing,
M. Johnson, S. Khudanpur, M. Ostendorf and R. Rosenfeld (Editors),
IMA Volumes in Mathematics and Its Applications, Vol. 138, Springer-Verlag, New York, Jan 2004. (Table of Contents)
Colleagues and Advisees: Faculty
Chris Callison-Burch, CS
Jason Eisner, CS
Greg Hager, CS
Hynek Hermansky, ECE
Fred Jelinek, ECE
Damianos Karakos, ECE
Carey Priebe, AMS
David Yarowsky, CS

Former Advisees (Degree) and their last known whereabouts

Arnab Ghoshal (PhD), Apple
Woosung Kim (PhD), Nuance Communications
Zhifei Li (PhD), Google
Murat Saraclar (PhD), Bogazici University
Balakrishnan Varadarajan (PhD), Google
Chris White (PhD), Harvard University
Jun Wu (PhD), TencentGeetu Ambwani (MSE), Streamsage (Comcast)
Wei Chen (MSE), CMU LTI
Sourin Das (MSE) Constellation Energy
Yan Huang (MSE), ICSI, Berkeley
Srividya Mohan (MSE), Multimodal Technologies
Binit Mohanty (MSE), Epic Systems
Srihari Reddy (MSE), Yahoo!
Paola Virga (MSE), IBM T. J. Watson Research Center
Ziyuan Wang (MSE), SDL (Language Weaver)
Ali Yazgan, (MSE), SAS (Business Analytics).Current Advisees
Yuan Cao, ECE
Scott Novotney, CS
Ariya Rastrow, ECE
Puyang Xu, ECEAdministrative Staff
Ruth Scally, CLSP
Karen Austin, CLSP
Carl Pupa, CLSP

Personal: I spent three fun-filled weeks in Kenya (and Tanzania) with two of my nieces and their parents – my sister and brother-in-law – in May and June of 2002. We visited lots of national parks and game reserves and took lots of pictures with a simple point-and-shoot camera. Here are a few of them in a 52MB Powerpoint document or a 310MB PDF document. Take a peek if you have a few minutes (or a few hours, depending on your modem-speed!).
I went there again in 2004 and had even more fun. Took some great photos too! I need to find a better way to put them up.

Send comments about this page to Sanjeev Khudanpur

Last modified: Sat Sep 26 07:12:18 EDT 2009

Center for Language and Speech Processing