Yanmin Qian received the B.S. degree from the Department of Electronic and Information Engineering,Huazhong University of Science and Technology, Wuhan, China, in 2007, and the Ph.D. degree from the Department of Electronic Engineering, Tsinghua University, Beijing, China, in 2012. Since 2013, he has been with the Department of Computer Science and Engineering, Shanghai Jiao Tong University (SJTU), Shanghai, China, where he is currently an Associate Professor. From 2015 to 2016, he also worked as an Associate Research in the Speech Group, Cambridge University Engineering Department, Cambridge, U.K. He is a member of IEEE and ISCA, and the key member of Kaldi Develop Group. He has published more than 60 papers on speech and language processing with 1400+ citations, including the top conference: ICASSP, INTERSPEECH and ASRU. His current research interests include the acoustic and language modeling in speech recognition, speaker and language recognition, key word spotting, and multimedia signal processing.

Research interests

  • Speech & Language understanding and human computer interaction
  • Large vocabulary continuous speech recognition
  • Discriminative training of acoustic models
  • Robust speech recognition
  • Multilingual speech recognition and Low-resource speech recognition
  • Deep learning based speech signal processing
  • Multimedia Signal Processing
  • GPU and SOC based fast speech recognition


Current Projects
  • Structured Deep Learning Study for the Robust Speech Recognition in the Heterogeneous Noisy Scenario, supported by the NSFC (NO. 61603252)
  • Shanghai Sailing Program, support by the Shanghai Government, China (No. 16YF1405300)
  • Deep Learning for Noise Robust Speech Recognition, supported by Shanghai Jiao Tong University
  • SMC-Chenxin Young Scholar Award, supported by Shanghai Jiao Tong University
  • The Interdisciplinary Program of Shanghai Jiao Tong University, supported by Shanghai Jiao Tong University (14JCZ03)
  • Joint SJTU-AISpeech Laboratory, supported by AISpeech Corporation
Past Projects
  • Speech Denoising by Deep Neural Network, supported by Baidu Corporation
  • Mixing Digital and English Speech Recognition, support by China Aviation Industry Group (AVIC)
  • Speech Recognition Technology Under the Low-Data-Resource Conditions, supported by an NSFC project and the PhD Research and Innovation Fund of Tsinghua University
  • Kaldi Speech Recognition Toolkit Development and Research
  • Large Vocabulary Continuous Speech Recognition System and Spoken Term Detection System Development and Research, Supported by the China 863 Projects, NSFC Projects and the Projects from China's Ministry of National Defense
  • Multilingual Speech Recognition Research, supported by the Interdisciplinary Fund Support by School of Information Science and Technology in Tsinghua University
  • Speech Recognition SOC System Development Under the Low-Hardware-Resource Condition, The SOC system is applied in the 2008 Olympic mascots, and win the High-tech Olympics Advanced Award


  • IEEE Member, IEEE SPS Member
  • ISCA Member
  • CCF Member
  • TPC Member for InterSpeech, ISCSLP, COCOSDA, ChinaSip
  • Regular reviewer for IEEE/ACM Transactions on Audio, Speech and Language, IEEE Journal of Selected Topics in Signal Processing, IEEE Signal Processing Letter, Neurocomputing, Multimedia Tools and Applications, etc
  • Regular reviewer for International conferences: ICASSP, INTERSPEECH, ASRU, SLT, ISCSLP, ChinaSip, EUSIPCO, COCOSDA, NCMMSC, ICPR, etc
Open-source toolkit
  • The Kaldi Speech Recognition Toolkit:
  • CUED-RNNLM-An open-source toolkit for efficient training and evaluation of recurrent neural network language models:


  • 2015--The First Prize of the MGB Data Recognition Challenge
  • 2015--The Third Prize of the Automatic Speaker Verification Spoofing and Countermeasures Challenge
  • 2015--Shanghai Jiao Tong University SMC-Chenxin Young Scholar Award
  • 2014--The Second Prize of the Fourth Wu Wenjun Artificial Intelligence Science and Technology Award
  • 2013--The Second Excellent Doctoral Dissertation Award in Tsinghua University
  • 2012--Google Grants Award in InterSpeech2012 (Total 4 PhDs around the world)
  • 2012--Tsinghua-JiangZhen Scholarship, First Class(Total 25 students in Tsinghua University)
  • 2011--Tsinghua-JiangZhen Scholarship, First Class(Total 25 students in Tsinghua University)
  • 2010--Excellent PhD Academic Newcomer Award Nomination of Chinese Education Ministry
  • 2010--PhD Research and Innovation Award of Tsinghua University
  • 2009--Interdisciplinary Fund Support by School of Information Science and Technology in Tsinghua University


Yanmin Qian
Institute of Intelligent Human Machine Interaction
Computer Science and Engineering Department
Tel: +86 21 34207008
3-501 SEIEE Building, 800 Dongchuan Road, Minhang District, Shanghai
200240, China