+86 17625641972
上海交通大学电院3号楼225
speaker recognition

About me CV Google Scholar

My name is Shuai Wang, I am a Ph.D. student at SpeechLab, Shanghai Jiao Tong University. Supervised by Prof. Kai Yu and Prof. Yanmin Qian, my research interests lie primarily in speaker recognition and diariazation.

Education

  • 09/2015-Present Shanghai Jiao Tong University, School of Electronic Information and Electrical Engineering
  • 09/2014-07/2015 Shanghai Jiao Tong University, School of Electronic Information and Electrical Engineering
    • Ph.D. Candidate in Software Engineering ( Supervisor: Prof. Fei Hu)
  • 09/2010-07/2014 Northwestern Polytechnical University, School of Software and Microelectronics

SID related Challenges

Internship

Publications

  1. Shuai Wang, Yanmin Qian and Kai Yu. What Does the Speaker Embedding Encode? Interspeech 2017. pdf
  2. Xiaowei Jiang, Shuai Wang, Xu Xiang, Yanmin Qian. Integrating Online i-vector into GMM-UBM for Text-dependent Speaker Verification. APSIPA 2017. pdf
  3. Shuai Wang, Yanmin Qian and Kai Yu. Focal KL-Divergence based Dilated Convolutional Neural Networks for Co-channel Speaker Identification. ICASSP 2018 (IEEE Ganesh N. Ramaswamy Memorial Award) pdf
  4. Zili Huang, Shuai Wang and Yanmin Qian. Joint i-vector with End-to-End system for Short Duration Text-independent speaker verification. ICASSP 2018 pdf
  5. [Shuai Wang, Zili Huang] and Kai Yu. Angular Softmax for Short-Duration Text-independent Speaker Verification. (Joint First Author) Interspeech 2018 (ISCA Travel Grant) pdf
  6. Yanmin Qian, Chao Weng, Xuankai Chang, Shuai Wang and Dong Yu.  Past Review, Current Progress and Challenges Ahead on Cocktail Party Problem. FITEE
  7. Shuai Wang, Heinrich Dinkel, Yanmin Qian and Kai Yu. Covariance Based Deep Feature for Text-dependent Speaker Verification. IScIDE 2018 pdf
  8. Yexin Yang, Shuai Wang, Man Sun, Yanmin Qian and Kai Yu. Generative Adversarial Networks based X-vector Augmentation for Robust Probabilistic Linear Discriminant Analysis in Speaker Verification. ISCSLP 2018.
  9. Shuai Wang, Zili Huang and Kai Yu. Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition. ISCSLP 2018.
  10. Shuai Wang, Yexin Yang, Tianzhe Wang, Yanmin Qian and Kai Yu. Knowledge Distillation for Small Foot-print Deep Speaker Embedding. ICASSP 2019.
  11. Shuai Wang, Johan Rohdin, Lukáš Burget, Oldřich Plchot, Yanmin Qian, Kai Yu and Jan Černocký. On the Usage of Phonetic Information for Text-independent Speaker Embedding Extraction. Interspeech 2019. pdf slides
  12. Zhanghao Wu, Shuai Wang, Yanmin Qian and Kai Yu. Data Augmentation using Variational Autoencoder for Embedding based Speaker Verification. Interspeech 2019. pdf slides
  13. Hongji Wang, Heinrich Dinkel, Shuai Wang, Yanmin Qian and Kai Yu. Cross-domain replay spoofing attack detection usingdomain adversarial training. Interspeech 2019. (ISCA Travel Grant)
  14. Yexin Yang, Hongji Wang, Heinrich Dinkel, Zhengyang Chen, Shuai Wang, Yanmin Qian and Kai Yu. The SJTU Robust Anti-spoofing System for the ASVspoof 2019 Challenge. Interspeech 2019.
  15. Mireia Diez, Lukáš Burget, Shuai Wang, Johan Rohdin, Jan Černocký. Bayesian HMM based x-vector clustering for Speaker Diarization. Interspeech 2019. pdf
  16. Xu Xiang, Shuai Wang, Houjun Huang, Yanmin Qian and Kai Yu. Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition. arXiv:1906.07317v1 (Accepted by APSIPA ASC 2019)
  17. Yefei Chen, Shuai Wang, Yanmin Qian and Kai Yu. End-to-End Speaker-Dependent Voice Activity Detection. NCMMSC 2019
  18. Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu. Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification. IEEE/ACM Transactions on Audio, Speech, and Language Processing
  19. Federico Landini, Shuai Wang, Mireia Diez, Lukáš Burget, Pavel Matějka, Kateřina Žmolíková, Ladislav Mošner, Oldřich Plchot, Ondřej Novotný, Hossein Zeinali, Johan Rohdin. BUT System Description for DIHARD Speech Diarization Challenge 2019 arxiv
  20. Hossein Zeinali, Shuai Wang, Anna Silnova, Pavel Matějka, Oldřich Plchot. BUT System Description to VoxCeleb Speaker Recognition Challenge 2019 arxiv
  21. Shuai Wang, Johan Rohdin, Oldřich Plchot, Lukáš Burget, Kai Yu and Jan Černocký. Investigation of SpecAugment for deep speaker embedding learning. ICASSP 2020
  22. [Yexin Yang, Shuai Wang], Xun Gong, Yanmin Qian and Kai Yu. Text adaptation for speaker verification with speaker-text factorized embeddings. (Joint First Author) ICASSP 2020
  23. Zhengyang Chen, Shuai Wang, Yanmin Qian and Kai Yu. Channel Invariant Speaker Embedding Learning With Joint Multi-task and Adversarial Training. ICASSP 2020
  24. Federico Landini, Shuai Wang, Mireia Diez, Lukáš Burget, Pavel Matějka, Kateřina Žmolíková, Ladislav Mošner, Anna Silnova, Oldřich Plchot, Ondřej Novotný, Hossein Zeinali and Johan Rohdin. BUT System for DIHARD Speech Diarization Challenge 2019. ICASSP 2020
  25. Mireia Diez, Lukáš Burget, Federico Landini, Shuai Wang, Jan Černocký. Optimizing Bayesian HMM based x-vector Clustering for the Second DIHARD Speech Diarization Challenge. ICASSP 2020.