MENTOR TEAM

导师团队

我们欢迎拥有各种学术背景的杰出学者、研究员和年轻科学家。到 2026 年,西湖大学预计将拥有 300 名助理教授、副教授和正教授(包括讲座教授),600 名研究、教学、技术支持和行政人员以及 900 名博士后研究员。

返回
李晓飞博士

李晓飞博士

Xiaofei Li, Ph.D.

工学院

人工智能与数据科学

音频信号与信息处理实验室

联系

邮箱: 邮箱: lixiaofei@westlake.edu.cn

网站: https://audio.westlake.edu.cn/


个人简介


李晓飞,河北邯郸人。2007年本科毕业于北京机械工业学院,2013年获北京大学理学博士学位,导师为刘宏教授。2014年2月至2019年12月在法国国家信息与自动化研究所(Inria Grenoble Rhone-Alpes)Perception组工作,历任博士后、Starting Research Scientist,合作导师为Radu Horaud博士。2020年3月全职加入西湖大学任助理教授(博导)。


学术成果


李晓飞博士的研究方向为真实复杂环境下的音频/语音信号处理与感知,包括声源定位、语音增强、语音分离、噪声估计、音频/语音感知等课题,涵盖了信号处理、麦克风阵列、机器学习/深度学习、最优化等研究领域。已发表权威期刊与会议论文30余篇;其多移动说话人定位与跟踪算法达到了本领域前沿水平,已用于三星电子的产品原型;其窄带深度滤波算法解决了深度神经网络语音增强的环境泛化问题,并取得了领先的语音增强性能。


代表论文


1. Xiaofei Li*, Laurent Girin, Sharon Gannot and Radu Horaud. Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27 (9), pp. 1365 - 1377, 2019.

2. Xiaofei Li*#, Yutong Ban#, Laurent Girin, Xavier Alameda-Pineda and Radu Horaud. Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environment. IEEE Journal of Selected Topics in Signal Processing, 13 (1), pp. 88 - 103, 2019. (#equal contribution)

3. Xiaofei Li*, Laurent Girin, Sharon Gannot and Radu Horaud. Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27 (3), pp. 645 - 659, 2019.

4. Xiaofei Li*, Simon Leglaive, Laurent Girin, and Radu Horaud. Audio-noise Power Spectral Density Estimation Using Long Short-term Memory. IEEE Signal Processing Letters, 26 (6), pp. 918 - 922, 2019.

5. Xiaofei Li*, Sharon Gannot, Laurent Girin and Radu Horaud. Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26 (10), pp. 1755 - 1768, 2018.

6. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization with Spatial Sparsity Regularization. IEEE/ACM Transactions on Audio, Speech and Language Processing, 25 (10), pp. 1997 - 2012, 2017.

7. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization. IEEE/ACM Transactions on Audio, Speech and Language Processing, 24 (11), pp. 2171 – 2186, 2016.

8. Xiaofei Li and Hong Liu*. Sound Source Localization for HRI Using FOC-based Time Difference Feature and Spatial Grid Matching. IEEE Transactions on Cybernetics, 43 (4), pp. 1199-1212, 2013

9. Bing Yang, Hong Liu*, Cheng Pang and Xiaofei Li. Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27(8), 1241-1255, 2019.

10. Israel D. Gebru*, Silèye Ba, Xiaofei Li and Radu Horaud. Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40 (5), pp. 1086 - 1099, 2018.

11.Cheng Pang, Hong Liu*, Jie Zhang and Xiaofei Li. Binaural Sound Localization Based on Reverberation Weighting and Generalized Parametric Mapping. IEEE/ACM Transactions on Audio, Speech and Language Processing, 25 (8), pp. 1618 - 1632, 2017.

12. Pingping Wu, Hong Liu*, Xiaofei Li, Ting Fan, and Xuewu Zhang. A Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion. IEEE Transactions on Multimedia, 18 (3), pp. 326 - 338, 2016.

13. Xiaofei Li* and Radu Horaud. Multichannel Speech Enhancement based on Time-frequency Masking using Subband Long Short-Term Memory. WASPAA, 2019.

14. Xiaofei Li*, Sharon Gannot, Laurent Girin and Radu Horaud. Multisource MINT Using the Convolutive Transfer Function. ICASSP, 2018.

15. Xiaofei Li*, Laurent Girin and Radu Horaud. An EM algorithm for audio source separation based on the convolutive transfer function. WASPAA, 2017.

16. Xiaofei Li*, Laurent Girin and Radu Horaud. Audio Source Separation based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization. ICASSP, 2017.

17. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Estimation of Relative Transfer Function in the Presence of Stationary Noise Based on Segmental Power Spectral Density Matrix Subtraction. ICASSP, 2015.


联系方式


电子邮箱:lixiaofei@westlake.edu.cn


课题组正招收博士研究生,欢迎信号与信息处理、模式识别、计算机应用技术等相关专业本科、硕士毕业生申请。具体招生项目请见
https://www.westlake.edu.cn/admissions_aid/graduate/
另长期招聘副研究员、助理研究员、博士后,欢迎音频/语音、多媒体、信号处理、机器学习等相关方向博士学位获得者申请。