李晓飞博士 - 人工智能与数据科学 - 工学院 - 导师团队

MENTOR TEAM

导师团队

我们欢迎拥有各种学术背景的杰出学者、研究员和年轻科学家。到 2026 年，西湖大学预计将拥有 300 名助理教授、副教授和正教授（包括讲座教授），600 名研究、教学、技术支持和行政人员以及 900 名博士后研究员。

李晓飞博士

Xiaofei Li, Ph.D.

工学院

人工智能与数据科学

音频信号与信息处理实验室

联系

邮箱: 邮箱: lixiaofei@westlake.edu.cn

网站: https://audio.westlake.edu.cn/

个人简介

李晓飞，河北邯郸人。2007年本科毕业于北京机械工业学院，2013年获北京大学理学博士学位，导师为刘宏教授。2014年2月至2019年12月在法国国家信息与自动化研究所（Inria Grenoble Rhone-Alpes）Perception组工作，历任博士后、Starting Research Scientist，合作导师为Radu Horaud博士。2020年3月全职加入西湖大学任助理教授（博导）。

学术成果

李晓飞博士的研究方向为真实复杂环境下的音频/语音信号处理与感知，包括声源定位、语音增强、语音分离、噪声估计、音频/语音感知等课题，涵盖了信号处理、麦克风阵列、机器学习/深度学习、最优化等研究领域。已发表权威期刊与会议论文30余篇；其多移动说话人定位与跟踪算法达到了本领域前沿水平，已用于三星电子的产品原型；其窄带深度滤波算法解决了深度神经网络语音增强的环境泛化问题，并取得了领先的语音增强性能。

代表论文

1. Xiaofei Li*, Laurent Girin, Sharon Gannot and Radu Horaud. Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27 (9), pp. 1365 - 1377, 2019.

2. Xiaofei Li*#, Yutong Ban#, Laurent Girin, Xavier Alameda-Pineda and Radu Horaud. Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environment. IEEE Journal of Selected Topics in Signal Processing, 13 (1), pp. 88 - 103, 2019. (#equal contribution)

3. Xiaofei Li*, Laurent Girin, Sharon Gannot and Radu Horaud. Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27 (3), pp. 645 - 659, 2019.

4. Xiaofei Li*, Simon Leglaive, Laurent Girin, and Radu Horaud. Audio-noise Power Spectral Density Estimation Using Long Short-term Memory. IEEE Signal Processing Letters, 26 (6), pp. 918 - 922, 2019.

5. Xiaofei Li*, Sharon Gannot, Laurent Girin and Radu Horaud. Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26 (10), pp. 1755 - 1768, 2018.

6. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization with Spatial Sparsity Regularization. IEEE/ACM Transactions on Audio, Speech and Language Processing, 25 (10), pp. 1997 - 2012, 2017.

7. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization. IEEE/ACM Transactions on Audio, Speech and Language Processing, 24 (11), pp. 2171 – 2186, 2016.

8. Xiaofei Li and Hong Liu*. Sound Source Localization for HRI Using FOC-based Time Difference Feature and Spatial Grid Matching. IEEE Transactions on Cybernetics, 43 (4), pp. 1199-1212, 2013

9. Bing Yang, Hong Liu*, Cheng Pang and Xiaofei Li. Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27(8), 1241-1255, 2019.

10. Israel D. Gebru*, Silèye Ba, Xiaofei Li and Radu Horaud. Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40 (5), pp. 1086 - 1099, 2018.

11.Cheng Pang, Hong Liu*, Jie Zhang and Xiaofei Li. Binaural Sound Localization Based on Reverberation Weighting and Generalized Parametric Mapping. IEEE/ACM Transactions on Audio, Speech and Language Processing, 25 (8), pp. 1618 - 1632, 2017.

12. Pingping Wu, Hong Liu*, Xiaofei Li, Ting Fan, and Xuewu Zhang. A Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion. IEEE Transactions on Multimedia, 18 (3), pp. 326 - 338, 2016.

13. Xiaofei Li* and Radu Horaud. Multichannel Speech Enhancement based on Time-frequency Masking using Subband Long Short-Term Memory. WASPAA, 2019.

14. Xiaofei Li*, Sharon Gannot, Laurent Girin and Radu Horaud. Multisource MINT Using the Convolutive Transfer Function. ICASSP, 2018.

15. Xiaofei Li*, Laurent Girin and Radu Horaud. An EM algorithm for audio source separation based on the convolutive transfer function. WASPAA, 2017.

16. Xiaofei Li*, Laurent Girin and Radu Horaud. Audio Source Separation based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization. ICASSP, 2017.

17. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Estimation of Relative Transfer Function in the Presence of Stationary Noise Based on Segmental Power Spectral Density Matrix Subtraction. ICASSP, 2015.

联系方式

电子邮箱：lixiaofei@westlake.edu.cn

课题组正招收博士研究生，欢迎信号与信息处理、模式识别、计算机应用技术等相关专业本科、硕士毕业生申请。具体招生项目请见
https://www.westlake.edu.cn/admissions_aid/graduate/
另长期招聘副研究员、助理研究员、博士后，欢迎音频/语音、多媒体、信号处理、机器学习等相关方向博士学位获得者申请。