个人简介
李晓飞,河北邯郸人。2007年本科毕业于北京机械工业学院,2013年获北京大学理学博士学位,导师为刘宏教授。2014年2月至2019年12月在法国国家信息与自动化研究所(Inria Grenoble Rhone-Alpes)Perception组工作,历任博士后、Starting Research Scientist,合作导师为Radu Horaud博士。2020年3月全职加入西湖大学任助理教授(博导)。
学术成果
李晓飞博士的研究方向为真实复杂环境下的音频/语音信号处理与感知,包括声源定位、语音增强、语音分离、噪声估计、音频/语音感知等课题,涵盖了信号处理、麦克风阵列、机器学习/深度学习、最优化等研究领域。已发表权威期刊与会议论文30余篇;其多移动说话人定位与跟踪算法达到了本领域前沿水平,已用于三星电子的产品原型;其窄带深度滤波算法解决了深度神经网络语音增强的环境泛化问题,并取得了领先的语音增强性能。
代表论文
1. Xiaofei Li*, Laurent Girin, Sharon Gannot and Radu Horaud. Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27 (9), pp. 1365 - 1377, 2019.
2. Xiaofei Li*#, Yutong Ban#, Laurent Girin, Xavier Alameda-Pineda and Radu Horaud. Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environment. IEEE Journal of Selected Topics in Signal Processing, 13 (1), pp. 88 - 103, 2019. (#equal contribution)
3. Xiaofei Li*, Laurent Girin, Sharon Gannot and Radu Horaud. Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27 (3), pp. 645 - 659, 2019.
4. Xiaofei Li*, Simon Leglaive, Laurent Girin, and Radu Horaud. Audio-noise Power Spectral Density Estimation Using Long Short-term Memory. IEEE Signal Processing Letters, 26 (6), pp. 918 - 922, 2019.
5. Xiaofei Li*, Sharon Gannot, Laurent Girin and Radu Horaud. Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26 (10), pp. 1755 - 1768, 2018.
6. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization with Spatial Sparsity Regularization. IEEE/ACM Transactions on Audio, Speech and Language Processing, 25 (10), pp. 1997 - 2012, 2017.
7. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization. IEEE/ACM Transactions on Audio, Speech and Language Processing, 24 (11), pp. 2171 – 2186, 2016.
8. Xiaofei Li and Hong Liu*. Sound Source Localization for HRI Using FOC-based Time Difference Feature and Spatial Grid Matching. IEEE Transactions on Cybernetics, 43 (4), pp. 1199-1212, 2013
9. Bing Yang, Hong Liu*, Cheng Pang and Xiaofei Li. Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27(8), 1241-1255, 2019.
10. Israel D. Gebru*, Silèye Ba, Xiaofei Li and Radu Horaud. Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40 (5), pp. 1086 - 1099, 2018.
11.Cheng Pang, Hong Liu*, Jie Zhang and Xiaofei Li. Binaural Sound Localization Based on Reverberation Weighting and Generalized Parametric Mapping. IEEE/ACM Transactions on Audio, Speech and Language Processing, 25 (8), pp. 1618 - 1632, 2017.
12. Pingping Wu, Hong Liu*, Xiaofei Li, Ting Fan, and Xuewu Zhang. A Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion. IEEE Transactions on Multimedia, 18 (3), pp. 326 - 338, 2016.
13. Xiaofei Li* and Radu Horaud. Multichannel Speech Enhancement based on Time-frequency Masking using Subband Long Short-Term Memory. WASPAA, 2019.
14. Xiaofei Li*, Sharon Gannot, Laurent Girin and Radu Horaud. Multisource MINT Using the Convolutive Transfer Function. ICASSP, 2018.
15. Xiaofei Li*, Laurent Girin and Radu Horaud. An EM algorithm for audio source separation based on the convolutive transfer function. WASPAA, 2017.
16. Xiaofei Li*, Laurent Girin and Radu Horaud. Audio Source Separation based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization. ICASSP, 2017.
17. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Estimation of Relative Transfer Function in the Presence of Stationary Noise Based on Segmental Power Spectral Density Matrix Subtraction. ICASSP, 2015.
联系方式
电子邮箱:lixiaofei@westlake.edu.cn
课题组正招收博士研究生,欢迎信号与信息处理、模式识别、计算机应用技术等相关专业本科、硕士毕业生申请。具体招生项目请见
https://www.westlake.edu.cn/admissions_aid/graduate/
另长期招聘副研究员、助理研究员、博士后,欢迎音频/语音、多媒体、信号处理、机器学习等相关方向博士学位获得者申请。