“Focus on scientific research, contribute more to the development of Westlake University.”
Biography
I work at Westlake University as an assistant Professor since March 2020. Prior to this, I worked in the PERCEPTION team at INRIA Grenoble Rhône-Alpes, France, as a post-doctoral researcher from Feb. 2014 to Jan. 2016, and as a starting research scientist from Feb. 2016 to Dec. 2019, hosted by Dr. Radu Horaud. I did my PhD in Electronics at Peking University, during 2007 to 2013, supervised by Prof. Hong Liu. I received a Bachelor degree in Electronic Information from Beijing Institute of Machinery in 2007.
History
2020
Assistant Professor, Westlake University, China
2016
Starting Researcher, Inria, France
2014
Post-doc, Inria, France
2013
Ph.D. degree, Peking University, China
Research
My field of expertise is acoustic/audio/speech signal processing, including the research topics of channel identification/equalization, noise estimation, sound source localization, speech enhancement, speech separation, robust speech recognition, etc. I have two major contributions in the field: the applications of convolutive transfer function to sound source localization and speech dereverberation; narrow-band deep filtering applies deep neural network for signal filtering, more specifically speech denoising.
Representative Publications
1. Xiaofei Li*, Laurent Girin, Sharon Gannot and Radu Horaud. Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27 (9), pp. 1365 - 1377, 2019.
2. Xiaofei Li*#, Yutong Ban#, Laurent Girin, Xavier Alameda-Pineda and Radu Horaud. Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environment. IEEE Journal of Selected Topics in Signal Processing, 13 (1), pp. 88 - 103, 2019. (#equal contribution)
3. Xiaofei Li*, Laurent Girin, Sharon Gannot and Radu Horaud. Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27 (3), pp. 645 - 659, 2019.
4. Xiaofei Li*, Simon Leglaive, Laurent Girin, and Radu Horaud. Audio-noise Power Spectral Density Estimation Using Long Short-term Memory. IEEE Signal Processing Letters, 26 (6), pp. 918 - 922, 2019.
5. Xiaofei Li*, Sharon Gannot, Laurent Girin and Radu Horaud. Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26 (10), pp. 1755 - 1768, 2018.
6. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization with Spatial Sparsity Regularization. IEEE/ACM Transactions on Audio, Speech and Language Processing, 25 (10), pp. 1997 - 2012, 2017.
7. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization. IEEE/ACM Transactions on Audio, Speech and Language Processing, 24 (11), pp. 2171 – 2186, 2016.
8. Xiaofei Li and Hong Liu*. Sound Source Localization for HRI Using FOC-based Time Difference Feature and Spatial Grid Matching. IEEE Transactions on Cybernetics, 43 (4), pp. 1199-1212, 2013
9. Bing Yang, Hong Liu*, Cheng Pang and Xiaofei Li. Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27(8), 1241-1255, 2019.
10. Israel D. Gebru*, Silèye Ba, Xiaofei Li and Radu Horaud. Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40 (5), pp. 1086 - 1099, 2018.
11.Cheng Pang, Hong Liu*, Jie Zhang and Xiaofei Li. Binaural Sound Localization Based on Reverberation Weighting and Generalized Parametric Mapping. IEEE/ACM Transactions on Audio, Speech and Language Processing, 25 (8), pp. 1618 - 1632, 2017.
12. Pingping Wu, Hong Liu*, Xiaofei Li, Ting Fan, and Xuewu Zhang. A Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion. IEEE Transactions on Multimedia, 18 (3), pp. 326 - 338, 2016.
13. Xiaofei Li* and Radu Horaud. Multichannel Speech Enhancement based on Time-frequency Masking using Subband Long Short-Term Memory. WASPAA, 2019.
14. Xiaofei Li*, Sharon Gannot, Laurent Girin and Radu Horaud. Multisource MINT Using the Convolutive Transfer Function. ICASSP, 2018.
15. Xiaofei Li*, Laurent Girin and Radu Horaud. An EM algorithm for audio source separation based on the convolutive transfer function. WASPAA, 2017.
16. Xiaofei Li*, Laurent Girin and Radu Horaud. Audio Source Separation based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization. ICASSP, 2017.
17. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Estimation of Relative Transfer Function in the Presence of Stationary Noise Based on Segmental Power Spectral Density Matrix Subtraction. ICASSP, 2015.
Contact Us
Email: lixiaofei@westlake.edu.cn
Please visit https://audio.westlake.edu.cn/ and https://github.com/Audio-WestlakeU for more information about Dr. Xiaofei Li and his group at Westlake University.