publications

2024

  1. Unified Cross-Modal Attention: Robust Audio-Visual Speech Recognition and Beyond
    Jiahong Li*, Chenda Li*, Yifei Wu, and 1 more author
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
  2. URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
    Wangyou Zhang, Robin Scheibler, Kohei Saijo, and 9 more authors
    In Interspeech 2024, Sep 2024

2023

  1. Target Sound Extraction with Variable Cross-Modality Clues
    Chenda Li, Yao Qian, Zhuo Chen, and 5 more authors
    In ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun 2023
  2. Adapting Multi-Lingual ASR Models for Handling Multiple Talkers
    Chenda Li, Yao Qian, Zhuo Chen, and 5 more authors
    In INTERSPEECH 2023, Aug 2023
  3. Predictive Skim: Contrastive Predictive Coding for Low-Latency Online Speech Separation
    Chenda Li, Yifei Wu, and Yanmin Qian
    In ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun 2023
  4. Robust Audio-Visual ASR with Unified Cross-Modal Attention
    Jiahong Li, Chenda Li, Yifei Wu, and 1 more author
    In ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun 2023
  5. Light-Weight Visualvoice: Neural Network Quantization On Audio Visual Speech Separation
    Yifei Wu, Chenda Li, and Yanmin Qian
    In 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), Jun 2023

2022

  1. Skim: Skipping Memory Lstm for Low-Latency Real-Time Continuous Speech Separation
    Chenda Li, Lei Yang, Weiqin Wang, and 1 more author
    In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2022
  2. Dual-Path Modeling With Memory Embedding Model for Continuous Speech Separation
    Chenda Li, Zhuo Chen, and Yanmin Qian
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, May 2022
  3. ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding
    Yen-Ju Lu, Xuankai Chang, Chenda Li, and 10 more authors
    In Interspeech 2022, Sep 2022
  4. Towards Low-Distortion Multi-Channel Speech Enhancement: The ESPNET-Se Submission to the L3DAS22 Challenge
    Yen-Ju Lu, Samuele Cornell, Xuankai Chang, and 5 more authors
    In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2022
  5. The Sjtu System For Multimodal Information Based Speech Processing Challenge 2021
    Wei Wang, Xun Gong, Yifei Wu, and 5 more authors
    In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2022
  6. Time-Domain Audio-Visual Speech Separation on Low Quality Videos
    Yifei Wu, Chenda Li, Jinfeng Bai, and 2 more authors
    In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2022

2021

  1. ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration
    Chenda Li*, Jing Shi*, Wangyou Zhang*, and 8 more authors
    In 2021 IEEE Spoken Language Technology Workshop (SLT), Jan 2021
  2. Recent Developments on Espnet Toolkit Boosted By Conformer
    Pengcheng Guo, Florian Boyer, Xuankai Chang, and 12 more authors
    In icassp, Jun 2021
  3. Continuous Speech Separation Using Speaker Inventory for Long Recording
    Cong Han, Yi Luo, Chenda Li, and 8 more authors
    In Proc. Interspeech, Aug 2021
  4. Dual-Path Modeling for Long Recording Speech Separation in Meetings
    Chenda Li, Zhuo Chen, Yi Luo, and 6 more authors
    In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun 2021
  5. Dual-Path RNN for Long Recording Speech Separation
    Chenda Li, Yi Luo, Cong Han, and 9 more authors
    In 2021 IEEE Spoken Language Technology Workshop (SLT), Jan 2021
  6. Rethinking The Separation Layers In Speech Separation Networks
    Yi Luo, Zhuo Chen, Cong Han, and 3 more authors
    In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun 2021
  7. The 2020 ESPnet Update: New Features, Broadened Applications, Performance Improvements, and Future Plans
    Shinji Watanabe, Florian Boyer, Xuankai Chang, and 12 more authors
    In 2021 IEEE Data Science and Learning Workshop (DSLW), Jun 2021
  8. Audio-Visual Multi-Talker Speech Recognition in a Cocktail Party
    Yifei Wu, Chenda Li, Song Yang, and 2 more authors
    In Interspeech 2021, Aug 2021
  9. Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions
    Wangyou Zhang, Jing Shi, Chenda Li, and 2 more authors
    In 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2021

2020

  1. Deep Audio-Visual Speech Separation with Attention Mechanism
    Chenda Li, and Yanmin Qian
    In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2020
  2. Listen, Watch and Understand at the Cocktail Party: Audio-Visual-Contextual Speech Separation
    Chenda Li, and Yanmin Qian
    In Interspeech 2020, Oct 2020

2019

  1. Prosody Usage Optimization for Children Speech Recognition with Zero Resource Children Speech
    Chenda Li, and Yanmin Qian
    In Proc. Interspeech 2019, Oct 2019