The ADASP group of Télécom-Paris has 12 papers accepted at the 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025).

This month’s Listen-Lab seminar will feature a conference debrief, where the ADASP group members will share key highlights and their favorite papers from ICASSP 2025.

  1. Zero-shot Musical Stem Retrieval with Joint-Embedding Predictive Architectures
    Alain Riou, Antonin Gagneré, Gaëtan Hadjeres, Stefan Lattner, Geoffroy Peeters
  2. Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning
    Aurian Quelennec, Pierre Chouteau, Geoffroy Peeters, Slim Essid
  3. Perceptual Noise-Masking with Music through Deep Spectral Envelope Shaping
    Clémentine Berger, Roland Badeau, Slim Essid
  4. Multiple Choice Learning for Efficient Speech Separation with Many Speakers
    David Perera, François Derrida, Théo Mariotte, Gaël Richard, Slim Essid
  5. O-EENC-SD: Efficient Online End-to-End Neural Clustering for Speaker Diarization
    Elio Gruttadauria, Mathieu Fontaine, Jonathan Le Roux, Slim Essid
  6. Twenty-Five Years of MIR Research: Achievements, Practices, Evaluations, and Future Challenges
    Geoffroy Peeters, Zafar Rafii, Magdalena Fuentes, Zhiyao Duan, Emmanouil Benetos, Juhan Nam, Yuki Mitsufuji
  7. A Hybrid Model for Weakly-Supervised Speech Dereverberation
    Louis Bahrman, Mathieu Fontaine, Gaël Richard
  8. F-StrIPE: Fast Structure-Informed Positional Encoding for Symbolic Music Generation
    Manvi Agarwal, Changhong Wang, Gaël Richard
  9. AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
    Samir Sadok, Simon Leglaive, Laurent Girin, Gaël Richard, Xavier Alameda-Pineda
  10. Contrastive Knowledge Distillation for Embedding Refinement in Personalized Speech Enhancement
    Thomas Serre, Mathieu Fontaine, Éric Benhaim, Slim Essid
  11. Investigating the Sensitivity of Pre-trained Audio Embeddings to Common Effects
    Victor Deng, Changhong Wang, Gaël Richard, Brian McFee
  12. Learning Source Disentanglement in Neural Audio Codec 
    Xiaoyu Bie, Xubo Liu, Gaël Richard

Links: