Laurent GIRIN
Professeur Grenoble-INP
Equipe Cognitive Robotics, Interactive Systems, & Speech Processing
Département Parole et Cognition
ME CONTACTER / CONTACT ME
Mail : laurent.girin@gipsa-lab.grenoble-inp.fr

Domaine Universitaire
BP 46
38402 Saint Martin d'Hères cedex

Bureau B366
Tél.33 (0)4 76 57 45 37
Fax : 33 (0)4 76 57 47 10
PUBLICATIONS RECENTES / RECENT PUBLICATIONS
Les derniéres publications de la collection Gipsa dans HAL

Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders

Mostafa Sadeghi, Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud. Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2020, 28, pp.1788-1800. ⟨ 10.1109/TASLP.2020.3000593 ⟩. ⟨ hal-02364900v3 ⟩

A Recurrent Variational Autoencoder for Speech Enhancement

Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud. A Recurrent Variational Autoencoder for Speech Enhancement. IEEE International Conference on Acoustic Speech and Signal Processing (ICASSP), May 2020, Barcelona, Spain. pp.1-7, ⟨ 10.1109/ICASSP40776.2020.9053164 ⟩. ⟨ hal-02329000v2 ⟩

Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers

Yutong Ban, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud. Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers. IEEE Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers, 2019, 42, pp.1-17. ⟨ 10.1109/TPAMI.2019.2953020 ⟩. ⟨ hal-01950866v2 ⟩

Audio-Visual Variational Fusion for Multi-Person Tracking with Robots

Xavier Alameda-Pineda, Soraya Arias, Yutong Ban, Guillaume Delorme, Laurent Girin, et al.. Audio-Visual Variational Fusion for Multi-Person Tracking with Robots. ACMMM 2019 - 27th ACM International Conference on Multimedia, Oct 2019, Nice, France. pp.1059-1061, ⟨ 10.1145/3343031.3350590 ⟩. ⟨ hal-02354514 ⟩

Bayesian time-domain multiple sound source localization for a stochastic machine

Raphael Frisch, Marvin Faix, Jacques Droulez, Laurent Girin, Emmanuel Mazer. Bayesian time-domain multiple sound source localization for a stochastic machine. EUSIPCO 2019 - 27th European Signal Processing Conference, Sep 2019, A Coruna, Spain. pp.1-5, ⟨ 10.23919/EUSIPCO.2019.8902666 ⟩. ⟨ hal-02377220 ⟩

Notes on the use of variational autoencoders for speech and audio spectrogram modeling

Laurent Girin, Fanny Roche, Thomas Hueber, Simon Leglaive. Notes on the use of variational autoencoders for speech and audio spectrogram modeling. DAFx 2019 - 22nd International Conference on Digital Audio Effects, Sep 2019, Birmingham, United Kingdom. pp.1-8. ⟨ hal-02349385 ⟩

Audio-noise Power Spectral Density Estimation Using Long Short-term Memory

Xiaofei Li, Simon Leglaive, Laurent Girin, Radu Horaud. Audio-noise Power Spectral Density Estimation Using Long Short-term Memory. IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2019, 26 (6), pp.918-922. ⟨ 10.1109/LSP.2019.2911879 ⟩. ⟨ hal-02100059 ⟩

Autoencoders for music sound modeling : a comparison of linear, shallow, deep, recurrent and variational models

Fanny Roche, Thomas Hueber, Samuel Limier, Laurent Girin. Autoencoders for music sound modeling : a comparison of linear, shallow, deep, recurrent and variational models. SMC 2019 - 16th Sound & Music Computing Conference, May 2019, Malaga, Spain. ⟨ hal-02349406 ⟩

Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering

Xiaofei Li, Laurent Girin, Sharon Gannot, Radu Horaud. Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2019, 27 (9), pp.1365-1377. ⟨ 10.1109/TASLP.2019.2919183 ⟩. ⟨ hal-01969041 ⟩

Speech enhancement with variational autoencoders and alpha-stable distributions

Simon Leglaive, Umut Simsekli, Antoine Liutkus, Laurent Girin, Radu Horaud. Speech enhancement with variational autoencoders and alpha-stable distributions. ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing, May 2019, Brighton, United Kingdom. pp.541-545, ⟨ 10.1109/ICASSP.2019.8682546 ⟩. ⟨ hal-02005106 ⟩

ENCADREMENT DE THESES / PhD THESIS SUPERVISED

Grenoble Images Parole Signal Automatique laboratoire

UMR 5216 CNRS - Grenoble INP - Université Joseph Fourier - Université Stendhal