Learning Source Disentanglement in Neural Audio Codec
Xiaoyu BIE*, Xubo Liu, Gaël Richard
arXiv preprint arXiv:2409.11228
[arXiv] [Project page]
Publications
Here is a selection of recent publications, full list can be found on Google Scholar
* indicates equal contribution.
HiT-DVAE: Human Motion Generation via Hierarchical Transformer Dynamical VAE
Xiaoyu BIE*, Wen Guo*, Simon Leglaive, Laurent Girin, Francesc Moreno-Noguer, Xavier Alameda-Pineda
arXiv preprint arXiv:2204.01565
[arXiv]
Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders
Xiaoyu BIE, Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 2022.
[arXiv] [Paper] [Project page] [Code]
Multi-Person Extreme Motion Prediction
Wen Guo*, Xiaoyu BIE*, Xavier Alameda-Pineda, Francesc Moreno-Noguer
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
[arXiv] [Paper] [Project page] [Dataset] [Code]
Dynamical Variational Autoencoders: A Comprehensive Review
Laurent Girin, Simon Leglaive, Xiaoyu BIE, Julien Diard, Thomas Hueber, Xavier Alameda-Pineda
Foundations and Trends in Machine Learning, 2021, Vol. 15, No. 1-2, pp 1–175.
[arXiv] [Paper] [Project page] [Tutorial] [Code]
A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling
Xiaoyu BIE, Laurent Girin, Simon Leglaive, Thomas Hueber, Xavier Alameda-Pineda
Interspeech, 2021.
[arXiv] [Paper] [Project page] [Code]