Speech Enhancement and Dereverberation with Diffusion-based Generative Models Paper • 2208.05830 • Published Aug 11, 2022 • 3
Improving Speech Enhancement with Multi-Metric Supervision from Learned Quality Assessment Paper • 2506.12260 • Published Jun 13 • 1
Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published Apr 24 • 92
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer Paper • 2308.06873 • Published Aug 14, 2023 • 27
DM-Codec: Distilling Multimodal Representations for Speech Tokenization Paper • 2410.15017 • Published Oct 19, 2024 • 2