Multimodel audio facebook/seamless-m4t-v2-large Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 46.7k • 913 stepfun-ai/Step-Audio-2-mini Any-to-Any • 8B • Updated Sep 5 • 2.02k • 234 bosonai/higgs-audio-v2-generation-3B-base Text-to-Speech • 6B • Updated Jul 28 • 431k • 630
Neural codecs facebook/encodec_48khz Feature Extraction • 19.1M • Updated Sep 6, 2023 • 7.63k • 33 facebook/encodec_32khz Feature Extraction • 59M • Updated Sep 4, 2023 • 84.5k • 19 facebook/encodec_24khz Feature Extraction • 23.3M • Updated Jul 25, 2023 • 676k • 52 descript/dac_44khz Feature Extraction • 76.6M • Updated Oct 11, 2024 • 189k • • 10
Multimodel audio facebook/seamless-m4t-v2-large Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 46.7k • 913 stepfun-ai/Step-Audio-2-mini Any-to-Any • 8B • Updated Sep 5 • 2.02k • 234 bosonai/higgs-audio-v2-generation-3B-base Text-to-Speech • 6B • Updated Jul 28 • 431k • 630
Neural codecs facebook/encodec_48khz Feature Extraction • 19.1M • Updated Sep 6, 2023 • 7.63k • 33 facebook/encodec_32khz Feature Extraction • 59M • Updated Sep 4, 2023 • 84.5k • 19 facebook/encodec_24khz Feature Extraction • 23.3M • Updated Jul 25, 2023 • 676k • 52 descript/dac_44khz Feature Extraction • 76.6M • Updated Oct 11, 2024 • 189k • • 10