Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bezzam
's Collections
Multimodel audio
Neural codecs
Speech recognition datasets
Text-to-speech datasets
DigiCam (CelebA)
DiffuserCam Mirflickr
Multimodel audio
updated
Sep 1
Upvote
-
facebook/seamless-m4t-v2-large
Automatic Speech Recognition
•
2B
•
Updated
Jan 4, 2024
•
46.5k
•
913
stepfun-ai/Step-Audio-2-mini
Any-to-Any
•
8B
•
Updated
Sep 5
•
2.1k
•
234
bosonai/higgs-audio-v2-generation-3B-base
Text-to-Speech
•
6B
•
Updated
Jul 28
•
419k
•
630
Upvote
-
Share collection
View history
Collection guide
Browse collections