Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
samsam55 's Collections
Datasets
Self Improving
Run on CPU Optimizations
Deep Search
World View Creation (out painting 3D)
Computer Use
Coding LLMs
Visual Multi Modal LLM
TTS & Speech to Text
Misc
Agents
3D Models & Modeling

TTS & Speech to Text

updated 12 days ago
Upvote
-

  • Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction

    Paper • 2510.03117 • Published 25 days ago • 11

  • ResembleAI/chatterbox

    Text-to-Speech • Updated Sep 23 • 868k • • 1.24k

  • thewh1teagle/phonikud

    0.3B • Updated Aug 24 • 149

  • UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

    Paper • 2510.13344 • Published 13 days ago • 61
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs