Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
SDewittCLathrop3PhD 's Collections
FINANCE
SPEECH TO TEXT
AGENTS
CHARACTER AI
RESEARCH ARXIV
TTS
PERSONALIZATION
VISION
GPT-OSS
DOCUMENT WRITER
PLAYGROUND
SPREADSHEET
LORAS
EMBEDDING
LAW
SEARCH
LEADERBOARD
HEALTH
VIDEO
WRITE
HARDWARE, VRAM
MODELS
SONGS
TRAINING
IMAGE EXPLANATION
IMAGES
OCR
SPACES

SPEECH TO TEXT

updated 25 days ago
Upvote
-

  • Running
    222
    222

    Qwen3 ASR Demo

    👀

    Convert audio to text with context and language options


  • Running on Zero
    2.57k
    2.57k

    Whisper

    📉

    Transcribe audio files or YouTube videos into text


  • openai/whisper-large-v3

    Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 4.17M • • 5.03k

  • Running
    45
    45

    Qwen3 Omni Captioner Demo

    🐠

    Generate captions from audio


  • Qwen/Qwen3-Omni-30B-A3B-Captioner

    Any-to-Any • 32B • Updated Sep 22 • 35.3k • 164

  • nvidia/parakeet-tdt-0.6b-v3

    Automatic Speech Recognition • Updated Sep 18 • 219k • 370

  • LiquidAI/LFM2-Audio-1.5B

    Audio-to-Audio • 1B • Updated Sep 19 • 3.08k • 260
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs