Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kwanghee Choi's picture
1

Kwanghee Choi

juice500
21world's profile picture
·
https://kwangheechoi.com
  • juice500ml

AI & ML interests

None yet

Organizations

ESPnet's profile picture Dynamic-SUPERB's profile picture CMU LTI Wav2Gloss Project's profile picture ChangeLing Lab's profile picture

authored 8 papers 2 months ago

Wav2Gloss: Generating Interlinear Glossed Text from Speech

Paper • 2403.13169 • Published Mar 19, 2024

TiDAL: Learning Training Dynamics for Active Learning

Paper • 2210.06788 • Published Oct 13, 2022

On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models

Paper • 2406.09282 • Published Jun 13, 2024

ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration

Paper • 2409.09506 • Published Sep 14, 2024 • 4

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Paper • 2411.05361 • Published Nov 8, 2024 • 5

OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder

Paper • 2507.14129 • Published Jul 18, 2025 • 11

POWSM: A Phonetic Open Whisper-Style Speech Foundation Model

Paper • 2510.24992 • Published Oct 28, 2025 • 4

PRiSM: Benchmarking Phone Realization in Speech Models

Paper • 2601.14046 • Published Jan 20 • 7
authored a paper about 2 years ago

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Paper • 2401.16658 • Published Jan 30, 2024 • 14
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs