MoME: Mixture of Matryoshka Experts for Audio-Visual Speech Recognition Paper • 2510.04136 • Published 22 days ago • 3 • 2
Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach Paper • 2505.14336 • Published May 20 • 3 • 2
Adaptive Audio-Visual Speech Recognition via Matryoshka-Based Multimodal LLMs Paper • 2503.06362 • Published Mar 9 • 3 • 2