DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry Paper • 2512.11558 • Published 20 days ago • 41
Baichuan-M2: Scaling Medical Capability with Large Verifier System Paper • 2509.02208 • Published Sep 2, 2025 • 42
Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization Paper • 2509.09307 • Published Sep 11, 2025 • 6
TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis Paper • 2508.13618 • Published Aug 19, 2025 • 18
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation Paper • 2506.18095 • Published Jun 22, 2025 • 66
QFFT, Question-Free Fine-Tuning for Adaptive Reasoning Paper • 2506.12860 • Published Jun 15, 2025 • 18
CoRT: Code-integrated Reasoning within Thinking Paper • 2506.09820 • Published Jun 11, 2025 • 18
Video-R1: Reinforcing Video Reasoning in MLLMs Paper • 2503.21776 • Published Mar 27, 2025 • 79
S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information Paper • 2503.05085 • Published Mar 7, 2025 • 47