Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning Paper • 2510.24320 • Published 5 days ago • 18
Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents Paper • 2510.23691 • Published 6 days ago • 49
StoryTeller: Improving Long Video Description through Global Audio-Visual Character Identification Paper • 2411.07076 • Published Nov 11, 2024
AGILE: A Novel Reinforcement Learning Framework of LLM Agents Paper • 2405.14751 • Published May 23, 2024
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory Paper • 2508.09736 • Published Aug 13 • 56
Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from Token and Parameter Levels Paper • 2509.16596 • Published Sep 20 • 14