LaSeR: Reinforcement Learning with Last-Token Self-Rewarding Paper • 2510.14943 • Published 14 days ago • 37
DeepCritic: Deliberate Critique with Large Language Models Paper • 2505.00662 • Published May 1 • 54 • 8
DeepCritic: Deliberate Critique with Large Language Models Paper • 2505.00662 • Published May 1 • 54
Towards Physically Plausible Video Generation via VLM Planning Paper • 2503.23368 • Published Mar 30 • 40