Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning Paper • 2506.08477 • Published Jun 10 • 4 • 2
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models Paper • 2505.02686 • Published May 5 • 16 • 2
AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge Paper • 2412.13670 • Published Dec 18, 2024 • 6 • 2