Reward Hacking in Reasoning Models Collection Do reasoning LLMs actually reason — or learn to game the test? IPT allows for detecting reward hacking in inductive programming tasks (SLR-Bench). • 4 items • Updated May 18 • 1
ActivationReasoning: Logical Reasoning in Latent Activation Spaces Paper • 2510.18184 • Published Oct 21, 2025 • 2
Scalable Logical Reasoning Collection A collection of scalable logical reasoning tasks • 14 items • Updated May 18 • 2
SLR: An Automated Synthesis Framework for Scalable Logical Reasoning Paper • 2506.15787 • Published Jun 18, 2025 • 3
LlavaGuard Collection This collection contains the original repos of the LlavaGuard releases • 17 items • Updated Mar 2 • 7
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 675
InternVL2.0 Collection Expanding Performance Boundaries of Open-Source MLLM • 13 items • Updated Mar 2 • 89
V-LoL: A Diagnostic Dataset for Visual Logical Learning Paper • 2306.07743 • Published Jun 13, 2023 • 1
LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment Paper • 2406.05113 • Published Jun 7, 2024 • 3