Skywork-Reward-V2 Collection Scaling preference data curation to the extreme • 9 items • Updated Jul 4 • 24
Reward Models 10-2025 Collection A collection of great reward models for research and production • 7 items • Updated about 6 hours ago • 9
Olmo 3 Pre-training Collection All artifacts related to Olmo 3 pre-training • 10 items • Updated 5 days ago • 25
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases 29 days ago • 52
OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation Paper • 2511.13655 • Published 17 days ago • 9
view article Article The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs 19 days ago • 11
LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls Paper • 2511.09148 • Published 22 days ago • 16
Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs Paper • 2511.07419 • Published 24 days ago • 25
Too Good to be Bad: On the Failure of LLMs to Role-Play Villains Paper • 2511.04962 • Published 27 days ago • 52
SYNTH Collection Fully generalist synthetic dataset and SOTA small reasoners • 3 items • Updated 24 days ago • 10
Common Pile v0.1 Collection All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated Jun 6 • 37