Are complicated loss functions necessary for teaching LLMs to reason? Paper • 2603.18756 • Published 17 days ago • 1
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published 10 days ago • 125
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28, 2025 • 21
Global PIQA Collection A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries. • 2 items • Updated Feb 2 • 2
Quantifying the Carbon Emissions of Machine Learning Paper • 1910.09700 • Published Oct 21, 2019 • 41
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 26 days ago • 103
Biased Tales: Cultural and Topic Bias in Generating Children's Stories Paper • 2509.07908 • Published Sep 9, 2025 • 1
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards Paper • 2601.06021 • Published Jan 9 • 47
ACE Collection Ai2 Climate Emulator (ACE) is a family of fast ML models that simulate global atmospheric variability over time scales ranging from hours to centuries • 9 items • Updated 22 days ago • 13
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated Mar 2 • 88