Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lewtun
's Collections
β Awesome RL datasets π β
β Long-context post-training π§Ά β
H4
Awesome RLHF
Mistral 7B + UltraChat + Arithmo checkpoints
Hub tools
Gemma RLAIF
β Awesome RL datasets π β
updated
Sep 23, 2025
Upvote
1
ScaleAI/SWE-bench_Pro
Viewer
β’
Updated
Sep 25, 2025
β’
731
β’
11.9k
β’
43
agentica-org/DeepScaleR-Preview-Dataset
Viewer
β’
Updated
Feb 10, 2025
β’
40.3k
β’
8.56k
β’
183
open-r1/DAPO-Math-17k-Processed
Viewer
β’
Updated
Nov 10, 2025
β’
34.8k
β’
5.07k
β’
53
Upvote
1
Share collection
View history
Collection guide
Browse collections