Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
lewtun
's Collections
β Awesome RL datasets π β
β Long-context post-training π§Ά β
H4
Awesome RLHF
Mistral 7B + UltraChat + Arithmo checkpoints
Hub tools
Gemma RLAIF
β Awesome RL datasets π β
updated
Sep 23, 2025
Upvote
1
ScaleAI/SWE-bench_Pro
Benchmark
β’
Updated
24 days ago
β’
731
β’
483k
β’
55
agentica-org/DeepScaleR-Preview-Dataset
Viewer
β’
Updated
Feb 10, 2025
β’
40.3k
β’
7.39k
β’
198
open-r1/DAPO-Math-17k-Processed
Viewer
β’
Updated
Nov 10, 2025
β’
34.8k
β’
5.31k
β’
62
Upvote
1
Share collection
View history
Collection guide
Browse collections