weqweasdas/qwen15b_train_simple_subset5k_for_difficulty_transition Viewer • Updated 9 days ago • 5k • 12
weqweasdas/from_default_filtered_openr1_with_scores_filtered_05_and_filtered_allwrong Viewer • Updated Sep 18 • 25k • 14
weqweasdas/dapo_and_openr1_can_be_evaluated_by_daporm_deduplicate_with_scores Viewer • Updated Sep 16 • 34.1k • 5
weqweasdas/dapo_and_openr1_can_be_evaluated_by_daporm_deduplicate Viewer • Updated Sep 15 • 34.1k • 5
weqweasdas/test_rm_from_default_filtered_openr_math_verify_scores_and_dapo_scores Viewer • Updated Sep 15 • 93.7k • 3
weqweasdas/test_rm_from_default_filtered_openr_math_verify_scores Viewer • Updated Sep 15 • 93.7k • 7
weqweasdas/from_default_filtered_openr1_with_scores_filtered_0125_but_not_all_wrong Viewer • Updated Sep 13 • 13.3k • 6
weqweasdas/from_default_filtered_openr1_with_scores_filtered_0125 Viewer • Updated Sep 13 • 37.8k • 2