This collection contains curriculum-RLed Olmo models.
SeanWang0027 PRO
SeanWang0027
AI & ML interests
Continual Learning
Recent Activity
published a dataset about 1 hour ago
SeanWang0027/rlve_30b_qwen_1.7b_mixed_20envs_10 updated a dataset about 4 hours ago
SeanWang0027/rlve_30b_qwen_1.7b_mixed_20envs_10 updated a dataset 1 day ago
CL-From-Nothing/RLVE-EvalOrganizations
models 36
SeanWang0027/mixed_sdft_solution_sudoku_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_qwen3_1_7b
Updated • 2
SeanWang0027/dolci-wildchat-think-singleturn
Updated
SeanWang0027/student_prefix_kukurasu_20K_nemotron8b_continual_Q_nemotron-cascade-8b_cutoff2048_epoch_3_mask
8B • Updated • 19
SeanWang0027/student_prefix_kukurasu_20K_nemotron8b_continual_Q_nemotron-cascade-8b_cutoff1024_epoch_3_mask
Updated
SeanWang0027/student_prefix_kukurasu_20K_nemotron8b_continual_Q_nemotron-cascade-8b_cutoff512_epoch_3_mask
8B • Updated • 17
SeanWang0027/sdft_sudoku_minesweeper_kukurasu_Qwen3-1.7B_1_epoch_8192_32_batch_2e-5_lr
2B • Updated • 17
SeanWang0027/student_prefix_kukurasu_20K_qwen3_1-7b_continual_Q_qwen3-1.7b_cutoff2048_epoch_3_mask
2B • Updated • 7
SeanWang0027/student_prefix_kukurasu_20K_qwen3_1-7b_continual_Q_qwen3-1.7b_cutoff1024_epoch_3_mask
2B • Updated • 15
SeanWang0027/student_prefix_kukurasu_20K_qwen3_1-7b_continual_Q_qwen3-1.7b_cutoff512_epoch_3_mask
2B • Updated • 10
SeanWang0027/sdft_minesweeper_kukurasu_Qwen3-1.7B_1_epoch_8192_32_batch_2e-5_lr
2B • Updated • 9
datasets 27
SeanWang0027/rlve_30b_qwen_1.7b_mixed_20envs_10
Viewer • Updated • 16k
SeanWang0027/teacher_prefix_sudoku_10K_sequential_qwen3_4b_thinking_continual_nemotron-cascade-8b
Updated • 27
SeanWang0027/student_prefix_sequential
Viewer • Updated • 3k • 29 • 1
SeanWang0027/RAGEN
Updated • 891
SeanWang0027/mixed_sdft_solution_sequential_minesweeper_kukurasu_qwen3_4b_thinking
Updated • 41
SeanWang0027/teacher_prefix_sudoku_10K_qwen3_4b_thinking_continual_qwen3-1-7b-parquet_qwen3-1.7b_epoch_3
Updated • 30
SeanWang0027/mixed_sdft_solution_kukurasu_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_qwen3_1_7b
Updated • 41
SeanWang0027/mixed_sdft_solution_minesweeper_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_qwen3_1_7b
Updated • 42
SeanWang0027/mixed_sdft_solution_sudoku_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_nemotron8b
Updated • 42
SeanWang0027/mixed_sdft_solution_minesweeper_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_nemotron8b
Updated • 44