yang31210999/Qwen3-4B-Instruct-2507-0809-rank128-lr0.0002-s1k_gptoss20b_high-1k 4B • Updated Aug 9 • 6
yang31210999/Qwen3-4B-Thinking-2507-0809-rank128-lr0.0002-s1k_gptoss20b_low-1k 4B • Updated Aug 9 • 7
yang31210999/Qwen3-4B-Thinking-2507-0809-rank128-lr0.0002-s1k_gptoss20b_high-1k 4B • Updated Aug 9 • 4
yang31210999/Qwen3-4B-Instruct-2507-0809-rank128-lr0.0002-s1k_gptoss20b_low-1k 4B • Updated Aug 9 • 2