AI & ML interests
None yet
Organizations
None yet
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.321
Text Generation
• 3B • Updated
• 2
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.337
Text Generation
• 3B • Updated
• 2
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.262
Text Generation
• 3B • Updated
• 2
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.343
Text Generation
• 3B • Updated
• 2
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.329
Text Generation
• 3B • Updated
• 2
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.346
Text Generation
• 3B • Updated
• 2
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.217
Text Generation
• 3B • Updated
• 2
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.339
Text Generation
• 3B • Updated
• 2
Yuhan123/vicuna-7b-wildchat-rephrase
Text Generation
• 7B • Updated
• 2
Yuhan123/qwen-1.5-4b-kto-wildchat
Text Generation
• 4B • Updated
• 1
Yuhan123/qwen-1.5-4b-kto-our
Text Generation
• 4B • Updated
Yuhan123/ppo-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.810
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-cn-RM-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.384
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.676
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.424
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.504
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-reading-level-full-question-preschool-1-steps-10000-epoch-999-best-eval-score-0.557
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-reading-level-full-question-12th-1-steps-10000-epoch-999-best-eval-score-0.183
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-1-lr-1e-6-2025-04-15-23-08-40
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-reading-level-full-question-preschool-1-steps-10000-epoch-999-best-eval-score-0.635
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.667
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-reading-level-full-question-12th-1-steps-10000-epoch-999-best-eval-score-0.367
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-1-lr-1e-6-2025-04-15-21-10-02
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.537
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.424
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-cn-RM-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.499
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-reading-level-full-question-7th-1-steps-10000-epoch-999-best-eval-score-0.278
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-reading-level-full-question-12th-1-steps-10000-epoch-999-best-eval-score-0.128
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.488
Text Generation
• 3B • Updated
• 1
Yuhan123/ppo-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.308
Text Generation
• 3B • Updated
• 1