Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Project of MoE reward model
Activity Feed
Request to join this org
Follow
7
AI & ML interests
None defined yet.
Recent Activity
shengyi-qian
authored
a paper
about 2 months ago
DigiData: Training and Evaluating General-Purpose Mobile Control Agents
zhuokai
authored
a paper
about 2 months ago
Scaling Agent Learning via Experience Synthesis
zhuokai
authored
a paper
2 months ago
From Uncertainty to Trust: Enhancing Reliability in Vision-Language Models with Uncertainty-Guided Dropout Decoding
View all activity
Team members
6
MoeReward
's models
6
Sort: Recently updated
MoeReward/rl_checkpoints
Updated
Jun 27, 2025
MoeReward/lora_checkpoint
Updated
Mar 30, 2025
MoeReward/reward_lora_qwen_1_5_base
Updated
Mar 21, 2025
•
4
MoeReward/reward_qwen_1_5
14B
•
Updated
Mar 17, 2025
•
6
MoeReward/reward_lora_qwen_1_5
Updated
Mar 17, 2025
•
3
MoeReward/sft_full_param_qwen_1_5
14B
•
Updated
Mar 16, 2025
•
6