YANG ZHOU
Yang-Zhou
AI & ML interests
RLHF and DPO
Recent Activity
upvoted
a
paper
about 2 hours ago
RubricHub: A Comprehensive and Highly Discriminative Rubric Dataset via Automated Coarse-to-Fine Generation
updated
a dataset
3 months ago
Yang-Zhou/DAPO-Math-17k-Qwen3-235B-A22B-Thinking-2507-rejection-distill
published
a dataset
3 months ago
Yang-Zhou/DAPO-Math-17k-Qwen3-235B-A22B-Thinking-2507-rejection-distill
Organizations
None yet