arxiv:2506.01789
Seungone Kim PRO
seungone
AI & ML interests
Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment
Recent Activity
upvoted
a
paper
3 days ago
SPICE: Self-Play In Corpus Environments Improves Reasoning
liked
a dataset
4 months ago
toloka/u-math
liked
a dataset
4 months ago
xw27/scibench