Seungone Kim's picture

Seungone Kim PRO

seungone

·

https://seungonekim.github.io/

AI & ML interests

Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment

Recent Activity

upvoted a paper 17 days ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

authored a paper 3 months ago

Measuring Sycophancy of Language Models in Multi-turn Dialogues

authored a paper 3 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

View all activity

Organizations

Papers 38

arxiv:2511.22173

arxiv:2510.24684

arxiv:2509.21451

arxiv:2508.13141

spaces 2

My Argilla

Test3

models 1

seungone/skywork-reward-replicate

Text Classification • 8B • Updated Dec 11, 2024 • 3

datasets 5

seungone/ablation1_math_gpt4o_mini

Viewer • Updated Nov 25, 2024 • 5.56k • 6

seungone/ablation3_math_llama3.1_8b_instruct

Viewer • Updated Nov 25, 2024 • 24.8k • 6

seungone/ablation2_math_llama3.1_8b_instruct

Viewer • Updated Nov 25, 2024 • 5.99k • 12

seungone/ablation1_code_gpt4o_mini

Viewer • Updated Nov 25, 2024 • 10k • 4

seungone/final-math-claude3.5_sonnet-10000

Viewer • Updated Sep 16, 2024 • 10k • 13 • 1