arxiv:2605.09063
Seungone Kim PRO
seungone
AI & ML interests
Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment
Recent Activity
authored a paper 2 days ago
Reasoning over mathematical objects: on-policy reward modeling and test time aggregation authored a paper 2 days ago
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs