A collection of benchmarks for evaluating LMs or VLMs under multi-turn interaction
Young-Jun Lee PRO
passing2961
AI & ML interests
Social Dialogue System, Multi-Modal Dialogue
Recent Activity
upvoted a paper 2 days ago
ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents