MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games
Paper
•
2510.15414
•
Published
MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs
Note Note: This paper has been updated to v2 on arXiv. MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs