LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads? Paper • 2510.09595 • Published Oct 10, 2025 • 2
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence Paper • 2507.21046 • Published Jul 28, 2025 • 83
AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes Paper • 2506.14728 • Published Jun 17, 2025
On Path to Multimodal Historical Reasoning: HistBench and HistAgent Paper • 2505.20246 • Published May 26, 2025
Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution Paper • 2505.20286 • Published May 26, 2025 • 8