AgentDoG Collection A Diagnostic Guardrail Framework for AI Agent Safety and Security • 11 items • Updated 10 days ago • 89
LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads? Paper • 2510.09595 • Published Oct 10, 2025 • 2
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence Paper • 2507.21046 • Published Jul 28, 2025 • 83
GenoArmory: A Unified Evaluation Framework for Adversarial Attacks on Genomic Foundation Models Paper • 2505.10983 • Published May 16, 2025 • 2