TIGER-Lab/MMLU-Pro
Benchmark
•
Updated
•
12.1k
•
70.6k
•
405
Natural Language Processing, Image Generation
VisCoder2: Building Multi-Language Visualization Coding Agents
BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions