Running Agents 68 UncheatableEval π 68 Explore model scaling metrics with interactive tables and plots
Meta-Harness: End-to-End Optimization of Model Harnesses Paper β’ 2603.28052 β’ Published 30 days ago β’ 20
Flash-KMeans: Fast and Memory-Efficient Exact K-Means Paper β’ 2603.09229 β’ Published Mar 10 β’ 82