ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research
Paper • 2606.07591 • Published • 94
None defined yet.
ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research
Sci-CoE: Co-evolving Scientific Reasoning LLMs via Geometric Consensus with Sparse Supervision