Hey all โ our ResearchClawBench leaderboard just updated ๐ฅ
We let AI do real science: 40 tasks across 10 disciplines, compared to human papers. Hard example? ๐๏ธ Glacier mass change โ AI must integrate 233 datasets from 35 teams, 4 methods, reproduce 6542ยฑ387 Gt ice loss vs IPCC. No toy problems.
Hey all โ our ResearchClawBench leaderboard just updated ๐ฅ
We let AI do real science: 40 tasks across 10 disciplines, compared to human papers. Hard example? ๐๏ธ Glacier mass change โ AI must integrate 233 datasets from 35 teams, 4 methods, reproduce 6542ยฑ387 Gt ice loss vs IPCC. No toy problems.