Spaces:
Running
Running
What would Alan Turing think of todays LLM's?
#1
by
Whyvette - opened
Just curious
Just curious
What if Turing met ChatGPT?
The first 5 minutes, he'd be thrilled. "Good heavens, it actually holds a conversation!"
The next 5 minutes, he'd be confused. "Wait โ why is it so confident about something it just got completely wrong?"
That's exactly what we found building FINAL Bench.
Today's LLMs ace exams. The problem? They wear the same poker face whether they're right or wrong. In our data, 94.8% of metacognitive failure came from models declaring "I totally know this!" โ and then getting it wrong.
The Turing Test asks: "Can it pass as human?" โ Pass.
FINAL Bench asks: "Does it know what it doesn't know?" โ Fail.
We think Turing would sum it up like this:
"Brilliant speaker. Terrible self-critic."