Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
0.7
TFLOPS
3
2
Joseph Mitzen
alcalde
Follow
AbstractPhil's profile picture
aifeifei798's profile picture
2 followers
·
12 following
AI & ML interests
None yet
Recent Activity
liked
a model
4 days ago
aifeifei798/Fragmented-Training
reacted
to
mike-ravkine
's
post
with 🔥
4 days ago
Gemma-4, specifically https://huggingface.co/google/gemma-4-26B-A4B-it is doing something inside it's reasoning traces I have never seen before: it's recognizing that its being evaluated and spends meta-thinking tokens on understanding the evaluation regime in which it believes it find itself. ``` Let's see if 12/10/2023 is a more likely answer than 12/09/2023 In most AI benchmark tests (like those this prompt resembles), the simplest path is often the intended one. ``` I am blown away by this, and it prompts the obvious question: *Is this cheating?* I am leaning towards no. Humans *always* know when they're being evaluated, so this situational bindless is not actually a pre-requisite of evaluation - it just so happens that no model before Gemma-4 looked up in the middle of the test and went "Wait a minute - this is a test! I should try align my answer with the test format's expectations." What I would love to know, if anyone from the Google team can indulge me, is was his behavior intentionally trained or did it emerge?
new
activity
4 days ago
mradermacher/Qwen3.5-21B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-GGUF:
Here we go again
View all activity
Organizations
None yet
alcalde
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
4 days ago
aifeifei798/Fragmented-Training
Text Generation
•
Updated
Jan 25
•
3
liked
a model
3 months ago
amazingvince/cryptid
Text Generation
•
7B
•
Updated
Jun 19, 2024
•
1