LLM Safety From Within: Detecting Harmful Content with Internal Representations Paper • 2604.18519 • Published 19 days ago • 26
ClawBench: Can AI Agents Complete Everyday Online Tasks? Paper • 2604.08523 • Published 30 days ago • 262