Beyond Steering Vector: Flow-based Activation Steering for Inference-Time Intervention Paper • 2605.05892 • Published 8 days ago • 1
How Far Are VLMs from Privacy Awareness in the Physical World? An Empirical Study Paper • 2605.05340 • Published 7 days ago • 1
One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue Paper • 2605.05630 • Published 3 days ago • 8
One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue Paper • 2605.05630 • Published 3 days ago • 8
The Trojan Knowledge: Bypassing Commercial LLM Guardrails via Harmless Prompt Weaving and Adaptive Tree Search Paper • 2512.01353 • Published Dec 1, 2025 • 2
Measuring Physical-World Privacy Awareness of Large Language Models: An Evaluation Benchmark Paper • 2510.02356 • Published Sep 27, 2025 • 11
Exploring $\ell_0$ Sparsification for Inference-free Sparse Retrievers Paper • 2504.14839 • Published Apr 21, 2025 • 5