UniGame: Turning a Unified Multimodal Model Into Its Own Adversary Paper • 2511.19413 • Published Nov 24, 2025 • 20
How Far Are Surgeons from Surgical World Models? A Pilot Study on Zero-shot Surgical Video Generation with Expert Assessment Paper • 2511.01775 • Published Nov 3, 2025 • 6
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published Oct 6, 2025 • 127
DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper • 2510.21618 • Published Oct 24, 2025 • 99
Glyph: Scaling Context Windows via Visual-Text Compression Paper • 2510.17800 • Published Oct 20, 2025 • 67
FlashWorld: High-quality 3D Scene Generation within Seconds Paper • 2510.13678 • Published Oct 15, 2025 • 72
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published Feb 20, 2025 • 156
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published Oct 13, 2025 • 165
Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process Paper • 2406.18361 • Published Jun 26, 2024 • 1
Are Pixel-Wise Metrics Reliable for Sparse-View Computed Tomography Reconstruction? Paper • 2506.02093 • Published Jun 2, 2025 • 2