Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 147
Running 20 Falcon-H1-Tiny: A series of extremely small, yet powerful language models redefining capabilities at small scale 📝 20
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents Paper • 2512.23343 • Published 20 days ago • 27
Lost in the Noise: How Reasoning Models Fail with Contextual Distractors Paper • 2601.07226 • Published 6 days ago • 28
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning Paper • 2601.09088 • Published 4 days ago • 52
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Paper • 2512.20578 • Published 26 days ago • 76
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling Paper • 2512.23959 • Published 19 days ago • 103
Controlled Self-Evolution for Algorithmic Code Optimization Paper • 2601.07348 • Published 6 days ago • 106
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published 7 days ago • 202
Running on Zero MCP 1.56k Z Image Turbo 🖼 1.56k Generate stunning images from text descriptions in seconds
Fast Inference from Transformers via Speculative Decoding Paper • 2211.17192 • Published Nov 30, 2022 • 10