Running 3.72k The Ultra-Scale Playbook 🌌 3.72k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 3.03k The Smol Training Playbook 📚 3.03k The secrets to building world-class LLMs
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper • 2506.13585 • Published Jun 16, 2025 • 273
MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs Paper • 2505.21327 • Published May 27, 2025 • 83