view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 Aug 8, 2025 ⢠92
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Paper ⢠2508.09834 ⢠Published Aug 13, 2025 ⢠53