InfiMed-ORBIT: Aligning LLMs on Open-Ended Complex Tasks via Rubric-Based Incremental Training Paper • 2510.15859 • Published Oct 17, 2025 • 11
InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization Paper • 2508.05731 • Published Aug 7, 2025 • 25
FreedomIntelligence/medical-o1-reasoning-SFT Viewer • Updated Apr 22, 2025 • 90.1k • 5.05k • 1.03k