Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning Paper • 2601.20829 • Published Jan 28 • 6
guactastesgood/DeepSeek-R1-Distill-Qwen-1.5B-failure-prefix-conditioning-iteration1 2B • Updated 25 days ago • 30
guactastesgood/DeepSeek-R1-Distill-Qwen-1.5B-failure-prefix-conditioning-iteration2 Text Generation • 2B • Updated 25 days ago • 23
guactastesgood/failure-prefix-conditioned-dataset-iteration-2 Viewer • Updated 25 days ago • 1.12k • 26