Interplay-LM-Reasoning

AI & ML interests

None defined yet.

Recent Activity

Clockz updated a model 27 days ago

Interplay-LM-Reasoning/extrapolation_midtrain

Clockz updated a model 28 days ago

Interplay-LM-Reasoning/context_pretrain_2

Clockz published a model 28 days ago

Interplay-LM-Reasoning/context_pretrain_2

View all activity

updated a model 27 days ago

Interplay-LM-Reasoning/extrapolation_midtrain

Updated 27 days ago

updated a model 28 days ago

Interplay-LM-Reasoning/context_pretrain_2

Updated 28 days ago

published a model 28 days ago

Interplay-LM-Reasoning/context_pretrain_2

Updated 28 days ago

updated a model 28 days ago

Interplay-LM-Reasoning/context_pretrain

Updated 28 days ago

updated a model 29 days ago

Interplay-LM-Reasoning/extrapolation_rl

Updated 29 days ago

updated 2 datasets 3 months ago

Interplay-LM-Reasoning/composition

Viewer • Updated Jan 26 • 129M • 546 • 1

Interplay-LM-Reasoning/context

Viewer • Updated Jan 26 • 33.7M • 316 • 2

authored 4 papers 5 months ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Paper • 2510.24702 • Published Oct 28, 2025 • 31

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 46

Simulating Environments with Reasoning Models for Agent Training

Paper • 2511.01824 • Published Nov 3, 2025 • 2

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 40

in Interplay-LM-Reasoning/extrapolation_midtrain 5 months ago

Add pipeline tag, GitHub link, and improved model description

#1 opened 5 months ago by

in Interplay-LM-Reasoning/extrapolation_rl 5 months ago

Improve model card: Add pipeline tag and GitHub link

#1 opened 5 months ago by

published 2 datasets 5 months ago

Interplay-LM-Reasoning/context

Viewer • Updated Jan 26 • 33.7M • 316 • 2

Interplay-LM-Reasoning/composition

Viewer • Updated Jan 26 • 129M • 546 • 1

published 3 models 5 months ago

Interplay-LM-Reasoning/extrapolation_midtrain

Updated 27 days ago

Interplay-LM-Reasoning/context_pretrain

Updated 28 days ago

Interplay-LM-Reasoning/extrapolation_rl

Updated 29 days ago

authored a paper 5 months ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 40

authored a paper 10 months ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17, 2025 • 39