pankajmathur's picture
Update README.md
7300c06 verified
metadata
license: apache-2.0
base_model:
  - mistralai/Devstral-Small-2507
tags:
  - code
  - sft
  - rl
  - rlvr
  - grpo
language:
  - en
library_name: transformers
datasets:
  - pankajmathur/orca_mini_v1_dataset
  - pankajmathur/OpenThoughts-Agent-v1-SFT-cleaned
  - princeton-nlp/SWE-bench_Verified
  - nvidia/Nemotron-Terminal-Corpus

RenCoder-Devstral-Small-2507

This model is a SFT + RLVR (DPO+GRPO) version of mistralai/Devstral-Small-2507 on muliple agentic coding datasets (SWE-Bench, NVIDIA Terminal Corpus etc).

"Obsessed with building Open Source AGI, So am I ! Let's create together 🚀 https://www.linkedin.com/in/pankajam"

Model Details

Usage

License

This model inherits the Apache 2.0 license from the base Devstral-Small-2507 model.

Acknowledgements