Edit Models filters
Model Tree
Apps
Inference Providers
Models
22,659
Active filters: grpo
Lazarus-Ai/ReAligned-Qwen3.5-35B-A3B
Text Generation • 35B • Updated • 37 • 3
alireza7/GrepSeek-Qwen3.5-9B-GRPO
Text Generation • 9B • Updated • 179 • 3
Lazarus-Ai/ReAligned-Qwen3.5-27B-GGUF
Text Generation • 27B • Updated • 1.44k • 3
Lazarus-Ai/ReAligned-Qwen3.5-27B
Text Generation • 27B • Updated • 32 • 2
mudasir13cs/qwen25-vl-3b-floorplan-grpo
Image-to-Text • Updated • 408 • 5
Lazarus-Ai/ReAligned-Qwen3.5-35B-A3B-GGUF
Text Generation • 35B • Updated • 1.41k • 2
olaverse/MIST-Mini-8B-Thinking
Text Generation • 8B • Updated • 170 • • 2
Lazarus-Ai/ReAligned-Qwen3.5-0.8B
Text Generation • 0.9B • Updated • 49 • 1
Lazarus-Ai/ReAligned-Qwen3.5-2B
Text Generation • 2B • Updated • 24 • 1
Lazarus-Ai/ReAligned-Qwen3.5-4B
Text Generation • 5B • Updated • 90 • 1
Lazarus-Ai/ReAligned-Qwen3.5-9B
Text Generation • 9B • Updated • 25 • 1
zosmaai/Qwen3.5-0.8B-GRPO-Math
Text Generation • 0.8B • Updated • 21 • 1
Ayansk11/FinSenti-Qwen3-4B
Text Generation • 4B • Updated • 268 • • 1
zhaohq/PureRL-7B-v5-main
Text Generation • 8B • Updated • 22 • 1
ParaVT/ParaVT-8B
Video-Text-to-Text • 9B • Updated • 406 • 4
mradermacher/PureRL-7B-v5-main-GGUF
8B • Updated • 592 • 2
mradermacher/ParaVT-8B-GGUF
8B • Updated • 606 • 1
MeiGen-AI/GenEvolve
Image-Text-to-Text • 9B • Updated • 158 • 6
Lazarus-Ai/ReAligned-Qwen3.5-0.8B-GGUF
Text Generation • 0.8B • Updated • 638 • 1
Lazarus-Ai/ReAligned-Qwen3.5-2B-GGUF
Text Generation • 2B • Updated • 719 • 1
Lazarus-Ai/ReAligned-Qwen3.5-4B-GGUF
Text Generation • 4B • Updated • 699 • 1
Lazarus-Ai/ReAligned-Qwen3.5-9B-GGUF
Text Generation • 9B • Updated • 1.63k • 1
kridaydave/Qwen-1.5B-LFGRPO-OPTIM
Text Generation • Updated • 45 • 1
mradermacher/Atomight-V2.1-0.5B-Inference-GGUF
Text Generation • 0.5B • Updated • 906 • 1
mradermacher/Atomight-V2.1-0.5B-Inference-i1-GGUF
Text Generation • 0.5B • Updated • 3.27k • 1
EvanOLeary/laguna-xs2-dense-k8-cuda-grpo
Text Generation • 3B • Updated • 105 • 1
gabriel-xiong/qwen3-8b-grpo-v2-epoch2
8B • Updated • 1
NovatasticRoScript/Atomight-V2.1-0.5B-Inference
Text Generation • 0.5B • Updated • 720 • • 2
Chun121/Qwen3-4B-RPG-Roleplay-V2
Text Generation • 4B • Updated • 14.6k • 56