Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

One-click Deployment

Inference Endpoints

Microsoft Foundry

Amazon SageMaker AI

Misc

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

93

Base only

Active filters: quant

degirum/test

Updated May 23, 2022

degirum/testmodel1

Updated May 23, 2022

digitous/13B-HyperMantis_GPTQ_4bit-128g

Text Generation • Updated May 24, 2023 • 6 • 12

pszemraj/nougat-small-onnx-quant_avx2

Image-Text-to-Text • Updated Dec 29, 2025 • 13

pszemraj/nougat-base-onnx-quant_avx2

Image-Text-to-Text • Updated Dec 29, 2025 • 14

fhai50032/RolePlayLake-7B-GGUF

7B • Updated Jan 30, 2024 • 46 • 3

oldbridge/latxa-7b-instruct-q8

Text Generation • 7B • Updated Nov 19, 2024 • 9

pszemraj/nougat-small-onnx-quant_avx512_vnni

Image-Text-to-Text • Updated Dec 29, 2025 • 3

RDson/Llama-3-Magenta-Instruct-4x8B-MoE-GGUF

25B • Updated May 17, 2024 • 245 • 1

TroyDoesAI/Codestral-21B-Pruned

Text Generation • 21B • Updated Jun 1, 2024 • 4 • 2

mradermacher/Codestral-21B-Pruned-GGUF

21B • Updated Jun 1, 2024 • 90

mradermacher/Codestral-21B-Pruned-i1-GGUF

21B • Updated Aug 2, 2024 • 572

pszemraj/candle-flanUL2-quantized

Text Generation • 19B • Updated Dec 29, 2025 • 12

byroneverson/gemma-2-27b-it-abliterated-gguf

Text Generation • 27B • Updated Sep 9, 2024 • 88 • 12

QuantFactory/gemma-2-27b-it-abliterated-GGUF

Text Generation • 27B • Updated Nov 6, 2024 • 1.33k • 7

EmperorKronos/gemma-2-27b-it-abliterated-exl2

Text Generation • Updated Sep 30, 2024 • 2

byroneverson/LongWriter-glm4-9b-abliterated-gguf

Text Generation • 9B • Updated Oct 19, 2024 • 48 • 3

aiqwe/FinShibainu

Question Answering • 8B • Updated Dec 18, 2024 • 9 • 4

mradermacher/FinShibainu-GGUF

8B • Updated Dec 18, 2024 • 172 • 1

eaddario/Hammer2.1-7b-GGUF

Text Generation • 8B • Updated Apr 24, 2025 • 670 • 2

eaddario/DeepSeek-R1-Distill-Qwen-7B-GGUF

Text Generation • 8B • Updated Apr 24, 2025 • 575 • 3

eaddario/Watt-Tool-8B-GGUF

Text Generation • 8B • Updated Apr 24, 2025 • 504 • 5

eaddario/DeepSeek-R1-Distill-Llama-8B-GGUF

Text Generation • 8B • Updated Apr 19, 2025 • 445 • 1

shisa-ai/Llama-3.1-Tulu-3-405B-FP8-Dynamic

Text Generation • 406B • Updated Feb 9, 2025 • 10

eaddario/Dolphin3.0-R1-Mistral-24B-GGUF

Text Generation • 24B • Updated May 10, 2025 • 3.02k • 1

eaddario/Llama-Guard-3-8B-GGUF

Text Generation • 8B • Updated Apr 25, 2025 • 1.47k

eaddario/Dolphin3.0-Mistral-24B-GGUF

Text Generation • 24B • Updated May 10, 2025 • 495 • 2

eaddario/Llama-xLAM-2-8b-fc-r-GGUF

Text Generation • 8B • Updated Apr 27, 2025 • 190 • 1

eaddario/Qwen3-8B-GGUF

Text Generation • 8B • Updated Apr 30, 2025 • 326 • 1

eaddario/OLMo-2-1124-7B-Instruct-GGUF

Text Generation • 7B • Updated May 3, 2025 • 183