Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,509

Full-text search

Active filters: fp8

Qwen/Qwen3-VL-4B-Instruct-FP8

Image-Text-to-Text • 5B • Updated Oct 15, 2025 • 29.2k • 50

Qwen/Qwen3-VL-4B-Thinking-FP8

Image-Text-to-Text • 5B • Updated Nov 26, 2025 • 2.71k • 30

Qwen/Qwen3-VL-32B-Instruct-FP8

Image-Text-to-Text • Updated Oct 22, 2025 • 403k • 43

Qwen/Qwen3-VL-2B-Instruct-FP8

Image-Text-to-Text • 2B • Updated Oct 20, 2025 • 182k • 38

Qwen/Qwen3-VL-2B-Thinking-FP8

Image-Text-to-Text • 2B • Updated Nov 26, 2025 • 1.27k • 26

RedHatAI/granite-4.0-h-small-FP8-dynamic

Text Generation • 32B • Updated 10 days ago • 718 • 2

ai-sage/GigaChat3-10B-A1.8B

Text Generation • 11B • Updated Dec 11, 2025 • 3.52k • 64

drbaph/Z-Image-Turbo-FP8

Text-to-Image • Updated Nov 27, 2025 • 13.9k • 40

XiaomiMiMo/MiMo-V2-Flash-Base

Text Generation • 310B • Updated Dec 17, 2025 • 114 • 44

RedHatAI/Ministral-3-14B-Instruct-2512

14B • Updated 10 days ago • 332 • 2

sh0ck0r/Llama-3.3-70B-Vulpecula-r1-FP8-Dynamic

Text Generation • 71B • Updated Dec 24, 2025 • 5 • 1

Ilus-AI/Qwen-Image-2512-FP8

Text-to-Image • Updated Jan 3 • 58 • 1

YifeiDevs/Huihui-Qwen3-8B-abliterated-v2-FP8-Comfy

Updated Jan 25 • 1 • 1

Ex0bit/MiniMax-M2.5-PRISM-PRO-Tensors

229B • Updated 16 days ago • 16 • 2

alexliap/Qwen3-VL-Embedding-2B-FP8-DYNAMIC

Feature Extraction • 2B • Updated 17 days ago • 12.5k • 2

mratsim/MiniMax-M2.5-FP8-INT4-AWQ

Text Generation • 39B • Updated 17 days ago • 5.56k • 9

EliasOenal/MiniMax-M2.5-Hybrid-AWQ-W4A16G128-Attn-fp8_e4m3-KV-fp8_e4m3

Text Generation • 34B • Updated 17 days ago • 517 • 11

RedHatAI/Qwen3.5-397B-A17B-FP8-dynamic

397B • Updated 3 days ago • 2.87k • 3

prithivMLmods/Qwen-Image-Edit-AIO-FP8

Image-to-Image • Updated 13 days ago • 232 • 4

unsloth/Qwen3.5-397B-A17B-FP8

Image-Text-to-Text • 403B • Updated 8 days ago • 1.02k • 2

prithivMLmods/Spatial-SSRL-Qwen3VL-4B-FP8

Image-Text-to-Text • 4B • Updated 4 days ago • 13 • 1

RedHatAI/MiniMax-M2.5

Text Generation • 229B • Updated 8 days ago • 33 • 1

lovedheart/Qwen3.5-26B-A3B-FP8

Image-Text-to-Text • 28B • Updated 8 days ago • 37 • 1

FriendliAI/Meta-Llama-3-8B-Instruct-fp8

Text Generation • 8B • Updated Nov 3, 2024 • 25 • 2

RedHatAI/Meta-Llama-3-8B-Instruct-FP8

Text Generation • Updated Jul 18, 2024 • 6.22k • • 24

RedHatAI/Mixtral-8x7B-Instruct-v0.1-AutoFP8

Text Generation • 47B • Updated Jul 18, 2024 • 7 • 3

anyisalin/Meta-Llama-3-8B-Instruct-FP8

Text Generation • 8B • Updated May 6, 2024 • 6

anyisalin/Meta-Llama-3-8B-Instruct-FP8-D

Text Generation • 8B • Updated Apr 28, 2024 • 4

anyisalin/lzlv_70b_fp16_hf-FP8-D

Text Generation • 69B • Updated Apr 28, 2024 • 1

anyisalin/Meta-Llama-3-70B-Instruct-FP8-D

Text Generation • 71B • Updated Apr 28, 2024 • 4