Open to Work

10 34 22

Sifal KLIOUI

Sifal

https://sifal.social/

AI & ML interests

None yet

Recent Activity

liked a model 25 days ago

deepseek-ai/DeepSeek-V4-Pro

liked a model about 1 month ago

QwQZh/gated_attention

upvoted a paper about 2 months ago

ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models

View all activity

Organizations

upvoted a paper about 2 months ago

ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models

Paper • 2603.19466 • Published Mar 19 • 41

upvoted an article 4 months ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

itazap, ariG23498, ArthurZ, sergiopaniego, merve, pcuenq

•

Dec 18, 2025

• 124

upvoted a collection 5 months ago

Olmo 3.1

Collection

The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated Dec 23, 2025 • 52

upvoted an article 5 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

qgallouedec

•

Apr 18, 2025

• 72

upvoted 2 articles 6 months ago

Article

Model statistics of the 50 most downloaded entities on Hugging Face

lbourdois

•

Oct 13, 2025

• 40

Article

Entropic Instruction Following: Does Semantic Coherence Help LLMs Follow Instructions?

Sifal

•

Dec 3, 2025

• 1

upvoted an article 8 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez

•

Sep 11, 2025

• 188

upvoted an article 9 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

abidlabs, znation, nouamanetazi, sasha, qgallouedec

•

Jul 29, 2025

• 223

upvoted 6 articles 11 months ago

Article

Creating custom kernels for the AMD MI300

ror, seungrokj

•

Jul 9, 2025

• 54

Article

Efficient MultiModal Data Pipeline

ariG23498, lusxvr, andito, sergiopaniego, pcuenq

•

Jul 8, 2025

• 71

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers

tomaarsen, arthurbresnu

•

Jul 1, 2025

• 138

Article

Transformers backend integration in SGLang

zhyncs, ispobock, lmzheng, JinnP, marcsun13

•

Jun 23, 2025

• 56

Article

🪆 Introduction to Matryoshka Embedding Models

tomaarsen, Xenova, osanseviero

•

Feb 23, 2024

• 208

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

drbh, danieldk, Narsil, pcuenq, pagezyhf, merve, reach-vb

•

Jun 12, 2025

• 164

upvoted 2 articles 12 months ago

Article

Bigger isn't always better: how to choose the most efficient model for context-specific tasks 🌱🧑🏼‍💻

sasha

•

May 28, 2025

• 22

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb

•

May 21, 2025

• 258

upvoted 3 articles about 1 year ago

Article

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

danaaubakirova, Beegbrain, mshukor, m1b, villekuosmanen, cadene, pcuenq

•

May 11, 2025

• 97

Article

~Don't~ Repeat Yourself

patrickvonplaten

•

Apr 5, 2022

• 55

Article

Uncensor any LLM with abliteration

mlabonne

•

Jun 13, 2024

• 855

upvoted a paper about 1 year ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13, 2025 • 172

Sifal KLIOUI

AI & ML interests

Recent Activity

Organizations

Sifal's activity

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Gotchas in Tokenizer Behavior Every Developer Should Know

Model statistics of the 50 most downloaded entities on Hugging Face

Entropic Instruction Following: Does Semantic Coherence Help LLMs Follow Instructions?

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Creating custom kernels for the AMD MI300

Efficient MultiModal Data Pipeline

Training and Finetuning Sparse Embedding Models with Sentence Transformers

Transformers backend integration in SGLang

🪆 Introduction to Matryoshka Embedding Models

Learn the Hugging Face Kernel Hub in 5 Minutes

Bigger isn't always better: how to choose the most efficient model for context-specific tasks 🌱🧑🏼‍💻

nanoVLM: The simplest repository to train your VLM in pure PyTorch

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

~Don't~ Repeat Yourself

Uncensor any LLM with abliteration