Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

upvoted a paper about 3 hours ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

upvoted an article about 3 hours ago

ML Intern Takes Our Post-Training Internship Test

published an article about 3 hours ago

ML Intern Takes Our Post-Training Internship Test

View all activity

Organizations

upvoted a paper about 3 hours ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 65

upvoted an article about 3 hours ago

Article

ML Intern Takes Our Post-Training Internship Test

about 3 hours ago

•

11

published an article about 3 hours ago

Article

ML Intern Takes Our Post-Training Internship Test

about 3 hours ago

•

11

updated a Space about 3 hours ago

ml-intern sandbox

liked a model 1 day ago

openai/privacy-filter

Token Classification • 1B • Updated 1 day ago • 1.89k • 506

updated 3 Spaces 3 days ago

ml-intern sandbox

ml-agent sandbox

Traces Viewer

Explore Hugging Face trace logs in an interactive viewer

updated a dataset 3 days ago

smolagents/post-train-bench-traces

Updated 3 days ago • 77

published a dataset 3 days ago

smolagents/post-train-bench-traces

Updated 3 days ago • 77

liked a Space 3 days ago

Traces Viewer

Explore and visualize trace logs in an interactive web viewer

liked a Space 4 days ago

Defeating the trainer-generator precision mismatch in TRL

Download research PDF (Pro access required)

liked a model 7 days ago

Qwen/Qwen3.6-35B-A3B

Image-Text-to-Text • 36B • Updated 1 day ago • 718k • 1.31k

upvoted a paper 7 days ago

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published 22 days ago • 46

published a model 7 days ago

lewtun/Qwen3-4B-Instruct-2507-SFT

Updated 7 days ago