Trevor Miller

MicrowaveJack

MicrowaveJack

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

liked a Space 2 months ago

nanotron/ultrascale-playbook

liked a Space 2 months ago

HuggingFaceFW/blogpost-fineweb-v1

View all activity

Organizations

None yet

upvoted an article about 2 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

liked 3 Spaces 2 months ago

The Ultra-Scale Playbook

🌌

3.61k

The ultimate guide to training LLM on large GPU Clusters

FineWeb: decanting the web for the finest text data at scale

🍷

1.24k

Generate high-quality text data for LLMs using FineWeb

The Smol Training Playbook

📚

2.75k

The secrets to building world-class LLMs

liked a model 3 months ago

microsoft/UserLM-8b

Text Generation • 8B • Updated Oct 9, 2025 • 675 • 360

liked a Space about 1 year ago

Qwen2.5 Coder Artifacts

🐢

1.71k

Generate code for applications

liked a model over 1 year ago

BAAI/bge-small-en-v1.5

Feature Extraction • 33.4M • Updated Feb 22, 2024 • 3.09M • • 394

liked 2 datasets over 1 year ago

gretelai/gretel-math-gsm8k-v1

Viewer • Updated Oct 16, 2024 • 24.9k • 384 • 39

TIGER-Lab/SKGInstruct

Preview • Updated Apr 9, 2024 • 160 • 28

liked 4 models over 1 year ago

liked a dataset over 1 year ago

TIGER-Lab/MMLU-Pro

Viewer • Updated Oct 25, 2025 • 12.1k • 69.4k • 403

upvoted a paper over 1 year ago

DynaVis: Dynamically Synthesized UI Widgets for Visualization Editing

Paper • 2401.10880 • Published Jan 19, 2024 • 1

liked a model almost 2 years ago

TheBloke/CodeLlama-70B-hf-GGUF

Text Generation • 69B • Updated Jan 30, 2024 • 977 • 42

updated a collection almost 2 years ago

Research

Collection

1 item • Updated Jan 29, 2024

upvoted a paper almost 2 years ago

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Paper • 2401.15024 • Published Jan 26, 2024 • 73

liked a model almost 2 years ago

mistralai/Mixtral-8x7B-Instruct-v0.1

47B • Updated Jul 24, 2025 • 554k • 4.62k

Trevor Miller

AI & ML interests

Recent Activity

Organizations

MicrowaveJack's activity

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

The Ultra-Scale Playbook

FineWeb: decanting the web for the finest text data at scale

The Smol Training Playbook

Qwen2.5 Coder Artifacts