Albert Catalan-Tatjer

aldakata

4 10 15

https://aldakata.github.io/

aldakata

AI & ML interests

Efficiency

Recent Activity

liked a model about 6 hours ago

z-lab/Qwen3.5-27B-DFlash

liked a Space 6 days ago

JonasGeiping/stream-llm-demo

upvoted a collection 7 days ago

Qwen-AgentWorld

View all activity

Organizations

liked a model about 6 hours ago

z-lab/Qwen3.5-27B-DFlash

Text Generation • 2B • Updated 11 days ago • 3.1k • 111

liked a Space 6 days ago

Stream-LLM Live Demo

🌊

Live ten-channel multi-stream LLM demo.

upvoted a collection 7 days ago

Qwen-AgentWorld

Collection

3 items • Updated 7 days ago • 62

liked a model 8 days ago

unsloth/GLM-5.2-GGUF

Text Generation • 754B • Updated 8 days ago • 180k • 486

liked a model 9 days ago

fixed-point-reasoners/fprm

Updated 9 days ago • 2

liked a model about 1 month ago

talkie-lm/talkie-1930-13b-it

Updated Apr 23 • 283

liked a model about 2 months ago

JonasGeiping/stream-qwen3.5-27b

Text Generation • 27B • Updated May 13 • 138 • 22

New activity in JonasGeiping/stream-qwen3-8b about 2 months ago

typo

#1 opened about 2 months ago by

aldakata

liked a model about 2 months ago

JonasGeiping/stream-qwen3-8b

Text Generation • 8B • Updated May 13 • 106 • 6

New activity in allenai/OLMo-2-0425-1B 3 months ago

Main revision

#5 opened 9 months ago by

aldakata

liked 2 datasets 5 months ago

ricdomolm/MATH-500

Viewer • Updated Feb 6, 2025 • 12.5k • 179 • 4

christopher/rosetta-code

Viewer • Updated Sep 24, 2023 • 79k • 656 • 39

upvoted a paper 5 months ago

Olmo 3

Paper • 2512.13961 • Published Dec 15, 2025 • 36

upvoted a collection 5 months ago

Olmo 3

Collection

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 171

liked a model 7 months ago

deepseek-ai/DeepSeek-Math-V2

Text Generation • 685B • Updated Nov 27, 2025 • 581 • 702

liked a model 8 months ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated Dec 17, 2025 • 8.45k • 1.47k

liked 2 Spaces 8 months ago

The Smol Training Playbook

📚

3.22k

The secrets to building world-class LLMs

The Ultra-Scale Playbook

🌌

3.92k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 8 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 356

liked a dataset 8 months ago

bigcode/starcoderdata

Viewer • Updated May 16, 2023 • 207M • 21.4k • 523

Albert Catalan-Tatjer

AI & ML interests

Recent Activity

Organizations

aldakata's activity

Stream-LLM Live Demo

typo

Main revision

The Smol Training Playbook

The Ultra-Scale Playbook

KV Caching Explained: Optimizing Transformer Inference Efficiency