In a Training Loop 🔄

2 27 44

Luc Robert--Villanueva

lucrbrtv

AI & ML interests

None yet

Recent Activity

liked a model about 7 hours ago

dphn/Dolphin-X1-Trinity-Nano-GGUF

upvoted a paper 4 days ago

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation

updated a model 5 days ago

lucrbrtv/doom-world-model

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation

Paper • 2412.06781 • Published Dec 9, 2024 • 24

upvoted a paper about 2 months ago

Interleaved Head Attention

Paper • 2602.21371 • Published Feb 24 • 1

upvoted a changelog about 2 months ago

Hugging Face Changelog

Agent Traces on the Hub

Apr 7

• 138

upvoted an article about 2 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 902

upvoted a paper about 2 months ago

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

Paper • 2603.19312 • Published Mar 13 • 46

upvoted a collection about 2 months ago

LeWM

Collection

Official checkpoints and datasets related to LeWM paper. • 9 items • Updated Mar 27 • 40

upvoted a changelog 2 months ago

Hugging Face Changelog

Hugging Face Papers for AI Agents

Mar 18

• 141

upvoted an article 2 months ago

Article

PRX Part 3 — Training a Text-to-Image Model in 24h!

Photoroom

•

Mar 3

• 64

upvoted 2 papers 2 months ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published Mar 3 • 105

Multimodal OCR: Parse Anything from Documents

Paper • 2603.13032 • Published Mar 13 • 44

upvoted an article 3 months ago

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 164

upvoted 2 articles 6 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 627

Article

Transformers v5: Simple model definitions powering the AI ecosystem

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 311

upvoted a paper 6 months ago

Rotary Position Embedding for Vision Transformer

Paper • 2403.13298 • Published Mar 20, 2024 • 6

upvoted 6 articles 6 months ago

Article

~Don't~ Repeat Yourself

patrickvonplaten

•

Apr 5, 2022

• 55

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb

•

May 21, 2025

• 258

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

codelion

•

Nov 3, 2025

• 65

Article

Text-to-image Architectural Experiments

Photoroom

•

Nov 13, 2025

• 57

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

MiniMax-AI

•

Oct 30, 2025

• 80

Article

We’re open-sourcing our text-to-image model and the process behind it

Photoroom

•

Nov 12, 2025

• 99

Luc Robert--Villanueva

AI & ML interests

Recent Activity

Organizations

lucrbrtv's activity

Agent Traces on the Hub

Welcome Gemma 4: Frontier multimodal intelligence on device

Hugging Face Papers for AI Agents

PRX Part 3 — Training a Text-to-Image Model in 24h!

Mixture of Experts (MoEs) in Transformers

We Got Claude to Fine-Tune an Open Source LLM

Transformers v5: Simple model definitions powering the AI ecosystem

~Don't~ Repeat Yourself

nanoVLM: The simplest repository to train your VLM in pure PyTorch

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Text-to-image Architectural Experiments

Why Did MiniMax M2 End Up as a Full Attention Model?

We’re open-sourcing our text-to-image model and the process behind it