grib0ed0v (Alexey G)

upvoted an article 2 months ago

Article

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

qgallouedec

•

Dec 4, 2025

• 69

upvoted a collection about 1 year ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29, 2025 • 736

upvoted 9 articles over 1 year ago

Article

The Open Arabic LLM Leaderboard 2

+6

alielfilali01, Manel-Hik, tarickMorty, amztheory, basma-b, rcojocaru, HakimHacid, clefourrier

•

Feb 10, 2025

• 38

Article

Welcome to Inference Providers on the Hub 🔥

+5

burkaygur, zeke, aton2006, hassanelmghari, sbrandeis, kramp, julien-c

•

Jan 28, 2025

• 495

Article

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!

+1

andito, mfarre, merve

•

Jan 23, 2025

• 192

Article

Train 400x faster Static Embedding Models with Sentence Transformers

tomaarsen

•

Jan 15, 2025

• 230

Article

Introducing smolagents: simple agents that write actions in code.

+1

m-ric, merve, thomwolf

•

Dec 31, 2024

• 1.2k

Article

We now support VLMs in smolagents!

+1

m-ric, merve, albertvillanova

•

Jan 24, 2025

• 113

Article

Finally, a Replacement for BERT: Introducing ModernBERT

+13

bwarner, NohTow, bclavie, orionweller, ohallstrom, staghado, alexisgallagher, rbiswasfc, fladhak, tomaarsen, ncoop57, griffin, jph00, johnowhitaker, iacolippo

•

Dec 19, 2024

• 741

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

mfuntowicz, hlarcher

•

Jan 16, 2025

• 76

Article

Timm ❤️ Transformers: Use any timm model with transformers

+3

ariG23498, rwightman, qubvel-hf, pcuenq, reach-vb

•

Jan 16, 2025

• 55

upvoted 2 collections over 1 year ago

SigLIP

Collection

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated Mar 12 • 66

Cultura-Ru-Edu

Collection

Our dataset for enhancing LLM training with educational content in the Russian language. • 2 items • Updated Nov 26, 2024 • 5

upvoted 2 papers over 1 year ago

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 29

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 122

upvoted an article over 1 year ago

Article

Let’s make a generation of amazing image generation models

burtenshaw

•

Nov 26, 2024

• 33

upvoted 4 papers over 1 year ago

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published Nov 17, 2024 • 57

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12, 2024 • 65

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Paper • 2410.02089 • Published Oct 2, 2024 • 13

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Paper • 2410.19609 • Published Oct 25, 2024 • 18

Alexey G

AI & ML interests

Organizations

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

Llama 4

The Open Arabic LLM Leaderboard 2

Welcome to Inference Providers on the Hub 🔥

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!

Train 400x faster Static Embedding Models with Sentence Transformers

Introducing smolagents: simple agents that write actions in code.

We now support VLMs in smolagents!

Finally, a Replacement for BERT: Introducing ModernBERT

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Timm ❤️ Transformers: Use any timm model with transformers

SigLIP

Cultura-Ru-Edu

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Attention Is All You Need

Let’s make a generation of amazing image generation models

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Large Language Models Can Self-Improve in Long-context Reasoning

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Alexey G

AI & ML interests

Organizations

grib0ed0v's activity

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

The Open Arabic LLM Leaderboard 2

Welcome to Inference Providers on the Hub 🔥

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!

Train 400x faster Static Embedding Models with Sentence Transformers

Introducing smolagents: simple agents that write actions in code.

We now support VLMs in smolagents!

Finally, a Replacement for BERT: Introducing ModernBERT

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Timm ❤️ Transformers: Use any timm model with transformers

Let’s make a generation of amazing image generation models