In a Training Loop 🔄

Asankhaya Sharma

codelion

http://asankhaya.github.io/

AI & ML interests

Creator of OptiLLM, OpenEvolve, Adaptive Classifier, and Ellora. Pioneering a new category in AI infrastructure: inference-time compute for LLMs.

Recent Activity

updated a model about 15 hours ago

mlx-community/gemma-4-31B-it-OptiQ-4bit

published a model about 15 hours ago

mlx-community/gemma-4-31B-it-OptiQ-4bit

updated a model about 16 hours ago

mlx-community/Qwen3.6-35B-A3B-OptiQ-4bit

View all activity

Organizations

upvoted 2 collections 16 days ago

YOLO 26

Collection

5 items • Updated 16 days ago • 2

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated 6 days ago • 126

upvoted a changelog 19 days ago

Hugging Face Changelog

Agent Traces on the Hub

19 days ago

• 116

upvoted an article 24 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

25 days ago

•

879

upvoted a collection about 2 months ago

Nano Language Models

Collection

A collection of really small language models pre-trained from scratch with open-data. Ideal for use in experimentation and evaluations. • 3 items • Updated Mar 25 • 1

upvoted an article about 2 months ago

Article

Scaling Pedagogical Pre-training: From Optimal Mixing to 10 Billion Tokens

Mar 6

•

upvoted a collection about 2 months ago

🤏 Smol-Data

Collection

Tried and tested mixes for strong pretraining. Inspired by https://huggingface.co/blog/codelion/optimal-dataset-mixing • 14 items • Updated Mar 2 • 12

upvoted a paper 3 months ago

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published Jan 30 • 225

upvoted an article 3 months ago

Article

Reverse Engineering a $500M Mystery: From HashHop to Memory-Augmented Language Models

Jan 23

•

upvoted an article 4 months ago

Article

The Optimal Architecture for Small Language Models

Dec 26, 2025

•

120

upvoted a paper 4 months ago

Universal Reasoning Model

Paper • 2512.14693 • Published Dec 16, 2025 • 44

upvoted an article 5 months ago

Article

Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement

Dec 3, 2025

•

upvoted a paper 5 months ago

Budget-Aware Tool-Use Enables Effective Agent Scaling

Paper • 2511.17006 • Published Nov 21, 2025 • 34

upvoted 2 articles 6 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

Article

Python Is All You Need? Introducing Dria-Agent-α

Jan 10, 2025

•

upvoted 2 collections 7 months ago

Sutra Pedagogical Datasets

Collection

High-quality synthetic educational datasets designed for LLM pretraining with structured pedagogical content across 9 knowledge domains. • 7 items • Updated Mar 17 • 4

Dhara Foundational Models

Collection

Diffusion Language Models combining deep narrow networks, Canon layers (depthwise causal convolutions), and WSD (Warmup-Stable-Decay) training. • 2 items • Updated Mar 21 • 3

upvoted a paper 7 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513

upvoted an article 7 months ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

Oct 9, 2025

•

upvoted a collection 7 months ago

Mem-Agent

Collection

Small sized agents from Dria trained on interacting with an obsidian-like memory system using python tools. Trained on Qwen3-4B-Thinking-2507. • 4 items • Updated Sep 5, 2025 • 5

Asankhaya Sharma

AI & ML interests

Recent Activity

Organizations

codelion's activity

Agent Traces on the Hub

Welcome Gemma 4: Frontier multimodal intelligence on device

Scaling Pedagogical Pre-training: From Optimal Mixing to 10 Billion Tokens

Reverse Engineering a $500M Mystery: From HashHop to Memory-Augmented Language Models

The Optimal Architecture for Small Language Models

Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Python Is All You Need? Introducing Dria-Agent-α

mem-agent: Equipping LLM Agents with Memory Using RL