Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alex Li's picture
1 8 20

Alex Li

alexyogo22
ksiabani's profile picture BigDog93's profile picture
·
  • AlexanderYogurt

AI & ML interests

Agents

Organizations

LangChainDatasets's profile picture

upvoted 2 papers 4 months ago

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 56

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 228
upvoted a paper 5 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 158
upvoted a paper 6 months ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8, 2025 • 93
upvoted a paper 8 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 144
upvoted a collection 8 months ago

Qwen3

Collection
84 items • Updated 4 days ago • 1.53k
upvoted an article 9 months ago
view article
Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

  • +4
Feb 4, 2025
•
122
upvoted an article 11 months ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

  • +1
Jan 28, 2025
•
887
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs