ldwang

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

liked a Space about 11 hours ago

CompVis/stable-diffusion-license

liked a model about 15 hours ago

zai-org/GLM-5.2

upvoted a paper 5 days ago

MiniMax Sparse Attention

View all activity

Organizations

liked a Space about 11 hours ago

License

⚖

1.43k

Convert PDFs to HTML

liked a model about 15 hours ago

zai-org/GLM-5.2

Text Generation • 753B • Updated about 6 hours ago • 666 • • 891

upvoted a paper 5 days ago

MiniMax Sparse Attention

Paper • 2606.13392 • Published 7 days ago • 137

liked a dataset 6 days ago

nvidia/Nemotron-SFT-CUDA-v1

Viewer • Updated 13 days ago • 2.28k • 307 • 5

upvoted a collection 13 days ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 6 days ago • 158

upvoted an article 13 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 163

liked a Space 20 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

187

Building and scaling RL environments for LLM training

liked a dataset about 1 month ago

open-thoughts/AgentTrove

Viewer • Updated May 7 • 1.7M • 4.28k • 186

liked 5 models about 1 month ago

updated a model about 1 month ago

BAAI/OpenSeek-Mid-v1

Text Generation • 11B • Updated May 13 • 20 • 12

liked a model about 1 month ago

BAAI/OpenSeek-Mid-v1

Text Generation • 11B • Updated May 13 • 20 • 12

liked 2 models about 2 months ago

deepseek-ai/DeepSeek-V4-Flash

Text Generation • 158B • Updated 9 days ago • 2.22M • • 1.52k

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 9 days ago • 2.8M • • 4.92k

upvoted a collection 2 months ago

Qwen3.6

Collection

4 items • Updated Apr 22 • 409

liked a model 2 months ago

Qwen/Qwen3.5-9B

Image-Text-to-Text • 10B • Updated Mar 2 • 6.71M • • 1.58k

upvoted a paper 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 110