3 9 40

Weijing Huang

waleking

AI & ML interests

Language Models

Recent Activity

upvoted a paper 16 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

upvoted a paper 9 months ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

liked a dataset 10 months ago

OpenStellarTeam/Chinese-SimpleQA

View all activity

Organizations

None yet

upvoted a paper 16 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 60

upvoted a paper 9 months ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published Apr 7, 2025 • 26

liked 2 datasets 10 months ago

OpenStellarTeam/Chinese-SimpleQA

Viewer • Updated Dec 16, 2024 • 3k • 377 • 35

allenai/olmOCR-mix-0225

Viewer • Updated Feb 25, 2025 • 259k • 912 • 169

upvoted a paper 11 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11, 2025 • 57

liked a dataset 11 months ago

Anthropic/EconomicIndex

Preview • Updated Nov 17, 2025 • 3.34k • 380

upvoted a paper 11 months ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published Feb 5, 2025 • 18

upvoted an article 11 months ago

Article

Replicating DeepSeek R1 for Information Extraction

Jan 31, 2025

•

upvoted a paper 12 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13, 2025 • 99

liked a Space 12 months ago

Scaling test-time compute

📈

588

Implement test-time compute scaling for math problems

upvoted an article about 1 year ago

Article

Deriving DPO's Loss

Dec 24, 2024

•

liked a dataset about 1 year ago

m-a-p/MAP-CC

Viewer • Updated Jul 11, 2024 • 1.77B • 2.7k • 76

liked 2 datasets over 1 year ago

Lyte/Reasoner-1o1-v0.3-HQ

Viewer • Updated Sep 18, 2024 • 370 • 44 • 8

pints-ai/Expository-Prose-V1

Viewer • Updated Aug 12, 2024 • 6.67M • 115 • 19

liked a model over 1 year ago

PleIAs/OCRonos-Vintage

Text Generation • 0.1B • Updated Aug 8, 2024 • 93 • 81

liked 4 datasets over 1 year ago