Yimin Wang

99sweetcookie

AI & ML interests

None yet

Recent Activity

published a model 3 days ago

99sweetcookie/qwen3-4b-router

updated a model 3 days ago

99sweetcookie/reasoningshield-r1-llama

published a model 3 days ago

99sweetcookie/reasoningshield-r1-llama

View all activity

Organizations

published a model 3 days ago

99sweetcookie/qwen3-4b-router

Updated 3 days ago

updated a model 3 days ago

99sweetcookie/reasoningshield-r1-llama

8B • Updated 3 days ago • 24

published a model 3 days ago

99sweetcookie/reasoningshield-r1-llama

8B • Updated 3 days ago • 24

updated a model 3 days ago

99sweetcookie/reasoningshield-stage2-dpo

Text Generation • Updated 3 days ago • 11

published a model 3 days ago

99sweetcookie/reasoningshield-stage2-dpo

Text Generation • Updated 3 days ago • 11

updated a model 3 days ago

99sweetcookie/reasoningshield-stage1-sft

Text Generation • 8B • Updated 3 days ago • 12

published a model 3 days ago

99sweetcookie/reasoningshield-stage1-sft

Text Generation • 8B • Updated 3 days ago • 12

updated a model 4 days ago

99sweetcookie/reasoningshield-dpo-final

Text Generation • 4B • Updated 4 days ago • 20

published a model 4 days ago

99sweetcookie/reasoningshield-dpo-final

Text Generation • 4B • Updated 4 days ago • 20

updated a model 4 days ago

99sweetcookie/reasoningshield-dpo-40

4B • Updated 4 days ago • 10

published a model 4 days ago

99sweetcookie/reasoningshield-dpo-40

4B • Updated 4 days ago • 10

updated a model 4 days ago

99sweetcookie/reasoningshield-dpo-checkpoint30

4B • Updated 4 days ago • 11

published a model 4 days ago

99sweetcookie/reasoningshield-dpo-checkpoint30

4B • Updated 4 days ago • 11

updated a model 4 days ago

99sweetcookie/reasoningshield-sft

Text Generation • 4B • Updated 4 days ago • 10

published a model 4 days ago

99sweetcookie/reasoningshield-sft

Text Generation • 4B • Updated 4 days ago • 10

upvoted a paper about 1 month ago

LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?

Paper • 2510.09595 • Published Oct 10, 2025 • 2

upvoted a paper 6 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28, 2025 • 83

authored a paper 7 months ago

AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes

Paper • 2506.14728 • Published Jun 17, 2025

authored 2 papers 8 months ago

On Path to Multimodal Historical Reasoning: HistBench and HistAgent

Paper • 2505.20246 • Published May 26, 2025

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Paper • 2505.20286 • Published May 26, 2025 • 8

Yimin Wang

AI & ML interests

Recent Activity

Organizations

99sweetcookie's activity