Sheng Yang's picture

1 14 5

Sheng Yang

SuunnYang

·

AI & ML interests

None yet

Recent Activity

upvoted a collection about 1 month ago

upvoted a collection 4 months ago

upvoted a paper 4 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

View all activity

Organizations

None yet

upvoted a collection about 1 month ago

GLM-4.6V

3 items • Updated Dec 8, 2025 • 47

upvoted a collection 4 months ago

Qwen3-VL

37 items • Updated 9 days ago • 559

upvoted a paper 4 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 84

upvoted 5 papers 5 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 180

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 291

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

Paper • 2508.10433 • Published Aug 14, 2025 • 144

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 195

upvoted a collection 5 months ago

GLM-4.5

GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated Aug 11, 2025 • 252

upvoted 5 papers 6 months ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7, 2025 • 39

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2, 2025 • 56

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2, 2025 • 130

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 434

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 250