Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
8
3
Zhenghai Xue
ZhenghaiXue
Follow
rodoxcasta's profile picture
junwux's profile picture
21world's profile picture
5 followers
·
8 following
AI_Defender
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted
a
paper
5 days ago
Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation
upvoted
a
paper
4 months ago
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning
upvoted
a
paper
4 months ago
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs
View all activity
Organizations
ZhenghaiXue
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
3 models
over 1 year ago
deepseek-ai/DeepSeek-R1
Text Generation
•
685B
•
Updated
Mar 27, 2025
•
4.58M
•
•
13.3k
Skywork/Skywork-o1-Open-PRM-Qwen-2.5-7B
Text Classification
•
Updated
Aug 29, 2025
•
656
•
53
OpenRLHF/Mistral-7b-PRM-Math-Shepherd
7B
•
Updated
Oct 30, 2024
•
6
•
1