Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
kevinpro
's Collections
R-PRM
MAPO: Multilingual Reasoning with Preference Optimization
R-PRM
updated
Mar 31, 2025
R-PRM: Reasoning-Driven Process Reward Modeling
Upvote
3
kevinpro/R-PRM-7B-DPO
Text Generation
•
8B
•
Updated
Mar 28, 2025
•
8
•
•
3
R-PRM: Reasoning-Driven Process Reward Modeling
Paper
•
2503.21295
•
Published
Mar 27, 2025
kevinpro/R-PRM
Viewer
•
Updated
Mar 28, 2025
•
594k
•
2.12k
•
1
Upvote
3
Share collection
View history
Collection guide
Browse collections