Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
5
Yang Zhou
nbzy1995
Follow
0 followers
·
5 following
nbzy1995
yang-zhou-524b51170
AI & ML interests
Artificial General Intelligence, AI for Science, AI for society
Recent Activity
updated
a model
about 2 months ago
nbzy1995/Qwen2-0-5B-GRPO-vllm-trl
updated
a model
about 2 months ago
nbzy1995/Qwen2-0-5B-GRPO-vllm-trl
updated
a model
about 2 months ago
nbzy1995/Qwen2-0-5B-GRPO-vllm-trl
View all activity
Organizations
nbzy1995
's models
16
Sort:Â Recently updated
nbzy1995/Qwen2-0-5B-GRPO-vllm-trl
Updated
Nov 17, 2025
nbzy1995/Qwen3-VL-4B-Instruct-trl-grpo
Updated
Nov 13, 2025
nbzy1995/Reinforce-Cartpole-v1
Reinforcement Learning
•
Updated
Jun 7, 2025
nbzy1995/dqn_rl_zoo3_atari
Reinforcement Learning
•
Updated
Jun 6, 2025
•
4
nbzy1995/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Jun 4, 2025
nbzy1995/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jun 1, 2025
•
3
nbzy1995/LunarLander-v2-scratch
Reinforcement Learning
•
Updated
May 31, 2025
nbzy1995/poca-SoccerTwos
Reinforcement Learning
•
Updated
May 2, 2025
•
14
nbzy1995/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Apr 22, 2025
•
2
nbzy1995/ppo-PyramidsRND
Reinforcement Learning
•
Updated
Apr 18, 2025
•
11
nbzy1995/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Apr 18, 2025
•
19
nbzy1995/Reinforce-PixelCopter
Reinforcement Learning
•
Updated
Apr 17, 2025
nbzy1995/Taxi-v3
Reinforcement Learning
•
Updated
Apr 5, 2025
nbzy1995/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 5, 2025
nbzy1995/ppo-Huggy
Reinforcement Learning
•
Updated
Mar 14, 2025
•
56
nbzy1995/test
Updated
Mar 13, 2025