AI & ML interests
None yet
Organizations
None yet
MatchaLwc/Qwen2.5-Math-7B-ep1-new-0.6-compress-0.77
Text Generation
•
8B
•
Updated
MatchaLwc/Qwen2.5-Math-7B-ep1-new-0.6
Text Generation
•
8B
•
Updated
MatchaLwc/Qwen2.5-Math-7B-ep1-new-1
Text Generation
•
8B
•
Updated
•
1
MatchaLwc/Qwen2.5-Math-7B-ep1-0.9-compressreward
Text Generation
•
8B
•
Updated
MatchaLwc/Qwen2.5-Math-7B-ep1-0.9
Text Generation
•
8B
•
Updated
•
1
MatchaLwc/Qwen2.5-Math-7B-ep1-1
Text Generation
•
8B
•
Updated
•
1
MatchaLwc/Qwen2.5-7B-Instruct-aftermath-ep1-compressed-0.8
Text Generation
•
8B
•
Updated
•
1
MatchaLwc/Qwen2.5-7B-Instruct-aftermath-ep1-compressed
Text Generation
•
8B
•
Updated
•
1
MatchaLwc/Qwen2.5-7B-Instruct-aftermath-ep1
Text Generation
•
8B
•
Updated
•
1
MatchaLwc/qwen25-Math-7b-1epoch-0.8
Text Generation
•
8B
•
Updated
•
1
MatchaLwc/qwen25-Math-7b-1epoch
Text Generation
•
8B
•
Updated
MatchaLwc/qwen25-Math-7b-3epoch
Text Generation
•
8B
•
Updated
•
1
MatchaLwc/Qwen-2.5-7B-Simple-RL-sequential
Text Generation
•
8B
•
Updated
•
1
MatchaLwc/Qwen-2.5-7B-Simple-RL
Text Generation
•
8B
•
Updated
MatchaLwc/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
MatchaLwc/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
2
MatchaLwc/policy_llama3_policy_0
8B
•
Updated
MatchaLwc/MATH_value_0_llama3
Updated
MatchaLwc/more_4_wrong_llama2
Updated
MatchaLwc/more_3_wrong_llama2
Updated