AI & ML interests
None yet
Organizations
None yet
dyingc/qwen2_5_3b_grpo_gsm8k
3B
•
Updated
dyingc/bert-base-uncased_with_RCE
Feature Extraction
•
Updated
•
3
dyingc/Mistral-7B-instruction-finetuned
Text Generation
•
7B
•
Updated
•
1
dyingc/Mistral-7B-instruction-LoRA
Updated
dyingc/Mistral-CatMacaroni-slerp-uncensored-7B.q8_0.gguf
7B
•
Updated
•
25
•
1
dyingc/Llama-2-7b-chat-hf-bitsandbytes
Text Generation
•
7B
•
Updated
•
2
dyingc/Llama-2-7b-chat-hf-quant8
Text Generation
•
7B
•
Updated
•
2
dyingc/LlamaGuard-7b-quant
Text Generation
•
7B
•
Updated
•
2
dyingc/Llama-2-7b-chat-hf-quant
Text Generation
•
7B
•
Updated
•
2
dyingc/Llama-2-7b-chat-hf-q8
Text Generation
•
7B
•
Updated
•
4
dyingc/llama-2-7b-chat-hf-q4
Text Generation
•
7B
•
Updated
•
3
dyingc/bert-base-uncased-with-custom-code
Fill-Mask
•
Updated
•
1
dyingc/dolly-lora-ddp-test
Updated
dyingc/alpaca7B-lora-vastai
Updated
Reinforcement Learning
•
Updated
dyingc/a2c-PandaReachDense-v2
Reinforcement Learning
•
Updated
dyingc/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
•
1
Reinforcement Learning
•
Updated
•
2
dyingc/ppo-SnowballTarget
Reinforcement Learning
•
Updated
dyingc/Reinforce-PixelCopter
Reinforcement Learning
•
Updated
dyingc/Reinforce-policy-gradient
Reinforcement Learning
•
Updated
dyingc/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
dyingc/q-FrozenLake-v1-8x8-Slippery
Reinforcement Learning
•
Updated
dyingc/q-FrozenLake-v1-8x8-noSlippery
Reinforcement Learning
•
Updated
dyingc/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
•
10
Reinforcement Learning
•
Updated
•
1