Dipika
AI & ML interests
None yet
Recent Activity
updated a collection 4 days ago
Models in CI updated a model 4 days ago
nm-testing/TinyLlama-1.1B-Chat-v1.0-NVFP4A16 published a model 4 days ago
nm-testing/TinyLlama-1.1B-Chat-v1.0-NVFP4A16Organizations
Update chat_template.jinja
1
#4 opened 12 days ago
by
bdellabe
Update chat_template.jinja
#5 opened 12 days ago
by
bdellabe
这个量化类型的模型,4090显卡上可以用vllm部署嘛
2
#9 opened 10 days ago
by
David-LR
Delete .eval_results
#2 opened 12 days ago
by
SaylorTwift
Error running with latest Cuda 13 SGLang
❤️ 1
2
#7 opened 9 days ago
by
souvla
Any hope for a dynamic version?
1
#1 opened 9 days ago
by
mirix
How did you manage to make a model trained in FP16 work on NVFP4, making it bigger?
1
#1 opened 15 days ago
by
yangus87
Great quant!!
12
#6 opened 25 days ago
by
tasticleeze
Regarding the correctness of the int4 quantization script
1
#5 opened 25 days ago
by
traphix
config.json ignore-list needs linear_attn patterns — model produces garbage otherwise
❤️ 1
3
#4 opened 28 days ago
by
cghart123
Creation details?
1
#3 opened 28 days ago
by
traphix
Sglang?
1
#2 opened 29 days ago
by
jpsequeira
vllm-openai:cu130-nightly Error
➕ 4
3
#1 opened 29 days ago
by
andynoodles
Update tokenizer_config.json
#3 opened about 1 month ago
by
bdellabe
Update tokenizer_config.json
#4 opened about 1 month ago
by
bdellabe
Update chat_template.jinja
#2 opened about 1 month ago
by
bdellabe
Update chat_template.jinja
#3 opened about 1 month ago
by
bdellabe
Add compressed-tensors tag
#1 opened 2 months ago
by
dsikka
Add compressed-tensors tag
👍 2
1
#9 opened 4 months ago
by
dsikka
Update config.json for sglang
4
#3 opened 6 months ago
by
ayachinenefan