Spaces:
Running
Running
Commit History
cuda: remove linking to cublasLt (llama/14790)
fafaa8b
CUDA: FA support for Deepseek (Ampere or newer) (llama/13306)
507d30c
CUDA: mix virt/real CUDA archs for GGML_NATIVE=OFF (llama/13135)
9fb68a1
build : fix build info on windows (llama/13239)
415b9fc
Diego Devesa
commited on
CUDA: compress mode option and default to size (llama/12029)
4ec988a
Erik Scholz
commited on
CUDA: app option to compile without FlashAttention (llama/12025)
fbc5f16
CUDA: correct the lowest Maxwell supported by CUDA 12 (llama/11984)
6641178
cuda : add ampere to the list of default architectures (llama/11870)
1d19dec
Diego Devesa
commited on