whisper.cpp / ggml /src /ggml-cuda /CMakeLists.txt

Commit History

CUDA cmake: add `-lineinfo` for easier debug (llama/15260)
008e169

am17an commited on

cuda: remove linking to cublasLt (llama/14790)
fafaa8b

yeahdongcn commited on

CUDA: FA support for Deepseek (Ampere or newer) (llama/13306)
507d30c

JohannesGaessler commited on

CUDA: mix virt/real CUDA archs for GGML_NATIVE=OFF (llama/13135)
9fb68a1

JohannesGaessler commited on

build : fix build info on windows (llama/13239)
415b9fc

Diego Devesa commited on

CUDA: compress mode option and default to size (llama/12029)
4ec988a

Erik Scholz commited on

CUDA: app option to compile without FlashAttention (llama/12025)
fbc5f16

JohannesGaessler commited on

CUDA: correct the lowest Maxwell supported by CUDA 12 (llama/11984)
6641178

PureJourney JohannesGaessler commited on

cuda : add ampere to the list of default architectures (llama/11870)
1d19dec

Diego Devesa commited on

CUDA: use mma PTX instructions for FlashAttention (llama/11583)
f328957

JohannesGaessler Diego Devesa commited on

ggml : sync remnants (skip) (#0)
451937f
unverified

ggerganov commited on

ggml : sync resolve (skip) (#0)
d4d67dc

ggerganov commited on