whisper.cpp / ggml.c

Commit History

ci : more platforms coverage (#1101)
c4448fa
unverified

alonfaraj Alon Faraj commited on

Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027)"
1e5ddb0
unverified

ggerganov commited on

ggml : sync latest repo (mostly refactoring changes)
d97fd69
unverified

ggerganov commited on

ggml : do not use _GNU_SOURCE gratuitously (#1027)
3a69cdf
unverified

Przemysław Pawełczyk commited on

ggml : sync latest ggml lib
a100c9a
unverified

ggerganov commited on

ggml : update WASM SIMD
84c1cc7
unverified

ggerganov commited on

ggml : sync latest ggml repo
6ee8740
unverified

ggerganov commited on

ggml : add AVX dot products
7e7b11c
unverified

ggerganov commited on

ggml : sync latest ggml
803e1be
unverified

ggerganov commited on

ggml : fix 32-bit ARM build + quantization
87ee234

ggerganov commited on

ggml : sync ggml (clBLAST + tensor names)
f50d3b3
unverified

ggerganov commited on

ggml : fix UB (int << 31)
8253b98
unverified

ggerganov commited on

whisper : add integer quantization support (#540)
a5f8f3c
unverified

ggerganov commited on

ggml : fix WASM build
c3d7603
unverified

ggerganov commited on

ggml : fix 32-bit ARM NEON (#836)
5fa72ca
unverified

ggerganov commited on

ggml : use vzip instead of vuzp for consistency
741db99
unverified

ggerganov commited on

ggml : fix WASM build
ada8c2d
unverified

ggerganov commited on

ggml : sync with ggml repo (warning fixes + asserts)
caf2759
unverified

ggerganov commited on

ggml : sync latest ggml + llama.cpp updates (quantization)
ede1268
unverified

ggerganov commited on

ggml, ci : fix build on whisper.android (ARM_NEON) + add CI (#764)
dedf05b
unverified

jhenhong commited on

ggml : sync latest ggml
7b8292f
unverified

ggerganov commited on

ggml : fix q4_1 dot product types (#759)
984a856
unverified

novag ggerganov commited on

ggml : sync latest changes from ggml and llama.cpp
3bd52ce
unverified

ggerganov commited on

ggml : fix WASM build
70332a0
unverified

ggerganov commited on

ggml : backport llama.cpp updates (close #709)
bf6b4f8
unverified

ggerganov commited on

talk-llama : add new example + sync ggml from llama.cpp (#664)
a8c74e6
unverified

ggerganov commited on

whisper : reduce memory usage during inference (#431)
3aa9e6c
unverified

ggerganov commited on

whisper : PPC64 big-endian support (#398)
239569b
unverified

fitzsim commited on

bench : add memcpy and ggml_mul_mat benchmarks
a660ed9
unverified

ggerganov commited on

ggml : remove obsolete zeroing + comment fixes (#390)
9c35c0d
unverified

ggerganov commited on

ggml : correct behaviour of ggml_vec_sum_f32 (#390)
ffffc6e
unverified

Abitofevrything commited on

ggml : improve vec_dot_f16 unrolling in flash_attn_f16
6e57274
unverified

ggerganov commited on

ggml : fix bug in new soft max computation
c59ce76
unverified

ggerganov commited on

ggml : when using BLAS start only 1 CPU thread
6c4692f
unverified

ggerganov commited on

ggml : fix running tasks with variable number of threads
2078d85
unverified

ggerganov commited on

ggml : unroll ggml_vec_dot_f16 in ggml_compute_forward_flash_attn_f16
f07fecd
unverified

ggerganov commited on

whisper : revert accidental MB change
db991e1
unverified

ggerganov commited on

ggml : speed-up soft max via Accelerate + unroll
fdaf59a
unverified

ggerganov commited on

ggml : use vDSP_sve and vDSP_maxv from Accelerate
ed14a8b
unverified

ggerganov commited on

ggml : make gcc happy (minor)
496acd2
unverified

ggerganov commited on

ggml : add SSE3 and fp16 conversion lookup table (#368)
2c3f7d4
unverified

Abitofevrything ggerganov commited on

whisper : document POWER VSX support
4dbf7ee

Thomas Fitzsimmons commited on

ggml : reorganize POWER9 ppc64le SIMD code
e0a5614

Thomas Fitzsimmons commited on

ggml : change f16 load and store macro arguments
4a68b87

Thomas Fitzsimmons commited on

ggml : add void to argument-less functions
f06f912
unverified

ggerganov commited on

ggml : define MIN / MAX only if not defined (minor)
2117da6
unverified

ggerganov commited on

ggml : improve f16 acceleration for POWER9 ppc64le
f92a260

Thomas Fitzsimmons commited on

ggml : barrier refactor + static functions
7b501c1
unverified

ggerganov commited on

ggml : simplify the SIMD code (#324)
6fe850c
unverified

ggerganov commited on

ggml : use vaddvq_f32 for slightly more efficient reduce
550fbf8
unverified

ggerganov commited on