ci : more platforms coverage (#1101) c4448fa unverified alonfaraj Alon Faraj commited on Jul 16, 2023
Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027)" 1e5ddb0 unverified ggerganov commited on Jul 2, 2023
ggml : sync latest repo (mostly refactoring changes) d97fd69 unverified ggerganov commited on Jul 2, 2023
ggml : do not use _GNU_SOURCE gratuitously (#1027) 3a69cdf unverified Przemysław Pawełczyk commited on Jun 25, 2023
whisper : add integer quantization support (#540) a5f8f3c unverified ggerganov commited on Apr 30, 2023
ggml : use vzip instead of vuzp for consistency 741db99 unverified ggerganov commited on Apr 29, 2023
ggml : sync with ggml repo (warning fixes + asserts) caf2759 unverified ggerganov commited on Apr 29, 2023
ggml : sync latest ggml + llama.cpp updates (quantization) ede1268 unverified ggerganov commited on Apr 29, 2023
ggml, ci : fix build on whisper.android (ARM_NEON) + add CI (#764) dedf05b unverified jhenhong commited on Apr 15, 2023
ggml : sync latest changes from ggml and llama.cpp 3bd52ce unverified ggerganov commited on Apr 13, 2023
talk-llama : add new example + sync ggml from llama.cpp (#664) a8c74e6 unverified ggerganov commited on Mar 27, 2023
whisper : reduce memory usage during inference (#431) 3aa9e6c unverified ggerganov commited on Feb 4, 2023
ggml : remove obsolete zeroing + comment fixes (#390) 9c35c0d unverified ggerganov commited on Jan 8, 2023
ggml : correct behaviour of ggml_vec_sum_f32 (#390) ffffc6e unverified Abitofevrything commited on Jan 8, 2023
ggml : improve vec_dot_f16 unrolling in flash_attn_f16 6e57274 unverified ggerganov commited on Jan 8, 2023
ggml : fix running tasks with variable number of threads 2078d85 unverified ggerganov commited on Jan 7, 2023
ggml : unroll ggml_vec_dot_f16 in ggml_compute_forward_flash_attn_f16 f07fecd unverified ggerganov commited on Jan 7, 2023
ggml : speed-up soft max via Accelerate + unroll fdaf59a unverified ggerganov commited on Jan 7, 2023
ggml : use vDSP_sve and vDSP_maxv from Accelerate ed14a8b unverified ggerganov commited on Jan 7, 2023
ggml : add SSE3 and fp16 conversion lookup table (#368) 2c3f7d4 unverified Abitofevrything ggerganov commited on Jan 6, 2023
ggml : define MIN / MAX only if not defined (minor) 2117da6 unverified ggerganov commited on Jan 5, 2023
ggml : improve f16 acceleration for POWER9 ppc64le f92a260 Thomas Fitzsimmons commited on Dec 30, 2022
ggml : use vaddvq_f32 for slightly more efficient reduce 550fbf8 unverified ggerganov commited on Dec 23, 2022