Skip to content

Releases: ggml-org/llama.cpp

b7413

16 Dec 00:18
4529c66

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

kv-cache: Fix state restore fragmented cache (#17982)

  • kv-cache : fix state restore with fragmented cache (#17527)

Change find_slot to allow non-contiguous allocation during state restore. Fixes 'failed to find available cells in kv cache' error when restoring state to fragmented cache.

  • tests : update logic

  • cleanup: tightened state_read_meta sig, added is_contiguous case

  • fix: state_read_meta arg reorder loose ends


Co-authored-by: Georgi Gerganov [email protected]

macOS/iOS:

Linux:

Windows:

openEuler:

b7411

15 Dec 19:47
165caaf

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

metal: use shared buffers on eGPU (#17866)

  • metal: use shared buffers on eGPU

With #15906, I noticed on important regression when using metal backend on eGPU.
This commit restore the previous behavior and add an option to force its activation.

  • metal: use shared buffers on eGPU

  • metal: use shared buffers on eGPU

macOS/iOS:

Linux:

Windows:

openEuler:

b7410

15 Dec 18:01
96a181a

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

mtmd: refactor audio preprocessing (#17978)

  • mtmd: refactor audio preprocessing

  • refactor

Co-authored-by: Tarek [email protected]

  • wip

  • wip (2)

  • improve constructor

  • fix use_natural_log

  • fix padding for short input

  • clean up

  • remove need_chunking


Co-authored-by: Tarek [email protected]

macOS/iOS:

Linux:

Windows:

openEuler:

b7406

15 Dec 04:15
4aced7a

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

[SYCL] Support gpt-oss by OPs add-id, mul_mat for mxfp4, swiglu_oai (#17826)

  • support gpt-oss GPU by OP add-id, mul_mat for mxfp4, swiglu_oai, fix warning

  • fix fault ut case, update ops.md

  • rebase, fix format issue

macOS/iOS:

Linux:

Windows:

openEuler:

b7405

15 Dec 04:09
745fa0e

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

model : add glm-asr support (#17901)

  • [model] add glm-asr support

  • fix format for ci

  • fix convert format for ci

  • update glm_asr convert script & use build_ffn for glm_asr clip & use build_stack for padding and review

  • check root architecture for convert hf script

  • fix conficlt with upstream

  • fix convert script for glm asr & format clip-impl

  • format

  • restore hparams text

  • improved conversion


Co-authored-by: Sigbjørn Skjæret [email protected]

macOS/iOS:

Linux:

Windows:

openEuler:

b7404

14 Dec 22:34
5239229

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

preset: handle negated arg, reverse the meaning if needed (#18041)

macOS/iOS:

Linux:

Windows:

openEuler:

b7402

14 Dec 19:48
37f5a10

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

mtmd: enhance image resizing in llava_uhd (#18014)

macOS/iOS:

Linux:

Windows:

openEuler:

b7401

14 Dec 19:19
9e6649e

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

vulkan: fix mul_mat_vec_iq1_s formatting (#18026)

macOS/iOS:

Linux:

Windows:

openEuler:

b7400

14 Dec 18:25
0759b09

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

graph: add f_attn_temp_offset (#18025)

macOS/iOS:

Linux:

Windows:

openEuler:

b7399

14 Dec 13:17
254098a

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

common : refactor common_sampler + grammar logic changes (#17937)

  • common : refactor common_sampler + grammar logic changes

  • tests : increase max_tokens to get needed response

  • batched : fix uninitialized samplers

macOS/iOS:

Linux:

Windows:

openEuler: