Skip to content

Releases: ggml-org/llama.cpp

b7399

14 Dec 13:17
254098a

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

common : refactor common_sampler + grammar logic changes (#17937)

  • common : refactor common_sampler + grammar logic changes

  • tests : increase max_tokens to get needed response

  • batched : fix uninitialized samplers

macOS/iOS:

Linux:

Windows:

openEuler:

b7398

14 Dec 13:08
3238b14

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

vulkan: Fix data race/hang in scalar/cm1 flash attention (#17887)

macOS/iOS:

Linux:

Windows:

openEuler:

b7397

14 Dec 12:36
4722671

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

vulkan: improve mul_mat_vec_iq1_s speed (#17874)

macOS/iOS:

Linux:

Windows:

openEuler:

b7394

14 Dec 11:22
609a2d0

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

models : fix YaRN regression + consolidate logic (#18006)

  • models : fix YaRN regression + consolidate logic

  • cont : fix the fix

  • cont : remove header

  • cont : add header

macOS/iOS:

Linux:

Windows:

openEuler:

b7393

14 Dec 10:13

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

ggml : arm repack fix build

macOS/iOS:

Linux:

Windows:

openEuler:

b7388

13 Dec 23:46
4ed2bae

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

server-models.cpp: add missing (#18000)

Fixes: #17999

macOS/iOS:

Linux:

Windows:

openEuler:

b7387

13 Dec 21:33
5266379

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

llama_context: synchronize before reallocating output buffer (#17974)

macOS/iOS:

Linux:

Windows:

openEuler:

b7386

13 Dec 21:25
4d5ae24

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

arg: fix common_params_parse not accepting negated arg (#17991)

macOS/iOS:

Linux:

Windows:

openEuler:

b7385

13 Dec 20:54
66ba512

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

cmake: correct scope - link ws2_32 for MinGW/w64devkit builds in cpp-httplib (#17972)

  • fix - w64devkit build

  • fix - w64devkit build private scope

macOS/iOS:

Linux:

Windows:

openEuler:

b7384

13 Dec 19:59
36255a2

Choose a tag to compare

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

vulkan: support get_rows for i32 (#17941)

macOS/iOS:

Linux:

Windows:

openEuler: