-
Notifications
You must be signed in to change notification settings - Fork 14.1k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
vulkan: Implement set_tensor_async and the event interfaces
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18047
opened Dec 15, 2025 by
jeffbolznv
Loading…
convert : keep file part order from model index
python
python script changes
#18043
opened Dec 14, 2025 by
CISC
Loading…
[Speculative decoding] feat: add EAGLE3 speculative decoding support
examples
ggml
changes relating to the ggml tensor library for machine learning
model
Model specific
python
python script changes
#18039
opened Dec 14, 2025 by
ichbinhandsome
•
Draft
Extend run-org-model.py
examples
python
python script changes
#18034
opened Dec 14, 2025 by
pwilkin
Loading…
vulkan: use 4 rows for scalar FA large tile size
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18033
opened Dec 14, 2025 by
jeffbolznv
Loading…
Vulkan: some improvement on mul_mat_iq2_xs
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18031
opened Dec 14, 2025 by
lovedheart
Loading…
feat: add --moe-n-expert flag for MoE expert count override
#18029
opened Dec 14, 2025 by
pestopoppa
Loading…
4 tasks done
added note for old Intel hardware pre sycl
documentation
Improvements or additions to documentation
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#18017
opened Dec 14, 2025 by
alosslessdev
Loading…
on opencl added note for pre SYCL Intel hardware
documentation
Improvements or additions to documentation
OpenCL
Issues specific to the OpenCL backend
#18016
opened Dec 14, 2025 by
alosslessdev
Loading…
Add message for pre-RDNA AMD GPU support via opencl
documentation
Improvements or additions to documentation
#18015
opened Dec 14, 2025 by
alosslessdev
Loading…
webui: fix chat screen shadow width
examples
server
#18010
opened Dec 13, 2025 by
polydecay
Loading…
CLI: fixed adding cli and completion into docker containers, improved docs
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
#18003
opened Dec 13, 2025 by
andrew-aladev
Loading…
Clarify that steps also apply to linux
documentation
Improvements or additions to documentation
#18002
opened Dec 13, 2025 by
alosslessdev
Loading…
arg: clarify auto kvu/np being set on server
examples
server
#17997
opened Dec 13, 2025 by
ngxson
Loading…
Optimization: Qwen3 next autoregressive pass
model
Model specific
#17996
opened Dec 13, 2025 by
pwilkin
Loading…
CLI: fixed dead links to tools/main for cli and completion, fixed code owners
documentation
Improvements or additions to documentation
examples
#17993
opened Dec 13, 2025 by
andrew-aladev
Loading…
HIP: Refactor mma for RDNA and CDNA
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17990
opened Dec 13, 2025 by
zhang-hui-yulo
•
Draft
1 task
kv-cache: Fix state restore fragmented cache
testing
Everything test related
#17982
opened Dec 13, 2025 by
ssweens
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.