Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

model : add ASR support for LFM2-Audio-1.5B (conformer)
#18106 opened Dec 16, 2025 by ngxson Loading…
model: fix LFM2 missing tensors
#18105 opened Dec 16, 2025 by ngxson Loading…
llama-fit-params: force disable mlock
#18103 opened Dec 16, 2025 by JohannesGaessler Loading…
ggml-cuda: Delta-Net linear attention for Qwen3-Next
#18102 opened Dec 16, 2025 by hauhaut Loading…
ggml : use WARP_SIZE/2 for argmax reduction offset ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#18092 opened Dec 16, 2025 by Aadeshveer Loading…
ggml: migrate work_data to stack allocation ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#18083 opened Dec 16, 2025 by GermanAizek Loading…
vulkan/cuda: fix topk_moe with exp_probs_b ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related Vulkan Issues specific to the Vulkan backend
#18071 opened Dec 15, 2025 by jeffbolznv Loading…
vulkan: support GGML_UNARY_OP_XIELU ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#18062 opened Dec 15, 2025 by jeffbolznv Loading…
vulkan: in graph_optimize, try to group ADD operations ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#18060 opened Dec 15, 2025 by jeffbolznv Loading…
CLI: llama-cli and llama-completion cosmetics devops improvements to build systems and github actions documentation Improvements or additions to documentation python python script changes script Script related SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#18053 opened Dec 15, 2025 by andrew-aladev Loading…
vulkan: Implement set_tensor_async and the event interfaces ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#18047 opened Dec 15, 2025 by jeffbolznv Loading…
chat-parser: handle whitespace around JSON in tool call parsing testing Everything test related
#18044 opened Dec 15, 2025 by ochafik Draft
convert : keep file part order from model index python python script changes
#18043 opened Dec 14, 2025 by CISC Loading…
[Speculative decoding] feat: add EAGLE3 speculative decoding support examples ggml changes relating to the ggml tensor library for machine learning model Model specific python python script changes
#18039 opened Dec 14, 2025 by ichbinhandsome Draft
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.