llama-cpp-pydist 0.13.0__py3-none-any.whl → 0.14.0__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- llama_cpp/binaries/{llama-b6916-bin-win-cpu-x64.zip → llama-b6957-bin-win-cpu-x64.zip} +0 -0
- {llama_cpp_pydist-0.13.0.dist-info → llama_cpp_pydist-0.14.0.dist-info}/METADATA +31 -1
- {llama_cpp_pydist-0.13.0.dist-info → llama_cpp_pydist-0.14.0.dist-info}/RECORD +136 -128
- vendor_llama_cpp_pydist/llama.cpp/.devops/s390x.Dockerfile +4 -1
- vendor_llama_cpp_pydist/llama.cpp/.github/labeler.yml +4 -0
- vendor_llama_cpp_pydist/llama.cpp/.github/workflows/build-linux-cross.yml +37 -37
- vendor_llama_cpp_pydist/llama.cpp/.github/workflows/release.yml +2 -2
- vendor_llama_cpp_pydist/llama.cpp/README.md +3 -4
- vendor_llama_cpp_pydist/llama.cpp/common/arg.cpp +14 -0
- vendor_llama_cpp_pydist/llama.cpp/common/chat.cpp +17 -2
- vendor_llama_cpp_pydist/llama.cpp/common/common.h +6 -0
- vendor_llama_cpp_pydist/llama.cpp/convert_hf_to_gguf.py +143 -0
- vendor_llama_cpp_pydist/llama.cpp/docs/backend/OPENCL.md +25 -3
- vendor_llama_cpp_pydist/llama.cpp/docs/docker.md +3 -3
- vendor_llama_cpp_pydist/llama.cpp/docs/ops/CUDA.csv +1620 -0
- vendor_llama_cpp_pydist/llama.cpp/docs/ops.md +5 -5
- vendor_llama_cpp_pydist/llama.cpp/ggml/include/ggml.h +1 -0
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/CMakeLists.txt +6 -3
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cpu/CMakeLists.txt +10 -3
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cpu/arch/loongarch/quants.c +4 -5
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cpu/arch/s390/cpu-feats.cpp +50 -0
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cpu/ggml-cpu-impl.h +3 -1
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cpu/ops.cpp +57 -8
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cpu/repack.cpp +25 -0
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cpu/simd-mappings.h +25 -25
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cuda/fattn-tile.cu +4 -0
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cuda/fattn-tile.cuh +29 -2
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cuda/fattn.cu +3 -2
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cuda/ggml-cuda.cu +33 -0
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cuda/template-instances/fattn-tile-instance-dkq72-dv72.cu +5 -0
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cuda/template-instances/generate_cu_files.py +3 -1
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cuda/unary.cu +32 -0
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-cuda/unary.cuh +8 -0
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-hexagon/htp/ops-utils.h +17 -17
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-metal/ggml-metal-device.m +1 -0
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-metal/ggml-metal.metal +8 -0
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-opencl/ggml-opencl.cpp +6 -0
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-opencl/kernels/rope.cl +50 -24
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-sycl/repeat_back.cpp +45 -25
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp +274 -406
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-webgpu/ggml-webgpu.cpp +65 -23
- vendor_llama_cpp_pydist/llama.cpp/ggml/src/ggml-webgpu/wgsl-shaders/{set_rows.wgsl → set_rows.tmpl.wgsl} +38 -7
- vendor_llama_cpp_pydist/llama.cpp/gguf-py/gguf/constants.py +21 -0
- vendor_llama_cpp_pydist/llama.cpp/gguf-py/gguf/tensor_mapping.py +2 -0
- vendor_llama_cpp_pydist/llama.cpp/include/llama.h +4 -3
- vendor_llama_cpp_pydist/llama.cpp/scripts/bench-models.sh +74 -0
- vendor_llama_cpp_pydist/llama.cpp/scripts/sync-ggml.last +1 -1
- vendor_llama_cpp_pydist/llama.cpp/src/CMakeLists.txt +1 -0
- vendor_llama_cpp_pydist/llama.cpp/src/llama-arch.cpp +18 -0
- vendor_llama_cpp_pydist/llama.cpp/src/llama-arch.h +1 -0
- vendor_llama_cpp_pydist/llama.cpp/src/llama-chat.cpp +32 -0
- vendor_llama_cpp_pydist/llama.cpp/src/llama-chat.h +1 -0
- vendor_llama_cpp_pydist/llama.cpp/src/llama-context.cpp +27 -10
- vendor_llama_cpp_pydist/llama.cpp/src/llama-context.h +5 -5
- vendor_llama_cpp_pydist/llama.cpp/src/llama-cparams.h +1 -0
- vendor_llama_cpp_pydist/llama.cpp/src/llama-model.cpp +62 -10
- vendor_llama_cpp_pydist/llama.cpp/src/models/gemma2-iswa.cpp +113 -113
- vendor_llama_cpp_pydist/llama.cpp/src/models/gemma3-iswa.cpp +116 -116
- vendor_llama_cpp_pydist/llama.cpp/src/models/glm4-moe.cpp +139 -139
- vendor_llama_cpp_pydist/llama.cpp/src/models/grok.cpp +133 -134
- vendor_llama_cpp_pydist/llama.cpp/src/models/hunyuan-dense.cpp +113 -113
- vendor_llama_cpp_pydist/llama.cpp/src/models/hunyuan-moe.cpp +135 -135
- vendor_llama_cpp_pydist/llama.cpp/src/models/internlm2.cpp +97 -98
- vendor_llama_cpp_pydist/llama.cpp/src/models/jais.cpp +70 -70
- vendor_llama_cpp_pydist/llama.cpp/src/models/jamba.cpp +97 -98
- vendor_llama_cpp_pydist/llama.cpp/src/models/llada-moe.cpp +113 -114
- vendor_llama_cpp_pydist/llama.cpp/src/models/llada.cpp +71 -73
- vendor_llama_cpp_pydist/llama.cpp/src/models/llama.cpp +128 -129
- vendor_llama_cpp_pydist/llama.cpp/src/models/minicpm3.cpp +177 -178
- vendor_llama_cpp_pydist/llama.cpp/src/models/models.h +4 -0
- vendor_llama_cpp_pydist/llama.cpp/src/models/nemotron.cpp +99 -99
- vendor_llama_cpp_pydist/llama.cpp/src/models/neo-bert.cpp +97 -97
- vendor_llama_cpp_pydist/llama.cpp/src/models/olmo.cpp +98 -98
- vendor_llama_cpp_pydist/llama.cpp/src/models/olmo2.cpp +122 -123
- vendor_llama_cpp_pydist/llama.cpp/src/models/olmoe.cpp +107 -107
- vendor_llama_cpp_pydist/llama.cpp/src/models/openai-moe-iswa.cpp +106 -106
- vendor_llama_cpp_pydist/llama.cpp/src/models/openelm.cpp +105 -105
- vendor_llama_cpp_pydist/llama.cpp/src/models/orion.cpp +113 -113
- vendor_llama_cpp_pydist/llama.cpp/src/models/pangu-embedded.cpp +121 -0
- vendor_llama_cpp_pydist/llama.cpp/src/models/phi3.cpp +128 -129
- vendor_llama_cpp_pydist/llama.cpp/src/models/plamo.cpp +100 -100
- vendor_llama_cpp_pydist/llama.cpp/src/models/plm.cpp +154 -154
- vendor_llama_cpp_pydist/llama.cpp/src/models/qwen2.cpp +108 -109
- vendor_llama_cpp_pydist/llama.cpp/src/models/qwen2moe.cpp +127 -127
- vendor_llama_cpp_pydist/llama.cpp/src/models/qwen2vl.cpp +107 -107
- vendor_llama_cpp_pydist/llama.cpp/src/models/qwen3.cpp +107 -107
- vendor_llama_cpp_pydist/llama.cpp/src/models/qwen3moe.cpp +115 -115
- vendor_llama_cpp_pydist/llama.cpp/src/models/refact.cpp +76 -76
- vendor_llama_cpp_pydist/llama.cpp/src/models/seed-oss.cpp +100 -100
- vendor_llama_cpp_pydist/llama.cpp/src/models/smallthinker.cpp +92 -92
- vendor_llama_cpp_pydist/llama.cpp/src/models/smollm3.cpp +100 -100
- vendor_llama_cpp_pydist/llama.cpp/src/models/starcoder.cpp +81 -81
- vendor_llama_cpp_pydist/llama.cpp/src/models/starcoder2.cpp +93 -93
- vendor_llama_cpp_pydist/llama.cpp/src/models/t5-dec.cpp +135 -135
- vendor_llama_cpp_pydist/llama.cpp/src/models/t5-enc.cpp +77 -77
- vendor_llama_cpp_pydist/llama.cpp/src/models/wavtokenizer-dec.cpp +106 -106
- vendor_llama_cpp_pydist/llama.cpp/src/models/xverse.cpp +89 -89
- vendor_llama_cpp_pydist/llama.cpp/tests/test-backend-ops.cpp +21 -2
- vendor_llama_cpp_pydist/llama.cpp/tests/test-thread-safety.cpp +8 -1
- vendor_llama_cpp_pydist/llama.cpp/tools/batched-bench/batched-bench.cpp +0 -2
- vendor_llama_cpp_pydist/llama.cpp/tools/mtmd/clip-impl.h +2 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/mtmd/clip.cpp +239 -47
- vendor_llama_cpp_pydist/llama.cpp/tools/mtmd/clip.h +10 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/mtmd/mtmd-cli.cpp +7 -4
- vendor_llama_cpp_pydist/llama.cpp/tools/mtmd/mtmd.cpp +29 -14
- vendor_llama_cpp_pydist/llama.cpp/tools/mtmd/mtmd.h +5 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/README.md +1 -1
- vendor_llama_cpp_pydist/llama.cpp/tools/server/public/index.html.gz +0 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/server.cpp +89 -21
- vendor_llama_cpp_pydist/llama.cpp/tools/server/tests/unit/test_chat_completion.py +4 -4
- vendor_llama_cpp_pydist/llama.cpp/tools/server/tests/unit/test_completion.py +31 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/tests/unit/test_infill.py +2 -2
- vendor_llama_cpp_pydist/llama.cpp/tools/server/tests/utils.py +3 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/utils.hpp +2 -1
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/package-lock.json +361 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/package.json +1 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/src/lib/components/app/chat/ChatAttachments/ChatAttachmentPreviewDialog.svelte +9 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/src/lib/components/app/chat/ChatMessages/ChatMessageAssistant.svelte +2 -2
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/src/lib/components/app/misc/MarkdownContent.svelte +3 -12
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/src/lib/constants/latex-protection.ts +35 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/src/lib/services/chat.ts +16 -4
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/src/lib/stores/chat.svelte.ts +52 -2
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/src/lib/stores/server.svelte.ts +113 -64
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/src/lib/types/api.d.ts +1 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/src/lib/types/settings.d.ts +1 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/src/lib/utils/latex-protection.test.ts +355 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/src/lib/utils/latex-protection.ts +267 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/src/stories/fixtures/math-formulas.ts +65 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/src/styles/katex-custom.scss +13 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/svelte.config.js +3 -0
- vendor_llama_cpp_pydist/llama.cpp/tools/server/webui/vite.config.ts +28 -3
- vendor_llama_cpp_pydist/llama.cpp/vendor/minja/chat-template.hpp +9 -2
- vendor_llama_cpp_pydist/llama.cpp/vendor/minja/minja.hpp +5 -7
- {llama_cpp_pydist-0.13.0.dist-info → llama_cpp_pydist-0.14.0.dist-info}/WHEEL +0 -0
- {llama_cpp_pydist-0.13.0.dist-info → llama_cpp_pydist-0.14.0.dist-info}/licenses/LICENSE +0 -0
- {llama_cpp_pydist-0.13.0.dist-info → llama_cpp_pydist-0.14.0.dist-info}/top_level.txt +0 -0
|
Binary file
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.4
|
|
2
2
|
Name: llama_cpp_pydist
|
|
3
|
-
Version: 0.
|
|
3
|
+
Version: 0.14.0
|
|
4
4
|
Summary: A Python package for Llama CPP.
|
|
5
5
|
Home-page: https://github.com/shamitv/llama_cpp
|
|
6
6
|
Author: Shamit Verma
|
|
@@ -146,6 +146,36 @@ For instructions on how to build the package from source, update the `llama.cpp`
|
|
|
146
146
|
|
|
147
147
|
# Changelog
|
|
148
148
|
|
|
149
|
+
## 2025-11-05: Update to llama.cpp b6957
|
|
150
|
+
|
|
151
|
+
- b6919 (b6919) – 2025-11-01 – https://github.com/ggml-org/llama.cpp/releases/tag/b6919
|
|
152
|
+
- b6920 (b6920) – 2025-11-02 – https://github.com/ggml-org/llama.cpp/releases/tag/b6920
|
|
153
|
+
- b6922 (b6922) – 2025-11-02 – https://github.com/ggml-org/llama.cpp/releases/tag/b6922
|
|
154
|
+
- b6923 (b6923) – 2025-11-02 – https://github.com/ggml-org/llama.cpp/releases/tag/b6923
|
|
155
|
+
- b6924 (b6924) – 2025-11-02 – https://github.com/ggml-org/llama.cpp/releases/tag/b6924
|
|
156
|
+
- b6927 (b6927) – 2025-11-02 – https://github.com/ggml-org/llama.cpp/releases/tag/b6927
|
|
157
|
+
- b6929 (b6929) – 2025-11-02 – https://github.com/ggml-org/llama.cpp/releases/tag/b6929
|
|
158
|
+
- b6931 (b6931) – 2025-11-03 – https://github.com/ggml-org/llama.cpp/releases/tag/b6931
|
|
159
|
+
- b6932 (b6932) – 2025-11-03 – https://github.com/ggml-org/llama.cpp/releases/tag/b6932
|
|
160
|
+
- b6933 (b6933) – 2025-11-03 – https://github.com/ggml-org/llama.cpp/releases/tag/b6933
|
|
161
|
+
- b6934 (b6934) – 2025-11-03 – https://github.com/ggml-org/llama.cpp/releases/tag/b6934
|
|
162
|
+
- b6935 (b6935) – 2025-11-03 – https://github.com/ggml-org/llama.cpp/releases/tag/b6935
|
|
163
|
+
- b6936 (b6936) – 2025-11-03 – https://github.com/ggml-org/llama.cpp/releases/tag/b6936
|
|
164
|
+
- b6937 (b6937) – 2025-11-03 – https://github.com/ggml-org/llama.cpp/releases/tag/b6937
|
|
165
|
+
- b6940 (b6940) – 2025-11-03 – https://github.com/ggml-org/llama.cpp/releases/tag/b6940
|
|
166
|
+
- b6941 (b6941) – 2025-11-04 – https://github.com/ggml-org/llama.cpp/releases/tag/b6941
|
|
167
|
+
- b6942 (b6942) – 2025-11-04 – https://github.com/ggml-org/llama.cpp/releases/tag/b6942
|
|
168
|
+
- b6943 (b6943) – 2025-11-04 – https://github.com/ggml-org/llama.cpp/releases/tag/b6943
|
|
169
|
+
- b6945 (b6945) – 2025-11-04 – https://github.com/ggml-org/llama.cpp/releases/tag/b6945
|
|
170
|
+
- b6947 (b6947) – 2025-11-04 – https://github.com/ggml-org/llama.cpp/releases/tag/b6947
|
|
171
|
+
- b6948 (b6948) – 2025-11-04 – https://github.com/ggml-org/llama.cpp/releases/tag/b6948
|
|
172
|
+
- b6949 (b6949) – 2025-11-04 – https://github.com/ggml-org/llama.cpp/releases/tag/b6949
|
|
173
|
+
- b6953 (b6953) – 2025-11-05 – https://github.com/ggml-org/llama.cpp/releases/tag/b6953
|
|
174
|
+
- b6954 (b6954) – 2025-11-05 – https://github.com/ggml-org/llama.cpp/releases/tag/b6954
|
|
175
|
+
- b6955 (b6955) – 2025-11-05 – https://github.com/ggml-org/llama.cpp/releases/tag/b6955
|
|
176
|
+
- b6957 (b6957) – 2025-11-05 – https://github.com/ggml-org/llama.cpp/releases/tag/b6957
|
|
177
|
+
|
|
178
|
+
|
|
149
179
|
## 2025-11-01: Update to llama.cpp b6916
|
|
150
180
|
|
|
151
181
|
- b6904 (b6904) – 2025-10-31 – https://github.com/ggml-org/llama.cpp/releases/tag/b6904
|