RubyGems - llama_cpp - Versions diffs - 0.17.9 → 0.18.0 - Mend

llama_cpp 0.17.9 → 0.18.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (17) hide show

checksums.yaml +4 -4
data/CHANGELOG.md +17 -0
data/LICENSE.txt +1 -1
data/README.md +8 -29
data/ext/llama_cpp/extconf.rb +0 -3
data/ext/llama_cpp/llama_cpp.c +5157 -0
data/ext/llama_cpp/llama_cpp.h +0 -5
data/lib/llama_cpp/version.rb +3 -3
data/lib/llama_cpp.rb +38 -83
data/sig/llama_cpp.rbs +3 -59
metadata +4 -12
data/examples/README.md +0 -92
data/examples/chat.rb +0 -198
data/examples/embedding.rb +0 -42
data/examples/prompt_jp.txt +0 -8
data/examples/simple.rb +0 -96
data/ext/llama_cpp/llama_cpp.cpp +0 -3761

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: d8e90419e0ffcd183523d6aa45f2ef54d4a5697d80e0862e6c358e27ba4a6a1e
-  data.tar.gz: 85128e367d0523e99a5302b9c765c20d6d55b3ed357f1785921f31a5fc2a0cea
+  metadata.gz: 0a9263eee75a3d91907c711565799fd25820d8d2b0f0ae9818a24a0798b49bf4
+  data.tar.gz: 9f05051e8972baea44c4bd33f63190d5da8a62156c419fc02ec6b96ea3545c31
 SHA512:
-  metadata.gz: 76218e8970d649e01ebfee4c90727d1d563bf1f456727910a0755e52642e50ddf65160ee4c8613632eab419747f933aaced032f331293bed6e59810539b1fa05
-  data.tar.gz: b3aa1b46bb1262fd681c677fbcf10bcc43eb507b3fd3db9c1c243b5c61c87ab290b1a38e3eb265109d2a4e5bbcc61bc9aa0a2d84aaf004822af2f529afb436bd
+  metadata.gz: 2543f1022462d32694649f2226d097537672bd6921af4fe6948687e00ef007b2e0425110abc7a3240d5dceb25131a3641bc9614f0e1117f3a7a40dcc55b23190
+  data.tar.gz: 0c54ea7e7617e99f52b0f005e8f871d5f69423e92d350bc93562ba56b48ff85a88ee6c3f01e1e6a71a0782a1bcf212f4900c7b88229a63935349cbdabee95cfc

data/CHANGELOG.md CHANGED Viewed

@@ -1,3 +1,20 @@
+## [[0.18.0](https://github.com/yoshoku/llama_cpp.rb/compare/v0.17.10...v0.18.0)] - 2025-02-02
+**Breaking Changes**
+All the native extensions code was rewritten in C. The high-level API has been removed and replaced with a simple bindings library.
+The fast update speed of llama.cpp makes it difficult to keep up with the creation of this binding library.
+[As previously noted](https://github.com/yoshoku/llama_cpp.rb/blob/main/CHANGELOG.md#060---2023-09-30),
+the author has given up on continuing to develop this binding library. Thank you for your understanding.
+## [[0.17.10](https://github.com/yoshoku/llama_cpp.rb/compare/v0.17.9...v0.17.10)] - 2024-09-07
+- Change supported llama.cpp version to b3676.
+  - Add `LLAMA_VOCAB_TYPE_RWKV` constant.
+  - Add `LLAMA_FTYPE_MOSTLY_TQ1_0` and `LLAMA_FTYPE_MOSTLY_TQ2_0` constants.
+  - Change type of n_threads and n_threads_batch from uint32_t to int32 in native extension codes.
+Implementation bindings for llama_attach_threadpool and llama_detach_threadpool have been skipped.
 ## [[0.17.9](https://github.com/yoshoku/llama_cpp.rb/compare/v0.17.8...v0.17.9)] - 2024-08-31
 - Change supported llama.cpp version to b3639.

data/LICENSE.txt CHANGED Viewed

@@ -1,6 +1,6 @@
 The MIT License (MIT)
-Copyright (c) 2023-2024 Atsushi Tatsuma
+Copyright (c) 2023-2025 Atsushi Tatsuma
 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal

data/README.md CHANGED Viewed

@@ -57,41 +57,20 @@ An example of Ruby code that generates sentences with the quantization model is
 ```ruby
 require 'llama_cpp'
-model_params = LLaMACpp::ModelParams.new
-model = LLaMACpp::Model.new(model_path: '/home/user/llama.cpp/models/open_llama_7b/ggml-model-q4_0.bin', params: model_params)
+LlamaCpp.ggml_backend_load_all
-context_params = LLaMACpp::ContextParams.new
-context_params.seed = 42
-context = LLaMACpp::Context.new(model: model, params: context_params)
+model_params = LlamaCpp::LlamaModelParams.new
+model = LlamaCpp::llama_model_load_from_file('/home/user/llama.cpp/models/open_llama_7b/ggml-model-q4_0.bin', model_params)
-puts LLaMACpp.generate(context, 'Hello, World.')
-```
-## Examples
-There is a sample program in the [examples](https://github.com/yoshoku/llama_cpp.rb/tree/main/examples) directory that allow interactvie communication like ChatGPT.
-```sh
-$ git clone https://github.com/yoshoku/llama_cpp.rb.git
-$ cd examples
-$ bundle install
-$ ruby chat.rb --model /home/user/llama.cpp/models/open_llama_7b/ggml-model-q4_0.bin --seed 2023
-...
-User: Who is the originator of the Ruby programming language?
-Bob: The originator of the Ruby programming language is Mr. Yukihiro Matsumoto.
-User:
-```
-![llama_cpp_chat_example](https://github.com/yoshoku/llama_cpp.rb/assets/5562409/374ae3d8-63a6-498f-ae6e-5552b464bdda)
+context_params = LlamaCpp::LlamaContextParams.new
+context = LlamaCpp.llama_init_from_model(model, context_params)
-Japanse chat is also possible using the [Vicuna model on Hugging Face](https://huggingface.co/CRD716/ggml-vicuna-1.1-quantized).
+puts LLaMACpp.generate(context, 'Hello, World.')
-```sh
-$ wget https://huggingface.co/CRD716/ggml-vicuna-1.1-quantized/resolve/main/ggml-vicuna-7b-1.1-q4_0.bin
-$ ruby chat.rb --model ggml-vicuna-7b-1.1-q4_0.bin --file prompt_jp.txt
+LlamaCpp.llama_free(context)
+LlamaCpp.llama_model_free(model)
 ```
-![llama_cpp rb-jpchat](https://github.com/yoshoku/llama_cpp.rb/assets/5562409/526ff18c-2bb2-4b06-8933-f72960024033)
 ## Contributing
 Bug reports and pull requests are welcome on GitHub at https://github.com/yoshoku/llama_cpp.rb.

data/ext/llama_cpp/extconf.rb CHANGED Viewed

@@ -2,10 +2,7 @@
 require 'mkmf'
-abort('libstdc++ is not found.') unless have_library('stdc++')
 abort('libllama is not found.') unless have_library('llama')
 abort('llama.h is not found.') unless have_header('llama.h')
-$CXXFLAGS << ' -std=c++11'
 create_makefile('llama_cpp/llama_cpp')