RubyGems - ollama-ruby - Versions diffs - 0.7.0 → 0.9.0 - Mend

ollama-ruby 0.7.0 → 0.9.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (17) hide show

checksums.yaml +4 -4
data/CHANGES.md +153 -110
data/README.md +168 -166
data/bin/ollama_chat +134 -76
data/docker-compose.yml +1 -1
data/lib/ollama/documents.rb +2 -1
data/lib/ollama/handlers/markdown.rb +1 -2
data/lib/ollama/utils/fetcher.rb +2 -0
data/lib/ollama/version.rb +1 -1
data/ollama-ruby.gemspec +4 -4
data/spec/ollama/client_spec.rb +3 -3
data/spec/ollama/documents/redis_backed_memory_cache_spec.rb +1 -1
data/spec/ollama/documents/redis_cache_spec.rb +1 -1
data/spec/ollama/documents_spec.rb +5 -5
data/spec/ollama/handlers/markdown_spec.rb +0 -2
data/spec/ollama/utils/fetcher_spec.rb +1 -1
metadata +4 -4

data/README.md CHANGED Viewed

@@ -26,161 +26,23 @@ gem 'ollama-ruby'
 to your Gemfile and run `bundle install` in your terminal.
-## Executables
-### ollama\_chat
-This a chat client, that can be used to connect to an ollama server and enter a
-chat converstation with a LLM. It can be called with the following arguments:
-```
-Usage: ollama_chat [OPTIONS]
-  -f CONFIG      config file to read
-  -u URL         the ollama base url, OLLAMA_URL
-  -m MODEL       the ollama model to chat with, OLLAMA_CHAT_MODEL
-  -s SYSTEM      the system prompt to use as a file, OLLAMA_CHAT_SYSTEM
-  -c CHAT        a saved chat conversation to load
-  -C COLLECTION  name of the collection used in this conversation
-  -D DOCUMENT    load document and add to embeddings collection (multiple)
-  -M             use (empty) MemoryCache for this chat session
-  -E             disable embeddings for this chat session
-  -V             display the current version number and quit
-  -h             this help
-```
-The base URL can be either set by the environment variable `OLLAMA_URL` or it
-is derived from the environment variable `OLLAMA_HOST`. The default model to
-connect can be configured in the environment variable `OLLAMA_MODEL`.
-The YAML config file in `$XDG_CONFIG_HOME/ollama_chat/config.yml`, that you can
-use for more complex settings, it looks like this:
-```
----
-url: <%= ENV['OLLAMA_URL'] || 'http://%s' % ENV.fetch('OLLAMA_HOST') %>
-model:
-  name: <%= ENV.fetch('OLLAMA_CHAT_MODEL', 'llama3.1') %>
-  options:
-    num_ctx: 8192
-system: <%= ENV.fetch('OLLAMA_CHAT_SYSTEM', 'null') %>
-voice: Samantha
-markdown: true
-embedding:
-  enabled: true
-  model:
-    name: mxbai-embed-large
-    options: {}
-  collection: <%= ENV.fetch('OLLAMA_CHAT_COLLECTION', 'ollama_chat') %>
-  found_texts_size: 4096
-  splitter:
-    name: RecursiveCharacter
-    chunk_size: 1024
-cache: Ollama::Documents::RedisCache
-redis:
-  url: <%= ENV.fetch('REDIS_URL', 'null') %>
-debug: <%= ENV['OLLAMA_CHAT_DEBUG'].to_i == 1 ? true : false %>
-```
-If you want to store embeddings persistently, set an environment variable
-`REDIS_URL` or update the `redis.url` setting in your `config.yml` file to
-connect to a Redis server. Without this setup, embeddings will only be stored
-in process memory, which is less durable.
-Some settings can be passed as arguments as well, e. g. if you want to choose a
-specific system prompt:
-```
-$ ollama_chat -s sherlock.txt
-Model with architecture llama found.
-Connecting to llama3.1@http://ollama.local.net:11434 now…
-Configured system prompt is:
-You are Sherlock Holmes and the user is your new client, Dr. Watson is also in
-the room. You will talk and act in the typical manner of Sherlock Holmes do and
-try to solve the user's case using logic and deduction.
-Type /help to display the chat help.
-📨 user:
-Good morning.
-📨 assistant:
-Ah, good morning, my dear fellow! It is a pleasure to make your acquaintance. I
-am Sherlock Holmes, the renowned detective, and this is my trusty sidekick, Dr.
-Watson. Please, have a seat and tell us about the nature of your visit. What
-seems to be the problem that has brought you to our humble abode at 221B Baker
-Street?
-(Watson nods in encouragement as he takes notes)
-Now, pray tell, what is it that puzzles you, my dear client? A missing item,
-perhaps? Or a mysterious occurrence that requires clarification? The game, as
-they say, is afoot!
-```
-This example shows how an image like this can be sent to a vision model for
-analysis:
-![cat](spec/assets/kitten.jpg)
-```
-$ ollama_chat -m llava-llama3
-Model with architecture llama found.
-Connecting to llava-llama3@http://localhost:11434 now…
-Type /help to display the chat help.
-📸 user> What's on this image? ./spec/assets/kitten.jpg
-📨 assistant:
-The image captures a moment of tranquility featuring a young cat. The cat,
-adorned with gray and white fur marked by black stripes on its face and legs,
-is the central figure in this scene. Its eyes, a striking shade of blue, are
-wide open and directed towards the camera, giving an impression of curiosity or
-alertness.
-The cat is comfortably nestled on a red blanket, which contrasts vividly with
-its fur. The blanket, soft and inviting, provides a sense of warmth to the
-image. In the background, partially obscured by the cat's head, is another
-blanket of similar red hue. The repetition of the color adds a sense of harmony
-to the composition.
-The cat's position on the right side of the photo creates an interesting
-asymmetry with the camera lens, which occupies the left side of the frame. This
-visual balance enhances the overall composition of the image.
+## Usage
-There are no discernible texts or other objects in the image. The focus is
-solely on the cat and its immediate surroundings. The image does not provide
-any information about the location or setting beyond what has been described.
-The simplicity of the scene allows the viewer to concentrate on the main
-subject - the young, blue-eyed cat.
-```
+In your own software the library can be used as shown in this example:
-The following commands can be given inside the chat, if prefixed by a `/`:
+```ruby
+require "ollama"
+include Ollama
-```
-/copy                           to copy last response to clipboard
-/paste                          to paste content
-/markdown                       toggle markdown output
-/stream                         toggle stream output
-/location                       toggle location submission
-/voice( change)                 toggle voice output or change the voice
-/list [n]                       list the last n / all conversation exchanges
-/clear                          clear the whole conversation
-/clobber                        clear the conversation and collection
-/pop [n]                        pop the last n exchanges, defaults to 1
-/model                          change the model
-/system                         change system prompt (clears conversation)
-/regenerate                     the last answer message
-/collection( clear|change)      change (default) collection or clear
-/info                           show information for current session
-/import source                  import the source's content
-/summarize [n] source           summarize the source's content in n words
-/embedding                      toggle embedding paused or not
-/embed source                   embed the source's content
-/web [n] query                  query web search & return n or 1 results
-/save filename                  store conversation messages
-/load filename                  load conversation messages
-/quit                           to quit
-/help                           to view this help
+ollama = Client.new(base_url: 'http://localhost:11434')
+messages = Message.new(role: 'user', content: 'Why is the sky blue?')
+ollama.chat(model: 'llama3.1', stream: true, messages:, &Print) # or
+print ollama.chat(model: 'llama3.1', stream: true, messages:).lazy.map { |response|
+  response.message.content
+}
 ```
-### ollama\_console
+## Try out things in ollama\_console
 This is an interactive console, that can be used to try the different commands
 provided by an `Ollama::Client` instance. For example this command generate a
@@ -197,21 +59,6 @@ Commands: chat,copy,create,delete,embeddings,generate,help,ps,pull,push,show,tag
 > In a small village nestled between two great palm trees 🌳, there lived a
 > brave adventurer named Alex 👦. […]
-## Usage
-In your own software the library can be used as shown in this example:
-```ruby
-require "ollama"
-include Ollama
-ollama = Client.new(base_url: 'http://localhost:11434')
-messages = Message.new(role: 'user', content: 'Why is the sky blue?')
-ollama.chat(model: 'llama3.1', stream: true, messages:, &Print) # or
-print ollama.chat(model: 'llama3.1', stream: true, messages:).lazy.map { |response|
-	response.message.content
-}
-```
 ## API
@@ -463,11 +310,166 @@ If `Ollama::Errors::TimeoutError` is raised, it might help to increase the
 For more generic errors an `Ollama::Errors::Error` is raised.
+## Other executables
+### ollama\_chat
+This a chat client, that can be used to connect to an ollama server and enter a
+chat converstation with a LLM. It can be called with the following arguments:
+```
+Usage: ollama_chat [OPTIONS]
+  -f CONFIG      config file to read
+  -u URL         the ollama base url, OLLAMA_URL
+  -m MODEL       the ollama model to chat with, OLLAMA_CHAT_MODEL
+  -s SYSTEM      the system prompt to use as a file, OLLAMA_CHAT_SYSTEM
+  -c CHAT        a saved chat conversation to load
+  -C COLLECTION  name of the collection used in this conversation
+  -D DOCUMENT    load document and add to embeddings collection (multiple)
+  -M             use (empty) MemoryCache for this chat session
+  -E             disable embeddings for this chat session
+  -V             display the current version number and quit
+  -h             this help
+```
+The base URL can be either set by the environment variable `OLLAMA_URL` or it
+is derived from the environment variable `OLLAMA_HOST`. The default model to
+connect can be configured in the environment variable `OLLAMA_MODEL`.
+The YAML config file in `$XDG_CONFIG_HOME/ollama_chat/config.yml`, that you can
+use for more complex settings, it looks like this:
+```
+---
+url: <%= ENV['OLLAMA_URL'] || 'http://%s' % ENV.fetch('OLLAMA_HOST') %>
+model:
+  name: <%= ENV.fetch('OLLAMA_CHAT_MODEL', 'llama3.1') %>
+  options:
+    num_ctx: 8192
+system: <%= ENV.fetch('OLLAMA_CHAT_SYSTEM', 'null') %>
+voice: Samantha
+markdown: true
+embedding:
+  enabled: true
+  model:
+    name: mxbai-embed-large
+    options: {}
+  collection: <%= ENV.fetch('OLLAMA_CHAT_COLLECTION', 'ollama_chat') %>
+  found_texts_size: 4096
+  splitter:
+    name: RecursiveCharacter
+    chunk_size: 1024
+cache: Ollama::Documents::RedisCache
+redis:
+  url: <%= ENV.fetch('REDIS_URL', 'null') %>
+debug: <%= ENV['OLLAMA_CHAT_DEBUG'].to_i == 1 ? true : false %>
+```
+If you want to store embeddings persistently, set an environment variable
+`REDIS_URL` or update the `redis.url` setting in your `config.yml` file to
+connect to a Redis server. Without this setup, embeddings will only be stored
+in process memory, which is less durable.
+Some settings can be passed as arguments as well, e. g. if you want to choose a
+specific system prompt:
+```
+$ ollama_chat -s sherlock.txt
+Model with architecture llama found.
+Connecting to llama3.1@http://ollama.local.net:11434 now…
+Configured system prompt is:
+You are Sherlock Holmes and the user is your new client, Dr. Watson is also in
+the room. You will talk and act in the typical manner of Sherlock Holmes do and
+try to solve the user's case using logic and deduction.
+Type /help to display the chat help.
+📨 user:
+Good morning.
+📨 assistant:
+Ah, good morning, my dear fellow! It is a pleasure to make your acquaintance. I
+am Sherlock Holmes, the renowned detective, and this is my trusty sidekick, Dr.
+Watson. Please, have a seat and tell us about the nature of your visit. What
+seems to be the problem that has brought you to our humble abode at 221B Baker
+Street?
+(Watson nods in encouragement as he takes notes)
+Now, pray tell, what is it that puzzles you, my dear client? A missing item,
+perhaps? Or a mysterious occurrence that requires clarification? The game, as
+they say, is afoot!
+```
+This example shows how an image like this can be sent to a vision model for
+analysis:
+![cat](spec/assets/kitten.jpg)
+```
+$ ollama_chat -m llava-llama3
+Model with architecture llama found.
+Connecting to llava-llama3@http://localhost:11434 now…
+Type /help to display the chat help.
+📸 user> What's on this image? ./spec/assets/kitten.jpg
+📨 assistant:
+The image captures a moment of tranquility featuring a young cat. The cat,
+adorned with gray and white fur marked by black stripes on its face and legs,
+is the central figure in this scene. Its eyes, a striking shade of blue, are
+wide open and directed towards the camera, giving an impression of curiosity or
+alertness.
+The cat is comfortably nestled on a red blanket, which contrasts vividly with
+its fur. The blanket, soft and inviting, provides a sense of warmth to the
+image. In the background, partially obscured by the cat's head, is another
+blanket of similar red hue. The repetition of the color adds a sense of harmony
+to the composition.
+The cat's position on the right side of the photo creates an interesting
+asymmetry with the camera lens, which occupies the left side of the frame. This
+visual balance enhances the overall composition of the image.
+There are no discernible texts or other objects in the image. The focus is
+solely on the cat and its immediate surroundings. The image does not provide
+any information about the location or setting beyond what has been described.
+The simplicity of the scene allows the viewer to concentrate on the main
+subject - the young, blue-eyed cat.
+```
+The following commands can be given inside the chat, if prefixed by a `/`:
+```
+/copy                           to copy last response to clipboard
+/paste                          to paste content
+/markdown                       toggle markdown output
+/stream                         toggle stream output
+/location                       toggle location submission
+/voice( change)                 toggle voice output or change the voice
+/list [n]                       list the last n / all conversation exchanges
+/clear                          clear the whole conversation
+/clobber                        clear the conversation and collection
+/pop [n]                        pop the last n exchanges, defaults to 1
+/model                          change the model
+/system                         change system prompt (clears conversation)
+/regenerate                     the last answer message
+/collection( clear|change)      change (default) collection or clear
+/info                           show information for current session
+/document_policy                pick a scan policy for document references
+/import source                  import the source's content
+/summarize [n] source           summarize the source's content in n words
+/embedding                      toggle embedding paused or not
+/embed source                   embed the source's content
+/web [n] query                  query web search & return n or 1 results
+/save filename                  store conversation messages
+/load filename                  load conversation messages
+/quit                           to quit
+/help                           to view this help
+```
 ## Download
 The homepage of this library is located at
-* https://github.com/flori/ollama
+* https://github.com/flori/ollama-ruby
 ## Author

data/bin/ollama_chat CHANGED Viewed

@@ -49,9 +49,10 @@ class OllamaChatConfig
     voice:
       enabled: false
       default: Samantha
-      list: <%= `say -v ?`.lines.map { _1[/^(.+?)\s+[a-z]{2}_[a-zA-Z0-9]{2,}/, 1] }.uniq.sort.to_s.force_encoding('ASCII-8BIT') %>
+      list: <%= `say -v ? 2>/dev/null`.lines.map { _1[/^(.+?)\s+[a-z]{2}_[a-zA-Z0-9]{2,}/, 1] }.uniq.sort.to_s.force_encoding('ASCII-8BIT') %>
     markdown: true
     stream: true
+    document_policy: importing
     embedding:
       enabled: true
       model:
@@ -59,6 +60,7 @@ class OllamaChatConfig
         options: {}
         # Retrieval prompt template:
         prompt: 'Represent this sentence for searching relevant passages: %s'
+      batch_size: 10
       collection: <%= ENV['OLLAMA_CHAT_COLLECTION'] %>
       found_texts_size: 4096
       found_texts_count: null
@@ -478,54 +480,6 @@ def parse_source(source_io)
   end
 end
-def embed_source(source_io, source)
-  $embedding.on? or return parse_source(source_io)
-  puts "Embedding #{italic { source_io&.content_type }} document #{source.to_s.inspect}."
-  text = parse_source(source_io) or return
-  text.downcase!
-  splitter_config = $config.embedding.splitter
-  inputs = nil
-  case splitter_config.name
-  when 'Character'
-    splitter = Ollama::Documents::Splitters::Character.new(
-      chunk_size: splitter_config.chunk_size,
-    )
-    inputs = splitter.split(text)
-  when 'RecursiveCharacter'
-    splitter = Ollama::Documents::Splitters::RecursiveCharacter.new(
-      chunk_size: splitter_config.chunk_size,
-    )
-    inputs = splitter.split(text)
-  when 'Semantic'
-    splitter = Ollama::Documents::Splitters::Semantic.new(
-      ollama:, model: $config.embedding.model.name,
-      chunk_size: splitter_config.chunk_size,
-    )
-    inputs = splitter.split(
-      text,
-      breakpoint: splitter_config.breakpoint.to_sym,
-      percentage: splitter_config.percentage?,
-      percentile: splitter_config.percentile?,
-    )
-    inputs = splitter.split(text)
-  end
-  inputs or return
-  source = source.to_s
-  if source.start_with?(?!)
-    source = Ollama::Utils::Width.truncate(
-      source[1..-1].gsub(/\W+/, ?_),
-      length: 10
-    )
-  end
-  $documents.add(inputs, source:)
-end
-def add_image(images, source_io, source)
-  STDERR.puts "Adding #{source_io&.content_type} image #{source.to_s.inspect}."
-  image = Image.for_io(source_io, path: source.to_s)
-  (images << image).uniq!
-end
 def http_options(url)
   options = {}
   if ssl_no_verify = $config.ssl_no_verify?
@@ -554,7 +508,7 @@ def fetch_source(source, &block)
     ) do |tmp|
       block.(tmp)
     end
-  when %r(\Afile://(?:(?:[.-]|[[:alnum:]])*)(/\S*)|([~.]?/\S*))
+  when %r(\Afile://(/\S*)|\A((?:\.\.|[~.]?)/\S*))
     filename = $~.captures.compact.first
     filename = File.expand_path(filename)
     Utils::Fetcher.read(filename) do |tmp|
@@ -567,30 +521,90 @@ rescue => e
   STDERR.puts "Cannot fetch source #{source.to_s.inspect}: #{e}\n#{e.backtrace * ?\n}"
 end
+def add_image(images, source_io, source)
+  STDERR.puts "Adding #{source_io&.content_type} image #{source.to_s.inspect}."
+  image = Image.for_io(source_io, path: source.to_s)
+  (images << image).uniq!
+end
+def import_source(source_io, source)
+  source = source.to_s
+  puts "Importing #{italic { source_io&.content_type }} document #{source.inspect} now."
+  "Imported #{source.inspect}:\n%s\n\n" % parse_source(source_io)
+end
 def import(source)
-  puts "Now importing #{source.to_s.inspect}."
   fetch_source(source) do |source_io|
-    content = parse_source(source_io)
-    content.present? or return
+    content = import_source(source_io, source) or return
     source_io.rewind
     content
   end
 end
-def summarize(source, words: nil)
+def summarize_source(source_io, source, words: nil)
+  puts "Summarizing #{italic { source_io&.content_type }} document #{source.inspect} now."
   words = words.to_i
   words < 1 and words = 100
-  puts "Now summarizing #{source.to_s.inspect}."
-  source_content =
-    fetch_source(source) do |source_io|
-      content = parse_source(source_io)
-      content.present? or return
-      source_io.rewind
-      content
-    end
+  source_content = parse_source(source_io)
+  source_content.present? or return
   $config.prompts.summarize % { source_content:, words: }
 end
+def summarize(source, words: nil)
+  fetch_source(source) do |source_io|
+    content = summarize_source(source_io, source, words:) or return
+    source_io.rewind
+    content
+  end
+end
+def embed_source(source_io, source, count: nil)
+  $embedding.on? or return parse_source(source_io)
+  m = "Embedding #{italic { source_io&.content_type }} document #{source.to_s.inspect}."
+  if count
+    puts '%u. %s' % [ count, m ]
+  else
+    puts m
+  end
+  text = parse_source(source_io) or return
+  text.downcase!
+  splitter_config = $config.embedding.splitter
+  inputs = nil
+  case splitter_config.name
+  when 'Character'
+    splitter = Ollama::Documents::Splitters::Character.new(
+      chunk_size: splitter_config.chunk_size,
+    )
+    inputs = splitter.split(text)
+  when 'RecursiveCharacter'
+    splitter = Ollama::Documents::Splitters::RecursiveCharacter.new(
+      chunk_size: splitter_config.chunk_size,
+    )
+    inputs = splitter.split(text)
+  when 'Semantic'
+    splitter = Ollama::Documents::Splitters::Semantic.new(
+      ollama:, model: $config.embedding.model.name,
+      chunk_size: splitter_config.chunk_size,
+    )
+    inputs = splitter.split(
+      text,
+      breakpoint: splitter_config.breakpoint.to_sym,
+      percentage: splitter_config.percentage?,
+      percentile: splitter_config.percentile?,
+    )
+    inputs = splitter.split(text)
+  end
+  inputs or return
+  source = source.to_s
+  if source.start_with?(?!)
+    source = Ollama::Utils::Width.truncate(
+      source[1..-1].gsub(/\W+/, ?_),
+      length: 10
+    )
+  end
+  $documents.add(inputs, source:, batch_size: $config.embedding.batch_size?)
+end
 def embed(source)
   if $embedding.on?
     puts "Now embedding #{source.to_s.inspect}."
@@ -612,7 +626,8 @@ def parse_content(content, images)
   images.clear
   tags = Utils::Tags.new
-  content.scan(%r([.~]?/\S+|https?://\S+|#\S+)).each do |source|
+  contents = [ content ]
+  content.scan(%r((?:\.\.|[.~])?/\S+|https?://\S+|#\S+)).each do |source|
     case source
     when /\A#(\S+)/
       tags.add($1, source:)
@@ -622,8 +637,15 @@ def parse_content(content, images)
         case source_io&.content_type&.media_type
         when 'image'
           add_image(images, source_io, source)
-        when 'text', 'application'
-          embed_source(source_io, source)
+        when 'text', 'application', nil
+          case $document_policy
+          when 'importing'
+            contents << import_source(source_io, source)
+          when 'embedding'
+            embed_source(source_io, source)
+          when 'summarizing'
+            contents << summarize_source(source_io, source)
+          end
         else
           STDERR.puts(
             "Cannot fetch #{source.to_s.inspect} with content type "\
@@ -633,8 +655,8 @@ def parse_content(content, images)
       end
     end
   end
-  return content, (tags unless tags.empty?)
+  new_content = contents.select(&:present?).compact * "\n\n"
+  return new_content, (tags unless tags.empty?)
 end
 def choose_model(cli_model, current_model)
@@ -668,16 +690,44 @@ def choose_collection(current_collection)
   end
 ensure
   puts "Using collection #{bold{$documents.collection}}."
-  collection_stats
+  info
+end
+def choose_document_policy
+  policies = %w[ importing embedding summarizing ].sort
+  current  = if policies.index($document_policy)
+               $document_policy
+             elsif policies.index($config.document_policy)
+               $config.document_policy
+             else
+               policies.first
+             end
+  policies.unshift('[EXIT]')
+  policy = Ollama::Utils::Chooser.choose(policies)
+  case policy
+  when nil, '[EXIT]'
+    puts "Exiting chooser."
+    policy = current
+  end
+  $document_policy = policy
+ensure
+  puts "Using document policy #{bold{$document_policy}}."
+  info
 end
 def collection_stats
+  list = $documents.collections.sort.map { |c|
+    '  ' + ($documents.collection == c ? bold { c } : c).to_s
+  }.join(?\n)
   puts <<~EOT
-    Collection
+    Current Collection
       Name: #{bold{$documents.collection}}
       Embedding model: #{bold{$embedding_model}}
       #Embeddings: #{$documents.size}
+      #Tags: #{$documents.tags.size}
       Tags: #{$documents.tags}
+    List:
+    #{list}
   EOT
 end
@@ -744,6 +794,7 @@ def info
   $markdown.show
   $stream.show
   $location.show
+  puts "Document policy for references in user text: #{bold{$document_policy}}"
   if $voice.on?
     puts "Using voice #{bold{$current_voice}} to speak."
   end
@@ -787,6 +838,7 @@ def display_chat_help
     /regenerate                     the last answer message
     /collection( clear|change)      change (default) collection or clear
     /info                           show information for current session
+    /document_policy                pick a scan policy for document references
     /import source                  import the source's content
     /summarize [n] source           summarize the source's content in n words
     /embedding                      toggle embedding paused or not
@@ -841,10 +893,11 @@ $opts[?V] and version
 base_url = $opts[?u] || $config.url
 $ollama      = Client.new(base_url:, debug: $config.debug)
-$model       = choose_model($opts[?m], $config.model.name)
-options      = Options[$config.model.options]
-model_system = pull_model_unless_present($model, options)
-messages     = []
+$document_policy = $config.document_policy
+$model           = choose_model($opts[?m], $config.model.name)
+options          = Options[$config.model.options]
+model_system     = pull_model_unless_present($model, options)
+messages         = []
 $embedding_enabled.set($config.embedding.enabled && !$opts[?E])
 if $opts[?c]
@@ -889,11 +942,13 @@ if $embedding.on?
       end
     end
     puts "Collection #{bold{collection}}: Adding #{document_list.size} documents…"
+    count = 1
     document_list.each_slice(25) do |docs|
       docs.each do |doc|
         fetch_source(doc) do |doc_io|
-          embed_source(doc_io, doc)
+          embed_source(doc_io, doc, count:)
         end
+        count += 1
       end
     end
   end
@@ -955,7 +1010,7 @@ loop do
     puts "Cleared messages."
     next
   when %r(^/clobber$)
-    if ask?(prompt: 'Are you sure? (y/n) ') =~ /\Ay/i
+    if ask?(prompt: 'Are you sure to clear messages and collection? (y/n) ') =~ /\Ay/i
       clear_messages(messages)
       $documents.clear
       puts "Cleared messages and collection #{bold{$documents.collection}}."
@@ -1020,9 +1075,12 @@ loop do
       choose_collection($documents.collection)
     end
     next
-  when %r(/info)
+  when %r(^/info$)
     info
     next
+  when %r(^/document_policy$)
+    choose_document_policy
+    next
   when %r(^/import\s+(.+))
     parse_content = false
     content       = import($1) or next