RubyGems - langchainrb - Versions diffs - 0.7.2 → 0.7.5 - Mend

langchainrb 0.7.2 → 0.7.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (24) hide show

checksums.yaml +4 -4
data/CHANGELOG.md +10 -0
data/README.md +158 -320
data/lib/langchain/agent/agents.md +54 -0
data/lib/langchain/llm/aws_bedrock.rb +216 -0
data/lib/langchain/llm/openai.rb +26 -9
data/lib/langchain/llm/response/aws_titan_response.rb +17 -0
data/lib/langchain/llm/response/base_response.rb +3 -0
data/lib/langchain/utils/token_length/ai21_validator.rb +1 -0
data/lib/langchain/utils/token_length/base_validator.rb +6 -2
data/lib/langchain/utils/token_length/cohere_validator.rb +1 -0
data/lib/langchain/utils/token_length/google_palm_validator.rb +1 -0
data/lib/langchain/utils/token_length/openai_validator.rb +20 -0
data/lib/langchain/vectorsearch/chroma.rb +3 -1
data/lib/langchain/vectorsearch/milvus.rb +3 -1
data/lib/langchain/vectorsearch/pgvector.rb +3 -1
data/lib/langchain/vectorsearch/pinecone.rb +3 -1
data/lib/langchain/vectorsearch/qdrant.rb +3 -1
data/lib/langchain/vectorsearch/weaviate.rb +4 -2
data/lib/langchain/version.rb +1 -1
metadata +19 -5
data/lib/langchain/evals/ragas/critique.rb +0 -62
data/lib/langchain/evals/ragas/prompts/critique.yml +0 -18
data/lib/langchain/loader_chunkers/html.rb +0 -27

checksums.yaml CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: 49f95a7d3bf92523a3bb74ffd9c1cff35c258c4ecb9523e75b3be4ffdf333359
-  data.tar.gz: a114fc925963757330e83e9287314b1c363206a31293e788ab8f7cc5f8e82249
+  metadata.gz: f4c388275b83a0e4260f4ae9271f4c164a8d34ea5ea9585916d91e7e9c17c980
+  data.tar.gz: 8daa400de3ed80bb3fb9c53cc19ef4d56f137c2aa157bd268dbda488d0fca432
 SHA512:
-  metadata.gz: e0fb4076645a2ba09e0e9012fa2ec84260c5294f59628284baace34ad98b4dc2621c29217890aba7995d21288b68b0eab96a4ad4ba74beb1c41d8e79c296539d
-  data.tar.gz: 2d681b82119d4c4356011bcba6f5590429abdb3bea3049ab4c50ba720320493a64838bc08c6b9b8f16d2b2bd71d445795ae56923074a47b26e9948873460a250
+  metadata.gz: 4bae87c050be6a8fa011c1ae5de4b119abac498669f2e63ca1829e11b7b5ecca7610330be670d24fd6cb98c2e2599c593e9922378985efc586d76c124efb865e
+  data.tar.gz: 2a39b084c6a239aeb0de22bfc87629d2f2909b23eabfcf71a835a1f1624d84afe3ea106afdafb8f1fb301b7934d73abc7253c9b8bd3f6c9b170231ebb5af0936

data/CHANGELOG.md CHANGED Viewed

@@ -1,5 +1,15 @@
 ## [Unreleased]
+## [0.7.5] - 2023-11-13
+- Fixes
+## [0.7.4] - 2023-11-10
+- AWS Bedrock is available as an LLM provider. Available models from AI21, Cohere, AWS, and Anthropic.
+## [0.7.3] - 2023-11-08
+- LLM response passes through the context in RAG cases
+- Fix gpt-4 token length validation
 ## [0.7.2] - 2023-11-02
 - Azure OpenAI LLM support

data/README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 💎🔗 Langchain.rb
 ---
-⚡ Building applications with LLMs through composability ⚡
+⚡ Building LLM-powered applications in Ruby ⚡
 For deep Rails integration see: [langchainrb_rails](https://github.com/andreibondarev/langchainrb_rails) gem.
@@ -11,21 +11,24 @@ Available for paid consulting engagements! [Email me](mailto:andrei@sourcelabs.i
 [![Docs](http://img.shields.io/badge/yard-docs-blue.svg)](http://rubydoc.info/gems/langchainrb)
 [![License](https://img.shields.io/badge/license-MIT-green.svg)](https://github.com/andreibondarev/langchainrb/blob/main/LICENSE.txt)
 [![](https://dcbadge.vercel.app/api/server/WDARp7J2n8?compact=true&style=flat)](https://discord.gg/WDARp7J2n8)
+[![X](https://img.shields.io/twitter/url/https/twitter.com/cloudposse.svg?style=social&label=Follow%20%40rushing_andrei)](https://twitter.com/rushing_andrei)
-Langchain.rb is a library that's an abstraction layer on top many emergent AI, ML and other DS tools. The goal is to abstract complexity and difficult concepts to make building AI/ML-supercharged applications approachable for traditional software engineers.
+## Use Cases
+* Retrieval Augmented Generation (RAG) and vector search
+* Chat bots
+* [AI agents](https://github.com/andreibondarev/langchainrb/tree/main/lib/langchain/agent/agents.md)
-## Explore Langchain.rb
+## Table of Contents
 - [Installation](#installation)
 - [Usage](#usage)
-- [Vector Search Databases](#using-vector-search-databases-)
-- [Standalone LLMs](#using-standalone-llms-️)
-- [Prompts](#using-prompts-)
-- [Output Parsers](#using-output-parsers)
-- [Agents](#using-agents-)
-- [Loaders](#loaders-)
-- [Examples](#examples)
+- [Large Language Models (LLMs)](#large-language-models-llms)
+- [Prompt Management](#prompt-management)
+- [Output Parsers](#output-parsers)
+- [Building RAG](#building-retrieval-augment-generation-rag-system)
+- [Building chat bots](#building-chat-bots)
 - [Evaluations](#evaluations-evals)
+- [Examples](#examples)
 - [Logging](#logging)
 - [Development](#development)
 - [Discord](#discord)
@@ -46,264 +49,66 @@ If bundler is not being used to manage dependencies, install the gem by executin
 require "langchain"
 ```
-#### Supported vector search databases and features:
-| Database | Querying           | Storage | Schema Management | Backups | Rails Integration |
-| -------- |:------------------:| -------:| -----------------:| -------:| -----------------:|
-| [Chroma](https://trychroma.com/) | :white_check_mark: | :white_check_mark: | :white_check_mark: | WIP     | :white_check_mark: |
-| [Hnswlib](https://github.com/nmslib/hnswlib/) | :white_check_mark: | :white_check_mark: | :white_check_mark: | WIP     | WIP               |
-| [Milvus](https://milvus.io/) | :white_check_mark: | :white_check_mark: | :white_check_mark: | WIP     | :white_check_mark: |
-| [Pinecone](https://www.pinecone.io/) | :white_check_mark: | :white_check_mark: | :white_check_mark: | WIP     | :white_check_mark: |
-| [Pgvector](https://github.com/pgvector/pgvector) | :white_check_mark: | :white_check_mark: | :white_check_mark: | WIP     | :white_check_mark: |
-| [Qdrant](https://qdrant.tech/) | :white_check_mark: | :white_check_mark: | :white_check_mark: | WIP     | :white_check_mark: |
-| [Weaviate](https://weaviate.io/) | :white_check_mark: | :white_check_mark: | :white_check_mark: | WIP     | :white_check_mark: |
-### Using Vector Search Databases 🔍
-Choose the LLM provider you'll be using (OpenAI or Cohere) and retrieve the API key.
-Add `gem "weaviate-ruby", "~> 0.8.3"`  to your Gemfile.
-Pick the vector search database you'll be using and instantiate the client:
-```ruby
-client = Langchain::Vectorsearch::Weaviate.new(
-    url: ENV["WEAVIATE_URL"],
-    api_key: ENV["WEAVIATE_API_KEY"],
-    index_name: "",
-    llm: Langchain::LLM::OpenAI.new(api_key: ENV["OPENAI_API_KEY"])
-)
-# You can instantiate any other supported vector search database:
-client = Langchain::Vectorsearch::Chroma.new(...) # `gem "chroma-db", "~> 0.6.0"`
-client = Langchain::Vectorsearch::Hnswlib.new(...) # `gem "hnswlib", "~> 0.8.1"`
-client = Langchain::Vectorsearch::Milvus.new(...) # `gem "milvus", "~> 0.9.2"`
-client = Langchain::Vectorsearch::Pinecone.new(...) # `gem "pinecone", "~> 0.1.6"`
-client = Langchain::Vectorsearch::Pgvector.new(...) # `gem "pgvector", "~> 0.2"`
-client = Langchain::Vectorsearch::Qdrant.new(...) # `gem"qdrant-ruby", "~> 0.9.3"`
-```
-```ruby
-# Creating the default schema
-client.create_default_schema
-```
-```ruby
-# Store plain texts in your vector search database
-client.add_texts(
-    texts: [
-        "Begin by preheating your oven to 375°F (190°C). Prepare four boneless, skinless chicken breasts by cutting a pocket into the side of each breast, being careful not to cut all the way through. Season the chicken with salt and pepper to taste. In a large skillet, melt 2 tablespoons of unsalted butter over medium heat. Add 1 small diced onion and 2 minced garlic cloves, and cook until softened, about 3-4 minutes. Add 8 ounces of fresh spinach and cook until wilted, about 3 minutes. Remove the skillet from heat and let the mixture cool slightly.",
-        "In a bowl, combine the spinach mixture with 4 ounces of softened cream cheese, 1/4 cup of grated Parmesan cheese, 1/4 cup of shredded mozzarella cheese, and 1/4 teaspoon of red pepper flakes. Mix until well combined. Stuff each chicken breast pocket with an equal amount of the spinach mixture. Seal the pocket with a toothpick if necessary. In the same skillet, heat 1 tablespoon of olive oil over medium-high heat. Add the stuffed chicken breasts and sear on each side for 3-4 minutes, or until golden brown."
-    ]
-)
-```
-```ruby
-# Store the contents of your files in your vector search database
-my_pdf = Langchain.root.join("path/to/my.pdf")
-my_text = Langchain.root.join("path/to/my.txt")
-my_docx = Langchain.root.join("path/to/my.docx")
-client.add_data(paths: [my_pdf, my_text, my_docx])
-```
-```ruby
-# Retrieve similar documents based on the query string passed in
-client.similarity_search(
-    query:,
-    k:       # number of results to be retrieved
-)
-```
-```ruby
-# Retrieve similar documents based on the query string passed in via the [HyDE technique](https://arxiv.org/abs/2212.10496)
-client.similarity_search_with_hyde()
-```
-```ruby
-# Retrieve similar documents based on the embedding passed in
-client.similarity_search_by_vector(
-    embedding:,
-    k:       # number of results to be retrieved
-)
-```
-```ruby
-# Q&A-style querying based on the question passed in
-client.ask(
-    question:
-)
-```
-## Integrating Vector Search into ActiveRecord models
-```ruby
-class Product < ActiveRecord::Base
-  vectorsearch provider: Langchain::Vectorsearch::Qdrant.new(
-                 api_key: ENV["QDRANT_API_KEY"],
-                 url: ENV["QDRANT_URL"],
-                 index_name: "Products",
-                 llm: Langchain::LLM::GooglePalm.new(api_key: ENV["GOOGLE_PALM_API_KEY"])
-               )
+## Large Language Models (LLMs)
+Langchain.rb wraps all supported LLMs in a unified interface allowing you to easily swap out and test out different models.
-  after_save :upsert_to_vectorsearch
-end
-```
+#### Supported LLMs and features:
+| LLM providers                                    | embed()            | complete()         | chat()              | summarize()        | Notes              |
+| --------                                         |:------------------:| :-------:          | :-----------------: | :-------:          | :----------------- |
+| [OpenAI](https://openai.com/)                    | :white_check_mark: | :white_check_mark: | :white_check_mark:  | ❌                 | Including Azure OpenAI |
+| [AI21](https://ai21.com/)                        | ❌                 | :white_check_mark: | ❌                  | :white_check_mark: |                    |
+| [Anthropic](https://milvus.io/)                  | ❌                 | :white_check_mark: | ❌                  | ❌                 |                    |
+| [AWS Bedrock](https://aws.amazon.com/bedrock)    | :white_check_mark: | :white_check_mark: | ❌                  | ❌                 | Provides AWS, Cohere, AI21, Antropic and Stability AI models |
+| [Cohere](https://www.pinecone.io/)               | :white_check_mark: | :white_check_mark: | :white_check_mark:  | :white_check_mark: |                    |
+| [GooglePalm](https://ai.google/discover/palm2/) | :white_check_mark: | :white_check_mark: | :white_check_mark:  | :white_check_mark: |                    |
+| [HuggingFace](https://huggingface.co/)          | :white_check_mark: | ❌                 | ❌                  | ❌                 |                    |
+| [Ollama](https://ollama.ai/)                     | :white_check_mark: | :white_check_mark: | ❌                  | ❌                 |                    |
+| [Replicate](https://replicate.com/)              | :white_check_mark: | :white_check_mark: | :white_check_mark:  | :white_check_mark: |                    |
-### Exposed ActiveRecord methods
-```ruby
-# Retrieve similar products based on the query string passed in
-Product.similarity_search(
-    query:,
-    k:       # number of results to be retrieved
-)
-```
-```ruby
-# Q&A-style querying based on the question passed in
-Product.ask(
-    question:
-)
-```
-Additional info [here](https://github.com/andreibondarev/langchainrb/blob/main/lib/langchain/active_record/hooks.rb#L10-L38).
-### Using Standalone LLMs 🗣️
-Add `gem "ruby-openai", "~> 4.0.0"` to your Gemfile.
+#### Using standalone LLMs:
 #### OpenAI
-```ruby
-openai = Langchain::LLM::OpenAI.new(api_key: ENV["OPENAI_API_KEY"])
-```
-You can pass additional parameters to the constructor, it will be passed to the OpenAI client:
-```ruby
-openai = Langchain::LLM::OpenAI.new(api_key: ENV["OPENAI_API_KEY"], llm_options: {uri_base: "http://localhost:1234"}) )
-```
-```ruby
-openai.embed(text: "foo bar")
-```
-```ruby
-openai.complete(prompt: "What is the meaning of life?")
-```
-##### Open AI Function calls support
-Conversation support
-```ruby
-chat = Langchain::Conversation.new(llm: openai)
-```
-```ruby
-chat.set_context("You are the climate bot")
-chat.set_functions(functions)
-```
-qdrant:
-```ruby
-client.llm.functions = functions
-```
-#### Azure
 Add `gem "ruby-openai", "~> 5.2.0"` to your Gemfile.
 ```ruby
-azure = Langchain::LLM::Azure.new(
-  api_key: ENV["AZURE_API_KEY"],
-  llm_options: {
-    api_type: :azure,
-    api_version: "2023-03-15-preview"
-  },
-  embedding_deployment_url: ENV.fetch("AZURE_EMBEDDING_URI"),
-  chat_deployment_url: ENV.fetch("AZURE_CHAT_URI")
-)
-```
-where `AZURE_EMBEDDING_URI` is e.g. `https://custom-domain.openai.azure.com/openai/deployments/gpt-35-turbo` and `AZURE_CHAT_URI` is e.g. `https://custom-domain.openai.azure.com/openai/deployments/ada-2`
-You can pass additional parameters to the constructor, it will be passed to the Azure client:
-```ruby
-azure = Langchain::LLM::Azure.new(
-  api_key: ENV["AZURE_API_KEY"],
-  llm_options: {
-    api_type: :azure,
-    api_version: "2023-03-15-preview",
-    request_timeout: 240 # Optional
-  },
-  embedding_deployment_url: ENV.fetch("AZURE_EMBEDDING_URI"),
-  chat_deployment_url: ENV.fetch("AZURE_CHAT_URI")
-)
-```
-```ruby
-azure.embed(text: "foo bar")
-```
-```ruby
-azure.complete(prompt: "What is the meaning of life?")
-```
-#### Cohere
-Add `gem "cohere-ruby", "~> 0.9.6"` to your Gemfile.
-```ruby
-cohere = Langchain::LLM::Cohere.new(api_key: ENV["COHERE_API_KEY"])
-```
-```ruby
-cohere.embed(text: "foo bar")
-```
-```ruby
-cohere.complete(prompt: "What is the meaning of life?")
-```
-#### HuggingFace
-Add `gem "hugging-face", "~> 0.3.2"` to your Gemfile.
-```ruby
-hugging_face = Langchain::LLM::HuggingFace.new(api_key: ENV["HUGGING_FACE_API_KEY"])
-```
-#### Replicate
-Add `gem "replicate-ruby", "~> 0.2.2"` to your Gemfile.
-```ruby
-replicate = Langchain::LLM::Replicate.new(api_key: ENV["REPLICATE_API_KEY"])
+llm = Langchain::LLM::OpenAI.new(api_key: ENV["OPENAI_API_KEY"])
 ```
-#### Google PaLM (Pathways Language Model)
-Add `"google_palm_api", "~> 0.1.3"` to your Gemfile.
+You can pass additional parameters to the constructor, it will be passed to the OpenAI client:
 ```ruby
-google_palm = Langchain::LLM::GooglePalm.new(api_key: ENV["GOOGLE_PALM_API_KEY"])
+llm = Langchain::LLM::OpenAI.new(api_key: ENV["OPENAI_API_KEY"], llm_options: { ... })
 ```
-#### AI21
-Add `gem "ai21", "~> 0.2.1"` to your Gemfile.
+Generate vector embeddings:
 ```ruby
-ai21 = Langchain::LLM::AI21.new(api_key: ENV["AI21_API_KEY"])
+llm.embed(text: "foo bar")
 ```
-#### Anthropic
-Add `gem "anthropic", "~> 0.1.0"` to your Gemfile.
+Generate a text completion:
 ```ruby
-anthropic = Langchain::LLM::Anthropic.new(api_key: ENV["ANTHROPIC_API_KEY"])
+llm.complete(prompt: "What is the meaning of life?")
 ```
+Generate a chat completion:
 ```ruby
-anthropic.complete(prompt: "What is the meaning of life?")
+llm.chat(prompt: "Hey! How are you?")
 ```
-#### Ollama
+Summarize the text:
 ```ruby
-ollama = Langchain::LLM::Ollama.new(url: ENV["OLLAMA_URL"])
+llm.complete(text: "...")
 ```
+You can use any other LLM by invoking the same interface:
 ```ruby
-ollama.complete(prompt: "What is the meaning of life?")
-```
-```ruby
-ollama.embed(text: "Hello world!")
+llm = Langchain::LLM::GooglePalm.new(...)
 ```
-### Using Prompts 📋
+### Prompt Management
 #### Prompt Templates
-Create a prompt with one input variable:
-```ruby
-prompt = Langchain::Prompt::PromptTemplate.new(template: "Tell me a {adjective} joke.", input_variables: ["adjective"])
-prompt.format(adjective: "funny") # "Tell me a funny joke."
-```
-Create a prompt with multiple input variables:
+Create a prompt with input variables:
 ```ruby
 prompt = Langchain::Prompt::PromptTemplate.new(template: "Tell me a {adjective} joke about {content}.", input_variables: ["adjective", "content"])
@@ -384,7 +189,8 @@ prompt = Langchain::Prompt.load_from_path(file_path: "spec/fixtures/prompt/promp
 prompt.input_variables #=> ["adjective", "content"]
 ```
-### Using Output Parsers
+### Output Parsers
 Parse LLM text responses into structured output, such as JSON.
@@ -484,93 +290,147 @@ fix_parser.parse(llm_response)
 See [here](https://github.com/andreibondarev/langchainrb/tree/main/examples/create_and_manage_prompt_templates_using_structured_output_parser.rb) for a concrete example
-### Using Agents 🤖
-Agents are semi-autonomous bots that can respond to user questions and use available to them Tools to provide informed replies. They break down problems into series of steps and define Actions (and Action Inputs) along the way that are executed and fed back to them as additional information. Once an Agent decides that it has the Final Answer it responds with it.
+## Building Retrieval Augment Generation (RAG) system
+RAG is a methodology that assists LLMs generate accurate and up-to-date information.
+A typical RAG workflow follows the 3 steps below:
+1. Relevant knowledge (or data) is retrieved from the knowledge base (typically a vector search DB)
+2. A prompt, containing retrieved knowledge above, is constructed.
+3. LLM receives the prompt above to generate a text completion.
+Most common use-case for a RAG system is powering Q&A systems where users pose natural language questions and receive answers in natural language.
-#### ReAct Agent
+### Vector search databases
+Langchain.rb provides a convenient unified interface on top of supported vectorsearch databases that make it easy to configure your index, add data, query and retrieve from it.
-Add `gem "ruby-openai"`, `gem "eqn"`, and `gem "google_search_results"` to your Gemfile
+#### Supported vector search databases and features:
-```ruby
-search_tool = Langchain::Tool::GoogleSearch.new(api_key: ENV["SERPAPI_API_KEY"])
-calculator = Langchain::Tool::Calculator.new
+| Database                                         | Open-source        | Cloud offering     |
+| --------                                         |:------------------:| :------------:     |
+| [Chroma](https://trychroma.com/)                 | :white_check_mark: | :white_check_mark: |
+| [Hnswlib](https://github.com/nmslib/hnswlib/)    | :white_check_mark: | ❌                 |
+| [Milvus](https://milvus.io/)                     | :white_check_mark: | :white_check_mark: Zilliz Cloud |
+| [Pinecone](https://www.pinecone.io/)             | ❌                 | :white_check_mark: |
+| [Pgvector](https://github.com/pgvector/pgvector) | :white_check_mark: | :white_check_mark: |
+| [Qdrant](https://qdrant.tech/)                   | :white_check_mark: | :white_check_mark: |
+| [Weaviate](https://weaviate.io/)                 | :white_check_mark: | :white_check_mark: |
-openai = Langchain::LLM::OpenAI.new(api_key: ENV["OPENAI_API_KEY"])
+### Using Vector Search Databases 🔍
-agent = Langchain::Agent::ReActAgent.new(
-  llm: openai,
-  tools: [search_tool, calculator]
-)
-```
+Pick the vector search database you'll be using, add the gem dependency and instantiate the client:
 ```ruby
-agent.run(question: "How many full soccer fields would be needed to cover the distance between NYC and DC in a straight line?")
-#=> "Approximately 2,945 soccer fields would be needed to cover the distance between NYC and DC in a straight line."
+gem "weaviate-ruby", "~> 0.8.9"
 ```
-#### SQL-Query Agent
+Choose and instantiate the LLM provider you'll be using to generate embeddings
+```ruby
+llm = Langchain::LLM::OpenAI.new(api_key: ENV["OPENAI_API_KEY"])
+```
-Add `gem "sequel"` to your Gemfile
+```ruby
+client = Langchain::Vectorsearch::Weaviate.new(
+    url: ENV["WEAVIATE_URL"],
+    api_key: ENV["WEAVIATE_API_KEY"],
+    index_name: "Documents",
+    llm: llm
+)
+```
+You can instantiate any other supported vector search database:
 ```ruby
-database = Langchain::Tool::Database.new(connection_string: "postgres://user:password@localhost:5432/db_name")
+client = Langchain::Vectorsearch::Chroma.new(...)   # `gem "chroma-db", "~> 0.6.0"`
+client = Langchain::Vectorsearch::Hnswlib.new(...)  # `gem "hnswlib", "~> 0.8.1"`
+client = Langchain::Vectorsearch::Milvus.new(...)   # `gem "milvus", "~> 0.9.2"`
+client = Langchain::Vectorsearch::Pinecone.new(...) # `gem "pinecone", "~> 0.1.6"`
+client = Langchain::Vectorsearch::Pgvector.new(...) # `gem "pgvector", "~> 0.2"`
+client = Langchain::Vectorsearch::Qdrant.new(...)   # `gem"qdrant-ruby", "~> 0.9.3"`
+```
-agent = Langchain::Agent::SQLQueryAgent.new(llm: Langchain::LLM::OpenAI.new(api_key: ENV["OPENAI_API_KEY"]), db: database)
+Create the default schema:
+```ruby
+client.create_default_schema
 ```
+Add plain text data to your vector search database:
 ```ruby
-agent.run(question: "How many users have a name with length greater than 5 in the users table?")
-#=> "14 users have a name with length greater than 5 in the users table."
+client.add_texts(
+  texts: [
+    "Begin by preheating your oven to 375°F (190°C). Prepare four boneless, skinless chicken breasts by cutting a pocket into the side of each breast, being careful not to cut all the way through. Season the chicken with salt and pepper to taste. In a large skillet, melt 2 tablespoons of unsalted butter over medium heat. Add 1 small diced onion and 2 minced garlic cloves, and cook until softened, about 3-4 minutes. Add 8 ounces of fresh spinach and cook until wilted, about 3 minutes. Remove the skillet from heat and let the mixture cool slightly.",
+      "In a bowl, combine the spinach mixture with 4 ounces of softened cream cheese, 1/4 cup of grated Parmesan cheese, 1/4 cup of shredded mozzarella cheese, and 1/4 teaspoon of red pepper flakes. Mix until well combined. Stuff each chicken breast pocket with an equal amount of the spinach mixture. Seal the pocket with a toothpick if necessary. In the same skillet, heat 1 tablespoon of olive oil over medium-high heat. Add the stuffed chicken breasts and sear on each side for 3-4 minutes, or until golden brown."
+  ]
+)
 ```
-#### Demo
-![May-12-2023 13-09-13](https://github.com/andreibondarev/langchainrb/assets/541665/6bad4cd9-976c-420f-9cf9-b85bf84f7eaf)
+Or use the file parsers to load, parse and index data into your database:
+```ruby
+my_pdf = Langchain.root.join("path/to/my.pdf")
+my_text = Langchain.root.join("path/to/my.txt")
+my_docx = Langchain.root.join("path/to/my.docx")
-![May-12-2023 13-07-45](https://github.com/andreibondarev/langchainrb/assets/541665/9aacdcc7-4225-4ea0-ab96-7ee48826eb9b)
+client.add_data(paths: [my_pdf, my_text, my_docx])
+```
+Supported file formats: docx, html, pdf, text, json, jsonl, csv, xlsx.
-#### Available Tools 🛠️
+Retrieve similar documents based on the query string passed in:
+```ruby
+client.similarity_search(
+  query:,
+  k:       # number of results to be retrieved
+)
+```
-| Name         | Description                                        | ENV Requirements                                              | Gem Requirements                          |
-| ------------ | :------------------------------------------------: | :-----------------------------------------------------------: | :---------------------------------------: |
-| "calculator" | Useful for getting the result of a math expression |                                                               | `gem "eqn", "~> 1.6.5"`                   |
-| "database"   | Useful for querying a SQL database |                                                               | `gem "sequel", "~> 5.68.0"`                   |
-| "ruby_code_interpreter" | Interprets Ruby expressions             |                                                               | `gem "safe_ruby", "~> 1.0.4"`             |
-| "google_search"     | A wrapper around Google Search                     | `ENV["SERPAPI_API_KEY"]` (https://serpapi.com/manage-api-key) | `gem "google_search_results", "~> 2.0.0"` |
-| "weather"  | Calls Open Weather API to retrieve the current weather        |      `ENV["OPEN_WEATHER_API_KEY"]` (https://home.openweathermap.org/api_keys)               | `gem "open-weather-ruby-client", "~> 0.3.0"`    |
-| "wikipedia"  | Calls Wikipedia API to retrieve the summary        |                                                               | `gem "wikipedia-client", "~> 1.17.0"`     |
+Retrieve similar documents based on the query string passed in via the [HyDE technique](https://arxiv.org/abs/2212.10496):
+```ruby
+client.similarity_search_with_hyde()
+```
-#### Loaders 🚚
+Retrieve similar documents based on the embedding passed in:
+```ruby
+client.similarity_search_by_vector(
+  embedding:,
+  k:       # number of results to be retrieved
+)
+```
-Need to read data from various sources? Load it up.
+RAG-based querying
+```ruby
+client.ask(
+  question:
+)
+```
-##### Usage
+## Building chat bots
-Just call `Langchan::Loader.load` with the path to the file or a URL you want to load.
+### Conversation class
+Choose and instantiate the LLM provider you'll be using:
 ```ruby
-Langchain::Loader.load('/path/to/file.pdf')
+llm = Langchain::LLM::OpenAI.new(api_key: ENV["OPENAI_API_KEY"])
 ```
-or
+Instantiate the Conversation class:
 ```ruby
-Langchain::Loader.load('https://www.example.com/file.pdf')
+chat = Langchain::Conversation.new(llm: llm)
 ```
-##### Supported Formats
+(Optional) Set the conversation context:
+```ruby
+chat.set_context("You are a chatbot from the future")
+```
+Exchange messages with the LLM
+```ruby
+chat.message("Tell me about future technologies")
+```
-| Format | Pocessor                     |       Gem Requirements       |
-| ------ | ---------------------------- | :--------------------------: |
-| docx   | Langchain::Processors::Docx  |   `gem "docx", "~> 0.8.0"`   |
-| html   | Langchain::Processors::HTML  | `gem "nokogiri", "~> 1.13"`  |
-| pdf    | Langchain::Processors::PDF   | `gem "pdf-reader", "~> 1.4"` |
-| text   | Langchain::Processors::Text  |                              |
-| JSON   | Langchain::Processors::JSON  |                              |
-| JSONL  | Langchain::Processors::JSONL |                              |
-| csv    | Langchain::Processors::CSV   |                              |
-| xlsx   | Langchain::Processors::Xlsx  |   `gem "roo", "~> 2.10.0"`   |
+To stream the chat response:
+```ruby
+chat = Langchain::Conversation.new(llm: llm) do |chunk|
+  print(chunk)
+end
+```
-## Examples
-Additional examples available: [/examples](https://github.com/andreibondarev/langchainrb/tree/main/examples)
+Open AI Functions support
+```ruby
+chat.set_functions(functions)
+```
 ## Evaluations (Evals)
 The Evaluations module is a collection of tools that can be used to evaluate and track the performance of the output products by LLM and your RAG (Retrieval Augmented Generation) pipelines.
@@ -598,13 +458,16 @@ ragas.score(answer: "", question: "", context: "")
 # }
 ```
+## Examples
+Additional examples available: [/examples](https://github.com/andreibondarev/langchainrb/tree/main/examples)
 ## Logging
 LangChain.rb uses standard logging mechanisms and defaults to `:warn` level. Most messages are at info level, but we will add debug or warn statements as needed.
 To show all log messages:
 ```ruby
-Langchain.logger.level = :info
+Langchain.logger.level = :debug
 ```
 ## Development
@@ -618,31 +481,6 @@ Langchain.logger.level = :info
 ## Discord
 Join us in the [Langchain.rb](https://discord.gg/WDARp7J2n8) Discord server.
-## Core Contributors
-[<img style="border-radius:50%" alt="Andrei Bondarev" src="https://avatars.githubusercontent.com/u/541665?v=4" width="80" height="80" class="avatar">](https://twitter.com/rushing_andrei)
-## Contributors
-[<img style="border-radius:50%" alt="Alex Chaplinsky" src="https://avatars.githubusercontent.com/u/695947?v=4" width="80" height="80" class="avatar">](https://github.com/alchaplinsky)
-[<img style="border-radius:50%" alt="Josh Nichols" src="https://avatars.githubusercontent.com/u/159?v=4" width="80" height="80" class="avatar">](https://github.com/technicalpickles)
-[<img style="border-radius:50%" alt="Matt Lindsey" src="https://avatars.githubusercontent.com/u/5638339?v=4" width="80" height="80" class="avatar">](https://github.com/mattlindsey)
-[<img style="border-radius:50%" alt="Ricky Chilcott" src="https://avatars.githubusercontent.com/u/445759?v=4" width="80" height="80" class="avatar">](https://github.com/rickychilcott)
-[<img style="border-radius:50%" alt="Moeki Kawakami" src="https://avatars.githubusercontent.com/u/72325947?v=4" width="80" height="80" class="avatar">](https://github.com/moekidev)
-[<img style="border-radius:50%" alt="Jens Stmrs" src="https://avatars.githubusercontent.com/u/3492669?v=4" width="80" height="80" class="avatar">](https://github.com/faustus7)
-[<img style="border-radius:50%" alt="Rafael Figueiredo" src="https://avatars.githubusercontent.com/u/35845775?v=4" width="80" height="80" class="avatar">](https://github.com/rafaelqfigueiredo)
-[<img style="border-radius:50%" alt="Piero Dotti" src="https://avatars.githubusercontent.com/u/5167659?v=4" width="80" height="80" class="avatar">](https://github.com/ProGM)
-[<img style="border-radius:50%" alt="Michał Ciemięga" src="https://avatars.githubusercontent.com/u/389828?v=4" width="80" height="80" class="avatar">](https://github.com/zewelor)
-[<img style="border-radius:50%" alt="Bruno Bornsztein" src="https://avatars.githubusercontent.com/u/3760?v=4" width="80" height="80" class="avatar">](https://github.com/bborn)
-[<img style="border-radius:50%" alt="Tim Williams" src="https://avatars.githubusercontent.com/u/1192351?v=4" width="80" height="80" class="avatar">](https://github.com/timrwilliams)
-[<img style="border-radius:50%" alt="Zhenhang Tung" src="https://avatars.githubusercontent.com/u/8170159?v=4" width="80" height="80" class="avatar">](https://github.com/ZhenhangTung)
-[<img style="border-radius:50%" alt="Hama" src="https://avatars.githubusercontent.com/u/38002468?v=4" width="80" height="80" class="avatar">](https://github.com/akmhmgc)
-[<img style="border-radius:50%" alt="Josh Weir" src="https://avatars.githubusercontent.com/u/10720337?v=4" width="80" height="80" class="avatar">](https://github.com/joshweir)
-[<img style="border-radius:50%" alt="Arthur Hess" src="https://avatars.githubusercontent.com/u/446035?v=4" width="80" height="80" class="avatar">](https://github.com/arthurhess)
-[<img style="border-radius:50%" alt="Jin Shen" src="https://avatars.githubusercontent.com/u/54917718?v=4" width="80" height="80" class="avatar">](https://github.com/jacshen-ebay)
-[<img style="border-radius:50%" alt="Earle Bunao" src="https://avatars.githubusercontent.com/u/4653624?v=4" width="80" height="80" class="avatar">](https://github.com/erbunao)
-[<img style="border-radius:50%" alt="Maël H." src="https://avatars.githubusercontent.com/u/61985678?v=4" width="80" height="80" class="avatar">](https://github.com/mael-ha)
-[<img style="border-radius:50%" alt="Chris O. Adebiyi" src="https://avatars.githubusercontent.com/u/62605573?v=4" width="80" height="80" class="avatar">](https://github.com/oluvvafemi)
-[<img style="border-radius:50%" alt="Aaron Breckenridge" src="https://avatars.githubusercontent.com/u/201360?v=4" width="80" height="80" class="avatar">](https://github.com/breckenedge)
 ## Star History
 [![Star History Chart](https://api.star-history.com/svg?repos=andreibondarev/langchainrb&type=Date)](https://star-history.com/#andreibondarev/langchainrb&Date)