PyPI - pygpt-net - Versions diffs - 2.4.36.post1__py3-none-any.whl → 2.4.37__py3-none-any.whl - Mend

pygpt-net 2.4.36.post1py3-none-any.whl → 2.4.37py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (29) hide show

CHANGELOG.md +9 -1
README.md +37 -9
pygpt_net/CHANGELOG.txt +9 -1
pygpt_net/__init__.py +3 -3
pygpt_net/controller/chat/attachment.py +7 -39
pygpt_net/core/attachments/context.py +132 -42
pygpt_net/core/bridge/worker.py +16 -2
pygpt_net/core/events/event.py +2 -1
pygpt_net/core/idx/chat.py +22 -24
pygpt_net/data/config/config.json +7 -5
pygpt_net/data/config/models.json +3 -3
pygpt_net/data/config/modes.json +3 -3
pygpt_net/data/config/settings.json +26 -0
pygpt_net/data/locale/locale.de.ini +7 -3
pygpt_net/data/locale/locale.en.ini +13 -9
pygpt_net/data/locale/locale.es.ini +7 -3
pygpt_net/data/locale/locale.fr.ini +7 -3
pygpt_net/data/locale/locale.it.ini +7 -3
pygpt_net/data/locale/locale.pl.ini +8 -4
pygpt_net/data/locale/locale.uk.ini +7 -3
pygpt_net/data/locale/locale.zh.ini +8 -4
pygpt_net/plugin/idx_llama_index/__init__.py +2 -2
pygpt_net/plugin/real_time/__init__.py +2 -2
pygpt_net/provider/core/config/patch.py +12 -1
{pygpt_net-2.4.36.post1.dist-info → pygpt_net-2.4.37.dist-info}/METADATA +38 -10
{pygpt_net-2.4.36.post1.dist-info → pygpt_net-2.4.37.dist-info}/RECORD +29 -29
{pygpt_net-2.4.36.post1.dist-info → pygpt_net-2.4.37.dist-info}/LICENSE +0 -0
{pygpt_net-2.4.36.post1.dist-info → pygpt_net-2.4.37.dist-info}/WHEEL +0 -0
{pygpt_net-2.4.36.post1.dist-info → pygpt_net-2.4.37.dist-info}/entry_points.txt +0 -0

CHANGELOG.md CHANGED Viewed

@@ -1,5 +1,13 @@
 # CHANGELOG
+## 2.4.37 (2024-11-30)
+- The `Query only` mode in `Uploaded` tab has been renamed to `RAG`.
+- New options have been added under `Settings -> Files and Attachments`:
+  - `Use history in RAG query`: When enabled, the content of the entire conversation will be used when preparing a query if the mode is set to RAG or Summary.
+  - `RAG limit`: This option is applicable only if 'Use history in RAG query' is enabled. It specifies the limit on how many recent entries in the conversation will be used when generating a query for RAG. A value of 0 indicates no limit.
+- Cache: dynamic parts of the system prompt (from plugins) have been moved to the very end of the prompt stack to enable the use of prompt cache mechanisms in OpenAI.
 ## 2.4.36 (2024-11-28)
 - Added a new command-line argument: --workdir="/path/to/workdir" to explicitly set the current working directory.
@@ -33,7 +41,7 @@
 - Added an option checkbox `Auto-index on upload` in the `Attachments` tab:
-**Tip:** To use the `Query only` mode, the file must be indexed in the vector database. This occurs automatically at the time of upload if the `Auto-index on upload` option in the `Attachments` tab is enabled. When uploading large files, such indexing might take a while - therefore, if you are using the `Full context` option, which does not use the index, you can disable the `Auto-index` option to speed up the upload of the attachment. In this case, it will only be indexed when the `Query only` option is called for the first time, and until then, attachment will be available in the form of `Full context` and `Summary`.
+**Tip:** To use the `RAG` mode, the file must be indexed in the vector database. This occurs automatically at the time of upload if the `Auto-index on upload` option in the `Attachments` tab is enabled. When uploading large files, such indexing might take a while - therefore, if you are using the `Full context` option, which does not use the index, you can disable the `Auto-index` option to speed up the upload of the attachment. In this case, it will only be indexed when the `RAG` option is called for the first time, and until then, attachment will be available in the form of `Full context` and `Summary`.
 - Added context menu options in `Uploaded attachments` tab: `Open`, `Open Source directory` and `Open Storage directory`.

README.md CHANGED Viewed

@@ -2,7 +2,7 @@
 [![pygpt](https://snapcraft.io/pygpt/badge.svg)](https://snapcraft.io/pygpt)
-Release: **2.4.36** | build: **2024.11.28** | Python: **>=3.10, <3.12**
+Release: **2.4.37** | build: **2024.11.30** | Python: **>=3.10, <3.12**
 > Official website: https://pygpt.net | Documentation: https://pygpt.readthedocs.io
 >
@@ -603,11 +603,11 @@ Built-in file loaders:
 - Webpages (crawling any webpage content)
 - YouTube (transcriptions)
-You can configure data loaders in `Settings / LlamaIndex / Data Loaders` by providing list of keyword arguments for specified loaders.
+You can configure data loaders in `Settings / Indexes (LlamaIndex) / Data Loaders` by providing list of keyword arguments for specified loaders.
 You can also develop and provide your own custom loader and register it within the application.
 LlamaIndex is also integrated with context database - you can use data from database (your context history) as additional context in discussion.
-Options for indexing existing context history or enabling real-time indexing new ones (from database) are available in `Settings / LlamaIndex` section.
+Options for indexing existing context history or enabling real-time indexing new ones (from database) are available in `Settings / Indexes (LlamaIndex)` section.
 **WARNING:** remember that when indexing content, API calls to the embedding model are used. Each indexing consumes additional tokens. Always control the number of tokens used on the OpenAI page.
@@ -669,7 +669,7 @@ You can set the limit of steps in such a loop by going to `Settings -> Agents an
 You can change the prompt used for evaluating the response in `Settings -> Prompts -> Agent: evaluation prompt in loop`. Here, you can adjust it to suit your needs, for example, by defining more or less critical feedback for the responses received.
-##  Agent (Legacy, Autonomous)
+##  Agent (Autonomous)
 This is an older version of the Agent mode, still available as legacy. However, it is recommended to use the newer mode: `Agent (LlamaIndex)`.
@@ -817,11 +817,13 @@ The content from the uploaded attachments will be used in the current conversati
 - `Full context`: Provides best results. This mode attaches the entire content of the read file to the user's prompt. This process happens in the background and may require a large number of tokens if you uploaded extensive content.
-- `Query only`: The indexed attachment will only be queried in real-time using LlamaIndex. This operation does not require any additional tokens, but it may not provide access to the full content of the file 1:1.
+- `RAG`: The indexed attachment will only be queried in real-time using LlamaIndex. This operation does not require any additional tokens, but it may not provide access to the full content of the file 1:1.
 - `Summary`: When queried, an additional query will be generated in the background and executed by a separate model to summarize the content of the attachment and return the required information to the main model. You can change the model used for summarization in the settings under the `Files and attachments` section.
-**Important**: When using `Full context` mode, the entire content of the file is included in the prompt, which can result in high token usage each time. If you want to reduce the number of tokens used, instead use the `Query only` option, which will only query the indexed attachment in the vector database to provide additional context.
+In the `RAG` and `Summary` mode, you can enable an additional setting by going to `Settings -> Files and attachments -> Use history in RAG query`. This allows for better preparation of queries for RAG. When this option is turned on, the entire conversation context is considered, rather than just the user's last query. This allows for better searching of the index for additional context. In the `RAG limit` option, you can set a limit on how many recent entries in a discussion should be considered (`0 = no limit, default: 3`).
+**Important**: When using `Full context` mode, the entire content of the file is included in the prompt, which can result in high token usage each time. If you want to reduce the number of tokens used, instead use the `RAG` option, which will only query the indexed attachment in the vector database to provide additional context.
 **Images as Additional Context**
@@ -829,7 +831,7 @@ Files such as jpg, png, and similar images are a special case. By default, image
 **Uploading larger files and auto-index**
-To use the `Query only` mode, the file must be indexed in the vector database. This occurs automatically at the time of upload if the `Auto-index on upload` option in the `Attachments` tab is enabled. When uploading large files, such indexing might take a while - therefore, if you are using the `Full context` option, which does not use the index, you can disable the `Auto-index` option to speed up the upload of the attachment. In this case, it will only be indexed when the `Query only` option is called for the first time, and until then, attachment will be available in the form of `Full context` and `Summary`.
+To use the `RAG` mode, the file must be indexed in the vector database. This occurs automatically at the time of upload if the `Auto-index on upload` option in the `Attachments` tab is enabled. When uploading large files, such indexing might take a while - therefore, if you are using the `Full context` option, which does not use the index, you can disable the `Auto-index` option to speed up the upload of the attachment. In this case, it will only be indexed when the `RAG` option is called for the first time, and until then, attachment will be available in the form of `Full context` and `Summary`.
 ## Downloading files
@@ -2710,6 +2712,16 @@ Config -> Settings...
 - `Directory for file downloads`: Subdirectory for downloaded files, e.g. in Assistants mode, inside "data". Default: "download"
+- `Verbose mode`: Enabled verbose mode when using attachment as additional context.
+- `Model for querying index`: Model to use for preparing query and querying the index when the RAG option is selected.
+- `Model for attachment content summary`: Model to use when generating a summary for the content of a file when the Summary option is selected.
+- `Use history in RAG query`: When enabled, the content of the entire conversation will be used when preparing a query if mode is RAG or Summary.
+- `RAG limit`: Only if the option `Use history in RAG query` is enabled. Specify the limit of how many recent entries in the conversation will be used when generating a query for RAG. 0 = no limit.
 **Context**
 - `Context Threshold`: Sets the number of tokens reserved for the model to respond to the next prompt.
@@ -3247,7 +3259,7 @@ If you want to only query index (without chat) you can enable `Query index only
 You can create a custom vector store provider or data loader for your data and develop a custom launcher for the application.
-See the section `Extending PyGPT / Adding custom Vector Store provider` for more details.
+See the section `Extending PyGPT / Adding a custom Vector Store provider` for more details.
 # Updates
@@ -3545,6 +3557,8 @@ Syntax: `event name` - triggered on, `event data` *(data type)*:
 - `AI_NAME` - when preparing an AI name, `data['value']` *(string, name of the AI assistant)*
+- `AGENT_PROMPT` - on agent prompt in eval mode, `data['value']` *(string, prompt)*
 - `AUDIO_INPUT_RECORD_START` - start audio input recording
 - `AUDIO_INPUT_RECORD_STOP` -  stop audio input recording
@@ -3603,10 +3617,16 @@ Syntax: `event name` - triggered on, `event data` *(data type)*:
 - `POST_PROMPT` - after preparing a system prompt, `data['value']` *(string, system prompt)*
+- `POST_PROMPT_ASYNC` - after preparing a system prompt, just before request in async thread, `data['value']` *(string, system prompt)*
+- `POST_PROMPT_END` - after preparing a system prompt, just before request in async thread, at the very end `data['value']` *(string, system prompt)*
 - `PRE_PROMPT` - before preparing a system prompt, `data['value']` *(string, system prompt)*
 - `SYSTEM_PROMPT` - when preparing a system prompt, `data['value']` *(string, system prompt)*
+- `TOOL_OUTPUT_RENDER` - when rendering extra content from tools from plugins, `data['content']` *(string, content)*
 - `UI_ATTACHMENTS` - when the attachment upload elements are rendered, `data['value']` *(bool, show True/False)*
 - `UI_VISION` - when the vision elements are rendered, `data['value']` *(bool, show True/False)*
@@ -3845,6 +3865,14 @@ may consume additional tokens that are not displayed in the main window.
 ## Recent changes:
+**2.4.37 (2024-11-30)**
+- The `Query only` mode in `Uploaded` tab has been renamed to `RAG`.
+- New options have been added under `Settings -> Files and Attachments`:
+  - `Use history in RAG query`: When enabled, the content of the entire conversation will be used when preparing a query if the mode is set to RAG or Summary.
+  - `RAG limit`: This option is applicable only if 'Use history in RAG query' is enabled. It specifies the limit on how many recent entries in the conversation will be used when generating a query for RAG. A value of 0 indicates no limit.
+- Cache: dynamic parts of the system prompt (from plugins) have been moved to the very end of the prompt stack to enable the use of prompt cache mechanisms in OpenAI.
 **2.4.36 (2024-11-28)**
 - Added a new command-line argument: --workdir="/path/to/workdir" to explicitly set the current working directory.
@@ -3878,7 +3906,7 @@ may consume additional tokens that are not displayed in the main window.
 - Added an option checkbox `Auto-index on upload` in the `Attachments` tab:
-**Tip:** To use the `Query only` mode, the file must be indexed in the vector database. This occurs automatically at the time of upload if the `Auto-index on upload` option in the `Attachments` tab is enabled. When uploading large files, such indexing might take a while - therefore, if you are using the `Full context` option, which does not use the index, you can disable the `Auto-index` option to speed up the upload of the attachment. In this case, it will only be indexed when the `Query only` option is called for the first time, and until then, attachment will be available in the form of `Full context` and `Summary`.
+**Tip:** To use the `RAG` mode, the file must be indexed in the vector database. This occurs automatically at the time of upload if the `Auto-index on upload` option in the `Attachments` tab is enabled. When uploading large files, such indexing might take a while - therefore, if you are using the `Full context` option, which does not use the index, you can disable the `Auto-index` option to speed up the upload of the attachment. In this case, it will only be indexed when the `RAG` option is called for the first time, and until then, attachment will be available in the form of `Full context` and `Summary`.
 - Added context menu options in `Uploaded attachments` tab: `Open`, `Open Source directory` and `Open Storage directory`.

pygpt_net/CHANGELOG.txt CHANGED Viewed

@@ -1,3 +1,11 @@
+2.4.37 (2024-11-30)
+- The `Query only` mode in `Uploaded` tab has been renamed to `RAG`.
+- New options have been added under `Settings -> Files and Attachments`:
+  - `Use history in RAG query`: When enabled, the content of the entire conversation will be used when preparing a query if the mode is set to RAG or Summary.
+  - `RAG limit`: This option is applicable only if 'Use history in RAG query' is enabled. It specifies the limit on how many recent entries in the conversation will be used when generating a query for RAG. A value of 0 indicates no limit.
+- Cache: dynamic parts of the system prompt (from plugins) have been moved to the very end of the prompt stack to enable the use of prompt cache mechanisms in OpenAI.
 2.4.36 (2024-11-28)
 - Added a new command-line argument: --workdir="/path/to/workdir" to explicitly set the current working directory.
@@ -31,7 +39,7 @@
 - Added an option checkbox `Auto-index on upload` in the `Attachments` tab:
-Tip: To use the `Query only` mode, the file must be indexed in the vector database. This occurs automatically at the time of upload if the `Auto-index on upload` option in the `Attachments` tab is enabled. When uploading large files, such indexing might take a while - therefore, if you are using the `Full context` option, which does not use the index, you can disable the `Auto-index` option to speed up the upload of the attachment. In this case, it will only be indexed when the `Query only` option is called for the first time, and until then, attachment will be available in the form of `Full context` and `Summary`.
+Tip: To use the `RAG` mode, the file must be indexed in the vector database. This occurs automatically at the time of upload if the `Auto-index on upload` option in the `Attachments` tab is enabled. When uploading large files, such indexing might take a while - therefore, if you are using the `Full context` option, which does not use the index, you can disable the `Auto-index` option to speed up the upload of the attachment. In this case, it will only be indexed when the `RAG` option is called for the first time, and until then, attachment will be available in the form of `Full context` and `Summary`.
 - Added context menu options in `Uploaded attachments` tab: `Open`, `Open Source directory` and `Open Storage directory`.

pygpt_net/__init__.py CHANGED Viewed

@@ -6,15 +6,15 @@
 # GitHub:  https://github.com/szczyglis-dev/py-gpt   #
 # MIT License                                        #
 # Created By  : Marcin Szczygliński                  #
-# Updated Date: 2024.11.28 01:00:00                  #
+# Updated Date: 2024.11.30 01:00:00                  #
 # ================================================== #
 __author__ = "Marcin Szczygliński"
 __copyright__ = "Copyright 2024, Marcin Szczygliński"
 __credits__ = ["Marcin Szczygliński"]
 __license__ = "MIT"
-__version__ = "2.4.36"
-__build__ = "2024.11.28"
+__version__ = "2.4.37"
+__build__ = "2024.11.30"
 __maintainer__ = "Marcin Szczygliński"
 __github__ = "https://github.com/szczyglis-dev/py-gpt"
 __website__ = "https://pygpt.net"

pygpt_net/controller/chat/attachment.py CHANGED Viewed

@@ -6,7 +6,7 @@
 # GitHub:  https://github.com/szczyglis-dev/py-gpt   #
 # MIT License                                        #
 # Created By  : Marcin Szczygliński                  #
-# Updated Date: 2024.11.26 04:00:00                  #
+# Updated Date: 2024.11.29 23:00:00                  #
 # ================================================== #
 import os
@@ -261,26 +261,22 @@ class Attachment(QObject):
         """
         return self.mode
-    def get_context(self, ctx: CtxItem) -> str:
+    def get_context(self, ctx: CtxItem, history: list) -> str:
         """
         Get additional context for attachment
         :param ctx: CtxItem instance
+        :param history Context items (history)
         :return: Additional context
         """
-        content = ""
-        meta = ctx.meta
         if self.mode != self.MODE_DISABLED:
             if self.is_verbose():
                 print("\nPreparing additional context...\nContext Mode: {}".format(self.mode))
-        self.window.core.attachments.context.reset()
-        if self.mode == self.MODE_FULL_CONTEXT:
-            content = self.get_full_context(ctx)
-        elif self.mode == self.MODE_QUERY_CONTEXT:
-            content = self.get_query_context(meta, str(ctx.input))
-        elif self.mode == self.MODE_QUERY_CONTEXT_SUMMARY:
-            content = self.get_context_summary(ctx)
+        self.window.core.attachments.context.reset()  # reset used files and urls
+        # get additional context from attachments
+        content = self.window.core.attachments.context.get_context(self.mode, ctx, history)
         # append used files and urls to context
         files = self.window.core.attachments.context.get_used_files()
@@ -296,34 +292,6 @@ class Attachment(QObject):
             return "====================================\nADDITIONAL CONTEXT FROM ATTACHMENT(s): {}".format(content)
         return ""
-    def get_full_context(self, ctx: CtxItem) -> str:
-        """
-        Get full context for attachment
-        :param ctx: CtxItem instance
-        :return: Full context
-        """
-        return self.window.core.attachments.context.get_context_text(ctx, filename=True)
-    def get_query_context(self, meta: CtxMeta, query: str) -> str:
-        """
-        Get query context for attachment
-        :param meta: CtxMeta instance
-        :param query: Query string
-        :return: Query context
-        """
-        return self.window.core.attachments.context.query_context(meta, query)
-    def get_context_summary(self, ctx: CtxItem) -> str:
-        """
-        Get context summary
-        :param ctx: CtxItem instance
-        :return: Context summary
-        """
-        return self.window.core.attachments.context.summary_context(ctx, ctx.input)
     def get_uploaded_attachments(self, meta: CtxMeta) -> list:
         """
         Get uploaded attachments for meta

pygpt_net/core/attachments/context.py CHANGED Viewed

@@ -6,7 +6,7 @@
 # GitHub:  https://github.com/szczyglis-dev/py-gpt   #
 # MIT License                                        #
 # Created By  : Marcin Szczygliński                  #
-# Updated Date: 2024.11.26 04:00:00                  #
+# Updated Date: 2024.11.29 23:00:00                  #
 # ================================================== #
 import copy
@@ -40,7 +40,7 @@ class Context:
         Summarize the text below by extracting the most important information,
         especially those that may help answer the question:
-        `{query}`.
+        `{query}`
         If the answer to the question is not in the text to summarize,
         simply return a summary of the entire content.
@@ -59,25 +59,49 @@ class Context:
         `{content}`
         """
-    def get_all(self, meta: CtxMeta) -> list:
-        """
-        Get all attachments for meta
-        :param meta: CtxMeta instance
-        :return: list of attachments
-        """
-        return meta.additional_ctx
-    def get_dir(self, meta: CtxMeta) -> str:
-        """
-        Get directory for meta
-        :param meta: CtxMeta instance
-        :return: directory path
+        self.rag_prompt = """
+        Prepare a question for the RAG engine (vector database) asking for additional context that can help obtain
+        extra information necessary to answer the user's question. The query should be brief and to the point,
+        so as to be processed as effectively as possible by the RAG engine. Below is the entire conversation
+        of the user with the AI assistant, and at the end the current user's question, for which you need to
+        prepare DIRECT query for the RAG engine for additional context, taking into account the content of the entire
+        discussion and its context. In your response, return only the DIRECT query for additional context,
+        do not return anything else besides it. The response should not contain any phrases other than the query itself:
+        # Good RAG query example:
+        `What is the capital of France?`
+        # Bad RAG query example:
+        `Can you tell me the capital of France?`
+        # Full conversation:
+        `{history}`
+        # User question:
+        `{query}`
+        """
+    def get_context(self, mode: str, ctx: CtxItem, history: list) -> str:
+        """
+        Get context for mode
+        :param mode: Context mode
+        :param ctx: CtxItem instance
+        :param history: history
+        :return: context
         """
-        meta_uuid = str(meta.uuid)
-        return os.path.join(self.window.core.config.get_user_dir("ctx_idx"), meta_uuid)
+        content = ""
+        if mode == self.window.controller.chat.attachment.MODE_FULL_CONTEXT:
+            content = self.get_context_text(ctx, filename=True)
+        elif mode == self.window.controller.chat.attachment.MODE_QUERY_CONTEXT:
+            content = self.query_context(ctx, history)
+        elif mode == self.window.controller.chat.attachment.MODE_QUERY_CONTEXT_SUMMARY:
+            content = self.summary_context(ctx, history)
+        return content
     def get_context_text(self, ctx: CtxItem, filename: bool = False) -> str:
         """
@@ -126,15 +150,17 @@ class Context:
         self.last_used_context = context
         return context
-    def query_context(self, meta: CtxMeta, query: str) -> str:
+    def query_context(self, ctx: CtxItem, history: list) -> str:
         """
         Query the index for context
-        :param meta : CtxMeta instance
-        :param query: query string
+        :param ctx: CtxItem instance
+        :param history: history
         :return: query result
         """
+        meta = ctx.meta
         meta_path = self.get_dir(meta)
+        query = str(ctx.input)
         if not os.path.exists(meta_path) or not os.path.isdir(meta_path):
             return ""
         idx_path = os.path.join(self.get_dir(meta), self.dir_index)
@@ -162,8 +188,21 @@ class Context:
             self.window.core.ctx.replace(meta)
             self.window.core.ctx.save(meta.id)
+        history_data = self.prepare_context_history(history)
         model, model_item = self.get_selected_model("query")
-        result = self.window.core.idx.chat.query_attachment(query, idx_path, model_item)
+        verbose = False
+        if self.is_verbose():
+            verbose = True
+            print("Attachments: using query model: {}".format(model))
+        result = self.window.core.idx.chat.query_attachment(
+            query=query,
+            path=idx_path,
+            model=model_item,
+            history=history_data,
+            verbose=verbose,
+        )
         self.last_used_context = result
         if self.is_verbose():
@@ -171,28 +210,12 @@ class Context:
         return result
-    def get_selected_model(self, mode: str = "summary"):
-        """
-        Get selected model for attachments
-        :return: model name, model item
-        """
-        model_item = None
-        model = None
-        if mode == "summary":
-            model = self.window.core.config.get("ctx.attachment.summary.model", "gpt-4o-mini")
-        elif mode == "query":
-            model = self.window.core.config.get("ctx.attachment.query.model", "gpt-4o-mini")
-        if model:
-            model_item = self.window.core.models.get(model)
-        return model, model_item
-    def summary_context(self, ctx: CtxItem, query: str) -> str:
+    def summary_context(self, ctx: CtxItem, history: list) -> str:
         """
         Get summary of the context
         :param ctx: CtxItem instance
-        :param query: query string
+        :param history: history
         :return: query result
         """
         model, model_item = self.get_selected_model("summary")
@@ -202,6 +225,7 @@ class Context:
         if self.is_verbose():
             print("Attachments: using summary model: {}".format(model))
+        query = str(ctx.input)
         content = self.get_context_text(ctx, filename=True)
         prompt = self.summary_prompt.format(
             query=str(query).strip(),
@@ -210,12 +234,14 @@ class Context:
         if self.is_verbose():
             print("Attachments: summary prompt: {}".format(prompt))
+        history_data = self.prepare_context_history(history)
         ctx = CtxItem()
         bridge_context = BridgeContext(
             ctx=ctx,
             prompt=prompt,
             stream=False,
             model=model_item,
+            history=history_data,
         )
         event = KernelEvent(KernelEvent.CALL, {
             'context': bridge_context,
@@ -228,6 +254,35 @@ class Context:
             print("Attachments: summary received: {}".format(response))
         return response
+    def prepare_context_history(self, history: list) -> list:
+        """
+        Prepare context history
+        :param history: history
+        :return: history data
+        """
+        use_history = self.window.core.config.get("ctx.attachment.rag.history", True)
+        history_data = []
+        if use_history:
+            if self.is_verbose():
+                print("Attachments: using history for query prepare...")
+            # use only last X items from history
+            num_items = self.window.core.config.get("ctx.attachment.rag.history.max_items", 3)
+            history_data = []
+            for item in history:
+                history_data.append(item)
+            # 0 = unlimited
+            if num_items > 0:
+                if self.is_verbose():
+                    print("Attachments: using last {} items from history...".format(num_items))
+                if len(history_data) < num_items:
+                    num_items = len(history_data)
+                history_data = history_data[-num_items:]
+        return history_data
     def upload(
             self,
             meta: CtxMeta,
@@ -396,6 +451,41 @@ class Context:
             print("Attachments: indexed. Doc IDs: {}".format(doc_ids))
         return doc_ids
+    def get_all(self, meta: CtxMeta) -> list:
+        """
+        Get all attachments for meta
+        :param meta: CtxMeta instance
+        :return: list of attachments
+        """
+        return meta.additional_ctx
+    def get_dir(self, meta: CtxMeta) -> str:
+        """
+        Get directory for meta
+        :param meta: CtxMeta instance
+        :return: directory path
+        """
+        meta_uuid = str(meta.uuid)
+        return os.path.join(self.window.core.config.get_user_dir("ctx_idx"), meta_uuid)
+    def get_selected_model(self, mode: str = "summary"):
+        """
+        Get selected model for attachments
+        :return: model name, model item
+        """
+        model_item = None
+        model = None
+        if mode == "summary":
+            model = self.window.core.config.get("ctx.attachment.summary.model", "gpt-4o-mini")
+        elif mode == "query":
+            model = self.window.core.config.get("ctx.attachment.query.model", "gpt-4o-mini")
+        if model:
+            model_item = self.window.core.models.get(model)
+        return model, model_item
     def duplicate(self, from_meta_id: int, to_meta_id: int) -> bool:
         """
         Duplicate attachments from one meta to another

pygpt_net/core/bridge/worker.py CHANGED Viewed

@@ -6,7 +6,7 @@
 # GitHub:  https://github.com/szczyglis-dev/py-gpt   #
 # MIT License                                        #
 # Created By  : Marcin Szczygliński                  #
-# Updated Date: 2024.11.23 21:00:00                  #
+# Updated Date: 2024.11.29 23:00:00                  #
 # ================================================== #
 from PySide6.QtCore import QObject, Signal, QRunnable, Slot
@@ -50,6 +50,9 @@ class BridgeWorker(QObject, QRunnable):
             # ADDITIONAL CONTEXT: append additional context from attachments
             self.handle_additional_context()
+            # POST PROMPT END: handle post prompt end event
+            self.handle_post_prompt_end()
             # Langchain
             if self.mode == MODE_LANGCHAIN:
                 result = self.window.core.chain.call(
@@ -124,6 +127,17 @@ class BridgeWorker(QObject, QRunnable):
         self.window.dispatch(event)
         self.context.system_prompt = event.data['value']
+    def handle_post_prompt_end(self):
+        """Handle post prompt end event"""
+        event = Event(Event.POST_PROMPT_END, {
+            'mode': self.context.mode,
+            'reply': self.context.ctx.reply,
+            'value': self.context.system_prompt,
+        })
+        event.ctx = self.context.ctx
+        self.window.dispatch(event)
+        self.context.system_prompt = event.data['value']
     def handle_additional_context(self):
         """Append additional context"""
         ctx = self.context.ctx
@@ -133,7 +147,7 @@ class BridgeWorker(QObject, QRunnable):
             return
         if not self.window.controller.chat.attachment.has_context(ctx.meta):
             return
-        ad_context = self.window.controller.chat.attachment.get_context(ctx)
+        ad_context = self.window.controller.chat.attachment.get_context(ctx, self.context.history)
         ad_mode = self.window.controller.chat.attachment.get_mode()
         if ad_context:
             self.context.prompt += "\n\n" + ad_context  # append to input text

pygpt_net/core/events/event.py CHANGED Viewed

@@ -6,7 +6,7 @@
 # GitHub:  https://github.com/szczyglis-dev/py-gpt   #
 # MIT License                                        #
 # Created By  : Marcin Szczygliński                  #
-# Updated Date: 2024.11.26 19:00:00                  #
+# Updated Date: 2024.11.29 23:00:00                  #
 # ================================================== #
 import json
@@ -50,6 +50,7 @@ class Event(BaseEvent):
     PLUGIN_OPTION_GET = "plugin.option.get"
     POST_PROMPT = "post.prompt"
     POST_PROMPT_ASYNC = "post.prompt.async"
+    POST_PROMPT_END = "post.prompt.end"
     PRE_PROMPT = "pre.prompt"
     SYSTEM_PROMPT = "system.prompt"
     TOOL_OUTPUT_RENDER = "tool.output.render"

pygpt-net 2.4.36.post1__py3-none-any.whl → 2.4.37__py3-none-any.whl

pygpt-net 2.4.36.post1py3-none-any.whl → 2.4.37py3-none-any.whl