PyPI - olly-desktop - Versions diffs - 1.2.1__tar.gz - Mend

olly-desktop 1.2.1__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (62) hide show

olly_desktop-1.2.1/.gitignore +45 -0
olly_desktop-1.2.1/.pypiignore +17 -0
olly_desktop-1.2.1/CHANGELOG.md +6 -0
olly_desktop-1.2.1/LICENSE +21 -0
olly_desktop-1.2.1/PKG-INFO +330 -0
olly_desktop-1.2.1/PUBLISHING.md +37 -0
olly_desktop-1.2.1/README.md +271 -0
olly_desktop-1.2.1/build.bat +30 -0
olly_desktop-1.2.1/build.sh +23 -0
olly_desktop-1.2.1/config.py +88 -0
olly_desktop-1.2.1/core/__init__.py +1 -0
olly_desktop-1.2.1/core/agent.py +198 -0
olly_desktop-1.2.1/core/agent_tools.py +548 -0
olly_desktop-1.2.1/core/capture_context.py +47 -0
olly_desktop-1.2.1/core/chat_store.py +154 -0
olly_desktop-1.2.1/core/file_watcher.py +91 -0
olly_desktop-1.2.1/core/image_utils.py +235 -0
olly_desktop-1.2.1/core/log.py +44 -0
olly_desktop-1.2.1/core/mcp_manager.py +309 -0
olly_desktop-1.2.1/core/ollama_client.py +587 -0
olly_desktop-1.2.1/core/ollama_setup.py +235 -0
olly_desktop-1.2.1/core/paths.py +46 -0
olly_desktop-1.2.1/core/platform.py +144 -0
olly_desktop-1.2.1/core/rag_store.py +327 -0
olly_desktop-1.2.1/core/settings.py +233 -0
olly_desktop-1.2.1/core/startup.py +126 -0
olly_desktop-1.2.1/core/text_capture.py +206 -0
olly_desktop-1.2.1/core/text_inject.py +56 -0
olly_desktop-1.2.1/core/window_utils/__init__.py +30 -0
olly_desktop-1.2.1/core/window_utils/unix.py +197 -0
olly_desktop-1.2.1/core/window_utils/win32.py +127 -0
olly_desktop-1.2.1/docs/install-notes.md +28 -0
olly_desktop-1.2.1/installer.iss +112 -0
olly_desktop-1.2.1/launch.bat +41 -0
olly_desktop-1.2.1/launch.sh +34 -0
olly_desktop-1.2.1/launch.vbs +2 -0
olly_desktop-1.2.1/main.py +297 -0
olly_desktop-1.2.1/packaging/linux/ai-assistant.desktop +7 -0
olly_desktop-1.2.1/pyproject.toml +78 -0
olly_desktop-1.2.1/pytest.ini +5 -0
olly_desktop-1.2.1/requirements-build.txt +2 -0
olly_desktop-1.2.1/requirements-dev.txt +3 -0
olly_desktop-1.2.1/requirements-windows.txt +2 -0
olly_desktop-1.2.1/requirements.txt +19 -0
olly_desktop-1.2.1/ui/__init__.py +1 -0
olly_desktop-1.2.1/ui/ai_popup.py +1415 -0
olly_desktop-1.2.1/ui/floating_button.py +254 -0
olly_desktop-1.2.1/ui/markdown_render.py +65 -0
olly_desktop-1.2.1/ui/onboarding_wizard.py +358 -0
olly_desktop-1.2.1/ui/settings_dialog.py +675 -0
olly_desktop-1.2.1/ui/styles/__init__.py +76 -0
olly_desktop-1.2.1/ui/styles/base.py +72 -0
olly_desktop-1.2.1/ui/styles/linux.py +200 -0
olly_desktop-1.2.1/ui/styles/macos.py +250 -0
olly_desktop-1.2.1/ui/styles/theme_detect.py +71 -0
olly_desktop-1.2.1/ui/styles/windows.py +206 -0
olly_desktop-1.2.1/ui/toast.py +49 -0
olly_desktop-1.2.1/ui/tool_confirm.py +72 -0
olly_desktop-1.2.1/ui/tray_icon.py +25 -0
olly_desktop-1.2.1/utils/__init__.py +1 -0
olly_desktop-1.2.1/utils/hotkey_manager.py +97 -0
olly_desktop-1.2.1/utils/hotkey_validate.py +16 -0

olly_desktop-1.2.1/.gitignore ADDED Viewed

@@ -0,0 +1,45 @@
+# Virtual environment
+venv/
+.venv/
+env/
+# Python cache
+__pycache__/
+*.py[cod]
+*.pyo
+*.pyd
+*.pyc
+.pytest_cache/
+# PyInstaller / build output
+build/
+dist/
+*.spec.bak
+*.egg-info/
+main.build/
+main.dist/
+main.onefile-build/
+installer_output/
+# Built binaries (local dev)
+OllamaSetup.exe
+AIAssistant.exe
+AIAssistantSetup.exe
+# IDE
+.vscode/
+.idea/
+# OS
+.DS_Store
+Thumbs.db
+# Logs
+*.log
+# ChromaDB / RAG data
+chroma_db/
+*.db
+# Secrets
+.env

olly_desktop-1.2.1/.pypiignore ADDED Viewed

@@ -0,0 +1,17 @@
+# Build artifacts — never ship these
+main.build/
+main.dist/
+main.onefile-build/
+installer_output/
+*.spec
+*.dmg
+*.exe
+# Dev/test
+tests/
+.github/
+.pytest_cache/
+__pycache__/
+*.pyc
+# Large assets not needed at runtime

olly_desktop-1.2.1/CHANGELOG.md ADDED Viewed

@@ -0,0 +1,6 @@
+# Changelog
+## 1.2.1
+- Add PyPI packaging as `olly-desktop`; launch with `olly` after `pip install olly-desktop`.
+- Single-source version from package metadata; GitHub Actions publish workflow for PyPI trusted publishing.

olly_desktop-1.2.1/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+MIT License
+Copyright (c) 2026 tp-0604
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

olly_desktop-1.2.1/PKG-INFO ADDED Viewed

@@ -0,0 +1,330 @@
+Metadata-Version: 2.4
+Name: olly-desktop
+Version: 1.2.1
+Summary: Local-first AI assistant with agent mode, RAG, and MCP support
+Project-URL: Homepage, https://github.com/tp-0604/ai-assistant
+Project-URL: Repository, https://github.com/tp-0604/ai-assistant
+Project-URL: Bug Tracker, https://github.com/tp-0604/ai-assistant/issues
+License: MIT License
+        Copyright (c) 2026 tp-0604
+        Permission is hereby granted, free of charge, to any person obtaining a copy
+        of this software and associated documentation files (the "Software"), to deal
+        in the Software without restriction, including without limitation the rights
+        to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+        copies of the Software, and to permit persons to whom the Software is
+        furnished to do so, subject to the following conditions:
+        The above copyright notice and this permission notice shall be included in all
+        copies or substantial portions of the Software.
+        THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+        IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+        FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+        AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+        LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+        OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+        SOFTWARE.
+License-File: LICENSE
+Keywords: agent,ai,assistant,llm,mcp,ollama,rag
+Classifier: Development Status :: 4 - Beta
+Classifier: Environment :: MacOS X
+Classifier: Environment :: Win32 (MS Windows)
+Classifier: Environment :: X11 Applications :: Qt
+Classifier: Intended Audience :: End Users/Desktop
+Classifier: License :: OSI Approved :: MIT License
+Classifier: Programming Language :: Python :: 3
+Classifier: Programming Language :: Python :: 3.11
+Classifier: Programming Language :: Python :: 3.12
+Classifier: Topic :: Desktop Environment
+Requires-Python: >=3.11
+Requires-Dist: chromadb>=0.4.22
+Requires-Dist: mcp<2.0.0,>=1.9.0
+Requires-Dist: mss>=9.0.0
+Requires-Dist: pillow>=10.2.0
+Requires-Dist: pynput>=1.7.6
+Requires-Dist: pypdf>=4.0.0
+Requires-Dist: pyperclip>=1.8.2
+Requires-Dist: pyqt6>=6.6.0
+Requires-Dist: pytesseract>=0.3.10
+Requires-Dist: python-docx>=1.1.0
+Requires-Dist: pywinauto>=0.6.8; sys_platform == 'win32'
+Requires-Dist: requests>=2.31.0
+Requires-Dist: watchdog>=4.0.0
+Provides-Extra: dev
+Requires-Dist: pyinstaller>=6.0.0; extra == 'dev'
+Requires-Dist: pytest>=8.0; extra == 'dev'
+Description-Content-Type: text/markdown
+# ✦ AI Assistant
+A **cross-platform desktop AI assistant** that lives in your system tray and works on whatever you are already doing — selected text, screenshots, clipboard images, and your own documents. It runs through **[Ollama](https://ollama.com)** so models stay **local by default**, with optional support for remote or cloud Ollama endpoints when you choose.
+![Python](https://img.shields.io/badge/Python-3.10+-blue)
+![PyQt6](https://img.shields.io/badge/UI-PyQt6-green)
+![Ollama](https://img.shields.io/badge/LLM-Ollama-orange)
+![Platforms](https://img.shields.io/badge/Platforms-Windows%20%7C%20macOS%20%7C%20Linux-lightgrey)
+---
+## What makes this different
+Most AI tools today are **browser tabs, IDE plugins, or single-platform utilities** tied to one vendor’s cloud. This project is built around a different idea: **bring a capable assistant to the OS layer**, without replacing your apps or sending everything to a SaaS backend.
+| | Typical cloud assistants (ChatGPT, Copilot, Gemini) | Ollama WebUI / chat apps | **This project** |
+|---|-----------------------------------------------------|--------------------------|------------------|
+| **Where it runs** | Vendor cloud | Local server in a browser tab | Native desktop app (Windows, macOS, Linux) |
+| **How you invoke it** | Switch app, paste, type | Open browser, paste | **Selection action bar** at the cursor, global hotkey, tray |
+| **Context from your work** | Manual copy-paste | Manual copy-paste | Captures selection, **target window**, screenshots |
+| **Your files** | Upload per chat / enterprise connectors | Manual upload or plugins | **Folder RAG** — index a directory, ask from the action bar |
+| **Model choice** | Vendor models only | Any Ollama model | Any Ollama model + quality presets + vision model picker |
+| **Privacy posture** | Data leaves device by default | Stays local if Ollama is local | **Local-first**; you control URL, capture, and offline mode |
+| **Insert back into apps** | Copy manually | Copy manually | **Insert last reply** hotkey into the foreground app |
+### Novel behaviors this project incorporates
+**1. Selection action bar (not a radial menu, not a sidebar)**
+After you select text, a compact toolbar appears near the cursor with one-click intents: Explain, Summarize, Translate, Ask, Screen, and Ask my files. Other tools usually make you open a separate window and paste — here the intent is chosen in context.
+**2. Foreground-aware capture**
+Before the assistant takes focus, it remembers which window and cursor position you were using. Screen capture targets **that** window — not the assistant’s own popup. That avoids the common “screenshot captured my chat window” failure mode.
+**3. System-wide workflow, not app-specific**
+Works in browsers, editors, PDF readers, terminals, and more via OS-level selection and hotkeys — not only inside one host application.
+**4. Local RAG on a folder you own**
+Point at a watch folder (Documents, a project directory, etc.). Files are chunked and embedded with Ollama; **Ask my files** pulls relevant passages into the prompt. No per-file upload dance each session.
+**5. Vision from the desktop**
+Paste images, capture a window, or use the Screen action for LeetCode-style problems, UI mockups, or diagrams — with configurable vision timeouts and image sizing for slow or cloud vision models.
+**6. Native UI per platform**
+PyQt6 with dedicated styling for Windows (Segoe), macOS (translucent / SF Pro), and Linux (GNOME-inspired) — not a generic Electron shell.
+**7. Power-user controls others often hide**
+Quality presets, custom model names, thinking-mode toggle for Qwen3/cloud models, remote Ollama URL detection with adjusted timeouts, chat export, recent chats, tone chips (Shorter / Simpler / Formal), and insert-reply hotkey.
+### What others do that this project does not (by design)
+- **No bundled proprietary model** — you install and choose models via Ollama.
+- **No multi-user cloud sync** — chats live in your app data directory on your machine.
+- **No IDE-only scope** — it is a general desktop assistant, not a code-editor extension.
+- **No always-on cloud** — when Ollama runs locally, inference stays on your PC; cloud use is opt-in via your Ollama URL and model choice.
+---
+## Features
+- **Selection action bar** — Explain, Summarize, Translate, Ask, Screen, Ask my files
+- **Global hotkey** — open assistant with current selection (Alt+S on Windows/Linux, ⌥S on macOS)
+- **Insert reply** — paste the last AI response into any app (Ctrl+Shift+V / ⌘⇧V)
+- **Vision** — paste images, screenshot button, window/screen capture from the action bar
+- **RAG** — index a configurable folder with Chroma + Ollama embeddings
+- **Chat** — streaming, recent chats, export, tone chips, safe markdown code rendering
+- **Settings UI** — tabbed: general, AI & models, hotkeys, files, advanced
+- **First-run wizard** — Ollama setup and model download with progress
+- **Tray integration** — launch at login, new chat, settings, paste screenshot, quit
+- **Platform UI** — Windows, macOS (liquid glass), Linux (GNOME-inspired)
+### Agent mode (beta)
+Opt in via **Settings → AI & models → Enable tools (beta)**. When enabled and your model supports Ollama tool calling (e.g. qwen3, llama3.2), the assistant can run a short read-only tool loop before answering:
+- **Search indexed files** — RAG over your watch folder
+- **List / read files** — only inside the configured watch folder (path-scoped; no arbitrary disk access)
+- **Read clipboard** — current text clipboard
+- **Capture screen** — OCR text from the foreground window (respects the screen-capture setting)
+Read-only tools use the same local-first rules as chat (including offline-only mode). Requires a tool-capable Ollama model; without one, chat falls back to plain streaming.
+### Desktop actions (beta)
+With **Enable tools** and **Allow desktop actions** both on in Settings, the AI can — each behind an **Allow / Deny** dialog — write text files **inside the watch folder only** (`.txt`, `.md`, `.csv`, `.json`, `.log`), paste text where you click after a 3-second countdown, open `http`/`https` links in your browser, and open documents from the watch folder.
+Constraints: text insertion is unavailable on **Wayland**; on **macOS** it needs the same Accessibility permission as hotkeys. The assistant never runs programs, presses arbitrary keys, or moves the mouse.
+### MCP servers (beta)
+Configure **stdio** MCP servers in **Settings → Advanced → MCP servers**. When agent mode and MCP are enabled, tools from connected servers are advertised to the model automatically. Anything the server does **not** mark with a read-only hint triggers an **Allow / Deny** dialog that shows the exact arguments before execution.
+- **Disabled in offline-only mode** — MCP servers are third-party programs that may use the network.
+- **Trust on first connect** — you must explicitly trust a server before it is started.
+- **Requires the server's runtime** — e.g. Node.js for `npx @modelcontextprotocol/server-filesystem …`.
+**Example:** add a filesystem server scoped to a notes folder (`npx -y @modelcontextprotocol/server-filesystem ~/Notes`). Ask the agent to find action items from meeting notes; it can search and read files in that folder, then summarize. If it needs to write `todo.md`, a non-read-only MCP tool shows the confirmation dialog first.
+SSE/HTTP MCP transport and image tool results are not in this release.
+---
+## Privacy & data
+| Default | Your choice |
+|---------|-------------|
+| Ollama at `127.0.0.1` | Point to a remote or cloud Ollama URL in Settings |
+| Chats stored under app data on your PC | Export or delete via the chat menu |
+| Screen capture can be disabled | Toggle in Settings → Advanced |
+| Images stripped from saved chat JSON | Raw prompts with base64 are not persisted |
+When Ollama runs locally, prompts and model output stay on your machine. If you use a cloud model (for example `qwen3-vl:235b-cloud`), inference goes to that endpoint — configure it explicitly in **Settings → AI & models**.
+---
+## Installation
+### Recommended — pip (all platforms, no signing warnings)
+Requires Python 3.11+ and [Ollama](https://ollama.com) running locally.
+```bash
+pip install olly-desktop
+olly
+```
+For an isolated install (recommended if you use Python for other things):
+```bash
+python -m venv ~/.venvs/olly-desktop
+source ~/.venvs/olly-desktop/bin/activate   # Windows: .venvs\olly-desktop\Scripts\activate
+pip install olly-desktop
+olly
+```
+### Binary installers (optional)
+Pre-built `.dmg` (macOS) and `.exe` (Windows) installers are attached to each
+[GitHub release](https://github.com/tp-0604/ai-assistant/releases).
+> **Note:** The binaries are unsigned. macOS will block the `.dmg` at first launch —
+> see [Installation notes](docs/install-notes.md) for the one-time workaround.
+> The pip install above has no such restriction.
+| Platform | File | Notes |
+|----------|------|-------|
+| Windows | `AIAssistantSetup.exe` | Recommended installer |
+| Windows | `AIAssistant.exe` | Portable build |
+| macOS | `AIAssistant.dmg` | Drag to Applications |
+| Linux | `AIAssistant-linux-x64.tar.gz` | Extract and run `AIAssistant/AIAssistant` |
+### Platform notes
+| OS | Permissions / limits |
+|----|----------------------|
+| **Windows** | Antivirus may flag global hooks once — add an exclusion if prompted |
+| **macOS** | Grant **Accessibility** for hotkeys and text capture |
+| **Linux X11** | Best support for global hotkeys and selection capture |
+| **Linux Wayland** | Global hotkeys, selection capture, and agent text insertion may be unavailable — use the tray menu |
+---
+## From source
+```bash
+git clone https://github.com/tp-0604/ai-assistant.git
+cd ai-assistant
+python -m venv venv
+source venv/bin/activate        # Windows: venv\Scripts\activate
+pip install -r requirements.txt
+# Windows only:
+pip install -r requirements-windows.txt
+```
+Install [Ollama](https://ollama.com), then:
+```bash
+# macOS / Linux
+./launch.sh
+# or
+python main.py
+# Windows
+launch.bat
+```
+---
+## Usage
+| Action | How |
+|--------|-----|
+| Selection bar | Select text (drag, or double-click a word if enabled in Settings) |
+| Open assistant | **Alt+S** (Windows/Linux) · **⌥S** (macOS) |
+| Paste image | **Ctrl+V** / **⌘V** in chat input |
+| Insert last reply | **Ctrl+Shift+V** / **⌘⇧V** |
+| Settings | Tray → Settings, or ⋮ in popup |
+| Ask my files | Selection bar → **Files** (enable RAG and pick a folder in Settings) |
+| Screenshot | Tray → Paste screenshot, or Screen in the action bar |
+---
+## Settings
+Stored in the app data directory — editable via **Settings**:
+| OS | Location |
+|----|----------|
+| Windows | `%APPDATA%\AIAssistant\` |
+| macOS | `~/Library/Application Support/AIAssistant/` |
+| Linux | `~/.local/share/AIAssistant/` |
+| Option | Description |
+|--------|-------------|
+| Quality preset | Speed / Balanced / Quality models |
+| AI & models | LLM, vision model, thinking mode, timeouts, Ollama URL |
+| Hotkeys | Global open and insert-reply shortcuts |
+| Ask my files | RAG folder, enable/disable indexing |
+| Theme | Follow system / Dark / Light |
+| Advanced | Screen capture, image limits, system prompt, offline-only |
+---
+## Build & release
+```bash
+# Windows
+build.bat
+# macOS
+./build.sh mac
+# Linux
+./build.sh linux
+```
+Push a version tag to build all platforms and publish to GitHub Releases:
+```bash
+git tag v1.2.0
+git push origin v1.2.0
+```
+CI (`.github/workflows/build.yml`) runs tests, builds Windows/macOS/Linux artifacts, and uploads them to one release page.
+---
+## Project structure
+```
+ai-assistant/
+├── main.py                 # Entry point, tray, services
+├── core/                   # Ollama client, settings, RAG, capture, platform
+├── ui/                     # Popup, action bar, settings, onboarding, markdown
+├── ui/styles/              # windows.py, macos.py, linux.py
+├── utils/                  # Global hotkeys
+├── packaging/              # Linux desktop file
+├── AI_Assistant.*.spec     # PyInstaller specs per OS
+└── tests/
+```
+---
+## Optional: OCR
+Install [Tesseract](https://github.com/tesseract-ocr/tesseract) for text extraction from images when no vision model is available.
+---
+## License
+MIT

olly_desktop-1.2.1/PUBLISHING.md ADDED Viewed

@@ -0,0 +1,37 @@
+# Publishing setup
+## PyPI trusted publishing (one-time)
+1. Create an account at [pypi.org](https://pypi.org) if you don't have one.
+2. Go to pypi.org → Account settings → Publishing → Add a new publisher.
+3. Fill in:
+   - PyPI project name: `olly-desktop`
+   - GitHub owner: `tp-0604`
+   - Repository: `ai-assistant`
+   - Workflow: `publish.yml`
+   - Environment: `pypi`
+4. The PyPI project `olly-desktop` must already exist (create it under your account if needed).
+## Releasing
+1. Bump the version in `pyproject.toml`.
+2. Update `CHANGELOG.md` (one line per notable change).
+3. Push a tag: `git tag v1.2.1 && git push origin v1.2.1`
+4. Go to GitHub → Releases → Draft a new release → choose the tag → **Publish** (not just save as draft).
+5. The `publish.yml` workflow fires and pushes `olly-desktop` to PyPI.
+6. The existing `build.yml` workflow fires and attaches the binary installers.
+## Testing the package locally before releasing
+```bash
+pip install -e .          # editable install from source
+olly                      # console script
+pip install dist/*.whl    # or install the built wheel directly
+```
+After publish:
+```bash
+pip install olly-desktop
+olly
+```