PyPI - llms-py - Versions diffs - 2.0.8__tar.gz → 2.0.10__tar.gz - Mend

llms-py 2.0.8tar.gz → 2.0.10tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (51) hide show

{llms_py-2.0.8/llms_py.egg-info → llms_py-2.0.10}/PKG-INFO +124 -39
{llms_py-2.0.8 → llms_py-2.0.10}/README.md +123 -38
llms_py-2.0.10/index.html +80 -0
{llms_py-2.0.8 → llms_py-2.0.10}/llms.json +16 -10
{llms_py-2.0.8 → llms_py-2.0.10}/llms.py +144 -13
{llms_py-2.0.8 → llms_py-2.0.10/llms_py.egg-info}/PKG-INFO +124 -39
{llms_py-2.0.8 → llms_py-2.0.10}/llms_py.egg-info/SOURCES.txt +12 -2
{llms_py-2.0.8 → llms_py-2.0.10}/pyproject.toml +1 -1
{llms_py-2.0.8 → llms_py-2.0.10}/setup.py +16 -6
llms_py-2.0.10/ui/Avatar.mjs +28 -0
llms_py-2.0.10/ui/Brand.mjs +23 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui/ChatPrompt.mjs +101 -69
{llms_py-2.0.8 → llms_py-2.0.10}/ui/Main.mjs +43 -183
llms_py-2.0.10/ui/ModelSelector.mjs +29 -0
llms_py-2.0.10/ui/ProviderStatus.mjs +105 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui/Recents.mjs +2 -1
llms_py-2.0.10/ui/SettingsDialog.mjs +374 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui/Sidebar.mjs +11 -27
llms_py-2.0.10/ui/SignIn.mjs +64 -0
llms_py-2.0.10/ui/SystemPromptEditor.mjs +31 -0
llms_py-2.0.10/ui/SystemPromptSelector.mjs +36 -0
llms_py-2.0.10/ui/Welcome.mjs +8 -0
llms_py-2.0.10/ui/ai.mjs +80 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui/app.css +76 -10
llms_py-2.0.10/ui/lib/servicestack-vue.mjs +37 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui/markdown.mjs +9 -2
{llms_py-2.0.8 → llms_py-2.0.10}/ui/tailwind.input.css +13 -4
{llms_py-2.0.8 → llms_py-2.0.10}/ui/threadStore.mjs +2 -2
{llms_py-2.0.8 → llms_py-2.0.10}/ui/typography.css +109 -1
{llms_py-2.0.8 → llms_py-2.0.10}/ui/utils.mjs +8 -2
llms_py-2.0.8/index.html +0 -64
llms_py-2.0.8/ui/lib/servicestack-vue.min.mjs +0 -37
{llms_py-2.0.8 → llms_py-2.0.10}/LICENSE +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/MANIFEST.in +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/llms_py.egg-info/dependency_links.txt +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/llms_py.egg-info/entry_points.txt +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/llms_py.egg-info/not-zip-safe +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/llms_py.egg-info/requires.txt +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/llms_py.egg-info/top_level.txt +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/requirements.txt +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/setup.cfg +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui/App.mjs +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui/fav.svg +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui/lib/highlight.min.mjs +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui/lib/idb.min.mjs +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui/lib/marked.min.mjs +0 -0
/llms_py-2.0.8/ui/lib/servicestack-client.min.mjs → /llms_py-2.0.10/ui/lib/servicestack-client.mjs +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui/lib/vue-router.min.mjs +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui/lib/vue.min.mjs +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui/lib/vue.mjs +0 -0
{llms_py-2.0.8 → llms_py-2.0.10}/ui.json +0 -0

{llms_py-2.0.8/llms_py.egg-info → llms_py-2.0.10}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: llms-py
-Version: 2.0.8
+Version: 2.0.10
 Summary: A lightweight CLI tool and OpenAI-compatible server for querying multiple Large Language Model (LLM) providers
 Home-page: https://github.com/ServiceStack/llms
 Author: ServiceStack
@@ -51,7 +51,7 @@ Configure additional providers and models in [llms.json](llms.json)
 ## Features
 - **Lightweight**: Single [llms.py](llms.py) Python file with single `aiohttp` dependency
-- **Multi-Provider Support**: OpenRouter, Ollama, Anthropic, Google, OpenAI, Grok, Groq, Qwen, Mistral
+- **Multi-Provider Support**: OpenRouter, Ollama, Anthropic, Google, OpenAI, Grok, Groq, Qwen, Z.ai, Mistral
 - **OpenAI-Compatible API**: Works with any client that supports OpenAI's chat completion API
 - **Configuration Management**: Easy provider enable/disable and configuration management
 - **CLI Interface**: Simple command-line interface for quick interactions
@@ -510,7 +510,52 @@ llms --default grok-4
 # Update llms.py to latest version
 llms --update
-```
+# Pass custom parameters to chat request (URL-encoded)
+llms --args "temperature=0.7&seed=111" "What is 2+2?"
+# Multiple parameters with different types
+llms --args "temperature=0.5&max_completion_tokens=50" "Tell me a joke"
+# URL-encoded special characters (stop sequences)
+llms --args "stop=Two,Words" "Count to 5"
+# Combine with other options
+llms --system "You are helpful" --args "temperature=0.3" --raw "Hello"
+```
+#### Custom Parameters with `--args`
+The `--args` option allows you to pass URL-encoded parameters to customize the chat request sent to LLM providers:
+**Parameter Types:**
+- **Floats**: `temperature=0.7`, `frequency_penalty=0.2`
+- **Integers**: `max_completion_tokens=100`
+- **Booleans**: `store=true`, `verbose=false`, `logprobs=true`
+- **Strings**: `stop=one`
+- **Lists**: `stop=two,words`
+**Common Parameters:**
+- `temperature`: Controls randomness (0.0 to 2.0)
+- `max_completion_tokens`: Maximum tokens in response
+- `seed`: For reproducible outputs
+- `top_p`: Nucleus sampling parameter
+- `stop`: Stop sequences (URL-encode special chars)
+- `store`: Whether or not to store the output
+- `frequency_penalty`: Penalize new tokens based on frequency
+- `presence_penalty`: Penalize new tokens based on presence
+- `logprobs`: Include log probabilities in response
+- `parallel_tool_calls`: Enable parallel tool calls
+- `prompt_cache_key`: Cache key for prompt
+- `reasoning_effort`: Reasoning effort (low, medium, high, *minimal, *none, *default)
+- `safety_identifier`: A string that uniquely identifies each user
+- `seed`: For reproducible outputs
+- `service_tier`: Service tier (free, standard, premium, *default)
+- `top_logprobs`: Number of top logprobs to return
+- `top_p`: Nucleus sampling parameter
+- `verbosity`: Verbosity level (0, 1, 2, 3, *default)
+- `enable_thinking`: Enable thinking mode (Qwen)
+- `stream`: Enable streaming responses
 ### Default Model Configuration
@@ -558,6 +603,42 @@ llms "Explain quantum computing" | glow
 ## Supported Providers
+Any OpenAI-compatible providers and their models can be added by configuring them in [llms.json](./llms.json). By default only AI Providers with free tiers are enabled which will only be "available" if their API Key is set.
+You can list the available providers, their models and which are enabled or disabled with:
+```bash
+llms ls
+```
+They can be enabled/disabled in your `llms.json` file or with:
+```bash
+llms --enable <provider>
+llms --disable <provider>
+```
+For a provider to be available, they also require their API Key configured in either your Environment Variables
+or directly in your `llms.json`.
+### Environment Variables
+| Provider        | Variable                  | Description         | Example |
+|-----------------|---------------------------|---------------------|---------|
+| openrouter_free | `OPENROUTER_FREE_API_KEY` | OpenRouter FREE models API key | `sk-or-...` |
+| groq            | `GROQ_API_KEY`            | Groq API key        | `gsk_...` |
+| google_free     | `GOOGLE_FREE_API_KEY`     | Google FREE API key | `AIza...` |
+| codestral       | `CODESTRAL_API_KEY`       | Codestral API key   | `...` |
+| ollama          | N/A                       | No API key required | |
+| openrouter      | `OPENROUTER_API_KEY`      | OpenRouter API key  | `sk-or-...` |
+| google          | `GOOGLE_API_KEY`          | Google API key      | `AIza...` |
+| anthropic       | `ANTHROPIC_API_KEY`       | Anthropic API key   | `sk-ant-...` |
+| openai          | `OPENAI_API_KEY`          | OpenAI API key      | `sk-...` |
+| grok            | `GROK_API_KEY`            | Grok (X.AI) API key | `xai-...` |
+| qwen            | `DASHSCOPE_API_KEY`       | Qwen (Alibaba) API key | `sk-...` |
+| z.ai            | `ZAI_API_KEY`             | Z.ai API key        | `sk-...` |
+| mistral         | `MISTRAL_API_KEY`         | Mistral API key     | `...` |
 ### OpenAI
 - **Type**: `OpenAiProvider`
 - **Models**: GPT-5, GPT-5 Codex, GPT-4o, GPT-4o-mini, o3, etc.
@@ -588,6 +669,26 @@ export GOOGLE_API_KEY="your-key"
 llms --enable google_free
 ```
+### OpenRouter
+- **Type**: `OpenAiProvider`
+- **Models**: 100+ models from various providers
+- **Features**: Access to latest models, free tier available
+```bash
+export OPENROUTER_API_KEY="your-key"
+llms --enable openrouter
+```
+### Grok (X.AI)
+- **Type**: `OpenAiProvider`
+- **Models**: Grok-4, Grok-3, Grok-3-mini, Grok-code-fast-1, etc.
+- **Features**: Real-time information, humor, uncensored responses
+```bash
+export GROK_API_KEY="your-key"
+llms --enable grok
+```
 ### Groq
 - **Type**: `OpenAiProvider`
 - **Models**: Llama 3.3, Gemma 2, Kimi K2, etc.
@@ -608,44 +709,44 @@ llms --enable groq
 llms --enable ollama
 ```
-### OpenRouter
+### Qwen (Alibaba Cloud)
 - **Type**: `OpenAiProvider`
-- **Models**: 100+ models from various providers
-- **Features**: Access to latest models, free tier available
+- **Models**: Qwen3-max, Qwen-max, Qwen-plus, Qwen2.5-VL, QwQ-plus, etc.
+- **Features**: Multilingual, vision models, coding, reasoning, audio processing
 ```bash
-export OPENROUTER_API_KEY="your-key"
-llms --enable openrouter
+export DASHSCOPE_API_KEY="your-key"
+llms --enable qwen
 ```
-### Mistral
+### Z.ai
 - **Type**: `OpenAiProvider`
-- **Models**: Mistral Large, Codestral, Pixtral, etc.
-- **Features**: Code generation, multilingual
+- **Models**: GLM-4.6, GLM-4.5, GLM-4.5-air, GLM-4.5-x, GLM-4.5-airx, GLM-4.5-flash, GLM-4:32b
+- **Features**: Advanced language models with strong reasoning capabilities
 ```bash
-export MISTRAL_API_KEY="your-key"
-llms --enable mistral
+export ZAI_API_KEY="your-key"
+llms --enable z.ai
 ```
-### Grok (X.AI)
+### Mistral
 - **Type**: `OpenAiProvider`
-- **Models**: Grok-4, Grok-3, Grok-3-mini, Grok-code-fast-1, etc.
-- **Features**: Real-time information, humor, uncensored responses
+- **Models**: Mistral Large, Codestral, Pixtral, etc.
+- **Features**: Code generation, multilingual
 ```bash
-export GROK_API_KEY="your-key"
-llms --enable grok
+export MISTRAL_API_KEY="your-key"
+llms --enable mistral
 ```
-### Qwen (Alibaba Cloud)
+### Codestral
 - **Type**: `OpenAiProvider`
-- **Models**: Qwen3-max, Qwen-max, Qwen-plus, Qwen2.5-VL, QwQ-plus, etc.
-- **Features**: Multilingual, vision models, coding, reasoning, audio processing
+- **Models**: Codestral
+- **Features**: Code generation
 ```bash
-export DASHSCOPE_API_KEY="your-key"
-llms --enable qwen
+export CODESTRAL_API_KEY="your-key"
+llms --enable codestral
 ```
 ## Model Routing
@@ -654,22 +755,6 @@ The tool automatically routes requests to the first available provider that supp
 Example: If both OpenAI and OpenRouter support `kimi-k2`, the request will first try OpenRouter (free), then fall back to Groq than OpenRouter (Paid) if requests fails.
-## Environment Variables
-| Variable | Description | Example |
-|----------|-------------|---------|
-| `LLMS_CONFIG_PATH`        | Custom config file path | `/path/to/llms.json` |
-| `OPENAI_API_KEY`          | OpenAI API key | `sk-...` |
-| `ANTHROPIC_API_KEY`       | Anthropic API key | `sk-ant-...` |
-| `GOOGLE_API_KEY`          | Google API key | `AIza...` |
-| `GROQ_API_KEY`            | Groq API key | `gsk_...` |
-| `MISTRAL_API_KEY`         | Mistral API key | `...` |
-| `OPENROUTER_API_KEY`      | OpenRouter API key | `sk-or-...` |
-| `OPENROUTER_FREE_API_KEY` | OpenRouter free tier key | `sk-or-...` |
-| `CODESTRAL_API_KEY`       | Codestral API key | `...` |
-| `GROK_API_KEY`            | Grok (X.AI) API key | `xai-...` |
-| `DASHSCOPE_API_KEY`       | Qwen (Alibaba Cloud) API key | `sk-...` |
 ## Configuration Examples
 ### Minimal Configuration

{llms_py-2.0.8 → llms_py-2.0.10}/README.md RENAMED Viewed

@@ -11,7 +11,7 @@ Configure additional providers and models in [llms.json](llms.json)
 ## Features
 - **Lightweight**: Single [llms.py](llms.py) Python file with single `aiohttp` dependency
-- **Multi-Provider Support**: OpenRouter, Ollama, Anthropic, Google, OpenAI, Grok, Groq, Qwen, Mistral
+- **Multi-Provider Support**: OpenRouter, Ollama, Anthropic, Google, OpenAI, Grok, Groq, Qwen, Z.ai, Mistral
 - **OpenAI-Compatible API**: Works with any client that supports OpenAI's chat completion API
 - **Configuration Management**: Easy provider enable/disable and configuration management
 - **CLI Interface**: Simple command-line interface for quick interactions
@@ -470,7 +470,52 @@ llms --default grok-4
 # Update llms.py to latest version
 llms --update
-```
+# Pass custom parameters to chat request (URL-encoded)
+llms --args "temperature=0.7&seed=111" "What is 2+2?"
+# Multiple parameters with different types
+llms --args "temperature=0.5&max_completion_tokens=50" "Tell me a joke"
+# URL-encoded special characters (stop sequences)
+llms --args "stop=Two,Words" "Count to 5"
+# Combine with other options
+llms --system "You are helpful" --args "temperature=0.3" --raw "Hello"
+```
+#### Custom Parameters with `--args`
+The `--args` option allows you to pass URL-encoded parameters to customize the chat request sent to LLM providers:
+**Parameter Types:**
+- **Floats**: `temperature=0.7`, `frequency_penalty=0.2`
+- **Integers**: `max_completion_tokens=100`
+- **Booleans**: `store=true`, `verbose=false`, `logprobs=true`
+- **Strings**: `stop=one`
+- **Lists**: `stop=two,words`
+**Common Parameters:**
+- `temperature`: Controls randomness (0.0 to 2.0)
+- `max_completion_tokens`: Maximum tokens in response
+- `seed`: For reproducible outputs
+- `top_p`: Nucleus sampling parameter
+- `stop`: Stop sequences (URL-encode special chars)
+- `store`: Whether or not to store the output
+- `frequency_penalty`: Penalize new tokens based on frequency
+- `presence_penalty`: Penalize new tokens based on presence
+- `logprobs`: Include log probabilities in response
+- `parallel_tool_calls`: Enable parallel tool calls
+- `prompt_cache_key`: Cache key for prompt
+- `reasoning_effort`: Reasoning effort (low, medium, high, *minimal, *none, *default)
+- `safety_identifier`: A string that uniquely identifies each user
+- `seed`: For reproducible outputs
+- `service_tier`: Service tier (free, standard, premium, *default)
+- `top_logprobs`: Number of top logprobs to return
+- `top_p`: Nucleus sampling parameter
+- `verbosity`: Verbosity level (0, 1, 2, 3, *default)
+- `enable_thinking`: Enable thinking mode (Qwen)
+- `stream`: Enable streaming responses
 ### Default Model Configuration
@@ -518,6 +563,42 @@ llms "Explain quantum computing" | glow
 ## Supported Providers
+Any OpenAI-compatible providers and their models can be added by configuring them in [llms.json](./llms.json). By default only AI Providers with free tiers are enabled which will only be "available" if their API Key is set.
+You can list the available providers, their models and which are enabled or disabled with:
+```bash
+llms ls
+```
+They can be enabled/disabled in your `llms.json` file or with:
+```bash
+llms --enable <provider>
+llms --disable <provider>
+```
+For a provider to be available, they also require their API Key configured in either your Environment Variables
+or directly in your `llms.json`.
+### Environment Variables
+| Provider        | Variable                  | Description         | Example |
+|-----------------|---------------------------|---------------------|---------|
+| openrouter_free | `OPENROUTER_FREE_API_KEY` | OpenRouter FREE models API key | `sk-or-...` |
+| groq            | `GROQ_API_KEY`            | Groq API key        | `gsk_...` |
+| google_free     | `GOOGLE_FREE_API_KEY`     | Google FREE API key | `AIza...` |
+| codestral       | `CODESTRAL_API_KEY`       | Codestral API key   | `...` |
+| ollama          | N/A                       | No API key required | |
+| openrouter      | `OPENROUTER_API_KEY`      | OpenRouter API key  | `sk-or-...` |
+| google          | `GOOGLE_API_KEY`          | Google API key      | `AIza...` |
+| anthropic       | `ANTHROPIC_API_KEY`       | Anthropic API key   | `sk-ant-...` |
+| openai          | `OPENAI_API_KEY`          | OpenAI API key      | `sk-...` |
+| grok            | `GROK_API_KEY`            | Grok (X.AI) API key | `xai-...` |
+| qwen            | `DASHSCOPE_API_KEY`       | Qwen (Alibaba) API key | `sk-...` |
+| z.ai            | `ZAI_API_KEY`             | Z.ai API key        | `sk-...` |
+| mistral         | `MISTRAL_API_KEY`         | Mistral API key     | `...` |
 ### OpenAI
 - **Type**: `OpenAiProvider`
 - **Models**: GPT-5, GPT-5 Codex, GPT-4o, GPT-4o-mini, o3, etc.
@@ -548,6 +629,26 @@ export GOOGLE_API_KEY="your-key"
 llms --enable google_free
 ```
+### OpenRouter
+- **Type**: `OpenAiProvider`
+- **Models**: 100+ models from various providers
+- **Features**: Access to latest models, free tier available
+```bash
+export OPENROUTER_API_KEY="your-key"
+llms --enable openrouter
+```
+### Grok (X.AI)
+- **Type**: `OpenAiProvider`
+- **Models**: Grok-4, Grok-3, Grok-3-mini, Grok-code-fast-1, etc.
+- **Features**: Real-time information, humor, uncensored responses
+```bash
+export GROK_API_KEY="your-key"
+llms --enable grok
+```
 ### Groq
 - **Type**: `OpenAiProvider`
 - **Models**: Llama 3.3, Gemma 2, Kimi K2, etc.
@@ -568,44 +669,44 @@ llms --enable groq
 llms --enable ollama
 ```
-### OpenRouter
+### Qwen (Alibaba Cloud)
 - **Type**: `OpenAiProvider`
-- **Models**: 100+ models from various providers
-- **Features**: Access to latest models, free tier available
+- **Models**: Qwen3-max, Qwen-max, Qwen-plus, Qwen2.5-VL, QwQ-plus, etc.
+- **Features**: Multilingual, vision models, coding, reasoning, audio processing
 ```bash
-export OPENROUTER_API_KEY="your-key"
-llms --enable openrouter
+export DASHSCOPE_API_KEY="your-key"
+llms --enable qwen
 ```
-### Mistral
+### Z.ai
 - **Type**: `OpenAiProvider`
-- **Models**: Mistral Large, Codestral, Pixtral, etc.
-- **Features**: Code generation, multilingual
+- **Models**: GLM-4.6, GLM-4.5, GLM-4.5-air, GLM-4.5-x, GLM-4.5-airx, GLM-4.5-flash, GLM-4:32b
+- **Features**: Advanced language models with strong reasoning capabilities
 ```bash
-export MISTRAL_API_KEY="your-key"
-llms --enable mistral
+export ZAI_API_KEY="your-key"
+llms --enable z.ai
 ```
-### Grok (X.AI)
+### Mistral
 - **Type**: `OpenAiProvider`
-- **Models**: Grok-4, Grok-3, Grok-3-mini, Grok-code-fast-1, etc.
-- **Features**: Real-time information, humor, uncensored responses
+- **Models**: Mistral Large, Codestral, Pixtral, etc.
+- **Features**: Code generation, multilingual
 ```bash
-export GROK_API_KEY="your-key"
-llms --enable grok
+export MISTRAL_API_KEY="your-key"
+llms --enable mistral
 ```
-### Qwen (Alibaba Cloud)
+### Codestral
 - **Type**: `OpenAiProvider`
-- **Models**: Qwen3-max, Qwen-max, Qwen-plus, Qwen2.5-VL, QwQ-plus, etc.
-- **Features**: Multilingual, vision models, coding, reasoning, audio processing
+- **Models**: Codestral
+- **Features**: Code generation
 ```bash
-export DASHSCOPE_API_KEY="your-key"
-llms --enable qwen
+export CODESTRAL_API_KEY="your-key"
+llms --enable codestral
 ```
 ## Model Routing
@@ -614,22 +715,6 @@ The tool automatically routes requests to the first available provider that supp
 Example: If both OpenAI and OpenRouter support `kimi-k2`, the request will first try OpenRouter (free), then fall back to Groq than OpenRouter (Paid) if requests fails.
-## Environment Variables
-| Variable | Description | Example |
-|----------|-------------|---------|
-| `LLMS_CONFIG_PATH`        | Custom config file path | `/path/to/llms.json` |
-| `OPENAI_API_KEY`          | OpenAI API key | `sk-...` |
-| `ANTHROPIC_API_KEY`       | Anthropic API key | `sk-ant-...` |
-| `GOOGLE_API_KEY`          | Google API key | `AIza...` |
-| `GROQ_API_KEY`            | Groq API key | `gsk_...` |
-| `MISTRAL_API_KEY`         | Mistral API key | `...` |
-| `OPENROUTER_API_KEY`      | OpenRouter API key | `sk-or-...` |
-| `OPENROUTER_FREE_API_KEY` | OpenRouter free tier key | `sk-or-...` |
-| `CODESTRAL_API_KEY`       | Codestral API key | `...` |
-| `GROK_API_KEY`            | Grok (X.AI) API key | `xai-...` |
-| `DASHSCOPE_API_KEY`       | Qwen (Alibaba Cloud) API key | `sk-...` |
 ## Configuration Examples
 ### Minimal Configuration

llms_py-2.0.10/index.html ADDED Viewed

@@ -0,0 +1,80 @@
+<html>
+<head>
+    <title>llms.py</title>
+    <link rel="stylesheet" href="/ui/typography.css">
+    <link rel="stylesheet" href="/ui/app.css">
+    <link rel="icon" type="image/svg" href="/ui/fav.svg">
+    <style>
+        [type='button'],button[type='submit']{cursor:pointer}
+        [type='checkbox'].switch:checked:hover,
+        [type='checkbox'].switch:checked:focus,
+        [type='checkbox'].switch:checked,
+        [type='checkbox'].switch:focus,
+        [type='checkbox'].switch
+        {
+            border: none;
+            background: none;
+            outline: none;
+            box-shadow: none;
+            cursor: pointer;
+        }
+    </style>
+</head>
+<script type="importmap">
+    {
+        "imports": {
+            "vue": "/ui/lib/vue.min.mjs",
+            "vue-router": "/ui/lib/vue-router.min.mjs",
+            "@servicestack/client": "/ui/lib/servicestack-client.mjs",
+            "@servicestack/vue": "/ui/lib/servicestack-vue.mjs",
+            "idb": "/ui/lib/idb.min.mjs",
+            "marked": "/ui/lib/marked.min.mjs",
+            "highlight.js": "/ui/lib/highlight.min.mjs"
+        }
+    }
+</script>
+<body>
+<div id="app"></div>
+</body>
+<script type="module">
+import { createApp, defineAsyncComponent } from 'vue'
+import { createWebHistory, createRouter } from "vue-router"
+import ServiceStackVue from "@servicestack/vue"
+import App from '/ui/App.mjs'
+import ai from '/ui/ai.mjs'
+import SettingsDialog from '/ui/SettingsDialog.mjs'
+const { config, models } = await ai.init()
+const MainComponent = defineAsyncComponent(() => import(ai.base + '/ui/Main.mjs'))
+const RecentsComponent = defineAsyncComponent(() => import(ai.base + '/ui/Recents.mjs'))
+const Components = {
+    SettingsDialog,
+}
+const routes = [
+    { path: '/', component: MainComponent },
+    { path: '/c/:id', component: MainComponent },
+    { path: '/recents', component: RecentsComponent  },
+    { path: '/:fallback(.*)*', component: MainComponent }
+]
+routes.forEach(r => r.path = ai.base + r.path)
+const router = createRouter({
+    history: createWebHistory(),
+    routes,
+})
+const app = createApp(App, { config, models })
+app.use(router)
+app.use(ServiceStackVue)
+app.provide('ai', ai)
+app.provide('config', config)
+app.provide('models', models)
+Object.keys(Components).forEach(name => {
+    app.component(name, Components[name])
+})
+window.ai = app.config.globalProperties.$ai = ai
+app.mount('#app')
+</script>
+</html>

{llms_py-2.0.8 → llms_py-2.0.10}/llms.json RENAMED Viewed

@@ -9,7 +9,12 @@
             "messages": [
                 {
                     "role": "user",
-                    "content": ""
+                    "content": [
+                        {
+                            "type": "text",
+                            "text": ""
+                        }
+                    ]
                 }
             ]
         },
@@ -90,10 +95,8 @@
                 "deepseek-r1:671b": "deepseek/deepseek-r1-0528:free",
                 "gemini-2.0-flash": "google/gemini-2.0-flash-exp:free",
                 "glm-4.5-air": "z-ai/glm-4.5-air:free",
-                "grok-4-fast": "x-ai/grok-4-fast:free",
                 "mai-ds-r1": "microsoft/mai-ds-r1:free",
                 "llama3.3:70b": "meta-llama/llama-3.3-70b-instruct:free",
-                "kimi-k2": "moonshotai/kimi-k2:free",
                 "nemotron-nano:9b": "nvidia/nemotron-nano-9b-v2:free",
                 "deepseek-r1-distill-llama:70b": "deepseek/deepseek-r1-distill-llama-70b:free",
                 "gpt-oss:20b": "openai/gpt-oss-20b:free",
@@ -102,7 +105,6 @@
                 "devstral-small": "mistralai/devstral-small-2505:free",
                 "venice-uncensored:24b": "cognitivecomputations/dolphin-mistral-24b-venice-edition:free",
                 "llama3.3:8b": "meta-llama/llama-3.3-8b-instruct:free",
-                "llama3.1:405b": "meta-llama/llama-3.1-405b-instruct:free",
                 "kimi-dev:72b": "moonshotai/kimi-dev-72b:free",
                 "gemma3:27b": "google/gemma-3-27b-it:free",
                 "qwen3-coder": "qwen/qwen3-coder:free",
@@ -171,7 +173,7 @@
             }
         },
         "ollama": {
-            "enabled": false,
+            "enabled": true,
             "type": "OllamaProvider",
             "base_url": "http://localhost:11434",
             "models": {},
@@ -389,7 +391,8 @@
                 "qwen2.5-vl:7b": "qwen2.5-vl-7b-instruct",
                 "qwen2.5-vl:3b": "qwen2.5-vl-3b-instruct",
                 "qwen2.5-omni:7b": "qwen2.5-omni-7b"
-            }
+            },
+            "enable_thinking": false
         },
         "z.ai": {
             "enabled": false,
@@ -404,7 +407,8 @@
                 "glm-4.5-airx": "glm-4.5-airx",
                 "glm-4.5-flash": "glm-4.5-flash",
                 "glm-4:32b": "glm-4-32b-0414-128k"
-            }
+            },
+            "temperature": 0.7
         },
         "mistral": {
             "enabled": false,
@@ -417,20 +421,22 @@
                 "devstral-medium": "devstral-medium-2507",
                 "codestral:22b": "codestral-latest",
                 "mistral-ocr": "mistral-ocr-latest",
-                "voxtral-mini": "voxtral-mini-latest",
                 "mistral-small3.2:24b": "mistral-small-latest",
                 "magistral-small": "magistral-small-latest",
                 "devstral-small": "devstral-small-2507",
                 "voxtral-small": "voxtral-small-latest",
+                "voxtral-mini": "voxtral-mini-latest",
+                "codestral-embed": "codestral-embed-2505",
+                "mistral-embed": "mistral-embed",
                 "mistral-large:123b": "mistral-large-latest",
                 "pixtral-large:124b": "pixtral-large-latest",
                 "pixtral:12b": "pixtral-12b",
-                "mistral-nemo:12b": "mistral-nemo",
+                "mistral-nemo:12b": "open-mistral-nemo",
                 "mistral-saba": "mistral-saba-latest",
                 "mistral:7b": "open-mistral-7b",
                 "mixtral:8x7b": "open-mixtral-8x7b",
                 "mixtral:8x22b": "open-mixtral-8x22b",
-                "ministral:8b": "ministral-3b-latest",
+                "ministral:8b": "ministral-8b-latest",
                 "ministral:3b": "ministral-3b-latest"
             }
         }

llms-py 2.0.8__tar.gz → 2.0.10__tar.gz

llms-py 2.0.8tar.gz → 2.0.10tar.gz