npm - ltcai - Versions diffs - 2.1.0 → 2.2.0 - Mend

ltcai 2.1.0 → 2.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (35) hide show

package/README.md +140 -590
package/auto_setup.py +17 -17
package/docs/CHANGELOG.md +45 -0
package/docs/MULTI_AGENT_RUNTIME.md +4 -4
package/docs/PLUGIN_SDK.md +7 -7
package/docs/REALTIME_COLLABORATION.md +6 -6
package/docs/V2_ARCHITECTURE.md +45 -25
package/docs/WORKFLOW_DESIGNER.md +4 -4
package/docs/architecture.md +127 -135
package/docs/kg-schema.md +3 -3
package/docs/public-deploy.md +2 -3
package/knowledge_graph.py +2 -2
package/latticeai/__init__.py +1 -1
package/latticeai/api/models.py +8 -0
package/latticeai/core/config.py +1 -1
package/latticeai/core/graph_curator.py +2 -2
package/latticeai/core/marketplace.py +2 -2
package/latticeai/core/model_compat.py +7 -63
package/latticeai/core/model_resolution.py +1 -1
package/latticeai/core/multi_agent.py +1 -1
package/latticeai/core/plugins.py +1 -1
package/latticeai/core/realtime.py +1 -1
package/latticeai/core/workflow_engine.py +1 -1
package/latticeai/core/workspace_os.py +1 -1
package/latticeai/server_app.py +1 -1
package/latticeai/services/model_catalog.py +105 -153
package/latticeai/services/model_recommendation.py +28 -17
package/latticeai/services/model_runtime.py +2 -2
package/llm_router.py +80 -92
package/ltcai_cli.py +2 -3
package/package.json +1 -1
package/static/chat.html +5 -6
package/static/scripts/chat.js +34 -36
package/static/workspace.html +1 -1
package/telegram_bot.py +1 -1

package/docs/architecture.md CHANGED Viewed

@@ -1,156 +1,148 @@
-# Lattice AI — 아키텍처
-## 전체 구조
-```
-┌─────────────────────────────────────────────────────────┐
-│                    클라이언트 레이어                      │
-│  웹 UI (chat.html)  │  VS Code 확장  │  Telegram 봇     │
-└──────────────────────────┬──────────────────────────────┘
-                           │ HTTP / SSE
-┌──────────────────────────▼──────────────────────────────┐
-│               server.py — FastAPI (port 4825)            │
-│                                                          │
-│  /chat  /agent  /models  /tools/*  /mcp/*  /garden      │
-│  /account  /admin  /auth/sso  /knowledge-graph  /graph   │
-└────┬──────────┬──────────┬──────────┬───────────────────┘
-     │          │          │          │
-     ▼          ▼          ▼          ▼
-llm_router  tools.py  knowledge_  p_reinforce
-  .py               graph.py      .py
-     │
-     ├── MLX (mlx_lm / mlx_vlm)   ← Apple Silicon 로컬
-     ├── OpenAI SDK                ← openai / groq / together / openrouter
-     └── Ollama / vLLM REST        ← 로컬 서버 연동
+# Lattice AI Architecture
+Lattice AI v2.2.0 is a local-first **AI Knowledge OS**. The architecture is
+organized around one durable center: the Knowledge Graph. Models, tools,
+agents, workflows, and UI modes are replaceable layers that operate on top of
+the graph.
+## Architecture Goals
+- Keep user knowledge local-first by default.
+- Treat multimodal input as the normal path, not an add-on.
+- Preserve evidence, decisions, files, artifacts, and work history.
+- Keep models replaceable and policy-governed.
+- Explain risk and source facts instead of hiding capability.
+- Keep basic and advanced modes feature-equivalent.
+- Keep admin-only capabilities explicit and auditable.
+## System View
+```mermaid
+flowchart TD
+    User["User files, screenshots, chats, notes, code, work logs"]
+    Ingestion["Multimodal ingestion"]
+    Extract["Entity, relation, evidence extraction"]
+    Graph["Knowledge Graph"]
+    Context["Graph context builder"]
+    Models["Multimodal model runtime"]
+    Agents["Agent runtime and workflows"]
+    Outputs["Advice, analysis, documents, automation"]
+    Admin["Admin policy and audit"]
+    User --> Ingestion
+    Ingestion --> Extract
+    Extract --> Graph
+    Graph --> Context
+    Context --> Models
+    Models --> Agents
+    Agents --> Outputs
+    Admin --> Models
+    Admin --> Graph
 ```
-## 파일별 역할
-| 파일 | 역할 |
-|------|------|
-| `server.py` | FastAPI 앱, 모든 HTTP 엔드포인트, 인증/세션/CORS/rate limit |
-| `ltcai_cli.py` | CLI 엔트리포인트 (`LTCAI` 명령), `doctor` 서브커맨드, uvicorn 실행 |
-| `llm_router.py` | 로컬(MLX/Ollama) ↔ 클라우드(OpenAI/Groq/…) 라우팅, 스트리밍 SSE |
-| `tools.py` | 에이전트 도구 구현: read_file, edit_file, grep, run_command, todo_write/read, 스크린샷 등 |
-| `knowledge_graph.py` | SQLite 지식 그래프 (노드/엣지/청크), Graph RAG 컨텍스트 주입 |
-| `p_reinforce.py` | P-Reinforce 지식 정원 엔진, `~/.ltcai-brain/` 분류 저장 |
-| `telegram_bot.py` | 로컬 AI Telegram 미러 봇 |
-| `codex_telegram_bot.py` | 클라우드 Codex Telegram 봇 (GPT + GitHub 이슈) |
-| `vscode-extension/` | TypeScript VS Code 확장 |
-| `static/` | 웹 UI HTML (chat, account, admin, graph), PWA manifest/SW |
-| `bin/ltcai.js` | npm CLI 엔트리포인트 (Python 환경 자동 부트스트랩) |
-## 데이터 흐름
-### 채팅 요청
+## Durable Core
-```
-브라우저 → POST /chat
-  → server.py: 인증 확인, rate limit
-  → llm_router.py: 모델 선택 (로컬/클라우드)
-  → knowledge_graph.py: Graph RAG 컨텍스트 조회 + 주입
-  → LLM 스트리밍 응답 (SSE)
-  → knowledge_graph.py: 메시지/응답 인제스트
-```
+The Knowledge Graph stores the durable user and organization memory:
-### 에이전트 요청
+- files and document evidence
+- images and screenshots
+- conversations and notes
+- user decisions
+- work history
+- generated artifacts
+- agent and workflow events
-```
-브라우저/VS Code → POST /agent
-  → server.py: 인증 확인, rate limit (6/분)
-  → llm_router.py: Discover→Plan→Implement→Verify 루프 (max 25스텝)
-  → tools.py: read_file / edit_file / grep / run_command / todo_*
-  → 각 스텝 결과 스트리밍
-```
+The LLM is not the product core. It is an execution worker that can be replaced
+when hardware, policy, or user preference changes.
-### 문서 업로드
+## Multimodal Ingestion
-```
-브라우저 → POST /upload
-  → server.py: magic-number 검증, rate limit (12/분)
-  → tools.py: PDF/DOCX/XLSX/PPTX 파싱
-  → knowledge_graph.py: Chunk/Page/Sheet/Slide 노드 인제스트
-  → blob 저장: ~/.ltcai/knowledge_graph_blobs/
-```
+Lattice AI assumes users will provide source material directly. The expected
+input set includes:
-## 데이터 저장소
+- PDF
+- Word
+- Excel
+- PowerPoint
+- images
+- screenshots
+- chat history
+- notes
+- web content
+- code
+- work logs
-```
-~/.ltcai/
-├── users.json                   # 사용자 계정 (scrypt 해시)
-├── sessions.json                # 세션 토큰 (24h TTL)
-├── chat_history.json            # 채팅 히스토리
-├── knowledge_graph.sqlite       # Graph RAG SQLite DB
-├── knowledge_graph_blobs/       # 원본 업로드 파일
-├── mcp_installs.json            # MCP 서버 설치 목록
-└── todos.json                   # 에이전트 TODO 리스트
-~/.ltcai-brain/
-├── INDEX.md
-├── 00_Raw/
-├── 10_Wiki/
-├── 20_Skills/
-├── 30_Projects/
-└── 40_Log/
-```
-## 인증 흐름
-```
-POST /login (username + password)
-  → scrypt 검증
-  → 세션 토큰 생성 (UUID, 24h TTL)
-  → Set-Cookie: session=<token>; HttpOnly; SameSite=Lax
-모든 민감 엔드포인트:
-  → _require_auth(): 쿠키 검증 → User 반환 또는 401
-```
-SSO (OIDC):
-```
-GET /auth/sso/login → 리디렉션 (Entra ID / Okta)
-GET /auth/sso/callback?code=... → 토큰 교환 → 세션 생성
-```
+The architecture must not ask users to convert these to plain text before AI can
+work on them.
-## MCP 연동
+## Model Runtime Policy
-`/mcp/tools` — 에이전트 도구 카탈로그를 MCP 형식으로 노출
-Claude Desktop / Cursor의 MCP 설정에 `http://localhost:4825/mcp` 추가 시 직접 도구 사용 가능.
+Local recommended models must be multimodal. The v2.2 local runtime policy is:
-자세한 내용: [mcp-tools.md](mcp-tools.md)
+- macOS Apple Silicon: MLX-VLM first
+- Windows: llama.cpp multimodal path, with LM Studio as a user-friendly option
+- Linux: llama.cpp or vLLM multimodal path depending on GPU support
+- Ollama: kept as an option, not the default priority
----
+The removed path is the old text-only MLX-LM recommendation route. Low-spec
+machines use smaller or quantized multimodal models.
-## PPT 명세와의 정렬 (2026-05 추가)
+## Model Source Disclosure
-`lattice_ai_full_spec.pptx` (UI 명세서) 에 맞춰 세 가지 보강 모듈이 추가됐다.
-어떤 슬라이드가 어떤 파일에 매핑되는지 한눈에:
+Model catalog entries carry source disclosure fields:
-| PPT 슬라이드 | 의미 | 구현 파일 |
-|--------------|------|-----------|
-| 14 (세 가지 약속) | Cross-platform · Auto-setup · Graph 원칙 | (전체 아키텍처) |
-| 15·19 (크로스플랫폼·디자인 토큰) | 공유 토큰 = 단일 진실 근원 | [`static/css/tokens.css`](../static/css/tokens.css) |
-| 16·17 (자동 환경 매트릭스·5단계) | OS·HW 감지 → 모델 추천 → 설치 → 검증 → 프리셋 | [`auto_setup.py`](../auto_setup.py) |
-| 20·21·22 (KG 노드·엣지·데이터 모델) | 10 NodeType / 12 EdgeType + embedding + confidence | [`kg_schema.py`](../kg_schema.py), [`docs/kg-schema.md`](kg-schema.md) |
-| 24 (통합 아키텍처) | 6 레이어 (UI / Logic / AI Core / KG / Storage / Auto-Setup) | 이 문서 + 위 파일들 |
+1. `source_country`
+2. `source_company`
+3. `execution_method`
+4. `internet_requirement`
+5. `model_name`
-### 신규 모듈 빠른 참조
+These are first-class model facts, not advanced-only metadata.
-```bash
-# 자동 환경 세팅 5단계
-python3 auto_setup.py probe          # ① 시스템 감지
-python3 auto_setup.py recommend      # ② 모델 추천
-python3 auto_setup.py plan           # ③ 설치 계획 (실행 안 함)
-python3 auto_setup.py plan --apply   # ③ 실제 설치 (위험)
-python3 auto_setup.py verify         # ④ 검증
-python3 auto_setup.py preset         # ⑤ 프리셋
-python3 auto_setup.py all            # 전체 흐름
+## Recommendation Flow
-# KG v2 스키마
-python3 kg_schema.py init  ~/.ltcai/kg_v2.db
-python3 kg_schema.py migrate ~/.ltcai/knowledge_graph.db    # legacy → v2
-python3 kg_schema.py stats ~/.ltcai/knowledge_graph.db
+```text
+hardware scan
+  -> CPU/GPU/RAM/disk/OS analysis
+  -> multimodal model shortlist
+  -> same-family old generation removal
+  -> source disclosure
+  -> recommendation reason
+  -> download/install/load/verify
 ```
-전체 명세 ↔ 구현 매핑은 [`spec-vs-impl.md`](spec-vs-impl.md) 참고.
+The current default recommendation family is Gemma 4. Qwen3-VL and Llama 4
+remain current multimodal alternatives.
+## Modes
+Basic mode and advanced mode have the same feature access.
+- Basic mode uses plain language and source facts.
+- Advanced mode adds execution, memory, quantization, and load/unload detail.
+- Admin mode adds actual authority: user management, permissions, audit logs,
+  organization policy, security policy, sensitive-data monitoring, model approval
+  policy, and Private VPC.
+## Main Modules
+| Module | Responsibility |
+| --- | --- |
+| `latticeai/services/model_catalog.py` | Multimodal model catalog, source metadata, aliases |
+| `latticeai/services/model_recommendation.py` | Hardware-aware multimodal recommendation |
+| `latticeai/services/model_runtime.py` | Download, load, server, and runtime orchestration |
+| `llm_router.py` | MLX-VLM and OpenAI-compatible model routing |
+| `knowledge_graph.py` | Graph storage, extraction, local folder graph RAG |
+| `latticeai/core/context_builder.py` | Graph context for generation |
+| `latticeai/core/workspace_os.py` | Workspace state, timeline, snapshots, memory |
+| `latticeai/core/multi_agent.py` | Planner/executor/reviewer/researcher orchestration |
+| `latticeai/core/workflow_engine.py` | Workflow definitions and run history |
+| `latticeai/core/plugins.py` | Plugin manifest, registry, permission boundary |
+| `latticeai/core/security.py` | Local security primitives |
+## Compatibility
+v2.2.0 preserves the additive Workspace OS and API compatibility posture from
+v2.x. Existing graph/workspace data is migrated non-destructively. The release
+does remove current recommendation entries for old or text-only model paths, but
+it does not destructively mutate existing user graph data.

package/docs/kg-schema.md CHANGED Viewed

@@ -56,7 +56,7 @@ Edge {
   weight       float [0..1]    // 관계의 ‘강도’
   confidence   float [0..1]    // 추출/추론의 ‘신뢰도’
   evidence     string[]        // 근거 (메시지/청크 ID 리스트)
-  created_by   string          // extractor:llm-gemma-3-12b | rule:regex | user
+  created_by   string          // extractor:llm-gemma-4-12b | rule:regex | user
   created_at   ISO8601 UTC
 }
 ```
@@ -106,7 +106,7 @@ Edge {
     "weight":     0.82,
     "confidence": 0.91,
     "evidence":   ["chunk:01HX7K…#p3", "chunk:01HX7K…#p11"],
-    "created_by": "extractor:llm-gemma-3-12b"
+    "created_by": "extractor:llm-gemma-4-12b"
   }
 }
 ```
@@ -197,7 +197,7 @@ store.upsert_edge(Edge(
     type=EdgeType.MENTIONS,
     weight=0.82, confidence=0.91,
     evidence=["chunk:01HX7K…#p3"],
-    created_by="extractor:llm-gemma-3-12b",
+    created_by="extractor:llm-gemma-4-12b",
 ))
 # 이웃 탐색

package/docs/public-deploy.md CHANGED Viewed

@@ -131,7 +131,6 @@ yourdomain.com {
 openai:gpt-4o-mini
 openai:gpt-4o
 openrouter:openai/gpt-4o-mini
-groq:llama-3.1-8b-instant
-groq:llama-3.3-70b-versatile
-together:meta-llama/Llama-3.3-70B-Instruct-Turbo
+openrouter:qwen/qwen3-vl-235b-a22b-instruct
+together:Qwen/Qwen3-VL-32B-Instruct
 ```

package/knowledge_graph.py CHANGED Viewed

@@ -523,7 +523,7 @@ def _extract_concepts_rules(text: str, limit: int = 12) -> List[str]:
     2. Multi-word proper nouns (Lattice AI, GPT-4o, Claude Sonnet)
     3. Single capitalized proper nouns not at sentence start (Claude, Python, FastAPI)
     4. Korean compound technical terms (멀티모달, 에이전트, 그래프RAG)
-    5. Hyphenated / versioned identifiers (gpt-4o, mlx-lm, llama-3.3)
+    5. Hyphenated / versioned identifiers (gpt-4o, mlx-vlm, gemma-4)
     """
     text = str(text or "")
     seen: dict = {}  # concept_lower → original form
@@ -586,7 +586,7 @@ def _extract_concepts_rules(text: str, limit: int = 12) -> List[str]:
             if len(m) >= 3 or cnt >= 2:
                 _add(m)
-    # 6. Hyphenated / versioned identifiers (gpt-4o, llama-3.3, mlx-lm)
+    # 6. Hyphenated / versioned identifiers (gpt-4o, gemma-4, mlx-vlm)
     for m in re.findall(r'\b([a-zA-Z][a-zA-Z0-9]*(?:-[a-zA-Z0-9.]+)+)\b', text):
         if len(m) >= 4:
             _add(m)

package/latticeai/__init__.py CHANGED Viewed

@@ -1,3 +1,3 @@
 """Lattice AI - modular server package."""
-__version__ = "2.1.0"
+__version__ = "2.2.0"

package/latticeai/api/models.py CHANGED Viewed

@@ -100,9 +100,17 @@ def create_models_router(
             base = {
                 "id": item["id"],
                 "name": item["name"],
+                "model_name": item.get("model_name") or item.get("name"),
                 "tag": item["tag"],
                 "size": item["size"],
                 "display_name": item.get("name") or item.get("id"),
+                "modality": item.get("modality") or "multimodal",
+                "source_country": item.get("source_country"),
+                "source_company": item.get("source_company"),
+                "execution_method": item.get("execution_method"),
+                "run_location": item.get("run_location"),
+                "internet_requirement": item.get("internet_requirement"),
+                "source_display_order": item.get("source_display_order"),
             }
             short_id = str(item["id"]).lower()
             aliases = MODEL_ENGINE_ALIASES.get(short_id) or {}

package/latticeai/core/config.py CHANGED Viewed

@@ -131,7 +131,7 @@ class Config:
         admin_emails = [item.strip().lower() for item in _value(env, "LATTICEAI_ADMIN_EMAILS", "").split(",") if item.strip()]
         public_model = _value(env, "LATTICEAI_PUBLIC_MODEL", _value(env, "LATTICEAI_DEFAULT_MODEL", "openai:gpt-4o-mini"))
-        local_model = _value(env, "LATTICEAI_LOCAL_MODEL", "mlx-community/gemma-4-26b-a4b-it-4bit")
+        local_model = _value(env, "LATTICEAI_LOCAL_MODEL", "mlx-community/gemma-4-12b-it-4bit")
         data_dir = Path(_value(env, "LATTICEAI_DATA_DIR", str(Path.home() / ".ltcai")))
         static_dir = Path(_value(env, "LATTICEAI_STATIC_DIR", str(base_dir / "static")))

package/latticeai/core/graph_curator.py CHANGED Viewed

@@ -231,9 +231,9 @@ def extract_topic_candidates(
 DEFAULT_ALIAS_GROUPS: List[List[str]] = [
     ["lattice ai", "latticeai", "래티스 ai", "래티스ai", "내 앱", "내 ai"],
-    ["gpt-oss", "gpt oss", "openai gpt-oss"],
+    ["gemma-4", "gemma 4", "google gemma"],
     ["gemma 4", "gemma4", "google gemma 4"],
-    ["llama 3", "llama3", "meta llama 3"],
+    ["llama 4", "llama4", "meta llama 4", "llama scout"],
 ]

package/latticeai/core/marketplace.py CHANGED Viewed

@@ -11,7 +11,7 @@ from copy import deepcopy
 from typing import Any, Dict, List, Optional
-MARKETPLACE_VERSION = "2.1.0"
+MARKETPLACE_VERSION = "2.2.0"
 TEMPLATE_KINDS = ("plugin", "workflow", "agent")
@@ -33,7 +33,7 @@ BUILTIN_TEMPLATES: Dict[str, List[Dict[str, Any]]] = {
                     "id": "plugin-review-action",
                     "name": "Plugin Review Action",
                     "version": "1.0.0",
-                    "lattice_version": ">=2.1.0",
+                    "lattice_version": ">=2.2.0",
                     "permissions": ["read_workspace", "run_skills"],
                     "provides": {"skills": ["review_action"]},
                 }

package/latticeai/core/model_compat.py CHANGED Viewed

@@ -25,18 +25,13 @@ logger = logging.getLogger(__name__)
 # ── Model family detection ────────────────────────────────────────────────────
 FAMILY_PATTERNS: List[Tuple[str, re.Pattern]] = [
-    ("gpt-oss", re.compile(r"gpt[-_]?oss", re.I)),
     ("gemma", re.compile(r"gemma", re.I)),
     ("qwen", re.compile(r"qwen", re.I)),
     ("llama", re.compile(r"\bllama|meta[-_]?llama", re.I)),
-    ("mistral", re.compile(r"mistral|mixtral", re.I)),
-    ("phi", re.compile(r"\bphi[-_]?\d", re.I)),
-    ("deepseek", re.compile(r"deepseek", re.I)),
-    ("yi", re.compile(r"\byi[-_]?\d", re.I)),
     ("claude", re.compile(r"claude", re.I)),
-    ("gpt-4", re.compile(r"gpt[-_]?4", re.I)),
-    ("gpt-3.5", re.compile(r"gpt[-_]?3\.?5", re.I)),
-    ("o1", re.compile(r"\bo1[-_]?", re.I)),
+    ("gpt", re.compile(r"gpt[-_]?(?:4|5)|openai", re.I)),
+    ("gemini", re.compile(r"gemini", re.I)),
+    ("grok", re.compile(r"grok|x[-_]?ai", re.I)),
 ]
@@ -59,20 +54,6 @@ def detect_model_family(model_id: str) -> str:
 DEFAULT_STOP = ["<|im_end|>", "<|endoftext|>", "</s>", "<|user|>", "<|assistant|>"]
 FAMILY_PROFILES: Dict[str, Dict[str, Any]] = {
-    "gpt-oss": {
-        "family": "gpt-oss",
-        "supports_system": True,
-        "supports_vision": False,
-        "chat_template": "gpt_oss",
-        "preferred_engines": ["ollama", "llamacpp", "vllm", "local_mlx"],
-        "temperature": 0.1,
-        "top_p": 0.9,
-        "max_tokens": 2048,
-        "stop_sequences": ["<|im_end|>", "<|end|>", "</s>", "<|user|>", "<|assistant|>"],
-        "disable_draft": True,
-        # trim_after_user_marker는 <|user|>가 살아있어야 동작하므로 strip_role_tokens보다 먼저 실행.
-        "postprocess": ["trim_after_user_marker", "strip_role_tokens"],
-    },
     "gemma": {
         "family": "gemma",
         "supports_system": True,
@@ -89,7 +70,7 @@ FAMILY_PROFILES: Dict[str, Dict[str, Any]] = {
     "qwen": {
         "family": "qwen",
         "supports_system": True,
-        "supports_vision": False,
+        "supports_vision": True,
         "chat_template": "qwen_chatml",
         "preferred_engines": ["ollama", "local_mlx", "vllm"],
         "temperature": 0.2,
@@ -102,7 +83,7 @@ FAMILY_PROFILES: Dict[str, Dict[str, Any]] = {
     "llama": {
         "family": "llama",
         "supports_system": True,
-        "supports_vision": False,
+        "supports_vision": True,
         "chat_template": "tokenizer_default",
         "preferred_engines": ["ollama", "local_mlx", "llamacpp", "vllm"],
         "temperature": 0.2,
@@ -112,45 +93,6 @@ FAMILY_PROFILES: Dict[str, Dict[str, Any]] = {
         "disable_draft": False,
         "postprocess": ["strip_role_tokens"],
     },
-    "mistral": {
-        "family": "mistral",
-        "supports_system": False,
-        "supports_vision": False,
-        "chat_template": "tokenizer_default",
-        "preferred_engines": ["ollama", "local_mlx", "llamacpp"],
-        "temperature": 0.2,
-        "top_p": 0.9,
-        "max_tokens": 4096,
-        "stop_sequences": ["</s>", "[INST]", "[/INST]"],
-        "disable_draft": False,
-        "postprocess": ["strip_role_tokens"],
-    },
-    "phi": {
-        "family": "phi",
-        "supports_system": True,
-        "supports_vision": False,
-        "chat_template": "tokenizer_default",
-        "preferred_engines": ["ollama", "local_mlx"],
-        "temperature": 0.2,
-        "top_p": 0.9,
-        "max_tokens": 2048,
-        "stop_sequences": ["<|end|>", "<|endoftext|>"],
-        "disable_draft": False,
-        "postprocess": ["strip_role_tokens"],
-    },
-    "deepseek": {
-        "family": "deepseek",
-        "supports_system": True,
-        "supports_vision": False,
-        "chat_template": "tokenizer_default",
-        "preferred_engines": ["ollama", "local_mlx", "vllm"],
-        "temperature": 0.2,
-        "top_p": 0.9,
-        "max_tokens": 4096,
-        "stop_sequences": ["<|EOT|>", "</s>"],
-        "disable_draft": False,
-        "postprocess": ["strip_role_tokens"],
-    },
     "unknown": {
         "family": "unknown",
         "supports_system": True,
@@ -316,6 +258,7 @@ class CompatProfile:
     engine: Optional[str]
     family: str
     template: str
+    supports_vision: bool
     stop: List[str]
     temperature: float
     top_p: float
@@ -362,6 +305,7 @@ def ensure_profile(model_id: str, engine: Optional[str] = None) -> CompatProfile
         engine=(engine or "").strip().lower() or None,
         family=base["family"],
         template=base["chat_template"],
+        supports_vision=bool(base.get("supports_vision", False)),
         stop=list(base["stop_sequences"]),
         temperature=float(base["temperature"]),
         top_p=float(base["top_p"]),

package/latticeai/core/model_resolution.py CHANGED Viewed

@@ -120,7 +120,7 @@ class ModelResolution:
         if not provider:
             provider = engine_hint or "local_mlx"
-        # alias 테이블 (예: {"gpt-oss-20b": {"local_mlx": "mlx-community/...","ollama":"gpt-oss:20b"}})
+        # alias 테이블 (예: {"gemma-4-12b-it-4bit": {"local_mlx": "mlx-community/...", "ollama": "hf.co/..."}})
         resolved_model = model_name
         if engine_aliases:
             aliases = engine_aliases.get(model_name.lower())

package/latticeai/core/multi_agent.py CHANGED Viewed

@@ -14,7 +14,7 @@ from datetime import datetime
 from typing import Any, Callable, Dict, List, Optional
-MULTI_AGENT_VERSION = "2.1.0"
+MULTI_AGENT_VERSION = "2.2.0"
 AGENT_ROLES = ("researcher", "planner", "executor", "reviewer", "release")
 CORE_PIPELINE = ("planner", "executor", "reviewer")

package/latticeai/core/plugins.py CHANGED Viewed

@@ -30,7 +30,7 @@ from pathlib import Path
 from typing import Any, Callable, Dict, List, Optional, Tuple
-PLUGIN_SDK_VERSION = "2.1.0"
+PLUGIN_SDK_VERSION = "2.2.0"
 # Capability-style permissions a plugin can request. Kept deliberately small so
 # the Enterprise seam can layer finer-grained policy on top without changing the

package/latticeai/core/realtime.py CHANGED Viewed

@@ -32,7 +32,7 @@ from datetime import datetime
 from typing import Any, AsyncIterator, Dict, List, Optional, Set
-REALTIME_VERSION = "2.1.0"
+REALTIME_VERSION = "2.2.0"
 _FEED_LIMIT = 200
 _QUEUE_MAX = 100

package/latticeai/core/workflow_engine.py CHANGED Viewed

@@ -28,7 +28,7 @@ from datetime import datetime
 from typing import Any, Callable, Dict, List, Optional
-WORKFLOW_ENGINE_VERSION = "2.1.0"
+WORKFLOW_ENGINE_VERSION = "2.2.0"
 # The node vocabulary a workflow can be built from. ``trigger`` and ``output``
 # are structural; the rest dispatch to an injected runner of the same family.

package/latticeai/core/workspace_os.py CHANGED Viewed

@@ -18,7 +18,7 @@ from pathlib import Path
 from typing import Any, Callable, Dict, Iterable, List, Optional
-WORKSPACE_OS_VERSION = "2.1.0"
+WORKSPACE_OS_VERSION = "2.2.0"
 # Workspace types separate single-user Personal workspaces from shared
 # Organization workspaces. Both keep the same local-first JSON store; the type

package/latticeai/server_app.py CHANGED Viewed

@@ -1,6 +1,6 @@
 """
 Lattice AI MLX — Local LLM Bridge Server
-Apple Silicon (M1-M5) 전용 | mlx-lm 기반
+Apple Silicon (M1-M5) 전용 | MLX-VLM 기반
 """
 import asyncio