npm - @humanclaw/humanclaw - Versions diffs - 1.2.8 → 2.0.1 - Mend

@humanclaw/humanclaw 1.2.8 → 2.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

package/README.md CHANGED Viewed

@@ -21,7 +21,7 @@
 HumanClaw 是一个碳基节点编排框架。系统将真实人类抽象为 Agent（碳基节点），将现实中的任务派发与结果收集抽象为进程的**挂起（Suspend）**与**恢复（Resume）**。
-核心流程：输入自然语言需求 → 选人 → AI 自动规划（拆任务 + 生成话术 + 设 DDL）→ 确认分发 → 收交付物 → AI 聚合审查。
+核心流程：输入自然语言需求 → 选人/选团队 → AI 自动规划（拆任务 + 生成话术 + 设 DDL）→ 确认分发 → 收交付物 → AI 聚合审查 → 绩效评价。
 ## 核心架构
@@ -31,19 +31,20 @@ HumanClaw 是一个碳基节点编排框架。系统将真实人类抽象为 Age
 │   Master    │     trace_id      │  HumanAgent  │
 │  (老板/PM)  │ ◄──────────────   │  (碳基算力)   │
 │             │   Resume + Result │              │
-└─────┬───────┘                   └──────────────┘
-      │
-      │  AI Review
-      ▼
-┌─────────────┐
-│  LLM 审查   │
-│ (Claude/GPT)│
-└─────────────┘
+└─────┬───────┘                   └──────┬───────┘
+      │                                  │
+      │  AI Review + Eval           Team Context
+      ▼                                  ▼
+┌─────────────┐                   ┌──────────────┐
+│  LLM 审查   │                   │   团队管理    │
+│ + 绩效评价  │                   │ (关系 & 权重) │
+└─────────────┘                   └──────────────┘
 ```
 - **Master 节点**：输入需求，AI 自动拆解为独立子任务，分发给碳基节点
 - **Worker 节点 (HumanAgent)**：接收带 `trace_id` 的独立任务，在碳基世界异步执行
-- **AI 审查**：所有任务完成后，LLM 自动审查交付质量并生成报告
+- **团队管理**：碳基节点按团队组织，每个团队有独立的关系上下文
+- **AI 审查 + 绩效评价**：所有任务完成后，LLM 审查交付质量并生成报告和绩效评分
 ## 快速开始
@@ -83,76 +84,98 @@ humanclaw agent list
 ## API 接口
+### 碳基节点
 | 方法 | 路径 | 说明 |
 |------|------|------|
 | `GET` | `/api/v1/nodes/status` | 碳基算力池状态 |
 | `POST` | `/api/v1/nodes` | 注册碳基节点 |
+| `GET` | `/api/v1/nodes/:id` | 获取节点详情（含团队） |
 | `PATCH` | `/api/v1/nodes/:id/status` | 更新节点状态 |
-| `POST` | `/api/v1/jobs/plan` | AI 智能规划（不分发） |
+| `DELETE` | `/api/v1/nodes/:id` | 删除节点 |
+### 任务编排
+| 方法 | 路径 | 说明 |
+|------|------|------|
+| `POST` | `/api/v1/jobs/plan` | AI 智能规划（支持 `team_id`） |
 | `POST` | `/api/v1/jobs/create` | 创建并分发任务 |
 | `GET` | `/api/v1/jobs/active` | 获取看板数据 |
 | `POST` | `/api/v1/tasks/resume` | 提交交付物，触发恢复 |
 | `POST` | `/api/v1/tasks/reject` | 打回重做 |
 | `POST` | `/api/v1/tasks/simulate` | AI 模拟交付（角色扮演） |
-| `POST` | `/api/v1/jobs/:id/review` | AI 聚合审查交付质量 |
-| `GET` | `/api/v1/config` | 获取 LLM 配置 |
-| `PUT` | `/api/v1/config` | 更新 LLM 配置 |
+| `POST` | `/api/v1/jobs/:id/review` | AI 聚合审查（支持评分体系） |
-### AI 规划示例
+### 团队管理
-```bash
-curl -X POST http://localhost:2026/api/v1/jobs/plan \
-  -H "Content-Type: application/json" \
-  -d '{ "prompt": "完成首页重构" }'
-```
+| 方法 | 路径 | 说明 |
+|------|------|------|
+| `GET` | `/api/v1/teams` | 团队列表（含成员） |
+| `GET` | `/api/v1/teams/:id` | 团队详情 |
+| `POST` | `/api/v1/teams` | 创建团队 |
+| `DELETE` | `/api/v1/teams/:id` | 删除团队 |
+| `POST` | `/api/v1/teams/:id/members` | 添加成员 |
+| `DELETE` | `/api/v1/teams/:id/members/:agent_id` | 移除成员 |
+| `PUT` | `/api/v1/teams/:id/members/:agent_id` | 更新成员团队关系 |
-### 提交交付物
+### 绩效评价
-```bash
-curl -X POST http://localhost:2026/api/v1/tasks/resume \
-  -H "Content-Type: application/json" \
-  -d '{
-    "trace_id": "TK-9527",
-    "result_data": { "text": "https://github.com/org/repo/pull/42" }
-  }'
-```
+| 方法 | 路径 | 说明 |
+|------|------|------|
+| `POST` | `/api/v1/evaluations/generate` | 生成绩效评价 |
+| `GET` | `/api/v1/evaluations/job/:job_id` | 按 Job 查询评价 |
+| `GET` | `/api/v1/evaluations/agent/:agent_id` | 按 Agent 查询评价历史 |
+| `GET` | `/api/v1/evaluations/dashboard` | 绩效看板 |
+### LLM 配置
+| 方法 | 路径 | 说明 |
+|------|------|------|
+| `GET` | `/api/v1/config` | 获取 LLM 配置 |
+| `PUT` | `/api/v1/config` | 更新 LLM 配置 |
 ## Dashboard 看板
 Web 看板包含三个核心视图：
-- **碳基算力池** — 实时查看碳基节点状态（🟢空闲 🟡忙碌 🔴离线 🟣崩溃），一键添加/删除节点
-- **碳基编排大盘** — AI 智能规划 + 任务看板 + 可交互任务卡片（点击直接提交交付/打回）+ 模拟交付 + AI 聚合审查
+- **碳基算力池** — 实时查看碳基节点状态（🟢空闲 🟡忙碌 🔴离线 🟣崩溃），团队管理，一键添加/删除节点
+- **碳基编排大盘** — AI 智能规划（可按团队）+ 任务看板 + 模拟交付 + AI 聚合审查 + 绩效评价
 - **I/O 交付终端** — 输入 trace_id 和交付载荷，触发系统恢复
 ### AI 功能
-- **智能规划** — 输入需求，AI 自动拆任务、匹配碳基节点、生成布置话术、设 DDL（可调）
+- **智能规划** — 输入需求，AI 自动拆任务、匹配碳基节点、生成布置话术、设 DDL（支持按团队规划，注入团队关系上下文）
 - **模拟交付** — 点击按钮，AI 以碳基节点视角角色扮演，根据身份、技能、关系生成模拟交付物
 - **聚合审查** — 全部交付后，AI 审查每个交付物质量（支持 GitHub PR/Commit/Issue URL），生成评分报告
-- **可配置 LLM** — 支持 Claude / OpenAI，可自定义 Base URL 接入私有模型服务（vLLM / Ollama / Azure）
+- **绩效评价** — 支持三种评分体系（阿里 3.75 / SABCD / EM），AI 生成按人按任务的绩效评分和评语
+- **可配置 LLM** — 支持 3 种 API 格式（Anthropic Messages / OpenAI Chat Completions / OpenAI Responses），可自定义 Base URL 接入私有模型服务
+### 可伸缩编辑器
+所有文本编辑区域（任务交付、审查结果、规划话术）均支持拖拽调整大小和全屏展开。
 ### Demo 场景
-Dashboard 内置三个开箱即用的 Demo 场景，一键加载即可体验：
+Dashboard 内置三个开箱即用的 Demo 场景，一键加载碳基节点和团队：
-- **三国蜀汉** 🐉 — 你是刘备，底下有关羽、张飞、赵云、诸葛亮等七员大将
-- **互联网大厂** 💻 — 你是技术总监，管理前端、后端、算法、产品、设计、测试、运维团队
+- **三国蜀汉** 🐉 — 你是刘备，麾下关羽、张飞、赵云、诸葛亮等七员文臣武将
+- **互联网大厂** 💻 — 你是技术总监，管理前端、后端、算法、产品、设计、测试、运维
 - **美国政府** 🇺🇸 — 你是特朗普，指挥 Musk、Rubio、Bessent 等核心内阁
 ## 核心工作流
-1. **镜像封装** — 录入碳基成员信息，构建碳基算力池
-2. **AI 规划** — 输入需求，AI 拆解任务、匹配节点、生成话术和 DDL
+1. **镜像封装** — 录入碳基成员信息，建立团队，构建碳基算力池
+2. **AI 规划** — 输入需求，选择团队/节点，AI 拆解任务、生成话术和 DDL
 3. **确认分发** — 预览规划结果，调整 DDL，确认后一键分发
 4. **异步恢复** — 碳基节点提交交付物（支持 GitHub URL），系统唤醒 Job
 5. **AI 审查** — 所有子任务完成后，LLM 审查交付质量并生成报告
+6. **绩效评价** — 选择评分体系，AI 生成每个碳基节点的绩效评分
 ## 环境变量
 | 变量 | 默认值 | 说明 |
 |------|--------|------|
-| `HUMANCLAW_LLM_PROVIDER` | `claude` | LLM 提供商：`claude` 或 `openai` |
+| `HUMANCLAW_LLM_PROVIDER` | `anthropic` | API 格式：`anthropic` / `openai` / `responses` |
 | `HUMANCLAW_LLM_API_KEY` | - | LLM API Key（使用 AI 功能时必填） |
 | `HUMANCLAW_LLM_MODEL` | 按 provider | 可选覆盖模型名 |
 | `HUMANCLAW_LLM_BASE_URL` | 官方地址 | 自定义 API 地址（私有部署） |
@@ -170,14 +193,21 @@ interface HumanAgent {
   status: AgentStatus;    // IDLE | BUSY | OFFLINE | OOM
 }
-interface HumanTask {
-  trace_id: string;       // TK-9527
-  job_id: string;
-  assignee_id: string;
-  todo_description: string;
-  deadline: string;
-  status: TaskStatus;     // PENDING | DISPATCHED | RESOLVED | OVERDUE
-  result_data: unknown;
+interface Team {
+  team_id: string;        // team_xxxxxxxx
+  name: string;           // "前端组"
+  description: string;
+  members: TeamMember[];  // 含团队关系
+}
+interface Evaluation {
+  eval_id: string;
+  agent_id: string;
+  trace_id: string;
+  rating_system: 'ali' | 'letter' | 'em';
+  rating: string;         // "3.75" / "A" / "EM+"
+  weight: number;
+  comment: string;
 }
 ```
@@ -197,10 +227,10 @@ npm run lint       # 类型检查
 - **Runtime**: Node.js 22+, TypeScript (ESM, strict)
 - **API**: Express v5
 - **Storage**: SQLite (better-sqlite3, WAL mode)
-- **LLM**: Claude / OpenAI（原生 fetch，零依赖）
+- **LLM**: 3 种 API 格式（Anthropic / OpenAI / Responses），原生 fetch，零依赖
 - **CLI**: Commander.js + @clack/prompts
 - **Dashboard**: 内联 HTML（无需构建）
-- **Testing**: Vitest (40 tests)
+- **Testing**: Vitest (68 tests)
 ## License

package/README_EN.md CHANGED Viewed

@@ -21,7 +21,7 @@
 HumanClaw is a carbon-based node orchestration framework. The system abstracts real humans as Agents (carbon-based nodes), models task dispatch and result collection as process **Suspend** and **Resume**.
-Core flow: natural language input → select people → AI auto-plans (breaks down tasks + generates briefings + sets deadlines) → confirm dispatch → collect deliverables → AI aggregated review.
+Core flow: natural language input → select people/team → AI auto-plans (breaks down tasks + generates briefings + sets deadlines) → confirm dispatch → collect deliverables → AI aggregated review → performance evaluation.
 ## Core Architecture
@@ -31,19 +31,20 @@ Core flow: natural language input → select people → AI auto-plans (breaks do
 │   Master    │     trace_id      │  HumanAgent  │
 │  (Boss/PM)  │ ◄──────────────   │  (Carbon CPU)│
 │             │   Resume + Result │              │
-└─────┬───────┘                   └──────────────┘
-      │
-      │  AI Review
-      ▼
-┌─────────────┐
-│  LLM Review │
-│ (Claude/GPT)│
-└─────────────┘
+└─────┬───────┘                   └──────┬───────┘
+      │                                  │
+      │  AI Review + Eval           Team Context
+      ▼                                  ▼
+┌─────────────┐                   ┌──────────────┐
+│  LLM Review │                   │     Team     │
+│ + Perf Eval │                   │  Management  │
+└─────────────┘                   └──────────────┘
 ```
 - **Master Node**: Input requirements, AI auto-breaks them into independent sub-tasks, dispatches to carbon-based nodes
 - **Worker Node (HumanAgent)**: Receives independent tasks with a `trace_id`, executes asynchronously in the carbon-based world
-- **AI Review**: After all tasks complete, LLM reviews deliverable quality and generates a report
+- **Team Management**: Carbon-based nodes organized into teams, each team with its own relationship context
+- **AI Review + Performance Evaluation**: After all tasks complete, LLM reviews deliverable quality and generates reports with performance ratings
 ## Quick Start
@@ -83,76 +84,98 @@ humanclaw agent list
 ## API Endpoints
+### Carbon-Based Nodes
 | Method | Path | Description |
 |--------|------|-------------|
 | `GET` | `/api/v1/nodes/status` | Carbon compute pool status |
 | `POST` | `/api/v1/nodes` | Register carbon-based node |
+| `GET` | `/api/v1/nodes/:id` | Get node details (with teams) |
 | `PATCH` | `/api/v1/nodes/:id/status` | Update node status |
-| `POST` | `/api/v1/jobs/plan` | AI task planning (does not dispatch) |
+| `DELETE` | `/api/v1/nodes/:id` | Delete node |
+### Task Orchestration
+| Method | Path | Description |
+|--------|------|-------------|
+| `POST` | `/api/v1/jobs/plan` | AI smart planning (supports `team_id`) |
 | `POST` | `/api/v1/jobs/create` | Create and dispatch job |
-| `GET` | `/api/v1/jobs/active` | Get active jobs data |
+| `GET` | `/api/v1/jobs/active` | Get dashboard data |
 | `POST` | `/api/v1/tasks/resume` | Submit deliverable, trigger resume |
-| `POST` | `/api/v1/tasks/reject` | Reject and retry |
+| `POST` | `/api/v1/tasks/reject` | Reject and redo |
 | `POST` | `/api/v1/tasks/simulate` | AI simulate delivery (role-play) |
-| `POST` | `/api/v1/jobs/:id/review` | AI aggregated review of deliverables |
-| `GET` | `/api/v1/config` | Get LLM configuration |
-| `PUT` | `/api/v1/config` | Update LLM configuration |
+| `POST` | `/api/v1/jobs/:id/review` | AI aggregated review (supports rating system) |
-### AI Planning Example
+### Team Management
-```bash
-curl -X POST http://localhost:2026/api/v1/jobs/plan \
-  -H "Content-Type: application/json" \
-  -d '{ "prompt": "Rebuild the homepage" }'
-```
+| Method | Path | Description |
+|--------|------|-------------|
+| `GET` | `/api/v1/teams` | Team list (with members) |
+| `GET` | `/api/v1/teams/:id` | Team details |
+| `POST` | `/api/v1/teams` | Create team |
+| `DELETE` | `/api/v1/teams/:id` | Delete team |
+| `POST` | `/api/v1/teams/:id/members` | Add member |
+| `DELETE` | `/api/v1/teams/:id/members/:agent_id` | Remove member |
+| `PUT` | `/api/v1/teams/:id/members/:agent_id` | Update member team relationship |
-### Submit Deliverable
+### Performance Evaluation
-```bash
-curl -X POST http://localhost:2026/api/v1/tasks/resume \
-  -H "Content-Type: application/json" \
-  -d '{
-    "trace_id": "TK-9527",
-    "result_data": { "text": "https://github.com/org/repo/pull/42" }
-  }'
-```
+| Method | Path | Description |
+|--------|------|-------------|
+| `POST` | `/api/v1/evaluations/generate` | Generate performance evaluations |
+| `GET` | `/api/v1/evaluations/job/:job_id` | Query evaluations by Job |
+| `GET` | `/api/v1/evaluations/agent/:agent_id` | Query evaluation history by Agent |
+| `GET` | `/api/v1/evaluations/dashboard` | Performance dashboard |
+### LLM Configuration
+| Method | Path | Description |
+|--------|------|-------------|
+| `GET` | `/api/v1/config` | Get LLM configuration |
+| `PUT` | `/api/v1/config` | Update LLM configuration |
 ## Dashboard
 The web dashboard includes three core views:
-- **Carbon Compute Pool** — Real-time carbon-based node status (🟢Idle 🟡Busy 🔴Offline 🟣OOM), add/remove nodes
-- **Carbon Orchestration Pipeline** — AI planning + task board + interactive task cards (click to submit/reject) + simulate delivery + AI review
+- **Carbon Compute Pool** — Real-time carbon-based node status (🟢Idle 🟡Busy 🔴Offline 🟣OOM), team management, add/remove nodes
+- **Carbon Orchestration Pipeline** — AI smart planning (by team) + task board + simulate delivery + AI aggregated review + performance evaluation
 - **I/O Resolution Terminal** — Input trace_id and payload to trigger system resume
 ### AI Features
-- **Smart Planning** — Input requirements, AI auto-breaks tasks, matches nodes, generates briefings, sets adjustable deadlines
-- **Simulate Delivery** — Click a button, AI role-plays as the worker node based on their identity, skills, and relationship to generate mock deliverables
+- **Smart Planning** — Input requirements, AI auto-breaks tasks, matches nodes, generates briefings, sets deadlines (supports team-based planning with team relationship context injection)
+- **Simulate Delivery** — Click a button, AI role-plays as the worker node based on identity, skills, and relationships to generate mock deliverables
 - **Aggregated Review** — After all deliveries, AI reviews each deliverable (supports GitHub PR/Commit/Issue URLs), generates quality report
-- **Configurable LLM** — Supports Claude / OpenAI, custom Base URL for private deployments (vLLM / Ollama / Azure)
+- **Performance Evaluation** — Three rating systems (Ali 3.75 / SABCD / EM), AI generates per-person per-task performance ratings and comments
+- **Configurable LLM** — Supports 3 API formats (Anthropic Messages / OpenAI Chat Completions / OpenAI Responses), custom Base URL for private model services
+### Resizable Editors
+All text editing areas (task delivery, review results, planning briefings) support drag-to-resize and fullscreen expansion.
 ### Demo Scenarios
-The dashboard includes three built-in demo scenarios for instant hands-on experience:
+The dashboard includes three built-in demo scenarios, one-click to load carbon-based nodes and teams:
 - **Three Kingdoms (Shu Han)** 🐉 — You are Liu Bei, commanding Guan Yu, Zhang Fei, Zhao Yun, Zhuge Liang and more
-- **Tech Company** 💻 — You are the Tech Director, managing frontend, backend, algorithm, product, design, QA, and ops teams
+- **Tech Company** 💻 — You are the Tech Director, managing frontend, backend, algorithm, product, design, QA, and DevOps
 - **US Government** 🇺🇸 — You are Trump, directing Musk, Rubio, Bessent and the core cabinet
 ## Core Workflow
-1. **Agent Encapsulation** — Register human members, build the carbon compute pool
-2. **AI Planning** — Input requirements, AI breaks tasks, matches nodes, generates briefings and deadlines
+1. **Agent Encapsulation** — Register human members, build teams, construct the carbon compute pool
+2. **AI Planning** — Input requirements, select team/nodes, AI breaks tasks, generates briefings and deadlines
 3. **Confirm Dispatch** — Preview plan, adjust deadlines, one-click dispatch
 4. **Async Resume** — Carbon-based nodes submit deliverables (supports GitHub URLs), system wakes up the Job
 5. **AI Review** — When all sub-tasks complete, LLM reviews deliverable quality and generates a report
+6. **Performance Evaluation** — Select rating system, AI generates performance ratings for each carbon-based node
 ## Environment Variables
 | Variable | Default | Description |
 |----------|---------|-------------|
-| `HUMANCLAW_LLM_PROVIDER` | `claude` | LLM provider: `claude` or `openai` |
+| `HUMANCLAW_LLM_PROVIDER` | `anthropic` | API format: `anthropic` / `openai` / `responses` |
 | `HUMANCLAW_LLM_API_KEY` | - | LLM API Key (required for AI features) |
 | `HUMANCLAW_LLM_MODEL` | per provider | Optional model override |
 | `HUMANCLAW_LLM_BASE_URL` | official | Custom API URL (private deployments) |
@@ -170,14 +193,21 @@ interface HumanAgent {
   status: AgentStatus;    // IDLE | BUSY | OFFLINE | OOM
 }
-interface HumanTask {
-  trace_id: string;       // TK-9527
-  job_id: string;
-  assignee_id: string;
-  todo_description: string;
-  deadline: string;
-  status: TaskStatus;     // PENDING | DISPATCHED | RESOLVED | OVERDUE
-  result_data: unknown;
+interface Team {
+  team_id: string;        // team_xxxxxxxx
+  name: string;           // "Frontend Team"
+  description: string;
+  members: TeamMember[];  // with team relationships
+}
+interface Evaluation {
+  eval_id: string;
+  agent_id: string;
+  trace_id: string;
+  rating_system: 'ali' | 'letter' | 'em';
+  rating: string;         // "3.75" / "A" / "EM+"
+  weight: number;
+  comment: string;
 }
 ```
@@ -197,10 +227,10 @@ npm run lint       # Type check
 - **Runtime**: Node.js 22+, TypeScript (ESM, strict)
 - **API**: Express v5
 - **Storage**: SQLite (better-sqlite3, WAL mode)
-- **LLM**: Claude / OpenAI (native fetch, zero dependencies)
+- **LLM**: 3 API formats (Anthropic / OpenAI / Responses), native fetch, zero dependencies
 - **CLI**: Commander.js + @clack/prompts
 - **Dashboard**: Inline HTML (no build step)
-- **Testing**: Vitest (40 tests)
+- **Testing**: Vitest (68 tests)
 ## License