npm - @simplysm/sd-claude - Versions diffs - 13.0.77 → 13.0.80 - Mend

@simplysm/sd-claude 13.0.77 → 13.0.80

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (64) hide show

package/claude/rules/sd-claude-rules.md +4 -63
package/claude/rules/sd-simplysm-usage.md +7 -0
package/claude/sd-session-start.sh +10 -0
package/claude/skills/sd-api-review/SKILL.md +89 -0
package/claude/skills/sd-check/SKILL.md +55 -57
package/claude/skills/sd-commit/SKILL.md +37 -42
package/claude/skills/sd-debug/SKILL.md +75 -265
package/claude/skills/sd-document/SKILL.md +63 -53
package/claude/skills/sd-document/_common.py +94 -0
package/claude/skills/sd-document/extract_docx.py +19 -48
package/claude/skills/sd-document/extract_pdf.py +22 -50
package/claude/skills/sd-document/extract_pptx.py +17 -40
package/claude/skills/sd-document/extract_xlsx.py +19 -40
package/claude/skills/sd-email-analyze/SKILL.md +23 -31
package/claude/skills/sd-email-analyze/email-analyzer.py +79 -65
package/claude/skills/sd-init/SKILL.md +133 -0
package/claude/skills/sd-plan/SKILL.md +69 -120
package/claude/skills/sd-readme/SKILL.md +106 -131
package/claude/skills/sd-review/SKILL.md +38 -155
package/claude/skills/sd-simplify/SKILL.md +59 -0
package/package.json +3 -2
package/README.md +0 -297
package/claude/refs/sd-angular.md +0 -127
package/claude/refs/sd-code-conventions.md +0 -155
package/claude/refs/sd-directories.md +0 -7
package/claude/refs/sd-library-issue.md +0 -7
package/claude/refs/sd-migration.md +0 -7
package/claude/refs/sd-orm-v12.md +0 -81
package/claude/refs/sd-orm.md +0 -23
package/claude/refs/sd-service.md +0 -5
package/claude/refs/sd-simplysm-docs.md +0 -52
package/claude/refs/sd-solid.md +0 -68
package/claude/refs/sd-workflow.md +0 -25
package/claude/rules/sd-refs-linker.md +0 -52
package/claude/sd-statusline.js +0 -296
package/claude/skills/sd-api-name-review/SKILL.md +0 -154
package/claude/skills/sd-brainstorm/SKILL.md +0 -215
package/claude/skills/sd-debug/condition-based-waiting-example.ts +0 -158
package/claude/skills/sd-debug/condition-based-waiting.md +0 -114
package/claude/skills/sd-debug/defense-in-depth.md +0 -128
package/claude/skills/sd-debug/find-polluter.sh +0 -64
package/claude/skills/sd-debug/root-cause-tracing.md +0 -168
package/claude/skills/sd-discuss/SKILL.md +0 -91
package/claude/skills/sd-explore/SKILL.md +0 -118
package/claude/skills/sd-plan-dev/SKILL.md +0 -294
package/claude/skills/sd-plan-dev/code-quality-reviewer-prompt.md +0 -49
package/claude/skills/sd-plan-dev/final-review-prompt.md +0 -50
package/claude/skills/sd-plan-dev/implementer-prompt.md +0 -60
package/claude/skills/sd-plan-dev/spec-reviewer-prompt.md +0 -45
package/claude/skills/sd-review/api-reviewer-prompt.md +0 -75
package/claude/skills/sd-review/code-reviewer-prompt.md +0 -82
package/claude/skills/sd-review/convention-checker-prompt.md +0 -61
package/claude/skills/sd-review/refactoring-analyzer-prompt.md +0 -92
package/claude/skills/sd-skill/SKILL.md +0 -417
package/claude/skills/sd-skill/anthropic-best-practices.md +0 -156
package/claude/skills/sd-skill/cso-guide.md +0 -161
package/claude/skills/sd-skill/examples/CLAUDE_MD_TESTING.md +0 -200
package/claude/skills/sd-skill/persuasion-principles.md +0 -220
package/claude/skills/sd-skill/testing-skills-with-subagents.md +0 -408
package/claude/skills/sd-skill/writing-guide.md +0 -159
package/claude/skills/sd-tdd/SKILL.md +0 -385
package/claude/skills/sd-tdd/testing-anti-patterns.md +0 -317
package/claude/skills/sd-use/SKILL.md +0 -67
package/claude/skills/sd-worktree/SKILL.md +0 -78

package/claude/skills/sd-debug/SKILL.md CHANGED Viewed

@@ -1,303 +1,113 @@
 ---
 name: sd-debug
-description: "Use when the user reports a bug, error, or unexpected behavior and asks to fix or debug it. Triggers: error messages, stack traces, test failures, build failures, 'why is this broken', 'fix this', 'debug this', unexpected behavior investigation."
+description: "디버그", "debug", "sd-debug", "오류 분석", "에러 원인", "버그 찾기" 등을 요청할 때 사용.
 ---
-# Systematic Debugging
+# SD Debug — 오류 원인 분석 및 해결 계획 수립
-## Overview
+에러 메시지, 스택 트레이스, 또는 문제 상황 설명을 받아 코드베이스를 심층 분석한 뒤 근본 원인을 진단하고, `/sd-plan` 프로세스로 해결 계획을 수립한다.
-Random fixes waste time and create new bugs. Quick patches mask underlying issues.
+ARGUMENTS: 에러 메시지, 스택 트레이스, 또는 문제 상황 설명 (선택). 미지정 시 대화 컨텍스트에서 파악하거나 사용자에게 질문한다.
-**Core principle:** ALWAYS find root cause before attempting fixes. Symptom fixes are failure.
-**Violating the letter of this process is violating the spirit of debugging.**
-## The Iron Law
-```
-NO FIXES WITHOUT ROOT CAUSE INVESTIGATION FIRST
-```
-If you haven't completed Phase 1, you cannot propose fixes.
-## When to Use
-Use for ANY technical issue:
-- Test failures
-- Bugs in production
-- Unexpected behavior
-- Performance problems
-- Build failures
-- Integration issues
-**Use this ESPECIALLY when:**
-- Under time pressure (emergencies make guessing tempting)
-- "Just one quick fix" seems obvious
-- You've already tried multiple fixes
-- Previous fix didn't work
-- You don't fully understand the issue
-**Don't skip when:**
-- Issue seems simple (simple bugs have root causes too)
-- You're in a hurry (rushing guarantees rework)
-- Manager wants it fixed NOW (systematic is faster than thrashing)
-## The Four Phases
-You MUST complete each phase before proceeding to the next.
-### Phase 1: Root Cause Investigation
-**BEFORE attempting ANY fix:**
-1. **Read Error Messages Carefully**
-   - Don't skip past errors or warnings
-   - They often contain the exact solution
-   - Read stack traces completely
-   - Note line numbers, file paths, error codes
-2. **Reproduce Consistently**
-   - Can you trigger it reliably?
-   - What are the exact steps?
-   - Does it happen every time?
-   - If not reproducible → gather more data, don't guess
-3. **Read the Source Code Directly**
-   - Read the actual code where the bug manifests
-   - Understand what the code does line by line
-   - Walk through the logic with the failing input mentally
-   - Do NOT run `git diff` or `git log` — diffs show WHAT changed, not WHY it's broken
-   - The bug is in the code as it exists NOW; analyze the code as-is
-4. **Gather Evidence in Multi-Component Systems**
-   **WHEN system has multiple components (CI → build → signing, API → service → database):**
-   **BEFORE proposing fixes, add diagnostic instrumentation:**
-   ```
-   For EACH component boundary:
-     - Log what data enters component
-     - Log what data exits component
-     - Verify environment/config propagation
-     - Check state at each layer
-   Run once to gather evidence showing WHERE it breaks
-   THEN analyze evidence to identify failing component
-   THEN investigate that specific component
-   ```
-   **Example (multi-layer system):**
-   ```bash
-   # Layer 1: Workflow
-   echo "=== Secrets available in workflow: ==="
-   echo "IDENTITY: ${IDENTITY:+SET}${IDENTITY:-UNSET}"
-   # Layer 2: Build script
-   echo "=== Env vars in build script: ==="
-   env | grep IDENTITY || echo "IDENTITY not in environment"
-   # Layer 3: Signing script
-   echo "=== Keychain state: ==="
-   security list-keychains
-   security find-identity -v
-   # Layer 4: Actual signing
-   codesign --sign "$IDENTITY" --verbose=4 "$APP"
-   ```
-   **This reveals:** Which layer fails (secrets → workflow ✓, workflow → build ✗)
-5. **Trace Data Flow**
-   **WHEN error is deep in call stack:**
-   See `root-cause-tracing.md` in this directory for the complete backward tracing technique.
-   **Quick version:**
-   - Where does bad value originate?
-   - What called this with bad value?
-   - Keep tracing up until you find the source
-   - Fix at source, not at symptom
-### Phase 2: Pattern Analysis
-**Find the pattern before fixing:**
-1. **Find Working Examples**
-   - Locate similar working code in same codebase
-   - What works that's similar to what's broken?
-2. **Compare Against References**
-   - If implementing pattern, read reference implementation COMPLETELY
-   - Don't skim - read every line
-   - Understand the pattern fully before applying
-3. **Identify Differences**
-   - What's different between working and broken?
-   - List every difference, however small
-   - Don't assume "that can't matter"
-4. **Understand Dependencies**
-   - What other components does this need?
-   - What settings, config, environment?
-   - What assumptions does it make?
-### Phase 3: Hypothesis and Testing
-**Scientific method:**
-1. **Form Single Hypothesis**
-   - State clearly: "I think X is the root cause because Y"
-   - Write it down
-   - Be specific, not vague
-2. **Test Minimally**
-   - Make the SMALLEST possible change to test hypothesis
-   - One variable at a time
-   - Don't fix multiple things at once
-3. **Verify Before Continuing**
-   - Did it work? Yes → Phase 4
-   - Didn't work? Form NEW hypothesis
-   - DON'T add more fixes on top
-4. **When You Don't Know**
-   - Say "I don't understand X"
-   - Don't pretend to know
-   - Ask for help
-   - Research more
-### Phase 4: Implementation
-**Fix the root cause, not the symptom:**
-1. **Create Failing Test Case**
-   - Simplest possible reproduction
-   - Automated test if possible
-   - One-off test script if no framework
-   - MUST have before fixing
-   - Use the `sd-tdd` skill for writing proper failing tests
-2. **Implement Single Fix**
-   - Address the root cause identified
-   - ONE change at a time
-   - No "while I'm here" improvements
-   - No bundled refactoring
+---
-3. **Verify Fix**
-   - Test passes now?
-   - No other tests broken?
-   - Issue actually resolved?
+## Step 1: 문제 정보 확보
-4. **If Fix Doesn't Work**
+- 문제 정보를 아래 우선순위로 확보하라:
+  1. **ARGUMENTS**: 스킬 호출 시 함께 전달된 에러 메시지, 스택 트레이스, 또는 문제 설명
+  2. **현재 대화**: ARGUMENTS가 없으면 현재 대화 컨텍스트에서 에러 메시지, 로그, 문제 상황을 파악
+  3. **AskUserQuestion**: 위 둘로도 파악이 안 되면 "어떤 문제를 디버깅할까요? 에러 메시지, 스택 트레이스, 또는 문제 상황을 설명해 주세요."라고 질문
+- 확보한 문제 정보에서 아래를 추출하라:
+  - **에러 유형**: 컴파일 에러 / 런타임 에러 / 타입 에러 / 논리 오류 / 빌드 에러 / 동작 이상 등
+  - **관련 단서**: 파일 경로, 함수명, 라인 번호, 패키지명, 에러 코드 등 코드베이스 탐색에 활용할 키워드
-   ```mermaid
-   flowchart TD
-       A{"Fix failed?"} --> B{"Attempts < 3?"}
-       B -->|yes| C["Phase 1: Re-analyze<br>with new information"]
-       B -->|"no (≥3)"| D["STOP: Question Architecture<br>→ Discuss with user first"]
-   ```
+## Step 2: 코드베이스 심층 분석
-   **Signs of architectural problem (≥3 failures):**
-   - Each fix reveals new shared state/coupling/problem in different place
-   - Fixes require "massive refactoring" to implement
-   - Each fix creates new symptoms elsewhere
+Step 1에서 확보한 문제 정보와 단서를 바탕으로, 해당 문제의 근본 원인을 파악하기 위해 필요한 조사를 스스로 판단하여 수행하라. Agent 도구(subagent_type: Explore)를 활용하여 코드베이스를 탐색하되, 조사 범위와 방법은 문제의 성격에 따라 자유롭게 결정하라.
-   **Question fundamentals:** Is this pattern sound? Are we sticking with it through inertia? Should we refactor architecture vs. continue fixing symptoms?
+### 분석 원칙
-   This is NOT a failed hypothesis - this is a wrong architecture.
+> **핵심**: 근본 원인을 완전히 이해하기 전까지 해결 방안을 제시하지 마라. "분석 → 이해 → 해결책" 순서를 반드시 지켜라.
-## Red Flags - STOP and Follow Process
+1. **추측 수정 금지**: 코드를 수정해보고 결과를 확인하는 시행착오 방식으로 원인을 찾지 마라. 코드를 읽고 로직을 추적하여 원인을 파악하라.
+2. **우회 금지**: `as` 타입 단언, `any`, `// @ts-ignore`, 하드코딩, 예외 삼킴(`catch` 후 무시) 등으로 증상을 숨기는 방안을 해결책으로 제시하지 마라.
+3. **의도 파악 우선**: 테스트 실패 시, 테스트가 검증하는 의도된 동작과 현재 코드의 실제 동작을 먼저 비교하라. 기능 변경이 의도적이면 테스트를 갱신하고, 의도치 않은 변경이면 코드를 수정하라. 의도를 판단할 수 없으면 사용자에게 질문하라.
+4. **증상과 원인 구분**: 에러 메시지가 나타나는 지점이 원인이 아닐 수 있다. 에러 지점에서 역추적하여 실제 원인을 찾아라.
-If you catch yourself thinking:
+### 분석 결과 정리
-- "Quick fix for now, investigate later"
-- "Just try changing X and see if it works"
-- "Add multiple changes, run tests"
-- "Skip the test, I'll manually verify"
-- "It's probably X, let me fix that"
-- "I don't fully understand but this might work"
-- "Pattern says X but I'll adapt it differently"
-- "Here are the main problems: [lists fixes without investigation]"
-- Proposing solutions before tracing data flow
-- "Let me check git diff/log to see what changed"
-- **"One more fix attempt" (when already tried 2+)**
-- **Each fix reveals new problem in different place**
+분석이 끝나면 아래 항목들을 정리하라:
+- **에러 발생 지점**: 문제가 발생하는 구체적 코드 위치 (파일경로:라인)
+- **근본 원인**: 왜 이 문제가 발생하는지에 대한 분석
+- **영향 범위**: 이 문제가 영향을 미치는 파일/함수 목록
+- **해결 방안**: 가능한 해결 방법들 (각각 수정 대상 파일 포함)
-**ALL of these mean: STOP. Return to Phase 1.**
+## Step 3: 진단 결과 종합 및 사용자 확인
-**If 3+ fixes failed:** Question the architecture (see Phase 4.5)
+Step 2의 분석 결과를 종합하여 아래 형식으로 진단 보고서를 작성하고 사용자에게 제시하라:
-## User Signals You're Doing It Wrong
+```
+## 진단 결과
-**Watch for these redirections:**
+### 문제 요약
+<에러/문제를 한 문장으로 요약>
-- "Is that not happening?" - You assumed without verifying
-- "Will it show us...?" - You should have added evidence gathering
-- "Stop guessing" - You're proposing fixes without understanding
-- "Ultrathink this" - Question fundamentals, not just symptoms
-- "We're stuck?" (frustrated) - Your approach isn't working
+### 근본 원인
+<근본 원인을 명확하고 구체적으로 설명. 파일 경로와 라인 번호 포함.>
-**When you see these:** STOP. Return to Phase 1.
+### 영향 범위
+- <영향받는 파일/함수 1>
+- <영향받는 파일/함수 2>
-## Common Rationalizations
+### 해결 방안
-| Excuse                                       | Reality                                                                 |
-| -------------------------------------------- | ----------------------------------------------------------------------- |
-| "Issue is simple, don't need process"        | Simple issues have root causes too. Process is fast for simple bugs.    |
-| "Emergency, no time for process"             | Systematic debugging is FASTER than guess-and-check thrashing.          |
-| "Just try this first, then investigate"      | First fix sets the pattern. Do it right from the start.                 |
-| "I'll write test after confirming fix works" | Untested fixes don't stick. Test first proves it.                       |
-| "Multiple fixes at once saves time"          | Can't isolate what worked. Causes new bugs.                             |
-| "Reference too long, I'll adapt the pattern" | Partial understanding guarantees bugs. Read it completely.              |
-| "I see the problem, let me fix it"           | Seeing symptoms ≠ understanding root cause.                             |
-| "Let me check git diff to see what changed"  | Diff shows WHAT changed, not WHY it's broken. Read the code as-is.      |
-| "One more fix attempt" (after 2+ failures)   | 3+ failures = architectural problem. Question pattern, don't fix again. |
+1. **<방안 1 제목>**: <설명>
+   - 수정 대상: <파일 경로 목록>
+   - 장점: ...
+   - 단점: ...
-## Quick Reference
+2. **<방안 2 제목>**: <설명> (해당 시)
+   - 수정 대상: <파일 경로 목록>
+   - 장점: ...
+   - 단점: ...
-| Phase                 | Key Activities                                            | Success Criteria            |
-| --------------------- | --------------------------------------------------------- | --------------------------- |
-| **1. Root Cause**     | Read errors, reproduce, read source code, gather evidence | Understand WHAT and WHY     |
-| **2. Pattern**        | Find working examples, compare                            | Identify differences        |
-| **3. Hypothesis**     | Form theory, test minimally                               | Confirmed or new hypothesis |
-| **4. Implementation** | Create test, fix, verify                                  | Bug resolved, tests pass    |
+### 권장 방안
+<가장 적절한 방안과 그 이유>
+```
-## When Process Reveals "No Root Cause"
+진단 보고서를 출력한 뒤, AskUserQuestion으로 다음을 질문하라:
-If systematic investigation reveals issue is truly environmental, timing-dependent, or external:
+```
+진단 결과를 확인해 주세요.
+1. 진단이 정확하므로 권장 방안으로 계획을 수립한다
+2. 진단이 정확하지만 다른 방안(번호)으로 계획을 수립한다
+3. 진단이 부정확하다 — 추가 정보를 제공하겠다
+```
-1. You've completed the process
-2. Document what you investigated
-3. Implement appropriate handling (retry, timeout, error message)
-4. Add monitoring/logging for future investigation
+- **1번 선택**: 권장 방안을 기반으로 Step 4로 진행하라.
+- **2번 선택**: 사용자가 지정한 방안을 기반으로 Step 4로 진행하라.
+- **3번 선택**: 사용자가 제공한 추가 정보를 반영하여 Step 2로 돌아가라.
-**But:** 95% of "no root cause" cases are incomplete investigation.
+## Step 4: sd-plan으로 해결 계획 수립
-## Supporting Techniques
+사용자가 확인한 진단 결과와 선택된 해결 방안을 작업 설명으로 하여, Skill 도구로 `sd-plan`을 호출하라. args에 아래를 전달하라:
-These techniques are part of systematic debugging and available in this directory:
+```
+아래 디버깅 진단 결과에 따른 해결 방안을 구현하기 위한 계획을 수립하라:
-- **`root-cause-tracing.md`** - Trace bugs backward through call stack to find original trigger
-- **`defense-in-depth.md`** - Add validation at multiple layers after finding root cause
-- **`condition-based-waiting.md`** - Replace arbitrary timeouts with condition polling
+## 문제
+<Step 3의 문제 요약>
-**Related skills:**
+## 근본 원인
+<Step 3의 근본 원인>
-- **sd-tdd** - For creating failing test case (Phase 4, Step 1)
-- **sd-check** - Verify fix worked before claiming success
+## 해결 방안
+<사용자가 선택한 해결 방안의 상세 내용>
-## Real-World Impact
+## 수정 대상 파일
+<해결 방안의 수정 대상 파일 경로 목록>
+```
-From debugging sessions:
+## Step 5: 계획 실행
-- Systematic approach: 15-30 minutes to fix
-- Random fixes approach: 2-3 hours of thrashing
-- First-time fix rate: 95% vs 40%
-- New bugs introduced: Near zero vs common
+sd-plan이 완료되어 확정된 계획서가 나오면, 그 계획서에 따라 코드를 수정하라.

package/claude/skills/sd-document/SKILL.md CHANGED Viewed

@@ -1,99 +1,109 @@
 ---
 name: sd-document
-description: "Use when the user's request involves .docx, .xlsx, .pptx, or .pdf files. Triggers: document reading/analysis, file content extraction, DOCX/XLSX creation, client document review, data export."
+description: .docx, .xlsx, .pptx, .pdf 파일과 관련하여 "문서 읽기/분석", "파일 내용 추출", "DOCX/XLSX 생성", "고객 문서 검토", "데이터 내보내기"를 요청할때 사용.
 ---
-# Document Processing
+# SD Document — 문서 파일 읽기/쓰기
-## Overview
+문서 파일(.docx/.xlsx/.pptx/.pdf)을 Python 스크립트로 읽거나 쓴다. 읽기 시 텍스트와 이미지를 위치 정보와 함께 추출하고, 이미지를 파일로 저장한 뒤 Claude Read로 분석한다.
-Read and write document files (.docx/.xlsx/.pptx/.pdf).
-Python scripts extract text and images with location information by format, save images to files, and analyze them with Claude Read.
+ARGUMENTS: 문서 파일 경로 (필수). `.docx`, `.xlsx`, `.pptx`, `.pdf` 파일 경로를 지정한다.
-## Quick Reference
+---
+## Step 1: 작업 방향 결정
+ARGUMENTS에서 파일 경로를 추출하고, 사용자의 요청이 **읽기**(분석/추출)인지 **쓰기**(생성/편집)인지 판단하라.
+- **읽기** → Step 2로 이동
+- **쓰기** → Step 4로 이동
-| Format | Read | Write | Library |
-|--------|------|-------|---------|
-| DOCX | Yes | Yes | `python-docx` |
-| XLSX | Yes | Yes | `openpyxl`, `pandas` |
-| PPTX | Yes | No | `python-pptx` |
-| PDF  | Yes | No | `pdfplumber`, `pypdf` |
+### 형식별 지원 현황
-Missing packages are auto-installed on first script run.
+| 형식 | 읽기 | 쓰기 | 라이브러리 |
+|------|------|------|-----------|
+| DOCX | 가능 | 가능 | `python-docx` |
+| XLSX | 가능 | 가능 | `openpyxl`, `pandas` |
+| PPTX | 가능 | 불가 | `python-pptx` |
+| PDF  | 가능 | 불가 | `pdfplumber`, `pypdf` |
-## Reading (Document Analysis)
+누락된 패키지는 첫 스크립트 실행 시 자동 설치된다.
-Run extraction scripts by format:
+## Step 2: 문서 읽기 (추출 스크립트 실행)
+파일 확장자에 맞는 추출 스크립트를 실행하라:
 ```bash
-python .claude/skills/sd-document/extract_docx.py <filepath>
-python .claude/skills/sd-document/extract_xlsx.py <filepath>
-python .claude/skills/sd-document/extract_pptx.py <filepath>
-python .claude/skills/sd-document/extract_pdf.py  <filepath>
+python .claude/skills/sd-document/extract_docx.py <파일경로>
+python .claude/skills/sd-document/extract_xlsx.py <파일경로>
+python .claude/skills/sd-document/extract_pptx.py <파일경로>
+python .claude/skills/sd-document/extract_pdf.py  <파일경로>
 ```
-### Output
-- **stdout**: Text and location information (Markdown format)
-- **Image files**: Saved to `<filename>_files/` directory
+### 출력
+- **stdout**: 텍스트 및 위치 정보 (Markdown 형식)
+- **이미지 파일**: `<파일명>_files/` 디렉토리에 저장
+### 위치 정보
+| 형식 | 위치 표현 방식 |
+|------|--------------|
+| DOCX | 문단 흐름 순서 (텍스트-이미지 인라인) |
+| XLSX | 셀 위치 (A1, B2 등) |
+| PPTX | 도형 left/top 좌표 (인치) + 슬라이드 번호 |
+| PDF  | 페이지 번호 |
-### Location Information
+## Step 3: 추출 결과 분석
-| Format | Location Representation |
-|--------|-------------------------|
-| DOCX | Paragraph flow order (text-image inline) |
-| XLSX | Cell position (A1, B2, etc.) |
-| PPTX | Shape left/top coordinates (inches) + slide number |
-| PDF  | Page number |
+Step 2의 출력에서 추출된 파일 경로를 확인하고 아래를 수행하라:
-### Image Analysis
-Open extracted image files with Claude **Read** tool for visual analysis.
+1. **이미지**: `_files/` 디렉토리에 저장된 각 이미지를 **Read** 도구로 열어 시각적 분석을 수행
+2. **텍스트**: stdout으로 출력된 텍스트를 사용자의 요청에 맞게 분석/요약
-### Scanned PDF (OCR)
-If text extraction is empty, the script outputs OCR instructions.
-Tesseract OCR requires OS-level installation (not auto-installable via pip).
+## Step 4: 문서 쓰기
-## Writing
+사용자의 요청에 따라 Python 스크립트를 작성하여 문서를 생성하거나 편집하라.
 ### DOCX (`python-docx`)
-For mail templates and simple reports.
+메일 템플릿 및 간단한 보고서용.
 ```python
 from docx import Document
-doc = Document()                          # New document
-# doc = Document("existing.docx")         # Edit existing document
-doc.add_heading("Title", level=1)
-doc.add_paragraph("Body content")
+doc = Document()                          # 새 문서
+# doc = Document("existing.docx")         # 기존 문서 편집
+doc.add_heading("제목", level=1)
+doc.add_paragraph("본문 내용")
 table = doc.add_table(rows=2, cols=3)
-table.cell(0, 0).text = "Item"
+table.cell(0, 0).text = "항목"
 doc.save("output.docx")
 ```
-Edit existing document: open with `Document("existing.docx")`, replace `paragraph.text`, modify `table.cell().text`.
+기존 문서 편집: `Document("existing.docx")`로 열어 `paragraph.text` 교체, `table.cell().text` 수정.
 ### XLSX (`openpyxl`)
-Focuses on data and formulas. Formatting (colors, borders) not required.
+데이터와 수식 중심. 서식(색상, 테두리)은 필수가 아님.
 ```python
 from openpyxl import Workbook
 wb = Workbook()
 ws = wb.active
-ws["A1"] = "Item"
-ws["B1"] = "Quantity"
-ws.append(["Apple", 10])
-ws.append(["Pear", 20])
+ws["A1"] = "항목"
+ws["B1"] = "수량"
+ws.append(["사과", 10])
+ws.append(["배", 20])
 ws["B4"] = "=SUM(B2:B3)"
 wb.save("output.xlsx")
 ```
-Edit existing file: open with `load_workbook("existing.xlsx")` and modify.
-Export pandas DataFrame: `df.to_excel("output.xlsx", index=False)`
+기존 파일 편집: `load_workbook("existing.xlsx")`로 열어 수정.
+pandas DataFrame 내보내기: `df.to_excel("output.xlsx", index=False)`
-## Common Mistakes
+## 흔한 실수
-- **Character encoding**: Scripts have built-in UTF-8 handling; always extract through scripts
-- **Missing images**: After extraction, remember to read images in `_files/` directory
-- **XLSX data_only**: `load_workbook(data_only=True)` removes formulas — use `data_only=False` to preserve them
+- **문자 인코딩**: 스크립트에 UTF-8 처리가 내장되어 있으므로 항상 스크립트를 통해 추출할 것
+- **이미지 누락**: 추출 후 `_files/` 디렉토리의 이미지를 반드시 읽을 것
+- **XLSX data_only**: `load_workbook(data_only=True)`는 수식을 제거함 — 수식을 유지하려면 `data_only=False` 사용

package/claude/skills/sd-document/_common.py ADDED Viewed

@@ -0,0 +1,94 @@
+"""Shared utilities for document extraction scripts."""
+import sys
+import io
+import re
+import subprocess
+from pathlib import Path
+def setup_encoding():
+    sys.stdout = io.TextIOWrapper(sys.stdout.buffer, encoding="utf-8", errors="replace")
+    sys.stderr = io.TextIOWrapper(sys.stderr.buffer, encoding="utf-8", errors="replace")
+def ensure_packages(packages: dict[str, str]):
+    for pip_name, import_name in packages.items():
+        try:
+            __import__(import_name)
+        except ImportError:
+            print(f"Installing package: {pip_name}...", file=sys.stderr)
+            subprocess.check_call([sys.executable, "-m", "pip", "install", pip_name],
+                                  stdout=subprocess.DEVNULL, stderr=subprocess.DEVNULL)
+def make_output_paths(file_path: str) -> tuple[Path, Path]:
+    p = Path(file_path)
+    out_dir = p.parent / f"{p.stem}_files"
+    return p, out_dir
+def print_header(file_path: Path):
+    print(f"# {file_path.name}\n")
+_created_dirs: set[Path] = set()
+def save_image(out_dir: Path, img_idx: int, blob: bytes, ext: str) -> Path:
+    if out_dir not in _created_dirs:
+        out_dir.mkdir(parents=True, exist_ok=True)
+        _created_dirs.add(out_dir)
+    img_path = out_dir / f"img_{img_idx:03d}.{ext}"
+    img_path.write_bytes(blob)
+    return img_path
+_CONTENT_TYPE_MAP = {
+    "image/jpeg": "jpg",
+    "image/png": "png",
+    "image/gif": "gif",
+    "image/bmp": "bmp",
+    "image/tiff": "tiff",
+    "image/svg+xml": "svg",
+    "image/webp": "webp",
+    "image/x-emf": "emf",
+    "image/x-wmf": "wmf",
+}
+def ext_from_content_type(content_type: str) -> str:
+    if content_type in _CONTENT_TYPE_MAP:
+        return _CONTENT_TYPE_MAP[content_type]
+    ext = content_type.split("/")[-1]
+    if "+" in ext:
+        ext = ext.split("+")[0]
+    return ext
+def print_image_summary(img_idx: int, out_dir: Path):
+    if img_idx > 0:
+        print(f"---\n{img_idx} image(s) saved: {out_dir}")
+    else:
+        print("---\nNo images")
+def run_cli(extract_fn, usage_name: str, packages: dict[str, str]):
+    if len(sys.argv) < 2:
+        print(f"Usage: python {usage_name} <file>", file=sys.stderr)
+        sys.exit(1)
+    ensure_packages(packages)
+    extract_fn(sys.argv[1])
+def normalize_cell(text) -> str:
+    if text is None:
+        return ""
+    return str(text).strip().replace("\n", " ")
+def parse_heading_level(style_name: str) -> int | None:
+    m = re.match(r"Heading\s*(\d+)", style_name)
+    if m:
+        return int(m.group(1))
+    return None