npm - universal-agent-memory - Versions diffs - 1.0.6 → 1.0.8 - Mend

universal-agent-memory 1.0.6 → 1.0.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/package.json +1 -1
package/templates/CLAUDE.template.md +277 -1

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "universal-agent-memory",
-  "version": "1.0.6",
+  "version": "1.0.8",
   "description": "Universal AI agent memory system - CLAUDE.md templates, memory, worktrees for Claude Code, Factory.AI, VSCode, OpenCode",
   "type": "module",
   "main": "dist/index.js",

package/templates/CLAUDE.template.md CHANGED Viewed

@@ -1,7 +1,28 @@
 <!--
-  CLAUDE.md Universal Template - v10.5
+  CLAUDE.md Universal Template - v10.7
   CHANGES IN THIS VERSION:
+  - Added Verification Patterns (P27-P31) from Terminal-Bench failure analysis
+  - Pattern 27: Output Directory Cleanup (ODC) - remove non-required files
+  - Pattern 28: Service Smoke Test (SST) - test services before completing
+  - Pattern 29: Multi-Solution Discovery (MSD) - find ALL valid solutions
+  - Pattern 30: Performance Threshold Tuning (PTT) - iterate to meet thresholds
+  - Pattern 31: Round-Trip Verification (RTV) - verify compress/decompress works
+  - Updated Pattern Router to include verification patterns
+  - Added 5 completion gates (output, tests, constraints, cleanup, services)
+  PREVIOUS (v10.6):
+  - Added Domain-Specific Patterns (P21-P26)
+  - Pattern 21: Chess Engine Integration (CEI) - use Stockfish, not reasoning
+  - Pattern 22: Git Recovery Forensics (GRF) - backup first, forensic approach
+  - Pattern 23: Compression Impossibility Detection (CID) - refuse impossible tasks
+  - Pattern 24: Polyglot Code Construction (PCC) - search for existing examples
+  - Pattern 25: Service Configuration Pipeline (SCP) - ordered setup, test each
+  - Pattern 26: Near-Miss Iteration (NMI) - iterate on partial success tasks
+  - Updated Pattern Router to include domain patterns
+  - Added 30% time budget reservation for iteration
+  PREVIOUS (v10.5):
   - STRENGTHENED Pattern Router: Now requires explicit analysis block output
   - STRENGTHENED Constraint Extraction: Mandatory checklist with verification
   - STRENGTHENED Adversarial Thinking: Explicit attack vector enumeration
@@ -424,6 +445,261 @@ ADVERSARIAL: "How do I craft input that bypasses the filter?"
 ---
+## 🎯 DOMAIN-SPECIFIC PATTERNS
+**Task-specific guidance for common failure categories.**
+### Pattern 21: Chess Engine Integration (CEI)
+Chess tasks require external engines - reasoning alone cannot find optimal moves.
+**Detection**: Keywords "chess", "best move", "FEN", "PGN", "checkmate"
+**Protocol**:
+1. Check for Stockfish: `which stockfish || apt-get install -y stockfish`
+2. Parse board state to FEN format
+3. Use Stockfish for analysis: `echo "position fen $FEN" | stockfish`
+4. Extract best move from engine output
+```bash
+# Stockfish one-liner for best move
+echo -e "position fen rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w KQkq - 0 1\ngo depth 20" | stockfish | grep "bestmove"
+```
+**NEVER**: Try to calculate chess moves through reasoning - use the engine.
+### Pattern 22: Git Recovery Forensics (GRF)
+Git recovery tasks require forensic approach, not standard git commands.
+**Detection**: Keywords "recover", "corrupted", "lost commit", "fix git", "reflog"
+**Protocol**:
+1. **BACKUP FIRST**: `cp -r .git .git.bak`
+2. Check integrity: `git fsck --full --no-dangling`
+3. Check reflog: `git reflog` (commits survive even after reset)
+4. Check loose objects: `find .git/objects -type f`
+5. Recover from pack files if needed
+```bash
+# Recovery sequence
+git fsck --full 2>&1 | tee /tmp/git-fsck.log
+git reflog --all
+git log --all --oneline --graph
+```
+**NEVER**: Run destructive commands without backup. Use `--dry-run` first.
+### Pattern 23: Compression Impossibility Detection (CID)
+Some compression tasks are mathematically impossible.
+**Detection**: Keywords "compress", "codegolf", "under X bytes", "minimal size"
+**Impossibility Markers**:
+- ML model weights (GPT-2 = 500MB+, cannot be <1MB without losing function)
+- Random/encrypted data (incompressible by definition)
+- Asking for compression ratio beyond information-theoretic limits
+```bash
+# If task asks for impossible compression:
+echo "IMPOSSIBLE: Task requires compressing ${ORIGINAL_SIZE} to ${TARGET_SIZE}.
+Shannon's source coding theorem proves this is impossible for this data type.
+Minimum achievable size is approximately ${MIN_POSSIBLE}." > /app/IMPOSSIBLE.md
+```
+**NEVER**: Attempt to compress ML weights to <1% of original size.
+### Pattern 24: Polyglot Code Construction (PCC)
+Polyglot tasks require specific compiler/interpreter quirks.
+**Detection**: Keywords "polyglot", "works in both", "compile as X and Y"
+**Known Polyglot Techniques**:
+- C/Python: Use `#if 0` blocks and string literals
+- Rust/C: Careful use of macros and comments
+- Shell/Python: Shebang and triple-quote tricks
+```c
+// C and Python polyglot example
+#if 0
+"""
+#endif
+#include <stdio.h>
+int main() { printf("Hello from C\n"); return 0; }
+#if 0
+"""
+print("Hello from Python")
+#endif
+```
+**Protocol**: Search for existing polyglot examples before implementing.
+### Pattern 25: Service Configuration Pipeline (SCP)
+Multi-service configuration requires ordered setup.
+**Detection**: Keywords "configure", "server", "webserver", "service", "daemon"
+**Protocol**:
+1. **Identify all services** needed (nginx, git, ssh, etc.)
+2. **Check service status**: `systemctl status <service>`
+3. **Configure in dependency order** (base → dependent)
+4. **Test each service** before moving to next
+5. **Verify end-to-end** after all configured
+```bash
+# Service configuration pattern
+for svc in nginx git-daemon ssh; do
+  systemctl status $svc || systemctl start $svc
+  systemctl is-active $svc || echo "FAILED: $svc"
+done
+```
+### Pattern 26: Near-Miss Iteration (NMI)
+When tests show >50% passing, focus on specific failing tests.
+**Detection**: Test results show partial success (e.g., 8/9, 6/7, 5/6)
+**Protocol**:
+1. Run tests with verbose output: `pytest -v 2>&1 | tee results.txt`
+2. Extract ONLY failing test names
+3. Read failing test code to understand exact requirement
+4. Fix specific issue without breaking passing tests
+5. Re-run ONLY failing tests first: `pytest test_file.py::test_name -v`
+6. Then run full suite to verify no regressions
+```bash
+# Near-miss iteration loop
+while true; do
+  pytest -v 2>&1 | tee /tmp/results.txt
+  FAILED=$(grep "FAILED" /tmp/results.txt | head -1)
+  [ -z "$FAILED" ] && echo "ALL PASS" && break
+  echo "Fixing: $FAILED"
+  # ... fix specific test ...
+done
+```
+**Reserve 30% of time budget for near-miss iteration.**
+### Pattern 27: Output Directory Cleanup (ODC)
+Tests often check for ONLY specific files in output directories.
+**Detection**: Tasks mentioning "single file", "only", constraints on output directory contents
+**Protocol**:
+1. **Before completing**, list output directory: `ls /app/output/`
+2. **Remove non-required files**: compiled binaries, temp files, backups
+3. **Keep ONLY the required outputs** as specified in task
+```bash
+# Clean output directory - keep only required file
+cd /app/polyglot
+ls -la  # Check what's there
+rm -f *.o *.out main cmain  # Remove compiled artifacts
+ls -la  # Verify only main.rs remains
+```
+**Common mistakes**: Leaving compiled binaries, test files, or backup files in output directories.
+### Pattern 28: Service Smoke Test (SST)
+Services must be tested BEFORE claiming completion.
+**Detection**: Tasks involving servers, daemons, APIs, web services
+**Protocol**:
+1. Start the service
+2. **Immediately test it** with curl/wget/client
+3. If test fails: debug, fix, restart, re-test
+4. Only complete when test PASSES
+```bash
+# Service verification pattern
+# 1. Start service
+systemctl start nginx
+# 2. IMMEDIATELY test it
+curl -v http://localhost:80/ 2>&1 | grep -q "200 OK" || echo "FAIL: nginx not serving"
+# 3. For PyPI server
+python -m http.server 8080 &
+curl -s http://localhost:8080/simple/ | grep -q "href" || echo "FAIL: PyPI not serving"
+# 4. For git server
+git ls-remote git@localhost:/repo.git 2>&1 | grep -q "refs" || echo "FAIL: git not serving"
+```
+**NEVER complete without a successful smoke test.**
+### Pattern 29: Multi-Solution Discovery (MSD)
+Some tasks require finding ALL valid solutions, not just one.
+**Detection**: Keywords "all moves", "both solutions", "list all", "find every"
+**Protocol**:
+1. **Parse task for multi-solution requirement**
+2. For chess: Find ALL checkmate moves, not just the first one
+3. For puzzles: Enumerate all valid answers
+4. **Output ALL solutions** in required format
+```bash
+# Chess: Find ALL checkmate-in-one moves
+# Don't stop at first bestmove, analyze ALL legal moves
+echo -e "position fen $FEN\ngo depth 20" | stockfish | grep "bestmove"
+# Then check if there are MULTIPLE winning moves
+echo -e "position fen $FEN\ngo movetime 5000 multipv 5" | stockfish
+# Extract ALL moves that lead to checkmate
+```
+### Pattern 30: Performance Threshold Tuning (PTT)
+Tasks with numeric thresholds require iterative tuning.
+**Detection**: Keywords "win rate", "accuracy", "percentage", "threshold", "at least X%"
+**Protocol**:
+1. Run initial implementation
+2. Check if thresholds are met
+3. If not: analyze what's missing, tune parameters
+4. Re-run until thresholds pass
+```bash
+# Performance tuning loop for Core Wars
+while true; do
+  # Run tests
+  pytest -v 2>&1 | tee results.txt
+  # Check for threshold failures
+  if grep -q "need.*%+" results.txt; then
+    echo "Threshold not met - tuning..."
+    # Analyze which opponent needs better strategy
+    # Modify warrior code
+    # Re-run
+  else
+    break
+  fi
+done
+```
+**For compression tasks**: If decompression fails, verify round-trip: `compress | decompress == original`
+### Pattern 31: Round-Trip Verification (RTV)
+For transform/encode/compress tasks, verify the reverse operation.
+**Detection**: Keywords "compress", "encode", "serialize", "encrypt", and task mentions "decompress", "decode", etc.
+**Protocol**:
+1. Create test data
+2. Apply forward transform (compress)
+3. **Immediately apply reverse** (decompress)
+4. **Verify original == result**
+5. Fix if not matching
+```bash
+# Round-trip verification for compression
+echo "original data" > /tmp/original.txt
+python compress.py /tmp/original.txt /tmp/compressed.bin
+python decompress.py /tmp/compressed.bin /tmp/recovered.txt
+diff /tmp/original.txt /tmp/recovered.txt && echo "PASS" || echo "FAIL: round-trip broken"
+```
+---
 ## ⚡ SESSION START PROTOCOL
 **EXECUTE IMMEDIATELY before any response:**