@laitszkin/apollo-toolkit 3.12.0 → 3.12.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/AGENTS.md +6 -6
- package/CHANGELOG.md +7 -4
- package/README.md +9 -10
- package/analyse-app-logs/scripts/__pycache__/filter_logs_by_time.cpython-312.pyc +0 -0
- package/analyse-app-logs/scripts/__pycache__/log_cli_utils.cpython-312.pyc +0 -0
- package/analyse-app-logs/scripts/__pycache__/search_logs.cpython-312.pyc +0 -0
- package/commit-and-push/SKILL.md +1 -3
- package/docs-to-voice/scripts/__pycache__/docs_to_voice.cpython-312.pyc +0 -0
- package/enhance-existing-features/SKILL.md +21 -37
- package/generate-spec/SKILL.md +7 -10
- package/generate-spec/scripts/__pycache__/create-specscpython-312.pyc +0 -0
- package/init-project-html/SKILL.md +15 -17
- package/iterative-code-performance/SKILL.md +1 -1
- package/iterative-code-quality/SKILL.md +1 -1
- package/katex/scripts/__pycache__/render_katex.cpython-312.pyc +0 -0
- package/maintain-project-constraints/SKILL.md +18 -22
- package/merge-changes-from-local-branches/SKILL.md +23 -34
- package/open-github-issue/scripts/__pycache__/open_github_issue.cpython-312.pyc +0 -0
- package/open-source-pr-workflow/SKILL.md +4 -7
- package/optimise-skill/SKILL.md +8 -8
- package/optimise-skill/references/definition.md +1 -0
- package/optimise-skill/references/example_skill.md +8 -8
- package/package.json +1 -1
- package/read-github-issue/scripts/__pycache__/find_issues.cpython-312.pyc +0 -0
- package/read-github-issue/scripts/__pycache__/read_issue.cpython-312.pyc +0 -0
- package/resolve-review-comments/scripts/__pycache__/review_threads.cpython-312.pyc +0 -0
- package/review-spec-related-changes/SKILL.md +30 -38
- package/ship-github-issue-fix/SKILL.md +2 -2
- package/solve-issues-found-during-review/SKILL.md +8 -43
- package/spec-to-project-html/SKILL.md +2 -2
- package/submission-readiness-check/SKILL.md +3 -19
- package/systematic-debug/SKILL.md +2 -2
- package/text-to-short-video/scripts/__pycache__/enforce_video_aspect_ratio.cpython-312.pyc +0 -0
- package/version-release/SKILL.md +3 -3
- package/discover-edge-cases/CHANGELOG.md +0 -19
- package/discover-edge-cases/LICENSE +0 -21
- package/discover-edge-cases/README.md +0 -87
- package/discover-edge-cases/SKILL.md +0 -32
- package/discover-edge-cases/agents/openai.yaml +0 -4
- package/discover-edge-cases/references/architecture-edge-cases.md +0 -41
- package/discover-edge-cases/references/code-edge-cases.md +0 -46
- package/discover-security-issues/CHANGELOG.md +0 -32
- package/discover-security-issues/LICENSE +0 -21
- package/discover-security-issues/README.md +0 -35
- package/discover-security-issues/SKILL.md +0 -54
- package/discover-security-issues/agents/openai.yaml +0 -4
- package/discover-security-issues/references/agent-attack-catalog.md +0 -117
- package/discover-security-issues/references/common-software-attack-catalog.md +0 -168
- package/discover-security-issues/references/red-team-extreme-scenarios.md +0 -81
- package/discover-security-issues/references/risk-checklist.md +0 -78
- package/discover-security-issues/references/security-test-patterns-agent.md +0 -101
- package/discover-security-issues/references/security-test-patterns-finance.md +0 -88
- package/discover-security-issues/references/test-snippets.md +0 -73
- package/recover-missing-plan/SKILL.md +0 -85
- package/recover-missing-plan/agents/openai.yaml +0 -4
- package/review-change-set/LICENSE +0 -21
- package/review-change-set/README.md +0 -55
- package/review-change-set/SKILL.md +0 -46
- package/review-change-set/agents/openai.yaml +0 -4
- package/review-codebases/LICENSE +0 -21
- package/review-codebases/README.md +0 -69
- package/review-codebases/SKILL.md +0 -46
- package/review-codebases/agents/openai.yaml +0 -4
- package/scheduled-runtime-health-check/LICENSE +0 -21
- package/scheduled-runtime-health-check/README.md +0 -107
- package/scheduled-runtime-health-check/SKILL.md +0 -135
- package/scheduled-runtime-health-check/agents/openai.yaml +0 -4
- package/scheduled-runtime-health-check/references/output-format.md +0 -20
|
@@ -1,135 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: scheduled-runtime-health-check
|
|
3
|
-
description: Use a background terminal to run a user-specified command immediately or in a requested time window, and optionally explain findings from the captured logs after the run. Use when users want timed project execution, bounded runtime checks, or post-run log-based findings.
|
|
4
|
-
---
|
|
5
|
-
|
|
6
|
-
# Scheduled Runtime Health Check
|
|
7
|
-
|
|
8
|
-
## Dependencies
|
|
9
|
-
|
|
10
|
-
- Required: `analyse-app-logs` when the user asks for post-run log findings or when the observed run needs evidence-backed diagnosis.
|
|
11
|
-
- Conditional: `improve-observability` when current logs cannot prove module health or root cause.
|
|
12
|
-
- Optional: `open-github-issue` indirectly through `analyse-app-logs` when confirmed issues should be published.
|
|
13
|
-
- Fallback: If no scheduler or automation capability is available for the requested future start time, stop and report that scheduling could not be created; only run immediately when the user explicitly allows an immediate bounded observation instead of a timed start.
|
|
14
|
-
|
|
15
|
-
## Standards
|
|
16
|
-
|
|
17
|
-
- Evidence: Anchor every conclusion to the requested command, execution window, startup/shutdown timestamps, one canonical run folder or artifact root, captured logs, and concrete runtime signals; when structured artifacts land outside that canonical root because of inherited environment or wrapper drift, prove the mismatch and treat the artifact-path correction as part of the evidence trail.
|
|
18
|
-
- Execution: Collect the run contract, verify the real stop mechanism before launch, choose the highest-fidelity execution mode that matches the user's intent, use a background terminal, optionally update the code only when the user asks, execute the requested command immediately or in the requested window, record the canonical run folder once the process materializes it, capture logs, stop cleanly when bounded, then delegate log review to `analyse-app-logs` only when findings are requested or needed.
|
|
19
|
-
- Quality: Keep scheduling, execution, and shutdown deterministic; separate confirmed findings from hypotheses; and mark each assessed module healthy/degraded/failed/unknown with reasons.
|
|
20
|
-
- Output: Return the run configuration, execution status, log locations, optional code-update result, optional module health by area, confirmed issues, potential issues, observability gaps, and scheduler status when applicable.
|
|
21
|
-
|
|
22
|
-
## Overview
|
|
23
|
-
|
|
24
|
-
Use this skill when the user wants an agent to do work in this shape:
|
|
25
|
-
|
|
26
|
-
- use a background terminal for the whole run
|
|
27
|
-
- execute a specific command such as `npm run dev`, `docker compose up`, or another repo-defined entrypoint
|
|
28
|
-
- optionally update the project before execution when the user explicitly asks
|
|
29
|
-
- optionally run it inside a specific time window
|
|
30
|
-
- optionally wait for the run to finish and then explain findings from the logs
|
|
31
|
-
|
|
32
|
-
Canonical task shape:
|
|
33
|
-
|
|
34
|
-
`Use $scheduled-runtime-health-check to use a background terminal to run <command>.`
|
|
35
|
-
|
|
36
|
-
Optional suffixes:
|
|
37
|
-
|
|
38
|
-
- `Before running, update this project to the latest safe code state.`
|
|
39
|
-
- `Run it in this specific time window: <window>.`
|
|
40
|
-
- `After the run completes, explain your findings from the logs.`
|
|
41
|
-
|
|
42
|
-
This skill is an orchestration layer. It owns the background terminal session, optional code-update step, optional scheduling, bounded runtime, log capture, and optional module-level health summary. It delegates deep log diagnosis to `analyse-app-logs` only when the user asks for findings or the run clearly needs evidence-backed analysis.
|
|
43
|
-
|
|
44
|
-
## Core principles
|
|
45
|
-
|
|
46
|
-
- Prefer one bounded observation window over open-ended monitoring.
|
|
47
|
-
- Use one dedicated background terminal session per requested run so execution and logs stay correlated.
|
|
48
|
-
- Record the canonical run directory, artifact root, or other generated output location as soon as it exists, and use that as the source of truth for later analysis.
|
|
49
|
-
- When a repository exposes both synthetic harnesses and production-like runtime entrypoints, prefer the production-like path for claims about real runtime, market, or operator behavior; use the lower-fidelity harness only when the user explicitly asked for it or when it is the only safe reproduction surface.
|
|
50
|
-
- Treat code update as optional and only perform it when the user explicitly requests it.
|
|
51
|
-
- Treat startup, steady-state, and shutdown as part of the same investigation.
|
|
52
|
-
- Do not call a module healthy unless there is at least one positive signal for it.
|
|
53
|
-
- Separate scheduler failures, boot failures, runtime failures, and shutdown failures.
|
|
54
|
-
- For complex pipelines, identify the last successful stage before attributing the failure to application logic.
|
|
55
|
-
- When the user asks to compare a bounded run with a previous run, compare only runs with the same command or preset, duration, runtime mode, and complete structured artifacts. If the previous run lacks canonical reports, databases, or startup artifacts, mark the runs incomparable and explain the artifact completeness gap instead of inventing performance deltas.
|
|
56
|
-
- When wrappers, copied environments, or report-path variables cause the current run's report or database to land under an older run directory, stop and reconcile the artifact root before analysis: record the intended run root, locate the actual emitted files, move or copy them back only when needed to restore one canonical evidence set, and report the path drift as an observability problem instead of silently mixing runs.
|
|
57
|
-
- If logs cannot support a health judgment, mark the module as `unknown` instead of guessing.
|
|
58
|
-
|
|
59
|
-
## Required workflow
|
|
60
|
-
|
|
61
|
-
1. Define the run contract
|
|
62
|
-
- Confirm or derive the workspace, execution command, optional code-update step, optional schedule, optional duration, readiness signal, log locations, and whether post-run findings are required.
|
|
63
|
-
- Derive commands from trustworthy sources first: `package.json`, `Makefile`, `docker-compose.yml`, `Procfile`, scripts, or project docs.
|
|
64
|
-
- If multiple commands exist for the same workflow, rank them by fidelity and state explicitly which mode you are choosing: production-like runtime, bounded integration harness, or synthetic scenario replay.
|
|
65
|
-
- If no trustworthy execution command or stop method can be found, stop and ask only for the missing command rather than guessing.
|
|
66
|
-
2. Prepare the background terminal run
|
|
67
|
-
- Use a dedicated background terminal session for the whole workflow.
|
|
68
|
-
- Create a dedicated run folder and record timezone, cwd, requested command, terminal session identifier, and any requested start/end boundaries.
|
|
69
|
-
- Capture stdout and stderr from the beginning of the session so the full run stays auditable.
|
|
70
|
-
- Identify and record the exact bounded-stop mechanism before launch: signal path, wrapper script, env var names, CLI flags, PID capture, and any project-specific shutdown helper.
|
|
71
|
-
- Decide in advance what the canonical evidence root will be if the command generates its own run directory, artifact bundle, database, or report file, so later diagnosis does not drift across multiple runs.
|
|
72
|
-
3. Optionally update to the latest safe code state
|
|
73
|
-
- Only do this step when the user explicitly asked to update the project before execution.
|
|
74
|
-
- Prefer the repository's normal safe update path, such as `git pull --ff-only`, or the project's documented sync command if one exists.
|
|
75
|
-
- Record the commit before and after the update.
|
|
76
|
-
- If the worktree is dirty, the branch has no upstream, or the update cannot be done safely, stop and report the exact blocker instead of guessing or forcing a merge.
|
|
77
|
-
4. Choose the execution timing
|
|
78
|
-
- If the user gave a specific time window, schedule or delay the same background-terminal run to start in that window.
|
|
79
|
-
- If no time window was requested, run immediately after setup, or after the optional update step if one was requested.
|
|
80
|
-
- If the user requested a future start time and no reliable scheduler is available, fail closed and report the scheduling limitation instead of starting early.
|
|
81
|
-
5. Run and capture readiness
|
|
82
|
-
- Execute the requested command in the same background terminal.
|
|
83
|
-
- As soon as the command emits or creates its canonical run directory, artifact root, or equivalent output location, record that path and reuse it for every later check.
|
|
84
|
-
- If later structured outputs appear under a different directory than the recorded run root, pause the analysis and reconcile the mismatch before reading metrics or logs from either location.
|
|
85
|
-
- Report the exact runtime mode used in the evidence record so later analysis does not accidentally treat synthetic-harness results as proof about production behavior.
|
|
86
|
-
- Wait for a concrete readiness signal when the command is expected to stay up, such as a health endpoint, listening-port log, worker boot line, or queue-consumer ready message.
|
|
87
|
-
- If readiness never arrives, stop the run, preserve logs, and treat it as a failed startup window.
|
|
88
|
-
6. Observe and stop when bounded
|
|
89
|
-
- If a bounded window or explicit stop time was requested, keep the process running only for that agreed window and then stop it cleanly.
|
|
90
|
-
- Do not rely only on the child command to self-terminate on time; track the wall-clock deadline yourself and enforce the stop sequence when the deadline is reached.
|
|
91
|
-
- Track crashes, restarts, retry storms, timeout bursts, stuck jobs, resource pressure, and repeated warnings during the run.
|
|
92
|
-
- Use the project's normal shutdown path first; if graceful stop fails, escalate deterministically and record the exact stop sequence and timestamps.
|
|
93
|
-
- If the process overruns the agreed window, stop it immediately, mark the run as an overrun, and report whether the cause was a bad wrapper contract, a missing stop hook, or a failed shutdown.
|
|
94
|
-
7. Explain findings from logs when requested
|
|
95
|
-
- If the user asked for findings after completion, wait for the run to finish before analyzing the captured logs.
|
|
96
|
-
- Invoke `analyse-app-logs` on only the captured runtime window.
|
|
97
|
-
- Pass the service or module names, environment, timezone, canonical run folder, relevant log files, and the exact start/end boundaries.
|
|
98
|
-
- When the command produced reports, databases, or other structured artifacts, compare them against the same run's logs before making a health judgment.
|
|
99
|
-
- For follow-up questions about why most business events did not happen, build a stage-by-stage funnel from the canonical artifacts before reading isolated logs: candidate counts, admission/precheck decisions, queue or governor outcomes, skipped/blocked reasons, executed counts, retry/remediation outcomes, and persistence records.
|
|
100
|
-
- For follow-up questions about runtime speed, report latency from structured timestamps when available, separating startup/readiness, queue wait, precheck/final-prepare, submission, confirmation, and end-to-end timings rather than collapsing them into one vague duration.
|
|
101
|
-
- Reuse its confirmed issues, hypotheses, and monitoring improvements instead of rewriting a separate incident workflow.
|
|
102
|
-
8. Produce the final report
|
|
103
|
-
- Always summarize the actual command executed, actual start/end timestamps, execution status, and log locations.
|
|
104
|
-
- Separate bounded execute time from setup, readiness, collection, and shutdown overhead so a 2-minute observation is not misreported as 2 minutes of end-to-end pipeline latency.
|
|
105
|
-
- Include the code-update result only when an update step was requested.
|
|
106
|
-
- When findings were requested, classify each relevant module as `healthy`, `degraded`, `failed`, or `unknown` with concrete evidence and separate observed issues from risks that still need validation.
|
|
107
|
-
|
|
108
|
-
## Scheduling rules
|
|
109
|
-
|
|
110
|
-
- Use the user's locale timezone when configuring scheduled tasks.
|
|
111
|
-
- Name scheduled jobs clearly so the user can recognize start, stop, and analysis ownership.
|
|
112
|
-
- Prefer recurring schedules only when the user explicitly wants repeated checks; otherwise create a one-off bounded run.
|
|
113
|
-
- If the host provides agent automations, use them before inventing project-local scheduling files.
|
|
114
|
-
- If native automation is unavailable, prefer the smallest reliable OS-level scheduling method already present on the machine.
|
|
115
|
-
- If the request depends on a future start time and no reliable scheduling method exists, do not silently convert the request into an immediate run.
|
|
116
|
-
|
|
117
|
-
## Health classification rubric
|
|
118
|
-
|
|
119
|
-
- `healthy`: positive startup signal, expected runtime behavior, and no material degradation in the chosen window.
|
|
120
|
-
- `degraded`: module stays up but shows retries, warnings, latency spikes, partial failures, or other recurring stress signals.
|
|
121
|
-
- `failed`: boot failure, repeated crash, readiness failure, fatal error loop, or inability to perform its core responsibility.
|
|
122
|
-
- `unknown`: logs or probes do not provide enough evidence to justify a health call.
|
|
123
|
-
|
|
124
|
-
Absence of errors alone is not enough for `healthy`.
|
|
125
|
-
|
|
126
|
-
Use `references/output-format.md` for the structured output template including run summary, execution result, module health, confirmed issues, and scheduler status.
|
|
127
|
-
|
|
128
|
-
## Guardrails
|
|
129
|
-
|
|
130
|
-
- Do not let the project continue running past the agreed window unless the user explicitly asks.
|
|
131
|
-
- Do not assume the documented bound is real until the wrapper or script implementation confirms it.
|
|
132
|
-
- Do not perform a code-update step unless the user explicitly asked for it.
|
|
133
|
-
- Do not claim steady-state health from startup-only evidence.
|
|
134
|
-
- Keep the run folder and scheduler metadata so the investigation can be reproduced.
|
|
135
|
-
- If current logs are too weak to judge module health, recommend `improve-observability` instead of stretching the evidence.
|
|
@@ -1,4 +0,0 @@
|
|
|
1
|
-
interface:
|
|
2
|
-
display_name: "Scheduled Runtime Health Check"
|
|
3
|
-
short_description: "Use a background terminal to run a command in a time window and optionally explain findings from logs."
|
|
4
|
-
default_prompt: "Use $scheduled-runtime-health-check to use a background terminal to run the requested command immediately or in the requested time window, optionally update the project first only when the user asks for that, capture logs through the run, and, when findings are requested, delegate bounded post-run log diagnosis to $analyse-app-logs."
|
|
@@ -1,20 +0,0 @@
|
|
|
1
|
-
# Output Format
|
|
2
|
-
|
|
3
|
-
Use this structure in responses:
|
|
4
|
-
|
|
5
|
-
1. Run summary
|
|
6
|
-
- Workspace, command, schedule if any, actual start/end timestamps, bounded execute duration, total wall-clock duration, readiness result, shutdown result if applicable, canonical run folder or artifact root, and log locations.
|
|
7
|
-
2. Execution result
|
|
8
|
-
- Whether the command completed, stayed up for the requested window, or failed early.
|
|
9
|
-
3. Code update result
|
|
10
|
-
- Include only when an update step was requested. Record the update command, before/after commit, or the exact blocker.
|
|
11
|
-
4. Module health
|
|
12
|
-
- Include only when findings were requested or health assessment was part of the task. One entry per module with status (`healthy` / `degraded` / `failed` / `unknown`) and evidence.
|
|
13
|
-
5. Confirmed issues
|
|
14
|
-
- Include only when log analysis was requested. Reuse evidence-backed findings from `analyse-app-logs`.
|
|
15
|
-
6. Potential issues and validation needed
|
|
16
|
-
- Include only when log analysis was requested. Risks that appeared in the run but need more evidence.
|
|
17
|
-
7. Observability gaps
|
|
18
|
-
- Include only when log analysis was requested. Missing logs, metrics, probes, or correlation IDs that blocked diagnosis.
|
|
19
|
-
8. Automation or scheduler status
|
|
20
|
-
- Include only when a future window or scheduler was involved. Record task identifiers, execution status, and whether future cleanup is needed.
|