@checkstack/queue-api 0.2.17 → 0.3.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/CHANGELOG.md CHANGED
@@ -1,5 +1,230 @@
1
1
  # @checkstack/queue-api
2
2
 
3
+ ## 0.3.0
4
+
5
+ ### Minor Changes
6
+
7
+ - aa89bc5: Replace the bespoke `registerInfrastructureTab()` registry with a standard
8
+ slot-extension contract (`InfrastructureTabsSlot` from
9
+ `@checkstack/infrastructure-common`). Plugins now contribute infrastructure
10
+ tabs via `createSlotExtension`, depending only on the slot owner.
11
+
12
+ The slot system in `@checkstack/frontend-api` gains a second type parameter
13
+ on `createSlot<TContext, TMetadata>` so extensions can declare typed static
14
+ metadata at registration time (label, icon, access rules, ordering for the
15
+ infrastructure tab bar). A new `useSlotExtensions(slot)` hook returns typed
16
+ extensions and subscribes to plugin lifecycle changes.
17
+
18
+ Each tab body now stacks a **Runtime** sub-section (live state, read-only)
19
+ on top of a **Configuration** sub-section (settings, gated by `canUpdate`).
20
+
21
+ **Queue runtime panel.** Surfaces aggregated counts (pending / processing /
22
+ completed / failed) plus three sub-tabs of recent jobs: **Active**, **Recent
23
+ failed** (with the failure message), and **Recent completed** (with
24
+ duration). Job payloads are deliberately not surfaced — they may carry
25
+ secrets and need a separate manage-access gate to be shown.
26
+
27
+ To support this, `Queue<T>` gains a required `listJobs(opts)` method
28
+ returning `JobSummary[]` (no payloads), and `QueueStats` gains a
29
+ `scope: "instance" | "cluster"` field. The in-memory queue keeps rolling
30
+ ring buffers (200 entries) for completed/failed history and tracks active
31
+ jobs by id; BullMQ uses native `getJobs`. `QueueManager.listJobs` aggregates
32
+ across queues and sorts (most-recent-first for terminal states, FIFO for
33
+ active/waiting/delayed).
34
+
35
+ **Cache runtime panel.** Lists the top N entries by size (or by recency) so
36
+ operators can debug a cache filling up. Values are deliberately omitted —
37
+ PII / secret risk. Backends opt in via an optional `listEntries?` method on
38
+ `CacheProvider`; non-supporting backends return `{ supported: false }` and
39
+ the UI renders a "not supported by this backend" hint. The in-memory cache
40
+ implements it using its existing per-entry byte tracking.
41
+
42
+ `CacheStats` also gains `scope: "instance" | "cluster"`.
43
+
44
+ **Multi-instance scope warning.** A new `<InstanceScopeBanner>` component in
45
+ `@checkstack/ui` renders a yellow banner above any runtime panel whose
46
+ backend reports `scope: "instance"` — i.e. in-memory queue or cache running
47
+ in a horizontally scaled deployment. The banner explains the metrics are
48
+ local to the responding replica and recommends switching to a clustered
49
+ backend (Redis-backed queue / cache) for cluster-wide visibility.
50
+
51
+ **Bug fix — stable cache provider proxy.** `CacheManagerImpl.getProvider()`
52
+ now returns a single stable proxy that delegates to whatever provider is
53
+ currently active. Previously, consumers of `createCachedScope` (and any
54
+ direct `cacheManager.getProvider()` caller) captured the active provider
55
+ reference at plugin-init time. After any `setActiveBackend` call — including
56
+ saving the same memory config in the new Cache tab, which reconstructs the
57
+ in-memory cache — those scopes wrote to an orphaned old provider while the
58
+ runtime panel read stats from the new (empty) one, making the runtime panel
59
+ appear to report 0 keys. With the proxy, all consumers share a single stable
60
+ identity and writes always land in the active provider.
61
+
62
+ **Bytes tracking on the in-memory cache.** `InMemoryCache.getStats().sizeBytes`
63
+ now returns a running approximation (UTF-8 bytes of the key plus
64
+ `v8.serialize(value).byteLength`, with a JSON fallback) that's kept in sync
65
+ across all eviction paths. Treat the number as a sanity gauge; it doesn't
66
+ include `Map` per-entry overhead.
67
+
68
+ **Pagination.** Both `Queue<T>.listJobs` and `CacheProvider.listEntries?`
69
+ are offset-paginated. Inputs gain an `offset: number`; outputs change to
70
+ `{ items, total: number | null, hasMore: boolean }`. `total` is nullable
71
+ so backends that can't compute it cheaply still paginate via `hasMore`.
72
+ The UI uses the existing `<Pagination>` component with a 25-row default
73
+ page size. `QueueManager.listJobs` aggregates by over-fetching
74
+ `[0, offset+limit)` per queue, merge-sorting, then slicing the window —
75
+ optimal for the single-queue case, acceptable for the multi-queue case
76
+ within the UI's reasonable page-depth bounds. BullMQ uses native offset
77
+ ranges via `getJobs(types, start, end)` plus `getJobCounts` for `total`.
78
+
79
+ **Pending tab.** The Queue runtime panel exposes a virtual `"pending"`
80
+ state (waiting ∪ delayed, FIFO). It's now the default sub-tab, since
81
+ "what's queued up?" is the most common question. Per-row state is shown
82
+ when viewing the combined list.
83
+
84
+ **Recurring schedules visible under Pending.** Cron- and interval-based
85
+ recurring jobs (e.g. healthchecks) are surfaced under Pending/Delayed
86
+ between fires, with a `nextRunAt` countdown column and a "(recurring)"
87
+ label. `JobSummary` gains optional `nextRunAt: Date` and `recurring:
88
+ boolean` fields. The in-memory queue synthesises these rows from its
89
+ `recurringJobs` registry; BullMQ already materialises the next fire of
90
+ each scheduler as a delayed job and we now surface its trigger time and
91
+ the `repeatJobKey`-derived `recurring` flag.
92
+
93
+ **Bug fix — drop hook emits with no listeners.** `EventBus.emit` no
94
+ longer enqueues a job when zero listeners (distributed or instance-local)
95
+ are registered for the hook. Previously, hooks like
96
+ `core.plugin.initialized` — emitted on every plugin init but subscribed
97
+ to by nothing in the core repo — accumulated one waiting job per emit
98
+ forever. The in-memory queue's `processNext` short-circuits when there
99
+ are zero consumer groups, so its post-loop cleanup never ran for these
100
+ orphaned jobs. The fix drops the emit at the source and logs a debug
101
+ line. Note: in distributed deployments using a Redis-backed queue, this
102
+ means a subscriber on another replica won't receive an event if no
103
+ replica that emits it has a local listener. Plugins needing cross-process
104
+ delivery must register their listener on every replica that should
105
+ receive the hook.
106
+
107
+ **Breaking notes (treated as minor under beta semantics)**:
108
+
109
+ - `@checkstack/infrastructure-common` removes `registerInfrastructureTab`
110
+ and `getInfrastructureTabs`; former callers must register an extension
111
+ into `InfrastructureTabsSlot`.
112
+ - `@checkstack/queue-api`'s `Queue<T>` interface requires the new
113
+ `listJobs(opts)` method returning `ListJobsResult` (paginated). Both
114
+ bundled queue backends (memory, BullMQ) are updated; out-of-tree
115
+ implementations will need to add it.
116
+ - `QueueStats` and `CacheStats` add a required `scope` field.
117
+ - `CacheProvider.listEntries?` (when implemented) now returns
118
+ `ListEntriesResult` instead of `CacheEntrySummary[]`.
119
+ - `JobState` adds a `"pending"` variant.
120
+
121
+ ### Patch Changes
122
+
123
+ - @checkstack/backend-api@0.15.1
124
+
125
+ ## 0.2.18
126
+
127
+ ### Patch Changes
128
+
129
+ - 50e5f5f: Runtime plugin system: install + uninstall plugins from npm, GitHub releases
130
+ (including private GitHub Enterprise instances), or tarball uploads at
131
+ runtime, with multi-package bundles, dependency-derived compatibility checks,
132
+ multi-instance coordination via a Postgres artifact store, and
133
+ single-coordinator destructive cleanup.
134
+
135
+ Highlights:
136
+
137
+ - New `PluginSource` discriminated union and `PluginInstaller` /
138
+ `PluginInstallerRegistry` interfaces in `@checkstack/backend-api`. The
139
+ GitHub variant accepts an optional `apiBaseUrl` so deployments backed by
140
+ GitHub Enterprise can install from `https://ghe.example.com/api/v3`
141
+ instead of `api.github.com`.
142
+ - New `installPackageMetadataSchema` (Zod) in `@checkstack/common` validates
143
+ every plugin's `package.json` at install time. Required fields: `name`,
144
+ `version`, `description`, `author`, `license`, `checkstack.type`,
145
+ `checkstack.pluginId`. Optional: `checkstack.bundle`,
146
+ `checkstack.usageInstructions`, `checkstack.allowInstallScripts`.
147
+ - New `pluginManagerContract` in `@checkstack/pluginmanager-common` with
148
+ `list`, `previewInstall`, `install`, `previewUninstall`, `uninstall`, and
149
+ `events` procedures.
150
+ - New `@checkstack/pluginmanager-frontend` admin UI: installed-plugins list
151
+ with per-row uninstall (typed-confirmation modal, schema/configs/cascade
152
+ toggles), install page with NPM / Tarball Upload / GitHub Release tabs
153
+ (Catalog tab disabled — coming soon), and an events page surfacing the
154
+ install/uninstall audit log.
155
+ - New `bunx @checkstack/scripts plugin-pack` CLI for plugin authors —
156
+ per-package mode produces an npm-shaped tarball; `--bundle` mode produces
157
+ an outer tarball containing every sibling declared in
158
+ `package.json#checkstack.bundle`. Published to npm so external authors
159
+ can `bunx` it directly without a workspace checkout.
160
+ - Compatibility derived from `package.json#dependencies` ranges
161
+ (`semver.satisfies` against the platform's loaded `@checkstack/*`
162
+ versions) — no separate `compatibility` field.
163
+ - Multi-instance: originator persists artifacts + `plugins` rows + broadcasts
164
+ install/uninstall; receiving instances do in-process register/unregister
165
+ only. Destructive ops (drop schema, delete plugin_configs, delete
166
+ artifacts, delete `plugins` rows) run exactly once on the originator.
167
+ - Fresh-instance bootstrap: `loadPlugins()` hydrates any
168
+ `is_uninstallable=true` plugin missing from `node_modules` from the
169
+ artifact store before normal Phase 1 register.
170
+ - New schema: `plugin_artifacts` (tarball storage), `plugin_install_events`
171
+ (audit/error log). `plugins` extended with `version`, `metadata`,
172
+ `source`, `bundle_id`, `is_primary`. Local plugin sync now writes
173
+ `version` from each plugin's `package.json` so the admin UI shows real
174
+ versions instead of `—`.
175
+ - Tarball-upload endpoint (`POST /api/pluginmanager/upload-tarball`) for
176
+ the install UI; access-gated by `pluginmanager.plugin.manage`.
177
+ - Plugin Manager menu link added to the user menu (main grid, alongside
178
+ Profile / Notification Settings / etc.).
179
+
180
+ Cross-cutting changes:
181
+
182
+ - Backend request/response logging now flows through `rootLogger` (winston)
183
+ instead of `hono/logger`. 5xx responses include the response body inline
184
+ so swallowed early-return errors are visible in the log.
185
+ - The `/api/:pluginId/*` dispatcher now logs which core service is missing
186
+ or which `pluginId` had no metadata when it 500s.
187
+ - New `registerCorePluginMetadata` on `PluginManager` for core routers
188
+ (like the plugin manager itself) that need their metadata visible to the
189
+ RPC dispatcher without going through the full plugin lifecycle.
190
+ - ESLint: `unicorn/no-null` is now disabled globally. Drizzle distinguishes
191
+ between `null` (writes a real SQL NULL) and `undefined` (skip the column
192
+ on insert), so treating them as interchangeable produced latent bugs at
193
+ the persistence boundary. The bulk of the patch-bumped packages above
194
+ reflect lint-fix touches that landed when this rule was relaxed.
195
+ - Workspace-wide license normalization to `Elastic-2.0` (matches
196
+ `LICENSE.md`). Every `package.json` in the workspace now declares the
197
+ same SPDX identifier; the patch bumps capture this.
198
+
199
+ Plugin packages (every `plugins/*`): added a `pack` npm script
200
+ (`bunx @checkstack/scripts plugin-pack`), mirrored each plugin's
201
+ `pluginId` from `plugin-metadata.ts` into `package.json#checkstack.pluginId`
202
+ so install-time validation passes, stubbed any missing required metadata
203
+ fields (`description`, `author`, `license`), and added
204
+ `checkstack.bundle` to multi-package plugin primaries (telegram, rcon, ssh,
205
+ jira, queue-bullmq, queue-memory, cache-memory).
206
+
207
+ Breaking changes:
208
+
209
+ - The legacy single-method `PluginInstaller` interface (`install(packageName)`)
210
+ is removed. Callers must use `coreServices.pluginInstallerRegistry`.
211
+ - The old `pluginAdminContract` and `createPluginAdminRouter` are removed.
212
+ Replaced by `pluginManagerContract` in `@checkstack/pluginmanager-common`
213
+ and `createPluginManagerRouter` in `core/backend`.
214
+ - `@checkstack/test-utils-backend` no longer exports
215
+ `createMockPluginInstaller` / `MockPluginInstaller` (the legacy interface
216
+ it shimmed is gone).
217
+
218
+ Note: bumps are limited to `minor` (for packages with new public API
219
+ surface) and `patch` (for downstream consumers, license normalization,
220
+ and lint fixes). No `major` bumps despite the `PluginInstaller` removal —
221
+ the legacy interface had no third-party consumers in the wild before this
222
+ runtime plugin system landed, and the contract surface is the same shape
223
+ modulo the rename.
224
+
225
+ - Updated dependencies [50e5f5f]
226
+ - @checkstack/backend-api@0.15.0
227
+
3
228
  ## 0.2.17
4
229
 
5
230
  ### Patch Changes
package/package.json CHANGED
@@ -1,18 +1,19 @@
1
1
  {
2
2
  "name": "@checkstack/queue-api",
3
- "version": "0.2.17",
3
+ "version": "0.3.0",
4
+ "license": "Elastic-2.0",
4
5
  "checkstack": {
5
6
  "type": "tooling"
6
7
  },
7
8
  "type": "module",
8
9
  "main": "src/index.ts",
9
10
  "dependencies": {
10
- "@checkstack/backend-api": "0.14.0",
11
+ "@checkstack/backend-api": "0.15.0",
11
12
  "zod": "^4.0.0"
12
13
  },
13
14
  "devDependencies": {
14
- "@checkstack/tsconfig": "0.0.5",
15
- "@checkstack/scripts": "0.1.2"
15
+ "@checkstack/tsconfig": "0.0.7",
16
+ "@checkstack/scripts": "0.3.0"
16
17
  },
17
18
  "scripts": {
18
19
  "typecheck": "tsgo -b",
@@ -1,5 +1,11 @@
1
1
  import { z } from "zod";
2
- import type { Queue, QueueStats, RecurringSchedule } from "./queue";
2
+ import type {
3
+ Queue,
4
+ QueueStats,
5
+ RecurringSchedule,
6
+ ListJobsOptions,
7
+ ListJobsResult,
8
+ } from "./queue";
3
9
  import type { Migration, Logger } from "@checkstack/backend-api";
4
10
 
5
11
  export interface QueuePlugin<Config = unknown> {
@@ -89,6 +95,12 @@ export interface QueueManager {
89
95
  */
90
96
  getAggregatedStats(): Promise<QueueStats>;
91
97
 
98
+ /**
99
+ * Aggregate {@link Queue.listJobs} across all queues, sorted into a single
100
+ * paginated list. Used by the Infrastructure runtime panel.
101
+ */
102
+ listJobs(opts: ListJobsOptions): Promise<ListJobsResult>;
103
+
92
104
  /**
93
105
  * List all recurring jobs across all queues.
94
106
  * Used for migration preview.
package/src/queue.ts CHANGED
@@ -152,6 +152,24 @@ export interface Queue<T = unknown> {
152
152
  * Get queue statistics
153
153
  */
154
154
  getStats(): Promise<QueueStats>;
155
+
156
+ /**
157
+ * List jobs in a particular state for the Infrastructure runtime panel,
158
+ * paginated.
159
+ *
160
+ * Implementations MUST:
161
+ * - Return summaries only (no payloads).
162
+ * - Honour `opts.offset` and `opts.limit`.
163
+ * - For `completed` / `failed`: return the most recent first.
164
+ * - For `active` / `waiting` / `delayed`: return the oldest first
165
+ * (FIFO order — what users typically want to see).
166
+ * - Set `total` to a cheap exact count when possible, or `null` when not.
167
+ * - Set `hasMore` correctly so the UI can disable the "next" control.
168
+ *
169
+ * Backends without affordable history (e.g. fire-and-forget transports)
170
+ * may return `{ items: [], total: 0, hasMore: false }` for terminal states.
171
+ */
172
+ listJobs(opts: ListJobsOptions): Promise<ListJobsResult>;
155
173
  }
156
174
 
157
175
  export interface QueueStats {
@@ -163,4 +181,87 @@ export interface QueueStats {
163
181
  * Number of active consumer groups
164
182
  */
165
183
  consumerGroups: number;
184
+ /**
185
+ * Whether these stats reflect the local process only (`"instance"`) or
186
+ * the entire deployment cluster (`"cluster"`). Backends that share state
187
+ * across replicas (e.g. BullMQ on Redis) report `"cluster"`. Backends
188
+ * that hold state in-process (e.g. the in-memory queue) report
189
+ * `"instance"`; in horizontally scaled deployments, each replica returns
190
+ * its own numbers.
191
+ */
192
+ scope: "instance" | "cluster";
193
+ }
194
+
195
+ /**
196
+ * Lifecycle states a job can be in. Mirrors BullMQ for consistency, with
197
+ * one virtual addition: `"pending"` is the union of `"waiting"` and
198
+ * `"delayed"`. Most operators think of "pending work" as both — the
199
+ * Infrastructure UI exposes it as a single tab, and the existing
200
+ * `QueueStats.pending` already aggregates both counts.
201
+ */
202
+ export type JobState =
203
+ | "pending"
204
+ | "waiting"
205
+ | "active"
206
+ | "delayed"
207
+ | "completed"
208
+ | "failed";
209
+
210
+ /**
211
+ * Inspection summary for a single job. Carries no payload — payloads can
212
+ * contain secrets, so they're surfaced (if at all) behind a separate
213
+ * manage-access RPC. The Infrastructure runtime panel uses these summaries
214
+ * for the Active / Failed / Completed sub-tables.
215
+ */
216
+ export interface JobSummary {
217
+ id: string;
218
+ /** Optional job/queue name for display. */
219
+ name?: string;
220
+ state: JobState;
221
+ /** Time the job was first enqueued. */
222
+ enqueuedAt: Date;
223
+ /** Time the job started processing, if applicable. */
224
+ startedAt?: Date;
225
+ /** Time the job finished (completed or failed), if applicable. */
226
+ finishedAt?: Date;
227
+ /** Attempts made so far (1-based on completion, 0 before first run). */
228
+ attempts: number;
229
+ /** Failure message, only set when `state === "failed"`. */
230
+ failedReason?: string;
231
+ /**
232
+ * For recurring schedules and delayed jobs: the absolute time of the
233
+ * next planned execution. Lets the UI render a "Next run" column for
234
+ * pending/delayed rows so cron-scheduled work (e.g. healthchecks) is
235
+ * visible between fires.
236
+ */
237
+ nextRunAt?: Date;
238
+ /**
239
+ * Whether this row represents a recurring schedule (cron / interval)
240
+ * rather than a one-shot enqueue. Used by the UI to label such rows
241
+ * distinctly so operators can tell scheduled work from ad-hoc work.
242
+ */
243
+ recurring?: boolean;
244
+ }
245
+
246
+ /**
247
+ * Options for {@link Queue.listJobs}. Offset-based pagination — backends
248
+ * that natively use cursors (BullMQ, Redis SCAN, etc.) translate internally.
249
+ */
250
+ export interface ListJobsOptions {
251
+ state: JobState;
252
+ /** Zero-based offset into the result set. */
253
+ offset: number;
254
+ /** Maximum summaries to return. Implementations should cap at a reasonable upper bound. */
255
+ limit: number;
256
+ }
257
+
258
+ /**
259
+ * Paginated result for {@link Queue.listJobs}.
260
+ */
261
+ export interface ListJobsResult {
262
+ items: JobSummary[];
263
+ /** Total job count for the requested state, or `null` if the backend can't compute it cheaply. */
264
+ total: number | null;
265
+ /** Whether more items exist past `offset + items.length`. */
266
+ hasMore: boolean;
166
267
  }