npm - @oh-my-pi/pi-ai - Versions diffs - 13.9.1 → 13.9.3 - Mend

@oh-my-pi/pi-ai 13.9.1 → 13.9.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (43) hide show

package/CHANGELOG.md +100 -6
package/package.json +2 -2
package/src/auth-storage.ts +587 -249
package/src/index.ts +2 -2
package/src/model-manager.ts +5 -4
package/src/model-thinking.ts +530 -0
package/src/models.json +5935 -888
package/src/models.ts +2 -17
package/src/prompts/turn-aborted-guidance.md +4 -0
package/src/provider-models/descriptors.ts +16 -6
package/src/provider-models/index.ts +0 -1
package/src/provider-models/openai-compat.ts +108 -25
package/src/providers/amazon-bedrock.ts +30 -69
package/src/providers/anthropic.ts +49 -42
package/src/providers/azure-openai-responses.ts +5 -0
package/src/providers/gitlab-duo.ts +1 -4
package/src/providers/google-vertex.ts +4 -4
package/src/providers/google.ts +4 -4
package/src/providers/kimi.ts +2 -2
package/src/providers/openai-codex/constants.ts +17 -1
package/src/providers/openai-codex/request-transformer.ts +10 -25
package/src/providers/openai-codex-responses.ts +186 -58
package/src/providers/openai-completions.ts +10 -2
package/src/providers/openai-responses.ts +96 -38
package/src/providers/synthetic.ts +2 -2
package/src/providers/transform-messages.ts +17 -9
package/src/stream.ts +41 -129
package/src/types.ts +51 -5
package/src/usage/claude.ts +10 -86
package/src/{providers/google-gemini-cli-usage.ts → usage/gemini.ts} +5 -19
package/src/usage/github-copilot.ts +7 -42
package/src/usage/google-antigravity.ts +4 -22
package/src/usage/kimi.ts +12 -54
package/src/usage/openai-codex.ts +8 -65
package/src/usage/zai.ts +14 -47
package/src/usage.ts +0 -18
package/src/utils/oauth/alibaba-coding-plan.ts +59 -0
package/src/utils/oauth/index.ts +17 -3
package/src/utils/oauth/openai-codex.ts +14 -4
package/src/utils/oauth/opencode.ts +0 -1
package/src/utils/oauth/types.ts +3 -1
package/src/provider-models/model-policies.ts +0 -94
package/src/thinking.ts +0 -85

package/CHANGELOG.md CHANGED Viewed

@@ -2,32 +2,126 @@
 ## [Unreleased]
+## [13.9.3] - 2026-03-07
+### Breaking Changes
+- Changed `reasoning` parameter from `ThinkingLevel | undefined` to `Effort | undefined` in `SimpleStreamOptions`; 'off' is no longer valid (omit the field instead)
+- Removed `supportsXhigh()` function; check `model.thinking?.maxLevel` instead
+- Removed `ThinkingLevel` and `ThinkingEffort` types; use `Effort` enum
+- Removed `getAvailableThinkingLevels()` and `getAvailableThinkingEfforts()` functions
+- Changed `transformRequestBody()` signature to require `Model` parameter as second argument for effort validation
+- Removed `thinking.ts` module export; import from `model-thinking.ts` instead
+### Added
+- Added `incremental` flag to `OpenAIResponsesHistoryPayload` to support building conversation history from multiple assistant messages instead of replacing it
+- Added `dt` flag to `OpenAIResponsesHistoryPayload` for transport-level metadata
+- Added `ThinkingConfig` interface to models for canonical thinking transport metadata with min/max effort levels and provider-specific mode
+- Added `thinking` field to `Model` type containing per-model thinking capabilities used to clamp and map user-facing effort levels
+- Added `Effort` enum (minimal, low, medium, high, xhigh) as canonical user-facing thinking levels replacing `ThinkingLevel`
+- Added `enrichModelThinking()` function to automatically populate thinking metadata on models based on their capabilities
+- Added `mapEffortToAnthropicAdaptiveEffort()` function to map user effort levels to Anthropic adaptive thinking effort
+- Added `mapEffortToGoogleThinkingLevel()` function to map user effort levels to Google thinking levels
+- Added `requireSupportedEffort()` function to validate and clamp effort levels per model, throwing errors for unsupported combinations
+- Added `clampThinkingLevelForModel()` function to clamp thinking levels to model-supported range
+- Added `applyGeneratedModelPolicies()` and `linkSparkPromotionTargets()` exports from model-thinking module
+- Added `serviceTier` option to control OpenAI processing priority and cost (auto, default, flex, scale, priority)
+- Added `providerPayload` field to messages and responses for reconstructing transport-native history
+- Added Gemini usage provider for tracking quota and tier information
+- Added `getCodexAccountId()` utility to extract account ID from Codex JWT tokens
+- Added email extraction from OpenAI Codex OAuth tokens for credential deduplication
+### Changed
+- Changed credential disabling mechanism from boolean `disabled` flag to `disabled_cause` text field for tracking why credentials were disabled
+- Changed `deleteAuthCredential()` and `deleteAuthCredentialsForProvider()` methods to require a `disabledCause` parameter explaining the reason for disabling
+- Changed Gemini model parsing to strip `-preview` suffix for consistent model identification
+- Changed OpenAI Codex websocket error handling to detect fatal connection errors and immediately fall back to SSE without retrying
+- Changed OpenAI Codex to always use websockets v2 protocol (removed v1 support)
+- Changed `reasoning` parameter type from `ThinkingLevel` to `Effort` in `SimpleStreamOptions`, removing 'off' value (callers should omit the field instead)
+- Changed thinking configuration to use model-specific metadata instead of hardcoded provider logic for effort mapping
+- Changed OpenAI Codex request transformer to accept `Model` parameter for effort validation instead of string model ID
+- Changed Anthropic provider to use model thinking metadata for determining adaptive thinking support instead of model ID pattern matching
+- Changed Google Vertex and Google providers to use shorter variable names for thinking config construction
+- Moved thinking-related utilities from `thinking.ts` to new `model-thinking.ts` module with expanded functionality
+- Moved model policy functions from `provider-models/model-policies.ts` to `model-thinking.ts`
+- Moved `googleGeminiCliUsageProvider` from `providers/google-gemini-cli-usage.ts` to `usage/gemini.ts`
+- Changed default OpenAI model from gpt-5.1-codex to gpt-5.4 across all providers
+- Changed `UsageFetchContext` to remove cache and now() dependencies—usage fetchers now use Date.now() directly
+- Removed `resetInMs` field from usage windows; consumers should calculate from `resetsAt` timestamp
+- Changed OpenAI Codex credential ranking to deduplicate by email when accountId matches
+- Improved OpenAI Codex error handling with retryable error detection
+### Removed
+- Removed `thinking.ts` module; use `model-thinking.ts` instead
+- Removed `provider-models/model-policies.ts` module; functionality moved to `model-thinking.ts`
+- Removed `supportsXhigh()` function from models.ts; use model.thinking metadata instead
+- Removed `ThinkingLevel` and `ThinkingEffort` types; use `Effort` enum instead
+- Removed `getAvailableThinkingLevels()` and `getAvailableThinkingEfforts()` functions
+- Removed `model-policies` export from `provider-models/index.ts`
+- Removed hardcoded thinking level clamping logic from OpenAI Codex request transformer; now uses model metadata
+- Removed `UsageCache` and `UsageCacheEntry` interfaces—caching is now handled internally by AuthStorage
+- Removed `google-gemini-cli-usage` export; use new `gemini` usage provider instead
+- Removed `resetInMs` computation from all usage providers
+- Removed cache TTL constants and cache management from usage fetchers (claude, github-copilot, google-antigravity, kimi, openai-codex, zai)
+### Fixed
+- Fixed credential purging to respect disabled credentials when deduplicating by email, preventing re-enablement of intentionally disabled credentials
+- Fixed OpenAI Codex websocket error reporting to include detailed error messages from error events
+- Fixed conversation history reconstruction to support incremental updates from multiple assistant messages while maintaining backward compatibility with full-snapshot payloads
+- Fixed OpenAI Codex to reject unsupported effort levels instead of silently clamping them, providing clear error messages about supported efforts
+- Fixed model cache normalization to properly apply thinking enrichment when loading cached models
+- Fixed dynamic model merging to apply thinking enrichment to merged model results
+- Fixed OpenAI Codex streaming to properly include service_tier in SSE payloads
+- Fixed type safety in OpenAI responses by removing unsafe type casts on image content blocks
+- Fixed credential purging to respect disabled credentials when deduplicating by email
+## [13.9.2] - 2026-03-05
+### Added
+- Support for redacted thinking blocks in Anthropic messages, enabling secure handling of encrypted reasoning content
+- Preservation of latest Anthropic thinking blocks and redacted thinking content during message transformation, even when switching between Anthropic models
+### Changed
+- Assistant message content now includes `RedactedThinkingContent` type alongside existing text, thinking, and tool call blocks
+- Message transformation logic now preserves signed thinking blocks and redacted thinking for the latest assistant message in Anthropic conversations
+### Fixed
+- Fixed Unicode normalization to consistently apply `toWellFormed()` to all text content, including thinking blocks, ensuring proper handling of malformed UTF-16 sequences
 ## [13.9.1] - 2026-03-05
 ### Breaking Changes
 - Removed `THINKING_LEVELS`, `ALL_THINKING_LEVELS`, `ALL_THINKING_MODES`, `THINKING_MODE_DESCRIPTIONS`, and `THINKING_MODE_LABELS` exports
 - Renamed `formatThinking()` to `getThinkingMetadata()` with changed return type from string to `ThinkingMetadata` object
 - Renamed `getAvailableThinkingLevel()` to `getAvailableThinkingLevels()` and added default parameter
-- Renamed `getAvailableThinkingEffort()` to `getAvailableThinkingEfforts()` and added default parameter
+- Renamed `getAvailableEffort()` to `getAvailableEfforts()` and added default parameter
 ### Added
 - Added `ThinkingMetadata` type to provide structured access to thinking mode information (value, label, description)
 ## [13.9.0] - 2026-03-05
 ### Added
-- Exported new thinking module with `ThinkingEffort`, `ThinkingLevel`, and `ThinkingMode` types for managing reasoning effort levels
-- Added `getAvailableThinkingEffort()` function to determine supported thinking effort levels based on model capabilities
-- Added `parseThinkingEffort()`, `parseThinkingLevel()`, and `parseThinkingMode()` functions for parsing thinking configuration strings
+- Exported new thinking module with `Effort`, `ThinkingLevel`, and `ThinkingMode` types for managing reasoning effort levels
+- Added `getAvailableEffort()` function to determine supported thinking effort levels based on model capabilities
+- Added `parseEffort()`, `parseThinkingLevel()`, and `parseThinkingMode()` functions for parsing thinking configuration strings
 - Added `THINKING_LEVELS`, `ALL_THINKING_LEVELS`, and `ALL_THINKING_MODES` constants for iterating over available thinking options
 - Added `THINKING_MODE_DESCRIPTIONS` and `THINKING_MODE_LABELS` for displaying thinking modes in user interfaces
 - Added `formatThinking()` function to format thinking modes as compact display labels
 ### Changed
-- Refactored thinking level handling to distinguish between `ThinkingEffort` (provider-level, no "off") and `ThinkingLevel` (user-facing, includes "off")
-- Updated `ThinkingBudgets` type to use `ThinkingEffort` instead of `ThinkingLevel` for more precise token budget configuration
+- Refactored thinking level handling to distinguish between `Effort` (provider-level, no "off") and `ThinkingLevel` (user-facing, includes "off")
+- Updated `ThinkingBudgets` type to use `Effort` instead of `ThinkingLevel` for more precise token budget configuration
 - Improved reasoning option handling to explicitly support "off" value for disabling reasoning across all providers
 - Simplified thinking effort mapping logic by centralizing provider-specific clamping behavior

package/package.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
 	"type": "module",
 	"name": "@oh-my-pi/pi-ai",
-	"version": "13.9.1",
+	"version": "13.9.3",
 	"description": "Unified LLM API with automatic model discovery and provider configuration",
 	"homepage": "https://github.com/can1357/oh-my-pi",
 	"author": "Can Boluk",
@@ -41,7 +41,7 @@
 		"@aws-sdk/client-bedrock-runtime": "^3",
 		"@bufbuild/protobuf": "^2.11",
 		"@google/genai": "^1.43",
-		"@oh-my-pi/pi-utils": "13.9.1",
+		"@oh-my-pi/pi-utils": "13.9.3",
 		"@sinclair/typebox": "^0.34",
 		"@smithy/node-http-handler": "^4.4",
 		"ajv": "^8.18",