npm - bedrock-wrapper - Versions diffs - 2.7.2 → 2.9.0 - Mend

bedrock-wrapper 2.7.2 → 2.9.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (10) hide show

package/AGENTS.md +123 -0
package/CHANGELOG.md +101 -4
package/README.md +54 -32
package/bedrock-models.js +409 -11
package/bedrock-wrapper.js +22 -6
package/package.json +2 -2
package/specs--completed/llama-4-model-support/PLAN-DRAFT-20260108-130000.md +241 -0
package/test-converse-output.txt +7477 -0
package/test-final-output.txt +7577 -0
package/test-run-output.txt +7629 -0

package/AGENTS.md ADDED Viewed

@@ -0,0 +1,123 @@
+# AGENTS.md
+This file provides guidance to AI coding agents like Claude Code (claude.ai/code), Cursor AI, Codex, Gemini CLI, GitHub Copilot, and other AI coding assistants when working with code in this repository.
+## Project Purpose
+Bedrock Wrapper translates OpenAI-compatible API objects to AWS Bedrock's serverless inference LLMs. It acts as an adapter layer allowing applications using the OpenAI API format to seamlessly call AWS Bedrock models.
+## Development Commands
+```bash
+npm install              # Install dependencies
+npm run clean            # Clean reinstall (removes node_modules and package-lock.json)
+npm run test             # Test all models with both Invoke and Converse APIs
+npm run test:invoke      # Test with Invoke API only
+npm run test:converse    # Test with Converse API only
+npm run test-vision      # Test vision capabilities
+npm run test-stop        # Test stop sequences
+npm run interactive      # Interactive CLI for testing specific models
+```
+## Architecture Overview
+```
+bedrock-wrapper.js (main entry)
+       │
+       ├── Converse API Path (useConverseAPI: true)
+       │   └── Unified format for all models
+       │
+       └── Invoke API Path (default)
+           └── Model-specific request/response handling
+                    │
+                    └── bedrock-models.js
+                        └── Model configurations registry
+```
+### Key Functions in bedrock-wrapper.js
+| Function | Line | Purpose |
+|----------|------|---------|
+| `bedrockWrapper()` | ~501 | Main entry point, async generator |
+| `convertToConverseFormat()` | ~86 | OpenAI messages → Converse API format |
+| `processMessagesForInvoke()` | ~168 | Model-specific message processing |
+| `buildInvokePrompt()` | ~234 | Constructs model-specific prompts |
+| `buildInvokeRequest()` | ~300 | Creates model-specific request objects |
+| `executeInvokeAPI()` | ~409 | Handles streaming and non-streaming |
+| `findAwsModelWithId()` | ~763 | Model lookup by name or ID |
+### Model Configuration Schema (bedrock-models.js)
+Each model entry requires:
+- `modelName`: Consumer-facing name (e.g., "Claude-4-5-Sonnet")
+- `modelId`: AWS Bedrock identifier
+- `vision`: Boolean for image support
+- `messages_api`: Boolean (true = structured messages, false = prompt string)
+- `response_chunk_element`: JSON path for streaming response extraction
+- `response_nonchunk_element`: JSON path for non-streaming response
+### Two API Paths
+1. **Converse API** (`useConverseAPI: true`): Unified format, handles all models consistently
+2. **Invoke API** (default): Model-specific formatting required
+Some models (e.g., DeepSeek-V3.1) have `converse_api_only: true` and automatically use the Converse API.
+## Model Family Patterns
+| Family | API Type | Special Handling |
+|--------|----------|------------------|
+| Claude | Messages API | Thinking tags: `<think>`, anthropic_version required |
+| Nova | Messages API | Content as array `[{text: content}]`, schemaVersion: "messages-v1" |
+| Llama | Prompt-based | Role tags: `<\|begin_of_text\|>`, `<\|start_header_id\|>` |
+| Mistral | Prompt-based (older) / Messages (v3+) | `[INST]`/`[/INST]` tags for older models |
+| GPT-OSS | Messages API | Reasoning tags: `<reasoning>`, streaming not supported |
+| Qwen | Messages API | Standard messages format |
+| DeepSeek | Messages API | V3.1 requires Converse API only |
+| Gemma | Messages API | Standard messages format with vision |
+| Kimi | Messages API | preserve_reasoning for thinking models |
+## Adding a New Model
+1. Add entry to `bedrock_models` array in `bedrock-models.js`
+2. For prompt-based models, define all role prefix/suffix tokens
+3. For vision models, set `vision: true` and add `image_support` config
+4. For thinking models, add `thinking` config in `special_request_schema`
+5. Test with `npm run test` to verify both API paths
+## Key Implementation Details
+### Image Processing
+- Uses Sharp library to resize images to max 2048x2048
+- Converts all formats to JPEG for consistency
+- Handles base64, data URLs, and HTTP URLs
+### Thinking Mode
+- Claude: `<think>` tags, budget_tokens in special_request_schema
+- GPT-OSS: `<reasoning>` tags, preserve_reasoning flag
+- Temperature auto-set to 1.0, budget_tokens constrained to 80% of max_tokens
+### Stop Sequences
+- Claude: `stop_sequences` (up to 8,191)
+- Nova: `stopSequences` (up to 4)
+- Mistral: `stop` (up to 10)
+- Llama: Not supported by AWS Bedrock
+## Environment Setup
+Create `.env` file:
+```
+AWS_REGION=us-west-2
+AWS_ACCESS_KEY_ID=your_key
+AWS_SECRET_ACCESS_KEY=your_secret
+LLM_MAX_GEN_TOKENS=1024
+LLM_TEMPERATURE=0.2
+```
+## Test Output Files
+After running tests, check these files for results:
+- `test-models-output.txt`
+- `test-vision-models-output.txt`
+- `test-stop-sequences-output.txt`
+- `test-converse-api-output.txt`

package/CHANGELOG.md CHANGED Viewed

@@ -1,8 +1,63 @@
 # Changelog
 All notable changes to this project will be documented in this file.
+## [2.9.0] - 2026-01-08 (Llama 4 Models)
+### ✨ Added
+- Support for Llama 4 Scout and Maverick models
+  - Llama-4-Scout-17b (vision support, 2K max output tokens)
+  - Llama-4-Maverick-17b (vision support, 2K max output tokens)
+- First Llama models with multimodal/vision capabilities in this wrapper
+- Cross-region inference profile IDs (us.meta.llama4-*)
+### ⚙️ Technical Details
+- **Vision Support**: Both models support image inputs (first Llama models with vision)
+- **API Compatibility**: Both Invoke API and Converse API paths supported
+- **Streaming**: Full streaming and non-streaming support
+- **Stop Sequences**: Not supported (AWS Bedrock limitation for all Llama models)
+## [2.8.0] - 2025-12-05 (New Models: Claude Opus 4.5, Gemma, Kimi, MiniMax, Mistral, Nova)
+### ✨ Added
+- Support for Claude Opus 4.5 models
+  - Claude-4-5-Opus (128K max output tokens, vision support)
+  - Claude-4-5-Opus-Thinking (with extended thinking capabilities)
+- Support for Amazon Nova 2 Lite model
+  - Nova-2-Lite (vision support, 5K max output tokens)
+- Support for Qwen3 Next model
+  - Qwen3-Next-80B-A3B (MoE architecture, 32K max output tokens)
+- Support for new Mistral models (Converse API)
+  - Mistral-Large-3 (675B parameters, vision support, 32K max output tokens)
+  - Ministral-3-3b (vision support, 8K max output tokens)
+  - Ministral-3-8b (vision support, 8K max output tokens)
+  - Ministral-3-14b (vision support, 16K max output tokens)
+  - Magistral-Small-2509 (text-only, 8K max output tokens)
+- Support for Google Gemma 3 models (new provider)
+  - Gemma-3-4b (vision support, 8K max output tokens)
+  - Gemma-3-12b (vision support, 8K max output tokens)
+  - Gemma-3-27b (vision support, 8K max output tokens)
+- Support for Moonshot AI Kimi K2 models (new provider)
+  - Kimi-K2 (1T total parameters, 32B active MoE, 32K max output tokens)
+  - Kimi-K2-Thinking (with reasoning tag preservation)
+- Support for MiniMax M2 model (new provider)
+  - MiniMax-M2 (230B total parameters, 10B active MoE, 32K max output tokens)
+### ⚙️ Technical Details
+- **New Model Families**: Google Gemma, Moonshot AI Kimi, MiniMax
+- **Vision Support**: All Gemma 3 models, Mistral-Large-3, Ministral 3 series, Nova-2-Lite
+- **Thinking Mode**: Kimi-K2-Thinking uses `preserve_reasoning: true` for reasoning tag preservation
+- **API Compatibility**: All new models use Converse API (`messages_api: true`)
+- **New Mistral Models**: Unlike older Mistral models (Invoke API), new models use Converse API
 ## [2.7.0] - 2025-11-18 (DeepSeek & Qwen 3)
 ### ✨ Added
 - Support for DeepSeek foundation models
   - DeepSeek-R1 (reasoning model with chain-of-thought capabilities, 8K max output tokens)
   - DeepSeek-V3.1 (hybrid thinking mode for complex reasoning, 8K max output tokens, **Converse API only**)
@@ -21,10 +76,12 @@ All notable changes to this project will be documented in this file.
 - Repository-scale code analysis capabilities for Qwen Coder models
 ### 🤬 Breaking Changes
 - Removed `top_p` parameter from all models as it is not fully supported by AWS Bedrock
   - `temperature` should always be used instead
 ### ⚙️ Technical Details
 - **Model Configuration**: All new models use messages API format (OpenAI-compatible)
 - **API Compatibility**:
   - Qwen 3 models: Support both Invoke API and Converse API
@@ -32,7 +89,9 @@ All notable changes to this project will be documented in this file.
   - DeepSeek-V3.1: Converse API only (automatically enforced)
 ## [2.6.2] - 2025-10-16 (Claude Haiku 4.5)
 ### ✨ Added
 - Support for Claude Haiku 4.5 models
   - Claude-4-5-Haiku
   - Claude-4-5-Haiku-Thinking
@@ -42,16 +101,21 @@ All notable changes to this project will be documented in this file.
 - Temperature/Top-P mutual exclusion parameter handling for Haiku 4.5 models
 ## [2.6.1] - 2025-09-30 (Claude Sonnet 4.5)
 ### ✨ Added
 - Support for Claude Sonnet 4.5 models
   - Claude-4-5-Sonnet
   - Claude-4-5-Sonnet-Thinking
 ## [2.5.0] - 2025-08-12 (Converse API)
 ### ✨ Added
 - Support for Converse API (streaming and non-streaming)
 ### ⚙️ Technical Details
 - **Model Configuration**: All models use standard messages API format
 - **API Compatibility**: Supports OpenAI-style requests
 - **Response Processing**: Automatic reasoning tag handling based on model variant
@@ -59,7 +123,9 @@ All notable changes to this project will be documented in this file.
 - **Testing Coverage**: Full integration with existing test suites and interactive example
 ## [2.4.5] - 2025-08-06 (GPT-OSS Models)
 ### ✨ Added
 - Support for OpenAI GPT-OSS models on AWS Bedrock
   - GPT-OSS-120B (120B parameter open weight model)
   - GPT-OSS-20B (20B parameter open weight model)
@@ -72,6 +138,7 @@ All notable changes to this project will be documented in this file.
 - OpenAI-compatible API format with `max_completion_tokens` parameter
 ### ⚙️ Technical Details
 - **Model Configuration**: All GPT-OSS models use standard messages API format
 - **API Compatibility**: Supports OpenAI-style requests with Apache 2.0 licensed models
 - **Response Processing**: Automatic reasoning tag handling based on model variant
@@ -79,13 +146,17 @@ All notable changes to this project will be documented in this file.
 - **Testing Coverage**: Full integration with existing test suites and interactive example
 ## [2.4.4] - 2025-08-05 (Claude 4.1 Opus)
 ### ✨ Added
 - Support for Claude 4.1 Opus models
   - Claude-4-1-Opus
   - Claude-4-1-Opus-Thinking
 ## [2.4.3] - 2025-07-31 (Stop Sequences Fixes)
 ### 🛠️ Fixed
 - **Critical Discovery**: Removed stop sequences support from Llama models
   - AWS Bedrock does not support stop sequences for Llama models (confirmed via official AWS documentation)
   - Llama models only support: `prompt`, `temperature`, `top_p`, `max_gen_len`, `images`
@@ -95,24 +166,28 @@ All notable changes to this project will be documented in this file.
 - Improved error handling for empty responses when stop sequences trigger early
 ### 📝 Updated
 - **Documentation corrections**
   - Corrected stop sequences support claims (removed "all models support" language)
   - Added accurate model-specific support matrix with sequence limits
   - Added comprehensive stop sequences support table with AWS documentation references
 - **Model Support Matrix** now clearly documented:
-  - ✅ Claude models: Full support (up to 8,191 sequences)
+  - ✅ Claude models: Full support (up to 8,191 sequences)
   - ✅ Nova models: Full support (up to 4 sequences)
   - ✅ Mistral models: Full support (up to 10 sequences)
   - ❌ Llama models: Not supported (AWS Bedrock limitation)
 ### ⚙️ Technical Details
 - Based on comprehensive research of official AWS Bedrock documentation
 - All changes maintain full backward compatibility
 - Test results show significant improvements in stop sequences reliability for supported models
 - Added detailed explanations to help users understand AWS Bedrock's actual capabilities
 ## [2.4.2] - 2025-07-31 (Stop Sequences Support)
 ### ✨ Added
 - Stop sequences support for compatible models
   - OpenAI-compatible `stop` and `stop_sequences` parameters
   - Automatic string-to-array conversion for compatibility
@@ -121,6 +196,7 @@ All notable changes to this project will be documented in this file.
 - Comprehensive stop sequences testing and validation with `npm run test-stop`
 ### 🛠️ Fixed
 - **Critical Discovery**: Removed stop sequences support from Llama models
   - AWS Bedrock does not support stop sequences for Llama models (confirmed via official documentation)
   - Llama models only support: `prompt`, `temperature`, `top_p`, `max_gen_len`, `images`
@@ -129,6 +205,7 @@ All notable changes to this project will be documented in this file.
 - Improved error handling for empty responses when stop sequences trigger early
 ### ⚙️ Technical Details
 - **Model Support Matrix**:
   - ✅ Claude models: Full support (up to 8,191 sequences)
   - ✅ Nova models: Full support (up to 4 sequences)
@@ -140,7 +217,9 @@ All notable changes to this project will be documented in this file.
 - Added comprehensive documentation in README.md and CLAUDE.md explaining support limitations
 ## [2.4.0] - 2025-07-24 (AWS Nova Models)
 ### ✨ Added
 - Support for AWS Nova models
   - Nova-Pro (300K context, multimodal, 5K output tokens)
   - Nova-Lite (300K context, multimodal, optimized for speed)
@@ -150,7 +229,9 @@ All notable changes to this project will be documented in this file.
 - Automatic content array formatting for Nova message compatibility
 ## [2.3.1] - 2025-05-22 (Claude 4 Opus / Sonnet)
 ### ✨ Added
 - Support for Claude 4 Opus & Claude 4 Sonnet models
   - Claude-4-Opus
   - Claude-4-Opus-Thinking
@@ -158,7 +239,9 @@ All notable changes to this project will be documented in this file.
   - Claude-4-Sonnet-Thinking
 ## [2.3.0] - 2025-02-15 (Claude 3.7 & Image Support)
 ### ✨ Added
 - Support for Claude 3.7 models
   - Claude-3-7-Sonnet
   - Claude-3-7-Sonnet-Thinking
@@ -171,29 +254,37 @@ All notable changes to this project will be documented in this file.
 - Documentation for image support usage
 ### 🔄 Changed
 - Updated model configuration for image-capable models
 - Improved response handling for multimodal inputs
 ## [2.2.0] - 2025-01-01 (Llama 3.3 70b)
 ### ✨ Added
 - Support for Llama 3.3 70b
 ## [2.1.0] - 2024-11-21 (Claude 3.5 Haiku)
 ### ✨ Added
 - Support for Claude 3.5 Haiku
 ## [2.0.0] - 2024-10-31 (Claude Sonnet & Haiku)
 ### ✨ Added
 - Support for Anthropic Sonnet & Haiku models
   - Claude-3-5-Sonnet-v2
   - Claude-3-5-Sonnet
   - Claude-3-Haiku
 - Interactive example script for testing models
 - Testing script with streaming and non-streaming support for all models
-- Stardardize output to be a string via Streamed and non-Streamed responses
+- Stardardize output to be a string via Streamed and non-Streamed responses
   > **NOTE:** This is a breaking change for previous non-streaming responses. Existing streaming responses will remain unchanged.
 ### 🔄 Changed
 - Complete architecture overhaul for better model support
 - Improved message handling with role-based formatting
 - Enhanced error handling and response processing
@@ -201,6 +292,7 @@ All notable changes to this project will be documented in this file.
 - Updated AWS SDK integration
 ### ⚙️ Technical Details
 - Implemented messages API support for compatible models
 - Added system message handling as separate field where supported
 - Configurable token limits per model
@@ -208,7 +300,9 @@ All notable changes to this project will be documented in this file.
 - Cross-region profile support for certain models
 ## [1.3.0] - 2024-07-24 (Llama3.2)
 ### ✨ Added
 - Support for Llama 3.2 series models
   - Llama-3-2-1b
   - Llama-3-2-3b
@@ -216,18 +310,21 @@ All notable changes to this project will be documented in this file.
   - Llama-3-2-90b
 ## [1.1.0] - 2024-07-24 (Llama3.1)
 ### ✨ Added
 - Support for Llama 3.1 series models
   - Llama-3-1-8b
   - Llama-3-1-70b
 ## [1.0.14] - 2024-05-06 (Initial Stable Release)
 ### ✨ Added
 - Initial stablerelease of Bedrock Wrapper
 - Basic AWS Bedrock integration
 - OpenAI-compatible API object support
-- Basic model support
+- Basic model support
   - Llama-3-8b
   - Llama-3-70b
   - Mistral-7b

package/README.md CHANGED Viewed

@@ -122,43 +122,59 @@ Bedrock Wrapper is an npm package that simplifies the integration of existing Op
 ### Supported Models
-| modelName                  | AWS Model Id                                 | Image |
-|----------------------------|----------------------------------------------|-------|
-| Claude-3-5-Haiku           | anthropic.claude-3-5-haiku-20241022-v1:0     |  ❌  |
-| Claude-3-5-Sonnet          | anthropic.claude-3-5-sonnet-20240620-v1:0    |  ✅  |
-| Claude-3-5-Sonnet-v2       | anthropic.claude-3-5-sonnet-20241022-v2:0    |  ✅  |
-| Claude-3-7-Sonnet          | us.anthropic.claude-3-7-sonnet-20250219-v1:0 |  ✅  |
-| Claude-3-7-Sonnet-Thinking | us.anthropic.claude-3-7-sonnet-20250219-v1:0 |  ✅  |
-| Claude-3-Haiku             | anthropic.claude-3-haiku-20240307-v1:0       |  ✅  |
-| Claude-4-Opus              | us.anthropic.claude-opus-4-20250514-v1:0     |  ✅  |
-| Claude-4-Opus-Thinking     | us.anthropic.claude-opus-4-20250514-v1:0     |  ✅  |
-| Claude-4-Sonnet            | us.anthropic.claude-sonnet-4-20250514-v1:0   |  ✅  |
-| Claude-4-Sonnet-Thinking   | us.anthropic.claude-sonnet-4-20250514-v1:0   |  ✅  |
-| Claude-4-1-Opus            | us.anthropic.claude-opus-4-1-20250805-v1:0   |  ✅  |
-| Claude-4-1-Opus-Thinking   | us.anthropic.claude-opus-4-1-20250805-v1:0   |  ✅  |
-| Claude-4-5-Haiku           | us.anthropic.claude-haiku-4-5-20251001-v1:0  |  ✅  |
-| Claude-4-5-Haiku-Thinking  | us.anthropic.claude-haiku-4-5-20251001-v1:0  |  ✅  |
-| Claude-4-5-Sonnet          | us.anthropic.claude-sonnet-4-5-20250929-v1:0 |  ✅  |
-| Claude-4-5-Sonnet-Thinking | us.anthropic.claude-sonnet-4-5-20250929-v1:0 |  ✅  |
-| DeepSeek-R1                | us.deepseek.r1-v1:0                          |  ❌  |
-| DeepSeek-V3.1              | deepseek.v3-v1:0                             |  ❌  |
-| GPT-OSS-120B               | openai.gpt-oss-120b-1:0                      |  ❌  |
-| GPT-OSS-120B-Thinking      | openai.gpt-oss-120b-1:0                      |  ❌  |
-| GPT-OSS-20B                | openai.gpt-oss-20b-1:0                       |  ❌  |
-| GPT-OSS-20B-Thinking       | openai.gpt-oss-20b-1:0                       |  ❌  |
-| Llama-3-8b                 | meta.llama3-8b-instruct-v1:0                 |  ❌  |
-| Llama-3-70b                | meta.llama3-70b-instruct-v1:0                |  ❌  |
-| Llama-3-1-8b               | meta.llama3-1-8b-instruct-v1:0               |  ❌  |
-| Llama-3-1-70b              | meta.llama3-1-70b-instruct-v1:0              |  ❌  |
-| Llama-3-1-405b             | meta.llama3-1-405b-instruct-v1:0             |  ❌  |
+| modelName                  | AWS Model Id                                    | Image |
+|----------------------------|-------------------------------------------------|-------|
+| Claude-3-5-Haiku           | us.anthropic.claude-3-5-haiku-20241022-v1:0     |  ❌  |
+| Claude-3-5-Sonnet          | us.anthropic.claude-3-5-sonnet-20240620-v1:0    |  ✅  |
+| Claude-3-5-Sonnet-v2       | us.anthropic.claude-3-5-sonnet-20241022-v2:0    |  ✅  |
+| Claude-3-7-Sonnet          | us.anthropic.claude-3-7-sonnet-20250219-v1:0    |  ✅  |
+| Claude-3-7-Sonnet-Thinking | us.anthropic.claude-3-7-sonnet-20250219-v1:0    |  ✅  |
+| Claude-3-Haiku             | us.anthropic.claude-3-haiku-20240307-v1:0       |  ✅  |
+| Claude-4-Opus              | us.anthropic.claude-opus-4-20250514-v1:0        |  ✅  |
+| Claude-4-Opus-Thinking     | us.anthropic.claude-opus-4-20250514-v1:0        |  ✅  |
+| Claude-4-Sonnet            | us.anthropic.claude-sonnet-4-20250514-v1:0      |  ✅  |
+| Claude-4-Sonnet-Thinking   | us.anthropic.claude-sonnet-4-20250514-v1:0      |  ✅  |
+| Claude-4-1-Opus            | us.anthropic.claude-opus-4-1-20250805-v1:0      |  ✅  |
+| Claude-4-1-Opus-Thinking   | us.anthropic.claude-opus-4-1-20250805-v1:0      |  ✅  |
+| Claude-4-5-Haiku           | global.anthropic.claude-haiku-4-5-20251001-v1:0 |  ✅  |
+| Claude-4-5-Haiku-Thinking  | global.anthropic.claude-haiku-4-5-20251001-v1:0 |  ✅  |
+| Claude-4-5-Opus            | global.anthropic.claude-opus-4-5-20251101-v1:0  |  ✅  |
+| Claude-4-5-Opus-Thinking   | global.anthropic.claude-opus-4-5-20251101-v1:0  |  ✅  |
+| Claude-4-5-Sonnet          | us.anthropic.claude-sonnet-4-5-20250929-v1:0    |  ✅  |
+| Claude-4-5-Sonnet-Thinking | us.anthropic.claude-sonnet-4-5-20250929-v1:0    |  ✅  |
+| DeepSeek-R1                | us.deepseek.r1-v1:0                             |  ❌  |
+| DeepSeek-V3.1              | deepseek.v3-v1:0                                |  ❌  |
+| Gemma-3-4b                 | google.gemma-3-4b-it                            |  ✅  |
+| Gemma-3-12b                | google.gemma-3-12b-it                           |  ✅  |
+| Gemma-3-27b                | google.gemma-3-27b-it                           |  ✅  |
+| GPT-OSS-120B               | openai.gpt-oss-120b-1:0                         |  ❌  |
+| GPT-OSS-120B-Thinking      | openai.gpt-oss-120b-1:0                         |  ❌  |
+| GPT-OSS-20B                | openai.gpt-oss-20b-1:0                          |  ❌  |
+| GPT-OSS-20B-Thinking       | openai.gpt-oss-20b-1:0                          |  ❌  |
+| Kimi-K2                    | moonshot.kimi-k2-thinking                       |  ❌  |
+| Kimi-K2-Thinking           | moonshot.kimi-k2-thinking                       |  ❌  |
+| Llama-3-8b                 | meta.llama3-8b-instruct-v1:0                    |  ❌  |
+| Llama-3-70b                | meta.llama3-70b-instruct-v1:0                   |  ❌  |
+| Llama-3-1-8b               | us.meta.llama3-1-8b-instruct-v1:0               |  ❌  |
+| Llama-3-1-70b              | us.meta.llama3-1-70b-instruct-v1:0              |  ❌  |
+| Llama-3-1-405b             | meta.llama3-1-405b-instruct-v1:0                |  ❌  |
 | Llama-3-2-1b               | us.meta.llama3-2-1b-instruct-v1:0            |  ❌  |
 | Llama-3-2-3b               | us.meta.llama3-2-3b-instruct-v1:0            |  ❌  |
 | Llama-3-2-11b              | us.meta.llama3-2-11b-instruct-v1:0           |  ❌  |
 | Llama-3-2-90b              | us.meta.llama3-2-90b-instruct-v1:0           |  ❌  |
 | Llama-3-3-70b              | us.meta.llama3-3-70b-instruct-v1:0           |  ❌  |
+| Llama-4-Scout-17b          | us.meta.llama4-scout-17b-instruct-v1:0       |  ✅  |
+| Llama-4-Maverick-17b       | us.meta.llama4-maverick-17b-instruct-v1:0    |  ✅  |
+| Magistral-Small-2509       | mistral.magistral-small-2509                 |  ❌  |
+| MiniMax-M2                 | minimax.minimax-m2                           |  ❌  |
+| Ministral-3-3b             | mistral.ministral-3-3b-instruct              |  ✅  |
+| Ministral-3-8b             | mistral.ministral-3-8b-instruct              |  ✅  |
+| Ministral-3-14b            | mistral.ministral-3-14b-instruct             |  ✅  |
 | Mistral-7b                 | mistral.mistral-7b-instruct-v0:2             |  ❌  |
-| Mixtral-8x7b               | mistral.mixtral-8x7b-instruct-v0:1           |  ❌  |
 | Mistral-Large              | mistral.mistral-large-2402-v1:0              |  ❌  |
+| Mistral-Large-3            | mistral.mistral-large-3-675b-instruct        |  ✅  |
+| Mixtral-8x7b               | mistral.mixtral-8x7b-instruct-v0:1           |  ❌  |
+| Nova-2-Lite                | us.amazon.nova-2-lite-v1:0                   |  ✅  |
 | Nova-Micro                 | us.amazon.nova-micro-v1:0                    |  ❌  |
 | Nova-Lite                  | us.amazon.nova-lite-v1:0                     |  ✅  |
 | Nova-Pro                   | us.amazon.nova-pro-v1:0                      |  ✅  |
@@ -166,6 +182,7 @@ Bedrock Wrapper is an npm package that simplifies the integration of existing Op
 | Qwen3-235B-A22B-2507       | qwen.qwen3-235b-a22b-2507-v1:0               |  ❌  |
 | Qwen3-Coder-30B-A3B        | qwen.qwen3-coder-30b-a3b-v1:0                |  ❌  |
 | Qwen3-Coder-480B-A35B      | qwen.qwen3-coder-480b-a35b-v1:0              |  ❌  |
+| Qwen3-Next-80B-A3B         | qwen.qwen3-next-80b-a3b                      |  ❌  |
 To return the list progrmatically you can import and call `listBedrockWrapperSupportedModels`:
 ```javascript
@@ -181,8 +198,9 @@ Please modify the `bedrock_models.js` file and submit a PR 🏆 or create an Iss
 ### Thinking Models
 Some models support extended reasoning capabilities through "thinking mode". These models include:
-- **Claude models**: Claude-4-1-Opus-Thinking, Claude-4-Opus-Thinking, Claude-4-5-Sonnet-Thinking, Claude-4-5-Haiku-Thinking, Claude-4-Sonnet-Thinking, Claude-3-7-Sonnet-Thinking
+- **Claude models**: Claude-4-5-Opus-Thinking, Claude-4-1-Opus-Thinking, Claude-4-Opus-Thinking, Claude-4-5-Sonnet-Thinking, Claude-4-5-Haiku-Thinking, Claude-4-Sonnet-Thinking, Claude-3-7-Sonnet-Thinking
 - **GPT-OSS models**: GPT-OSS-120B-Thinking, GPT-OSS-20B-Thinking
+- **Kimi models**: Kimi-K2-Thinking (preserves reasoning tags in output)
 To use thinking mode and see the model's reasoning process, set `include_thinking_data: true` in your request:
@@ -213,7 +231,7 @@ for await (const chunk of bedrockWrapper(awsCreds, openaiChatCompletionsCreateOb
 ### Image Support
-For models with image support (Claude 4+ series including Claude 4.5 Sonnet, Claude 4.5 Haiku, Claude 3.7 Sonnet, Claude 3.5 Sonnet, Claude 3 Haiku, Nova Pro, and Nova Lite), you can include images in your messages using the following format (not all models support system prompts):
+For models with image support (Claude 4+ series including Claude 4.5 Opus, Claude 4.5 Sonnet, Claude 4.5 Haiku, Claude 3.7 Sonnet, Claude 3.5 Sonnet, Claude 3 Haiku, Nova Pro, Nova Lite, Nova 2 Lite, Mistral Large 3, Ministral 3 series, Gemma 3 series, and Llama 4 series), you can include images in your messages using the following format (not all models support system prompts):
 ```javascript
 messages = [
@@ -280,6 +298,9 @@ const openaiChatCompletionsCreateObject = {
 - ✅ **GPT-OSS models**: Fully supported
 - ✅ **Mistral models**: Fully supported (up to 10 sequences)
 - ✅ **Qwen models**: Fully supported
+- ✅ **Gemma models**: Fully supported
+- ✅ **Kimi models**: Fully supported
+- ✅ **MiniMax models**: Fully supported
 - ❌ **Llama models**: Not supported (AWS Bedrock limitation)
 **Features:**
@@ -310,6 +331,7 @@ Some AWS Bedrock models have specific parameter restrictions that are automatica
 #### Claude 4+ Models (Temperature/Top-P Mutual Exclusion)
 **Affected Models:**
+- Claude-4-5-Opus & Claude-4-5-Opus-Thinking
 - Claude-4-5-Sonnet & Claude-4-5-Sonnet-Thinking
 - Claude-4-5-Haiku & Claude-4-5-Haiku-Thinking
 - Claude-4-Sonnet & Claude-4-Sonnet-Thinking