@jsonstudio/rcc 0.89.1189 → 0.89.1348
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +17 -0
- package/configsamples/config.json +426 -0
- package/configsamples/config.reference.json +58 -0
- package/configsamples/provider/crs/config.v1.json +46 -0
- package/configsamples/provider/glm/config.v1.json +81 -0
- package/configsamples/provider/glm-anthropic/config.v1.json +45 -0
- package/configsamples/provider/iflow/config.v1.json +74 -0
- package/configsamples/provider/kimi/config.v1.json +41 -0
- package/configsamples/provider/lmstudio/config.v1.json +101 -0
- package/configsamples/provider/mimo/config.v1.json +35 -0
- package/configsamples/provider/modelscope/config.v1.json +96 -0
- package/configsamples/provider/qwen/config.v1.json +38 -0
- package/configsamples/provider/tab/config.v1.json +50 -0
- package/configsamples/provider/tabglm/config.v1.json +49 -0
- package/dist/build-info.js +2 -2
- package/dist/cli/commands/code.js +12 -6
- package/dist/cli/commands/code.js.map +1 -1
- package/dist/cli/commands/config.d.ts +2 -1
- package/dist/cli/commands/config.js +74 -103
- package/dist/cli/commands/config.js.map +1 -1
- package/dist/cli/commands/examples.js +6 -6
- package/dist/cli/commands/examples.js.map +1 -1
- package/dist/cli/commands/init.d.ts +28 -0
- package/dist/cli/commands/init.js +91 -0
- package/dist/cli/commands/init.js.map +1 -0
- package/dist/cli/commands/port.js +10 -2
- package/dist/cli/commands/port.js.map +1 -1
- package/dist/cli/commands/restart.js +5 -2
- package/dist/cli/commands/restart.js.map +1 -1
- package/dist/cli/commands/start.js +25 -22
- package/dist/cli/commands/start.js.map +1 -1
- package/dist/cli/commands/status.js +1 -0
- package/dist/cli/commands/status.js.map +1 -1
- package/dist/cli/commands/stop.js +1 -0
- package/dist/cli/commands/stop.js.map +1 -1
- package/dist/cli/config/bundled-docs.d.ts +20 -0
- package/dist/cli/config/bundled-docs.js +91 -0
- package/dist/cli/config/bundled-docs.js.map +1 -0
- package/dist/cli/config/init-config.d.ts +36 -0
- package/dist/cli/config/init-config.js +180 -0
- package/dist/cli/config/init-config.js.map +1 -0
- package/dist/cli/config/init-provider-catalog.d.ts +8 -0
- package/dist/cli/config/init-provider-catalog.js +187 -0
- package/dist/cli/config/init-provider-catalog.js.map +1 -0
- package/dist/cli/register/init-command.d.ts +3 -0
- package/dist/cli/register/init-command.js +5 -0
- package/dist/cli/register/init-command.js.map +1 -0
- package/dist/cli.js +28 -3
- package/dist/cli.js.map +1 -1
- package/dist/client/gemini-cli/gemini-cli-protocol-client.js +1 -1
- package/dist/client/gemini-cli/gemini-cli-protocol-client.js.map +1 -1
- package/dist/config/risk-control-config.d.ts +94 -0
- package/dist/config/risk-control-config.js +196 -0
- package/dist/config/risk-control-config.js.map +1 -0
- package/dist/constants/index.d.ts +6 -0
- package/dist/constants/index.js +13 -0
- package/dist/constants/index.js.map +1 -1
- package/dist/docs/daemon-admin-ui.html +2113 -190
- package/dist/index.js +0 -1
- package/dist/index.js.map +1 -1
- package/dist/manager/modules/health/index.d.ts +1 -1
- package/dist/manager/modules/quota/antigravity-quota-manager.d.ts +70 -0
- package/dist/manager/modules/quota/antigravity-quota-manager.js +442 -0
- package/dist/manager/modules/quota/antigravity-quota-manager.js.map +1 -0
- package/dist/manager/modules/quota/index.d.ts +3 -127
- package/dist/manager/modules/quota/index.js +2 -1093
- package/dist/manager/modules/quota/index.js.map +1 -1
- package/dist/manager/modules/quota/provider-key-normalization.d.ts +3 -0
- package/dist/manager/modules/quota/provider-key-normalization.js +155 -0
- package/dist/manager/modules/quota/provider-key-normalization.js.map +1 -0
- package/dist/manager/modules/quota/provider-quota-daemon.cooldown.d.ts +9 -0
- package/dist/manager/modules/quota/provider-quota-daemon.cooldown.js +115 -0
- package/dist/manager/modules/quota/provider-quota-daemon.cooldown.js.map +1 -0
- package/dist/manager/modules/quota/provider-quota-daemon.d.ts +77 -0
- package/dist/manager/modules/quota/provider-quota-daemon.events.d.ts +12 -0
- package/dist/manager/modules/quota/provider-quota-daemon.events.js +237 -0
- package/dist/manager/modules/quota/provider-quota-daemon.events.js.map +1 -0
- package/dist/manager/modules/quota/provider-quota-daemon.js +404 -0
- package/dist/manager/modules/quota/provider-quota-daemon.js.map +1 -0
- package/dist/manager/modules/quota/provider-quota-daemon.model-backoff.d.ts +11 -0
- package/dist/manager/modules/quota/provider-quota-daemon.model-backoff.js +189 -0
- package/dist/manager/modules/quota/provider-quota-daemon.model-backoff.js.map +1 -0
- package/dist/manager/modules/quota/provider-quota-daemon.snapshot.d.ts +8 -0
- package/dist/manager/modules/quota/provider-quota-daemon.snapshot.js +96 -0
- package/dist/manager/modules/quota/provider-quota-daemon.snapshot.js.map +1 -0
- package/dist/manager/modules/quota/provider-quota-daemon.view.d.ts +19 -0
- package/dist/manager/modules/quota/provider-quota-daemon.view.js +37 -0
- package/dist/manager/modules/quota/provider-quota-daemon.view.js.map +1 -0
- package/dist/manager/modules/routing/index.d.ts +1 -0
- package/dist/manager/modules/routing/index.js +11 -25
- package/dist/manager/modules/routing/index.js.map +1 -1
- package/dist/manager/quota/provider-quota-center.d.ts +2 -0
- package/dist/manager/quota/provider-quota-center.js +80 -82
- package/dist/manager/quota/provider-quota-center.js.map +1 -1
- package/dist/modules/llmswitch/bridge.d.ts +16 -18
- package/dist/modules/llmswitch/bridge.js +314 -71
- package/dist/modules/llmswitch/bridge.js.map +1 -1
- package/dist/modules/llmswitch/core-loader.d.ts +4 -2
- package/dist/modules/llmswitch/core-loader.js +32 -20
- package/dist/modules/llmswitch/core-loader.js.map +1 -1
- package/dist/modules/pipeline/utils/colored-logger.js +3 -2
- package/dist/modules/pipeline/utils/colored-logger.js.map +1 -1
- package/dist/modules/pipeline/utils/debug-logger.js +1 -1
- package/dist/modules/pipeline/utils/debug-logger.js.map +1 -1
- package/dist/providers/auth/iflow-cookie-auth.js +0 -2
- package/dist/providers/auth/iflow-cookie-auth.js.map +1 -1
- package/dist/providers/auth/oauth-lifecycle.js +2 -23
- package/dist/providers/auth/oauth-lifecycle.js.map +1 -1
- package/dist/providers/core/config/camoufox-launcher.js +35 -4
- package/dist/providers/core/config/camoufox-launcher.js.map +1 -1
- package/dist/providers/core/runtime/antigravity-quota-client.js +6 -3
- package/dist/providers/core/runtime/antigravity-quota-client.js.map +1 -1
- package/dist/providers/core/runtime/base-provider.d.ts +2 -2
- package/dist/providers/core/runtime/base-provider.js +74 -69
- package/dist/providers/core/runtime/base-provider.js.map +1 -1
- package/dist/providers/core/runtime/gemini-cli-http-provider.js +6 -4
- package/dist/providers/core/runtime/gemini-cli-http-provider.js.map +1 -1
- package/dist/providers/core/runtime/http-request-executor.js +2 -2
- package/dist/providers/core/runtime/http-request-executor.js.map +1 -1
- package/dist/providers/core/runtime/http-transport-provider.d.ts +14 -0
- package/dist/providers/core/runtime/http-transport-provider.js +111 -5
- package/dist/providers/core/runtime/http-transport-provider.js.map +1 -1
- package/dist/providers/core/runtime/provider-error-classifier.js +10 -0
- package/dist/providers/core/runtime/provider-error-classifier.js.map +1 -1
- package/dist/providers/core/runtime/provider-factory.js +7 -5
- package/dist/providers/core/runtime/provider-factory.js.map +1 -1
- package/dist/providers/core/runtime/provider-runtime-metadata.d.ts +6 -0
- package/dist/providers/core/runtime/provider-runtime-metadata.js.map +1 -1
- package/dist/providers/core/runtime/responses-provider.d.ts +1 -7
- package/dist/providers/core/runtime/responses-provider.js +12 -93
- package/dist/providers/core/runtime/responses-provider.js.map +1 -1
- package/dist/providers/core/strategies/oauth-auth-code-flow.js +12 -8
- package/dist/providers/core/strategies/oauth-auth-code-flow.js.map +1 -1
- package/dist/providers/core/utils/http-client.js +16 -3
- package/dist/providers/core/utils/http-client.js.map +1 -1
- package/dist/providers/core/utils/provider-error-logger.d.ts +1 -1
- package/dist/providers/core/utils/provider-error-reporter.d.ts +3 -1
- package/dist/providers/core/utils/provider-error-reporter.js +3 -0
- package/dist/providers/core/utils/provider-error-reporter.js.map +1 -1
- package/dist/providers/core/utils/snapshot-writer.js +1 -4
- package/dist/providers/core/utils/snapshot-writer.js.map +1 -1
- package/dist/providers/mock/mock-provider-runtime.js +57 -27
- package/dist/providers/mock/mock-provider-runtime.js.map +1 -1
- package/dist/scripts/camoufox/launch-auth.mjs +193 -58
- package/dist/server/handlers/handler-utils.js +3 -2
- package/dist/server/handlers/handler-utils.js.map +1 -1
- package/dist/server/runtime/http-server/daemon-admin/auth-handler.d.ts +2 -0
- package/dist/server/runtime/http-server/daemon-admin/auth-handler.js +103 -0
- package/dist/server/runtime/http-server/daemon-admin/auth-handler.js.map +1 -0
- package/dist/server/runtime/http-server/daemon-admin/auth-session.d.ts +5 -0
- package/dist/server/runtime/http-server/daemon-admin/auth-session.js +77 -0
- package/dist/server/runtime/http-server/daemon-admin/auth-session.js.map +1 -0
- package/dist/server/runtime/http-server/daemon-admin/auth-store.d.ts +18 -0
- package/dist/server/runtime/http-server/daemon-admin/auth-store.js +89 -0
- package/dist/server/runtime/http-server/daemon-admin/auth-store.js.map +1 -0
- package/dist/server/runtime/http-server/daemon-admin/credentials-handler.js +1 -2
- package/dist/server/runtime/http-server/daemon-admin/credentials-handler.js.map +1 -1
- package/dist/server/runtime/http-server/daemon-admin/providers-handler.js +226 -24
- package/dist/server/runtime/http-server/daemon-admin/providers-handler.js.map +1 -1
- package/dist/server/runtime/http-server/daemon-admin/quota-handler.js +47 -8
- package/dist/server/runtime/http-server/daemon-admin/quota-handler.js.map +1 -1
- package/dist/server/runtime/http-server/daemon-admin/restart-handler.js +1 -1
- package/dist/server/runtime/http-server/daemon-admin/restart-handler.js.map +1 -1
- package/dist/server/runtime/http-server/daemon-admin/stats-handler.js +1 -1
- package/dist/server/runtime/http-server/daemon-admin/stats-handler.js.map +1 -1
- package/dist/server/runtime/http-server/daemon-admin/status-handler.js +68 -4
- package/dist/server/runtime/http-server/daemon-admin/status-handler.js.map +1 -1
- package/dist/server/runtime/http-server/daemon-admin-routes.d.ts +3 -4
- package/dist/server/runtime/http-server/daemon-admin-routes.js +9 -14
- package/dist/server/runtime/http-server/daemon-admin-routes.js.map +1 -1
- package/dist/server/runtime/http-server/executor-metadata.js +1 -1
- package/dist/server/runtime/http-server/executor-metadata.js.map +1 -1
- package/dist/server/runtime/http-server/executor-response.js +0 -16
- package/dist/server/runtime/http-server/executor-response.js.map +1 -1
- package/dist/server/runtime/http-server/hub-shadow-compare.d.ts +18 -0
- package/dist/server/runtime/http-server/hub-shadow-compare.js +256 -0
- package/dist/server/runtime/http-server/hub-shadow-compare.js.map +1 -0
- package/dist/server/runtime/http-server/index.d.ts +7 -2
- package/dist/server/runtime/http-server/index.js +287 -49
- package/dist/server/runtime/http-server/index.js.map +1 -1
- package/dist/server/runtime/http-server/middleware.js +19 -1
- package/dist/server/runtime/http-server/middleware.js.map +1 -1
- package/dist/server/runtime/http-server/request-executor.js +10 -19
- package/dist/server/runtime/http-server/request-executor.js.map +1 -1
- package/dist/server/runtime/http-server/routes.js +8 -2
- package/dist/server/runtime/http-server/routes.js.map +1 -1
- package/dist/server/runtime/http-server/session-dir.d.ts +2 -0
- package/dist/server/runtime/http-server/session-dir.js +59 -0
- package/dist/server/runtime/http-server/session-dir.js.map +1 -0
- package/dist/server/runtime/http-server/types.d.ts +0 -4
- package/dist/server/utils/utf8-chunk-buffer.js +6 -3
- package/dist/server/utils/utf8-chunk-buffer.js.map +1 -1
- package/dist/server/utils/warmup-storm-tracker.js +1 -1
- package/dist/server/utils/warmup-storm-tracker.js.map +1 -1
- package/dist/server-factory.d.ts +6 -28
- package/dist/server-factory.js +8 -93
- package/dist/server-factory.js.map +1 -1
- package/dist/token-daemon/index.js +2 -2
- package/dist/token-daemon/index.js.map +1 -1
- package/dist/token-daemon/provider-registry.js +0 -1
- package/dist/token-daemon/provider-registry.js.map +1 -1
- package/dist/token-daemon/server-utils.js +8 -9
- package/dist/token-daemon/server-utils.js.map +1 -1
- package/dist/token-daemon/token-utils.js +1 -1
- package/dist/token-daemon/token-utils.js.map +1 -1
- package/dist/tools/semantic-replay.js +2 -2
- package/dist/tools/semantic-replay.js.map +1 -1
- package/dist/tools/stats-request-events.d.ts +1 -1
- package/dist/tools/stats-usage.js +6 -3
- package/dist/tools/stats-usage.js.map +1 -1
- package/dist/utils/errorsamples.d.ts +5 -0
- package/dist/utils/errorsamples.js +27 -0
- package/dist/utils/errorsamples.js.map +1 -0
- package/dist/utils/llms-engine-shadow.d.ts +19 -0
- package/dist/utils/llms-engine-shadow.js +209 -0
- package/dist/utils/llms-engine-shadow.js.map +1 -0
- package/dist/utils/runtime-versions.d.ts +1 -0
- package/dist/utils/runtime-versions.js +39 -0
- package/dist/utils/runtime-versions.js.map +1 -0
- package/docs/ARCHITECTURE.md +402 -0
- package/docs/CODEX_AND_CLAUDE_CODE.md +69 -0
- package/docs/CONFIG_ARCHITECTURE.md +517 -0
- package/docs/ERROR_HANDLING_AUDIT.md +0 -0
- package/docs/GCLI2API_PARITY_GAPS.md +98 -0
- package/docs/INSTALLATION_AND_QUICKSTART.md +74 -0
- package/docs/INSTRUCTION_MARKUP.md +89 -0
- package/docs/MODULE_ENHANCEMENT_SYSTEM.md +666 -0
- package/docs/PORTS.md +36 -0
- package/docs/PROVIDERS_BUILTIN.md +111 -0
- package/docs/PROVIDER_TYPES.md +55 -0
- package/docs/SERVERTOOL_CLOCK_DESIGN.md +233 -0
- package/docs/USAGE_HANDLING_ANALYSIS.md +335 -0
- package/docs/USER_CONFIG_PARSER_CHANGES.md +175 -0
- package/docs/V3_INBOUND_OUTBOUND_DESIGN.md +86 -0
- package/docs/VIRTUAL_ROUTER_PRIORITY_AND_HEALTH.md +125 -0
- package/docs/anthropic-request-golden-samples.md +50 -0
- package/docs/ccr-alignment-enhancetool.md +105 -0
- package/docs/chat-glm-500-analysis.md +79 -0
- package/docs/chat-request-golden-samples.md +42 -0
- package/docs/chat-semantic-expansion-plan.md +82 -0
- package/docs/cli-command-inventory.md +76 -0
- package/docs/codex-samples-replay.md +50 -0
- package/docs/daemon-admin-api-design.md +350 -0
- package/docs/daemon-admin-module-structure.md +169 -0
- package/docs/daemon-admin-ui.html +3394 -0
- package/docs/debug-system-design.md +734 -0
- package/docs/debugging/gemini-sse-root-cause.md +52 -0
- package/docs/debugging/sse_encoding_failure_analysis.md +53 -0
- package/docs/dry-run/README.md +721 -0
- package/docs/error-handling-v2.md +92 -0
- package/docs/exec-command-guard-policy.example.v1.json +42 -0
- package/docs/fixes/gemini-protocol-mapping.md +57 -0
- package/docs/fixes/oauth-portal-timing-fix.md +202 -0
- package/docs/fixes/web-search-hop3-fix.md +265 -0
- package/docs/glm-api-reference.md +390 -0
- package/docs/glm-chat-completions.md +1779 -0
- package/docs/glm-history-inline-images.md +44 -0
- package/docs/golden-ci-library.md +66 -0
- package/docs/lmstudio-dry-run-summary.md +203 -0
- package/docs/lmstudio-tool-calling.md +214 -0
- package/docs/mapping-tables/anthropic-to-openai.json +290 -0
- package/docs/mapping-tables/iflow-to-openai.json +215 -0
- package/docs/mapping-tables/openai-passthrough.json +190 -0
- package/docs/mapping-tables/openai-to-iflow.json +227 -0
- package/docs/monitoring/Design.md +61 -0
- package/docs/multi-token-auth-guide.md +66 -0
- package/docs/oauth-authentication-guide.md +168 -0
- package/docs/oauth-iflow-implementation.md +153 -0
- package/docs/pipeline-routing-report.md +209 -0
- package/docs/plans/manager-daemon/PLAN.md +86 -0
- package/docs/plans/provider-config-v2-plan.md +176 -0
- package/docs/plans/provider-runtime-manager-plan.md +209 -0
- package/docs/plans/transparent-429-failover.md +89 -0
- package/docs/plans/unified-hub-framework-v1.md +245 -0
- package/docs/provider-config-v2-ui-design.md +181 -0
- package/docs/provider-quota-design.md +129 -0
- package/docs/providers/gemini-provider.md +62 -0
- package/docs/providers/lmstudio-v2-migration-report.md +102 -0
- package/docs/providers/provider-composite-design.md +142 -0
- package/docs/providers/provider-composite-testing.md +98 -0
- package/docs/providers/provider-type-only-migration.md +111 -0
- package/docs/rccx-wasm-migration.md +74 -0
- package/docs/refactoring/architecture-comparison-diagram.md +140 -0
- package/docs/refactoring/compatibility-v2-architecture-design.md +738 -0
- package/docs/refactoring/workflow-compatibility-refactoring-design.md +361 -0
- package/docs/reports/routing-classification-report.json +24 -0
- package/docs/reports/routing-classification-report.md +18 -0
- package/docs/reports/thinking-keywords-report.json +19 -0
- package/docs/responses/README.md +156 -0
- package/docs/responses-generic-provider.md +86 -0
- package/docs/responses-passthrough-provider-design.md +202 -0
- package/docs/routing-awrr-health-weighted-round-robin.md +179 -0
- package/docs/routing-instructions.md +393 -0
- package/docs/stop-message-auto.md +225 -0
- package/docs/streaming-flow.html +30 -0
- package/docs/streaming-flow.md +182 -0
- package/docs/token-daemon-preview.html +490 -0
- package/docs/token-refresh-daemon-plan.md +269 -0
- package/docs/transformation-tables/Gemini-FinishReason/345/256/214/346/225/264/350/275/254/346/215/242/350/241/250.json +233 -0
- package/docs/transformation-tables/README.md +225 -0
- package/docs/transformation-tables/claude-code-router-anthropic-to-gemini.json +283 -0
- package/docs/transformation-tables/claude-code-router-anthropic-to-openai.json +208 -0
- package/docs/transformation-tables/claude-code-router-openai-to-anthropic.json +261 -0
- package/docs/transformation-tables/claude-code-router-openai-to-gemini.json +208 -0
- package/docs/transformation-tables/claude-code-router-openai-to-lmstudio.json +182 -0
- package/docs/transformation-tables/claude-code-router-openai-to-ollama.json +250 -0
- package/docs/transformation-tables/claude-code-router-openai-to-textgenwebui.json +295 -0
- package/docs/transformation-tables/claude-code-router-provider-conversions.json +193 -0
- package/docs/transformation-tables//345/256/214/346/225/264/347/232/204/345/267/245/345/205/267/346/211/247/350/241/214/346/265/201/347/250/213/350/275/254/346/215/242/350/241/250.json +299 -0
- package/docs/transformation-tables//345/257/271/350/257/235/345/216/206/345/217/262/347/273/264/346/212/244/345/210/206/346/236/220.md +134 -0
- package/docs/transformation-tables//345/267/245/345/205/267/350/260/203/347/224/250/346/250/241/345/274/217/345/210/206/346/236/220.md +158 -0
- package/docs/transformation-tables//347/212/266/346/200/201/347/256/241/347/220/206/351/234/200/346/261/202/345/210/206/346/236/220.md +175 -0
- package/docs/transformation-tables//351/235/231/346/200/201/350/241/250vs/345/212/250/346/200/201/345/210/206/346/236/220.md +189 -0
- package/docs/transformation-tables//351/235/231/346/200/201/350/241/250/345/207/206/347/241/256/346/200/247/350/257/204/344/274/260.md +179 -0
- package/docs/transformation-tables//351/235/236/346/265/201/345/274/217/345/234/272/346/231/257/345/210/206/346/236/220.md +189 -0
- package/docs/v2-architecture/IMPLEMENTATION-ROADMAP.md +367 -0
- package/docs/v2-architecture/OPTIMIZED-DESIGN.md +827 -0
- package/docs/v2-architecture/PRERUN-CONNECTION-DESIGN.md +716 -0
- package/docs/v2-architecture/README.md +551 -0
- package/docs/verification/modelscope-verify.md +59 -0
- package/docs/web-search-service-design.md +322 -0
- package/package.json +12 -7
- package/scripts/camoufox/launch-auth.mjs +193 -58
- package/scripts/monitor-diff.mjs +126 -0
- package/scripts/pack-mode.mjs +19 -1
- package/scripts/pack-rcc.mjs +63 -0
- package/scripts/unified-hub-shadow-compare.mjs +33 -13
- package/scripts/verify-e2e-toolcall.mjs +115 -26
- package/dist/modules/llmswitch/pipeline-registry.d.ts +0 -57
- package/dist/modules/llmswitch/pipeline-registry.js +0 -229
- package/dist/modules/llmswitch/pipeline-registry.js.map +0 -1
- package/dist/server/RouteCodexServer.d.ts +0 -13
- package/dist/server/RouteCodexServer.js +0 -25
- package/dist/server/RouteCodexServer.js.map +0 -1
- package/dist/v2/conversion/hub/snapshot-recorder.d.ts +0 -12
- package/dist/v2/conversion/hub/snapshot-recorder.js +0 -22
- package/dist/v2/conversion/hub/snapshot-recorder.js.map +0 -1
|
@@ -0,0 +1,390 @@
|
|
|
1
|
+
# GLM API 参考文档
|
|
2
|
+
|
|
3
|
+
> 源文档:https://docs.bigmodel.cn/api-reference/模型-api/对话补全
|
|
4
|
+
> 保存时间:2025年10月29日
|
|
5
|
+
|
|
6
|
+
## 概述
|
|
7
|
+
|
|
8
|
+
GLM(General Language Model)对话补全API支持多种模型,提供文本对话、工具调用、流式输出等功能。支持多模态输入输出,包括文本、图片、音频、视频和文件。
|
|
9
|
+
|
|
10
|
+
## API端点
|
|
11
|
+
|
|
12
|
+
```
|
|
13
|
+
POST https://open.bigmodel.cn/api/paas/v4/chat/completions
|
|
14
|
+
```
|
|
15
|
+
|
|
16
|
+
## 认证
|
|
17
|
+
|
|
18
|
+
```
|
|
19
|
+
Authorization: Bearer <token>
|
|
20
|
+
Content-Type: application/json
|
|
21
|
+
```
|
|
22
|
+
|
|
23
|
+
## 请求参数
|
|
24
|
+
|
|
25
|
+
### 基础调用示例
|
|
26
|
+
|
|
27
|
+
```bash
|
|
28
|
+
curl --request POST \
|
|
29
|
+
--url https://open.bigmodel.cn/api/paas/v4/chat/completions \
|
|
30
|
+
--header 'Authorization: Bearer <token>' \
|
|
31
|
+
--header 'Content-Type: application/json' \
|
|
32
|
+
--data '{
|
|
33
|
+
"model": "glm-4.6",
|
|
34
|
+
"messages": [
|
|
35
|
+
{
|
|
36
|
+
"role": "system",
|
|
37
|
+
"content": "你是一个有用的AI助手。"
|
|
38
|
+
},
|
|
39
|
+
{
|
|
40
|
+
"role": "user",
|
|
41
|
+
"content": "请介绍一下人工智能的发展历程。"
|
|
42
|
+
}
|
|
43
|
+
],
|
|
44
|
+
"temperature": 1,
|
|
45
|
+
"max_tokens": 65536,
|
|
46
|
+
"stream": false
|
|
47
|
+
}'
|
|
48
|
+
```
|
|
49
|
+
|
|
50
|
+
### 参数详情
|
|
51
|
+
|
|
52
|
+
#### model (必需)
|
|
53
|
+
- **类型**: `enum<string>`
|
|
54
|
+
- **默认值**: `glm-4.6`
|
|
55
|
+
- **描述**: 调用的普通对话模型代码
|
|
56
|
+
|
|
57
|
+
**可用选项**:
|
|
58
|
+
- `glm-4.6` - 最新旗舰模型,专为智能体应用打造
|
|
59
|
+
- `glm-4.5` - 复杂推理、超长上下文
|
|
60
|
+
- `glm-4.5-air` - 轻量级版本
|
|
61
|
+
- `glm-4.5-x` - 增强版本
|
|
62
|
+
- `glm-4.5-airx` - 高性能轻量版本
|
|
63
|
+
- `glm-4.5-flash` - 快速响应版本
|
|
64
|
+
- `glm-4-plus` - GLM-4增强版本
|
|
65
|
+
- `glm-4-air-250414` - 2025年4月版本
|
|
66
|
+
- `glm-4-airx` - 高性能版本
|
|
67
|
+
- `glm-4-flashx` - 极速版本
|
|
68
|
+
- `glm-4-flashx-250414` - 2025年4月极速版本
|
|
69
|
+
- `glm-z1-air` - 推理专用轻量版本
|
|
70
|
+
- `glm-z1-airx` - 推理专用高性能版本
|
|
71
|
+
- `glm-z1-flash` - 推理专用快速版本
|
|
72
|
+
- `glm-z1-flashx` - 推理专用极速版本
|
|
73
|
+
|
|
74
|
+
#### messages (必需)
|
|
75
|
+
- **类型**: `(用户消息 · object | 系统消息 · object | 助手消息 · object | 工具消息 · object)[]`
|
|
76
|
+
- **描述**: 对话消息列表,包含完整的上下文信息
|
|
77
|
+
- **最小长度**: 1
|
|
78
|
+
- **注意**: 不能只包含系统消息或助手消息
|
|
79
|
+
|
|
80
|
+
**消息格式**:
|
|
81
|
+
```json
|
|
82
|
+
{
|
|
83
|
+
"role": "user" | "system" | "assistant" | "tool",
|
|
84
|
+
"content": "消息内容"
|
|
85
|
+
}
|
|
86
|
+
```
|
|
87
|
+
|
|
88
|
+
#### stream (可选)
|
|
89
|
+
- **类型**: `boolean`
|
|
90
|
+
- **默认值**: `false`
|
|
91
|
+
- **描述**: 是否启用流式输出模式
|
|
92
|
+
- `false`: 一次性返回完整响应
|
|
93
|
+
- `true`: 通过SSE流式返回内容,结束时返回 `data: [DONE]`
|
|
94
|
+
|
|
95
|
+
#### thinking (可选,GLM-4.5+支持)
|
|
96
|
+
- **类型**: `object`
|
|
97
|
+
- **描述**: 控制大模型是否开启思维链
|
|
98
|
+
|
|
99
|
+
```json
|
|
100
|
+
{
|
|
101
|
+
"thinking": {
|
|
102
|
+
"type": "enabled" | "disabled"
|
|
103
|
+
}
|
|
104
|
+
}
|
|
105
|
+
```
|
|
106
|
+
|
|
107
|
+
#### do_sample (可选)
|
|
108
|
+
- **类型**: `boolean`
|
|
109
|
+
- **默认值**: `true`
|
|
110
|
+
- **描述**: 是否启用采样策略
|
|
111
|
+
- `true`: 使用temperature、top_p等参数进行随机采样
|
|
112
|
+
- `false`: 选择概率最高的词汇,忽略temperature和top_p
|
|
113
|
+
|
|
114
|
+
#### temperature (可选)
|
|
115
|
+
- **类型**: `number`
|
|
116
|
+
- **默认值**:
|
|
117
|
+
- GLM-4.6系列: 1.0
|
|
118
|
+
- GLM-4.5系列: 0.6
|
|
119
|
+
- GLM-Z1系列和GLM-4系列: 0.75
|
|
120
|
+
- **范围**: `0.0 <= x <= 1.0`
|
|
121
|
+
- **描述**: 采样温度,控制输出的随机性和创造性
|
|
122
|
+
|
|
123
|
+
#### top_p (可选)
|
|
124
|
+
- **类型**: `number`
|
|
125
|
+
- **默认值**:
|
|
126
|
+
- GLM-4.6/GLM-4.5系列: 0.95
|
|
127
|
+
- GLM-Z1系列和GLM-4系列: 0.9
|
|
128
|
+
- **范围**: `0 < x <= 1.0`
|
|
129
|
+
- **描述**: 核采样参数,控制候选词汇范围
|
|
130
|
+
|
|
131
|
+
#### max_tokens (可选)
|
|
132
|
+
- **类型**: `integer`
|
|
133
|
+
- **范围**: `1 <= x <= 131072`
|
|
134
|
+
- **描述**: 模型输出的最大token数限制
|
|
135
|
+
- GLM-4.6: 最大128K
|
|
136
|
+
- GLM-4.5: 最大96K
|
|
137
|
+
- GLM-Z1系列: 最大32K
|
|
138
|
+
|
|
139
|
+
#### tool_stream (可选)
|
|
140
|
+
- **类型**: `boolean`
|
|
141
|
+
- **描述**: 是否开启流式响应Function Calls,仅限GLM-4.6支持
|
|
142
|
+
|
|
143
|
+
#### tools (可选)
|
|
144
|
+
- **类型**: `Function Call · object[] | Retrieval · object[] | Web Search · object[] | MCP · object[]`
|
|
145
|
+
- **描述**: 模型可以调用的工具列表
|
|
146
|
+
- **最大数量**: 128个函数
|
|
147
|
+
|
|
148
|
+
**函数工具格式**:
|
|
149
|
+
```json
|
|
150
|
+
{
|
|
151
|
+
"type": "function",
|
|
152
|
+
"function": {
|
|
153
|
+
"name": "函数名称",
|
|
154
|
+
"description": "函数描述",
|
|
155
|
+
"parameters": {
|
|
156
|
+
"type": "object",
|
|
157
|
+
"properties": {},
|
|
158
|
+
"required": ["必需参数"],
|
|
159
|
+
"additionalProperties": true
|
|
160
|
+
}
|
|
161
|
+
}
|
|
162
|
+
}
|
|
163
|
+
```
|
|
164
|
+
|
|
165
|
+
#### tool_choice (可选)
|
|
166
|
+
- **类型**: `enum<string>`
|
|
167
|
+
- **默认值**: `auto`
|
|
168
|
+
- **描述**: 控制模型如何选择工具
|
|
169
|
+
- **可用选项**: `auto`
|
|
170
|
+
|
|
171
|
+
#### stop (可选)
|
|
172
|
+
- **类型**: `string[]`
|
|
173
|
+
- **最大长度**: 1
|
|
174
|
+
- **描述**: 停止词列表,遇到指定字符串时停止生成
|
|
175
|
+
|
|
176
|
+
#### response_format (可选)
|
|
177
|
+
- **类型**: `object`
|
|
178
|
+
- **描述**: 指定模型的响应输出格式
|
|
179
|
+
|
|
180
|
+
```json
|
|
181
|
+
{
|
|
182
|
+
"response_format": {
|
|
183
|
+
"type": "text" | "json_object"
|
|
184
|
+
}
|
|
185
|
+
}
|
|
186
|
+
```
|
|
187
|
+
|
|
188
|
+
#### request_id (可选)
|
|
189
|
+
- **类型**: `string`
|
|
190
|
+
- **描述**: 请求唯一标识符,建议使用UUID格式
|
|
191
|
+
|
|
192
|
+
#### user_id (可选)
|
|
193
|
+
- **类型**: `string`
|
|
194
|
+
- **长度要求**: 6-128个字符
|
|
195
|
+
- **描述**: 终端用户的唯一标识符
|
|
196
|
+
|
|
197
|
+
## 响应格式
|
|
198
|
+
|
|
199
|
+
### 成功响应示例
|
|
200
|
+
|
|
201
|
+
```json
|
|
202
|
+
{
|
|
203
|
+
"id": "<string>",
|
|
204
|
+
"request_id": "<string>",
|
|
205
|
+
"created": 123,
|
|
206
|
+
"model": "<string>",
|
|
207
|
+
"choices": [
|
|
208
|
+
{
|
|
209
|
+
"index": 123,
|
|
210
|
+
"message": {
|
|
211
|
+
"role": "assistant",
|
|
212
|
+
"content": "<string>",
|
|
213
|
+
"reasoning_content": "<string>",
|
|
214
|
+
"audio": {
|
|
215
|
+
"id": "<string>",
|
|
216
|
+
"data": "<string>",
|
|
217
|
+
"expires_at": "<string>"
|
|
218
|
+
},
|
|
219
|
+
"tool_calls": [
|
|
220
|
+
{
|
|
221
|
+
"function": {
|
|
222
|
+
"name": "<string>",
|
|
223
|
+
"arguments": {}
|
|
224
|
+
},
|
|
225
|
+
"mcp": {
|
|
226
|
+
"id": "<string>",
|
|
227
|
+
"type": "mcp_list_tools",
|
|
228
|
+
"server_label": "<string>",
|
|
229
|
+
"error": "<string>",
|
|
230
|
+
"tools": [
|
|
231
|
+
{
|
|
232
|
+
"name": "<string>",
|
|
233
|
+
"description": "<string>",
|
|
234
|
+
"annotations": {},
|
|
235
|
+
"input_schema": {
|
|
236
|
+
"type": "object",
|
|
237
|
+
"properties": {},
|
|
238
|
+
"required": ["<any>"],
|
|
239
|
+
"additionalProperties": true
|
|
240
|
+
}
|
|
241
|
+
}
|
|
242
|
+
],
|
|
243
|
+
"arguments": "<string>",
|
|
244
|
+
"name": "<string>",
|
|
245
|
+
"output": {}
|
|
246
|
+
},
|
|
247
|
+
"id": "<string>",
|
|
248
|
+
"type": "<string>"
|
|
249
|
+
}
|
|
250
|
+
]
|
|
251
|
+
},
|
|
252
|
+
"finish_reason": "<string>"
|
|
253
|
+
}
|
|
254
|
+
],
|
|
255
|
+
"usage": {
|
|
256
|
+
"prompt_tokens": 123,
|
|
257
|
+
"completion_tokens": 123,
|
|
258
|
+
"prompt_tokens_details": {
|
|
259
|
+
"cached_tokens": 123
|
|
260
|
+
},
|
|
261
|
+
"total_tokens": 123
|
|
262
|
+
},
|
|
263
|
+
"video_result": [
|
|
264
|
+
{
|
|
265
|
+
"url": "<string>",
|
|
266
|
+
"cover_image_url": "<string>"
|
|
267
|
+
}
|
|
268
|
+
],
|
|
269
|
+
"web_search": [
|
|
270
|
+
{
|
|
271
|
+
"icon": "<string>",
|
|
272
|
+
"title": "<string>",
|
|
273
|
+
"link": "<string>",
|
|
274
|
+
"media": "<string>",
|
|
275
|
+
"publish_date": "<string>",
|
|
276
|
+
"content": "<string>",
|
|
277
|
+
"refer": "<string>"
|
|
278
|
+
}
|
|
279
|
+
],
|
|
280
|
+
"content_filter": [
|
|
281
|
+
{
|
|
282
|
+
"role": "<string>",
|
|
283
|
+
"level": 123
|
|
284
|
+
}
|
|
285
|
+
]
|
|
286
|
+
}
|
|
287
|
+
```
|
|
288
|
+
|
|
289
|
+
### 响应字段说明
|
|
290
|
+
|
|
291
|
+
#### choices
|
|
292
|
+
- **类型**: `object[]`
|
|
293
|
+
- **描述**: 模型响应列表
|
|
294
|
+
|
|
295
|
+
##### message
|
|
296
|
+
- **role**: 当前对话角色,默认为 `assistant`
|
|
297
|
+
- **content**: 当前对话文本内容
|
|
298
|
+
- 对于GLM-Z1系列模型,可能包含思考过程标签 `<think> </think>`
|
|
299
|
+
- 对于GLM-4.5V系列模型,可能包含文本边界标签 `<|begin_of_box|> <|end_of_box|>`
|
|
300
|
+
- **reasoning_content**: 思维链内容,仅在使用glm-4.5系列、glm-4.1v-thinking系列模型时返回
|
|
301
|
+
- **audio**: 当使用glm-4-voice模型时返回的音频内容
|
|
302
|
+
- **tool_calls**: 生成的应该被调用的函数名称和参数
|
|
303
|
+
|
|
304
|
+
##### finish_reason
|
|
305
|
+
推理终止原因:
|
|
306
|
+
- `stop`: 自然结束或触发stop词
|
|
307
|
+
- `tool_calls`: 模型命中函数
|
|
308
|
+
- `length`: 达到token长度限制
|
|
309
|
+
- `sensitive`: 内容被安全审核接口拦截
|
|
310
|
+
- `network_error`: 模型推理异常
|
|
311
|
+
|
|
312
|
+
#### usage
|
|
313
|
+
- **prompt_tokens**: 用户输入的Token数量
|
|
314
|
+
- **completion_tokens**: 输出的Token数量
|
|
315
|
+
- **total_tokens**: Token总数
|
|
316
|
+
- 对于glm-4-voice模型,1秒音频=12.5 Tokens,向上取整
|
|
317
|
+
|
|
318
|
+
#### video_result
|
|
319
|
+
视频生成结果(当使用视频生成功能时)
|
|
320
|
+
|
|
321
|
+
#### web_search
|
|
322
|
+
返回与网页搜索相关的信息(当使用WebSearchToolSchema时)
|
|
323
|
+
|
|
324
|
+
#### content_filter
|
|
325
|
+
返回内容安全的相关信息
|
|
326
|
+
- **role**: 安全生效环节(assistant, user, history)
|
|
327
|
+
- **level**: 严重程度,0-3,0最严重,3轻微
|
|
328
|
+
|
|
329
|
+
## 流式输出
|
|
330
|
+
|
|
331
|
+
当`stream=true`时,使用Server-Sent Events (SSE)格式返回:
|
|
332
|
+
|
|
333
|
+
```
|
|
334
|
+
data: {"id":"","object":"chat.completion.chunk","created":1234567890,"model":"glm-4","choices":[{"index":0,"delta":{"content":"内容片段"},"finish_reason":null}]}
|
|
335
|
+
|
|
336
|
+
data: [DONE]
|
|
337
|
+
```
|
|
338
|
+
|
|
339
|
+
## 模型特性对比
|
|
340
|
+
|
|
341
|
+
| 模型系列 | 最大上下文 | 最大输出 | 工具调用 | 思考链 | 多模态 |
|
|
342
|
+
|---------|-----------|----------|----------|--------|--------|
|
|
343
|
+
| GLM-4.6 | 128K | 128K | ✅ 完整支持 | ✅ | ✅ |
|
|
344
|
+
| GLM-4.5 | 96K | 96K | ✅ Web搜索+知识库 | ✅ | ✅ |
|
|
345
|
+
| GLM-Z1 | 32K | 32K | ❌ | ✅ `<think>`标签 | ❌ |
|
|
346
|
+
|
|
347
|
+
## 使用建议
|
|
348
|
+
|
|
349
|
+
### 温度参数
|
|
350
|
+
- **创意写作**: temperature=0.8-1.0
|
|
351
|
+
- **代码生成**: temperature=0.2-0.4
|
|
352
|
+
- **事实问答**: temperature=0.1-0.3
|
|
353
|
+
- **翻译任务**: temperature=0.1-0.2
|
|
354
|
+
|
|
355
|
+
### 最大Token设置
|
|
356
|
+
- **短对话**: max_tokens=1024-2048
|
|
357
|
+
- **长文本**: max_tokens=4096-8192
|
|
358
|
+
- **代码生成**: max_tokens=4096-16384
|
|
359
|
+
- **文档总结**: max_tokens=8192-32768
|
|
360
|
+
|
|
361
|
+
### 工具调用最佳实践
|
|
362
|
+
1. 最多支持128个函数
|
|
363
|
+
2. 函数名称只能包含字母、数字、下划线和破折号
|
|
364
|
+
3. 函数名称最大长度64个字符
|
|
365
|
+
4. 参数必须符合JSON Schema规范
|
|
366
|
+
|
|
367
|
+
## 错误处理
|
|
368
|
+
|
|
369
|
+
### 常见错误码
|
|
370
|
+
- **1210**: 工具调用格式错误
|
|
371
|
+
- **1214**: 工具调用配对错误
|
|
372
|
+
- **sensitive**: 内容被安全审核拦截
|
|
373
|
+
- **network_error**: 模型推理异常
|
|
374
|
+
|
|
375
|
+
### 内容过滤
|
|
376
|
+
响应中的`content_filter`字段表示内容安全检查结果:
|
|
377
|
+
- `role=assistant`: 模型推理环节
|
|
378
|
+
- `role=user`: 用户输入环节
|
|
379
|
+
- `role=history`: 历史上下文环节
|
|
380
|
+
|
|
381
|
+
## 相关文档
|
|
382
|
+
|
|
383
|
+
- GLM 兼容性实现现位于 `sharedmodule/llmswitch-core/src/conversion/compat/`(以 `chat:glm` profile 形式提供)
|
|
384
|
+
- [智谱AI官方文档](https://docs.bigmodel.cn/)
|
|
385
|
+
|
|
386
|
+
---
|
|
387
|
+
|
|
388
|
+
**文档版本**: v1.0.0
|
|
389
|
+
**最后更新**: 2025-10-29
|
|
390
|
+
**来源**: 智谱AI官方文档快照
|