@clickzetta/cz-cli-darwin-x64 0.3.80 → 0.3.81

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (201) hide show
  1. package/bin/cz-cli +0 -0
  2. package/package.json +1 -1
  3. package/bin/skills/clickzetta-access-control/LICENSE +0 -16
  4. package/bin/skills/clickzetta-access-control/SKILL.md +0 -243
  5. package/bin/skills/clickzetta-access-control/eval_cases.jsonl +0 -3
  6. package/bin/skills/clickzetta-access-control/references/dynamic-masking.md +0 -86
  7. package/bin/skills/clickzetta-access-control/references/grant-revoke.md +0 -103
  8. package/bin/skills/clickzetta-access-control/references/role-management.md +0 -66
  9. package/bin/skills/clickzetta-access-control/references/user-management.md +0 -61
  10. package/bin/skills/clickzetta-app-python-sdk/LICENSE +0 -16
  11. package/bin/skills/clickzetta-app-python-sdk/SKILL.md +0 -153
  12. package/bin/skills/clickzetta-app-python-sdk/eval_cases.jsonl +0 -12
  13. package/bin/skills/clickzetta-app-python-sdk/references/bulkload.md +0 -196
  14. package/bin/skills/clickzetta-app-python-sdk/references/connector.md +0 -143
  15. package/bin/skills/clickzetta-app-python-sdk/references/realtime.md +0 -122
  16. package/bin/skills/clickzetta-batch-sync-pipeline/LICENSE +0 -16
  17. package/bin/skills/clickzetta-batch-sync-pipeline/SKILL.md +0 -227
  18. package/bin/skills/clickzetta-batch-sync-pipeline/eval_cases.jsonl +0 -5
  19. package/bin/skills/clickzetta-bi-connect/LICENSE +0 -16
  20. package/bin/skills/clickzetta-bi-connect/SKILL.md +0 -176
  21. package/bin/skills/clickzetta-bi-connect/eval_cases.jsonl +0 -5
  22. package/bin/skills/clickzetta-bi-connect/references/bi-tools.md +0 -170
  23. package/bin/skills/clickzetta-cdc-sync-pipeline/LICENSE +0 -16
  24. package/bin/skills/clickzetta-cdc-sync-pipeline/SKILL.md +0 -633
  25. package/bin/skills/clickzetta-cdc-sync-pipeline/eval_cases.jsonl +0 -5
  26. package/bin/skills/clickzetta-data-ingest-pipeline/LICENSE +0 -16
  27. package/bin/skills/clickzetta-data-ingest-pipeline/SKILL.md +0 -237
  28. package/bin/skills/clickzetta-data-ingest-pipeline/eval_cases.jsonl +0 -5
  29. package/bin/skills/clickzetta-data-retention/LICENSE +0 -16
  30. package/bin/skills/clickzetta-data-retention/SKILL.md +0 -160
  31. package/bin/skills/clickzetta-data-retention/eval_cases.jsonl +0 -5
  32. package/bin/skills/clickzetta-data-retention/references/lifecycle-reference.md +0 -175
  33. package/bin/skills/clickzetta-data-science/LICENSE +0 -16
  34. package/bin/skills/clickzetta-data-science/SKILL.md +0 -125
  35. package/bin/skills/clickzetta-data-science/eval_cases.jsonl +0 -12
  36. package/bin/skills/clickzetta-data-science/references/bitmap-profile.md +0 -146
  37. package/bin/skills/clickzetta-data-science/references/data-patterns.md +0 -110
  38. package/bin/skills/clickzetta-data-science/references/setup.md +0 -160
  39. package/bin/skills/clickzetta-data-science/references/stats-functions.md +0 -195
  40. package/bin/skills/clickzetta-data-science/references/write-and-infer.md +0 -122
  41. package/bin/skills/clickzetta-data-science/references/zettapark-api.md +0 -156
  42. package/bin/skills/clickzetta-data-sharing/LICENSE +0 -16
  43. package/bin/skills/clickzetta-data-sharing/SKILL.md +0 -160
  44. package/bin/skills/clickzetta-data-sharing/eval_cases.jsonl +0 -3
  45. package/bin/skills/clickzetta-data-sharing/references/share-ddl.md +0 -134
  46. package/bin/skills/clickzetta-dba-guide/LICENSE +0 -16
  47. package/bin/skills/clickzetta-dba-guide/SKILL.md +0 -542
  48. package/bin/skills/clickzetta-dba-guide/eval_cases.jsonl +0 -3
  49. package/bin/skills/clickzetta-dw-modeling/LICENSE +0 -16
  50. package/bin/skills/clickzetta-dw-modeling/SKILL.md +0 -351
  51. package/bin/skills/clickzetta-dw-modeling/eval_cases.jsonl +0 -4
  52. package/bin/skills/clickzetta-dw-modeling/references/modeling-patterns.md +0 -100
  53. package/bin/skills/clickzetta-dynamic-table/LICENSE +0 -16
  54. package/bin/skills/clickzetta-dynamic-table/SKILL.md +0 -230
  55. package/bin/skills/clickzetta-dynamic-table/best-practices/dimension-table-join-guide.md +0 -253
  56. package/bin/skills/clickzetta-dynamic-table/best-practices/medallion-and-stream-patterns.md +0 -124
  57. package/bin/skills/clickzetta-dynamic-table/best-practices/non-partitioned-merge-into-warning.md +0 -96
  58. package/bin/skills/clickzetta-dynamic-table/best-practices/performance-optimization.md +0 -109
  59. package/bin/skills/clickzetta-dynamic-table/best-practices/scheduling-guide.md +0 -135
  60. package/bin/skills/clickzetta-dynamic-table/dt-creator/SKILL.md +0 -15
  61. package/bin/skills/clickzetta-dynamic-table/dt-creator/references/dt-declaration-strategy.md +0 -185
  62. package/bin/skills/clickzetta-dynamic-table/dt-creator/references/incremental-config-reference.md +0 -427
  63. package/bin/skills/clickzetta-dynamic-table/dt-creator/references/refresh-history-guide.md +0 -260
  64. package/bin/skills/clickzetta-dynamic-table/dt-creator/references/sql-limitations.md +0 -80
  65. package/bin/skills/clickzetta-dynamic-table/dynamic-table-alter/SKILL.md +0 -190
  66. package/bin/skills/clickzetta-dynamic-table/eval_cases.jsonl +0 -5
  67. package/bin/skills/clickzetta-dynamic-table/sql-to-dt/SKILL.md +0 -27
  68. package/bin/skills/clickzetta-dynamic-table/sql-to-dt/references/sql2dt-column-validation-rules.md +0 -118
  69. package/bin/skills/clickzetta-dynamic-table/sql-to-dt/references/sql2dt-conversion-rules.md +0 -225
  70. package/bin/skills/clickzetta-dynamic-table/sql-to-dt/references/sql2dt-placeholder-rules.md +0 -182
  71. package/bin/skills/clickzetta-dynamic-table/sql-to-dt/references/sql2dt-refresh-rules.md +0 -98
  72. package/bin/skills/clickzetta-dynamic-table/sql-to-dt/references/sql2dt-self-reference-rules.md +0 -76
  73. package/bin/skills/clickzetta-dynamic-table/sql-to-dt/references/sql2dt-workflow.md +0 -109
  74. package/bin/skills/clickzetta-external-catalog/LICENSE +0 -16
  75. package/bin/skills/clickzetta-external-catalog/SKILL.md +0 -123
  76. package/bin/skills/clickzetta-external-catalog/eval_cases.jsonl +0 -5
  77. package/bin/skills/clickzetta-external-catalog/references/external-catalog-ddl.md +0 -130
  78. package/bin/skills/clickzetta-external-function/LICENSE +0 -16
  79. package/bin/skills/clickzetta-external-function/SKILL.md +0 -203
  80. package/bin/skills/clickzetta-external-function/eval_cases.jsonl +0 -4
  81. package/bin/skills/clickzetta-external-function/references/external-function-ddl.md +0 -171
  82. package/bin/skills/clickzetta-file-import-pipeline/LICENSE +0 -16
  83. package/bin/skills/clickzetta-file-import-pipeline/SKILL.md +0 -190
  84. package/bin/skills/clickzetta-file-import-pipeline/eval_cases.jsonl +0 -5
  85. package/bin/skills/clickzetta-index-manager/LICENSE +0 -16
  86. package/bin/skills/clickzetta-index-manager/SKILL.md +0 -140
  87. package/bin/skills/clickzetta-index-manager/eval_cases.jsonl +0 -5
  88. package/bin/skills/clickzetta-index-manager/references/bloomfilter-index.md +0 -67
  89. package/bin/skills/clickzetta-index-manager/references/index-management.md +0 -73
  90. package/bin/skills/clickzetta-index-manager/references/inverted-index.md +0 -80
  91. package/bin/skills/clickzetta-index-manager/references/vector-index.md +0 -81
  92. package/bin/skills/clickzetta-java-sdk/LICENSE +0 -16
  93. package/bin/skills/clickzetta-java-sdk/SKILL.md +0 -186
  94. package/bin/skills/clickzetta-java-sdk/eval_cases.jsonl +0 -12
  95. package/bin/skills/clickzetta-java-sdk/references/bulkload.md +0 -163
  96. package/bin/skills/clickzetta-java-sdk/references/realtime.md +0 -212
  97. package/bin/skills/clickzetta-kafka-ingest-pipeline/LICENSE +0 -16
  98. package/bin/skills/clickzetta-kafka-ingest-pipeline/SKILL.md +0 -769
  99. package/bin/skills/clickzetta-kafka-ingest-pipeline/eval_cases.jsonl +0 -5
  100. package/bin/skills/clickzetta-kafka-ingest-pipeline/references/kafka-pipe-syntax.md +0 -324
  101. package/bin/skills/clickzetta-lakehouse-connect/LICENSE +0 -16
  102. package/bin/skills/clickzetta-lakehouse-connect/SKILL.md +0 -218
  103. package/bin/skills/clickzetta-lakehouse-connect/eval_cases.jsonl +0 -3
  104. package/bin/skills/clickzetta-lakehouse-connect/evals/evals.json +0 -35
  105. package/bin/skills/clickzetta-lakehouse-connect/references/config-file.md +0 -435
  106. package/bin/skills/clickzetta-lakehouse-connect/references/jdbc.md +0 -478
  107. package/bin/skills/clickzetta-lakehouse-connect/references/python-sdk.md +0 -225
  108. package/bin/skills/clickzetta-lakehouse-connect/references/sqlalchemy.md +0 -468
  109. package/bin/skills/clickzetta-lakehouse-connect/references/zettapark-session.md +0 -445
  110. package/bin/skills/clickzetta-manage-comments/LICENSE +0 -16
  111. package/bin/skills/clickzetta-manage-comments/SKILL.md +0 -219
  112. package/bin/skills/clickzetta-manage-comments/eval_cases.jsonl +0 -3
  113. package/bin/skills/clickzetta-metadata/LICENSE +0 -16
  114. package/bin/skills/clickzetta-metadata/SKILL.md +0 -502
  115. package/bin/skills/clickzetta-metadata/eval_cases.jsonl +0 -5
  116. package/bin/skills/clickzetta-metadata/references/instance-views-reference.md +0 -276
  117. package/bin/skills/clickzetta-metadata/references/metering-views-reference.md +0 -137
  118. package/bin/skills/clickzetta-metadata/references/show-desc-reference.md +0 -326
  119. package/bin/skills/clickzetta-metadata/references/views-reference.md +0 -271
  120. package/bin/skills/clickzetta-monitoring/LICENSE +0 -16
  121. package/bin/skills/clickzetta-monitoring/SKILL.md +0 -215
  122. package/bin/skills/clickzetta-monitoring/eval_cases.jsonl +0 -5
  123. package/bin/skills/clickzetta-monitoring/references/job-history-analysis.md +0 -97
  124. package/bin/skills/clickzetta-monitoring/references/show-jobs.md +0 -48
  125. package/bin/skills/clickzetta-oss-ingest-pipeline/LICENSE +0 -16
  126. package/bin/skills/clickzetta-oss-ingest-pipeline/SKILL.md +0 -562
  127. package/bin/skills/clickzetta-oss-ingest-pipeline/eval_cases.jsonl +0 -5
  128. package/bin/skills/clickzetta-overview/LICENSE +0 -16
  129. package/bin/skills/clickzetta-overview/SKILL.md +0 -102
  130. package/bin/skills/clickzetta-overview/eval_cases.jsonl +0 -5
  131. package/bin/skills/clickzetta-overview/references/brands-and-endpoints.md +0 -79
  132. package/bin/skills/clickzetta-overview/references/object-model.md +0 -311
  133. package/bin/skills/clickzetta-overview/references/studio-modules.md +0 -173
  134. package/bin/skills/clickzetta-pipeline-review/LICENSE +0 -16
  135. package/bin/skills/clickzetta-pipeline-review/SKILL.md +0 -377
  136. package/bin/skills/clickzetta-query-optimizer/LICENSE +0 -16
  137. package/bin/skills/clickzetta-query-optimizer/SKILL.md +0 -156
  138. package/bin/skills/clickzetta-query-optimizer/eval_cases.jsonl +0 -5
  139. package/bin/skills/clickzetta-query-optimizer/references/explain.md +0 -56
  140. package/bin/skills/clickzetta-query-optimizer/references/hints-and-sortkey.md +0 -78
  141. package/bin/skills/clickzetta-query-optimizer/references/optimize.md +0 -65
  142. package/bin/skills/clickzetta-query-optimizer/references/result-cache.md +0 -49
  143. package/bin/skills/clickzetta-query-optimizer/references/show-jobs.md +0 -42
  144. package/bin/skills/clickzetta-realtime-sync-pipeline/LICENSE +0 -16
  145. package/bin/skills/clickzetta-realtime-sync-pipeline/SKILL.md +0 -323
  146. package/bin/skills/clickzetta-realtime-sync-pipeline/eval_cases.jsonl +0 -5
  147. package/bin/skills/clickzetta-semantic-view/LICENSE +0 -16
  148. package/bin/skills/clickzetta-semantic-view/SKILL.md +0 -207
  149. package/bin/skills/clickzetta-semantic-view/eval_cases.jsonl +0 -12
  150. package/bin/skills/clickzetta-semantic-view/references/semantic-view-reference.md +0 -167
  151. package/bin/skills/clickzetta-spark-flink-connector/LICENSE +0 -16
  152. package/bin/skills/clickzetta-spark-flink-connector/SKILL.md +0 -92
  153. package/bin/skills/clickzetta-spark-flink-connector/eval_cases.jsonl +0 -5
  154. package/bin/skills/clickzetta-spark-flink-connector/references/flink.md +0 -147
  155. package/bin/skills/clickzetta-spark-flink-connector/references/spark.md +0 -132
  156. package/bin/skills/clickzetta-sql-pipeline-manager/LICENSE +0 -16
  157. package/bin/skills/clickzetta-sql-pipeline-manager/SKILL.md +0 -485
  158. package/bin/skills/clickzetta-sql-pipeline-manager/eval_cases.jsonl +0 -12
  159. package/bin/skills/clickzetta-sql-pipeline-manager/evals/evals.json +0 -166
  160. package/bin/skills/clickzetta-sql-pipeline-manager/references/dynamic-table.md +0 -185
  161. package/bin/skills/clickzetta-sql-pipeline-manager/references/materialized-view.md +0 -129
  162. package/bin/skills/clickzetta-sql-pipeline-manager/references/pipe.md +0 -222
  163. package/bin/skills/clickzetta-sql-pipeline-manager/references/table-stream.md +0 -125
  164. package/bin/skills/clickzetta-sql-syntax-guide/LICENSE +0 -16
  165. package/bin/skills/clickzetta-sql-syntax-guide/SKILL.md +0 -249
  166. package/bin/skills/clickzetta-sql-syntax-guide/eval_cases.jsonl +0 -3
  167. package/bin/skills/clickzetta-sql-syntax-guide/references/ddl-reference.md +0 -350
  168. package/bin/skills/clickzetta-sql-syntax-guide/references/dml-reference.md +0 -279
  169. package/bin/skills/clickzetta-sql-syntax-guide/references/dql-reference.md +0 -504
  170. package/bin/skills/clickzetta-sql-syntax-guide/references/functions-reference.md +0 -372
  171. package/bin/skills/clickzetta-sql-syntax-guide/references/migration-databricks.md +0 -260
  172. package/bin/skills/clickzetta-sql-syntax-guide/references/migration-snowflake.md +0 -382
  173. package/bin/skills/clickzetta-sql-syntax-guide/references/vs-snowflake.md +0 -346
  174. package/bin/skills/clickzetta-sql-syntax-guide/references/vs-spark.md +0 -229
  175. package/bin/skills/clickzetta-studio-task-manager/LICENSE +0 -16
  176. package/bin/skills/clickzetta-studio-task-manager/SKILL.md +0 -652
  177. package/bin/skills/clickzetta-table-lineage/LICENSE +0 -16
  178. package/bin/skills/clickzetta-table-lineage/SKILL.md +0 -90
  179. package/bin/skills/clickzetta-table-lineage/eval_cases.jsonl +0 -1
  180. package/bin/skills/clickzetta-table-lineage/references/normalize_func.sql +0 -14
  181. package/bin/skills/clickzetta-table-lineage/references/table_cost.sql +0 -38
  182. package/bin/skills/clickzetta-table-lineage/references/table_lineage_standalone.html +0 -562
  183. package/bin/skills/clickzetta-table-lineage/references/table_relation.sql +0 -25
  184. package/bin/skills/clickzetta-table-stream-pipeline/LICENSE +0 -16
  185. package/bin/skills/clickzetta-table-stream-pipeline/SKILL.md +0 -206
  186. package/bin/skills/clickzetta-table-stream-pipeline/eval_cases.jsonl +0 -5
  187. package/bin/skills/clickzetta-vcluster-manager/LICENSE +0 -16
  188. package/bin/skills/clickzetta-vcluster-manager/SKILL.md +0 -212
  189. package/bin/skills/clickzetta-vcluster-manager/eval_cases.jsonl +0 -5
  190. package/bin/skills/clickzetta-vcluster-manager/references/vc-cache.md +0 -54
  191. package/bin/skills/clickzetta-vcluster-manager/references/vcluster-ddl.md +0 -150
  192. package/bin/skills/clickzetta-volume-manager/LICENSE +0 -16
  193. package/bin/skills/clickzetta-volume-manager/SKILL.md +0 -292
  194. package/bin/skills/clickzetta-volume-manager/eval_cases.jsonl +0 -5
  195. package/bin/skills/clickzetta-volume-manager/references/volume-ddl.md +0 -199
  196. package/bin/skills/clickzetta-zettapark/LICENSE +0 -16
  197. package/bin/skills/clickzetta-zettapark/SKILL.md +0 -248
  198. package/bin/skills/clickzetta-zettapark/eval_cases.jsonl +0 -12
  199. package/bin/skills/clickzetta-zettapark/references/zettapark-api.md +0 -283
  200. package/bin/skills/cz-cli/SKILL.md +0 -311
  201. package/bin/skills/cz-cli/references/profile-setup.md +0 -120
@@ -1,65 +0,0 @@
1
- # OPTIMIZE 命令参考
2
-
3
- > 来源:https://www.yunqi.tech/documents/OPTIMIZE 和 https://www.yunqi.tech/documents/small_file_optimization
4
-
5
- ## 语法
6
-
7
- ```sql
8
- OPTIMIZE table_name
9
- [WHERE predicate]
10
- [OPTIONS('key' = 'value')]
11
- ```
12
-
13
- ## 参数说明
14
-
15
- - `table_name`:格式为 `[schema_name.]table_name`
16
- - `WHERE predicate`:(可选)分区过滤条件,必须包含完整分区列匹配
17
- - 格式:`partition_column = 'value'` 或 `dt='2023-01-01' AND region='us'`
18
- - `OPTIONS`:(可选)控制执行模式
19
-
20
- ## 执行模式
21
-
22
- ### 异步模式(默认)
23
-
24
- 立即返回 Job ID,后台执行,不阻塞当前连接。
25
-
26
- ```sql
27
- -- 默认异步
28
- OPTIMIZE my_schema.orders;
29
-
30
- -- 显式指定异步
31
- OPTIMIZE my_schema.orders OPTIONS('cz.sql.optimize.table.async' = 'true');
32
- ```
33
-
34
- ### 同步模式
35
-
36
- 阻塞直到完成,适合开发测试和小表优化。
37
-
38
- ```sql
39
- OPTIMIZE my_schema.orders OPTIONS('cz.sql.optimize.table.async' = 'false');
40
- ```
41
-
42
- ## 核心功能
43
-
44
- - **小文件合并**:将多个小文件整合为大文件,减少文件元数据开销
45
- - **删除标记清理**:清理 UPDATE/DELETE 产生的删除标记,回收存储空间
46
- - **数据重组**:重新整理数据布局,提升查询性能
47
-
48
- ## 注意事项
49
-
50
- - **只能在通用型计算集群(GENERAL PURPOSE VIRTUAL CLUSTER)运行**,分析型集群不生效
51
- - 后台默认会不定时自动执行文件合并,手动 OPTIMIZE 用于精细控制
52
-
53
- ## DML 写入时自动触发小文件合并
54
-
55
- ```sql
56
- -- 在 DML 执行时同时触发小文件合并
57
- SET cz.sql.compaction.after.commit = true;
58
- INSERT INTO my_table VALUES (...);
59
- ```
60
-
61
- ## 查看分区文件数量
62
-
63
- ```sql
64
- SHOW PARTITIONS EXTENDED table_name;
65
- ```
@@ -1,49 +0,0 @@
1
- # Result Cache(查询结果缓存)参考
2
-
3
- > 来源:https://www.yunqi.tech/documents/result_cache
4
-
5
- ## 概述
6
-
7
- ClickZetta Lakehouse 提供三种缓存:
8
- 1. **查询结果缓存(Result Cache)** — 本文档
9
- 2. 元数据缓存(Metadata Cache)— 工作空间内共享
10
- 3. 虚拟集群本地缓存(Local Disk Cache)— 仅限指定集群
11
-
12
- ## 启用 / 禁用
13
-
14
- ```sql
15
- -- 开启结果缓存(SESSION 级别)
16
- SET cz.sql.enable.shortcut.result.cache = true;
17
-
18
- -- 关闭结果缓存
19
- SET cz.sql.enable.shortcut.result.cache = false;
20
- ```
21
-
22
- > 注意:当前默认未开启,需手动启用。
23
-
24
- ## 缓存复用条件(同时满足才能命中)
25
-
26
- 1. 查询中使用的表数据未发生变更
27
- 2. 查询中不包含视图引用
28
- 3. 新 SQL 与之前执行的 SQL 语法精确匹配
29
- 4. 查询中不包含非确定性函数(如 `CURRENT_TIMESTAMP()`)或 UDF
30
- 5. 之前的 Result Cache 未过期
31
-
32
- ## 过期周期
33
-
34
- - 默认保留 **24 小时**
35
- - 24 小时内有查询复用,则额外延长 24 小时
36
- - 超过 24 小时无复用则清除
37
-
38
- ## 约束与限制
39
-
40
- | 项目 | 限制 |
41
- |---|---|
42
- | 缓存保留周期 | 24 小时 |
43
- | 单工作空间最大缓存作业数 | 10 万 |
44
- | 缓存大小 | 无限制(≤10MB 存内存,>10MB 持久化到对象存储) |
45
- | 不支持 | 含非确定性函数、UDF 的查询 |
46
-
47
- ## 验证是否命中缓存
48
-
49
- 第二次执行相同查询后,在 Job Profile 的执行计划图中查看是否出现 `JOB RESULT REUSE` 标记。命中缓存的查询通常在 15ms 内返回。
@@ -1,42 +0,0 @@
1
- # SHOW JOBS 参考
2
-
3
- > 来源:https://www.yunqi.tech/documents/show-jobs
4
-
5
- ## 语法
6
-
7
- ```sql
8
- SHOW JOBS [IN VCLUSTER vc_name] [LIKE 'pattern'] [WHERE <expr>] [LIMIT num];
9
- ```
10
-
11
- ## 参数说明
12
-
13
- - `IN VCLUSTER vc_name`:(可选)筛选指定计算集群下的作业
14
- - `WHERE <expr>`:(可选)按字段过滤,支持 SHOW JOBS 返回的所有字段
15
- - `LIMIT num`:(可选)限制返回数量,范围 1-10000
16
- - `LIKE 'pattern'`:(可选)按 job_id 模式匹配,支持 `%` 和 `_`
17
-
18
- 默认显示最近 7 天内提交的任务,最多 10000 条。
19
-
20
- ## 示例
21
-
22
- ```sql
23
- -- 查看执行时间超过 2 分钟的作业
24
- SHOW JOBS IN VCLUSTER default_ap WHERE execution_time > interval 2 minute;
25
-
26
- -- 查看指定集群的所有作业
27
- SHOW JOBS IN VCLUSTER default_ap;
28
-
29
- -- 限制返回 100 条
30
- SHOW JOBS LIMIT 100;
31
-
32
- -- 查看指定集群最近 50 条
33
- SHOW JOBS IN VCLUSTER default_ap LIMIT 50;
34
-
35
- -- 按 job_id 模式匹配
36
- SHOW JOBS LIKE 'job_2024%';
37
- ```
38
-
39
- ## 注意事项
40
-
41
- - 只能查看最近 7 天内的作业记录
42
- - 未指定 VCLUSTER 时显示所有集群的作业
@@ -1,16 +0,0 @@
1
- ClickZetta Skills License
2
- © 2026 Yunqi Inc. All rights reserved.
3
- LICENSE: Use of these materials (including all code, prompts, assets, files, and other components of these skills (collectively, "Skills")) is governed by your agreement with ClickZetta for the Service. If no separate agreement exists, use is governed by ClickZetta's Terms of Service (available at: https://yunqi.tech/documents/user-aggrement).
4
- Your applicable agreement is referred to as the "Agreement." "Service" is as defined in the Agreement.
5
- ADDITIONAL RESTRICTIONS: Notwithstanding anything in the Agreement to the contrary, you may not:
6
-
7
- Extract from the Service or retain copies of the Skills outside use with the Service;
8
- Reproduce or copy the Skills, except for temporary copies created automatically during authorized use of the Service;
9
- Create derivative works based on the Skills;
10
- Distribute, sublicense, or transfer the Skills to any third party;
11
- Make, offer to sell, sell, or import any inventions embodied in the Skills; nor,
12
- Reverse engineer, decompile, or disassemble the Skills.
13
-
14
- The receipt, viewing, or possession of the Skills does not convey or imply any license or right beyond those expressly granted above.
15
- Yunqi retains all rights, title, and interest in the Skills, including all copyrights, trademarks, patents, and all other applicable intellectual property rights.
16
- THE SKILLS ARE PROVIDED "AS IS," WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SKILLS OR THE USE OR OTHER DEALINGS IN THE SKILLS.
@@ -1,323 +0,0 @@
1
- ---
2
- name: clickzetta-realtime-sync-pipeline
3
- description: |
4
- 创建和管理 ClickZetta Lakehouse 实时同步任务(单表),将外部数据源的数据实时同步到 Lakehouse。
5
- 支持 Kafka、MySQL、PostgreSQL 等数据源作为来源端,Lakehouse 作为目标端。
6
- 实时同步任务为持续运行的流式任务,无需配置调度策略,提交后即持续运行。
7
- 当用户说"Studio 实时同步"、"realtime sync"、"单表 CDC 同步"、"实时数据同步"、"Kafka 实时同步到 Lakehouse"、
8
- "MySQL 单表实时同步"、"单表实时同步"、"实时数据迁移"时触发。
9
- 包含实时同步任务创建、数据源配置、字段映射(含 JSONPath 计算列)、部署运维等
10
- ClickZetta Studio 特有逻辑。
11
- Keywords: real-time sync, single table, Kafka source, MySQL source, streaming
12
- ---
13
-
14
- # 实时同步(单表)Pipeline 工作流
15
-
16
- ## 向导:收集必要信息
17
-
18
- 开始创建实时同步任务前,优先使用交互式问答工具(如 `question`)收集以下信息并弹出选项菜单;若无此类工具,则用文字一次性列出所有问题:
19
-
20
- ```
21
- question({
22
- questions: [
23
- {
24
- question: "数据源类型?",
25
- options: [
26
- { label: "Kafka", description: "Kafka Topic 实时接入,支持 JSON 消息解析" },
27
- { label: "MySQL / Aurora MySQL", description: "单表 CDC 实时同步" },
28
- { label: "PostgreSQL / Aurora PG", description: "单表 CDC 实时同步" },
29
- { label: "SQL Server", description: "单表 CDC 实时同步" }
30
- ]
31
- },
32
- {
33
- question: "同步粒度?",
34
- options: [
35
- { label: "单表/单 Topic", description: "本 skill 支持,精细化配置" },
36
- { label: "整库/多表", description: "建议改用 clickzetta-cdc-sync-pipeline" }
37
- ]
38
- }
39
- ]
40
- })
41
- ```
42
-
43
- **如果用户已经提供了足够信息,直接进入工作流,不再弹出菜单。**
44
-
45
- ---
46
-
47
- ## 适用场景
48
-
49
- - 将外部数据源的数据实时同步到 Lakehouse(低延迟、持续运行)
50
- - Kafka Topic → Lakehouse 表(支持 JSON 消息解析)
51
- - MySQL / PostgreSQL / SQL Server 等数据库 → Lakehouse 表(CDC 变更捕获)
52
- - 数据时效性要求高,需要秒级或分钟级延迟
53
- - 单张源表/Topic 到单张目标表的实时同步
54
- - 关键词:实时同步、CDC、流式同步、realtime sync、Kafka 实时同步
55
-
56
- ## 与其他同步方式的区别
57
-
58
- | 维度 | 实时同步(本 Skill) | 离线同步 | 多表实时同步 |
59
- |------|---------------------|---------|------------|
60
- | 任务类型 ID | `14`(REALTIME/CDC) | `10` / `291` | `281` |
61
- | 同步粒度 | 单表/单 Topic | 单表/多表 | 整库/多表 |
62
- | 运行模式 | 持续运行(流式) | 周期调度(批量) | 持续运行(流式) |
63
- | 调度策略 | 无需配置,提交即运行 | 需配置 Cron 表达式 | 无需配置,提交即运行 |
64
- | 延迟 | 秒级~分钟级 | 取决于调度周期 | 秒级~分钟级 |
65
- | 适用 Skill | `clickzetta-realtime-sync-pipeline` | `clickzetta-batch-sync-pipeline` | `clickzetta-cdc-sync-pipeline` |
66
-
67
- ## 前置依赖
68
-
69
- - ClickZetta Lakehouse Studio 账户,具备创建同步任务、目标表的权限
70
- - 源端数据源已在 Studio 中配置(Kafka / MySQL / PostgreSQL / SQL Server 等)
71
- - 目标端 Lakehouse 数据源可用
72
- - Sync VCluster 可用(实时同步任务 task_type=14 需要 Sync VCluster)
73
- - **执行环境(满足其一即可,优先使用 cz-cli)**:
74
- - **cz-cli 路径**:已安装 cz-cli(`brew install cz-cli 或参考官方文档安装`),并完成 `cz-cli setup` 配置
75
- - **MCP 路径**:clickzetta-studio-mcp 工具可用(`create_task`、`save_integration_task`、`publish_task`、`list_data_sources`、`LH_show_object_list` 等)
76
-
77
- ## 环境探测(执行前必读)
78
-
79
- 在开始任何操作前,先判断当前执行环境:
80
-
81
- **第一步:检测 cz-cli 是否可用**
82
- ```bash
83
- cz-cli --version
84
- ```
85
- - 若命令存在 → **走 cz-cli 路径**(见本文档末尾"cz-cli 替代路径"章节)
86
- - 若命令不存在 → 继续检测 MCP
87
-
88
- **第二步:检测 MCP 是否可用(仅在 cz-cli 不可用时)**
89
-
90
- 尝试调用 `list_data_sources` 工具查询数据源列表。
91
- - 若工具存在于 tool list → **走 MCP 路径**(本文档默认路径)
92
- - 若工具不存在 → 停止执行,提示用户:
93
- > "当前环境既无 cz-cli 也无 MCP 工具,请安装其中之一后重试。
94
- > cz-cli 安装:`brew install cz-cli 或参考官方文档安装`,然后运行 `cz-cli setup`
95
- > MCP 安装:参考 clickzetta-studio-mcp 配置文档"
96
-
97
- ## 工作流
98
-
99
- ### 步骤 1:确认 Sync VCluster 可用
100
-
101
- ```
102
- 使用 LH_show_object_list(object_type='VCLUSTERS')查看可用虚拟集群。
103
- 筛选 vcluster_type 包含 SYNC 的集群。
104
- 如无可用 Sync VCluster,需先创建后再继续。
105
- ```
106
-
107
- ### 步骤 2:查找可用数据源
108
-
109
- ```
110
- 使用 list_data_sources 查看已配置的数据源列表。
111
- 按类型过滤:
112
- - Kafka: ds_type=2
113
- - MySQL: ds_type=5
114
- - PostgreSQL: ds_type=7
115
- - SQL Server: ds_type=8
116
- 记录源端 datasource_name 和目标端 Lakehouse datasource_name。
117
- ```
118
-
119
- ### 步骤 3:探查源端数据结构(可选)
120
-
121
- ```
122
- 使用 list_namespaces 查看源端数据源的命名空间(数据库/Schema)。
123
- 使用 list_metadata_objects 查看命名空间下的表/Topic 列表。
124
- 使用 get_metadata_detail 查看具体表/Topic 的字段结构。
125
- ```
126
-
127
- ### 步骤 4:创建实时同步任务
128
-
129
- ```
130
- 使用 create_task 创建任务:
131
- - task_type: 14(实时同步)
132
- - task_name: 自定义任务名称(建议包含源和目标信息,如 "rt_sync_kafka_orders")
133
- - data_folder_id: 目标文件夹 ID(可通过 list_folders 获取)
134
-
135
- 记录返回的 task_id 和 studio_url。
136
- ```
137
-
138
- ### 步骤 5:配置同步内容
139
-
140
- ```
141
- 使用 save_integration_task 配置同步:
142
- - task_id: 步骤 4 返回的任务 ID
143
- - source_datasource_name: 源端数据源名称
144
- - source_schema: 源端数据库/Schema(Kafka 场景为 Topic 所在命名空间)
145
- - source_table: 源端表名或 Kafka Topic 名称
146
- - source_ds_type: 源端类型(2=Kafka, 5=MySQL, 7=PostgreSQL, 8=SQL Server)
147
- - sink_datasource_name: 目标 Lakehouse 数据源名称
148
- - sink_schema: 目标 Schema(默认 public)
149
- - sink_table: 目标表名(可选,默认与源表同名)
150
- - sink_ds_type: 1(Lakehouse)
151
- ```
152
-
153
- > **说明**:系统会自动获取源端和目标端的元数据,生成字段映射。如目标表不存在,会自动创建。
154
-
155
- ### 步骤 6:Kafka JSON 消息解析(Kafka 数据源专用)
156
-
157
- 如果 Kafka Topic 的消息格式为 JSON,可在 Studio UI 中通过新增计算列解析嵌套字段:
158
-
159
- - 使用 JSONPath 规则解析 value 字段中的内容
160
- - 示例:`$.id` 提取顶层 id 字段,`$.data.code` 提取嵌套字段
161
- - 默认使用 Kafka Topic 内置字段(key、value、timestamp、partition、offset)进行映射
162
- - 计算列配置需在 Studio UI 中完成(通过 studio_url 打开)
163
-
164
- ### 步骤 7:提交部署
165
-
166
- ```
167
- 实时同步任务不需要配置调度策略(无需调用 save_task_configuration)。
168
- 直接使用 publish_task 提交任务:
169
- - task_id: 任务 ID
170
- - task_version: 当前版本号(通过 get_task_detail 获取)
171
-
172
- 提交后任务即开始持续运行。
173
- ```
174
-
175
- > **重要**:实时同步任务不支持开发状态下的测试运行,提交即为正式部署。
176
-
177
- ### 步骤 8:运维监控
178
-
179
- ```
180
- 提交后在运维中心管理实时同步任务:
181
-
182
- 查看任务状态:get_task_detail
183
- 查看运行记录:list_task_run(注意实时任务为持续运行,不同于离线任务的周期实例)
184
-
185
- Studio UI 中可进行:
186
- - 启动/停止任务
187
- - 查看同步延迟和吞吐量
188
- - 查看错误日志
189
- ```
190
-
191
- ---
192
-
193
- ## 支持的数据源
194
-
195
- ### 来源端
196
-
197
- | 数据源 | ds_type | 说明 |
198
- |--------|---------|------|
199
- | Kafka | 2 | 支持 JSON 消息解析(JSONPath 计算列) |
200
- | MySQL | 5 | CDC 变更捕获 |
201
- | PostgreSQL | 7 | CDC 变更捕获 |
202
- | SQL Server | 8 | CDC 变更捕获 |
203
- | Aurora MySQL | 39 | CDC 变更捕获 |
204
- | Aurora PostgreSQL | 40 | CDC 变更捕获 |
205
- | PolarDB MySQL | 19 | CDC 变更捕获 |
206
- | PolarDB PostgreSQL | 48 | CDC 变更捕获 |
207
-
208
- ### 目标端
209
-
210
- | 数据源 | ds_type |
211
- |--------|---------|
212
- | Lakehouse | 1 |
213
-
214
- ## 故障排除
215
-
216
- | 问题 | 排查方向 |
217
- |------|---------|
218
- | 任务创建失败 | 检查是否有可用的 Sync VCluster(`LH_show_object_list` 查看 VCLUSTERS,筛选 SYNC 类型) |
219
- | 源端连接失败 | 检查数据源配置中的连接信息、网络可达性、账号权限 |
220
- | Kafka 消费无数据 | 检查 Topic 名称是否正确、消费位点设置、Kafka 集群连通性 |
221
- | JSON 解析失败 | 检查 JSONPath 表达式是否正确、消息格式是否为合法 JSON |
222
- | 同步延迟增大 | 检查 Sync VCluster 资源是否充足、源端数据量是否突增 |
223
- | 目标表写入失败 | 检查目标表是否存在、字段类型是否兼容、权限是否充足 |
224
- | 任务异常停止 | 查看执行日志(`list_executions` + `get_execution_log`)排查具体错误 |
225
-
226
- ## 注意事项
227
-
228
- ### 运行模式
229
-
230
- - 实时同步任务为持续运行的流式任务,提交后即开始运行,无需配置调度
231
- - 不支持开发状态下的测试运行
232
- - 停止后需手动重新启动
233
-
234
- ### Sync VCluster 要求
235
-
236
- - 实时同步任务(task_type=14)必须使用 Sync VCluster
237
- - 创建任务前需确认有可用的 Sync VCluster
238
- - 可通过 `LH_show_object_list`(object_type='VCLUSTERS')查看,筛选 vcluster_type 包含 SYNC 的集群
239
-
240
- ### Kafka 数据源特殊说明
241
-
242
- - 支持指定消费起始位点(earliest / latest / 指定 offset)
243
- - JSON 消息可通过 JSONPath 计算列解析嵌套字段
244
- - 默认字段包括:key、value、timestamp、partition、offset
245
-
246
- ### 与多表实时同步的选择
247
-
248
- - 单表实时同步(本 Skill):适合单张表/Topic 的精细化同步
249
- - 多表实时同步(`clickzetta-cdc-sync-pipeline`):适合整库 CDC、多表批量实时同步
250
- - 如需同步整个数据库的所有表,建议使用多表实时同步
251
-
252
- ---
253
-
254
- ## cz-cli 替代路径
255
-
256
- > 仅在 cz-cli 可用且 MCP 不可用时使用本节。步骤编号与上方 MCP 路径对应。
257
- > 所有操作通过 `cz-cli agent run` 委托给内置 agent 完成,agent 内置完整的 Studio MCP 工具访问能力。
258
-
259
- ### 单表实时同步(cz-cli 版)
260
-
261
- **快速路径**:直接创建任务,然后在 Studio UI 配置数据源
262
-
263
- ```bash
264
- # 步骤 1:创建实时同步任务(task_type=14,即 REALTIME/CDC)
265
- cz-cli task create "rt_sync_<table>" --type REALTIME --folder <folder_name>
266
- # 返回 task_id 和 studio_url,在 studio_url 中完成数据源配置和字段映射
267
-
268
- # 步骤 2:配置完成后,发布任务(实时同步无需配置调度,提交即持续运行)
269
- cz-cli task deploy "rt_sync_<table>" -y
270
- ```
271
-
272
- **完整 agent 路径**(需要 agent 完成数据源探查和配置):
273
-
274
- ```bash
275
- # 一键完成:让 agent 完成完整的实时同步任务创建
276
- cz-cli agent run "创建实时同步任务(task_type=14),将数据源 <source_ds_name> 中 <schema>.<table>(或 Kafka topic <topic>)实时同步到 Lakehouse public schema,使用 Sync VCluster,任务名 rt_sync_<table>,放在 <folder_name> 文件夹下" \
277
- --format a2a --dangerously-skip-permissions
278
- ```
279
-
280
- 对于需要精细控制的场景,可拆分步骤:
281
-
282
- ```bash
283
- # 步骤 1:确认 Sync VCluster 可用
284
- cz-cli agent run "列出所有可用的 VCluster,筛选 vcluster_type 包含 SYNC 的集群,确认有可用的 Sync VCluster" \
285
- --format a2a --dangerously-skip-permissions
286
-
287
- # 步骤 2:查找数据源
288
- cz-cli agent run "列出所有已配置的数据源,按类型过滤(Kafka: ds_type=2, MySQL: ds_type=5, PostgreSQL: ds_type=7, SQL Server: ds_type=8),记录源端和目标端 Lakehouse 数据源名称" \
289
- --format a2a --dangerously-skip-permissions
290
-
291
- # 步骤 3(可选):探查源端数据结构
292
- cz-cli agent run "查看数据源 <source_ds_name> 的命名空间列表,以及 <schema> 下的表/Topic 列表和字段结构" \
293
- --format a2a --dangerously-skip-permissions
294
-
295
- # 步骤 4-5:创建并配置实时同步任务
296
- cz-cli agent run "创建实时同步任务(task_type=14),源端 datasource=<source_ds_name>,schema=<schema>,table=<table>(source_ds_type=<type>),目标 Lakehouse public.<table>,任务名 rt_sync_<table>" \
297
- --format a2a --dangerously-skip-permissions
298
-
299
- # 步骤 7:提交部署
300
- cz-cli agent run "提交实时同步任务 rt_sync_<table>,使其开始持续运行" \
301
- --format a2a --dangerously-skip-permissions
302
- ```
303
-
304
- > **注意**:实时同步任务不需要配置调度策略,提交即开始持续运行。Kafka JSON 消息的计算列配置需在 Studio UI 中完成。
305
-
306
- ---
307
-
308
- ### 运维监控(cz-cli 版)
309
-
310
- ```bash
311
- # 查看最近运行记录
312
- cz-cli runs list --task <task_name>
313
-
314
- # 查看运行详情
315
- cz-cli runs detail <run_id>
316
-
317
- # 查看执行日志
318
- cz-cli attempts log <run_id>
319
-
320
- # 下线任务(停止持续运行)
321
- cz-cli task undeploy <task_name> -y
322
- ```
323
-
@@ -1,5 +0,0 @@
1
- {"case_id":"001","type":"should_call","user_input":"怎么用 Studio 创建单表实时同步任务?","expected_skill":"clickzetta-realtime-sync-pipeline","expected_output_contains":["实时同步","task_type","28"]}
2
- {"case_id":"002","type":"should_call","user_input":"Kafka 单个 topic 实时同步到 Lakehouse 表怎么配置?","expected_skill":"clickzetta-realtime-sync-pipeline","expected_output_contains":["Kafka","实时同步"]}
3
- {"case_id":"003","type":"should_call","user_input":"单表实时同步和多表实时同步有什么区别?","expected_skill":"clickzetta-realtime-sync-pipeline","expected_output_contains":["单表","多表","28","281"]}
4
- {"case_id":"004","type":"should_call","user_input":"MySQL 单表 CDC 实时同步到 Lakehouse 怎么做?","expected_skill":"clickzetta-realtime-sync-pipeline","expected_output_contains":["MySQL","实时同步","CDC"]}
5
- {"case_id":"005","type":"should_call","user_input":"实时同步任务需要配置调度策略吗?","expected_skill":"clickzetta-realtime-sync-pipeline","expected_output_contains":["无需配置","持续运行"]}
@@ -1,16 +0,0 @@
1
- ClickZetta Skills License
2
- © 2026 Yunqi Inc. All rights reserved.
3
- LICENSE: Use of these materials (including all code, prompts, assets, files, and other components of these skills (collectively, "Skills")) is governed by your agreement with ClickZetta for the Service. If no separate agreement exists, use is governed by ClickZetta's Terms of Service (available at: https://yunqi.tech/documents/user-aggrement).
4
- Your applicable agreement is referred to as the "Agreement." "Service" is as defined in the Agreement.
5
- ADDITIONAL RESTRICTIONS: Notwithstanding anything in the Agreement to the contrary, you may not:
6
-
7
- Extract from the Service or retain copies of the Skills outside use with the Service;
8
- Reproduce or copy the Skills, except for temporary copies created automatically during authorized use of the Service;
9
- Create derivative works based on the Skills;
10
- Distribute, sublicense, or transfer the Skills to any third party;
11
- Make, offer to sell, sell, or import any inventions embodied in the Skills; nor,
12
- Reverse engineer, decompile, or disassemble the Skills.
13
-
14
- The receipt, viewing, or possession of the Skills does not convey or imply any license or right beyond those expressly granted above.
15
- Yunqi retains all rights, title, and interest in the Skills, including all copyrights, trademarks, patents, and all other applicable intellectual property rights.
16
- THE SKILLS ARE PROVIDED "AS IS," WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SKILLS OR THE USE OR OTHER DEALINGS IN THE SKILLS.