@clickzetta/cz-cli-darwin-arm64 0.3.92 → 0.3.94

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (69) hide show
  1. package/bin/cz-cli +0 -0
  2. package/bin/skills/clickzetta-ai-function/SKILL.md +109 -0
  3. package/bin/skills/clickzetta-ai-function/eval_cases.jsonl +4 -0
  4. package/bin/skills/clickzetta-ai-function/references/ai-function-ddl.md +106 -0
  5. package/bin/skills/clickzetta-batch-sync-pipeline/SKILL.md +124 -124
  6. package/bin/skills/clickzetta-batch-sync-pipeline/eval_cases.jsonl +5 -5
  7. package/bin/skills/clickzetta-bi-connect/SKILL.md +79 -78
  8. package/bin/skills/clickzetta-bi-connect/references/bi-tools.md +56 -56
  9. package/bin/skills/clickzetta-cdc-sync-pipeline/SKILL.md +386 -382
  10. package/bin/skills/clickzetta-cdc-sync-pipeline/eval_cases.jsonl +5 -5
  11. package/bin/skills/clickzetta-data-ingest-pipeline/SKILL.md +73 -212
  12. package/bin/skills/clickzetta-data-science/SKILL.md +57 -56
  13. package/bin/skills/clickzetta-data-science/references/bitmap-profile.md +38 -38
  14. package/bin/skills/clickzetta-data-science/references/data-patterns.md +16 -16
  15. package/bin/skills/clickzetta-data-science/references/setup.md +28 -28
  16. package/bin/skills/clickzetta-data-science/references/stats-functions.md +44 -44
  17. package/bin/skills/clickzetta-data-science/references/write-and-infer.md +22 -22
  18. package/bin/skills/clickzetta-data-science/references/zettapark-api.md +32 -32
  19. package/bin/skills/clickzetta-dw-modeling/SKILL.md +1 -1
  20. package/bin/skills/clickzetta-external-function/SKILL.md +51 -109
  21. package/bin/skills/clickzetta-external-function/eval_cases.jsonl +4 -4
  22. package/bin/skills/clickzetta-external-function/references/external-function-ddl.md +39 -77
  23. package/bin/skills/clickzetta-java-sdk/SKILL.md +49 -48
  24. package/bin/skills/clickzetta-java-sdk/eval_cases.jsonl +12 -12
  25. package/bin/skills/clickzetta-java-sdk/references/bulkload.md +34 -34
  26. package/bin/skills/clickzetta-java-sdk/references/realtime.md +44 -44
  27. package/bin/skills/clickzetta-kafka-ingest-pipeline/SKILL.md +273 -507
  28. package/bin/skills/clickzetta-kafka-ingest-pipeline/references/kafka-pipe-syntax.md +197 -231
  29. package/bin/skills/clickzetta-oss-ingest-pipeline/SKILL.md +231 -304
  30. package/bin/skills/clickzetta-realtime-sync-pipeline/SKILL.md +180 -179
  31. package/bin/skills/clickzetta-realtime-sync-pipeline/eval_cases.jsonl +5 -5
  32. package/bin/skills/clickzetta-semantic-view/SKILL.md +74 -72
  33. package/bin/skills/clickzetta-semantic-view/eval_cases.jsonl +12 -12
  34. package/bin/skills/clickzetta-semantic-view/references/semantic-view-reference.md +75 -75
  35. package/bin/skills/clickzetta-sql-migration/SKILL.md +128 -0
  36. package/bin/skills/clickzetta-sql-migration/eval_cases.jsonl +10 -0
  37. package/bin/skills/clickzetta-sql-migration/references/ddl-reference.md +350 -0
  38. package/bin/skills/clickzetta-sql-migration/references/dml-differences.md +192 -0
  39. package/bin/skills/clickzetta-sql-migration/references/dml-reference.md +279 -0
  40. package/bin/skills/{clickzetta-sql-syntax-guide → clickzetta-sql-migration}/references/dql-reference.md +128 -128
  41. package/bin/skills/clickzetta-sql-migration/references/function-mapping.md +194 -0
  42. package/bin/skills/clickzetta-sql-migration/references/functions-reference.md +372 -0
  43. package/bin/skills/clickzetta-sql-migration/references/implicit-type-conversion.md +143 -0
  44. package/bin/skills/clickzetta-sql-migration/references/migration-databricks.md +260 -0
  45. package/bin/skills/{clickzetta-sql-syntax-guide → clickzetta-sql-migration}/references/migration-snowflake.md +112 -112
  46. package/bin/skills/clickzetta-sql-migration/references/vs-snowflake.md +346 -0
  47. package/bin/skills/clickzetta-sql-migration/references/vs-spark.md +229 -0
  48. package/bin/skills/clickzetta-studio-task-manager/SKILL.md +326 -329
  49. package/bin/skills/clickzetta-table-lineage/SKILL.md +57 -55
  50. package/bin/skills/clickzetta-table-lineage/eval_cases.jsonl +1 -1
  51. package/bin/skills/clickzetta-table-lineage/references/normalize_func.sql +5 -5
  52. package/bin/skills/clickzetta-table-lineage/references/table_cost.sql +6 -6
  53. package/bin/skills/clickzetta-table-lineage/references/table_relation.sql +2 -2
  54. package/bin/skills/clickzetta-volume-manager/SKILL.md +186 -100
  55. package/bin/skills/clickzetta-volume-manager/references/volume-ddl.md +153 -52
  56. package/package.json +1 -1
  57. package/bin/skills/clickzetta-dynamic-table/best-practices/scheduling-guide.md +0 -135
  58. package/bin/skills/clickzetta-dynamic-table/dt-creator/references/dt-declaration-strategy.md +0 -185
  59. package/bin/skills/clickzetta-dynamic-table/dt-creator/references/refresh-history-guide.md +0 -260
  60. package/bin/skills/clickzetta-dynamic-table/dynamic-table-alter/SKILL.md +0 -191
  61. package/bin/skills/clickzetta-sql-syntax-guide/SKILL.md +0 -249
  62. package/bin/skills/clickzetta-sql-syntax-guide/eval_cases.jsonl +0 -3
  63. package/bin/skills/clickzetta-sql-syntax-guide/references/ddl-reference.md +0 -350
  64. package/bin/skills/clickzetta-sql-syntax-guide/references/dml-reference.md +0 -279
  65. package/bin/skills/clickzetta-sql-syntax-guide/references/functions-reference.md +0 -372
  66. package/bin/skills/clickzetta-sql-syntax-guide/references/migration-databricks.md +0 -260
  67. package/bin/skills/clickzetta-sql-syntax-guide/references/vs-snowflake.md +0 -346
  68. package/bin/skills/clickzetta-sql-syntax-guide/references/vs-spark.md +0 -229
  69. /package/bin/skills/{clickzetta-sql-syntax-guide → clickzetta-sql-migration}/LICENSE +0 -0
@@ -1,11 +1,11 @@
1
- # RealtimeStream 实时写入参考
1
+ # RealtimeStream Real-Time Write Reference
2
2
 
3
- > 适合:Kafka 消费写入、高频实时数据接入(秒级可查)、主键表 CDC 写入
3
+ > Best for: Kafka consumption and writes, high-frequency real-time ingestion with second-level queryability, and CDC writes to primary-key tables.
4
4
 
5
- ## Maven 依赖
5
+ ## Maven Dependency
6
6
 
7
7
  ```xml
8
- <!-- 最新版本见 https://central.sonatype.com/artifact/com.clickzetta/clickzetta-java -->
8
+ <!-- See https://central.sonatype.com/artifact/com.clickzetta/clickzetta-java for the latest version. -->
9
9
  <dependency>
10
10
  <groupId>com.clickzetta</groupId>
11
11
  <artifactId>clickzetta-java</artifactId>
@@ -18,23 +18,23 @@
18
18
  </dependency>
19
19
  ```
20
20
 
21
- ## 使用限制
21
+ ## Usage Limits
22
22
 
23
- - 实时写入的数据可以秒级查询
24
- - table stream、dynamic table 需等待约 **1 分钟**才能看到写入数据
25
- - 表结构变更时,需停止任务,变更后约 **90 分钟**重新启动
23
+ - Real-time written data can be queried within seconds.
24
+ - Table Streams and Dynamic Tables need about **1 minute** before they can see the written data.
25
+ - When the table schema changes, stop the task and restart it about **90 minutes** after the schema change.
26
26
 
27
- ## 操作模式
27
+ ## Operation Modes
28
28
 
29
- | 模式 | 适用表 | 可用 Operator |
29
+ | Mode | Target table | Available operators |
30
30
  |---|---|---|
31
- | `RealTimeOperate.APPEND_ONLY` | 普通表 | `Stream.Operator.INSERT` |
32
- | `RealTimeOperate.CDC` | 主键表 | `Stream.Operator.UPSERT`、`Stream.Operator.DELETE_IGNORE` |
31
+ | `RealTimeOperate.APPEND_ONLY` | Regular table | `Stream.Operator.INSERT` |
32
+ | `RealTimeOperate.CDC` | Primary-key table | `Stream.Operator.UPSERT`, `Stream.Operator.DELETE_IGNORE` |
33
33
 
34
- ## 普通表写入(APPEND_ONLY
34
+ ## Write to a Regular Table: APPEND_ONLY
35
35
 
36
36
  ```java
37
- // 推荐:显式参数方式(2.0.0+ 支持,不依赖 URL 解析)
37
+ // Recommended: explicit parameters. Supported in 2.0.0+ and does not depend on URL parsing.
38
38
  ClickZettaClient client = ClickZettaClient.newBuilder()
39
39
  .service("cn-shanghai-alicloud.api.clickzetta.com")
40
40
  .instance("your_instance")
@@ -53,17 +53,17 @@ RealtimeStream stream = client.newRealtimeStreamBuilder()
53
53
  .table("events")
54
54
  .build();
55
55
 
56
- // ⚠️ RealtimeStream 用列名(不是索引)
56
+ // RealtimeStream uses column names, not indexes.
57
57
  Row row = stream.createRow(Stream.Operator.INSERT);
58
58
  row.setValue("id", 1);
59
59
  row.setValue("event", "{\"type\":\"click\"}");
60
60
  stream.apply(row);
61
61
  ```
62
62
 
63
- ## 主键表写入(CDC 模式)
63
+ ## Write to a Primary-Key Table: CDC Mode
64
64
 
65
65
  ```java
66
- // 建表(主键表)
66
+ // Create a primary-key table.
67
67
  // CREATE TABLE orders (`txid` STRING PRIMARY KEY, `amount` DOUBLE, `status` STRING);
68
68
 
69
69
  RealtimeStream stream = client.newRealtimeStreamBuilder()
@@ -73,22 +73,22 @@ RealtimeStream stream = client.newRealtimeStreamBuilder()
73
73
  .table("orders")
74
74
  .build();
75
75
 
76
- // UPSERT:存在则更新,不存在则插入
76
+ // UPSERT: update an existing row or insert a new row.
77
77
  Row row = stream.createRow(Stream.Operator.UPSERT);
78
78
  row.setValue("txid", "order-001");
79
79
  row.setValue("amount", 299.99);
80
80
  row.setValue("status", "paid");
81
81
  stream.apply(row);
82
82
 
83
- // DELETE_IGNORE:删除,目标行不存在时自动忽略
83
+ // DELETE_IGNORE: delete the row and ignore the operation if the target row does not exist.
84
84
  Row delRow = stream.createRow(Stream.Operator.DELETE_IGNORE);
85
85
  delRow.setValue("txid", "order-001");
86
86
  stream.apply(delRow);
87
87
  ```
88
88
 
89
- ## 完整示例:Kafka Lakehouse
89
+ ## Complete Example: Kafka to Lakehouse
90
90
 
91
- ### KafkaReader
91
+ ### KafkaReader Class
92
92
 
93
93
  ```java
94
94
  import org.apache.kafka.clients.consumer.ConsumerConfig;
@@ -119,7 +119,7 @@ public class KafkaReader {
119
119
  }
120
120
  ```
121
121
 
122
- ### Kafka2Lakehouse 主类
122
+ ### Kafka2Lakehouse Main Class
123
123
 
124
124
  ```java
125
125
  import com.clickzetta.client.ClickZettaClient;
@@ -180,33 +180,33 @@ public class Kafka2Lakehouse {
180
180
  }
181
181
  ```
182
182
 
183
- ## 关键 API
183
+ ## Key APIs
184
184
 
185
- | API | 说明 |
185
+ | API | Description |
186
186
  |---|---|
187
- | `realtimeStream.createRow(Stream.Operator.INSERT)` | 普通表插入行 |
188
- | `realtimeStream.createRow(Stream.Operator.UPSERT)` | 主键表 upsert |
189
- | `realtimeStream.createRow(Stream.Operator.DELETE_IGNORE)` | 主键表删除行 |
190
- | `row.setValue(String columnName, Object value)` | 按列名设值(不是索引) |
191
- | `realtimeStream.apply(row)` | 发送行到服务端 |
192
- | `Options.builder().withMutationBufferLinesNum(n)` | 设置缓冲行数(默认 10 |
187
+ | `realtimeStream.createRow(Stream.Operator.INSERT)` | Create an insert row for a regular table. |
188
+ | `realtimeStream.createRow(Stream.Operator.UPSERT)` | Create an upsert row for a primary-key table. |
189
+ | `realtimeStream.createRow(Stream.Operator.DELETE_IGNORE)` | Create a delete row for a primary-key table. |
190
+ | `row.setValue(String columnName, Object value)` | Set a value by column name, not by index. |
191
+ | `realtimeStream.apply(row)` | Send the row to the server. |
192
+ | `Options.builder().withMutationBufferLinesNum(n)` | Set the number of buffered rows. The default is 10. |
193
193
 
194
- ## BulkloadStream vs RealtimeStream 对比
194
+ ## BulkloadStream vs RealtimeStream
195
195
 
196
- | 维度 | BulkloadStream | RealtimeStream |
196
+ | Dimension | BulkloadStream | RealtimeStream |
197
197
  |---|---|---|
198
- | 列设值方式 | `setValue(int index, value)` | `setValue(String name, value)` |
199
- | URL 参数 | `virtualcluster=` | `vcluster=` |
200
- | createRow 参数 | 无参数 | `Stream.Operator.INSERT/UPSERT/DELETE_IGNORE` |
201
- | 适用频率 | 低频(≥5 分钟/批) | 高频(秒级) |
202
- | 数据可见延迟 | close() 后可见 | ~1 分钟后可见 |
203
- | 主键表 | | CDC 模式 |
198
+ | Column value setter | `setValue(int index, value)` | `setValue(String name, value)` |
199
+ | URL parameter | `virtualcluster=` | `vcluster=` |
200
+ | `createRow` argument | No argument | `Stream.Operator.INSERT/UPSERT/DELETE_IGNORE` |
201
+ | Suitable write frequency | Low frequency, >=5 minutes per batch | High frequency, second-level writes |
202
+ | Data visibility latency | Visible after `close()` | Visible after about 1 minute |
203
+ | Primary-key table support | Not supported | Supported in CDC mode |
204
204
 
205
- ## 常见问题
205
+ ## FAQ
206
206
 
207
- | 问题 | 原因 | 解决方案 |
207
+ | Issue | Cause | Solution |
208
208
  |---|---|---|
209
- | 连接失败 | URL 参数名错误 | RealtimeStream `vcluster=`,不是 `virtualcluster=` |
210
- | 列名找不到 | 列名拼写错误 | 列名区分大小写,与建表 DDL 保持一致 |
211
- | 表结构变更后写入失败 | Stream 实例缓存了旧 schema | 停止任务,变更后等约 90 分钟再重启 |
212
- | dynamic table 看不到数据 | 实时写入有 ~1 分钟确认延迟 | 等待 1 分钟后再查询 |
209
+ | Connection fails | Wrong URL parameter name | RealtimeStream uses `vcluster=`, not `virtualcluster=`. |
210
+ | Column name not found | Column name is misspelled | Column names are case-sensitive and must match the table DDL. |
211
+ | Writes fail after a schema change | The old Stream instance cached the old schema | Stop the task and restart it about 90 minutes after the schema change. |
212
+ | Dynamic Table cannot see the data | Real-time writes have about 1 minute of confirmation latency | Query again after about 1 minute. |