@clickzetta/cz-cli-linux-x64 0.3.4 → 0.3.5
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/bin/cz-cli +0 -0
- package/package.json +1 -1
- package/bin/skills/clickzetta-access-control/SKILL.md +0 -243
- package/bin/skills/clickzetta-access-control/references/dynamic-masking.md +0 -86
- package/bin/skills/clickzetta-access-control/references/grant-revoke.md +0 -103
- package/bin/skills/clickzetta-access-control/references/role-management.md +0 -66
- package/bin/skills/clickzetta-access-control/references/user-management.md +0 -61
- package/bin/skills/clickzetta-ai-vector-search/SKILL.md +0 -160
- package/bin/skills/clickzetta-ai-vector-search/references/vector-search.md +0 -155
- package/bin/skills/clickzetta-app-python-sdk/SKILL.md +0 -153
- package/bin/skills/clickzetta-app-python-sdk/references/bulkload.md +0 -196
- package/bin/skills/clickzetta-app-python-sdk/references/connector.md +0 -143
- package/bin/skills/clickzetta-app-python-sdk/references/realtime.md +0 -122
- package/bin/skills/clickzetta-batch-sync-pipeline/SKILL.md +0 -293
- package/bin/skills/clickzetta-bi-connect/SKILL.md +0 -176
- package/bin/skills/clickzetta-bi-connect/references/bi-tools.md +0 -170
- package/bin/skills/clickzetta-cdc-sync-pipeline/SKILL.md +0 -457
- package/bin/skills/clickzetta-concepts/SKILL.md +0 -282
- package/bin/skills/clickzetta-concepts/references/brands-and-endpoints.md +0 -79
- package/bin/skills/clickzetta-concepts/references/object-model.md +0 -311
- package/bin/skills/clickzetta-data-ingest-pipeline/SKILL.md +0 -165
- package/bin/skills/clickzetta-data-lifecycle/SKILL.md +0 -211
- package/bin/skills/clickzetta-data-lifecycle/references/lifecycle-reference.md +0 -175
- package/bin/skills/clickzetta-data-recovery/SKILL.md +0 -215
- package/bin/skills/clickzetta-data-recovery/evals/evals.json +0 -35
- package/bin/skills/clickzetta-data-science/SKILL.md +0 -125
- package/bin/skills/clickzetta-data-science/references/bitmap-profile.md +0 -146
- package/bin/skills/clickzetta-data-science/references/data-patterns.md +0 -110
- package/bin/skills/clickzetta-data-science/references/setup.md +0 -160
- package/bin/skills/clickzetta-data-science/references/stats-functions.md +0 -195
- package/bin/skills/clickzetta-data-science/references/write-and-infer.md +0 -122
- package/bin/skills/clickzetta-data-science/references/zettapark-api.md +0 -156
- package/bin/skills/clickzetta-data-sharing/SKILL.md +0 -160
- package/bin/skills/clickzetta-data-sharing/references/share-ddl.md +0 -134
- package/bin/skills/clickzetta-dba-guide/SKILL.md +0 -540
- package/bin/skills/clickzetta-dw-modeling/SKILL.md +0 -259
- package/bin/skills/clickzetta-dw-modeling/references/modeling-patterns.md +0 -100
- package/bin/skills/clickzetta-dynamic-table/SKILL.md +0 -112
- package/bin/skills/clickzetta-dynamic-table/best-practices/dimension-table-join-guide.md +0 -257
- package/bin/skills/clickzetta-dynamic-table/best-practices/medallion-and-stream-patterns.md +0 -124
- package/bin/skills/clickzetta-dynamic-table/best-practices/non-partitioned-merge-into-warning.md +0 -96
- package/bin/skills/clickzetta-dynamic-table/best-practices/performance-optimization.md +0 -109
- package/bin/skills/clickzetta-dynamic-table/dt-creator/SKILL.md +0 -15
- package/bin/skills/clickzetta-dynamic-table/dt-creator/references/dt-declaration-strategy.md +0 -185
- package/bin/skills/clickzetta-dynamic-table/dt-creator/references/incremental-config-reference.md +0 -429
- package/bin/skills/clickzetta-dynamic-table/dt-creator/references/refresh-history-guide.md +0 -268
- package/bin/skills/clickzetta-dynamic-table/dt-creator/references/sql-limitations.md +0 -80
- package/bin/skills/clickzetta-dynamic-table/dynamic-table-alter/SKILL.md +0 -190
- package/bin/skills/clickzetta-external-catalog/SKILL.md +0 -120
- package/bin/skills/clickzetta-external-catalog/references/external-catalog-ddl.md +0 -130
- package/bin/skills/clickzetta-external-function/SKILL.md +0 -203
- package/bin/skills/clickzetta-external-function/references/external-function-ddl.md +0 -171
- package/bin/skills/clickzetta-file-import-pipeline/SKILL.md +0 -156
- package/bin/skills/clickzetta-index-manager/SKILL.md +0 -140
- package/bin/skills/clickzetta-index-manager/references/bloomfilter-index.md +0 -67
- package/bin/skills/clickzetta-index-manager/references/index-management.md +0 -73
- package/bin/skills/clickzetta-index-manager/references/inverted-index.md +0 -80
- package/bin/skills/clickzetta-index-manager/references/vector-index.md +0 -81
- package/bin/skills/clickzetta-information-schema/SKILL.md +0 -367
- package/bin/skills/clickzetta-information-schema/references/instance-views-reference.md +0 -276
- package/bin/skills/clickzetta-information-schema/references/metering-views-reference.md +0 -137
- package/bin/skills/clickzetta-information-schema/references/views-reference.md +0 -271
- package/bin/skills/clickzetta-java-sdk/SKILL.md +0 -186
- package/bin/skills/clickzetta-java-sdk/references/bulkload.md +0 -163
- package/bin/skills/clickzetta-java-sdk/references/realtime.md +0 -212
- package/bin/skills/clickzetta-kafka-ingest-pipeline/SKILL.md +0 -639
- package/bin/skills/clickzetta-kafka-ingest-pipeline/references/kafka-pipe-syntax.md +0 -324
- package/bin/skills/clickzetta-lakehouse-connect/SKILL.md +0 -218
- package/bin/skills/clickzetta-lakehouse-connect/evals/evals.json +0 -35
- package/bin/skills/clickzetta-lakehouse-connect/references/config-file.md +0 -435
- package/bin/skills/clickzetta-lakehouse-connect/references/jdbc.md +0 -478
- package/bin/skills/clickzetta-lakehouse-connect/references/python-sdk.md +0 -225
- package/bin/skills/clickzetta-lakehouse-connect/references/sqlalchemy.md +0 -468
- package/bin/skills/clickzetta-lakehouse-connect/references/zettapark-session.md +0 -445
- package/bin/skills/clickzetta-manage-comments/SKILL.md +0 -219
- package/bin/skills/clickzetta-metadata-query/SKILL.md +0 -298
- package/bin/skills/clickzetta-metadata-query/references/show-desc-reference.md +0 -326
- package/bin/skills/clickzetta-monitoring/SKILL.md +0 -199
- package/bin/skills/clickzetta-monitoring/references/job-history-analysis.md +0 -97
- package/bin/skills/clickzetta-monitoring/references/show-jobs.md +0 -48
- package/bin/skills/clickzetta-oss-ingest-pipeline/SKILL.md +0 -427
- package/bin/skills/clickzetta-query-optimizer/SKILL.md +0 -156
- package/bin/skills/clickzetta-query-optimizer/references/explain.md +0 -56
- package/bin/skills/clickzetta-query-optimizer/references/hints-and-sortkey.md +0 -78
- package/bin/skills/clickzetta-query-optimizer/references/optimize.md +0 -65
- package/bin/skills/clickzetta-query-optimizer/references/result-cache.md +0 -49
- package/bin/skills/clickzetta-query-optimizer/references/show-jobs.md +0 -42
- package/bin/skills/clickzetta-realtime-sync-pipeline/SKILL.md +0 -197
- package/bin/skills/clickzetta-semantic-view/SKILL.md +0 -207
- package/bin/skills/clickzetta-semantic-view/references/semantic-view-reference.md +0 -167
- package/bin/skills/clickzetta-spark-flink-connector/SKILL.md +0 -92
- package/bin/skills/clickzetta-spark-flink-connector/references/flink.md +0 -147
- package/bin/skills/clickzetta-spark-flink-connector/references/spark.md +0 -132
- package/bin/skills/clickzetta-sql-pipeline-manager/SKILL.md +0 -379
- package/bin/skills/clickzetta-sql-pipeline-manager/evals/evals.json +0 -166
- package/bin/skills/clickzetta-sql-pipeline-manager/references/dynamic-table.md +0 -185
- package/bin/skills/clickzetta-sql-pipeline-manager/references/materialized-view.md +0 -129
- package/bin/skills/clickzetta-sql-pipeline-manager/references/pipe.md +0 -222
- package/bin/skills/clickzetta-sql-pipeline-manager/references/table-stream.md +0 -125
- package/bin/skills/clickzetta-sql-syntax-guide/SKILL.md +0 -172
- package/bin/skills/clickzetta-sql-syntax-guide/references/ddl-reference.md +0 -350
- package/bin/skills/clickzetta-sql-syntax-guide/references/dml-reference.md +0 -279
- package/bin/skills/clickzetta-sql-syntax-guide/references/dql-reference.md +0 -504
- package/bin/skills/clickzetta-sql-syntax-guide/references/functions-reference.md +0 -372
- package/bin/skills/clickzetta-sql-syntax-guide/references/migration-databricks.md +0 -260
- package/bin/skills/clickzetta-sql-syntax-guide/references/migration-snowflake.md +0 -382
- package/bin/skills/clickzetta-sql-syntax-guide/references/vs-snowflake.md +0 -346
- package/bin/skills/clickzetta-sql-syntax-guide/references/vs-spark.md +0 -229
- package/bin/skills/clickzetta-studio-overview/SKILL.md +0 -170
- package/bin/skills/clickzetta-studio-overview/references/studio-modules.md +0 -173
- package/bin/skills/clickzetta-table-stream-pipeline/SKILL.md +0 -206
- package/bin/skills/clickzetta-vcluster-manager/SKILL.md +0 -212
- package/bin/skills/clickzetta-vcluster-manager/references/vc-cache.md +0 -54
- package/bin/skills/clickzetta-vcluster-manager/references/vcluster-ddl.md +0 -150
- package/bin/skills/clickzetta-volume-manager/SKILL.md +0 -292
- package/bin/skills/clickzetta-volume-manager/references/volume-ddl.md +0 -199
- package/bin/skills/clickzetta-zettapark/SKILL.md +0 -248
- package/bin/skills/clickzetta-zettapark/references/zettapark-api.md +0 -283
|
@@ -1,282 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: clickzetta-concepts
|
|
3
|
-
description: |
|
|
4
|
-
介绍 ClickZetta Lakehouse 的核心概念、对象模型和独特设计,帮助用户建立正确认知。
|
|
5
|
-
覆盖:账户/实例/工作空间/Schema 层级,Workspace 与 Database/Catalog 的对应关系,
|
|
6
|
-
VCluster 三种类型与 CRU 计费,Dynamic Table 增量刷新机制,Table Stream CDC,
|
|
7
|
-
三层缓存体系,Pipe 持续导入,Synonym 跨 Schema 别名,权限体系(RBAC/ACL),
|
|
8
|
-
以及与 Snowflake/Databricks 的关键差异对比。
|
|
9
|
-
还覆盖品牌关系:ClickZetta(技术品牌)= 云器(国内)= Singdata(国际),
|
|
10
|
-
以及国内(云器)和国际(Singdata)的控制台、API、JDBC 地址。
|
|
11
|
-
当用户说"工作空间是什么"、"Schema 和 Database 什么关系"、"Catalog 是什么"、
|
|
12
|
-
"实例和工作空间的区别"、"VCluster 是什么"、"CRU 是什么"、"内部表和外部表区别"、
|
|
13
|
-
"动态表和物化视图区别"、"Table Stream 是什么"、"Pipe 是什么"、"同义词是什么"、
|
|
14
|
-
"Lakehouse 架构"、"对象层级"、"权限体系"、"和 Snowflake 概念对比"、
|
|
15
|
-
"和 Databricks 概念对比"、"存算分离"、"CBO 增量计算"、
|
|
16
|
-
"云器是什么"、"Singdata 是什么"、"ClickZetta 和云器什么关系"、
|
|
17
|
-
"国际版地址"、"Singdata 地址"、"API 地址"、"控制台地址"、
|
|
18
|
-
"service 参数填什么"、"JDBC 地址"、"连接地址"时触发。
|
|
19
|
-
Keywords: concepts, architecture, workspace, schema, instance, VCluster, object model
|
|
20
|
-
---
|
|
21
|
-
|
|
22
|
-
# ClickZetta Lakehouse 核心概念
|
|
23
|
-
|
|
24
|
-
阅读 [references/object-model.md](references/object-model.md) 了解完整对象层级和差异对比。
|
|
25
|
-
阅读 [references/brands-and-endpoints.md](references/brands-and-endpoints.md) 了解品牌关系(云器/Singdata/ClickZetta)和各环境服务地址。
|
|
26
|
-
|
|
27
|
-
---
|
|
28
|
-
|
|
29
|
-
## 对象层级总览
|
|
30
|
-
|
|
31
|
-
```
|
|
32
|
-
账户 (Account)
|
|
33
|
-
└── 服务实例 (Instance) ← 对应一个云区域部署,资源隔离单元
|
|
34
|
-
└── 工作空间 (Workspace) ← 业务隔离单元,≈ Snowflake Database / Databricks Catalog
|
|
35
|
-
├── Schema ← 命名空间,≈ 传统 Database / Snowflake Schema
|
|
36
|
-
│ ├── 内部表 (Managed Table) — Iceberg · ACID · Time Travel
|
|
37
|
-
│ ├── 外部表 (External Table) — Delta/Hudi/Kafka · 只读
|
|
38
|
-
│ ├── 视图 / 动态表 / 物化视图
|
|
39
|
-
│ ├── Volume — User/Table/External(OSS/S3/COS)
|
|
40
|
-
│ ├── Table Stream — CDC 变更捕获(ClickZetta 特有)
|
|
41
|
-
│ ├── Pipe — 持续导入(Kafka/OSS)
|
|
42
|
-
│ ├── 函数 / External Function
|
|
43
|
-
│ ├── 索引(BloomFilter/倒排/向量)
|
|
44
|
-
│ └── 同义词 (Synonym) — 跨 Schema 别名(ClickZetta 特有)
|
|
45
|
-
├── Share — 跨账户零拷贝共享
|
|
46
|
-
├── Connection — Storage/API 连接
|
|
47
|
-
└── External Catalog — Hive/Iceberg/Databricks 联邦查询
|
|
48
|
-
```
|
|
49
|
-
|
|
50
|
-
---
|
|
51
|
-
|
|
52
|
-
## 与其他系统的概念对比
|
|
53
|
-
|
|
54
|
-
| ClickZetta | Snowflake | Databricks | 传统数据库 | 关键差异 |
|
|
55
|
-
|---|---|---|---|---|
|
|
56
|
-
| 服务实例 (Instance) | Account | Workspace | 数据库服务器 | ClickZetta 一个账户可多实例多云 |
|
|
57
|
-
| 工作空间 (Workspace) | Database | Catalog | Database | 连接时必须指定 |
|
|
58
|
-
| Schema | Schema | Schema/Database | Schema | 权限边界,支持 EXTERNAL 类型 |
|
|
59
|
-
| VCluster | Warehouse | SQL Warehouse | — | 三种类型,CRU 细粒度计费 |
|
|
60
|
-
| Dynamic Table | Dynamic Table | Streaming Table | — | 基于 CBO 的增量刷新,非流式 |
|
|
61
|
-
| Table Stream | Stream | — | — | 需先开启 change_tracking |
|
|
62
|
-
| Pipe | Pipe | Auto Loader | — | 每个 Pipe 对应独立 Volume |
|
|
63
|
-
| Volume | Stage | External Location | — | 三种子类型:User/Table/External |
|
|
64
|
-
| Share | Share | Delta Sharing | — | 跨实例零拷贝,消费方无存储费 |
|
|
65
|
-
| Synonym | — | — | Synonym | 支持跨 Schema 别名 |
|
|
66
|
-
| CRU | Credit | DBU | — | 跨云统一算力单位 |
|
|
67
|
-
| **Studio(内置)** | **需第三方** | **需第三方** | **—** | **内置调度/集成/数据质量/目录** |
|
|
68
|
-
|
|
69
|
-
> Studio 详细介绍见 [clickzetta-studio skill](../clickzetta-studio/SKILL.md)
|
|
70
|
-
|
|
71
|
-
---
|
|
72
|
-
|
|
73
|
-
## ClickZetta 独特概念详解
|
|
74
|
-
|
|
75
|
-
### 1. CRU(Compute Resource Unit)— 跨云统一算力单位
|
|
76
|
-
|
|
77
|
-
CRU 是 ClickZetta 对 IaaS 计算资源的抽象,**在不同云平台、不同 CPU 架构下提供一致的算力**。
|
|
78
|
-
|
|
79
|
-
- 计费单位:CRU × 时(集群运行时间)
|
|
80
|
-
- 集群停止时不计费,自动停止最小 15 秒
|
|
81
|
-
- 旧规格代码(XS/S/M/L/XL)已迁移为数字(1/2/4/8/16 CRU)
|
|
82
|
-
|
|
83
|
-
```sql
|
|
84
|
-
-- 创建 4 CRU 通用型集群
|
|
85
|
-
CREATE VCLUSTER my_gp TYPE GENERAL SIZE 4;
|
|
86
|
-
|
|
87
|
-
-- 创建分析型集群(弹性 1-4 副本,每副本 8 CRU)
|
|
88
|
-
CREATE VCLUSTER my_ap TYPE ANALYTICS SIZE 8 MIN_INSTANCE 1 MAX_INSTANCE 4;
|
|
89
|
-
```
|
|
90
|
-
|
|
91
|
-
### 2. VCluster 三种类型 — 不同场景不同集群
|
|
92
|
-
|
|
93
|
-
| 类型 | 弹性方式 | 规格步长 | 适用场景 | 特殊能力 |
|
|
94
|
-
|---|---|---|---|---|
|
|
95
|
-
| 通用型 (GENERAL) | 纵向(规格扩缩) | 1 CRU | ETL、批量导入、Ad-Hoc | 自动小文件合并(Dynamic Table 推荐) |
|
|
96
|
-
| 分析型 (ANALYTICS) | 横向(副本 1-10) | 2^n CRU | 高并发 BI、在线查询 | 本地缓存 PRELOAD_TABLES |
|
|
97
|
-
| 同步型 (INTEGRATION) | — | 0.25 CRU | 数据集成、CDC 同步 | 最小 0.25 CRU 小规格 |
|
|
98
|
-
|
|
99
|
-
**关键差异**:Dynamic Table 必须用通用型(GP),分析型(AP)不支持小文件合并,会导致文件碎片化。
|
|
100
|
-
|
|
101
|
-
### 3. Dynamic Table — 声明式增量计算(非流式)
|
|
102
|
-
|
|
103
|
-
Dynamic Table 是 ClickZetta 的核心特色之一,**通过 SQL 声明加工逻辑,系统自动判断增量/全量计算策略**。
|
|
104
|
-
|
|
105
|
-
与 Snowflake Dynamic Table 的关键差异:
|
|
106
|
-
- ClickZetta 基于 **CBO(Cost-Based Optimizer)** 自适应选择增量/全量算法
|
|
107
|
-
- `CREATE OR REPLACE` 保留数据和权限(Snowflake 会清空数据)
|
|
108
|
-
- 最小刷新间隔 **1 分钟**(非秒级流式)
|
|
109
|
-
- 建议用 **GP 集群**(AP 集群不支持小文件合并)
|
|
110
|
-
|
|
111
|
-
```sql
|
|
112
|
-
-- 声明式增量计算,系统自动处理增量逻辑
|
|
113
|
-
CREATE OR REPLACE DYNAMIC TABLE dws_order_daily
|
|
114
|
-
REFRESH interval 5 MINUTE
|
|
115
|
-
VCLUSTER default_gp
|
|
116
|
-
AS
|
|
117
|
-
SELECT date_trunc('day', order_time) AS dt,
|
|
118
|
-
SUM(amount) AS total_amount,
|
|
119
|
-
COUNT(*) AS order_cnt
|
|
120
|
-
FROM ods_orders
|
|
121
|
-
GROUP BY 1;
|
|
122
|
-
|
|
123
|
-
-- 手动触发刷新
|
|
124
|
-
REFRESH DYNAMIC TABLE dws_order_daily;
|
|
125
|
-
```
|
|
126
|
-
|
|
127
|
-
### 4. Table Stream — CDC 变更捕获对象
|
|
128
|
-
|
|
129
|
-
Table Stream 是 SQL 对象,记录表的 DML 变更(INSERT/UPDATE/DELETE),**消费后自动推进 offset**。
|
|
130
|
-
|
|
131
|
-
与 Snowflake Stream 的关键差异:
|
|
132
|
-
- **必须先开启 `change_tracking`**(Snowflake 不需要)
|
|
133
|
-
- 实时写入数据需等待 **1 分钟**后才可被 Stream 读取
|
|
134
|
-
- 支持在 Table、Dynamic Table、Materialized View、Kafka 外部表上创建
|
|
135
|
-
|
|
136
|
-
```sql
|
|
137
|
-
-- 第一步:开启变更跟踪(必须)
|
|
138
|
-
ALTER TABLE orders SET PROPERTIES ('change_tracking' = 'true');
|
|
139
|
-
|
|
140
|
-
-- 第二步:创建 Stream
|
|
141
|
-
CREATE TABLE STREAM orders_stream
|
|
142
|
-
ON TABLE orders
|
|
143
|
-
WITH PROPERTIES ('TABLE_STREAM_MODE' = 'STANDARD', 'SHOW_INITIAL_ROWS' = 'FALSE');
|
|
144
|
-
|
|
145
|
-
-- 消费 Stream(DML 操作后 offset 自动推进)
|
|
146
|
-
INSERT INTO orders_summary
|
|
147
|
-
SELECT __change_type, order_id, amount
|
|
148
|
-
FROM orders_stream
|
|
149
|
-
WHERE __change_type = 'INSERT';
|
|
150
|
-
```
|
|
151
|
-
|
|
152
|
-
Stream 元数据字段:
|
|
153
|
-
- `__change_type`:INSERT / UPDATE_BEFORE / UPDATE_AFTER / DELETE
|
|
154
|
-
- `__commit_version`:提交版本号
|
|
155
|
-
- `__commit_timestamp`:提交时间戳
|
|
156
|
-
|
|
157
|
-
### 5. Pipe — 持续导入对象
|
|
158
|
-
|
|
159
|
-
Pipe 是 SQL 对象,持续自动将数据从 Kafka 或对象存储导入到表中。
|
|
160
|
-
|
|
161
|
-
**ClickZetta 特有限制**:每个 Pipe 必须对应独立的 Volume,不可复用。
|
|
162
|
-
|
|
163
|
-
```sql
|
|
164
|
-
-- OSS 持续导入(LIST_PURGE 扫描模式)
|
|
165
|
-
CREATE PIPE oss_orders_pipe
|
|
166
|
-
VIRTUAL_CLUSTER = 'default_gp'
|
|
167
|
-
INGEST_MODE = 'LIST_PURGE'
|
|
168
|
-
AS COPY INTO orders
|
|
169
|
-
FROM VOLUME my_oss_volume
|
|
170
|
-
USING CSV OPTIONS('header'='true');
|
|
171
|
-
|
|
172
|
-
-- Kafka 持续导入
|
|
173
|
-
CREATE PIPE kafka_events_pipe
|
|
174
|
-
VIRTUAL_CLUSTER = 'default_gp'
|
|
175
|
-
BATCH_INTERVAL_IN_SECONDS = '60'
|
|
176
|
-
BATCH_SIZE_PER_KAFKA_PARTITION = '500000'
|
|
177
|
-
AS COPY INTO events
|
|
178
|
-
FROM (SELECT * FROM READ_KAFKA(...));
|
|
179
|
-
```
|
|
180
|
-
|
|
181
|
-
### 6. 三层缓存体系
|
|
182
|
-
|
|
183
|
-
ClickZetta 有三种独立缓存,理解它们对性能调优至关重要:
|
|
184
|
-
|
|
185
|
-
| 缓存类型 | 作用范围 | 适用集群 | 说明 |
|
|
186
|
-
|---|---|---|---|
|
|
187
|
-
| 查询结果缓存 (Result Cache) | 工作空间共享 | GP + AP | 相同 SQL 直接返回缓存结果 |
|
|
188
|
-
| 元数据缓存 (Metadata Cache) | 工作空间共享 | GP + AP | 表结构、分区信息缓存 |
|
|
189
|
-
| 本地磁盘缓存 (Local Disk Cache) | 集群本地节点 | GP + AP | 热数据文件缓存,集群停止后释放 |
|
|
190
|
-
|
|
191
|
-
**主动缓存**(仅 AP 集群):集群启动时自动预加载指定表的最新数据。
|
|
192
|
-
|
|
193
|
-
```sql
|
|
194
|
-
-- 设置 AP 集群预加载表(集群启动时自动缓存)
|
|
195
|
-
ALTER VCLUSTER my_ap SET PRELOAD_TABLES = "sales.orders,sales.products";
|
|
196
|
-
|
|
197
|
-
-- 查看缓存状态
|
|
198
|
-
SHOW PRELOAD CACHED STATUS;
|
|
199
|
-
SHOW EXTENDED PRELOAD CACHED STATUS;
|
|
200
|
-
```
|
|
201
|
-
|
|
202
|
-
### 7. Synonym(同义词)— 跨 Schema 别名
|
|
203
|
-
|
|
204
|
-
Synonym 是 ClickZetta 特有的对象类型,为表/Stream/动态表/物化视图/Volume/函数创建跨 Schema 别名,**无需复制数据**。
|
|
205
|
-
|
|
206
|
-
```sql
|
|
207
|
-
-- 在 schema_b 中为 schema_a 的表创建别名
|
|
208
|
-
CREATE SYNONYM schema_b.orders_alias FOR schema_a.orders;
|
|
209
|
-
|
|
210
|
-
-- 直接查询别名(数据实时与原表一致)
|
|
211
|
-
SELECT * FROM schema_b.orders_alias;
|
|
212
|
-
|
|
213
|
-
-- Volume 同义词(必须加 VOLUME 关键字)
|
|
214
|
-
CREATE VOLUME SYNONYM my_schema.vol_alias FOR data_schema.raw_volume;
|
|
215
|
-
|
|
216
|
-
-- 函数同义词(必须加 FUNCTION 关键字)
|
|
217
|
-
CREATE FUNCTION SYNONYM my_schema.fn_alias FOR data_schema.my_function;
|
|
218
|
-
```
|
|
219
|
-
|
|
220
|
-
### 8. Sort Key 智能推荐(Auto Index)
|
|
221
|
-
|
|
222
|
-
ClickZetta 会自动分析查询历史,推荐最优排序列(Sort Key),通过 `information_schema.sortkey_candidates` 查看。
|
|
223
|
-
|
|
224
|
-
```sql
|
|
225
|
-
-- 开启自动分析(每天收集,分析最近 150 分钟的作业)
|
|
226
|
-
ALTER WORKSPACE quick_start SET PROPERTIES ('auto_index' = 'day');
|
|
227
|
-
|
|
228
|
-
-- 查看推荐结果
|
|
229
|
-
SELECT table_name, col, statement, ratio
|
|
230
|
-
FROM information_schema.sortkey_candidates
|
|
231
|
-
ORDER BY ratio DESC;
|
|
232
|
-
|
|
233
|
-
-- 应用推荐(执行 statement 列中的 SQL)
|
|
234
|
-
ALTER TABLE sales.orders SET PROPERTIES ("hint.sort.columns" = "order_date");
|
|
235
|
-
```
|
|
236
|
-
|
|
237
|
-
---
|
|
238
|
-
|
|
239
|
-
## 核心概念详解
|
|
240
|
-
|
|
241
|
-
### 工作空间(Workspace)
|
|
242
|
-
|
|
243
|
-
**业务隔离单元**,是日常操作的主要边界。
|
|
244
|
-
|
|
245
|
-
- 连接时必须指定(等同于 Snowflake Database / Databricks Catalog)
|
|
246
|
-
- 每个 Workspace 有独立的:用户角色、VCluster、任务调度、INFORMATION_SCHEMA
|
|
247
|
-
- SQL 三层命名:`workspace_name.schema_name.table_name`
|
|
248
|
-
- 通过 `USE SCHEMA` 切换默认 Schema
|
|
249
|
-
|
|
250
|
-
### Schema
|
|
251
|
-
|
|
252
|
-
**命名空间**,是权限授予的边界。
|
|
253
|
-
|
|
254
|
-
- 类型:`MANAGED`(平台托管存储)/ `EXTERNAL`(外部数据湖路径)
|
|
255
|
-
- 一个 Workspace 下可有多个 Schema
|
|
256
|
-
- 可对整个 Schema 批量授权:`GRANT SELECT ON ALL TABLES IN SCHEMA my_schema TO ROLE ...`
|
|
257
|
-
|
|
258
|
-
### 权限体系关键点
|
|
259
|
-
|
|
260
|
-
- **无超级用户**:所有操作必须明确授权,无法绕过权限检查
|
|
261
|
-
- **实例角色与工作空间角色互不影响**:`instance_admin` 不能直接操作工作空间数据
|
|
262
|
-
- **自定义角色仅工作空间级**:不支持实例级自定义角色,且只能通过 SQL 创建
|
|
263
|
-
- **新用户默认无权限**:加入实例后获得 `instance_user` 角色,需显式授予工作空间角色才能操作数据
|
|
264
|
-
|
|
265
|
-
```sql
|
|
266
|
-
-- 新用户加入后必须授予工作空间角色
|
|
267
|
-
GRANT ROLE workspace_analyst TO USER new_user;
|
|
268
|
-
|
|
269
|
-
-- 创建自定义角色(仅 SQL,不支持 Web 端)
|
|
270
|
-
CREATE ROLE data_engineer;
|
|
271
|
-
GRANT SELECT, INSERT ON ALL TABLES IN SCHEMA dw TO ROLE data_engineer;
|
|
272
|
-
GRANT ROLE data_engineer TO USER alice;
|
|
273
|
-
```
|
|
274
|
-
|
|
275
|
-
---
|
|
276
|
-
|
|
277
|
-
## 存储架构关键点
|
|
278
|
-
|
|
279
|
-
- **存算分离**:VCluster 停止时不产生计算费用,存储按 GiB 独立计费
|
|
280
|
-
- **开放格式**:内部表基于 Apache Iceberg,可被 Spark/Trino 直接读取
|
|
281
|
-
- **多云多地域**:阿里云上海、腾讯云上海/北京/广州、AWS 北京
|
|
282
|
-
- **私有存储(BYOS)**:支持使用自己的 OSS/S3/COS 账号存储数据
|
|
@@ -1,79 +0,0 @@
|
|
|
1
|
-
# ClickZetta 品牌关系与服务地址
|
|
2
|
-
|
|
3
|
-
## 品牌关系
|
|
4
|
-
|
|
5
|
-
ClickZetta 是技术品牌名,同一产品在不同市场使用不同品牌:
|
|
6
|
-
|
|
7
|
-
| 品牌 | 市场 | 官网 | 文档 |
|
|
8
|
-
|---|---|---|---|
|
|
9
|
-
| **云器(Yunqi)** | 国内 | www.yunqi.tech | www.yunqi.tech/documents |
|
|
10
|
-
| **Singdata** | 国际 | www.singdata.com | www.singdata.com/documents |
|
|
11
|
-
| **ClickZetta** | 技术品牌(通用) | — | — |
|
|
12
|
-
|
|
13
|
-
> **云器 Lakehouse = ClickZetta Lakehouse = Singdata Lakehouse**,三者指同一产品。
|
|
14
|
-
> 用户提到"云器"、"Singdata"、"ClickZetta"时,均指同一 Lakehouse 平台。
|
|
15
|
-
|
|
16
|
-
---
|
|
17
|
-
|
|
18
|
-
## 国内(云器)服务地址
|
|
19
|
-
|
|
20
|
-
控制台:`https://<instance_name>.app.clickzetta.com`
|
|
21
|
-
|
|
22
|
-
JDBC URL 格式:`jdbc:clickzetta://<instance_name>.<region_code>.api.clickzetta.com/<workspace>`
|
|
23
|
-
|
|
24
|
-
| 云服务商 | 区域 | Region Code | API 地址 |
|
|
25
|
-
|---|---|---|---|
|
|
26
|
-
| 阿里云 | 上海 | `cn-shanghai-alicloud` | `<instance>.cn-shanghai-alicloud.api.clickzetta.com` |
|
|
27
|
-
| 阿里云 | 杭州 | `cn-hangzhou-alicloud` | `<instance>.cn-hangzhou-alicloud.api.clickzetta.com` |
|
|
28
|
-
| 阿里云 | 北京 | `cn-beijing-alicloud` | `<instance>.cn-beijing-alicloud.api.clickzetta.com` |
|
|
29
|
-
| 腾讯云 | 上海 | `cn-shanghai-tencentcloud` | `<instance>.cn-shanghai-tencentcloud.api.clickzetta.com` |
|
|
30
|
-
| 华为云 | 上海 | `cn-shanghai-huaweicloud` | `<instance>.cn-shanghai-huaweicloud.api.clickzetta.com` |
|
|
31
|
-
|
|
32
|
-
---
|
|
33
|
-
|
|
34
|
-
## 国际(Singdata)服务地址
|
|
35
|
-
|
|
36
|
-
账户控制台:`https://accounts.app.singdata.com` 或 `https://<account_name>.accounts.app.singdata.com`
|
|
37
|
-
|
|
38
|
-
实例控制台:`https://<instance_name>.app.singdata.com`
|
|
39
|
-
|
|
40
|
-
工作空间列表:`https://<instance_name>.app.lakehouse.singdata.com/workspace`
|
|
41
|
-
|
|
42
|
-
JDBC URL 格式:`jdbc:clickzetta://<instance_name>.<region_code>.api.singdata.com/<workspace>`
|
|
43
|
-
|
|
44
|
-
Streaming API Host:`<instance_name>.streamingapi.singdata.com`
|
|
45
|
-
|
|
46
|
-
| 云服务商 | 区域 | Region Code | API 地址 |
|
|
47
|
-
|---|---|---|---|
|
|
48
|
-
| 阿里云 | 新加坡 | `ap-southeast-1-alicloud` | `<instance>.ap-southeast-1-alicloud.api.singdata.com` |
|
|
49
|
-
| Amazon Web Services | 新加坡 | `ap-southeast-1-aws` | `<instance>.ap-southeast-1-aws.api.singdata.com` |
|
|
50
|
-
|
|
51
|
-
---
|
|
52
|
-
|
|
53
|
-
## SDK / 连接参数中的地址格式
|
|
54
|
-
|
|
55
|
-
Python SDK(`clickzetta-connector-python`)的 `service` 参数填 API 地址(不含 `jdbc:clickzetta://` 前缀和实例名):
|
|
56
|
-
|
|
57
|
-
```python
|
|
58
|
-
# 国内(云器)
|
|
59
|
-
conn = connect(service='cn-shanghai-alicloud.api.clickzetta.com', instance='your_instance', ...)
|
|
60
|
-
|
|
61
|
-
# 国际(Singdata)
|
|
62
|
-
conn = connect(service='ap-southeast-1-alicloud.api.singdata.com', instance='your_instance', ...)
|
|
63
|
-
```
|
|
64
|
-
|
|
65
|
-
Java SDK(`clickzetta-java`)的 `.service()` 参数同理:
|
|
66
|
-
|
|
67
|
-
```java
|
|
68
|
-
// 国内(云器)
|
|
69
|
-
ClickZettaClient.newBuilder()
|
|
70
|
-
.service("cn-shanghai-alicloud.api.clickzetta.com")
|
|
71
|
-
.instance("your_instance")
|
|
72
|
-
...
|
|
73
|
-
|
|
74
|
-
// 国际(Singdata)
|
|
75
|
-
ClickZettaClient.newBuilder()
|
|
76
|
-
.service("ap-southeast-1-alicloud.api.singdata.com")
|
|
77
|
-
.instance("your_instance")
|
|
78
|
-
...
|
|
79
|
-
```
|
|
@@ -1,311 +0,0 @@
|
|
|
1
|
-
# ClickZetta Lakehouse 对象模型完整参考
|
|
2
|
-
|
|
3
|
-
> 来源:官方产品文档 yunqi.tech
|
|
4
|
-
> 参考:clickzetta-lakehouse-architecture.html
|
|
5
|
-
|
|
6
|
-
---
|
|
7
|
-
|
|
8
|
-
## ClickZetta 独特概念速查
|
|
9
|
-
|
|
10
|
-
| 概念 | 独特之处 | 常见误区 |
|
|
11
|
-
|---|---|---|
|
|
12
|
-
| CRU | 跨云统一算力单位,旧规格 XS/S/M/L 已迁移为数字 1/2/4/8 | 不是 Snowflake Credit,不是 DBU |
|
|
13
|
-
| VCluster 三类型 | GP/AP/Integration 各有适用场景,Dynamic Table 必须用 GP | AP 集群不支持小文件合并 |
|
|
14
|
-
| Dynamic Table | CBO 自适应增量/全量,`OR REPLACE` 保留数据 | 最小 1 分钟,非秒级流式 |
|
|
15
|
-
| Table Stream | 需先 `ALTER TABLE SET PROPERTIES ('change_tracking'='true')` | 实时写入数据需等 1 分钟才可读 |
|
|
16
|
-
| Pipe | 每个 Pipe 对应独立 Volume,不可复用 | 不是 Snowflake Snowpipe,无自动触发 |
|
|
17
|
-
| Synonym | 支持跨 Schema 别名,VOLUME/FUNCTION 类型需显式声明关键字 | 不是视图,不复制数据 |
|
|
18
|
-
| 权限体系 | 无超级用户;实例角色与工作空间角色互不影响 | instance_admin 不能直接操作工作空间数据 |
|
|
19
|
-
| Workspace | 连接时必须指定,≈ Snowflake Database | 不是 Databricks Workspace(那个是实例级) |
|
|
20
|
-
| Schema TYPE | MANAGED(内部托管)/ EXTERNAL(外部数据湖) | EXTERNAL Schema 不支持 DML |
|
|
21
|
-
|
|
22
|
-
---
|
|
23
|
-
|
|
24
|
-
## 完整对象层级
|
|
25
|
-
|
|
26
|
-
```
|
|
27
|
-
账户 (Account)
|
|
28
|
-
│ 全局唯一 · SSO/MFA · 实名认证
|
|
29
|
-
│
|
|
30
|
-
└── 服务实例 (Instance)
|
|
31
|
-
│ 资源隔离 · 多云多地域 · Instance Role
|
|
32
|
-
│
|
|
33
|
-
└── 工作空间 (Workspace)
|
|
34
|
-
│ 业务隔离 · Workspace Role · VCluster 绑定 · 任务调度
|
|
35
|
-
│
|
|
36
|
-
├── Schema(数据库/命名空间)
|
|
37
|
-
│ │ MANAGED / EXTERNAL 类型
|
|
38
|
-
│ │
|
|
39
|
-
│ ├── 内部表 (Managed Table) — Iceberg · ACID · Time Travel · 索引
|
|
40
|
-
│ ├── 外部表 (External Table) — Delta/Hudi/Kafka · 只读
|
|
41
|
-
│ ├── 视图 (View) — 虚拟 · 无存储
|
|
42
|
-
│ ├── 动态表 (Dynamic Table) — 声明式增量刷新
|
|
43
|
-
│ ├── 物化视图 (Materialized View) — 预计算 · 定时刷新
|
|
44
|
-
│ ├── Volume — User/Table/External(OSS/S3/COS)
|
|
45
|
-
│ ├── Table Stream — CDC 变更捕获
|
|
46
|
-
│ ├── Pipe — Kafka/OSS 持续导入
|
|
47
|
-
│ ├── 函数 / External Function — SQL UDF / Python / Java
|
|
48
|
-
│ ├── 索引 — BloomFilter / Inverted / Vector(HNSW)
|
|
49
|
-
│ └── 同义词 (Synonym) — 跨 Schema 别名
|
|
50
|
-
│
|
|
51
|
-
├── Share — 跨账户零拷贝数据共享
|
|
52
|
-
├── Connection — Storage(OSS/COS/S3) / API(云函数)
|
|
53
|
-
└── External Catalog — Hive HMS / Iceberg REST / Databricks Unity
|
|
54
|
-
```
|
|
55
|
-
|
|
56
|
-
---
|
|
57
|
-
|
|
58
|
-
## 工作空间(Workspace)详解
|
|
59
|
-
|
|
60
|
-
### 核心定位
|
|
61
|
-
|
|
62
|
-
Workspace 是 ClickZetta 中**业务隔离的最小单元**,也是连接时必须指定的对象。
|
|
63
|
-
|
|
64
|
-
- 等同于 Snowflake 的 **Database**,或 Databricks 的 **Catalog**
|
|
65
|
-
- 每个 Workspace 有独立的:用户角色、VCluster、任务调度、INFORMATION_SCHEMA
|
|
66
|
-
- 连接参数中的 `workspace` 字段即指定此对象
|
|
67
|
-
|
|
68
|
-
### 管理命令
|
|
69
|
-
|
|
70
|
-
```sql
|
|
71
|
-
-- 查看所有工作空间(需 instance_admin)
|
|
72
|
-
SHOW WORKSPACES;
|
|
73
|
-
|
|
74
|
-
-- 查看工作空间详情
|
|
75
|
-
DESC WORKSPACE my_workspace;
|
|
76
|
-
|
|
77
|
-
-- 修改注释
|
|
78
|
-
ALTER WORKSPACE my_workspace SET COMMENT '生产环境';
|
|
79
|
-
|
|
80
|
-
-- 查看属性
|
|
81
|
-
SHOW PROPERTIES IN WORKSPACE my_workspace;
|
|
82
|
-
```
|
|
83
|
-
|
|
84
|
-
### DESC WORKSPACE 输出字段
|
|
85
|
-
|
|
86
|
-
| 字段 | 说明 |
|
|
87
|
-
|---|---|
|
|
88
|
-
| name | 工作空间名称 |
|
|
89
|
-
| creator | 创建者 |
|
|
90
|
-
| created_time | 创建时间 |
|
|
91
|
-
| last_modified_time | 最后修改时间 |
|
|
92
|
-
| comment | 注释 |
|
|
93
|
-
|
|
94
|
-
---
|
|
95
|
-
|
|
96
|
-
## Schema 详解
|
|
97
|
-
|
|
98
|
-
### 核心定位
|
|
99
|
-
|
|
100
|
-
Schema 是 ClickZetta 中的**命名空间**,用于组织数据对象。
|
|
101
|
-
|
|
102
|
-
- 等同于传统数据库的 **Database** 或 **Schema**(注意:不同系统叫法不同)
|
|
103
|
-
- 是权限授予的边界(可对整个 Schema 授权)
|
|
104
|
-
- 类型:`MANAGED`(平台托管存储)/ `EXTERNAL`(外部数据湖路径)
|
|
105
|
-
|
|
106
|
-
### 管理命令
|
|
107
|
-
|
|
108
|
-
```sql
|
|
109
|
-
-- 创建 Schema
|
|
110
|
-
CREATE SCHEMA my_schema;
|
|
111
|
-
|
|
112
|
-
-- 创建外部 Schema(指向外部数据湖)
|
|
113
|
-
CREATE EXTERNAL SCHEMA ext_schema LOCATION 'oss://bucket/path/';
|
|
114
|
-
|
|
115
|
-
-- 切换默认 Schema
|
|
116
|
-
USE SCHEMA my_schema;
|
|
117
|
-
|
|
118
|
-
-- 查看所有 Schema
|
|
119
|
-
SHOW SCHEMAS;
|
|
120
|
-
|
|
121
|
-
-- 查看 Schema 详情
|
|
122
|
-
DESC SCHEMA my_schema;
|
|
123
|
-
|
|
124
|
-
-- 修改 Schema
|
|
125
|
-
ALTER SCHEMA my_schema RENAME TO new_schema;
|
|
126
|
-
ALTER SCHEMA my_schema SET COMMENT '数据仓库层';
|
|
127
|
-
|
|
128
|
-
-- 删除 Schema(需先删除其中的对象)
|
|
129
|
-
DROP SCHEMA my_schema;
|
|
130
|
-
DROP SCHEMA IF EXISTS my_schema CASCADE; -- 级联删除所有对象
|
|
131
|
-
```
|
|
132
|
-
|
|
133
|
-
---
|
|
134
|
-
|
|
135
|
-
## VCluster(计算集群)详解
|
|
136
|
-
|
|
137
|
-
### 三种类型对比
|
|
138
|
-
|
|
139
|
-
| 属性 | 通用型 (GENERAL) | 分析型 (ANALYTICS) | 同步型 (INTEGRATION) |
|
|
140
|
-
|---|---|---|---|
|
|
141
|
-
| 适用场景 | ETL、批量导入、Ad-Hoc | 高并发 BI、在线查询 | 数据集成、CDC 同步 |
|
|
142
|
-
| 弹性方式 | 纵向(规格扩缩) | 横向(副本数 1-10) | — |
|
|
143
|
-
| 最小规格 | 1 CRU | 1 CRU | 0.25 CRU |
|
|
144
|
-
| 最大规格 | 256 CRU | 256 CRU | 256 CRU |
|
|
145
|
-
| 规格步长 | 1 CRU | 2^n CRU | 0.25 CRU |
|
|
146
|
-
| 本地缓存 | 不支持 | 支持(PRELOAD) | 不支持 |
|
|
147
|
-
| 小文件合并 | 支持(Dynamic Table 推荐) | 不支持 | — |
|
|
148
|
-
|
|
149
|
-
### 任务类型与集群对应
|
|
150
|
-
|
|
151
|
-
| 任务类型 | 推荐集群 |
|
|
152
|
-
|---|---|
|
|
153
|
-
| SQL ETL / 批量导入 | 通用型 |
|
|
154
|
-
| Ad-Hoc 查询 / BI | 分析型 |
|
|
155
|
-
| Dynamic Table(低频大量) | 通用型 |
|
|
156
|
-
| Dynamic Table(高频小量) | 分析型 |
|
|
157
|
-
| 离线同步 / 实时同步 / CDC | 同步型 |
|
|
158
|
-
| Python / Shell / JDBC 任务 | 不使用 VCluster |
|
|
159
|
-
|
|
160
|
-
### 管理命令
|
|
161
|
-
|
|
162
|
-
```sql
|
|
163
|
-
-- 创建通用型集群
|
|
164
|
-
CREATE VCLUSTER my_gp TYPE GENERAL SIZE 4;
|
|
165
|
-
|
|
166
|
-
-- 创建分析型集群(弹性 1-4 副本)
|
|
167
|
-
CREATE VCLUSTER my_ap TYPE ANALYTICS SIZE 8 MIN_INSTANCE 1 MAX_INSTANCE 4;
|
|
168
|
-
|
|
169
|
-
-- 启动 / 停止
|
|
170
|
-
ALTER VCLUSTER my_gp RESUME;
|
|
171
|
-
ALTER VCLUSTER my_gp SUSPEND;
|
|
172
|
-
|
|
173
|
-
-- 查看所有集群
|
|
174
|
-
SHOW VCLUSTERS;
|
|
175
|
-
```
|
|
176
|
-
|
|
177
|
-
---
|
|
178
|
-
|
|
179
|
-
## 用户与权限体系
|
|
180
|
-
|
|
181
|
-
### 用户层级
|
|
182
|
-
|
|
183
|
-
```
|
|
184
|
-
全局账号用户(Global User)
|
|
185
|
-
│ 在账户层面管理,user_name 全局唯一
|
|
186
|
-
│
|
|
187
|
-
└── 服务实例用户(Instance User)
|
|
188
|
-
│ 全局用户自动同步,默认获得 instance_user 角色(无数据权限)
|
|
189
|
-
│
|
|
190
|
-
└── 工作空间用户(Workspace User)
|
|
191
|
-
通过 GRANT ROLE 授予工作空间角色后才能操作数据
|
|
192
|
-
```
|
|
193
|
-
|
|
194
|
-
### 用户类型
|
|
195
|
-
|
|
196
|
-
| 类型 | 说明 |
|
|
197
|
-
|---|---|
|
|
198
|
-
| 普通用户 | 代表实际人员,可 Web 登录 |
|
|
199
|
-
| 系统服务用户 | 平台内置,默认禁用(如 sysservice_auto_mv) |
|
|
200
|
-
| 自定义服务用户 | 用于自动化程序,不可 Web 登录,可用 JDBC |
|
|
201
|
-
|
|
202
|
-
### 预置角色
|
|
203
|
-
|
|
204
|
-
| 角色 | 级别 | 权限范围 |
|
|
205
|
-
|---|---|---|
|
|
206
|
-
| instance_admin | 实例级 | 管理所有工作空间、用户、External Catalog |
|
|
207
|
-
| instance_user | 实例级 | 默认角色,无数据权限 |
|
|
208
|
-
| workspace_admin | 工作空间级 | 管理空间内所有对象和用户 |
|
|
209
|
-
| workspace_dev | 工作空间级 | 读写权限 + 任务管理 |
|
|
210
|
-
| workspace_analyst | 工作空间级 | 只读权限 |
|
|
211
|
-
|
|
212
|
-
### 授权命令
|
|
213
|
-
|
|
214
|
-
```sql
|
|
215
|
-
-- 将角色授予用户
|
|
216
|
-
GRANT ROLE workspace_dev TO USER alice;
|
|
217
|
-
|
|
218
|
-
-- 授予表权限
|
|
219
|
-
GRANT SELECT ON TABLE my_schema.my_table TO ROLE analyst_role;
|
|
220
|
-
GRANT SELECT ON ALL TABLES IN SCHEMA my_schema TO ROLE analyst_role;
|
|
221
|
-
|
|
222
|
-
-- 授予 information_schema 查询权限
|
|
223
|
-
GRANT ALL ON ALL VIEWS IN SCHEMA information_schema TO ROLE analyst_role;
|
|
224
|
-
|
|
225
|
-
-- 撤销权限
|
|
226
|
-
REVOKE SELECT ON TABLE my_schema.my_table FROM ROLE analyst_role;
|
|
227
|
-
|
|
228
|
-
-- 创建自定义角色(仅工作空间级,仅 SQL)
|
|
229
|
-
CREATE ROLE my_custom_role;
|
|
230
|
-
```
|
|
231
|
-
|
|
232
|
-
---
|
|
233
|
-
|
|
234
|
-
## 数据类型速查
|
|
235
|
-
|
|
236
|
-
| 分类 | 类型 |
|
|
237
|
-
|---|---|
|
|
238
|
-
| 整数 | TINYINT / SMALLINT / INT / BIGINT |
|
|
239
|
-
| 浮点 | FLOAT / DOUBLE / DECIMAL(p,s) |
|
|
240
|
-
| 字符串 | CHAR(n) / VARCHAR(n) / STRING(最大 16MB) |
|
|
241
|
-
| 时间 | DATE / TIMESTAMP(带时区 LTZ)/ TIMESTAMP_NTZ / INTERVAL |
|
|
242
|
-
| 布尔 | BOOLEAN |
|
|
243
|
-
| 复杂 | ARRAY\<T\> / MAP\<K,V\> / STRUCT\<field:type,...\> |
|
|
244
|
-
| AI 专用 | VECTOR(FLOAT, n)(最大 65535 维)/ VECTOR(TINYINT, n) |
|
|
245
|
-
| 特殊 | JSON / BINARY / BITMAP(Roaring Bitmap) |
|
|
246
|
-
|
|
247
|
-
---
|
|
248
|
-
|
|
249
|
-
## 平台架构层次
|
|
250
|
-
|
|
251
|
-
```
|
|
252
|
-
客户端层:Studio IDE · JDBC/ODBC · Python SDK · ZettaPark · BI 工具 · MCP Server
|
|
253
|
-
↓
|
|
254
|
-
计算层:VCluster(GENERAL / ANALYTICS / INTEGRATION)
|
|
255
|
-
↓
|
|
256
|
-
服务层:SQL 解析优化 · 向量化执行引擎 · Dynamic Table · AI Gateway · Result Cache
|
|
257
|
-
↓
|
|
258
|
-
存储层:内部表(Iceberg) · 外部表 · Volume · Time Travel · External Catalog · Share
|
|
259
|
-
↓
|
|
260
|
-
底层对象存储:阿里云 OSS · AWS S3 · 腾讯云 COS
|
|
261
|
-
```
|
|
262
|
-
|
|
263
|
-
**存算分离**:计算层和存储层独立扩展,VCluster 停止时不产生计算费用,存储按 GiB 计费。
|
|
264
|
-
|
|
265
|
-
---
|
|
266
|
-
|
|
267
|
-
## 数据对象横向对比
|
|
268
|
-
|
|
269
|
-
### Dynamic Table vs Materialized View vs View
|
|
270
|
-
|
|
271
|
-
| 维度 | 动态表 (Dynamic Table) | 物化视图 (Materialized View) | 视图 (View) |
|
|
272
|
-
|---|---|---|---|
|
|
273
|
-
| 数据存储 | 有(物化) | 有(物化) | 无(虚拟) |
|
|
274
|
-
| 刷新方式 | 自动增量/全量(CBO 决策) | 手动或定时全量 | 每次查询实时执行 |
|
|
275
|
-
| 最小刷新间隔 | 1 分钟 | 无限制(手动) | — |
|
|
276
|
-
| Time Travel | 支持 | 不支持 | 不支持 |
|
|
277
|
-
| UNDROP | 支持 | 不支持 | 不支持 |
|
|
278
|
-
| CREATE OR REPLACE | 支持(保留数据和权限) | 支持 | 支持 |
|
|
279
|
-
| 推荐集群 | GP(通用型) | GP 或 AP | — |
|
|
280
|
-
| 适用场景 | 实时 ETL、多层级联 | BI 加速、固定聚合 | 简单逻辑封装 |
|
|
281
|
-
|
|
282
|
-
### Table Stream 两种模式
|
|
283
|
-
|
|
284
|
-
| 模式 | 捕获内容 | 典型用途 |
|
|
285
|
-
|---|---|---|
|
|
286
|
-
| STANDARD | INSERT + UPDATE_BEFORE + UPDATE_AFTER + DELETE | CDC UPSERT,MERGE INTO 消费 |
|
|
287
|
-
| APPEND_ONLY | 仅 INSERT | 日志追加,简单 ETL |
|
|
288
|
-
|
|
289
|
-
**STANDARD 模式的 delta 语义**:记录两个 offset 之间的净变化。若一行先 INSERT 后 DELETE,delta 中该行消失(不会出现 INSERT+DELETE 两条记录)。
|
|
290
|
-
|
|
291
|
-
### Pipe 两种导入模式
|
|
292
|
-
|
|
293
|
-
| 模式 | 触发方式 | 适用场景 | 云支持 |
|
|
294
|
-
|---|---|---|---|
|
|
295
|
-
| LIST_PURGE | 定期扫描 Volume 目录 | 通用,任何对象存储 | 全部 |
|
|
296
|
-
| EVENT_NOTIFICATION | 云消息队列事件触发 | 低延迟,近实时 | 仅阿里云 OSS + AWS S3 |
|
|
297
|
-
|
|
298
|
-
---
|
|
299
|
-
|
|
300
|
-
## 地域与连接信息
|
|
301
|
-
|
|
302
|
-
| 云服务商 | 地域 | 区域代码 | API Endpoint |
|
|
303
|
-
|---|---|---|---|
|
|
304
|
-
| 阿里云 | 华东2(上海) | cn-shanghai-alicloud | cn-shanghai-alicloud.api.clickzetta.com |
|
|
305
|
-
| 腾讯云 | 华东(上海) | ap-shanghai-tencentcloud | ap-shanghai-tencentcloud.api.clickzetta.com |
|
|
306
|
-
| 腾讯云 | 华北(北京) | ap-beijing-tencentcloud | ap-beijing-tencentcloud.api.clickzetta.com |
|
|
307
|
-
| 腾讯云 | 华南(广州) | ap-guangzhou-tencentcloud | ap-guangzhou-tencentcloud.api.clickzetta.com |
|
|
308
|
-
| AWS | 北京 | cn-north-1-aws | cn-north-1-aws.api.clickzetta.com |
|
|
309
|
-
|
|
310
|
-
JDBC URL 格式:`jdbc:clickzetta://<instance_name>.<region_id>.api.clickzetta.com/`
|
|
311
|
-
|