@clickzetta/cz-cli-linux-x64 0.3.4 → 0.3.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (118) hide show
  1. package/bin/cz-cli +0 -0
  2. package/package.json +1 -1
  3. package/bin/skills/clickzetta-access-control/SKILL.md +0 -243
  4. package/bin/skills/clickzetta-access-control/references/dynamic-masking.md +0 -86
  5. package/bin/skills/clickzetta-access-control/references/grant-revoke.md +0 -103
  6. package/bin/skills/clickzetta-access-control/references/role-management.md +0 -66
  7. package/bin/skills/clickzetta-access-control/references/user-management.md +0 -61
  8. package/bin/skills/clickzetta-ai-vector-search/SKILL.md +0 -160
  9. package/bin/skills/clickzetta-ai-vector-search/references/vector-search.md +0 -155
  10. package/bin/skills/clickzetta-app-python-sdk/SKILL.md +0 -153
  11. package/bin/skills/clickzetta-app-python-sdk/references/bulkload.md +0 -196
  12. package/bin/skills/clickzetta-app-python-sdk/references/connector.md +0 -143
  13. package/bin/skills/clickzetta-app-python-sdk/references/realtime.md +0 -122
  14. package/bin/skills/clickzetta-batch-sync-pipeline/SKILL.md +0 -293
  15. package/bin/skills/clickzetta-bi-connect/SKILL.md +0 -176
  16. package/bin/skills/clickzetta-bi-connect/references/bi-tools.md +0 -170
  17. package/bin/skills/clickzetta-cdc-sync-pipeline/SKILL.md +0 -457
  18. package/bin/skills/clickzetta-concepts/SKILL.md +0 -282
  19. package/bin/skills/clickzetta-concepts/references/brands-and-endpoints.md +0 -79
  20. package/bin/skills/clickzetta-concepts/references/object-model.md +0 -311
  21. package/bin/skills/clickzetta-data-ingest-pipeline/SKILL.md +0 -165
  22. package/bin/skills/clickzetta-data-lifecycle/SKILL.md +0 -211
  23. package/bin/skills/clickzetta-data-lifecycle/references/lifecycle-reference.md +0 -175
  24. package/bin/skills/clickzetta-data-recovery/SKILL.md +0 -215
  25. package/bin/skills/clickzetta-data-recovery/evals/evals.json +0 -35
  26. package/bin/skills/clickzetta-data-science/SKILL.md +0 -125
  27. package/bin/skills/clickzetta-data-science/references/bitmap-profile.md +0 -146
  28. package/bin/skills/clickzetta-data-science/references/data-patterns.md +0 -110
  29. package/bin/skills/clickzetta-data-science/references/setup.md +0 -160
  30. package/bin/skills/clickzetta-data-science/references/stats-functions.md +0 -195
  31. package/bin/skills/clickzetta-data-science/references/write-and-infer.md +0 -122
  32. package/bin/skills/clickzetta-data-science/references/zettapark-api.md +0 -156
  33. package/bin/skills/clickzetta-data-sharing/SKILL.md +0 -160
  34. package/bin/skills/clickzetta-data-sharing/references/share-ddl.md +0 -134
  35. package/bin/skills/clickzetta-dba-guide/SKILL.md +0 -540
  36. package/bin/skills/clickzetta-dw-modeling/SKILL.md +0 -259
  37. package/bin/skills/clickzetta-dw-modeling/references/modeling-patterns.md +0 -100
  38. package/bin/skills/clickzetta-dynamic-table/SKILL.md +0 -112
  39. package/bin/skills/clickzetta-dynamic-table/best-practices/dimension-table-join-guide.md +0 -257
  40. package/bin/skills/clickzetta-dynamic-table/best-practices/medallion-and-stream-patterns.md +0 -124
  41. package/bin/skills/clickzetta-dynamic-table/best-practices/non-partitioned-merge-into-warning.md +0 -96
  42. package/bin/skills/clickzetta-dynamic-table/best-practices/performance-optimization.md +0 -109
  43. package/bin/skills/clickzetta-dynamic-table/dt-creator/SKILL.md +0 -15
  44. package/bin/skills/clickzetta-dynamic-table/dt-creator/references/dt-declaration-strategy.md +0 -185
  45. package/bin/skills/clickzetta-dynamic-table/dt-creator/references/incremental-config-reference.md +0 -429
  46. package/bin/skills/clickzetta-dynamic-table/dt-creator/references/refresh-history-guide.md +0 -268
  47. package/bin/skills/clickzetta-dynamic-table/dt-creator/references/sql-limitations.md +0 -80
  48. package/bin/skills/clickzetta-dynamic-table/dynamic-table-alter/SKILL.md +0 -190
  49. package/bin/skills/clickzetta-external-catalog/SKILL.md +0 -120
  50. package/bin/skills/clickzetta-external-catalog/references/external-catalog-ddl.md +0 -130
  51. package/bin/skills/clickzetta-external-function/SKILL.md +0 -203
  52. package/bin/skills/clickzetta-external-function/references/external-function-ddl.md +0 -171
  53. package/bin/skills/clickzetta-file-import-pipeline/SKILL.md +0 -156
  54. package/bin/skills/clickzetta-index-manager/SKILL.md +0 -140
  55. package/bin/skills/clickzetta-index-manager/references/bloomfilter-index.md +0 -67
  56. package/bin/skills/clickzetta-index-manager/references/index-management.md +0 -73
  57. package/bin/skills/clickzetta-index-manager/references/inverted-index.md +0 -80
  58. package/bin/skills/clickzetta-index-manager/references/vector-index.md +0 -81
  59. package/bin/skills/clickzetta-information-schema/SKILL.md +0 -367
  60. package/bin/skills/clickzetta-information-schema/references/instance-views-reference.md +0 -276
  61. package/bin/skills/clickzetta-information-schema/references/metering-views-reference.md +0 -137
  62. package/bin/skills/clickzetta-information-schema/references/views-reference.md +0 -271
  63. package/bin/skills/clickzetta-java-sdk/SKILL.md +0 -186
  64. package/bin/skills/clickzetta-java-sdk/references/bulkload.md +0 -163
  65. package/bin/skills/clickzetta-java-sdk/references/realtime.md +0 -212
  66. package/bin/skills/clickzetta-kafka-ingest-pipeline/SKILL.md +0 -639
  67. package/bin/skills/clickzetta-kafka-ingest-pipeline/references/kafka-pipe-syntax.md +0 -324
  68. package/bin/skills/clickzetta-lakehouse-connect/SKILL.md +0 -218
  69. package/bin/skills/clickzetta-lakehouse-connect/evals/evals.json +0 -35
  70. package/bin/skills/clickzetta-lakehouse-connect/references/config-file.md +0 -435
  71. package/bin/skills/clickzetta-lakehouse-connect/references/jdbc.md +0 -478
  72. package/bin/skills/clickzetta-lakehouse-connect/references/python-sdk.md +0 -225
  73. package/bin/skills/clickzetta-lakehouse-connect/references/sqlalchemy.md +0 -468
  74. package/bin/skills/clickzetta-lakehouse-connect/references/zettapark-session.md +0 -445
  75. package/bin/skills/clickzetta-manage-comments/SKILL.md +0 -219
  76. package/bin/skills/clickzetta-metadata-query/SKILL.md +0 -298
  77. package/bin/skills/clickzetta-metadata-query/references/show-desc-reference.md +0 -326
  78. package/bin/skills/clickzetta-monitoring/SKILL.md +0 -199
  79. package/bin/skills/clickzetta-monitoring/references/job-history-analysis.md +0 -97
  80. package/bin/skills/clickzetta-monitoring/references/show-jobs.md +0 -48
  81. package/bin/skills/clickzetta-oss-ingest-pipeline/SKILL.md +0 -427
  82. package/bin/skills/clickzetta-query-optimizer/SKILL.md +0 -156
  83. package/bin/skills/clickzetta-query-optimizer/references/explain.md +0 -56
  84. package/bin/skills/clickzetta-query-optimizer/references/hints-and-sortkey.md +0 -78
  85. package/bin/skills/clickzetta-query-optimizer/references/optimize.md +0 -65
  86. package/bin/skills/clickzetta-query-optimizer/references/result-cache.md +0 -49
  87. package/bin/skills/clickzetta-query-optimizer/references/show-jobs.md +0 -42
  88. package/bin/skills/clickzetta-realtime-sync-pipeline/SKILL.md +0 -197
  89. package/bin/skills/clickzetta-semantic-view/SKILL.md +0 -207
  90. package/bin/skills/clickzetta-semantic-view/references/semantic-view-reference.md +0 -167
  91. package/bin/skills/clickzetta-spark-flink-connector/SKILL.md +0 -92
  92. package/bin/skills/clickzetta-spark-flink-connector/references/flink.md +0 -147
  93. package/bin/skills/clickzetta-spark-flink-connector/references/spark.md +0 -132
  94. package/bin/skills/clickzetta-sql-pipeline-manager/SKILL.md +0 -379
  95. package/bin/skills/clickzetta-sql-pipeline-manager/evals/evals.json +0 -166
  96. package/bin/skills/clickzetta-sql-pipeline-manager/references/dynamic-table.md +0 -185
  97. package/bin/skills/clickzetta-sql-pipeline-manager/references/materialized-view.md +0 -129
  98. package/bin/skills/clickzetta-sql-pipeline-manager/references/pipe.md +0 -222
  99. package/bin/skills/clickzetta-sql-pipeline-manager/references/table-stream.md +0 -125
  100. package/bin/skills/clickzetta-sql-syntax-guide/SKILL.md +0 -172
  101. package/bin/skills/clickzetta-sql-syntax-guide/references/ddl-reference.md +0 -350
  102. package/bin/skills/clickzetta-sql-syntax-guide/references/dml-reference.md +0 -279
  103. package/bin/skills/clickzetta-sql-syntax-guide/references/dql-reference.md +0 -504
  104. package/bin/skills/clickzetta-sql-syntax-guide/references/functions-reference.md +0 -372
  105. package/bin/skills/clickzetta-sql-syntax-guide/references/migration-databricks.md +0 -260
  106. package/bin/skills/clickzetta-sql-syntax-guide/references/migration-snowflake.md +0 -382
  107. package/bin/skills/clickzetta-sql-syntax-guide/references/vs-snowflake.md +0 -346
  108. package/bin/skills/clickzetta-sql-syntax-guide/references/vs-spark.md +0 -229
  109. package/bin/skills/clickzetta-studio-overview/SKILL.md +0 -170
  110. package/bin/skills/clickzetta-studio-overview/references/studio-modules.md +0 -173
  111. package/bin/skills/clickzetta-table-stream-pipeline/SKILL.md +0 -206
  112. package/bin/skills/clickzetta-vcluster-manager/SKILL.md +0 -212
  113. package/bin/skills/clickzetta-vcluster-manager/references/vc-cache.md +0 -54
  114. package/bin/skills/clickzetta-vcluster-manager/references/vcluster-ddl.md +0 -150
  115. package/bin/skills/clickzetta-volume-manager/SKILL.md +0 -292
  116. package/bin/skills/clickzetta-volume-manager/references/volume-ddl.md +0 -199
  117. package/bin/skills/clickzetta-zettapark/SKILL.md +0 -248
  118. package/bin/skills/clickzetta-zettapark/references/zettapark-api.md +0 -283
@@ -1,367 +0,0 @@
1
- ---
2
- name: clickzetta-information-schema
3
- description: |
4
- 查询 ClickZetta Lakehouse INFORMATION_SCHEMA 元数据视图,获取表结构、字段信息、
5
- 作业历史、用户权限、Volume 和 Connection 等元数据。支持空间级(当前工作空间)
6
- 和实例级(所有工作空间,需 INSTANCE ADMIN)两个层级的查询。
7
- 费用分析:JOB_HISTORY.CRU 字段统计计算消耗,TABLES.BYTES 统计存储用量,
8
- SYS.information_schema.WORKSPACES.WORKSPACE_STORAGE 汇总跨空间存储,
9
- 可按用户/工作空间/时间段做成本归因和趋势分析。
10
- 当用户说"查看表结构"、"查看字段信息"、"查看作业历史"、"查看 JOB 历史"、
11
- "查看慢查询"、"查看 CRU 消耗"、"费用分析"、"成本分析"、"计算费用"、
12
- "存储费用"、"用量统计"、"成本归因"、"哪个用户消耗最多"、"存储用量排行"、
13
- "查看用户列表"、"查看角色"、"查看权限"、
14
- "查看 Volume 列表"、"查看 Connection"、"查看物化视图刷新历史"、
15
- "元数据查询"、"information_schema"、"查看所有表"、"查看 Schema 列表"、
16
- "统计存储用量"、"查看删除的表"时触发。
17
- Keywords: information_schema, metadata, table info, column info, job history, system view
18
- ---
19
-
20
- # ClickZetta Lakehouse INFORMATION_SCHEMA 查询指南
21
-
22
- ## 概述
23
-
24
- INFORMATION_SCHEMA 提供对 Lakehouse 元数据的只读查询能力,分为两个层级:
25
-
26
- | 层级 | 访问路径 | 权限要求 | 覆盖范围 |
27
- |---|---|---|---|
28
- | 实例级 | `SYS.information_schema.<视图名>` | INSTANCE ADMIN | 所有工作空间的元数据 |
29
- | 空间级 | `information_schema.<视图名>` | workspace_admin | 当前工作空间的元数据 |
30
-
31
- **重要限制:**
32
- - 所有视图只读,不可写入
33
- - 数据有约 15 分钟延迟
34
- - 建议使用 `SELECT 具体列名` 而非 `SELECT *`,避免视图结构变更导致任务失败
35
- - 空间级视图只显示当前存在的对象(无 DELETE_TIME 字段);实例级视图含已删除对象,用 `WHERE delete_time IS NULL` 过滤
36
-
37
- ---
38
-
39
- ## 快速参考:可用视图
40
-
41
- ### 空间级视图(`information_schema.*`)
42
-
43
- | 视图名 | 说明 |
44
- |---|---|
45
- | SCHEMAS | 当前空间下的所有 Schema |
46
- | TABLES | 当前空间下的所有表(含视图、物化视图) |
47
- | COLUMNS | 所有表的字段信息 |
48
- | VIEWS | 所有视图定义 |
49
- | USERS | 空间内用户及角色 |
50
- | ROLES | 空间内角色及成员 |
51
- | JOB_HISTORY | 作业执行历史(保留 60 天,含 PT_DATE 分区列) |
52
- | MATERIALIZED_VIEW_REFRESH_HISTORY | 物化视图刷新历史(含 PT_DATE 分区列) |
53
- | AUTOMV_REFRESH_HISTORY | 自动物化视图刷新历史(含 PT_DATE 分区列) |
54
- | VOLUMES | Volume 对象信息 |
55
- | CONNECTIONS | 存储连接对象信息 |
56
- | SORTKEY_CANDIDATES | 推荐排序列(由系统自动分析生成) |
57
-
58
- ### 实例级视图(`SYS.information_schema.*`)
59
-
60
- | 视图名 | 说明 |
61
- |---|---|
62
- | WORKSPACES | 所有工作空间信息(含存储用量) |
63
- | SCHEMAS | 所有空间的 Schema(含删除记录) |
64
- | TABLES | 所有空间的表(含删除记录) |
65
- | COLUMNS | 所有空间的字段(含删除记录) |
66
- | VIEWS | 所有空间的视图 |
67
- | USERS | 所有空间的用户 |
68
- | ROLES | 所有空间的角色 |
69
- | JOB_HISTORY | 所有空间的作业历史 |
70
- | MATERIALIZED_VIEW_REFRESH_HISTORY | 所有空间的物化视图刷新历史 |
71
- | AUTOMV_REFRESH_HISTORY | 所有空间的自动物化视图刷新历史 |
72
- | VOLUMES | 所有空间的 Volume |
73
- | CONNECTIONS | 所有空间的连接对象 |
74
- | OBJECT_PRIVILEGES | 权限授予记录 |
75
- | SORTKEY_CANDIDATES | 所有空间的排序列推荐 |
76
- | **STORAGE_METERING** ⭐ | **存储费用明细(托管存储/多版本存储/网络传输),按天按空间** |
77
- | **INSTANCE_USAGE** ⭐ | **计算费用明细(AP/GP集群/任务调度/数据集成),按天按空间** |
78
-
79
- ---
80
-
81
- ## 常用查询示例
82
-
83
- ### 查看表结构
84
-
85
- ```sql
86
- -- 列出当前空间所有表
87
- SELECT table_schema, table_name, table_type, row_count, bytes, create_time
88
- FROM information_schema.tables
89
- WHERE table_type = 'MANAGED_TABLE'
90
- ORDER BY table_schema, table_name;
91
-
92
- -- 查看某张表的字段
93
- SELECT column_name, data_type, is_nullable, is_primary_key, is_clustering_column, comment
94
- FROM information_schema.columns
95
- WHERE table_schema = 'my_schema'
96
- AND table_name = 'my_table'
97
- ORDER BY column_name;
98
-
99
- -- 查找包含特定字段名的表
100
- SELECT table_schema, table_name, column_name, data_type
101
- FROM information_schema.columns
102
- WHERE column_name ILIKE '%user_id%';
103
- ```
104
-
105
- ### 查看 Schema 信息
106
-
107
- ```sql
108
- -- 列出所有 Schema
109
- SELECT schema_name, type, schema_creator, create_time, comment
110
- FROM information_schema.schemas
111
- ORDER BY create_time DESC;
112
-
113
- -- 查找外部 Schema
114
- SELECT schema_name, schema_creator, create_time
115
- FROM information_schema.schemas
116
- WHERE type = 'EXTERNAL';
117
- ```
118
-
119
- ### 查看作业历史
120
-
121
- ```sql
122
- -- 最近 24 小时的作业
123
- SELECT job_id, job_creator, status, execution_time, cru,
124
- input_bytes, output_bytes, start_time
125
- FROM information_schema.job_history
126
- WHERE pt_date >= CAST(CURRENT_DATE - INTERVAL 1 DAY AS DATE)
127
- ORDER BY start_time DESC;
128
-
129
- -- 失败的作业
130
- SELECT job_id, job_creator, job_text, error_message, start_time
131
- FROM information_schema.job_history
132
- WHERE status = 'FAILED'
133
- AND pt_date >= CAST(CURRENT_DATE - INTERVAL 7 DAY AS DATE)
134
- ORDER BY start_time DESC;
135
-
136
- -- 按用户统计 CRU 消耗(最近 30 天)
137
- -- 注意:status 成功值为 'SUCCEED'(非 'SUCCEEDED')
138
- -- 推荐用 pt_date 分区列过滤,性能更好
139
- SELECT job_creator,
140
- COUNT(*) AS job_count,
141
- SUM(cru) AS total_cru,
142
- AVG(execution_time) AS avg_exec_sec
143
- FROM information_schema.job_history
144
- WHERE pt_date >= CAST(CURRENT_DATE - INTERVAL 30 DAY AS DATE)
145
- AND status = 'SUCCEED'
146
- GROUP BY job_creator
147
- ORDER BY total_cru DESC;
148
-
149
- -- 慢查询(超过 60 秒)
150
- SELECT job_id, job_creator, execution_time, input_bytes, job_text
151
- FROM information_schema.job_history
152
- WHERE execution_time > 60
153
- AND pt_date >= CAST(CURRENT_DATE - INTERVAL 7 DAY AS DATE)
154
- ORDER BY execution_time DESC
155
- LIMIT 20;
156
- ```
157
-
158
- ### 查看 Volume 信息
159
-
160
- ```sql
161
- -- 列出所有外部 Volume
162
- SELECT volume_name, volume_type, volume_region, volume_creator,
163
- connection_name, create_time
164
- FROM information_schema.volumes
165
- WHERE volume_type = 'EXTERNAL';
166
-
167
- -- 查找特定 Schema 下的 Volume
168
- SELECT volume_name, volume_url, volume_type, volume_creator
169
- FROM information_schema.volumes
170
- WHERE volume_schema = 'my_schema';
171
- ```
172
-
173
- ### 查看用户和权限
174
-
175
- ```sql
176
- -- 列出空间内所有用户及角色
177
- SELECT user_name, role_names, email, create_time
178
- FROM information_schema.users
179
- ORDER BY create_time DESC;
180
-
181
- -- 查看角色成员
182
- SELECT role_name, user_names
183
- FROM information_schema.roles
184
- ORDER BY role_name;
185
- ```
186
-
187
- ### 物化视图刷新监控
188
-
189
- ```sql
190
- -- 最近刷新失败的物化视图
191
- SELECT schema_name, materialized_view_name, status,
192
- start_time, end_time, error_message
193
- FROM information_schema.materialized_view_refresh_history
194
- WHERE status = 'FAILED'
195
- AND pt_date >= CAST(CURRENT_DATE - INTERVAL 7 DAY AS DATE)
196
- ORDER BY start_time DESC;
197
-
198
- -- 物化视图刷新耗时统计
199
- SELECT materialized_view_name,
200
- COUNT(*) AS refresh_count,
201
- AVG(DATEDIFF('second', start_time, end_time)) AS avg_seconds,
202
- SUM(cru) AS total_cru
203
- FROM information_schema.materialized_view_refresh_history
204
- WHERE status = 'SUCCEED'
205
- GROUP BY materialized_view_name
206
- ORDER BY avg_seconds DESC;
207
- ```
208
-
209
- ### 费用分析(需 INSTANCE ADMIN)
210
-
211
- 费用分析使用两个实例级专有视图,**这是 JOB_HISTORY.CRU 无法替代的**:
212
- - `STORAGE_METERING`:存储费用(托管存储/多版本存储/网络传输),含实际金额
213
- - `INSTANCE_USAGE`:计算费用(AP/GP集群/任务调度/数据集成/流式集成),含实际金额
214
-
215
- ```sql
216
- -- 按工作空间汇总本月计算费用(AP/GP/调度/集成)
217
- SELECT workspace_name,
218
- sku_name,
219
- ROUND(SUM(measurements_consumption), 2) AS total_cru,
220
- ROUND(SUM(amount), 2) AS total_amount_yuan
221
- FROM SYS.information_schema.instance_usage
222
- WHERE measurement_start >= DATE_TRUNC('month', CURRENT_DATE)
223
- AND sku_category = 'compute'
224
- GROUP BY workspace_name, sku_name
225
- ORDER BY total_amount_yuan DESC;
226
-
227
- -- 按工作空间汇总本月存储费用
228
- SELECT workspace_name,
229
- sku_name,
230
- ROUND(SUM(measurements_consumption), 4) AS consumption,
231
- measurements_unit,
232
- ROUND(SUM(amount), 4) AS total_amount_yuan
233
- FROM SYS.information_schema.storage_metering
234
- WHERE measurement_start >= DATE_TRUNC('month', CURRENT_DATE)
235
- GROUP BY workspace_name, sku_name, measurements_unit
236
- ORDER BY workspace_name, total_amount_yuan DESC;
237
-
238
- -- 存储 + 计算综合费用汇总(本月)
239
- SELECT cost_type, workspace_name,
240
- ROUND(SUM(total_amount), 2) AS total_yuan
241
- FROM (
242
- SELECT 'compute' AS cost_type, workspace_name, amount AS total_amount
243
- FROM SYS.information_schema.instance_usage
244
- WHERE measurement_start >= DATE_TRUNC('month', CURRENT_DATE)
245
- UNION ALL
246
- SELECT 'storage' AS cost_type, workspace_name, amount AS total_amount
247
- FROM SYS.information_schema.storage_metering
248
- WHERE measurement_start >= DATE_TRUNC('month', CURRENT_DATE)
249
- ) t
250
- GROUP BY cost_type, workspace_name
251
- ORDER BY cost_type, total_yuan DESC;
252
-
253
- -- 按天统计计算费用趋势(最近 30 天)
254
- SELECT DATE(measurement_start) AS dt,
255
- sku_name,
256
- ROUND(SUM(amount), 2) AS daily_amount_yuan
257
- FROM SYS.information_schema.instance_usage
258
- WHERE measurement_start >= CURRENT_DATE - INTERVAL 30 DAY
259
- AND sku_category = 'compute'
260
- GROUP BY DATE(measurement_start), sku_name
261
- ORDER BY dt, daily_amount_yuan DESC;
262
- ```
263
-
264
- **INSTANCE_USAGE SKU 枚举值(sku_category = 'compute'):**
265
-
266
- | sku_name | 说明 |
267
- |---|---|
268
- | AP类型计算集群 | 分析型 VCluster 费用 |
269
- | GP类型计算集群 | 通用型 VCluster 费用 |
270
- | 任务调度 | Studio 任务调度费用 |
271
- | 数据集成 | 离线/实时同步任务费用 |
272
- | 流式集成 | 流式数据集成费用 |
273
-
274
- **STORAGE_METERING SKU 枚举值:**
275
-
276
- | sku_category | sku_name | 说明 |
277
- |---|---|---|
278
- | storage | 托管存储容量 | 内部表数据存储 |
279
- | storage | 多版本未删除存储 | Time Travel 历史版本存储 |
280
- | network | 数据查询Internet数据传输 | 公网数据传输费用 |
281
-
282
- ```sql
283
- -- 按用户统计 CRU 消耗(从 JOB_HISTORY,不含金额)
284
- SELECT job_creator,
285
- COUNT(*) AS job_count,
286
- ROUND(SUM(cru), 2) AS total_cru,
287
- ROUND(AVG(execution_time), 1) AS avg_exec_sec
288
- FROM information_schema.job_history
289
- WHERE pt_date >= CAST(CURRENT_DATE - INTERVAL 30 DAY AS DATE)
290
- AND status = 'SUCCEED'
291
- GROUP BY job_creator
292
- ORDER BY total_cru DESC;
293
-
294
- -- 按天统计 CRU 趋势(从 JOB_HISTORY)
295
- SELECT pt_date,
296
- COUNT(*) AS job_count,
297
- ROUND(SUM(cru), 2) AS daily_cru
298
- FROM information_schema.job_history
299
- WHERE pt_date >= CAST(CURRENT_DATE - INTERVAL 30 DAY AS DATE)
300
- GROUP BY pt_date
301
- ORDER BY pt_date;
302
-
303
- -- 存储用量排行(当前空间,按表)
304
- SELECT table_schema, table_name,
305
- ROUND(bytes / 1024.0 / 1024 / 1024, 3) AS size_gb,
306
- row_count
307
- FROM information_schema.tables
308
- WHERE table_type = 'MANAGED_TABLE'
309
- ORDER BY bytes DESC
310
- LIMIT 20;
311
-
312
- -- 跨空间存储汇总(需 INSTANCE ADMIN)
313
- SELECT workspace_name,
314
- ROUND(workspace_storage / 1024.0 / 1024 / 1024, 2) AS storage_gb
315
- FROM SYS.information_schema.workspaces
316
- WHERE delete_time IS NULL
317
- ORDER BY workspace_storage DESC;
318
- ```
319
-
320
- ### 实例级查询(需 INSTANCE ADMIN)
321
-
322
- ```sql
323
- -- 查看所有工作空间存储用量
324
- SELECT workspace_name, workspace_creator,
325
- ROUND(workspace_storage / 1024.0 / 1024 / 1024, 2) AS storage_gb,
326
- create_time
327
- FROM SYS.information_schema.workspaces
328
- WHERE delete_time IS NULL
329
- ORDER BY workspace_storage DESC;
330
-
331
- -- 跨空间查找大表(大于 10GB)
332
- SELECT table_catalog, table_schema, table_name,
333
- row_count,
334
- ROUND(bytes / 1024.0 / 1024 / 1024, 2) AS size_gb
335
- FROM SYS.information_schema.tables
336
- WHERE delete_time IS NULL
337
- AND bytes > 10 * 1024 * 1024 * 1024
338
- ORDER BY bytes DESC;
339
-
340
- -- 查看权限授予记录
341
- SELECT grantor, grantee, granted_to, object_type,
342
- object_schema, object_name, privilege_type, authorization_time
343
- FROM SYS.information_schema.object_privileges
344
- WHERE grantee = 'some_user'
345
- ORDER BY authorization_time DESC;
346
- ```
347
-
348
- ---
349
-
350
- ## 相关文档
351
-
352
- - [视图字段详细说明](references/views-reference.md)
353
- - [实例级视图字段说明](references/instance-views-reference.md)
354
- - [费用视图字段说明](references/metering-views-reference.md)
355
-
356
- ---
357
-
358
- ## 注意事项
359
-
360
- 1. **ROW_COUNT / BYTES 为估计值**:PRIMARY KEY 表、实时写入表、分区操作后可能不准确
361
- 2. **并发 DDL 无一致性保证**:长时间运行的查询可能看不到最新创建的对象
362
- 3. **JOB_HISTORY 保留 60 天**:超过 60 天的历史记录会被自动清理
363
- 4. **空间级视图无 DELETE_TIME**:空间级视图只显示当前存在的对象;实例级视图含已删除对象,用 `WHERE delete_time IS NULL` 过滤
364
- 5. **JOB_HISTORY 有 PT_DATE 分区列**:用 `pt_date >= CAST(CURRENT_DATE - INTERVAL N DAY AS DATE)` 过滤,比 `start_time` 过滤性能更好
365
- 6. **STATUS 值注意**:JOB_HISTORY 成功状态为 `'SUCCEED'`(非 `'SUCCEEDED'`);MV 刷新成功为 `'SUCCEED'`(非 `'FINISHED'`)
366
- 7. **SYS.information_schema 包含所有 workspace 数据**:不加 `table_catalog` 过滤会返回所有 workspace 的结果(更多行,不是 0 行)。如需精确查询当前 workspace,加 `WHERE table_catalog = '<workspace>'`。字段名是 `create_time`(不是 `created_time`)
367
- 7. **STORAGE_METERING / INSTANCE_USAGE 仅实例级**:需 INSTANCE ADMIN 权限,通过 `SYS.information_schema.*` 访问;含实际金额字段,是费用分析的权威来源
@@ -1,276 +0,0 @@
1
- # 实例级 INFORMATION_SCHEMA 视图字段说明
2
-
3
- > 来源:https://www.yunqi.tech/documents/instance-informaiton-schema
4
- > 已通过实际 Lakehouse 连接验证(cn-shanghai-alicloud, f8866243)
5
-
6
- 访问路径:`SYS.information_schema.<视图名>`
7
- 权限要求:INSTANCE ADMIN
8
-
9
- 实例级视图覆盖所有工作空间,包含已删除对象(`DELETE_TIME IS NULL` 过滤现存对象)。
10
-
11
- **完整视图列表(排除 mysql 相关):**
12
- WORKSPACES · SCHEMAS · TABLES · COLUMNS · VIEWS · USERS · ROLES ·
13
- JOB_HISTORY · MATERIALIZED_VIEW_REFRESH_HISTORY · AUTOMV_REFRESH_HISTORY ·
14
- VOLUMES · CONNECTIONS · OBJECT_PRIVILEGES · SORTKEY_CANDIDATES ·
15
- **STORAGE_METERING**(存储费用)· **INSTANCE_USAGE**(计算费用)
16
-
17
- > 费用视图(STORAGE_METERING / INSTANCE_USAGE)字段详见 [metering-views-reference.md](metering-views-reference.md)
18
-
19
- ---
20
-
21
- ## WORKSPACES 视图
22
-
23
- | 字段名 | 类型 | 说明 |
24
- |---|---|---|
25
- | WORKSPACE_ID | STRING | 工作空间 ID |
26
- | WORKSPACE_NAME | STRING | 工作空间名称 |
27
- | WORKSPACE_CREATOR | STRING | 工作空间所有者 |
28
- | WORKSPACE_CREATOR_ID | STRING | 所有者账号 ID |
29
- | WORKSPACE_STORAGE | BIGINT | 存储用量(字节,不含外部表和外部数据湖) |
30
- | CREATE_TIME | TIMESTAMP | 创建时间 |
31
- | LAST_MODIFY_TIME | TIMESTAMP | 修改时间 |
32
- | COMMENT | STRING | 注释 |
33
- | DELETE_TIME | TIMESTAMP | 删除时间(NULL 表示未删除) |
34
- | PROPERTIES | MAP\<STRING,STRING> | 自定义属性 |
35
-
36
- ---
37
-
38
- ## SCHEMAS 视图
39
-
40
- | 字段名 | 类型 | 说明 |
41
- |---|---|---|
42
- | CATALOG_NAME | STRING | 所属 WORKSPACE 名称 |
43
- | SCHEMA_ID | STRING | Schema ID |
44
- | SCHEMA_NAME | STRING | Schema 名称 |
45
- | TYPE | STRING | EXTERNAL / MANAGED |
46
- | SCHEMA_CREATOR | STRING | 所有者账号名称 |
47
- | SCHEMA_CREATOR_ID | STRING | 所有者账号 ID |
48
- | CREATE_TIME | TIMESTAMP | 创建时间 |
49
- | LAST_MODIFY_TIME | TIMESTAMP | 修改时间 |
50
- | COMMENT | STRING | 注释 |
51
- | DELETE_TIME | TIMESTAMP | 删除时间(NULL 表示未删除) |
52
- | PROPERTIES | MAP\<STRING,STRING> | 自定义属性 |
53
-
54
- ---
55
-
56
- ## TABLES 视图
57
-
58
- | 字段名 | 类型 | 说明 |
59
- |---|---|---|
60
- | TABLE_CATALOG | STRING | 所属 WORKSPACE 名称 |
61
- | TABLE_CATALOG_ID | STRING | WORKSPACE ID |
62
- | TABLE_SCHEMA | STRING | 所属 Schema |
63
- | TABLE_SCHEMA_ID | STRING | Schema ID |
64
- | TABLE_NAME | STRING | 表名 |
65
- | TABLE_ID | STRING | 表 ID |
66
- | TABLE_CREATOR | STRING | 表所有者 |
67
- | TABLE_CREATOR_ID | STRING | 表创建者 ID |
68
- | TABLE_TYPE | STRING | EXTERNAL TABLE / VIRTUAL_VIEW / MATERIALIZED VIEW / MANAGED_TABLE |
69
- | ROW_COUNT | BIGINT | 行数(估计值) |
70
- | BYTES | BIGINT | 存储大小(估计值) |
71
- | CREATE_TIME | TIMESTAMP | 创建时间 |
72
- | LAST_MODIFY_TIME | TIMESTAMP | 修改时间 |
73
- | DATA_LIFECYCLE | BIGINT | 生命周期(天),NULL 表示永久 |
74
- | IS_PARTITIONED | BOOLEAN | 是否分区表 |
75
- | IS_CLUSTERED | BOOLEAN | 是否分桶表 |
76
- | COMMENT | STRING | 表注释 |
77
- | DELETE_TIME | TIMESTAMP | 删除时间(NULL 表示未删除) |
78
- | PROPERTIES | MAP\<STRING,STRING> | 自定义属性 |
79
-
80
- ---
81
-
82
- ## COLUMNS 视图
83
-
84
- | 字段名 | 类型 | 说明 |
85
- |---|---|---|
86
- | TABLE_CATALOG | STRING | 所属 WORKSPACE 名称 |
87
- | TABLE_CATALOG_ID | STRING | WORKSPACE ID |
88
- | TABLE_SCHEMA | STRING | 所属 Schema |
89
- | TABLE_SCHEMA_ID | STRING | Schema ID |
90
- | TABLE_NAME | STRING | 表名 |
91
- | TABLE_ID | STRING | 表 ID |
92
- | COLUMN_NAME | STRING | 字段名 |
93
- | COLUMN_ID | STRING | 字段 ID |
94
- | COLUMN_DEFAULT | STRING | 字段默认值(保留值) |
95
- | IS_NULLABLE | BOOLEAN | 是否可为 NULL |
96
- | DATA_TYPE | STRING | 字段类型 |
97
- | IS_PARTITIONING_COLUMN | BOOLEAN | 是否分区字段 |
98
- | IS_CLUSTERING_COLUMN | BOOLEAN | 是否 CLUSTER 字段 |
99
- | IS_PRIMARY_KEY | BOOLEAN | 是否主键 |
100
- | COMMENT | STRING | 字段注释 |
101
- | DELETE_TIME | TIMESTAMP | 删除时间(NULL 表示未删除) |
102
-
103
- ---
104
-
105
- ## VIEWS 视图
106
-
107
- | 字段名 | 类型 | 说明 |
108
- |---|---|---|
109
- | TABLE_CATALOG | STRING | 所属 WORKSPACE 名称 |
110
- | TABLE_CATALOG_ID | STRING | WORKSPACE ID |
111
- | TABLE_SCHEMA | STRING | 所属 Schema |
112
- | TABLE_SCHEMA_ID | STRING | Schema ID |
113
- | TABLE_NAME | STRING | 视图名 |
114
- | TABLE_ID | STRING | 视图 ID |
115
- | TABLE_CREATOR | STRING | 视图所有者账号名称 |
116
- | TABLE_CREATOR_ID | STRING | 视图所有者账号 ID |
117
- | VIEW_DEFINITION | STRING | 创建视图的 SQL 语句 |
118
- | CREATE_TIME | TIMESTAMP | 创建时间 |
119
- | LAST_MODIFY_TIME | TIMESTAMP | 修改时间 |
120
- | COMMENT | STRING | 视图注释 |
121
- | DELETE_TIME | TIMESTAMP | 删除时间(NULL 表示未删除) |
122
-
123
- ---
124
-
125
- ## USERS 视图
126
-
127
- | 字段名 | 类型 | 说明 |
128
- |---|---|---|
129
- | WORKSPACE_NAME | STRING | 所在工作空间 |
130
- | WORKSPACE_ID | STRING | 空间 ID |
131
- | USER_ID | STRING | 系统生成的用户 ID |
132
- | USER_NAME | STRING | 用户名(WORKSPACE_NAME + USER_NAME 拼接) |
133
- | ROLE_NAME | STRING | 拥有的角色(逗号分隔) |
134
- | ADD_TIME | TIMESTAMP | 用户创建时间 |
135
- | EMAIL | STRING | 用户邮箱 |
136
- | TELEPHONE | STRING | 用户电话 |
137
- | LAST_SUCCESS_LOGIN | TIMESTAMP | 上次登录时间 |
138
- | COMMENT | STRING | 描述信息 |
139
- | DELETE_TIME | TIMESTAMP | 删除时间(NULL 表示未删除) |
140
- | PROPERTIES | MAP\<STRING,STRING> | 自定义属性 |
141
-
142
- ---
143
-
144
- ## ROLES 视图
145
-
146
- | 字段名 | 类型 | 说明 |
147
- |---|---|---|
148
- | WORKSPACE_NAME | STRING | 所在工作空间 |
149
- | WORKSPACE_ID | STRING | 空间 ID |
150
- | ROLE_NAME | STRING | 角色名称 |
151
- | ROLE_ID | STRING | 角色 ID |
152
- | USER_NAME | STRING | 被授予该角色的用户(逗号分隔) |
153
- | USER_ID | STRING | 被授予该角色的用户 ID |
154
- | COMMENT | STRING | 描述信息 |
155
- | DELETE_TIME | TIMESTAMP | 删除时间(NULL 表示未删除) |
156
-
157
- ---
158
-
159
- ## JOB_HISTORY 视图
160
-
161
- | 字段名 | 类型 | 说明 |
162
- |---|---|---|
163
- | WORKSPACE_NAME | STRING | 作业所在空间 |
164
- | WORKSPACE_ID | STRING | 空间 ID |
165
- | JOB_ID | STRING | 作业 ID |
166
- | JOB_NAME | STRING | 作业名称 |
167
- | JOB_CREATOR_ID | STRING | 执行用户 ID |
168
- | JOB_CREATOR | STRING | 执行用户 |
169
- | STATUS | STRING | SETUP / RESUMING_CLUSTER / QUEUED / RUNNING / SUCCESS / FAILED / CANCELED |
170
- | CRU | DECIMAL(38,5) | 消耗的计算资源 |
171
- | ERROR_MESSAGE | STRING | 错误信息 |
172
- | JOB_TYPE | STRING | 作业类型:SQL |
173
- | JOB_TEXT | STRING | 执行的 SQL 语句 |
174
- | START_TIME | TIMESTAMP | 开始时间 |
175
- | END_TIME | TIMESTAMP | 结束时间 |
176
- | EXECUTION_TIME | DOUBLE | 执行时间(秒) |
177
- | INPUT_BYTES | BIGINT | 实际扫描数据量 |
178
- | CACHE_HIT | BIGINT | 从缓存读取的数据量 |
179
- | OUTPUT_BYTES | BIGINT | 输出字节数 |
180
- | INPUT_OBJECTS | STRING | 输入表名(格式:[SCHEMA].[TABLE],多个逗号分隔) |
181
- | OUTPUT_OBJECTS | STRING | 输出表名 |
182
- | CLIENT_INFO | STRING | 客户端信息(JDBC/SDK/Web/Java SDK) |
183
- | VIRTUAL_CLUSTER | STRING | 使用的计算集群 |
184
- | VIRTUAL_CLUSTER_ID | BIGINT | 计算集群 ID |
185
- | ROWS_PRODUCED | BIGINT | 处理的总记录数 |
186
- | ROWS_INSERTED | BIGINT | 插入行数 |
187
- | ROWS_UPDATED | BIGINT | 更新行数 |
188
- | ROWS_DELETED | BIGINT | 删除行数 |
189
- | JOB_CONFIG | STRING | 提交时的参数信息 |
190
- | JOB_PRIORITY | STRING | 作业优先级 |
191
- | INPUT_TABLES | STRING | 输入表(JSON 格式数组) |
192
- | OUTPUT_TABLES | STRING | 输出对象名称 |
193
- | QUERY_TAG | STRING | 用户设置的 TAG |
194
-
195
- ---
196
-
197
- ## MATERIALIZED_VIEW_REFRESH_HISTORY 视图
198
-
199
- | 字段名 | 类型 | 说明 |
200
- |---|---|---|
201
- | WORKSPACE_ID | BIGINT | 空间 ID |
202
- | WORKSPACE_NAME | STRING | 空间名称 |
203
- | SCHEMA_ID | BIGINT | Schema ID |
204
- | SCHEMA_NAME | STRING | Schema 名称 |
205
- | MATERIALIZED_VIEW_ID | BIGINT | 物化视图 ID |
206
- | MATERIALIZED_VIEW_NAME | STRING | 物化视图名称 |
207
- | CREDITS_USED | DECIMAL | 刷新消耗的计费 |
208
- | VIRTUAL_CLUSTER_ID | BIGINT | 虚拟集群 ID |
209
- | VIRTUAL_CLUSTER | STRING | 虚拟集群名称(自动刷新时有值) |
210
- | STATUS | STRING | PENDING / RUNNING / FINISHED / FAILED |
211
- | REFRESH_MODE | STRING | INCREMENTAL / FULL_REFRESH / NO_DATA |
212
- | STATISTICS | STRING | 增量刷新的记录数 |
213
- | SCHEDULE_START_TIME | TIMESTAMP_LTZ | 计划刷新时间 |
214
- | START_TIME | TIMESTAMP_LTZ | 实际开始时间 |
215
- | END_TIME | TIMESTAMP_LTZ | 结束时间 |
216
- | ERROR_MESSAGE | STRING | 刷新失败信息 |
217
-
218
- ---
219
-
220
- ## VOLUMES 视图
221
-
222
- | 字段名 | 类型 | 说明 |
223
- |---|---|---|
224
- | VOLUME_CATALOG | STRING | 所属 Workspace 名称 |
225
- | VOLUME_CATALOG_ID | STRING | 所属 Workspace ID |
226
- | VOLUME_SCHEMA | STRING | 所属 Schema 名称 |
227
- | VOLUME_SCHEMA_ID | STRING | Schema ID |
228
- | VOLUME_NAME | STRING | Volume 名称 |
229
- | VOLUME_ID | STRING | Volume ID |
230
- | VOLUME_URL | STRING | Volume 绑定的 URL |
231
- | VOLUME_REGION | STRING | Volume 所属区域 |
232
- | VOLUME_TYPE | STRING | INTERNAL / EXTERNAL |
233
- | VOLUME_CREATOR | STRING | Volume 的 owner |
234
- | CONNECTION_NAME | STRING | 引用的 Connection 名称 |
235
- | CONNECTION_ID | STRING | 引用的 Connection ID |
236
- | PROPERTIES | MAP\<STRING,STRING> | 保留字段 |
237
- | COMMENT | STRING | 注释 |
238
- | CREATE_TIME | TIMESTAMP | 创建时间 |
239
- | LAST_MODIFY_TIME | TIMESTAMP | 修改时间 |
240
-
241
- ---
242
-
243
- ## CONNECTIONS 视图
244
-
245
- | 字段名 | 类型 | 说明 |
246
- |---|---|---|
247
- | WORKSPACE_NAME | STRING | 所在空间 |
248
- | WORKSPACE_ID | STRING | 空间 ID |
249
- | CONNECTION_NAME | STRING | 连接对象名称 |
250
- | CONNECTION_ID | STRING | 连接 ID |
251
- | CONNECTION_KIND | STRING | STORAGE CONNECTION / API CONNECTION |
252
- | TYPE | STRING | FILE_SYSTEM / CLOUD_FUNCTION |
253
- | PROVIDER | STRING | FILE_SYSTEM 时:OSS / COS;CLOUD_FUNCTION 时:aliyun / tencent |
254
- | REGION | STRING | 连接的 region(如 ap-shanghai / cn-beijing) |
255
- | SOURCE_CREATOR | STRING | 创建者 |
256
- | CREATED_TIME | TIMESTAMP | 创建时间 |
257
- | COMMENT | STRING | 注释 |
258
- | PROPERTIES | MAP\<STRING,STRING> | 保留字段 |
259
-
260
- ---
261
-
262
- ## OBJECT_PRIVILEGES 视图
263
-
264
- | 字段名 | 类型 | 说明 |
265
- |---|---|---|
266
- | GRANTOR | TEXT | 授出权限的用户 |
267
- | GRANTEE | TEXT | 被授予权限的 user_name 或 role_name |
268
- | GRANTED_TO | TEXT | USER / ROLE |
269
- | OBJECT_CATALOG | TEXT | 被授予对象所在的工作空间或 catalog 名称 |
270
- | OBJECT_SCHEMA | TEXT | 被授予对象所在的 Schema(对象不在 Schema 下则为空) |
271
- | OBJECT_NAME | TEXT | 被授权的对象名称 |
272
- | OBJECT_TYPE | TEXT | 被授权对象的类型 |
273
- | SUB_OBJECT_TYPE | TEXT | 子对象类型 |
274
- | PRIVILEGE_TYPE | TEXT | 被授予的具体权限 |
275
- | IS_GRANTABLE | TEXT | 授权时是否有 WITH GRANT OPTION |
276
- | AUTHORIZATION_TIME | TIMESTAMP_LTZ | 权限授予时间 |