@clickzetta/cz-cli-darwin-x64 0.3.94 → 0.3.96

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (1893) hide show
  1. package/bin/cz-cli +0 -0
  2. package/bin/skills/clickzetta-batch-sync-pipeline/SKILL.md +1 -30
  3. package/bin/skills/clickzetta-cdc-sync-pipeline/SKILL.md +100 -534
  4. package/bin/skills/clickzetta-cdc-sync-pipeline/references/troubleshooting.md +155 -0
  5. package/bin/skills/clickzetta-data-ingest-pipeline/SKILL.md +5 -5
  6. package/bin/skills/clickzetta-data-science/SKILL.md +82 -84
  7. package/bin/skills/clickzetta-dbt-modeling/SKILL.md +347 -0
  8. package/bin/skills/clickzetta-dbt-modeling/references/grant-patterns.md +69 -0
  9. package/bin/skills/clickzetta-dbt-modeling/references/incremental-patterns.md +165 -0
  10. package/bin/skills/clickzetta-dbt-modeling/references/materialization-guide.md +324 -0
  11. package/bin/skills/clickzetta-dbt-modeling/references/test-strategy.md +80 -0
  12. package/bin/skills/clickzetta-dbt-project-setup/SKILL.md +176 -0
  13. package/bin/skills/clickzetta-dbt-project-setup/references/dbt-clickzetta-adapter.md +101 -0
  14. package/bin/skills/clickzetta-dbt-project-setup/references/elt-standards.md +220 -0
  15. package/bin/skills/clickzetta-dbt-studio-pipeline/SKILL.md +206 -0
  16. package/bin/skills/clickzetta-dbt-studio-pipeline/references/parameter-guide.md +118 -0
  17. package/bin/skills/clickzetta-dbt-studio-pipeline/references/studio-task-sop.md +115 -0
  18. package/bin/skills/clickzetta-dw-modeling/SKILL.md +130 -261
  19. package/bin/skills/clickzetta-dw-modeling/references/modeling-patterns.md +159 -67
  20. package/bin/skills/clickzetta-dynamic-table/SKILL.md +15 -51
  21. package/bin/skills/clickzetta-dynamic-table/dt-creator/SKILL.md +3 -5
  22. package/bin/skills/clickzetta-dynamic-table/sql-to-dt/SKILL.md +7 -1
  23. package/bin/skills/clickzetta-file-import-pipeline/SKILL.md +81 -106
  24. package/bin/skills/clickzetta-kafka-ingest-pipeline/SKILL.md +56 -327
  25. package/bin/skills/clickzetta-kafka-ingest-pipeline/references/operations.md +76 -0
  26. package/bin/skills/clickzetta-oss-ingest-pipeline/SKILL.md +1 -30
  27. package/bin/skills/clickzetta-overview/SKILL.md +54 -62
  28. package/bin/skills/clickzetta-overview/references/brands-and-endpoints.md +33 -35
  29. package/bin/skills/clickzetta-overview/references/object-model.md +167 -168
  30. package/bin/skills/clickzetta-overview/references/studio-modules.md +91 -91
  31. package/bin/skills/clickzetta-pipeline-review/SKILL.md +125 -210
  32. package/bin/skills/clickzetta-realtime-sync-pipeline/SKILL.md +1 -22
  33. package/bin/skills/clickzetta-spark-flink-connector/SKILL.md +28 -27
  34. package/bin/skills/clickzetta-spark-flink-connector/references/flink.md +356 -49
  35. package/bin/skills/clickzetta-sql-migration/SKILL.md +10 -20
  36. package/bin/skills/clickzetta-sql-pipeline-manager/SKILL.md +79 -419
  37. package/bin/skills/clickzetta-sql-pipeline-manager/references/dynamic-table.md +45 -45
  38. package/bin/skills/clickzetta-sql-pipeline-manager/references/materialized-view.md +25 -25
  39. package/bin/skills/clickzetta-sql-pipeline-manager/references/pipe.md +48 -48
  40. package/bin/skills/clickzetta-sql-pipeline-manager/references/scenarios.md +141 -0
  41. package/bin/skills/clickzetta-sql-pipeline-manager/references/table-stream.md +30 -30
  42. package/bin/skills/clickzetta-sql-pipeline-manager/references/troubleshooting.md +57 -0
  43. package/bin/skills/clickzetta-studio-task-manager/SKILL.md +83 -576
  44. package/bin/skills/clickzetta-studio-task-manager/references/engineering-sop.md +123 -0
  45. package/bin/skills/clickzetta-studio-task-manager/references/troubleshooting.md +63 -0
  46. package/bin/skills/clickzetta-table-lineage/SKILL.md +4 -3
  47. package/bin/skills/clickzetta-table-stream-pipeline/SKILL.md +116 -114
  48. package/bin/skills/clickzetta-zettapark/SKILL.md +36 -203
  49. package/bin/skills/clickzetta-zettapark/references/zettapark-api.md +143 -64
  50. package/bin/skills/cz-cli/SKILL.md +16 -16
  51. package/bin/skills/cz-cli/references/profile-setup.md +8 -8
  52. package/bin/skills/lakehouse-doc-en/SKILL.md +130 -0
  53. package/bin/skills/lakehouse-doc-en/SKILL.md.template +49 -0
  54. package/bin/skills/lakehouse-doc-en/references/AI Gateway Guide.md +1 -0
  55. package/bin/skills/lakehouse-doc-en/references/AIGateway.md +27 -0
  56. package/bin/skills/lakehouse-doc-en/references/AI_COMPLETE.md +239 -0
  57. package/bin/skills/lakehouse-doc-en/references/AI_EMBEDDING.md +192 -0
  58. package/bin/skills/lakehouse-doc-en/references/AI_Gateway.md +253 -0
  59. package/bin/skills/lakehouse-doc-en/references/AI_Gateway_Guide.md +1 -0
  60. package/bin/skills/lakehouse-doc-en/references/AI_eco.md +34 -0
  61. package/bin/skills/lakehouse-doc-en/references/AI_function_in_SQL.md +355 -0
  62. package/bin/skills/lakehouse-doc-en/references/AI_function_overview.md +25 -0
  63. package/bin/skills/lakehouse-doc-en/references/AI_ready_data_overview.md +82 -0
  64. package/bin/skills/lakehouse-doc-en/references/ALTER-EXTERNAL-TABLE.md +120 -0
  65. package/bin/skills/lakehouse-doc-en/references/ALTER-SCHEMA.md +86 -0
  66. package/bin/skills/lakehouse-doc-en/references/ALTER-TABLE-COLUMN.md +223 -0
  67. package/bin/skills/lakehouse-doc-en/references/ALTERTABLE.md +73 -0
  68. package/bin/skills/lakehouse-doc-en/references/APIKEYManage.md +1 -0
  69. package/bin/skills/lakehouse-doc-en/references/ARRAY.md +261 -0
  70. package/bin/skills/lakehouse-doc-en/references/Analysis.md +149 -0
  71. package/bin/skills/lakehouse-doc-en/references/AnalyticsModernDataStack.md +386 -0
  72. package/bin/skills/lakehouse-doc-en/references/Application_list.md +26 -0
  73. package/bin/skills/lakehouse-doc-en/references/Approval.md +1 -0
  74. package/bin/skills/lakehouse-doc-en/references/Approval_list.md +61 -0
  75. package/bin/skills/lakehouse-doc-en/references/BIGINT.md +68 -0
  76. package/bin/skills/lakehouse-doc-en/references/BINARY.md +113 -0
  77. package/bin/skills/lakehouse-doc-en/references/BOOLEAN.md +106 -0
  78. package/bin/skills/lakehouse-doc-en/references/BP_AI_Function_Image2text.md +203 -0
  79. package/bin/skills/lakehouse-doc-en/references/BestPracticesforUsingDynamicTables.md +336 -0
  80. package/bin/skills/lakehouse-doc-en/references/Billing.md +63 -0
  81. package/bin/skills/lakehouse-doc-en/references/BluepipeOracleLakehouse_DataSync.md +261 -0
  82. package/bin/skills/lakehouse-doc-en/references/CHAR.md +37 -0
  83. package/bin/skills/lakehouse-doc-en/references/CLAUDE.md +606 -0
  84. package/bin/skills/lakehouse-doc-en/references/CONNECTION.md +26 -0
  85. package/bin/skills/lakehouse-doc-en/references/COPY-INTO-Location.md +372 -0
  86. package/bin/skills/lakehouse-doc-en/references/COPY_INTO_Location.md +369 -0
  87. package/bin/skills/lakehouse-doc-en/references/CREAREUSER.md +38 -0
  88. package/bin/skills/lakehouse-doc-en/references/CREATE-BLOOMFILTER-INDEX.md +111 -0
  89. package/bin/skills/lakehouse-doc-en/references/CREATE-CONNECTION.md +56 -0
  90. package/bin/skills/lakehouse-doc-en/references/CREATECONNECTION.md +11 -0
  91. package/bin/skills/lakehouse-doc-en/references/CREATEEXTERNAlLSCHEMA.md +251 -0
  92. package/bin/skills/lakehouse-doc-en/references/CREATEMATERIALIZEDVIEW.md +489 -0
  93. package/bin/skills/lakehouse-doc-en/references/CREATEROLE.md +62 -0
  94. package/bin/skills/lakehouse-doc-en/references/CREATESCHEMA.md +33 -0
  95. package/bin/skills/lakehouse-doc-en/references/CREATEVIEW.md +55 -0
  96. package/bin/skills/lakehouse-doc-en/references/CREATE_EXTERNAL_FUNCTION.md +232 -0
  97. package/bin/skills/lakehouse-doc-en/references/CTERevenueCohort.md +275 -0
  98. package/bin/skills/lakehouse-doc-en/references/ClickZettaInsight.md +84 -0
  99. package/bin/skills/lakehouse-doc-en/references/Computation.md +6 -0
  100. package/bin/skills/lakehouse-doc-en/references/ComputeResourceDDL.md +1 -0
  101. package/bin/skills/lakehouse-doc-en/references/Create_Embeding_Function.md +236 -0
  102. package/bin/skills/lakehouse-doc-en/references/Create_LLM_Function.md +242 -0
  103. package/bin/skills/lakehouse-doc-en/references/CreatingAWSCloudVPCEndpoint.md +69 -0
  104. package/bin/skills/lakehouse-doc-en/references/CreatingAWSCloudVPCPrivatelinkService.md +108 -0
  105. package/bin/skills/lakehouse-doc-en/references/CreditScoringwithZettaparkandPythonMLlibraryNew.md +853 -0
  106. package/bin/skills/lakehouse-doc-en/references/DATE.md +84 -0
  107. package/bin/skills/lakehouse-doc-en/references/DDL.md +1 -0
  108. package/bin/skills/lakehouse-doc-en/references/DECIMAL.md +196 -0
  109. package/bin/skills/lakehouse-doc-en/references/DELETE.md +113 -0
  110. package/bin/skills/lakehouse-doc-en/references/DESC-INDEX.md +37 -0
  111. package/bin/skills/lakehouse-doc-en/references/DESC-JOB.md +92 -0
  112. package/bin/skills/lakehouse-doc-en/references/DESCCONNECTION.md +35 -0
  113. package/bin/skills/lakehouse-doc-en/references/DESCMATERIALIZEDVIEW.md +28 -0
  114. package/bin/skills/lakehouse-doc-en/references/DESCSCHEMAS.md +47 -0
  115. package/bin/skills/lakehouse-doc-en/references/DESCTABLE.md +98 -0
  116. package/bin/skills/lakehouse-doc-en/references/DESCVIEW.md +61 -0
  117. package/bin/skills/lakehouse-doc-en/references/DOUBLE.md +104 -0
  118. package/bin/skills/lakehouse-doc-en/references/DQL.md +93 -0
  119. package/bin/skills/lakehouse-doc-en/references/DROP-INDEX.md +26 -0
  120. package/bin/skills/lakehouse-doc-en/references/DROPCONNECTION.md +27 -0
  121. package/bin/skills/lakehouse-doc-en/references/DROPF_UNCTIOn.md +32 -0
  122. package/bin/skills/lakehouse-doc-en/references/DROPMATERIALIZEDVIEW.md +38 -0
  123. package/bin/skills/lakehouse-doc-en/references/DROPROLE.md +44 -0
  124. package/bin/skills/lakehouse-doc-en/references/DROPSCHEMA.md +31 -0
  125. package/bin/skills/lakehouse-doc-en/references/DROPTABLE.md +40 -0
  126. package/bin/skills/lakehouse-doc-en/references/DROPUSER.md +28 -0
  127. package/bin/skills/lakehouse-doc-en/references/DROPVIEW.md +26 -0
  128. package/bin/skills/lakehouse-doc-en/references/DataGPT_release.md +1 -0
  129. package/bin/skills/lakehouse-doc-en/references/DataQuality.md +99 -0
  130. package/bin/skills/lakehouse-doc-en/references/DataSourceConfigGuide.md +1 -0
  131. package/bin/skills/lakehouse-doc-en/references/DataSource_ADBMySQL.md +35 -0
  132. package/bin/skills/lakehouse-doc-en/references/DataSource_ADB_PostgreSQL.md +36 -0
  133. package/bin/skills/lakehouse-doc-en/references/DataSource_AMQP.md +37 -0
  134. package/bin/skills/lakehouse-doc-en/references/DataSource_Amazon_DocumentDB.md +113 -0
  135. package/bin/skills/lakehouse-doc-en/references/DataSource_Amazon_OpenSearch.md +41 -0
  136. package/bin/skills/lakehouse-doc-en/references/DataSource_Aurora_MySQL.md +35 -0
  137. package/bin/skills/lakehouse-doc-en/references/DataSource_Aurora_PostgreSQL.md +36 -0
  138. package/bin/skills/lakehouse-doc-en/references/DataSource_AutoMQ.md +124 -0
  139. package/bin/skills/lakehouse-doc-en/references/DataSource_COS.md +31 -0
  140. package/bin/skills/lakehouse-doc-en/references/DataSource_ClickHouse.md +36 -0
  141. package/bin/skills/lakehouse-doc-en/references/DataSource_DB2.md +36 -0
  142. package/bin/skills/lakehouse-doc-en/references/DataSource_DM.md +35 -0
  143. package/bin/skills/lakehouse-doc-en/references/DataSource_Databricks.md +38 -0
  144. package/bin/skills/lakehouse-doc-en/references/DataSource_Doris.md +34 -0
  145. package/bin/skills/lakehouse-doc-en/references/DataSource_DynamoDB.md +37 -0
  146. package/bin/skills/lakehouse-doc-en/references/DataSource_ElasticSearch.md +30 -0
  147. package/bin/skills/lakehouse-doc-en/references/DataSource_Greenplum.md +36 -0
  148. package/bin/skills/lakehouse-doc-en/references/DataSource_HANA.md +53 -0
  149. package/bin/skills/lakehouse-doc-en/references/DataSource_HBase.md +29 -0
  150. package/bin/skills/lakehouse-doc-en/references/DataSource_Hive.md +50 -0
  151. package/bin/skills/lakehouse-doc-en/references/DataSource_Hologres.md +36 -0
  152. package/bin/skills/lakehouse-doc-en/references/DataSource_Kafka.md +32 -0
  153. package/bin/skills/lakehouse-doc-en/references/DataSource_MariaDB.md +36 -0
  154. package/bin/skills/lakehouse-doc-en/references/DataSource_MaxCompute.md +32 -0
  155. package/bin/skills/lakehouse-doc-en/references/DataSource_MongoDB.md +36 -0
  156. package/bin/skills/lakehouse-doc-en/references/DataSource_MySQL.md +35 -0
  157. package/bin/skills/lakehouse-doc-en/references/DataSource_OSS.md +31 -0
  158. package/bin/skills/lakehouse-doc-en/references/DataSource_Oracle.md +35 -0
  159. package/bin/skills/lakehouse-doc-en/references/DataSource_PolarDB.md +37 -0
  160. package/bin/skills/lakehouse-doc-en/references/DataSource_PorarDB.md +37 -0
  161. package/bin/skills/lakehouse-doc-en/references/DataSource_PostgreSQL.md +36 -0
  162. package/bin/skills/lakehouse-doc-en/references/DataSource_Redis.md +55 -0
  163. package/bin/skills/lakehouse-doc-en/references/DataSource_Redshift.md +49 -0
  164. package/bin/skills/lakehouse-doc-en/references/DataSource_RestApi.md +30 -0
  165. package/bin/skills/lakehouse-doc-en/references/DataSource_S3.md +25 -0
  166. package/bin/skills/lakehouse-doc-en/references/DataSource_SLS.md +31 -0
  167. package/bin/skills/lakehouse-doc-en/references/DataSource_StarRocks.md +35 -0
  168. package/bin/skills/lakehouse-doc-en/references/DataSource_TiDB.md +36 -0
  169. package/bin/skills/lakehouse-doc-en/references/DataSupplementationTask.md +51 -0
  170. package/bin/skills/lakehouse-doc-en/references/Data_Catalog_Management.md +1 -0
  171. package/bin/skills/lakehouse-doc-en/references/Data_Encryption.md +81 -0
  172. package/bin/skills/lakehouse-doc-en/references/Datalake_StorageConnection.md +12 -0
  173. package/bin/skills/lakehouse-doc-en/references/Datasource_SQLServer.md +36 -0
  174. package/bin/skills/lakehouse-doc-en/references/Datus_Lakehouse_Integrated_Guide.md +1 -0
  175. package/bin/skills/lakehouse-doc-en/references/Datus_Lakehouse_MCPServer.md +153 -0
  176. package/bin/skills/lakehouse-doc-en/references/Dify_Integrated_with_LakehouseMCPServer.md +71 -0
  177. package/bin/skills/lakehouse-doc-en/references/Dify_Integreated_with_LakehouseMCPServer.md +71 -0
  178. package/bin/skills/lakehouse-doc-en/references/DynamicTableDevPractical.md +1 -0
  179. package/bin/skills/lakehouse-doc-en/references/ELTModernDataStack.md +485 -0
  180. package/bin/skills/lakehouse-doc-en/references/ELT_practice.md +1 -0
  181. package/bin/skills/lakehouse-doc-en/references/EXPLAIN.md +91 -0
  182. package/bin/skills/lakehouse-doc-en/references/EXTERNALFUNCTIONDevGuide.md +121 -0
  183. package/bin/skills/lakehouse-doc-en/references/EXTERNALSCHEMA.md +95 -0
  184. package/bin/skills/lakehouse-doc-en/references/EXTERNALSCHMEA.md +90 -0
  185. package/bin/skills/lakehouse-doc-en/references/EditionsOverview.md +68 -0
  186. package/bin/skills/lakehouse-doc-en/references/ExternalFunctionDevGuideJava.md +554 -0
  187. package/bin/skills/lakehouse-doc-en/references/FLOAT.md +84 -0
  188. package/bin/skills/lakehouse-doc-en/references/FeatureEngineeringForExpandingCustomerFeatureswithZettapark.md +431 -0
  189. package/bin/skills/lakehouse-doc-en/references/FileCommand.md +1 -0
  190. package/bin/skills/lakehouse-doc-en/references/FileFunction.md +1 -0
  191. package/bin/skills/lakehouse-doc-en/references/FineBI.md +192 -0
  192. package/bin/skills/lakehouse-doc-en/references/Foreach.md +90 -0
  193. package/bin/skills/lakehouse-doc-en/references/Full_Text_Search.md +1 -0
  194. package/bin/skills/lakehouse-doc-en/references/GET.md +87 -0
  195. package/bin/skills/lakehouse-doc-en/references/GET_PRESIGNED_URL.md +59 -0
  196. package/bin/skills/lakehouse-doc-en/references/GrantPriveleges.md +156 -0
  197. package/bin/skills/lakehouse-doc-en/references/Hive_connection.md +50 -0
  198. package/bin/skills/lakehouse-doc-en/references/IDENTITY-Column.md +67 -0
  199. package/bin/skills/lakehouse-doc-en/references/INSERT.md +230 -0
  200. package/bin/skills/lakehouse-doc-en/references/INSERTOVERWRITE.md +61 -0
  201. package/bin/skills/lakehouse-doc-en/references/INT.md +74 -0
  202. package/bin/skills/lakehouse-doc-en/references/INTERVAL.md +247 -0
  203. package/bin/skills/lakehouse-doc-en/references/Ingesting_Data_from_Alibaba_Cloud_Data_Lake_into_Lakehouse.md +920 -0
  204. package/bin/skills/lakehouse-doc-en/references/Ingestion.md +87 -0
  205. package/bin/skills/lakehouse-doc-en/references/Introduction.md +148 -0
  206. package/bin/skills/lakehouse-doc-en/references/JDBC-Driver.md +62 -0
  207. package/bin/skills/lakehouse-doc-en/references/JDBC_MindsDB_ML_LLM.md +255 -0
  208. package/bin/skills/lakehouse-doc-en/references/JOIN.md +265 -0
  209. package/bin/skills/lakehouse-doc-en/references/JSON.md +164 -0
  210. package/bin/skills/lakehouse-doc-en/references/JSON_DataType.md +49 -0
  211. package/bin/skills/lakehouse-doc-en/references/KAFKA_Storage_connection.md +1 -0
  212. package/bin/skills/lakehouse-doc-en/references/Kafka_connection.md +36 -0
  213. package/bin/skills/lakehouse-doc-en/references/Key_Concepts.md +116 -0
  214. package/bin/skills/lakehouse-doc-en/references/LATERALVIEW.md +157 -0
  215. package/bin/skills/lakehouse-doc-en/references/Lakehouse-client-repository.md +11 -0
  216. package/bin/skills/lakehouse-doc-en/references/LakehouseAI-overview.md +99 -0
  217. package/bin/skills/lakehouse-doc-en/references/LakehouseAI.md +0 -0
  218. package/bin/skills/lakehouse-doc-en/references/LakehouseAI_overview.md +16 -0
  219. package/bin/skills/lakehouse-doc-en/references/LakehouseAI/346/246/202/350/277/260.md +0 -0
  220. package/bin/skills/lakehouse-doc-en/references/LakehouseDataGPT-tour.md +64 -0
  221. package/bin/skills/lakehouse-doc-en/references/LakehouseDataGPTTour.md +64 -0
  222. package/bin/skills/lakehouse-doc-en/references/LakehouseMCPServer-intro.md +493 -0
  223. package/bin/skills/lakehouse-doc-en/references/LakehouseMCPServer.md +2 -0
  224. package/bin/skills/lakehouse-doc-en/references/LakehouseMCPServer_intro.md +493 -0
  225. package/bin/skills/lakehouse-doc-en/references/LakehousePython-zettapark.md +48 -0
  226. package/bin/skills/lakehouse-doc-en/references/LakehouseStudio-tour.md +185 -0
  227. package/bin/skills/lakehouse-doc-en/references/LakehouseStudioTour.md +185 -0
  228. package/bin/skills/lakehouse-doc-en/references/Lakehouse_Insight.md +98 -0
  229. package/bin/skills/lakehouse-doc-en/references/Lakehouse_Platform_Release_Note.md +1 -0
  230. package/bin/skills/lakehouse-doc-en/references/Lakehouse_Studio_101.md +1 -0
  231. package/bin/skills/lakehouse-doc-en/references/Lakehouse_Studio_Release_Note.md +1 -0
  232. package/bin/skills/lakehouse-doc-en/references/Lakehouse_Zilliz_MakeDataReadyforBIandAI.md +222 -0
  233. package/bin/skills/lakehouse-doc-en/references/Langchain_plug_installation.md +244 -0
  234. package/bin/skills/lakehouse-doc-en/references/Langchain_plug_quick_start.md +225 -0
  235. package/bin/skills/lakehouse-doc-en/references/Langchain_plugins_overview.md +409 -0
  236. package/bin/skills/lakehouse-doc-en/references/Limitation.md +7 -0
  237. package/bin/skills/lakehouse-doc-en/references/LoaddataintoLakehouse.md +1 -0
  238. package/bin/skills/lakehouse-doc-en/references/Logstash.md +181 -0
  239. package/bin/skills/lakehouse-doc-en/references/MAP.md +158 -0
  240. package/bin/skills/lakehouse-doc-en/references/MATERIALIZEDVIEW.md +111 -0
  241. package/bin/skills/lakehouse-doc-en/references/MCPServers.md +268 -0
  242. package/bin/skills/lakehouse-doc-en/references/MERGE.md +471 -0
  243. package/bin/skills/lakehouse-doc-en/references/ManageAccounts.md +187 -0
  244. package/bin/skills/lakehouse-doc-en/references/Management_All.md +1 -0
  245. package/bin/skills/lakehouse-doc-en/references/ManagingFilesonDatalakeVolumewithZettapark.md +161 -0
  246. package/bin/skills/lakehouse-doc-en/references/MigrateSnowflakeRealtimeETLPipelinetoClickzettaLakehouse.md +845 -0
  247. package/bin/skills/lakehouse-doc-en/references/Migrate_Spark_DataEngineeringBestPractices_Project_to_Lakehouse.md +300 -0
  248. package/bin/skills/lakehouse-doc-en/references/ModelMarket.md +1 -0
  249. package/bin/skills/lakehouse-doc-en/references/ModernDataStackWithEcosystemTools.md +1 -0
  250. package/bin/skills/lakehouse-doc-en/references/MonitoringItemSpecification.md +44 -0
  251. package/bin/skills/lakehouse-doc-en/references/N8N_AI_Workflow_Integration.md +1 -0
  252. package/bin/skills/lakehouse-doc-en/references/N8N_Integrated_with_LakehouseMCPServer.md +128 -0
  253. package/bin/skills/lakehouse-doc-en/references/Notebook.md +98 -0
  254. package/bin/skills/lakehouse-doc-en/references/OPTIMIZE.md +125 -0
  255. package/bin/skills/lakehouse-doc-en/references/ObjectModelDesign.md +1 -0
  256. package/bin/skills/lakehouse-doc-en/references/ObtainingAWSCloudARNandExternalID.md +188 -0
  257. package/bin/skills/lakehouse-doc-en/references/Op_and_Monitoring.md +1 -0
  258. package/bin/skills/lakehouse-doc-en/references/PUT.md +94 -0
  259. package/bin/skills/lakehouse-doc-en/references/PerformingVectorandScalarRetrievalinheSameTableinLakehouse.md +84 -0
  260. package/bin/skills/lakehouse-doc-en/references/Permission_Management.md +1 -0
  261. package/bin/skills/lakehouse-doc-en/references/Permission_application.md +43 -0
  262. package/bin/skills/lakehouse-doc-en/references/PowerBI.md +112 -0
  263. package/bin/skills/lakehouse-doc-en/references/PythonSDKVersionHistory.md +22 -0
  264. package/bin/skills/lakehouse-doc-en/references/PythonSample_put_gharchive2oss.md +153 -0
  265. package/bin/skills/lakehouse-doc-en/references/PythonSample_put_github_rt_events.md +336 -0
  266. package/bin/skills/lakehouse-doc-en/references/PythonSqlAlchemyVersionHistory.md +23 -0
  267. package/bin/skills/lakehouse-doc-en/references/Python_Task.md +27 -0
  268. package/bin/skills/lakehouse-doc-en/references/QuickStartwithCopycommand.md +65 -0
  269. package/bin/skills/lakehouse-doc-en/references/REFRESH.md +83 -0
  270. package/bin/skills/lakehouse-doc-en/references/REMOTEFUNCTION.md +1 -0
  271. package/bin/skills/lakehouse-doc-en/references/Refactor_ELT_practice.md +241 -0
  272. package/bin/skills/lakehouse-doc-en/references/ReferenceManual.md +1 -0
  273. package/bin/skills/lakehouse-doc-en/references/RemoteFunction-as-udf.md +43 -0
  274. package/bin/skills/lakehouse-doc-en/references/RemoteFunction-best-practice.md +350 -0
  275. package/bin/skills/lakehouse-doc-en/references/RemoteFunction-dev-guide-python3.md +570 -0
  276. package/bin/skills/lakehouse-doc-en/references/RemoteFunction-intro.md +54 -0
  277. package/bin/skills/lakehouse-doc-en/references/RemoteFunction-on-acr.md +249 -0
  278. package/bin/skills/lakehouse-doc-en/references/RemoteFunctionAsUDF.md +1 -0
  279. package/bin/skills/lakehouse-doc-en/references/RemoteFunctionAsUDF_sql.md +1 -0
  280. package/bin/skills/lakehouse-doc-en/references/RemoteFunctionBestPractice.md +343 -0
  281. package/bin/skills/lakehouse-doc-en/references/RemoteFunctionDevGuidePython3.md +145 -0
  282. package/bin/skills/lakehouse-doc-en/references/RemoteFunctionOnACR.md +245 -0
  283. package/bin/skills/lakehouse-doc-en/references/RemoteFunctionintro.md +54 -0
  284. package/bin/skills/lakehouse-doc-en/references/RevokePriveleges.md +81 -0
  285. package/bin/skills/lakehouse-doc-en/references/Row_LevelPermissions.md +42 -0
  286. package/bin/skills/lakehouse-doc-en/references/SCHEMA.md +63 -0
  287. package/bin/skills/lakehouse-doc-en/references/SCHEMADDL.md +0 -0
  288. package/bin/skills/lakehouse-doc-en/references/SHOW-INDEX.md +67 -0
  289. package/bin/skills/lakehouse-doc-en/references/SHOWCONNECTIONS.md +32 -0
  290. package/bin/skills/lakehouse-doc-en/references/SHOWFUNCTIONS.md +30 -0
  291. package/bin/skills/lakehouse-doc-en/references/SHOWGRANTS.md +54 -0
  292. package/bin/skills/lakehouse-doc-en/references/SHOWROLES.md +55 -0
  293. package/bin/skills/lakehouse-doc-en/references/SHOWTABLES.md +39 -0
  294. package/bin/skills/lakehouse-doc-en/references/SHOWUSERS.md +17 -0
  295. package/bin/skills/lakehouse-doc-en/references/SMALLINT.md +71 -0
  296. package/bin/skills/lakehouse-doc-en/references/SQL_Approx_Aggregate_Functions_Guide.md +371 -0
  297. package/bin/skills/lakehouse-doc-en/references/SQL_Array_Explode_Guide.md +360 -0
  298. package/bin/skills/lakehouse-doc-en/references/SQL_Array_Map_Processing_Guide.md +391 -0
  299. package/bin/skills/lakehouse-doc-en/references/SQL_Attribution_Guide.md +445 -0
  300. package/bin/skills/lakehouse-doc-en/references/SQL_Batch_Insert_Guide.md +171 -0
  301. package/bin/skills/lakehouse-doc-en/references/SQL_Bitmap_Guide.md +282 -0
  302. package/bin/skills/lakehouse-doc-en/references/SQL_CREATE_TABLE_GUIDE.md +1208 -0
  303. package/bin/skills/lakehouse-doc-en/references/SQL_Clone_Guide.md +188 -0
  304. package/bin/skills/lakehouse-doc-en/references/SQL_Copy_Into_Guide.md +181 -0
  305. package/bin/skills/lakehouse-doc-en/references/SQL_DML_Considerations.md +601 -0
  306. package/bin/skills/lakehouse-doc-en/references/SQL_Deduplication_Guide.md +203 -0
  307. package/bin/skills/lakehouse-doc-en/references/SQL_Dynamic_Table_Guide.md +215 -0
  308. package/bin/skills/lakehouse-doc-en/references/SQL_Explain_Guide.md +142 -0
  309. package/bin/skills/lakehouse-doc-en/references/SQL_External_Catalog_Guide.md +147 -0
  310. package/bin/skills/lakehouse-doc-en/references/SQL_External_Table_Guide.md +156 -0
  311. package/bin/skills/lakehouse-doc-en/references/SQL_Filter_Sort_Guide.md +225 -0
  312. package/bin/skills/lakehouse-doc-en/references/SQL_FullText_Search_Guide.md +337 -0
  313. package/bin/skills/lakehouse-doc-en/references/SQL_Funnel_Analysis_Guide.md +473 -0
  314. package/bin/skills/lakehouse-doc-en/references/SQL_Group_Aggregation_Guide.md +230 -0
  315. package/bin/skills/lakehouse-doc-en/references/SQL_Hierarchy_Workaround_Guide.md +435 -0
  316. package/bin/skills/lakehouse-doc-en/references/SQL_Index_Guide.md +155 -0
  317. package/bin/skills/lakehouse-doc-en/references/SQL_JSON_Parsing_Guide.md +221 -0
  318. package/bin/skills/lakehouse-doc-en/references/SQL_Join_Guide.md +660 -0
  319. package/bin/skills/lakehouse-doc-en/references/SQL_Null_Handling_Guide.md +273 -0
  320. package/bin/skills/lakehouse-doc-en/references/SQL_Object_Storage_Guide.md +1 -0
  321. package/bin/skills/lakehouse-doc-en/references/SQL_Optimize_Guide.md +137 -0
  322. package/bin/skills/lakehouse-doc-en/references/SQL_Pipe_Guide.md +178 -0
  323. package/bin/skills/lakehouse-doc-en/references/SQL_Pivot_Guide.md +346 -0
  324. package/bin/skills/lakehouse-doc-en/references/SQL_Query_Analysis_Guide.md +1 -0
  325. package/bin/skills/lakehouse-doc-en/references/SQL_Ranking_Guide.md +254 -0
  326. package/bin/skills/lakehouse-doc-en/references/SQL_Realtime_Guide.md +1 -0
  327. package/bin/skills/lakehouse-doc-en/references/SQL_Retention_Cohort_Guide.md +463 -0
  328. package/bin/skills/lakehouse-doc-en/references/SQL_Running_Total_Guide.md +324 -0
  329. package/bin/skills/lakehouse-doc-en/references/SQL_SELECT_Considerations.md +1831 -0
  330. package/bin/skills/lakehouse-doc-en/references/SQL_Sampling_Guide.md +154 -0
  331. package/bin/skills/lakehouse-doc-en/references/SQL_Sessionization_Guide.md +407 -0
  332. package/bin/skills/lakehouse-doc-en/references/SQL_Set_Operations_Guide.md +218 -0
  333. package/bin/skills/lakehouse-doc-en/references/SQL_Share_Guide.md +156 -0
  334. package/bin/skills/lakehouse-doc-en/references/SQL_String_Processing_Guide.md +663 -0
  335. package/bin/skills/lakehouse-doc-en/references/SQL_Syntax_Guide.md +1 -0
  336. package/bin/skills/lakehouse-doc-en/references/SQL_Table_Stream_Guide.md +178 -0
  337. package/bin/skills/lakehouse-doc-en/references/SQL_TimeSeries_Guide.md +362 -0
  338. package/bin/skills/lakehouse-doc-en/references/SQL_Time_Travel_Guide.md +173 -0
  339. package/bin/skills/lakehouse-doc-en/references/SQL_Type_Conversion_Guide.md +241 -0
  340. package/bin/skills/lakehouse-doc-en/references/SQL_Update_Delete_Guide.md +229 -0
  341. package/bin/skills/lakehouse-doc-en/references/SQL_Upsert_Guide.md +199 -0
  342. package/bin/skills/lakehouse-doc-en/references/SQL_Vector_Search_Guide.md +555 -0
  343. package/bin/skills/lakehouse-doc-en/references/SQL_View_Guide.md +195 -0
  344. package/bin/skills/lakehouse-doc-en/references/SQL_Volume_Guide.md +149 -0
  345. package/bin/skills/lakehouse-doc-en/references/SQL_With_CTE_Guide.md +1526 -0
  346. package/bin/skills/lakehouse-doc-en/references/SQL_Write_Change_Guide.md +1 -0
  347. package/bin/skills/lakehouse-doc-en/references/SQL_customers.md +73 -0
  348. package/bin/skills/lakehouse-doc-en/references/SQL_revenue.md +31 -0
  349. package/bin/skills/lakehouse-doc-en/references/STRING.md +74 -0
  350. package/bin/skills/lakehouse-doc-en/references/STRUCT.md +199 -0
  351. package/bin/skills/lakehouse-doc-en/references/SUMMARY.md +1485 -0
  352. package/bin/skills/lakehouse-doc-en/references/SYNCPOINT.md +4 -0
  353. package/bin/skills/lakehouse-doc-en/references/Security_system_inventory_and_optimization_based_Information_Schema.md +411 -0
  354. package/bin/skills/lakehouse-doc-en/references/SecurityandGovernance.md +1 -0
  355. package/bin/skills/lakehouse-doc-en/references/SharedData.md +1 -0
  356. package/bin/skills/lakehouse-doc-en/references/SlowlyChangingDimensionsInLakehouseUsingStreamsandTasks.md +587 -0
  357. package/bin/skills/lakehouse-doc-en/references/Spark_Lakehouse_iceberg_REST.md +151 -0
  358. package/bin/skills/lakehouse-doc-en/references/Structured_and_semi-structured_data_analysis.md +171 -0
  359. package/bin/skills/lakehouse-doc-en/references/Studio.md +78 -0
  360. package/bin/skills/lakehouse-doc-en/references/StudioDI_PrivateLinkVPC_fromRDS.md +105 -0
  361. package/bin/skills/lakehouse-doc-en/references/Supported_Cloud_Platforms.md +34 -0
  362. package/bin/skills/lakehouse-doc-en/references/TABLE.md +183 -0
  363. package/bin/skills/lakehouse-doc-en/references/TIMESTAMP.md +117 -0
  364. package/bin/skills/lakehouse-doc-en/references/TIMETRAVEL.md +150 -0
  365. package/bin/skills/lakehouse-doc-en/references/TINYINT.md +77 -0
  366. package/bin/skills/lakehouse-doc-en/references/TPC-H100G_experience.md +45 -0
  367. package/bin/skills/lakehouse-doc-en/references/TRUNCATE.md +123 -0
  368. package/bin/skills/lakehouse-doc-en/references/TableDesign.md +270 -0
  369. package/bin/skills/lakehouse-doc-en/references/TableauConnectToLakehouse.md +73 -0
  370. package/bin/skills/lakehouse-doc-en/references/Transformation.md +1 -0
  371. package/bin/skills/lakehouse-doc-en/references/UNDROP-TABLE.md +171 -0
  372. package/bin/skills/lakehouse-doc-en/references/UPDATE.md +217 -0
  373. package/bin/skills/lakehouse-doc-en/references/USESCHEMA.md +52 -0
  374. package/bin/skills/lakehouse-doc-en/references/Unstructured_io.md +749 -0
  375. package/bin/skills/lakehouse-doc-en/references/Usage_Statistics.md +1 -0
  376. package/bin/skills/lakehouse-doc-en/references/VARCHARleghth.md +33 -0
  377. package/bin/skills/lakehouse-doc-en/references/VARCHARlength.md +73 -0
  378. package/bin/skills/lakehouse-doc-en/references/VIEW.md +74 -0
  379. package/bin/skills/lakehouse-doc-en/references/WEBIDE.md +1 -0
  380. package/bin/skills/lakehouse-doc-en/references/WINDOWCALUSE.md +66 -0
  381. package/bin/skills/lakehouse-doc-en/references/WINDOWFUNCTION.md +561 -0
  382. package/bin/skills/lakehouse-doc-en/references/WITH.md +137 -0
  383. package/bin/skills/lakehouse-doc-en/references/ZettaparkQuickStart.md +466 -0
  384. package/bin/skills/lakehouse-doc-en/references/Zettapark_Data_Engineering_Demo.md +351 -0
  385. package/bin/skills/lakehouse-doc-en/references/a_comprehensive_guide_to_ingesting_data_into_clickzetta_lakehouse.md +66 -0
  386. package/bin/skills/lakehouse-doc-en/references/access-control-configuration.md +249 -0
  387. package/bin/skills/lakehouse-doc-en/references/access-control-general.md +81 -0
  388. package/bin/skills/lakehouse-doc-en/references/access-control.md +235 -0
  389. package/bin/skills/lakehouse-doc-en/references/account-funds.md +87 -0
  390. package/bin/skills/lakehouse-doc-en/references/account.md +2 -0
  391. package/bin/skills/lakehouse-doc-en/references/account_user_management.md +108 -0
  392. package/bin/skills/lakehouse-doc-en/references/agg_function.md +1 -0
  393. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/approx_count_distinct.md +50 -0
  394. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/bit_and.md +40 -0
  395. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/bit_or.md +41 -0
  396. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/bit_xor.md +50 -0
  397. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/bool_and.md +39 -0
  398. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/bool_or.md +47 -0
  399. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/collect_list.md +58 -0
  400. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/collect_set.md +45 -0
  401. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/count.md +44 -0
  402. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/group_bitmap_merge_state.md +40 -0
  403. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/group_bitmap_state.md +110 -0
  404. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/max.md +58 -0
  405. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/min.md +51 -0
  406. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/percentile.md +48 -0
  407. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/stddev_pop.md +51 -0
  408. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/stddev_samp.md +55 -0
  409. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/sum.md +51 -0
  410. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/var_pop.md +60 -0
  411. package/bin/skills/lakehouse-doc-en/references/aggregate_functions/var_samp.md +46 -0
  412. package/bin/skills/lakehouse-doc-en/references/ai_app_dev_practical.md +3 -0
  413. package/bin/skills/lakehouse-doc-en/references/ai_classify.md +263 -0
  414. package/bin/skills/lakehouse-doc-en/references/ai_extract.md +248 -0
  415. package/bin/skills/lakehouse-doc-en/references/ai_fix_grammar.md +200 -0
  416. package/bin/skills/lakehouse-doc-en/references/ai_function_best_practice.md +1 -0
  417. package/bin/skills/lakehouse-doc-en/references/ai_functions_overview.md +259 -0
  418. package/bin/skills/lakehouse-doc-en/references/ai_mask.md +188 -0
  419. package/bin/skills/lakehouse-doc-en/references/ai_sentiment.md +204 -0
  420. package/bin/skills/lakehouse-doc-en/references/ai_similarity.md +201 -0
  421. package/bin/skills/lakehouse-doc-en/references/ai_summarize.md +168 -0
  422. package/bin/skills/lakehouse-doc-en/references/ai_transcribe.md +188 -0
  423. package/bin/skills/lakehouse-doc-en/references/ai_translate.md +197 -0
  424. package/bin/skills/lakehouse-doc-en/references/airbyte.md +107 -0
  425. package/bin/skills/lakehouse-doc-en/references/alert.md +139 -0
  426. package/bin/skills/lakehouse-doc-en/references/alicloud-arn-externalid.md +49 -0
  427. package/bin/skills/lakehouse-doc-en/references/alicloud_byos_configuration.md +129 -0
  428. package/bin/skills/lakehouse-doc-en/references/aliyun_storage_connection.md +133 -0
  429. package/bin/skills/lakehouse-doc-en/references/alter-dynamic-table.md +327 -0
  430. package/bin/skills/lakehouse-doc-en/references/alter-external-schema.md +86 -0
  431. package/bin/skills/lakehouse-doc-en/references/alter-materialized-view.md +212 -0
  432. package/bin/skills/lakehouse-doc-en/references/alter-network-policy.md +62 -0
  433. package/bin/skills/lakehouse-doc-en/references/alter-pipe.md +43 -0
  434. package/bin/skills/lakehouse-doc-en/references/alter-share.md +39 -0
  435. package/bin/skills/lakehouse-doc-en/references/alter-table.md +349 -0
  436. package/bin/skills/lakehouse-doc-en/references/alter-user.md +53 -0
  437. package/bin/skills/lakehouse-doc-en/references/alter-vcluster.md +154 -0
  438. package/bin/skills/lakehouse-doc-en/references/alter-volume.md +51 -0
  439. package/bin/skills/lakehouse-doc-en/references/alter-workspace.md +43 -0
  440. package/bin/skills/lakehouse-doc-en/references/alter.md +72 -0
  441. package/bin/skills/lakehouse-doc-en/references/analysis_internet_data_nyc_green_data.md +449 -0
  442. package/bin/skills/lakehouse-doc-en/references/analytics-agent-core-concepts.md +34 -0
  443. package/bin/skills/lakehouse-doc-en/references/analytics-modern-data-stack.md +386 -0
  444. package/bin/skills/lakehouse-doc-en/references/analytics_cluster_best_practices.md +367 -0
  445. package/bin/skills/lakehouse-doc-en/references/analyze-table.md +51 -0
  446. package/bin/skills/lakehouse-doc-en/references/answer-accuracy-improve.md +179 -0
  447. package/bin/skills/lakehouse-doc-en/references/application-list.md +26 -0
  448. package/bin/skills/lakehouse-doc-en/references/approval-list.md +61 -0
  449. package/bin/skills/lakehouse-doc-en/references/array_function.md +1 -0
  450. package/bin/skills/lakehouse-doc-en/references/array_size.md +31 -0
  451. package/bin/skills/lakehouse-doc-en/references/authentication.md +53 -0
  452. package/bin/skills/lakehouse-doc-en/references/authority-management.md +157 -0
  453. package/bin/skills/lakehouse-doc-en/references/authoritymanagement.md +1 -0
  454. package/bin/skills/lakehouse-doc-en/references/auto-index.md +52 -0
  455. package/bin/skills/lakehouse-doc-en/references/aws_storage_connection.md +112 -0
  456. package/bin/skills/lakehouse-doc-en/references/awscloud_privatelink.md +1 -0
  457. package/bin/skills/lakehouse-doc-en/references/backfilling_data.md +50 -0
  458. package/bin/skills/lakehouse-doc-en/references/batch-load-parquet-file-into-lakehouse.md +79 -0
  459. package/bin/skills/lakehouse-doc-en/references/batch_sync.md +62 -0
  460. package/bin/skills/lakehouse-doc-en/references/batch_sync_Sop.md +135 -0
  461. package/bin/skills/lakehouse-doc-en/references/batchloadparquetfileintoLakehouse.md +79 -0
  462. package/bin/skills/lakehouse-doc-en/references/benchmark.md +1 -0
  463. package/bin/skills/lakehouse-doc-en/references/benchmark_guide.md +43 -0
  464. package/bin/skills/lakehouse-doc-en/references/best_ractices.md +1 -0
  465. package/bin/skills/lakehouse-doc-en/references/bestpractice_bazhuanyu.md +1 -0
  466. package/bin/skills/lakehouse-doc-en/references/bit_function.md +1 -0
  467. package/bin/skills/lakehouse-doc-en/references/bitmap-type.md +381 -0
  468. package/bin/skills/lakehouse-doc-en/references/bitmap_function.md +1 -0
  469. package/bin/skills/lakehouse-doc-en/references/bitmap_uba_guide.md +1234 -0
  470. package/bin/skills/lakehouse-doc-en/references/bloomfilter-summary.md +136 -0
  471. package/bin/skills/lakehouse-doc-en/references/bluepipe-oracle-lakehouse-datasync.md +244 -0
  472. package/bin/skills/lakehouse-doc-en/references/bring_your_own_storage.md +1 -0
  473. package/bin/skills/lakehouse-doc-en/references/build-index.md +75 -0
  474. package/bin/skills/lakehouse-doc-en/references/build_rag_with_langchain.md +652 -0
  475. package/bin/skills/lakehouse-doc-en/references/bulkload-summary.md +37 -0
  476. package/bin/skills/lakehouse-doc-en/references/bulkloadv1-java-sdk.md +177 -0
  477. package/bin/skills/lakehouse-doc-en/references/bulkloadv1-python-sdk.md +169 -0
  478. package/bin/skills/lakehouse-doc-en/references/business-audit-trail-with-table-stream.md +281 -0
  479. package/bin/skills/lakehouse-doc-en/references/byok.md +1 -0
  480. package/bin/skills/lakehouse-doc-en/references/byos_general.md +165 -0
  481. package/bin/skills/lakehouse-doc-en/references/byos_tencentcloud_configuration.md +138 -0
  482. package/bin/skills/lakehouse-doc-en/references/cache-command.md +31 -0
  483. package/bin/skills/lakehouse-doc-en/references/cancel-job.md +49 -0
  484. package/bin/skills/lakehouse-doc-en/references/cardinality_array.md +43 -0
  485. package/bin/skills/lakehouse-doc-en/references/charge_analysis_with_lakehouse_mcp_server.md +393 -0
  486. package/bin/skills/lakehouse-doc-en/references/chart-auto-refresh-guide.md +30 -0
  487. package/bin/skills/lakehouse-doc-en/references/clickzetta-sample-data.md +239 -0
  488. package/bin/skills/lakehouse-doc-en/references/clone-doc.md +101 -0
  489. package/bin/skills/lakehouse-doc-en/references/cloud_object_storage.md +1 -0
  490. package/bin/skills/lakehouse-doc-en/references/cluster-table-guide.md +64 -0
  491. package/bin/skills/lakehouse-doc-en/references/cluster-table.md +111 -0
  492. package/bin/skills/lakehouse-doc-en/references/code_approval.md +34 -0
  493. package/bin/skills/lakehouse-doc-en/references/composite_task.md +192 -0
  494. package/bin/skills/lakehouse-doc-en/references/comprehensive_comparison.md +83 -0
  495. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_3rd_tools.md +11 -0
  496. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_dbv_sql_put.md +44 -0
  497. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_environment_and_data_generate.md +669 -0
  498. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_javasdk_bulkload_realtime.md +741 -0
  499. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_kafka_realtime_sync.md +73 -0
  500. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_local_file_into_table_by_studio.md +64 -0
  501. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_overview.md +66 -0
  502. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_pipe_kafka.md +8 -0
  503. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_pipe_oss.md +7 -0
  504. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_studio_batchload_public_network.md +52 -0
  505. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_studio_python_node.md +113 -0
  506. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_studio_realtime_cdc_public_network.md +253 -0
  507. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_studio_sql_insert.md +174 -0
  508. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_zettapark_put_file_to_lake.md +222 -0
  509. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_zettapark_save_as_table.md +87 -0
  510. package/bin/skills/lakehouse-doc-en/references/comprehensive_guide_to_ingesting_zettapark_sql_insert.md +121 -0
  511. package/bin/skills/lakehouse-doc-en/references/compute-overview.md +142 -0
  512. package/bin/skills/lakehouse-doc-en/references/compute-resource-ddl.md +128 -0
  513. package/bin/skills/lakehouse-doc-en/references/concepts.md +209 -0
  514. package/bin/skills/lakehouse-doc-en/references/concurrency_scaling.md +56 -0
  515. package/bin/skills/lakehouse-doc-en/references/conditional_function.md +1 -0
  516. package/bin/skills/lakehouse-doc-en/references/config-datasource.md +82 -0
  517. package/bin/skills/lakehouse-doc-en/references/config_volume_dify_storage.md +342 -0
  518. package/bin/skills/lakehouse-doc-en/references/connect-with-cli.md +127 -0
  519. package/bin/skills/lakehouse-doc-en/references/connect_to_Lakehouse.md +1 -0
  520. package/bin/skills/lakehouse-doc-en/references/connection-guide.md +57 -0
  521. package/bin/skills/lakehouse-doc-en/references/connection-overview.md +171 -0
  522. package/bin/skills/lakehouse-doc-en/references/considerations-for-using-sql.md +1 -0
  523. package/bin/skills/lakehouse-doc-en/references/context_function.md +1 -0
  524. package/bin/skills/lakehouse-doc-en/references/continue-job.md +256 -0
  525. package/bin/skills/lakehouse-doc-en/references/conversational_analytics_datagpt.md +3 -0
  526. package/bin/skills/lakehouse-doc-en/references/copy-into-table.md +442 -0
  527. package/bin/skills/lakehouse-doc-en/references/copy.md +74 -0
  528. package/bin/skills/lakehouse-doc-en/references/cos_storage_connection.md +73 -0
  529. package/bin/skills/lakehouse-doc-en/references/cos_volume_creation.md +39 -0
  530. package/bin/skills/lakehouse-doc-en/references/cost_management.md +2 -0
  531. package/bin/skills/lakehouse-doc-en/references/create-api-connection.md +352 -0
  532. package/bin/skills/lakehouse-doc-en/references/create-catalog-connection.md +253 -0
  533. package/bin/skills/lakehouse-doc-en/references/create-dynamic-table.md +461 -0
  534. package/bin/skills/lakehouse-doc-en/references/create-embedding-function.md +244 -0
  535. package/bin/skills/lakehouse-doc-en/references/create-external-catalog.md +220 -0
  536. package/bin/skills/lakehouse-doc-en/references/create-external-delta.md +76 -0
  537. package/bin/skills/lakehouse-doc-en/references/create-external-schema.md +165 -0
  538. package/bin/skills/lakehouse-doc-en/references/create-external-table.md +75 -0
  539. package/bin/skills/lakehouse-doc-en/references/create-external-volume.md +102 -0
  540. package/bin/skills/lakehouse-doc-en/references/create-hive-catalog.md +64 -0
  541. package/bin/skills/lakehouse-doc-en/references/create-inverted-index.md +154 -0
  542. package/bin/skills/lakehouse-doc-en/references/create-kafka-external.md +298 -0
  543. package/bin/skills/lakehouse-doc-en/references/create-llm-function.md +245 -0
  544. package/bin/skills/lakehouse-doc-en/references/create-materialized-view.md +529 -0
  545. package/bin/skills/lakehouse-doc-en/references/create-network-policy.md +54 -0
  546. package/bin/skills/lakehouse-doc-en/references/create-pipe.md +60 -0
  547. package/bin/skills/lakehouse-doc-en/references/create-role.md +94 -0
  548. package/bin/skills/lakehouse-doc-en/references/create-schema-from-share.md +55 -0
  549. package/bin/skills/lakehouse-doc-en/references/create-schema.md +88 -0
  550. package/bin/skills/lakehouse-doc-en/references/create-share.md +132 -0
  551. package/bin/skills/lakehouse-doc-en/references/create-sql-function.md +119 -0
  552. package/bin/skills/lakehouse-doc-en/references/create-storage-connection.md +316 -0
  553. package/bin/skills/lakehouse-doc-en/references/create-synonym.md +64 -0
  554. package/bin/skills/lakehouse-doc-en/references/create-table-ddl.md +390 -0
  555. package/bin/skills/lakehouse-doc-en/references/create-table-stream.md +217 -0
  556. package/bin/skills/lakehouse-doc-en/references/create-user.md +59 -0
  557. package/bin/skills/lakehouse-doc-en/references/create-vector-index.md +105 -0
  558. package/bin/skills/lakehouse-doc-en/references/create-view.md +128 -0
  559. package/bin/skills/lakehouse-doc-en/references/create-volume.md +64 -0
  560. package/bin/skills/lakehouse-doc-en/references/create.md +93 -0
  561. package/bin/skills/lakehouse-doc-en/references/create_cluster.md +117 -0
  562. package/bin/skills/lakehouse-doc-en/references/creating_alicloud_privatelinkendpoint.md +37 -0
  563. package/bin/skills/lakehouse-doc-en/references/creating_alicloud_privatelinkservice.md +31 -0
  564. package/bin/skills/lakehouse-doc-en/references/creating_tencentcloud_privatelinkendpoint.md +35 -0
  565. package/bin/skills/lakehouse-doc-en/references/creating_tencentcloud_privatelinkservice.md +19 -0
  566. package/bin/skills/lakehouse-doc-en/references/credit-scoring-with-zettapark.md +873 -0
  567. package/bin/skills/lakehouse-doc-en/references/crete-external-catalog.md +16 -0
  568. package/bin/skills/lakehouse-doc-en/references/cz-cli-agent.md +142 -0
  569. package/bin/skills/lakehouse-doc-en/references/cz-cli-datasource.md +63 -0
  570. package/bin/skills/lakehouse-doc-en/references/cz-cli-sql.md +396 -0
  571. package/bin/skills/lakehouse-doc-en/references/cz-cli-studio-tasks.md +241 -0
  572. package/bin/skills/lakehouse-doc-en/references/cz-cli.md +130 -0
  573. package/bin/skills/lakehouse-doc-en/references/czguide-intro-to-cdc-using-clickzetta-rtsync-dynamic-tables.md +738 -0
  574. package/bin/skills/lakehouse-doc-en/references/dashboard-version-management-guide.md +109 -0
  575. package/bin/skills/lakehouse-doc-en/references/data-catalog.md +1 -0
  576. package/bin/skills/lakehouse-doc-en/references/data-integration-intro.md +58 -0
  577. package/bin/skills/lakehouse-doc-en/references/data-integration.md +51 -0
  578. package/bin/skills/lakehouse-doc-en/references/data-lifecycle.md +38 -0
  579. package/bin/skills/lakehouse-doc-en/references/data-load-summary.md +57 -0
  580. package/bin/skills/lakehouse-doc-en/references/data-mamager-tool.md +1 -0
  581. package/bin/skills/lakehouse-doc-en/references/data-manager-tool.md +1 -0
  582. package/bin/skills/lakehouse-doc-en/references/data-quality.md +99 -0
  583. package/bin/skills/lakehouse-doc-en/references/data-recover.md +52 -0
  584. package/bin/skills/lakehouse-doc-en/references/data-recovery-with-time-travel.md +248 -0
  585. package/bin/skills/lakehouse-doc-en/references/data-sharing-concept.md +110 -0
  586. package/bin/skills/lakehouse-doc-en/references/data-sharing-overview.md +149 -0
  587. package/bin/skills/lakehouse-doc-en/references/data-sharing.md +320 -0
  588. package/bin/skills/lakehouse-doc-en/references/data-sources.md +62 -0
  589. package/bin/skills/lakehouse-doc-en/references/data-sync-intro.md +23 -0
  590. package/bin/skills/lakehouse-doc-en/references/data-type-guide.md +86 -0
  591. package/bin/skills/lakehouse-doc-en/references/data-type.md +219 -0
  592. package/bin/skills/lakehouse-doc-en/references/data-types-timestamp-ntz.md +128 -0
  593. package/bin/skills/lakehouse-doc-en/references/data.md +1 -0
  594. package/bin/skills/lakehouse-doc-en/references/data_catalog.md +109 -0
  595. package/bin/skills/lakehouse-doc-en/references/data_clean_with_sql.md +378 -0
  596. package/bin/skills/lakehouse-doc-en/references/data_ops.md +7 -0
  597. package/bin/skills/lakehouse-doc-en/references/data_org.md +1 -0
  598. package/bin/skills/lakehouse-doc-en/references/data_partition.md +1 -0
  599. package/bin/skills/lakehouse-doc-en/references/data_privacy.md +43 -0
  600. package/bin/skills/lakehouse-doc-en/references/data_result_profile.md +21 -0
  601. package/bin/skills/lakehouse-doc-en/references/data_security.md +65 -0
  602. package/bin/skills/lakehouse-doc-en/references/data_share.md +1 -0
  603. package/bin/skills/lakehouse-doc-en/references/data_sharing_between_accounts_guide.md +331 -0
  604. package/bin/skills/lakehouse-doc-en/references/data_sharing_guide.md +117 -0
  605. package/bin/skills/lakehouse-doc-en/references/data_sync.md +6 -0
  606. package/bin/skills/lakehouse-doc-en/references/data_transfer_datalake.md +1 -0
  607. package/bin/skills/lakehouse-doc-en/references/data_visualization.md +96 -0
  608. package/bin/skills/lakehouse-doc-en/references/dataagent.md +80 -0
  609. package/bin/skills/lakehouse-doc-en/references/databricks_yunqi_integration_guide_v2.md +842 -0
  610. package/bin/skills/lakehouse-doc-en/references/datacatalog.md +127 -0
  611. package/bin/skills/lakehouse-doc-en/references/datagpt-release.md +1 -0
  612. package/bin/skills/lakehouse-doc-en/references/datagpt_bestpractice.md +1 -0
  613. package/bin/skills/lakehouse-doc-en/references/datagpt_data_source.md +57 -0
  614. package/bin/skills/lakehouse-doc-en/references/datagpt_get_accurate_answers.md +40 -0
  615. package/bin/skills/lakehouse-doc-en/references/datagpt_intro.md +75 -0
  616. package/bin/skills/lakehouse-doc-en/references/datagpt_introduction.md +129 -0
  617. package/bin/skills/lakehouse-doc-en/references/datagpt_quickstart.md +99 -0
  618. package/bin/skills/lakehouse-doc-en/references/datagpt_tutorial.md +1 -0
  619. package/bin/skills/lakehouse-doc-en/references/datalake-acceleration.md +5 -0
  620. package/bin/skills/lakehouse-doc-en/references/datalake-storage-connection.md +12 -0
  621. package/bin/skills/lakehouse-doc-en/references/datalake_FAQ.md +48 -0
  622. package/bin/skills/lakehouse-doc-en/references/datalake_overview.md +17 -0
  623. package/bin/skills/lakehouse-doc-en/references/datalake_privilege.md +55 -0
  624. package/bin/skills/lakehouse-doc-en/references/datalake_query_ingest.md +18 -0
  625. package/bin/skills/lakehouse-doc-en/references/datalake_unstructure_data.md +3 -0
  626. package/bin/skills/lakehouse-doc-en/references/datalake_volume.md +114 -0
  627. package/bin/skills/lakehouse-doc-en/references/datalake_volume_analytics.md +37 -0
  628. package/bin/skills/lakehouse-doc-en/references/datalake_volume_anlytics.md +1 -0
  629. package/bin/skills/lakehouse-doc-en/references/datalake_volume_object.md +1 -0
  630. package/bin/skills/lakehouse-doc-en/references/datalakeanalysis.md +272 -0
  631. package/bin/skills/lakehouse-doc-en/references/dataops_practice.md +104 -0
  632. package/bin/skills/lakehouse-doc-en/references/datasharing.md +328 -0
  633. package/bin/skills/lakehouse-doc-en/references/datasharing_catalog.md +1 -0
  634. package/bin/skills/lakehouse-doc-en/references/datasource-config-guide.md +1 -0
  635. package/bin/skills/lakehouse-doc-en/references/datasource_ip_whitelist.md +93 -0
  636. package/bin/skills/lakehouse-doc-en/references/datasources.md +62 -0
  637. package/bin/skills/lakehouse-doc-en/references/datatype-cast.md +89 -0
  638. package/bin/skills/lakehouse-doc-en/references/datatype-conversion.md +331 -0
  639. package/bin/skills/lakehouse-doc-en/references/datetime_patterns.md +61 -0
  640. package/bin/skills/lakehouse-doc-en/references/datus_lakehouse_installation.md +418 -0
  641. package/bin/skills/lakehouse-doc-en/references/datus_lakehouse_solution_overview.md +148 -0
  642. package/bin/skills/lakehouse-doc-en/references/db_dw_connection.md +1 -0
  643. package/bin/skills/lakehouse-doc-en/references/dbt-advanced-features.md +261 -0
  644. package/bin/skills/lakehouse-doc-en/references/dbt-bigquery-to-clickzetta-migration.md +458 -0
  645. package/bin/skills/lakehouse-doc-en/references/dbt-data-quality.md +237 -0
  646. package/bin/skills/lakehouse-doc-en/references/dbt-incremental.md +340 -0
  647. package/bin/skills/lakehouse-doc-en/references/dbt-practice-series.md +90 -0
  648. package/bin/skills/lakehouse-doc-en/references/dbt-realtime-pipeline.md +308 -0
  649. package/bin/skills/lakehouse-doc-en/references/dbt-snowflake-to-clickzetta-migration.md +384 -0
  650. package/bin/skills/lakehouse-doc-en/references/default-value.md +87 -0
  651. package/bin/skills/lakehouse-doc-en/references/delta-lake.md +190 -0
  652. package/bin/skills/lakehouse-doc-en/references/desc-catalog-table.md +27 -0
  653. package/bin/skills/lakehouse-doc-en/references/desc-catalog.md +28 -0
  654. package/bin/skills/lakehouse-doc-en/references/desc-connection.md +38 -0
  655. package/bin/skills/lakehouse-doc-en/references/desc-dynamic-table.md +95 -0
  656. package/bin/skills/lakehouse-doc-en/references/desc-external-schemas.md +57 -0
  657. package/bin/skills/lakehouse-doc-en/references/desc-external-table.md +63 -0
  658. package/bin/skills/lakehouse-doc-en/references/desc-function.md +37 -0
  659. package/bin/skills/lakehouse-doc-en/references/desc-history-dynamic-table.md +44 -0
  660. package/bin/skills/lakehouse-doc-en/references/desc-history-table.md +61 -0
  661. package/bin/skills/lakehouse-doc-en/references/desc-history.md +91 -0
  662. package/bin/skills/lakehouse-doc-en/references/desc-materialized-view.md +110 -0
  663. package/bin/skills/lakehouse-doc-en/references/desc-network-policy.md +43 -0
  664. package/bin/skills/lakehouse-doc-en/references/desc-pipe.md +37 -0
  665. package/bin/skills/lakehouse-doc-en/references/desc-schemas.md +59 -0
  666. package/bin/skills/lakehouse-doc-en/references/desc-share.md +48 -0
  667. package/bin/skills/lakehouse-doc-en/references/desc-table-stream.md +64 -0
  668. package/bin/skills/lakehouse-doc-en/references/desc-table.md +103 -0
  669. package/bin/skills/lakehouse-doc-en/references/desc-vcluster.md +79 -0
  670. package/bin/skills/lakehouse-doc-en/references/desc-view.md +66 -0
  671. package/bin/skills/lakehouse-doc-en/references/desc-volume.md +84 -0
  672. package/bin/skills/lakehouse-doc-en/references/desc-workspace.md +49 -0
  673. package/bin/skills/lakehouse-doc-en/references/describe.md +80 -0
  674. package/bin/skills/lakehouse-doc-en/references/dify_config_lakehouse_as_vectordb.md +314 -0
  675. package/bin/skills/lakehouse-doc-en/references/dify_yunqilakehouse_integration_overview.md +232 -0
  676. package/bin/skills/lakehouse-doc-en/references/discovery_analysis_data_in_json_file_on_external_volume.md +625 -0
  677. package/bin/skills/lakehouse-doc-en/references/discovery_analysis_data_in_parquet_file_on_external_volume.md +599 -0
  678. package/bin/skills/lakehouse-doc-en/references/dml.md +90 -0
  679. package/bin/skills/lakehouse-doc-en/references/download-data-with-copy.md +27 -0
  680. package/bin/skills/lakehouse-doc-en/references/download-data.md +1 -0
  681. package/bin/skills/lakehouse-doc-en/references/drop-connection.md +34 -0
  682. package/bin/skills/lakehouse-doc-en/references/drop-dynamic-table.md +76 -0
  683. package/bin/skills/lakehouse-doc-en/references/drop-external-schema.md +40 -0
  684. package/bin/skills/lakehouse-doc-en/references/drop-external-table.md +32 -0
  685. package/bin/skills/lakehouse-doc-en/references/drop-function.md +51 -0
  686. package/bin/skills/lakehouse-doc-en/references/drop-materialized-view.md +73 -0
  687. package/bin/skills/lakehouse-doc-en/references/drop-network-policy.md +36 -0
  688. package/bin/skills/lakehouse-doc-en/references/drop-pipe.md +15 -0
  689. package/bin/skills/lakehouse-doc-en/references/drop-role.md +61 -0
  690. package/bin/skills/lakehouse-doc-en/references/drop-schema.md +37 -0
  691. package/bin/skills/lakehouse-doc-en/references/drop-share.md +73 -0
  692. package/bin/skills/lakehouse-doc-en/references/drop-synonym.md +31 -0
  693. package/bin/skills/lakehouse-doc-en/references/drop-table-stream.md +31 -0
  694. package/bin/skills/lakehouse-doc-en/references/drop-table.md +70 -0
  695. package/bin/skills/lakehouse-doc-en/references/drop-user.md +43 -0
  696. package/bin/skills/lakehouse-doc-en/references/drop-vcluster.md +63 -0
  697. package/bin/skills/lakehouse-doc-en/references/drop-view.md +63 -0
  698. package/bin/skills/lakehouse-doc-en/references/drop-volume.md +55 -0
  699. package/bin/skills/lakehouse-doc-en/references/drop.md +84 -0
  700. package/bin/skills/lakehouse-doc-en/references/dynamic-mask.md +202 -0
  701. package/bin/skills/lakehouse-doc-en/references/dynamic-table-bestpractice.md +193 -0
  702. package/bin/skills/lakehouse-doc-en/references/dynamic-table-dml-sql.md +54 -0
  703. package/bin/skills/lakehouse-doc-en/references/dynamic-table-dml.md +48 -0
  704. package/bin/skills/lakehouse-doc-en/references/dynamic-table-incre.md +38 -0
  705. package/bin/skills/lakehouse-doc-en/references/dynamic-table-introduce.md +320 -0
  706. package/bin/skills/lakehouse-doc-en/references/dynamic-table-parameters.md +423 -0
  707. package/bin/skills/lakehouse-doc-en/references/dynamic-table-scheduling.md +42 -0
  708. package/bin/skills/lakehouse-doc-en/references/dynamic-table.md +96 -0
  709. package/bin/skills/lakehouse-doc-en/references/dynamicTable-DML-sql.md +59 -0
  710. package/bin/skills/lakehouse-doc-en/references/dynamicTable-dml.md +47 -0
  711. package/bin/skills/lakehouse-doc-en/references/dynamicTable-parmaters.md +250 -0
  712. package/bin/skills/lakehouse-doc-en/references/dynamic_table_summary.md +366 -0
  713. package/bin/skills/lakehouse-doc-en/references/dynamic_table_task.md +68 -0
  714. package/bin/skills/lakehouse-doc-en/references/dynamic_table_using_studio.md +163 -0
  715. package/bin/skills/lakehouse-doc-en/references/dynamictable.md +53 -0
  716. package/bin/skills/lakehouse-doc-en/references/eco_integration/Zeppelin.md +79 -0
  717. package/bin/skills/lakehouse-doc-en/references/eco_integration/airbyte.md +74 -0
  718. package/bin/skills/lakehouse-doc-en/references/eco_integration/datagrip-lakehouse.md +55 -0
  719. package/bin/skills/lakehouse-doc-en/references/eco_integration/datax.md +163 -0
  720. package/bin/skills/lakehouse-doc-en/references/eco_integration/dbeaver-lakehouse.md +65 -0
  721. package/bin/skills/lakehouse-doc-en/references/eco_integration/dbt-jaffle-shop-quickstart.md +234 -0
  722. package/bin/skills/lakehouse-doc-en/references/eco_integration/dbt.md +300 -0
  723. package/bin/skills/lakehouse-doc-en/references/eco_integration/rath.md +84 -0
  724. package/bin/skills/lakehouse-doc-en/references/eco_integration/sqlline.md +72 -0
  725. package/bin/skills/lakehouse-doc-en/references/eco_integration/sqlworkbench-j-lakehouse.md +52 -0
  726. package/bin/skills/lakehouse-doc-en/references/eco_integration/streamlit.md +104 -0
  727. package/bin/skills/lakehouse-doc-en/references/eco_integration/superset.md +98 -0
  728. package/bin/skills/lakehouse-doc-en/references/eco_integration/trino.md +73 -0
  729. package/bin/skills/lakehouse-doc-en/references/ecosystem-all.md +121 -0
  730. package/bin/skills/lakehouse-doc-en/references/encryption-function.md +1 -0
  731. package/bin/skills/lakehouse-doc-en/references/export_data_with_data-integration.md +3 -0
  732. package/bin/skills/lakehouse-doc-en/references/external-Volume.md +10 -0
  733. package/bin/skills/lakehouse-doc-en/references/external-catalog-concept.md +329 -0
  734. package/bin/skills/lakehouse-doc-en/references/external-catalog-summary.md +38 -0
  735. package/bin/skills/lakehouse-doc-en/references/external-catalog.md +1 -0
  736. package/bin/skills/lakehouse-doc-en/references/external-function-dev-guide-java.md +556 -0
  737. package/bin/skills/lakehouse-doc-en/references/external-function-summary.md +0 -0
  738. package/bin/skills/lakehouse-doc-en/references/external-hudi-table.md +188 -0
  739. package/bin/skills/lakehouse-doc-en/references/external-schema.md +100 -0
  740. package/bin/skills/lakehouse-doc-en/references/external-table-guide.md +101 -0
  741. package/bin/skills/lakehouse-doc-en/references/external_catalog_schema.md +182 -0
  742. package/bin/skills/lakehouse-doc-en/references/external_object_user_guide.md +298 -0
  743. package/bin/skills/lakehouse-doc-en/references/external_volume.md +199 -0
  744. package/bin/skills/lakehouse-doc-en/references/f6fc6447ee.md +161 -0
  745. package/bin/skills/lakehouse-doc-en/references/feature-engineering-with-zettapark.md +429 -0
  746. package/bin/skills/lakehouse-doc-en/references/federation-query.md +74 -0
  747. package/bin/skills/lakehouse-doc-en/references/file_functions.md +1 -0
  748. package/bin/skills/lakehouse-doc-en/references/filter-clause.md +76 -0
  749. package/bin/skills/lakehouse-doc-en/references/finebi-mysql.md +106 -0
  750. package/bin/skills/lakehouse-doc-en/references/flink-write-connector.md +646 -0
  751. package/bin/skills/lakehouse-doc-en/references/from_lakehouse_to_volume.md +109 -0
  752. package/bin/skills/lakehouse-doc-en/references/from_volume_to_table.md +45 -0
  753. package/bin/skills/lakehouse-doc-en/references/full-text-search.md +1 -0
  754. package/bin/skills/lakehouse-doc-en/references/fulltext_indexes_guide.md +1197 -0
  755. package/bin/skills/lakehouse-doc-en/references/function-overview.md +98 -0
  756. package/bin/skills/lakehouse-doc-en/references/functions.md +101 -0
  757. package/bin/skills/lakehouse-doc-en/references/funnel-analysis-with-window-functions.md +350 -0
  758. package/bin/skills/lakehouse-doc-en/references/generated-column.md +99 -0
  759. package/bin/skills/lakehouse-doc-en/references/generated_columns_guide.md +847 -0
  760. package/bin/skills/lakehouse-doc-en/references/geo.md +1 -0
  761. package/bin/skills/lakehouse-doc-en/references/geospatial_analysis.md +566 -0
  762. package/bin/skills/lakehouse-doc-en/references/get-started-with-sample-data.md +79 -0
  763. package/bin/skills/lakehouse-doc-en/references/getting_started_with_vcluster_for_processing_analytics.md +442 -0
  764. package/bin/skills/lakehouse-doc-en/references/gitfolder.md +57 -0
  765. package/bin/skills/lakehouse-doc-en/references/grant-privileges.md +140 -0
  766. package/bin/skills/lakehouse-doc-en/references/grant-to-share.md +54 -0
  767. package/bin/skills/lakehouse-doc-en/references/grant-user-privileges.md +112 -0
  768. package/bin/skills/lakehouse-doc-en/references/groupby.md +1258 -0
  769. package/bin/skills/lakehouse-doc-en/references/guides-overview-connecting.md +46 -0
  770. package/bin/skills/lakehouse-doc-en/references/hash.md +1 -0
  771. package/bin/skills/lakehouse-doc-en/references/high_order_function.md +1 -0
  772. package/bin/skills/lakehouse-doc-en/references/hive-connection.md +50 -0
  773. package/bin/skills/lakehouse-doc-en/references/hive-to-lakehouse-ecommerce.md +362 -0
  774. package/bin/skills/lakehouse-doc-en/references/ide.md +8 -0
  775. package/bin/skills/lakehouse-doc-en/references/identity-auth.md +8 -0
  776. package/bin/skills/lakehouse-doc-en/references/if_else_task.md +160 -0
  777. package/bin/skills/lakehouse-doc-en/references/ifnull.md +62 -0
  778. package/bin/skills/lakehouse-doc-en/references/ilike.md +83 -0
  779. package/bin/skills/lakehouse-doc-en/references/import-data-by-python-into-lakehouse.md +141 -0
  780. package/bin/skills/lakehouse-doc-en/references/import-feishu-data.md +389 -0
  781. package/bin/skills/lakehouse-doc-en/references/import_data_with_data-integration.md +3 -0
  782. package/bin/skills/lakehouse-doc-en/references/importdatabypythonintoLakehouse.md +134 -0
  783. package/bin/skills/lakehouse-doc-en/references/incremental-computing.md +482 -0
  784. package/bin/skills/lakehouse-doc-en/references/incremetal-compute.md +1 -0
  785. package/bin/skills/lakehouse-doc-en/references/index-overview.md +25 -0
  786. package/bin/skills/lakehouse-doc-en/references/index-partition-overview.md +137 -0
  787. package/bin/skills/lakehouse-doc-en/references/index-t.md +1 -0
  788. package/bin/skills/lakehouse-doc-en/references/index2.md +1 -0
  789. package/bin/skills/lakehouse-doc-en/references/index_guide.md +81 -0
  790. package/bin/skills/lakehouse-doc-en/references/indexs.md +1 -0
  791. package/bin/skills/lakehouse-doc-en/references/information_schema_guide.md +188 -0
  792. package/bin/skills/lakehouse-doc-en/references/ingesting-data-from-alibaba-cloud-datalake.md +977 -0
  793. package/bin/skills/lakehouse-doc-en/references/input_output.md +70 -0
  794. package/bin/skills/lakehouse-doc-en/references/instance-information-schema-summary.md +53 -0
  795. package/bin/skills/lakehouse-doc-en/references/instance-information-schema.md +344 -0
  796. package/bin/skills/lakehouse-doc-en/references/instance-information_schema.md +1 -0
  797. package/bin/skills/lakehouse-doc-en/references/internal_volume.md +213 -0
  798. package/bin/skills/lakehouse-doc-en/references/intro-supported-features.md +48 -0
  799. package/bin/skills/lakehouse-doc-en/references/inverted-index.md +350 -0
  800. package/bin/skills/lakehouse-doc-en/references/inverted_idx_bm25_param.md +251 -0
  801. package/bin/skills/lakehouse-doc-en/references/inverted_idx_multi-match.md +89 -0
  802. package/bin/skills/lakehouse-doc-en/references/ip_function.md +1 -0
  803. package/bin/skills/lakehouse-doc-en/references/is-null.md +53 -0
  804. package/bin/skills/lakehouse-doc-en/references/it-operation-management.md +1 -0
  805. package/bin/skills/lakehouse-doc-en/references/java_reference/bulkload-upload.md +164 -0
  806. package/bin/skills/lakehouse-doc-en/references/java_reference/client.md +48 -0
  807. package/bin/skills/lakehouse-doc-en/references/java_reference/java-sdk-release-notes.md +32 -0
  808. package/bin/skills/lakehouse-doc-en/references/java_reference/java-sdk-summary.md +187 -0
  809. package/bin/skills/lakehouse-doc-en/references/java_reference/jdbc.md +211 -0
  810. package/bin/skills/lakehouse-doc-en/references/java_reference/realtime-upload.md +288 -0
  811. package/bin/skills/lakehouse-doc-en/references/java_sdk_guide.md +35 -0
  812. package/bin/skills/lakehouse-doc-en/references/jdbc_task.md +37 -0
  813. package/bin/skills/lakehouse-doc-en/references/job-manage.md +57 -0
  814. package/bin/skills/lakehouse-doc-en/references/job_history_analysis_with_information_schema.md +674 -0
  815. package/bin/skills/lakehouse-doc-en/references/jobprofile-bestpractices.md +108 -0
  816. package/bin/skills/lakehouse-doc-en/references/json_analyze.md +424 -0
  817. package/bin/skills/lakehouse-doc-en/references/json_data_process_guide.md +881 -0
  818. package/bin/skills/lakehouse-doc-en/references/json_function.md +1 -0
  819. package/bin/skills/lakehouse-doc-en/references/json_guide_for_complex_biz_cases.md +1897 -0
  820. package/bin/skills/lakehouse-doc-en/references/kafka-connection.md +36 -0
  821. package/bin/skills/lakehouse-doc-en/references/kafka-external-table.md +93 -0
  822. package/bin/skills/lakehouse-doc-en/references/key-concepts.md +394 -0
  823. package/bin/skills/lakehouse-doc-en/references/lakehouse-acceleration-guide.md +136 -0
  824. package/bin/skills/lakehouse-doc-en/references/lakehouse-ai.md +1 -0
  825. package/bin/skills/lakehouse-doc-en/references/lakehouse-data-engineering-handbook.md +519 -0
  826. package/bin/skills/lakehouse-doc-en/references/lakehouse-index-best-practice.md +670 -0
  827. package/bin/skills/lakehouse-doc-en/references/lakehouse-insight.md +104 -0
  828. package/bin/skills/lakehouse-doc-en/references/lakehouse-llms-index-skill.md +209 -0
  829. package/bin/skills/lakehouse-doc-en/references/lakehouse-platform-release-note.md +1 -0
  830. package/bin/skills/lakehouse-doc-en/references/lakehouse-quick-experience_guide.md +980 -0
  831. package/bin/skills/lakehouse-doc-en/references/lakehouse-studio-101.md +1 -0
  832. package/bin/skills/lakehouse-doc-en/references/lakehouse-studio-concept.md +46 -0
  833. package/bin/skills/lakehouse-doc-en/references/lakehouse-studio-release-note.md +1 -0
  834. package/bin/skills/lakehouse-doc-en/references/lakehouse-table-stream-best-practices.md +607 -0
  835. package/bin/skills/lakehouse-doc-en/references/lakehouse-zilliz-make-data-ready-for-bi-and-ai.md +256 -0
  836. package/bin/skills/lakehouse-doc-en/references/lakehouse_billing_anomaly_alert_configuration_guide.md +294 -0
  837. package/bin/skills/lakehouse-doc-en/references/lakehouse_instance_overview.md +43 -0
  838. package/bin/skills/lakehouse-doc-en/references/lakehouse_studio.md +1 -0
  839. package/bin/skills/lakehouse-doc-en/references/lakehouse_table_design_guide.md +2674 -0
  840. package/bin/skills/lakehouse-doc-en/references/lance_guide.md +320 -0
  841. package/bin/skills/lakehouse-doc-en/references/langchain-plug-installation.md +276 -0
  842. package/bin/skills/lakehouse-doc-en/references/langchain-plug-quick-start.md +305 -0
  843. package/bin/skills/lakehouse-doc-en/references/langchain-plugins-overview.md +463 -0
  844. package/bin/skills/lakehouse-doc-en/references/langchain.md +95 -0
  845. package/bin/skills/lakehouse-doc-en/references/langchain_basic_samples.md +761 -0
  846. package/bin/skills/lakehouse-doc-en/references/langchain_integration.md +1 -0
  847. package/bin/skills/lakehouse-doc-en/references/left.md +43 -0
  848. package/bin/skills/lakehouse-doc-en/references/like.md +71 -0
  849. package/bin/skills/lakehouse-doc-en/references/list-partition.md +88 -0
  850. package/bin/skills/lakehouse-doc-en/references/llama-index.md +52 -0
  851. package/bin/skills/lakehouse-doc-en/references/load-data-local.md +85 -0
  852. package/bin/skills/lakehouse-doc-en/references/load-data-oss.md +175 -0
  853. package/bin/skills/lakehouse-doc-en/references/load-history-function.md +14 -0
  854. package/bin/skills/lakehouse-doc-en/references/load_history.md +17 -0
  855. package/bin/skills/lakehouse-doc-en/references/logging-in.md +68 -0
  856. package/bin/skills/lakehouse-doc-en/references/manage-accounts.md +187 -0
  857. package/bin/skills/lakehouse-doc-en/references/management.md +181 -0
  858. package/bin/skills/lakehouse-doc-en/references/management_guide.md +115 -0
  859. package/bin/skills/lakehouse-doc-en/references/managing-files-on-volume-with-zettapark.md +165 -0
  860. package/bin/skills/lakehouse-doc-en/references/managing-instance.md +68 -0
  861. package/bin/skills/lakehouse-doc-en/references/managing-security.md +1 -0
  862. package/bin/skills/lakehouse-doc-en/references/map-function.md +1 -0
  863. package/bin/skills/lakehouse-doc-en/references/mapjoin.md +98 -0
  864. package/bin/skills/lakehouse-doc-en/references/materialized_ddl.md +90 -0
  865. package/bin/skills/lakehouse-doc-en/references/math_function.md +1 -0
  866. package/bin/skills/lakehouse-doc-en/references/maxcompute-to-lakehouse-ecommerce.md +405 -0
  867. package/bin/skills/lakehouse-doc-en/references/medallion-lakehouse-from-scratch.md +720 -0
  868. package/bin/skills/lakehouse-doc-en/references/meta-objects-and-privileges.md +273 -0
  869. package/bin/skills/lakehouse-doc-en/references/metabase.md +59 -0
  870. package/bin/skills/lakehouse-doc-en/references/metadata_show_desc_command_guide.md +711 -0
  871. package/bin/skills/lakehouse-doc-en/references/metrics_answer_build.md +45 -0
  872. package/bin/skills/lakehouse-doc-en/references/migrate-snowflake-realtime-etl-to-lakehouse.md +862 -0
  873. package/bin/skills/lakehouse-doc-en/references/migrate-spark-data-engineering-best-practices-to-lakehouse.md +293 -0
  874. package/bin/skills/lakehouse-doc-en/references/migration-spark-sql.md +610 -0
  875. package/bin/skills/lakehouse-doc-en/references/migration-sql-compatibility.md +184 -0
  876. package/bin/skills/lakehouse-doc-en/references/mindsdb.md +282 -0
  877. package/bin/skills/lakehouse-doc-en/references/modelprice.md +155 -0
  878. package/bin/skills/lakehouse-doc-en/references/modern-data-stack-with-ecosystem-tools.md +1 -0
  879. package/bin/skills/lakehouse-doc-en/references/monitor_and_alert.md +177 -0
  880. package/bin/skills/lakehouse-doc-en/references/monitoring_and_alerting.md +177 -0
  881. package/bin/skills/lakehouse-doc-en/references/monitoring_item_specification.md +44 -0
  882. package/bin/skills/lakehouse-doc-en/references/multi_cloud_instance_manage_with_mcp_server.md +284 -0
  883. package/bin/skills/lakehouse-doc-en/references/multitable_batch_sync.md +465 -0
  884. package/bin/skills/lakehouse-doc-en/references/multitable_realtime_sync.md +401 -0
  885. package/bin/skills/lakehouse-doc-en/references/multitable_realtime_sync_sop.md +588 -0
  886. package/bin/skills/lakehouse-doc-en/references/n8n_Integreated_with_lakehouse_mcp_server.md +538 -0
  887. package/bin/skills/lakehouse-doc-en/references/navicat-mysql.md +67 -0
  888. package/bin/skills/lakehouse-doc-en/references/network_policy.md +286 -0
  889. package/bin/skills/lakehouse-doc-en/references/notes-and-guidelines-for-partition-tables.md +1724 -0
  890. package/bin/skills/lakehouse-doc-en/references/number_type_guide.md +56 -0
  891. package/bin/skills/lakehouse-doc-en/references/nyc_green_taxi_data_clean_transform_with_mcp_server.md +315 -0
  892. package/bin/skills/lakehouse-doc-en/references/object-model-overview.md +69 -0
  893. package/bin/skills/lakehouse-doc-en/references/object_identifier.md +270 -0
  894. package/bin/skills/lakehouse-doc-en/references/object_model_design.md +244 -0
  895. package/bin/skills/lakehouse-doc-en/references/om-api-connection.md +35 -0
  896. package/bin/skills/lakehouse-doc-en/references/om-bloomfilter.md +39 -0
  897. package/bin/skills/lakehouse-doc-en/references/om-catalog-connection.md +36 -0
  898. package/bin/skills/lakehouse-doc-en/references/om-data-sharing.md +39 -0
  899. package/bin/skills/lakehouse-doc-en/references/om-dynamic-mask.md +40 -0
  900. package/bin/skills/lakehouse-doc-en/references/om-dynamic-table.md +223 -0
  901. package/bin/skills/lakehouse-doc-en/references/om-external-function.md +49 -0
  902. package/bin/skills/lakehouse-doc-en/references/om-external-volume.md +57 -0
  903. package/bin/skills/lakehouse-doc-en/references/om-internal-volume.md +91 -0
  904. package/bin/skills/lakehouse-doc-en/references/om-inverted-index.md +38 -0
  905. package/bin/skills/lakehouse-doc-en/references/om-named-volume.md +75 -0
  906. package/bin/skills/lakehouse-doc-en/references/om-network-policy.md +33 -0
  907. package/bin/skills/lakehouse-doc-en/references/om-pipe.md +217 -0
  908. package/bin/skills/lakehouse-doc-en/references/om-roles.md +46 -0
  909. package/bin/skills/lakehouse-doc-en/references/om-row-level-permission.md +38 -0
  910. package/bin/skills/lakehouse-doc-en/references/om-schema.md +44 -0
  911. package/bin/skills/lakehouse-doc-en/references/om-semantic-view.md +37 -0
  912. package/bin/skills/lakehouse-doc-en/references/om-storage-connection.md +47 -0
  913. package/bin/skills/lakehouse-doc-en/references/om-synonym.md +119 -0
  914. package/bin/skills/lakehouse-doc-en/references/om-table-stream.md +196 -0
  915. package/bin/skills/lakehouse-doc-en/references/om-table.md +193 -0
  916. package/bin/skills/lakehouse-doc-en/references/om-time-travel.md +46 -0
  917. package/bin/skills/lakehouse-doc-en/references/om-user-management.md +42 -0
  918. package/bin/skills/lakehouse-doc-en/references/om-vcluster.md +101 -0
  919. package/bin/skills/lakehouse-doc-en/references/om-view.md +199 -0
  920. package/bin/skills/lakehouse-doc-en/references/om-workspace.md +39 -0
  921. package/bin/skills/lakehouse-doc-en/references/opensource/travel.md +126 -0
  922. package/bin/skills/lakehouse-doc-en/references/operation-maintenance.md +149 -0
  923. package/bin/skills/lakehouse-doc-en/references/optimizing-computing-resources.md +1 -0
  924. package/bin/skills/lakehouse-doc-en/references/org-hierarchy.md +56 -0
  925. package/bin/skills/lakehouse-doc-en/references/oss_volume_creation.md +39 -0
  926. package/bin/skills/lakehouse-doc-en/references/overview.md +187 -0
  927. package/bin/skills/lakehouse-doc-en/references/pandas-to-zettapark-migration-retail.md +262 -0
  928. package/bin/skills/lakehouse-doc-en/references/partition-func.md +1 -0
  929. package/bin/skills/lakehouse-doc-en/references/partition_table.md +324 -0
  930. package/bin/skills/lakehouse-doc-en/references/partition_table_guide.md +319 -0
  931. package/bin/skills/lakehouse-doc-en/references/performance_optimization.md +34 -0
  932. package/bin/skills/lakehouse-doc-en/references/performence_test.md +1 -0
  933. package/bin/skills/lakehouse-doc-en/references/permission-application.md +43 -0
  934. package/bin/skills/lakehouse-doc-en/references/permissions-of-built-in-workspace-level-roles.md +131 -0
  935. package/bin/skills/lakehouse-doc-en/references/pipe-introduction.md +97 -0
  936. package/bin/skills/lakehouse-doc-en/references/pipe-kafka-bestpractice-1.md +431 -0
  937. package/bin/skills/lakehouse-doc-en/references/pipe-kafka-table-stream.md +182 -0
  938. package/bin/skills/lakehouse-doc-en/references/pipe-kafka.md +217 -0
  939. package/bin/skills/lakehouse-doc-en/references/pipe-overview.md +167 -0
  940. package/bin/skills/lakehouse-doc-en/references/pipe-storage-object.md +263 -0
  941. package/bin/skills/lakehouse-doc-en/references/pipe-summary.md +107 -0
  942. package/bin/skills/lakehouse-doc-en/references/pipe-syntax.md +183 -0
  943. package/bin/skills/lakehouse-doc-en/references/pipe.md +1 -0
  944. package/bin/skills/lakehouse-doc-en/references/pow.md +26 -0
  945. package/bin/skills/lakehouse-doc-en/references/power_bi.md +117 -0
  946. package/bin/skills/lakehouse-doc-en/references/practice_data_analysis.md +1 -0
  947. package/bin/skills/lakehouse-doc-en/references/practice_data_import_and_export.md +1 -0
  948. package/bin/skills/lakehouse-doc-en/references/practice_python_task.md +203 -0
  949. package/bin/skills/lakehouse-doc-en/references/preparetion.md +39 -0
  950. package/bin/skills/lakehouse-doc-en/references/pricing.md +314 -0
  951. package/bin/skills/lakehouse-doc-en/references/primary key.md +82 -0
  952. package/bin/skills/lakehouse-doc-en/references/primary-key.md +204 -0
  953. package/bin/skills/lakehouse-doc-en/references/privacy-policy.md +113 -0
  954. package/bin/skills/lakehouse-doc-en/references/private-link-general.md +66 -0
  955. package/bin/skills/lakehouse-doc-en/references/private_link.md +1 -0
  956. package/bin/skills/lakehouse-doc-en/references/private_storage_overview.md +165 -0
  957. package/bin/skills/lakehouse-doc-en/references/product-trial-agreement.md +93 -0
  958. package/bin/skills/lakehouse-doc-en/references/product_concept.md +1 -0
  959. package/bin/skills/lakehouse-doc-en/references/put-get.md +1 -0
  960. package/bin/skills/lakehouse-doc-en/references/put_get_volume.md +3 -0
  961. package/bin/skills/lakehouse-doc-en/references/pyspark-to-zettapark-migration-f1.md +512 -0
  962. package/bin/skills/lakehouse-doc-en/references/python-igs.md +313 -0
  963. package/bin/skills/lakehouse-doc-en/references/python-sample-put-gharchive2oss.md +186 -0
  964. package/bin/skills/lakehouse-doc-en/references/python-sample-put-github-rt-events.md +348 -0
  965. package/bin/skills/lakehouse-doc-en/references/python-sdk-version-history.md +24 -0
  966. package/bin/skills/lakehouse-doc-en/references/python-sqlalchemy-version-history.md +23 -0
  967. package/bin/skills/lakehouse-doc-en/references/python-task-dev.md +1 -0
  968. package/bin/skills/lakehouse-doc-en/references/python-task.md +28 -0
  969. package/bin/skills/lakehouse-doc-en/references/python_package_install_import_guide.md +53 -0
  970. package/bin/skills/lakehouse-doc-en/references/python_reference/bulkload-upload.md +143 -0
  971. package/bin/skills/lakehouse-doc-en/references/python_reference/connector.md +248 -0
  972. package/bin/skills/lakehouse-doc-en/references/python_reference/connector_advanced.md +258 -0
  973. package/bin/skills/lakehouse-doc-en/references/python_reference/connector_examples.md +241 -0
  974. package/bin/skills/lakehouse-doc-en/references/python_reference/python-sdk-summary.md +88 -0
  975. package/bin/skills/lakehouse-doc-en/references/python_reference/sqlalchemy.md +145 -0
  976. package/bin/skills/lakehouse-doc-en/references/python_sdk_guide.md +40 -0
  977. package/bin/skills/lakehouse-doc-en/references/python_shell_datasource.md +356 -0
  978. package/bin/skills/lakehouse-doc-en/references/query-json-sy.md +80 -0
  979. package/bin/skills/lakehouse-doc-en/references/query-snowflake-open-catalog-iceberg-table.md +114 -0
  980. package/bin/skills/lakehouse-doc-en/references/query-syntax.md +294 -0
  981. package/bin/skills/lakehouse-doc-en/references/quick_start_batch_sync_data.md +116 -0
  982. package/bin/skills/lakehouse-doc-en/references/quick_start_bi_analysis.md +588 -0
  983. package/bin/skills/lakehouse-doc-en/references/quick_start_create_workspace.md +58 -0
  984. package/bin/skills/lakehouse-doc-en/references/quick_start_data_quality.md +75 -0
  985. package/bin/skills/lakehouse-doc-en/references/quick_start_etl.md +131 -0
  986. package/bin/skills/lakehouse-doc-en/references/quick_start_monitoring_and_alerting.md +93 -0
  987. package/bin/skills/lakehouse-doc-en/references/quick_start_sql_query.md +93 -0
  988. package/bin/skills/lakehouse-doc-en/references/quick_start_upload_data.md +69 -0
  989. package/bin/skills/lakehouse-doc-en/references/quick_start_user_management.md +72 -0
  990. package/bin/skills/lakehouse-doc-en/references/quick_start_workspace.md +72 -0
  991. package/bin/skills/lakehouse-doc-en/references/quick_start_workspace_user.md +67 -0
  992. package/bin/skills/lakehouse-doc-en/references/quickstart-with-copy-command.md +79 -0
  993. package/bin/skills/lakehouse-doc-en/references/quickstart.md +144 -0
  994. package/bin/skills/lakehouse-doc-en/references/quickstart_bak.md +592 -0
  995. package/bin/skills/lakehouse-doc-en/references/quickstart_datashare_between_companies.md +247 -0
  996. package/bin/skills/lakehouse-doc-en/references/quickstart_envirment_for_team.md +269 -0
  997. package/bin/skills/lakehouse-doc-en/references/quickstart_local_csv.md +90 -0
  998. package/bin/skills/lakehouse-doc-en/references/rdd-to-zettapark-migration-weblog.md +512 -0
  999. package/bin/skills/lakehouse-doc-en/references/realtime-pipeline-selection-guide.md +348 -0
  1000. package/bin/skills/lakehouse-doc-en/references/realtime-sales-dashboard-with-dynamic-table.md +317 -0
  1001. package/bin/skills/lakehouse-doc-en/references/realtime_sync.md +30 -0
  1002. package/bin/skills/lakehouse-doc-en/references/realtime_sync_and_analysis_practice.md +183 -0
  1003. package/bin/skills/lakehouse-doc-en/references/realtimesync_m.md +189 -0
  1004. package/bin/skills/lakehouse-doc-en/references/refactor-elt-practice.md +241 -0
  1005. package/bin/skills/lakehouse-doc-en/references/refresh-dynamic-table.md +35 -0
  1006. package/bin/skills/lakehouse-doc-en/references/refresh-history.md +54 -0
  1007. package/bin/skills/lakehouse-doc-en/references/refresh-materialized-view.md +35 -0
  1008. package/bin/skills/lakehouse-doc-en/references/regexp-function.md +1 -0
  1009. package/bin/skills/lakehouse-doc-en/references/regexp-statement.md +75 -0
  1010. package/bin/skills/lakehouse-doc-en/references/register_account.md +77 -0
  1011. package/bin/skills/lakehouse-doc-en/references/release-note-0414-1.1.md +56 -0
  1012. package/bin/skills/lakehouse-doc-en/references/release-note-2026-05-19.md +64 -0
  1013. package/bin/skills/lakehouse-doc-en/references/release-notes.md +1 -0
  1014. package/bin/skills/lakehouse-doc-en/references/releasenotes.md +1 -0
  1015. package/bin/skills/lakehouse-doc-en/references/releasenotesupdata.md +1 -0
  1016. package/bin/skills/lakehouse-doc-en/references/remove-volume.md +73 -0
  1017. package/bin/skills/lakehouse-doc-en/references/restore-dynamic-table.md +118 -0
  1018. package/bin/skills/lakehouse-doc-en/references/restore.md +118 -0
  1019. package/bin/skills/lakehouse-doc-en/references/result_cache.md +102 -0
  1020. package/bin/skills/lakehouse-doc-en/references/revoke-from-share.md +71 -0
  1021. package/bin/skills/lakehouse-doc-en/references/revoke-privileges.md +108 -0
  1022. package/bin/skills/lakehouse-doc-en/references/revoke-user-privileges.md +107 -0
  1023. package/bin/skills/lakehouse-doc-en/references/right.md +74 -0
  1024. package/bin/skills/lakehouse-doc-en/references/rlike.md +69 -0
  1025. package/bin/skills/lakehouse-doc-en/references/role-privilege-manage.md +80 -0
  1026. package/bin/skills/lakehouse-doc-en/references/role-privlilige-manage.md +1 -0
  1027. package/bin/skills/lakehouse-doc-en/references/roles.md +123 -0
  1028. package/bin/skills/lakehouse-doc-en/references/rom_lakehouse_to_volume.md +1 -0
  1029. package/bin/skills/lakehouse-doc-en/references/row_level_permission.md +41 -0
  1030. package/bin/skills/lakehouse-doc-en/references/rrf-fulltext-vector-hybrid-search-best-practices.md +233 -0
  1031. package/bin/skills/lakehouse-doc-en/references/s3_volume_creation.md +37 -0
  1032. package/bin/skills/lakehouse-doc-en/references/sample-data-using.md +768 -0
  1033. package/bin/skills/lakehouse-doc-en/references/scd2-product-dimension-with-merge-into.md +314 -0
  1034. package/bin/skills/lakehouse-doc-en/references/scheduled_task.md +54 -0
  1035. package/bin/skills/lakehouse-doc-en/references/search-functions.md +1 -0
  1036. package/bin/skills/lakehouse-doc-en/references/security-policy-overview.md +112 -0
  1037. package/bin/skills/lakehouse-doc-en/references/security-system-inventory-based-information-schema.md +411 -0
  1038. package/bin/skills/lakehouse-doc-en/references/security_compliance_audit_dir_guide.md +1 -0
  1039. package/bin/skills/lakehouse-doc-en/references/security_compliance_audit_guide.md +393 -0
  1040. package/bin/skills/lakehouse-doc-en/references/security_overview.md +41 -0
  1041. package/bin/skills/lakehouse-doc-en/references/select-catalog-table.md +48 -0
  1042. package/bin/skills/lakehouse-doc-en/references/semantic-view-advanced.md +124 -0
  1043. package/bin/skills/lakehouse-doc-en/references/semantic-view-agent-guide.md +474 -0
  1044. package/bin/skills/lakehouse-doc-en/references/semantic-view-ai.md +184 -0
  1045. package/bin/skills/lakehouse-doc-en/references/semantic-view-best-practices.md +163 -0
  1046. package/bin/skills/lakehouse-doc-en/references/semantic-view-create.md +202 -0
  1047. package/bin/skills/lakehouse-doc-en/references/semantic-view-manage.md +143 -0
  1048. package/bin/skills/lakehouse-doc-en/references/semantic-view-overview.md +42 -0
  1049. package/bin/skills/lakehouse-doc-en/references/semantic-view-query.md +194 -0
  1050. package/bin/skills/lakehouse-doc-en/references/semantic_view.md +891 -0
  1051. package/bin/skills/lakehouse-doc-en/references/server-data-for-ai.md +134 -0
  1052. package/bin/skills/lakehouse-doc-en/references/service-aggrement.md +299 -0
  1053. package/bin/skills/lakehouse-doc-en/references/set-command.md +200 -0
  1054. package/bin/skills/lakehouse-doc-en/references/set-operations.md +199 -0
  1055. package/bin/skills/lakehouse-doc-en/references/set-properties.md +119 -0
  1056. package/bin/skills/lakehouse-doc-en/references/setup.md +38 -0
  1057. package/bin/skills/lakehouse-doc-en/references/setup_cz_cli.md +495 -0
  1058. package/bin/skills/lakehouse-doc-en/references/share-ddl.md +53 -0
  1059. package/bin/skills/lakehouse-doc-en/references/show-cached-status.md +29 -0
  1060. package/bin/skills/lakehouse-doc-en/references/show-catalog-schema.md +43 -0
  1061. package/bin/skills/lakehouse-doc-en/references/show-catalog-table.md +51 -0
  1062. package/bin/skills/lakehouse-doc-en/references/show-catalog.md +43 -0
  1063. package/bin/skills/lakehouse-doc-en/references/show-columns.md +78 -0
  1064. package/bin/skills/lakehouse-doc-en/references/show-connections.md +42 -0
  1065. package/bin/skills/lakehouse-doc-en/references/show-create-dynamic-table.md +37 -0
  1066. package/bin/skills/lakehouse-doc-en/references/show-create-external-table.md +105 -0
  1067. package/bin/skills/lakehouse-doc-en/references/show-create-materialized-view.md +35 -0
  1068. package/bin/skills/lakehouse-doc-en/references/show-create-pipe.md +15 -0
  1069. package/bin/skills/lakehouse-doc-en/references/show-create-table.md +93 -0
  1070. package/bin/skills/lakehouse-doc-en/references/show-dynamic-table.md +60 -0
  1071. package/bin/skills/lakehouse-doc-en/references/show-external-functions.md +36 -0
  1072. package/bin/skills/lakehouse-doc-en/references/show-external-schemas.md +108 -0
  1073. package/bin/skills/lakehouse-doc-en/references/show-external-table.md +72 -0
  1074. package/bin/skills/lakehouse-doc-en/references/show-finctions.md +21 -0
  1075. package/bin/skills/lakehouse-doc-en/references/show-functions.md +38 -0
  1076. package/bin/skills/lakehouse-doc-en/references/show-grants-user.md +62 -0
  1077. package/bin/skills/lakehouse-doc-en/references/show-grants.md +129 -0
  1078. package/bin/skills/lakehouse-doc-en/references/show-jobs.md +87 -0
  1079. package/bin/skills/lakehouse-doc-en/references/show-materialized-view.md +81 -0
  1080. package/bin/skills/lakehouse-doc-en/references/show-network-policy.md +40 -0
  1081. package/bin/skills/lakehouse-doc-en/references/show-pipes.md +29 -0
  1082. package/bin/skills/lakehouse-doc-en/references/show-roles.md +122 -0
  1083. package/bin/skills/lakehouse-doc-en/references/show-schemas.md +41 -0
  1084. package/bin/skills/lakehouse-doc-en/references/show-shares.md +95 -0
  1085. package/bin/skills/lakehouse-doc-en/references/show-storage-connections.md +41 -0
  1086. package/bin/skills/lakehouse-doc-en/references/show-synonyms.md +23 -0
  1087. package/bin/skills/lakehouse-doc-en/references/show-table-streams.md +42 -0
  1088. package/bin/skills/lakehouse-doc-en/references/show-tables-history.md +81 -0
  1089. package/bin/skills/lakehouse-doc-en/references/show-tables.md +76 -0
  1090. package/bin/skills/lakehouse-doc-en/references/show-users.md +39 -0
  1091. package/bin/skills/lakehouse-doc-en/references/show-vclusters.md +59 -0
  1092. package/bin/skills/lakehouse-doc-en/references/show-views.md +78 -0
  1093. package/bin/skills/lakehouse-doc-en/references/show-volume.md +114 -0
  1094. package/bin/skills/lakehouse-doc-en/references/show-workspaces.md +48 -0
  1095. package/bin/skills/lakehouse-doc-en/references/show.md +105 -0
  1096. package/bin/skills/lakehouse-doc-en/references/simpletosimple_bazhuayu_datagpt.md +150 -0
  1097. package/bin/skills/lakehouse-doc-en/references/singdata_overview.md +1 -0
  1098. package/bin/skills/lakehouse-doc-en/references/slowly-changing-dimensions-with-streams-and-tasks.md +640 -0
  1099. package/bin/skills/lakehouse-doc-en/references/small_file_optimization.md +120 -0
  1100. package/bin/skills/lakehouse-doc-en/references/snowflake-dynamic-tables-to-lakehouse.md +268 -0
  1101. package/bin/skills/lakehouse-doc-en/references/snowflake-snowpark-to-zettapark-migration.md +390 -0
  1102. package/bin/skills/lakehouse-doc-en/references/spark-connector-summary.md +307 -0
  1103. package/bin/skills/lakehouse-doc-en/references/spark-connector-use.md +225 -0
  1104. package/bin/skills/lakehouse-doc-en/references/spark-connector.md +1 -0
  1105. package/bin/skills/lakehouse-doc-en/references/spark-lakehouse-iceberg-rest.md +180 -0
  1106. package/bin/skills/lakehouse-doc-en/references/spark-migration-guide.md +158 -0
  1107. package/bin/skills/lakehouse-doc-en/references/sql-commands.md +77 -0
  1108. package/bin/skills/lakehouse-doc-en/references/sql-comments.md +31 -0
  1109. package/bin/skills/lakehouse-doc-en/references/sql-function-guide.md +234 -0
  1110. package/bin/skills/lakehouse-doc-en/references/sql-parmaters.md +430 -0
  1111. package/bin/skills/lakehouse-doc-en/references/sql-qualify.md +286 -0
  1112. package/bin/skills/lakehouse-doc-en/references/sql-query-overview.md +39 -0
  1113. package/bin/skills/lakehouse-doc-en/references/sql-reference.md +79 -0
  1114. package/bin/skills/lakehouse-doc-en/references/sql_data_transfom_NestedDataTypes.md +452 -0
  1115. package/bin/skills/lakehouse-doc-en/references/sql_data_transform.md +114 -0
  1116. package/bin/skills/lakehouse-doc-en/references/sql_data_transform_basic.md +572 -0
  1117. package/bin/skills/lakehouse-doc-en/references/sql_data_transform_cte.md +166 -0
  1118. package/bin/skills/lakehouse-doc-en/references/sql_data_transform_tips.md +405 -0
  1119. package/bin/skills/lakehouse-doc-en/references/sql_data_transform_windows.md +430 -0
  1120. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/any_value.md +66 -0
  1121. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/approx_count_distinct.md +41 -0
  1122. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/approx_histogram.md +50 -0
  1123. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/approx_percentile.md +56 -0
  1124. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/approx_top_k.md +54 -0
  1125. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/array_agg.md +50 -0
  1126. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/avg.md +68 -0
  1127. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/bit_and.md +60 -0
  1128. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/bit_or.md +64 -0
  1129. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/bit_xor.md +60 -0
  1130. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/bitmap_agg.md +30 -0
  1131. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/bitmap_union.md +41 -0
  1132. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/bitmap_union_count.md +40 -0
  1133. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/bool_and.md +66 -0
  1134. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/bool_or.md +68 -0
  1135. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/collect_list.md +58 -0
  1136. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/collect_list_on_array.md +82 -0
  1137. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/collect_set.md +85 -0
  1138. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/collect_set_on_array.md +57 -0
  1139. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/corr.md +103 -0
  1140. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/count.md +96 -0
  1141. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/count_distinct.md +96 -0
  1142. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/count_if.md +57 -0
  1143. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/covar_pop.md +89 -0
  1144. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/covar_samp.md +113 -0
  1145. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/first_value.md +122 -0
  1146. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/group-array.md +90 -0
  1147. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/groupArray.md +90 -0
  1148. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/group_bitmap.md +56 -0
  1149. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/group_bitmap_and.md +59 -0
  1150. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/group_bitmap_and_state.md +71 -0
  1151. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/group_bitmap_merge.md +56 -0
  1152. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/group_bitmap_merge_state.md +28 -0
  1153. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/group_bitmap_or.md +56 -0
  1154. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/group_bitmap_or_state.md +54 -0
  1155. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/group_bitmap_state.md +46 -0
  1156. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/group_bitmap_xor.md +60 -0
  1157. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/group_bitmap_xor_state.md +89 -0
  1158. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/group_concat.md +74 -0
  1159. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/json_array_agg.md +49 -0
  1160. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/json_merge_agg.md +31 -0
  1161. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/json_object_agg.md +38 -0
  1162. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/last_value.md +100 -0
  1163. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/map_agg.md +79 -0
  1164. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/max.md +70 -0
  1165. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/max_by.md +46 -0
  1166. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/median.md +75 -0
  1167. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/min.md +58 -0
  1168. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/min_by.md +68 -0
  1169. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/percentile.md +51 -0
  1170. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/percentile_approx.md +71 -0
  1171. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/percentile_rank.md +33 -0
  1172. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/std.md +72 -0
  1173. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/stddev.md +67 -0
  1174. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/stddev_pop.md +67 -0
  1175. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/stddev_samp.md +67 -0
  1176. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/sum.md +67 -0
  1177. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/topn.md +42 -0
  1178. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/topn_array.md +42 -0
  1179. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/var_pop.md +65 -0
  1180. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/var_samp.md +65 -0
  1181. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/variance.md +72 -0
  1182. package/bin/skills/lakehouse-doc-en/references/sql_functions/aggregate_functions/wm_concat.md +72 -0
  1183. package/bin/skills/lakehouse-doc-en/references/sql_functions/context_functions/current_instance_id.md +17 -0
  1184. package/bin/skills/lakehouse-doc-en/references/sql_functions/context_functions/current_schema.md +29 -0
  1185. package/bin/skills/lakehouse-doc-en/references/sql_functions/context_functions/current_session_id.md +30 -0
  1186. package/bin/skills/lakehouse-doc-en/references/sql_functions/context_functions/current_user.md +24 -0
  1187. package/bin/skills/lakehouse-doc-en/references/sql_functions/context_functions/current_user_id.md +22 -0
  1188. package/bin/skills/lakehouse-doc-en/references/sql_functions/context_functions/current_vcluster.md +38 -0
  1189. package/bin/skills/lakehouse-doc-en/references/sql_functions/context_functions/current_workspace.md +19 -0
  1190. package/bin/skills/lakehouse-doc-en/references/sql_functions/context_functions/current_workspace_id.md +25 -0
  1191. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/binary_to_bitmap.md +25 -0
  1192. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_and.md +35 -0
  1193. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_and_cardinality.md +48 -0
  1194. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_andnot.md +60 -0
  1195. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_andnot_cardinality.md +58 -0
  1196. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_build.md +53 -0
  1197. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_cardinality.md +49 -0
  1198. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_contains.md +45 -0
  1199. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_count.md +45 -0
  1200. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_empty.md +52 -0
  1201. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_has_all.md +48 -0
  1202. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_has_any.md +63 -0
  1203. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_hash.md +52 -0
  1204. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_hash64.md +62 -0
  1205. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_max.md +25 -0
  1206. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_min.md +28 -0
  1207. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_or.md +58 -0
  1208. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_or_cardinality.md +22 -0
  1209. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_remove.md +42 -0
  1210. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_subset_in_range.md +48 -0
  1211. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_subset_limit.md +33 -0
  1212. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_to_array.md +23 -0
  1213. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_to_binary.md +23 -0
  1214. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_to_rb32.md +62 -0
  1215. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_to_rb64.md +62 -0
  1216. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_to_string.md +41 -0
  1217. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_transform.md +40 -0
  1218. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_xor.md +53 -0
  1219. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/bitmap_xor_cardinality.md +29 -0
  1220. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/rb32_to_bitmap.md +63 -0
  1221. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/rb64_to_bitmap.md +63 -0
  1222. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/string_to_bitmap.md +56 -0
  1223. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/sub_bitmap.md +33 -0
  1224. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitmap_functions/to_bitmap.md +78 -0
  1225. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitwise_functions/bit_count.md +38 -0
  1226. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitwise_functions/bitnot.md +30 -0
  1227. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitwise_functions/shiftleft.md +85 -0
  1228. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitwise_functions/shiftright.md +88 -0
  1229. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/bitwise_functions/shiftrightunsigned.md +88 -0
  1230. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/assert_true.md +74 -0
  1231. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/between.md +51 -0
  1232. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/case_when.md +87 -0
  1233. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/coalesce.md +43 -0
  1234. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/decode.md +45 -0
  1235. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/if.md +53 -0
  1236. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/ifnull.md +72 -0
  1237. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/in.md +32 -0
  1238. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/is-null.md +59 -0
  1239. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/is_false.md +63 -0
  1240. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/is_not_null.md +36 -0
  1241. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/is_null.md +32 -0
  1242. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/is_true.md +63 -0
  1243. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/multiif.md +48 -0
  1244. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/nullif.md +44 -0
  1245. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/nvl.md +33 -0
  1246. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/nvl2.md +41 -0
  1247. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/raise_error.md +52 -0
  1248. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/conditional_functions/typeof.md +70 -0
  1249. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/add_days.md +61 -0
  1250. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/add_months.md +38 -0
  1251. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/add_years.md +53 -0
  1252. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/adddate.md +53 -0
  1253. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/convert_timezone.md +45 -0
  1254. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/current_date.md +49 -0
  1255. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/current_timestamp.md +44 -0
  1256. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/date.md +39 -0
  1257. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/date_add.md +35 -0
  1258. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/date_format.md +40 -0
  1259. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/date_format_mysql.md +56 -0
  1260. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/date_format_pg.md +54 -0
  1261. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/date_sub.md +50 -0
  1262. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/date_trunc.md +58 -0
  1263. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/dateadd.md +59 -0
  1264. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/datediff.md +48 -0
  1265. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/datetime_patterns.md +60 -0
  1266. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/day.md +47 -0
  1267. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/dayofmonth.md +36 -0
  1268. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/dayofweek.md +30 -0
  1269. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/dayofweek_iso.md +45 -0
  1270. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/dayofyear.md +40 -0
  1271. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/days.md +25 -0
  1272. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/days_diff.md +39 -0
  1273. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/extract.md +47 -0
  1274. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/from_unixtime.md +42 -0
  1275. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/from_utc_timestamp.md +40 -0
  1276. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/hour.md +46 -0
  1277. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/hours.md +25 -0
  1278. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/hours_diff.md +39 -0
  1279. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/last_day.md +35 -0
  1280. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/localtimestamp.md +17 -0
  1281. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/makde_date.md +21 -0
  1282. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/make_date.md +20 -0
  1283. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/make_dt_interval.md +55 -0
  1284. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/make_ym_interval.md +49 -0
  1285. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/milliseconds_diff.md +39 -0
  1286. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/minute.md +38 -0
  1287. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/minutes_diff.md +39 -0
  1288. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/month.md +42 -0
  1289. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/months.md +25 -0
  1290. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/months_between.md +50 -0
  1291. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/months_diff.md +39 -0
  1292. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/next_day.md +51 -0
  1293. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/now.md +43 -0
  1294. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/quarter.md +44 -0
  1295. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/second.md +34 -0
  1296. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/seconds_diff.md +39 -0
  1297. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/str_to_date_mysql.md +37 -0
  1298. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/sub_days.md +58 -0
  1299. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/subdate.md +53 -0
  1300. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/timestamp_micros.md +32 -0
  1301. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/timestamp_millis.md +45 -0
  1302. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/timestamp_seconds.md +54 -0
  1303. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/timestampadd.md +59 -0
  1304. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/timestampdiff.md +67 -0
  1305. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/to_date.md +39 -0
  1306. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/to_start_of_interval.md +28 -0
  1307. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/to_timestamp.md +59 -0
  1308. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/to_timestamp_ntz.md +46 -0
  1309. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/to_unix_timestamp.md +49 -0
  1310. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/to_unix_timestamp_ms.md +45 -0
  1311. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/to_unix_timestamp_us.md +37 -0
  1312. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/to_utc_timestamp.md +56 -0
  1313. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/toyyyymmdd.md +62 -0
  1314. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/trunc.md +38 -0
  1315. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/unix_timestamp.md +86 -0
  1316. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/week.md +29 -0
  1317. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/weekday.md +48 -0
  1318. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/weekofyear.md +26 -0
  1319. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/weeks_diff.md +39 -0
  1320. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/year.md +35 -0
  1321. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/yearofweek.md +54 -0
  1322. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/years.md +25 -0
  1323. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/datetime_functions/years_diff.md +39 -0
  1324. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/file_functions/GET_PRESIGNED_URL.md +91 -0
  1325. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/geo_functions/st_geohash.md +33 -0
  1326. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/geo_functions/st_latfromgeohash.md +53 -0
  1327. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/geo_functions/st_longfromgeohash.md +52 -0
  1328. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/bucket.md +33 -0
  1329. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/crc32.md +60 -0
  1330. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/general_complexhash2.md +33 -0
  1331. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/general_hash.md +47 -0
  1332. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/general_murmurhash3.md +33 -0
  1333. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/hash_combine.md +35 -0
  1334. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/hash_combine_commutative.md +56 -0
  1335. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/md5.md +69 -0
  1336. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/murmurhash.md +33 -0
  1337. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/murmurhash3_32.md +33 -0
  1338. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/murmurhash3_64.md +33 -0
  1339. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/sha1.md +67 -0
  1340. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/sha2.md +106 -0
  1341. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/sha224.md +59 -0
  1342. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/sha256.md +59 -0
  1343. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/sha384.md +59 -0
  1344. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/sha512.md +59 -0
  1345. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/hash_functions/sm3.md +60 -0
  1346. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/array_aggregate.md +88 -0
  1347. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/array_filter_doris.md +78 -0
  1348. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/array_map_doris.md +39 -0
  1349. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/array_sort_by_key.md +40 -0
  1350. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/element_at.md +59 -0
  1351. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/exists.md +60 -0
  1352. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/filter.md +39 -0
  1353. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/forall.md +62 -0
  1354. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/high_order_functions.md +34 -0
  1355. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/map_filter.md +33 -0
  1356. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/map_zip_with.md +43 -0
  1357. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/transform.md +43 -0
  1358. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/transform_keys.md +42 -0
  1359. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/transform_values.md +34 -0
  1360. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/high_order_functions/zip_with.md +45 -0
  1361. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/ip_functions/get_ip_info.md +197 -0
  1362. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/ip_functions/ipv4_num_to_string.md +27 -0
  1363. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/ip_functions/ipv4_string_to_num.md +30 -0
  1364. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/ip_functions/ipv6_num_to_string.md +31 -0
  1365. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/ip_functions/ipv6_string_to_num.md +40 -0
  1366. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/ip_functions/is_ip_address_in_range.md +39 -0
  1367. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/from_json.md +82 -0
  1368. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/get_json_object.md +77 -0
  1369. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_array.md +36 -0
  1370. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_array_get.md +38 -0
  1371. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_contains.md +91 -0
  1372. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_extract.md +59 -0
  1373. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_extract_bigint.md +33 -0
  1374. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_extract_boolean.md +36 -0
  1375. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_extract_date.md +35 -0
  1376. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_extract_double.md +33 -0
  1377. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_extract_float.md +37 -0
  1378. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_extract_int.md +37 -0
  1379. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_extract_string.md +37 -0
  1380. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_extract_timestamp.md +35 -0
  1381. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_extract_timestamp_ntz.md +35 -0
  1382. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_length.md +37 -0
  1383. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_minify.md +33 -0
  1384. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_normalize.md +39 -0
  1385. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_object.md +47 -0
  1386. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_parse.md +45 -0
  1387. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_remove.md +39 -0
  1388. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_type.md +56 -0
  1389. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_unquote.md +39 -0
  1390. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/json_valid.md +45 -0
  1391. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/schema_of_json.md +63 -0
  1392. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/to_json.md +62 -0
  1393. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/json_functions/try_from_json.md +39 -0
  1394. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/abs.md +44 -0
  1395. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/acos.md +37 -0
  1396. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/acosh.md +39 -0
  1397. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/asin.md +38 -0
  1398. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/asinh.md +45 -0
  1399. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/atan.md +25 -0
  1400. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/atan2.md +40 -0
  1401. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/atanh.md +22 -0
  1402. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/bround.md +42 -0
  1403. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/cbrt.md +56 -0
  1404. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/ceil.md +45 -0
  1405. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/ceiling.md +47 -0
  1406. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/ceilling.md +41 -0
  1407. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/cos.md +40 -0
  1408. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/cosh.md +41 -0
  1409. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/cot.md +49 -0
  1410. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/csc.md +33 -0
  1411. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/degrees.md +38 -0
  1412. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/div.md +33 -0
  1413. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/e.md +29 -0
  1414. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/erf.md +74 -0
  1415. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/exp.md +31 -0
  1416. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/exp2.md +32 -0
  1417. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/expm1.md +49 -0
  1418. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/floor.md +42 -0
  1419. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/greatest.md +36 -0
  1420. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/hypot.md +35 -0
  1421. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/isnan.md +28 -0
  1422. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/least.md +38 -0
  1423. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/lgamma.md +75 -0
  1424. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/ln.md +52 -0
  1425. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/log.md +39 -0
  1426. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/log10.md +47 -0
  1427. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/log1p.md +45 -0
  1428. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/log2.md +28 -0
  1429. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/median.md +21 -0
  1430. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/mod.md +34 -0
  1431. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/monotonically_increasing_id.md +79 -0
  1432. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/negative.md +15 -0
  1433. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/operators.md +600 -0
  1434. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/pi.md +37 -0
  1435. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/pmod.md +32 -0
  1436. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/positive.md +15 -0
  1437. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/pow.md +37 -0
  1438. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/power.md +35 -0
  1439. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/radians.md +25 -0
  1440. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/rand.md +31 -0
  1441. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/randn.md +25 -0
  1442. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/random.md +54 -0
  1443. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/round.md +42 -0
  1444. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/shiftleft.md +78 -0
  1445. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/shiftright.md +81 -0
  1446. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/shiftrightunsigned.md +81 -0
  1447. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/sign.md +44 -0
  1448. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/sin.md +61 -0
  1449. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/sinh.md +44 -0
  1450. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/sqrt.md +38 -0
  1451. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/tan.md +41 -0
  1452. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/tanh.md +38 -0
  1453. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/math_functions/tgamma.md +75 -0
  1454. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array.md +45 -0
  1455. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_append.md +30 -0
  1456. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_compact.md +35 -0
  1457. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_contains.md +35 -0
  1458. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_distinct.md +38 -0
  1459. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_except.md +32 -0
  1460. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_intersect.md +39 -0
  1461. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_join.md +47 -0
  1462. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_max.md +52 -0
  1463. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_min.md +54 -0
  1464. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_position.md +44 -0
  1465. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_prepend.md +30 -0
  1466. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_remove.md +30 -0
  1467. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_repeat.md +50 -0
  1468. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_size.md +31 -0
  1469. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_sort.md +44 -0
  1470. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_sort_reverse.md +30 -0
  1471. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/array_union.md +32 -0
  1472. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/arrays_overlap.md +38 -0
  1473. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/arrays_zip.md +47 -0
  1474. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/cardinality.md +43 -0
  1475. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/concat.md +47 -0
  1476. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/concat_ws.md +67 -0
  1477. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/element_at.md +40 -0
  1478. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/flatten.md +50 -0
  1479. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/hasany_clickhouse.md +40 -0
  1480. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/map.md +214 -0
  1481. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/map_concat.md +36 -0
  1482. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/map_concat_array.md +30 -0
  1483. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/map_contains_key.md +33 -0
  1484. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/map_entries.md +40 -0
  1485. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/map_equal.md +32 -0
  1486. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/map_except.md +34 -0
  1487. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/map_from_arrays.md +40 -0
  1488. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/map_from_entries.md +50 -0
  1489. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/map_keys.md +34 -0
  1490. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/map_values.md +41 -0
  1491. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/multimap_from_entries.md +50 -0
  1492. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/named_struct.md +51 -0
  1493. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/sequence.md +33 -0
  1494. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/size.md +30 -0
  1495. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/slice.md +37 -0
  1496. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/sort_array.md +34 -0
  1497. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/struct.md +33 -0
  1498. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/struct_insert.md +48 -0
  1499. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/struct_update.md +39 -0
  1500. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/trans_array.md +219 -0
  1501. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/nested_functions/try_element_at.md +46 -0
  1502. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/partition/max_pt.md +36 -0
  1503. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/search_functions/match_all.md +52 -0
  1504. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/search_functions/match_any.md +52 -0
  1505. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/search_functions/match_phrase.md +50 -0
  1506. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/search_functions/match_phrase_prefix.md +35 -0
  1507. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/search_functions/match_regexp.md +28 -0
  1508. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/search_functions/tokenize.md +56 -0
  1509. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/aes_decrypt.md +37 -0
  1510. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/aes_decrypt_mysql.md +30 -0
  1511. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/aes_encrypt.md +33 -0
  1512. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/aes_encrypt_mysql.md +47 -0
  1513. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/ascii.md +34 -0
  1514. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/base64.md +27 -0
  1515. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/binary.md +36 -0
  1516. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/btrim.md +38 -0
  1517. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/char.md +51 -0
  1518. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/char_length.md +46 -0
  1519. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/character_length.md +46 -0
  1520. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/chr.md +26 -0
  1521. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/collation_sort_key.md +56 -0
  1522. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/concat.md +69 -0
  1523. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/concat_ws.md +67 -0
  1524. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/contains.md +29 -0
  1525. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/conv.md +48 -0
  1526. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/endswith.md +68 -0
  1527. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/find_in_set.md +49 -0
  1528. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/format_string.md +47 -0
  1529. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/gunzip.md +60 -0
  1530. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/gzip.md +60 -0
  1531. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/hex.md +43 -0
  1532. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/ilike.md +89 -0
  1533. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/instr.md +41 -0
  1534. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/is_ascii.md +27 -0
  1535. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/is_utf8.md +27 -0
  1536. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/lcase.md +45 -0
  1537. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/left.md +40 -0
  1538. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/length.md +30 -0
  1539. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/lengthb.md +42 -0
  1540. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/like.md +106 -0
  1541. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/locate.md +42 -0
  1542. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/lower.md +54 -0
  1543. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/lpad.md +42 -0
  1544. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/ltrim.md +43 -0
  1545. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/mask.md +36 -0
  1546. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/mask_inner.md +74 -0
  1547. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/mask_outer.md +74 -0
  1548. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/md5.md +59 -0
  1549. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/octet_length.md +47 -0
  1550. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/parse_url.md +72 -0
  1551. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/position.md +35 -0
  1552. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/regexp-statement.md +81 -0
  1553. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/regexp_count.md +112 -0
  1554. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/regexp_extract.md +37 -0
  1555. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/regexp_extract_all.md +41 -0
  1556. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/regexp_instr.md +146 -0
  1557. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/regexp_replace.md +33 -0
  1558. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/regexp_split_to_array.md +34 -0
  1559. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/repeat.md +44 -0
  1560. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/replace.md +37 -0
  1561. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/reverse.md +64 -0
  1562. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/right.md +39 -0
  1563. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/rlike.md +59 -0
  1564. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/rpad.md +46 -0
  1565. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/rtrim.md +42 -0
  1566. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/sha1.md +59 -0
  1567. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/space.md +30 -0
  1568. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/split.md +57 -0
  1569. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/split_by_regexp.md +34 -0
  1570. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/split_by_string.md +33 -0
  1571. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/split_part.md +42 -0
  1572. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/startswith.md +68 -0
  1573. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/str_to_map.md +46 -0
  1574. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/strpos.md +80 -0
  1575. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/substr.md +40 -0
  1576. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/substring.md +47 -0
  1577. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/substring_index.md +46 -0
  1578. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/translate.md +42 -0
  1579. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/trim.md +51 -0
  1580. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/typeof.md +63 -0
  1581. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/ucase.md +46 -0
  1582. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/unbase64.md +50 -0
  1583. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/unhex.md +46 -0
  1584. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/upper.md +45 -0
  1585. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/url_decode.md +36 -0
  1586. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/url_encode.md +37 -0
  1587. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/string_functions/uuid.md +37 -0
  1588. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/vector_functions/binary_quantize.md +52 -0
  1589. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/vector_functions/cosine_distance.md +47 -0
  1590. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/vector_functions/dot_product.md +48 -0
  1591. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/vector_functions/fill_vector.md +53 -0
  1592. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/vector_functions/hamming_distance.md +55 -0
  1593. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/vector_functions/jaccard_distance.md +53 -0
  1594. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/vector_functions/l2_distance.md +47 -0
  1595. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/vector_functions/l2_norm.md +46 -0
  1596. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/vector_functions/l2_normalize.md +48 -0
  1597. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/vector_functions/print_vector_bits.md +52 -0
  1598. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/vector_functions/vector.md +77 -0
  1599. package/bin/skills/lakehouse-doc-en/references/sql_functions/scalar_functions/vector_functions/vector_add_scalar.md +53 -0
  1600. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/explode.md +64 -0
  1601. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/explode_json_array_double.md +29 -0
  1602. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/explode_json_array_int.md +29 -0
  1603. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/explode_json_array_json.md +28 -0
  1604. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/explode_json_array_string.md +29 -0
  1605. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/explode_json_object.md +29 -0
  1606. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/explode_outer.md +78 -0
  1607. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/inline.md +42 -0
  1608. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/json_tuple.md +72 -0
  1609. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/load_history.md +55 -0
  1610. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/posexplode.md +79 -0
  1611. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/posexplode_outer.md +78 -0
  1612. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/read_kafka.md +65 -0
  1613. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/stack.md +62 -0
  1614. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/table_changes.md +136 -0
  1615. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/unnest.md +105 -0
  1616. package/bin/skills/lakehouse-doc-en/references/sql_functions/table_functions/unnset.md +101 -0
  1617. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/avg.md +79 -0
  1618. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/count.md +80 -0
  1619. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/cume_dist.md +92 -0
  1620. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/dense_rank.md +109 -0
  1621. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/first.md +148 -0
  1622. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/first_value.md +128 -0
  1623. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/lag.md +59 -0
  1624. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/last.md +158 -0
  1625. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/last_value.md +154 -0
  1626. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/lead.md +45 -0
  1627. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/max.md +159 -0
  1628. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/min.md +127 -0
  1629. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/nth_value.md +108 -0
  1630. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/ntile.md +53 -0
  1631. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/percent_rank.md +53 -0
  1632. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/rank.md +131 -0
  1633. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/row_number.md +134 -0
  1634. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/sum.md +99 -0
  1635. package/bin/skills/lakehouse-doc-en/references/sql_functions/window_functions/window_clause.md +81 -0
  1636. package/bin/skills/lakehouse-doc-en/references/sql_functions_guide.md +1 -0
  1637. package/bin/skills/lakehouse-doc-en/references/sql_practice.md +1 -0
  1638. package/bin/skills/lakehouse-doc-en/references/sql_rfm.md +197 -0
  1639. package/bin/skills/lakehouse-doc-en/references/sqlalchemy.md +76 -0
  1640. package/bin/skills/lakehouse-doc-en/references/ssb-benchmark.md +216 -0
  1641. package/bin/skills/lakehouse-doc-en/references/sso-configuration.md +160 -0
  1642. package/bin/skills/lakehouse-doc-en/references/storage_encryption.md +81 -0
  1643. package/bin/skills/lakehouse-doc-en/references/streaming_data_pipeline_overview.md +46 -0
  1644. package/bin/skills/lakehouse-doc-en/references/streaming_data_pipeline_overview1.md +1 -0
  1645. package/bin/skills/lakehouse-doc-en/references/streaming_pipeline_with_dynamic_table.md +93 -0
  1646. package/bin/skills/lakehouse-doc-en/references/string_function.md +1 -0
  1647. package/bin/skills/lakehouse-doc-en/references/string_guide.md +42 -0
  1648. package/bin/skills/lakehouse-doc-en/references/struct-function.md +1 -0
  1649. package/bin/skills/lakehouse-doc-en/references/structure_data_analysis.md +223 -0
  1650. package/bin/skills/lakehouse-doc-en/references/studio-di-privatelink-vpc-from-rds.md +105 -0
  1651. package/bin/skills/lakehouse-doc-en/references/studio-incremental-sync-practice.md +318 -0
  1652. package/bin/skills/lakehouse-doc-en/references/studio-jdbc-task.md +238 -0
  1653. package/bin/skills/lakehouse-doc-en/references/studio-python-task-connector.md +286 -0
  1654. package/bin/skills/lakehouse-doc-en/references/studio-python-task-zettapark.md +368 -0
  1655. package/bin/skills/lakehouse-doc-en/references/studio-shell-task.md +191 -0
  1656. package/bin/skills/lakehouse-doc-en/references/studio-task-practice.md +20 -0
  1657. package/bin/skills/lakehouse-doc-en/references/studio_manual.md +172 -0
  1658. package/bin/skills/lakehouse-doc-en/references/studio_overview.md +59 -0
  1659. package/bin/skills/lakehouse-doc-en/references/synonym.md +120 -0
  1660. package/bin/skills/lakehouse-doc-en/references/table-design.md +270 -0
  1661. package/bin/skills/lakehouse-doc-en/references/table-overview.md +68 -0
  1662. package/bin/skills/lakehouse-doc-en/references/table-stream-title.md +71 -0
  1663. package/bin/skills/lakehouse-doc-en/references/table-stream.md +1 -0
  1664. package/bin/skills/lakehouse-doc-en/references/table-summary.md +1 -0
  1665. package/bin/skills/lakehouse-doc-en/references/table_changes.md +127 -0
  1666. package/bin/skills/lakehouse-doc-en/references/table_function.md +1 -0
  1667. package/bin/skills/lakehouse-doc-en/references/table_rendering.md +34 -0
  1668. package/bin/skills/lakehouse-doc-en/references/table_stream.md +311 -0
  1669. package/bin/skills/lakehouse-doc-en/references/tableau-connect-to-lakehouse.md +64 -0
  1670. package/bin/skills/lakehouse-doc-en/references/tablesample.md +482 -0
  1671. package/bin/skills/lakehouse-doc-en/references/tablestream_summary.md +515 -0
  1672. package/bin/skills/lakehouse-doc-en/references/task-develop.md +277 -0
  1673. package/bin/skills/lakehouse-doc-en/references/task-instance-maintenance.md +198 -0
  1674. package/bin/skills/lakehouse-doc-en/references/task_development.md +54 -0
  1675. package/bin/skills/lakehouse-doc-en/references/task_group.md +158 -0
  1676. package/bin/skills/lakehouse-doc-en/references/task_instance.md +82 -0
  1677. package/bin/skills/lakehouse-doc-en/references/task_instance_operations.md +206 -0
  1678. package/bin/skills/lakehouse-doc-en/references/task_param.md +130 -0
  1679. package/bin/skills/lakehouse-doc-en/references/task_param_examples.md +220 -0
  1680. package/bin/skills/lakehouse-doc-en/references/task_param_reference.md +322 -0
  1681. package/bin/skills/lakehouse-doc-en/references/task_scheduling.md +1 -0
  1682. package/bin/skills/lakehouse-doc-en/references/task_scheduling_dependency.md +70 -0
  1683. package/bin/skills/lakehouse-doc-en/references/taskdevelop.md +268 -0
  1684. package/bin/skills/lakehouse-doc-en/references/tencentcloud_arn_and_externalid.md +29 -0
  1685. package/bin/skills/lakehouse-doc-en/references/tencentcloud_privatelink.md +1 -0
  1686. package/bin/skills/lakehouse-doc-en/references/time-function.md +67 -0
  1687. package/bin/skills/lakehouse-doc-en/references/time-travel-concept.md +175 -0
  1688. package/bin/skills/lakehouse-doc-en/references/time_date_guide.md +47 -0
  1689. package/bin/skills/lakehouse-doc-en/references/time_travel_guide.md +73 -0
  1690. package/bin/skills/lakehouse-doc-en/references/timetravel-summary.md +45 -0
  1691. package/bin/skills/lakehouse-doc-en/references/tools_AI.md +1 -0
  1692. package/bin/skills/lakehouse-doc-en/references/tools_BI.md +1 -0
  1693. package/bin/skills/lakehouse-doc-en/references/tools_app.md +1 -0
  1694. package/bin/skills/lakehouse-doc-en/references/tpcds-3tb-benchmark.md +754 -0
  1695. package/bin/skills/lakehouse-doc-en/references/tpcds-benchmark.md +756 -0
  1696. package/bin/skills/lakehouse-doc-en/references/tpch-benchmark.md +884 -0
  1697. package/bin/skills/lakehouse-doc-en/references/transform-dt.md +291 -0
  1698. package/bin/skills/lakehouse-doc-en/references/trial-account-quotas-and-limits.md +85 -0
  1699. package/bin/skills/lakehouse-doc-en/references/tutorial_DataGPT.md +1 -0
  1700. package/bin/skills/lakehouse-doc-en/references/tutorial_connect_to_lakehouse.md +1 -0
  1701. package/bin/skills/lakehouse-doc-en/references/tutorial_data_ingestion.md +1 -0
  1702. package/bin/skills/lakehouse-doc-en/references/tutorial_data_transformation.md +1 -0
  1703. package/bin/skills/lakehouse-doc-en/references/tutorial_migration.md +49 -0
  1704. package/bin/skills/lakehouse-doc-en/references/tutorial_virtual_cluster.md +149 -0
  1705. package/bin/skills/lakehouse-doc-en/references/tutorial_work_with_workspace.md +1 -0
  1706. package/bin/skills/lakehouse-doc-en/references/tutorial_zettapark.md +1 -0
  1707. package/bin/skills/lakehouse-doc-en/references/tutorials-streaming-data-pipeline-with_dynamic-table.md +123 -0
  1708. package/bin/skills/lakehouse-doc-en/references/tutorials.md +168 -0
  1709. package/bin/skills/lakehouse-doc-en/references/undrop-dynamic-table.md +74 -0
  1710. package/bin/skills/lakehouse-doc-en/references/undrop-materialized-view.md +74 -0
  1711. package/bin/skills/lakehouse-doc-en/references/undrop.md +36 -0
  1712. package/bin/skills/lakehouse-doc-en/references/unified-workflow-demo.md +175 -0
  1713. package/bin/skills/lakehouse-doc-en/references/unified-workflow-intro.md +31 -0
  1714. package/bin/skills/lakehouse-doc-en/references/unified-workflow.md +2 -0
  1715. package/bin/skills/lakehouse-doc-en/references/unifiedWorkflow.md +1 -0
  1716. package/bin/skills/lakehouse-doc-en/references/unloa-data-summary.md +17 -0
  1717. package/bin/skills/lakehouse-doc-en/references/unload-data-local.md +71 -0
  1718. package/bin/skills/lakehouse-doc-en/references/unload-data-summary.md +17 -0
  1719. package/bin/skills/lakehouse-doc-en/references/unstructure_data_analysis.md +1 -0
  1720. package/bin/skills/lakehouse-doc-en/references/unstructured-io.md +795 -0
  1721. package/bin/skills/lakehouse-doc-en/references/unstructured_etl_pipeline_notebook.md +11 -0
  1722. package/bin/skills/lakehouse-doc-en/references/unstructured_etl_pipeline_user_guide.md +1099 -0
  1723. package/bin/skills/lakehouse-doc-en/references/unstructured_etl_python_api.md +942 -0
  1724. package/bin/skills/lakehouse-doc-en/references/upload-data-with-sdk.md +4 -0
  1725. package/bin/skills/lakehouse-doc-en/references/upload-data.md +1 -0
  1726. package/bin/skills/lakehouse-doc-en/references/upload-json-jsonl.md +135 -0
  1727. package/bin/skills/lakehouse-doc-en/references/upload_data.md +122 -0
  1728. package/bin/skills/lakehouse-doc-en/references/use-copy-upload-data.md +88 -0
  1729. package/bin/skills/lakehouse-doc-en/references/use-dbt-dev.md +428 -0
  1730. package/bin/skills/lakehouse-doc-en/references/use-external-schema.md +34 -0
  1731. package/bin/skills/lakehouse-doc-en/references/use-external-upload-hive-data.md +18 -0
  1732. package/bin/skills/lakehouse-doc-en/references/use-java-sdk-realtime-uploaddata.md +163 -0
  1733. package/bin/skills/lakehouse-doc-en/references/use-java-sdk-upload-data-local.md +140 -0
  1734. package/bin/skills/lakehouse-doc-en/references/use-mysql-client.md +184 -0
  1735. package/bin/skills/lakehouse-doc-en/references/use-python-sdk-upload-data.md +103 -0
  1736. package/bin/skills/lakehouse-doc-en/references/use-schema.md +49 -0
  1737. package/bin/skills/lakehouse-doc-en/references/use-sql-analyze.md +1 -0
  1738. package/bin/skills/lakehouse-doc-en/references/use-vcluster.md +58 -0
  1739. package/bin/skills/lakehouse-doc-en/references/user-aggrement.md +1 -0
  1740. package/bin/skills/lakehouse-doc-en/references/user-external-function.md +56 -0
  1741. package/bin/skills/lakehouse-doc-en/references/user-identification.md +58 -0
  1742. package/bin/skills/lakehouse-doc-en/references/user_permission_grand_guide.md +322 -0
  1743. package/bin/skills/lakehouse-doc-en/references/user_rights_management.md +1 -0
  1744. package/bin/skills/lakehouse-doc-en/references/using-google-authenticator.md +44 -0
  1745. package/bin/skills/lakehouse-doc-en/references/using-udf-in-dynamic-table.md +162 -0
  1746. package/bin/skills/lakehouse-doc-en/references/using_mcp_solute_data_pipeline_issue.md +342 -0
  1747. package/bin/skills/lakehouse-doc-en/references/uuid.md +39 -0
  1748. package/bin/skills/lakehouse-doc-en/references/validate_schema_evolution.md +169 -0
  1749. package/bin/skills/lakehouse-doc-en/references/values.md +97 -0
  1750. package/bin/skills/lakehouse-doc-en/references/vc-job.md +1 -0
  1751. package/bin/skills/lakehouse-doc-en/references/vc_cache.md +85 -0
  1752. package/bin/skills/lakehouse-doc-en/references/vcluster_size_description.md +96 -0
  1753. package/bin/skills/lakehouse-doc-en/references/vector-and-scalar-retrieval-in-same-table.md +84 -0
  1754. package/bin/skills/lakehouse-doc-en/references/vector-functions.md +1 -0
  1755. package/bin/skills/lakehouse-doc-en/references/vector-search.md +140 -0
  1756. package/bin/skills/lakehouse-doc-en/references/vector-type.md +201 -0
  1757. package/bin/skills/lakehouse-doc-en/references/vector_data_process_guide.md +952 -0
  1758. package/bin/skills/lakehouse-doc-en/references/vector_search_ai.md +423 -0
  1759. package/bin/skills/lakehouse-doc-en/references/version-update.md +20 -0
  1760. package/bin/skills/lakehouse-doc-en/references/virtual-cluster.md +213 -0
  1761. package/bin/skills/lakehouse-doc-en/references/volume-introduction.md +163 -0
  1762. package/bin/skills/lakehouse-doc-en/references/volume-list.md +50 -0
  1763. package/bin/skills/lakehouse-doc-en/references/volume-overview.md +397 -0
  1764. package/bin/skills/lakehouse-doc-en/references/volume_best_practices.md +1147 -0
  1765. package/bin/skills/lakehouse-doc-en/references/volumeobject.md +1 -0
  1766. package/bin/skills/lakehouse-doc-en/references/web-job-history.md +178 -0
  1767. package/bin/skills/lakehouse-doc-en/references/web_search.md +30 -0
  1768. package/bin/skills/lakehouse-doc-en/references/what-is-clickzetta-lakehouse.md +55 -0
  1769. package/bin/skills/lakehouse-doc-en/references/what_is_clickzetta_lakehouse.md +90 -0
  1770. package/bin/skills/lakehouse-doc-en/references/window-frame.md +135 -0
  1771. package/bin/skills/lakehouse-doc-en/references/window-function-summary.md +134 -0
  1772. package/bin/skills/lakehouse-doc-en/references/windowframe.md +143 -0
  1773. package/bin/skills/lakehouse-doc-en/references/working_with_Vclusters.md +175 -0
  1774. package/bin/skills/lakehouse-doc-en/references/working_with_cache.md +96 -0
  1775. package/bin/skills/lakehouse-doc-en/references/worksheet.md +15 -0
  1776. package/bin/skills/lakehouse-doc-en/references/workspace-information_schema-views.md +203 -0
  1777. package/bin/skills/lakehouse-doc-en/references/workspace-informationschema-summary.md +52 -0
  1778. package/bin/skills/lakehouse-doc-en/references/workspace-introduction.md +43 -0
  1779. package/bin/skills/lakehouse-doc-en/references/workspace-sql.md +1 -0
  1780. package/bin/skills/lakehouse-doc-en/references/workspace_information_schema.md +1 -0
  1781. package/bin/skills/lakehouse-doc-en/references/zettapark-data-engineering-demo.md +348 -0
  1782. package/bin/skills/lakehouse-doc-en/references/zettapark-dataframe-guide.md +342 -0
  1783. package/bin/skills/lakehouse-doc-en/references/zettapark-dynamic-table-guide.md +204 -0
  1784. package/bin/skills/lakehouse-doc-en/references/zettapark-etl-guide.md +257 -0
  1785. package/bin/skills/lakehouse-doc-en/references/zettapark-feature-engineering.md +311 -0
  1786. package/bin/skills/lakehouse-doc-en/references/zettapark-functions-guide.md +321 -0
  1787. package/bin/skills/lakehouse-doc-en/references/zettapark-quick-start.md +123 -0
  1788. package/bin/skills/lakehouse-doc-en/references/zettapark-stream-guide.md +205 -0
  1789. package/bin/skills/lakehouse-doc-en/references/zettapark-volume-guide.md +291 -0
  1790. package/package.json +1 -1
  1791. package/bin/skills/clickzetta-access-control/LICENSE +0 -16
  1792. package/bin/skills/clickzetta-access-control/SKILL.md +0 -243
  1793. package/bin/skills/clickzetta-access-control/eval_cases.jsonl +0 -3
  1794. package/bin/skills/clickzetta-access-control/references/dynamic-masking.md +0 -86
  1795. package/bin/skills/clickzetta-access-control/references/grant-revoke.md +0 -103
  1796. package/bin/skills/clickzetta-access-control/references/role-management.md +0 -66
  1797. package/bin/skills/clickzetta-access-control/references/user-management.md +0 -61
  1798. package/bin/skills/clickzetta-ai-function/eval_cases.jsonl +0 -4
  1799. package/bin/skills/clickzetta-app-python-sdk/LICENSE +0 -16
  1800. package/bin/skills/clickzetta-app-python-sdk/SKILL.md +0 -153
  1801. package/bin/skills/clickzetta-app-python-sdk/eval_cases.jsonl +0 -12
  1802. package/bin/skills/clickzetta-app-python-sdk/references/bulkload.md +0 -196
  1803. package/bin/skills/clickzetta-app-python-sdk/references/connector.md +0 -143
  1804. package/bin/skills/clickzetta-app-python-sdk/references/realtime.md +0 -122
  1805. package/bin/skills/clickzetta-batch-sync-pipeline/eval_cases.jsonl +0 -5
  1806. package/bin/skills/clickzetta-bi-connect/LICENSE +0 -16
  1807. package/bin/skills/clickzetta-bi-connect/SKILL.md +0 -177
  1808. package/bin/skills/clickzetta-bi-connect/eval_cases.jsonl +0 -5
  1809. package/bin/skills/clickzetta-bi-connect/references/bi-tools.md +0 -170
  1810. package/bin/skills/clickzetta-cdc-sync-pipeline/eval_cases.jsonl +0 -5
  1811. package/bin/skills/clickzetta-data-ingest-pipeline/eval_cases.jsonl +0 -5
  1812. package/bin/skills/clickzetta-data-retention/LICENSE +0 -16
  1813. package/bin/skills/clickzetta-data-retention/SKILL.md +0 -160
  1814. package/bin/skills/clickzetta-data-retention/eval_cases.jsonl +0 -5
  1815. package/bin/skills/clickzetta-data-retention/references/lifecycle-reference.md +0 -175
  1816. package/bin/skills/clickzetta-data-science/eval_cases.jsonl +0 -12
  1817. package/bin/skills/clickzetta-data-sharing/LICENSE +0 -16
  1818. package/bin/skills/clickzetta-data-sharing/SKILL.md +0 -160
  1819. package/bin/skills/clickzetta-data-sharing/eval_cases.jsonl +0 -3
  1820. package/bin/skills/clickzetta-data-sharing/references/share-ddl.md +0 -134
  1821. package/bin/skills/clickzetta-dba-guide/LICENSE +0 -16
  1822. package/bin/skills/clickzetta-dba-guide/SKILL.md +0 -542
  1823. package/bin/skills/clickzetta-dba-guide/eval_cases.jsonl +0 -3
  1824. package/bin/skills/clickzetta-dw-modeling/eval_cases.jsonl +0 -4
  1825. package/bin/skills/clickzetta-dynamic-table/eval_cases.jsonl +0 -5
  1826. package/bin/skills/clickzetta-external-catalog/LICENSE +0 -16
  1827. package/bin/skills/clickzetta-external-catalog/SKILL.md +0 -123
  1828. package/bin/skills/clickzetta-external-catalog/eval_cases.jsonl +0 -5
  1829. package/bin/skills/clickzetta-external-catalog/references/external-catalog-ddl.md +0 -130
  1830. package/bin/skills/clickzetta-external-function/eval_cases.jsonl +0 -4
  1831. package/bin/skills/clickzetta-file-import-pipeline/eval_cases.jsonl +0 -5
  1832. package/bin/skills/clickzetta-index-manager/LICENSE +0 -16
  1833. package/bin/skills/clickzetta-index-manager/SKILL.md +0 -140
  1834. package/bin/skills/clickzetta-index-manager/eval_cases.jsonl +0 -5
  1835. package/bin/skills/clickzetta-index-manager/references/bloomfilter-index.md +0 -67
  1836. package/bin/skills/clickzetta-index-manager/references/index-management.md +0 -73
  1837. package/bin/skills/clickzetta-index-manager/references/inverted-index.md +0 -80
  1838. package/bin/skills/clickzetta-index-manager/references/vector-index.md +0 -81
  1839. package/bin/skills/clickzetta-java-sdk/LICENSE +0 -16
  1840. package/bin/skills/clickzetta-java-sdk/SKILL.md +0 -187
  1841. package/bin/skills/clickzetta-java-sdk/eval_cases.jsonl +0 -12
  1842. package/bin/skills/clickzetta-java-sdk/references/bulkload.md +0 -163
  1843. package/bin/skills/clickzetta-java-sdk/references/realtime.md +0 -212
  1844. package/bin/skills/clickzetta-kafka-ingest-pipeline/eval_cases.jsonl +0 -5
  1845. package/bin/skills/clickzetta-lakehouse-connect/LICENSE +0 -16
  1846. package/bin/skills/clickzetta-lakehouse-connect/SKILL.md +0 -218
  1847. package/bin/skills/clickzetta-lakehouse-connect/eval_cases.jsonl +0 -3
  1848. package/bin/skills/clickzetta-lakehouse-connect/evals/evals.json +0 -35
  1849. package/bin/skills/clickzetta-lakehouse-connect/references/config-file.md +0 -435
  1850. package/bin/skills/clickzetta-lakehouse-connect/references/jdbc.md +0 -478
  1851. package/bin/skills/clickzetta-lakehouse-connect/references/python-sdk.md +0 -225
  1852. package/bin/skills/clickzetta-lakehouse-connect/references/sqlalchemy.md +0 -468
  1853. package/bin/skills/clickzetta-lakehouse-connect/references/zettapark-session.md +0 -445
  1854. package/bin/skills/clickzetta-manage-comments/LICENSE +0 -16
  1855. package/bin/skills/clickzetta-manage-comments/SKILL.md +0 -219
  1856. package/bin/skills/clickzetta-manage-comments/eval_cases.jsonl +0 -3
  1857. package/bin/skills/clickzetta-metadata/LICENSE +0 -16
  1858. package/bin/skills/clickzetta-metadata/SKILL.md +0 -502
  1859. package/bin/skills/clickzetta-metadata/eval_cases.jsonl +0 -5
  1860. package/bin/skills/clickzetta-metadata/references/instance-views-reference.md +0 -276
  1861. package/bin/skills/clickzetta-metadata/references/metering-views-reference.md +0 -137
  1862. package/bin/skills/clickzetta-metadata/references/show-desc-reference.md +0 -326
  1863. package/bin/skills/clickzetta-metadata/references/views-reference.md +0 -271
  1864. package/bin/skills/clickzetta-monitoring/LICENSE +0 -16
  1865. package/bin/skills/clickzetta-monitoring/SKILL.md +0 -215
  1866. package/bin/skills/clickzetta-monitoring/eval_cases.jsonl +0 -5
  1867. package/bin/skills/clickzetta-monitoring/references/job-history-analysis.md +0 -97
  1868. package/bin/skills/clickzetta-monitoring/references/show-jobs.md +0 -48
  1869. package/bin/skills/clickzetta-oss-ingest-pipeline/eval_cases.jsonl +0 -5
  1870. package/bin/skills/clickzetta-overview/eval_cases.jsonl +0 -5
  1871. package/bin/skills/clickzetta-query-optimizer/LICENSE +0 -16
  1872. package/bin/skills/clickzetta-query-optimizer/SKILL.md +0 -156
  1873. package/bin/skills/clickzetta-query-optimizer/eval_cases.jsonl +0 -5
  1874. package/bin/skills/clickzetta-query-optimizer/references/explain.md +0 -56
  1875. package/bin/skills/clickzetta-query-optimizer/references/hints-and-sortkey.md +0 -78
  1876. package/bin/skills/clickzetta-query-optimizer/references/optimize.md +0 -65
  1877. package/bin/skills/clickzetta-query-optimizer/references/result-cache.md +0 -49
  1878. package/bin/skills/clickzetta-query-optimizer/references/show-jobs.md +0 -42
  1879. package/bin/skills/clickzetta-realtime-sync-pipeline/eval_cases.jsonl +0 -5
  1880. package/bin/skills/clickzetta-semantic-view/eval_cases.jsonl +0 -12
  1881. package/bin/skills/clickzetta-spark-flink-connector/eval_cases.jsonl +0 -5
  1882. package/bin/skills/clickzetta-sql-migration/eval_cases.jsonl +0 -10
  1883. package/bin/skills/clickzetta-sql-pipeline-manager/eval_cases.jsonl +0 -12
  1884. package/bin/skills/clickzetta-sql-pipeline-manager/evals/evals.json +0 -166
  1885. package/bin/skills/clickzetta-table-lineage/eval_cases.jsonl +0 -1
  1886. package/bin/skills/clickzetta-table-stream-pipeline/eval_cases.jsonl +0 -5
  1887. package/bin/skills/clickzetta-vcluster-manager/LICENSE +0 -16
  1888. package/bin/skills/clickzetta-vcluster-manager/SKILL.md +0 -212
  1889. package/bin/skills/clickzetta-vcluster-manager/eval_cases.jsonl +0 -5
  1890. package/bin/skills/clickzetta-vcluster-manager/references/vc-cache.md +0 -54
  1891. package/bin/skills/clickzetta-vcluster-manager/references/vcluster-ddl.md +0 -150
  1892. package/bin/skills/clickzetta-volume-manager/eval_cases.jsonl +0 -5
  1893. package/bin/skills/clickzetta-zettapark/eval_cases.jsonl +0 -12
@@ -0,0 +1,2674 @@
1
+ # Singdata Lakehouse Table Design Best Practices Guide
2
+
3
+ ## Content Overview
4
+
5
+ ### Document Introduction
6
+
7
+ This guide is a comprehensive reference manual for table design on the Singdata Lakehouse platform, covering everything from basic data type selection to complex enterprise-level architecture patterns.
8
+
9
+ ### How to Use This Guide
10
+
11
+ Depending on your role and needs, we recommend the following reading paths:
12
+
13
+ * **Data Architects**: Focus on Design Philosophy (Chapter 1), Partition Architecture (Chapter 5), and Enterprise Design Patterns (Chapter 11)
14
+ * **Data Engineers**: Dive into Data Type Design (Chapter 3), Index Architecture (Chapter 6), and Performance Optimization (Chapter 9)
15
+ * **Backend Developers**: Concentrate on Table Structure Design (Chapter 4), Complex Data Types (Section 3.3), and Troubleshooting (Chapter 10)
16
+ * **Quick Start**: Refer directly to the Design Review Checklist (Chapter 9) as a project guidance framework
17
+
18
+ ### Core Chapter Overview
19
+
20
+ 1. **Design Philosophy and Principles** - Foundational design philosophy and decision framework
21
+ 2. **Data Type Design Strategy** - Detailed type selection guide and use cases
22
+ 3. **Table Structure Design Patterns** - Effective use of constraints, defaults, and generated columns
23
+ 4. **Partition Architecture Design** - Partition type selection and optimization strategies
24
+ 5. **Bucketing and Sorting Optimization** - Best practices for physical data organization
25
+ 6. **Index Architecture Design** - Vector, inverted, and bloom filter indexes in detail
26
+ 7. **Performance Optimization Strategies** - Query performance and storage cost optimization techniques
27
+ 8. **Common Design Pitfalls and Solutions** - Avoiding common mistakes and optimization recommendations
28
+ 9. **Design Review Checklist** - Comprehensive design validation process
29
+ 10. **Enterprise Design Patterns in Practice** - Four advanced application architectures in detail
30
+ 11. **Lab Environment Cleanup Guide** - Resource management best practices
31
+ 12. **Summary** - Content summary
32
+
33
+ On first reading, we recommend going through the design philosophy section to understand core principles, then diving into relevant chapters based on your specific needs. Every code example can be copied and used directly to help you quickly apply them in practice.
34
+
35
+ ***
36
+
37
+ ## Design Philosophy and Principles
38
+
39
+ ### Core Design Thinking
40
+
41
+ On the Singdata Lakehouse, excellent table design should balance **performance, maintainability, and business requirements**. This guide follows these validated core principles:
42
+
43
+ 1. **Business-Driven Design** - Table structure should reflect business models and query patterns
44
+ 2. **Performance-First Consideration** - Proper partitioning, bucketing, and indexing strategies are critical
45
+ 3. **Future-Oriented Scalability** - Design with data growth and business evolution in mind
46
+ 4. **Operations-Friendly** - Simplify daily maintenance and troubleshooting complexity
47
+
48
+ ### Design Decision Framework
49
+
50
+ Each design decision should consider the following dimensions:
51
+
52
+ * **Query Patterns**: Primary data access methods and frequency
53
+ * **Data Characteristics**: Data volume, growth rate, distribution characteristics
54
+ * **Business Requirements**: Real-time requirements, consistency needs, scalability demands
55
+ * **Resource Constraints**: Storage cost, compute resources, operational complexity
56
+
57
+ ***
58
+
59
+ ## Data Type Design Strategy
60
+
61
+ ### Numeric Type Selection Guide
62
+
63
+ #### Auto-Increment Primary Key Design
64
+
65
+ **Key Limitation**: IDENTITY columns only support the BIGINT type
66
+
67
+ ```sql
68
+ -- Correct IDENTITY usage (only supported syntax)
69
+ CREATE TABLE business_events (
70
+ event_id BIGINT IDENTITY, -- Only BIGINT type is supported
71
+ event_data JSON,
72
+ created_at TIMESTAMP DEFAULT current_timestamp()
73
+ );
74
+
75
+ -- IDENTITY with seed value
76
+ CREATE TABLE user_accounts (
77
+ user_id BIGINT IDENTITY(1000), -- Auto-increment starting from 1000
78
+ username VARCHAR(50) NOT NULL
79
+ );
80
+ ```
81
+
82
+ **Unsupported IDENTITY Syntax (confirmed to fail in testing)**:
83
+
84
+ ```sql
85
+ -- These will all result in error: invalid identity column type int, currently only BIGINT is supported
86
+ CREATE TABLE wrong_examples (
87
+ id INT IDENTITY, -- Fails
88
+ small_id SMALLINT IDENTITY, -- Fails
89
+ str_id VARCHAR(50) IDENTITY -- Fails
90
+ );
91
+ ```
92
+
93
+ #### Business Numeric Field Selection
94
+
95
+ | Data Type | Storage | Value Range | Recommended Scenario | Practical Example |
96
+ | --------------- | ------- | ------------------------ | ---------------------- | ----------------------------- |
97
+ | `TINYINT` | 1 byte | -128 to 127 | Status codes, levels | `status TINYINT DEFAULT 1` |
98
+ | `SMALLINT` | 2 bytes | -32,768 to 32,767 | Years, counters | `birth_year SMALLINT` |
99
+ | `INT` | 4 bytes | +/-2.1 billion | Business IDs, large counts | `user_id INT NOT NULL` |
100
+ | `BIGINT` | 8 bytes | +/-9.22 quintillion | Auto-increment PK, large values | `id BIGINT IDENTITY` |
101
+ | `DECIMAL(p,s)` | Variable| Up to 38-digit precision | Financial calculations | `amount DECIMAL(15,2)` |
102
+ | `FLOAT` | 4 bytes | Single-precision float | Scientific computing, coordinates | `temperature FLOAT` |
103
+ | `DOUBLE` | 8 bytes | Double-precision float | High-precision calculations | `coordinate DOUBLE` |
104
+
105
+ ### String Type Strategy
106
+
107
+ #### Length Planning Principles (Based on Actual Business Requirements)
108
+
109
+ | Business Scenario | Recommended Type | Length Setting | Coverage Rate | Design Considerations |
110
+ | ----------------- | ------------------ | -------------- | ------------- | --------------------------- |
111
+ | Email address | `VARCHAR(320)` | RFC5321 standard | 99.9% | International standard length |
112
+ | Username | `VARCHAR(50)` | Research-based | 99.5% | Balance storage and usability |
113
+ | Phone number | `VARCHAR(20)` | International format | 100% | Supports +86-138\*\*\*\* |
114
+ | URL address | `VARCHAR(2048)` | Measured | 98% | Includes complex query params |
115
+ | Article title | `VARCHAR(200)` | SEO optimized | 95% | Search engine friendly |
116
+ | Product description| `VARCHAR(2000)` | E-commerce needs | 90% | Detail page display |
117
+ | Long-form text | `STRING` | Unlimited length | 100% | Blog posts, comments, etc. |
118
+
119
+ ```sql
120
+ -- String type best practices
121
+ CREATE TABLE user_profiles (
122
+ user_id BIGINT IDENTITY,
123
+
124
+ -- Fixed format uses CHAR
125
+ country_code CHAR(2), -- CN, US, JP
126
+ currency_code CHAR(3), -- USD, CNY, EUR
127
+
128
+ -- Business fields use reasonable VARCHAR lengths
129
+ username VARCHAR(50) NOT NULL,
130
+ email VARCHAR(320) NOT NULL,
131
+ mobile_phone VARCHAR(20),
132
+
133
+ -- Descriptive content
134
+ nickname VARCHAR(100),
135
+ bio VARCHAR(500), -- Personal bio
136
+ full_description STRING, -- Detailed description, variable length
137
+
138
+ -- Structured data
139
+ preferences JSON DEFAULT '{}'
140
+ );
141
+ ```
142
+
143
+ ### Vector Type Use Cases
144
+
145
+ #### Vector Type Syntax and Applications
146
+
147
+ **Standard Syntax**: `VECTOR(scalar_type, dimension)` or `VECTOR(dimension)`
148
+
149
+ | Scalar Type | Storage Overhead | Use Case | Recommended Dimensions | Application Example |
150
+ | ----------- | ---------------- | --------------------- | ---------------------- | -------------------------- |
151
+ | `FLOAT` | 4 bytes/dim | Semantic vectors, general AI | 128-2048 | `VECTOR(FLOAT, 768)` |
152
+ | `INT` | 4 bytes/dim | Discrete features, count vectors | 64-1024 | `VECTOR(INT, 256)` |
153
+ | `TINYINT` | 1 byte/dim | Compressed vectors, mobile | 64-512 | `VECTOR(TINYINT, 128)` |
154
+
155
+ **Practical Application Examples**:
156
+
157
+ ```sql
158
+ CREATE TABLE ai_content_vectors (
159
+ content_id BIGINT IDENTITY,
160
+ content_type VARCHAR(50),
161
+
162
+ -- Vector configurations for different business scenarios
163
+ text_embedding VECTOR(FLOAT, 768), -- BERT/RoBERTa output
164
+ image_features VECTOR(FLOAT, 512), -- ResNet/CNN features
165
+ user_preference VECTOR(INT, 256), -- Recommendation system user profile
166
+ mobile_compact VECTOR(TINYINT, 128), -- Mobile lightweight
167
+ general_vector VECTOR(512) -- Default FLOAT type
168
+ );
169
+
170
+ -- Vector data insert syntax (note: dimensions must match strictly)
171
+ INSERT INTO ai_content_vectors (content_type, text_embedding) VALUES (
172
+ 'document',
173
+ cast(concat('[', repeat('0.1,', 767), '0.1]') as VECTOR(FLOAT, 768))
174
+ );
175
+ ```
176
+
177
+ ### Complex Data Type Usage Guide
178
+
179
+ #### Proper STRUCT Type Usage
180
+
181
+ **Correct STRUCT Data Insert Syntax**:
182
+
183
+ ```sql
184
+ CREATE TABLE user_complex_data (
185
+ user_id BIGINT IDENTITY,
186
+
187
+ -- Simple struct
188
+ basic_info STRUCT<id:INT, name:STRING, age:INT>,
189
+
190
+ -- Complex nested struct
191
+ detailed_profile STRUCT<
192
+ personal:STRUCT<name:STRING, email:STRING>,
193
+ address:STRUCT<city:STRING, country:STRING>,
194
+ preferences:MAP<STRING, STRING>
195
+ >
196
+ );
197
+
198
+ -- Method 1: Using struct function (positional arguments)
199
+ INSERT INTO user_complex_data (basic_info) VALUES (
200
+ struct(123, 'Alice', 25)
201
+ );
202
+
203
+ -- Method 2: Using named_struct function (recommended, explicit field names)
204
+ INSERT INTO user_complex_data (basic_info) VALUES (
205
+ named_struct('id', 123, 'name', 'Alice', 'age', 25)
206
+ );
207
+
208
+ -- Inserting complex nested structures
209
+ INSERT INTO user_complex_data (detailed_profile) VALUES (
210
+ named_struct(
211
+ 'personal', named_struct('name', 'Bob', 'email', 'bob@test.com'),
212
+ 'address', named_struct('city', 'Shanghai', 'country', 'China'),
213
+ 'preferences', map('lang', 'zh', 'theme', 'dark')
214
+ )
215
+ );
216
+ ```
217
+
218
+ #### ARRAY and MAP Type Usage
219
+
220
+ ```sql
221
+ CREATE TABLE collection_types_demo (
222
+ record_id BIGINT IDENTITY,
223
+
224
+ -- Array types
225
+ tags ARRAY<STRING>,
226
+ scores ARRAY<INT>,
227
+ nested_arrays ARRAY<ARRAY<STRING>>,
228
+
229
+ -- Map types
230
+ config MAP<STRING, STRING>,
231
+ metrics MAP<STRING, DOUBLE>,
232
+ complex_map MAP<STRING, ARRAY<INT>>
233
+ );
234
+
235
+ -- Correct insert syntax
236
+ INSERT INTO collection_types_demo (
237
+ tags, scores, nested_arrays, config, metrics, complex_map
238
+ ) VALUES (
239
+ array('tech', 'AI', 'database'), -- String array
240
+ array(85, 92, 78), -- Integer array
241
+ array(array('group1', 'item1'), array('group2', 'item2')), -- Nested arrays
242
+ map('env', 'prod', 'version', 'v2.2'), -- String map
243
+ map('cpu_usage', 0.75, 'memory_usage', 0.60), -- Numeric map
244
+ map('feature1', array(1, 2, 3), 'feature2', array(4, 5, 6)) -- Complex map
245
+ );
246
+ ```
247
+
248
+ ***
249
+
250
+ ## Table Structure Design Patterns
251
+
252
+ ### Constraint Design Strategies
253
+
254
+ #### Proper Use of NOT NULL Constraints
255
+
256
+ NOT NULL constraints not only ensure data integrity but also serve as important hints for the query optimizer:
257
+
258
+ ```sql
259
+ CREATE TABLE order_management (
260
+ order_id BIGINT IDENTITY,
261
+
262
+ -- Core business fields: must be non-null
263
+ customer_id INT NOT NULL, -- Core business association
264
+ order_time TIMESTAMP NOT NULL, -- Core time dimension
265
+ order_status TINYINT NOT NULL DEFAULT 0, -- Business status
266
+ total_amount DECIMAL(12,2) NOT NULL, -- Core amount field
267
+
268
+ -- Optional business fields: nullable
269
+ coupon_code VARCHAR(20), -- Coupon (optional)
270
+ customer_notes VARCHAR(500), -- Customer notes (optional)
271
+ gift_message VARCHAR(200), -- Gift message (optional)
272
+
273
+ -- System fields: non-null with defaults
274
+ created_at TIMESTAMP NOT NULL DEFAULT current_timestamp(),
275
+ updated_at TIMESTAMP, -- Update time (NULL on first creation)
276
+
277
+ -- Partition field (generated column)
278
+ date_partition STRING GENERATED ALWAYS AS (
279
+ date_format(order_time, 'yyyy-MM-dd')
280
+ )
281
+ )
282
+ PARTITIONED BY (date_partition);
283
+ ```
284
+
285
+ #### Using Default Values
286
+
287
+ Default value design should reflect business logic and system behavior:
288
+
289
+ ```sql
290
+ CREATE TABLE user_account_enhanced (
291
+ user_id BIGINT IDENTITY,
292
+ username VARCHAR(50) NOT NULL,
293
+
294
+ -- Reasonable defaults for business statuses
295
+ account_status TINYINT DEFAULT 1, -- 1=normal, 0=disabled, 2=locked
296
+ email_verified BOOLEAN DEFAULT false, -- Default not verified
297
+ phone_verified BOOLEAN DEFAULT false, -- Default not verified
298
+
299
+ -- Business defaults for numeric fields
300
+ credit_balance DECIMAL(10,2) DEFAULT 0.00, -- Default balance 0
301
+ loyalty_points INT DEFAULT 0, -- Default points 0
302
+ login_attempts TINYINT DEFAULT 0, -- Default login attempts 0
303
+
304
+ -- System defaults for time fields
305
+ registration_time TIMESTAMP DEFAULT current_timestamp(),
306
+ last_login_time TIMESTAMP, -- NULL before first login
307
+ password_changed_at TIMESTAMP DEFAULT current_timestamp(),
308
+
309
+ -- Defaults for JSON fields
310
+ user_preferences JSON DEFAULT '{}', -- Default empty object
311
+ security_settings JSON DEFAULT '{"two_factor": false, "login_notifications": true}'
312
+ );
313
+ ```
314
+
315
+ ### Complete Generated Column Function List
316
+
317
+ Generated columns only support **deterministic scalar functions**. Below is the complete list of functions verified through testing:
318
+
319
+ #### Date/Time Functions
320
+
321
+ | Function Name | Description | Input Type | Return Type | Usage Example | Verified |
322
+ | ----------------- | ------------------------- | ---------------- | ----------- | -------------------------------- | -------- |
323
+ | `year()` | Extract year | DATE/TIMESTAMP | INT | `year(order_date)` | Passed |
324
+ | `month()` | Extract month | DATE/TIMESTAMP | INT | `month(order_date)` | Passed |
325
+ | `day()` | Extract day | DATE/TIMESTAMP | INT | `day(order_date)` | Passed |
326
+ | `hour()` | Extract hour | TIMESTAMP | INT | `hour(event_time)` | Passed |
327
+ | `minute()` | Extract minute | TIMESTAMP | INT | `minute(event_time)` | Passed |
328
+ | `second()` | Extract second | TIMESTAMP | INT | `second(event_time)` | Passed |
329
+ | `dayofweek()` | Day of week (1-7) | DATE/TIMESTAMP | INT | `dayofweek(order_date)` | Passed |
330
+ | `dayofyear()` | Day of year | DATE/TIMESTAMP | INT | `dayofyear(order_date)` | Passed |
331
+ | `quarter()` | Quarter (1-4) | DATE/TIMESTAMP | INT | `quarter(order_date)` | Passed |
332
+ | `date_format()` | Format date | DATE/TIMESTAMP | STRING | `date_format(dt, 'yyyy-MM-dd')` | Passed |
333
+
334
+ #### Math Functions
335
+
336
+ | Function Name | Description | Usage Example | Verified |
337
+ | ------------- | ---------------- | ------------------- | -------- |
338
+ | `abs()` | Absolute value | `abs(profit_loss)` | Passed |
339
+ | `round()` | Round | `round(amount, 2)` | Passed |
340
+ | `ceil()` | Ceiling | `ceil(price)` | Passed |
341
+ | `floor()` | Floor | `floor(score)` | Passed |
342
+ | `power()` | Power | `power(base, 2)` | Passed |
343
+ | `sqrt()` | Square root | `sqrt(area)` | Passed |
344
+ | `mod()` | Modulo | `mod(id, 10)` | Passed |
345
+
346
+ #### String Functions
347
+
348
+ | Function Name | Description | Usage Example | Return Type | Verified |
349
+ | ------------- | ------------------- | ------------------------------------- | ----------- | -------- |
350
+ | `concat()` | String concatenation| `concat(first_name, ' ', last_name)` | STRING | Passed |
351
+ | `length()` | String length | `length(username)` | INT | Passed |
352
+ | `upper()` | To uppercase | `upper(code)` | STRING | Passed |
353
+ | `lower()` | To lowercase | `lower(email)` | STRING | Passed |
354
+ | `trim()` | Remove leading/trailing spaces | `trim(input_text)` | STRING | Passed |
355
+ | `substr()` | Extract substring | `substr(phone, 1, 3)` | STRING | Passed |
356
+ | `replace()` | String replacement | `replace(text, 'old', 'new')` | STRING | Passed |
357
+
358
+ #### Type Conversion and Conditional Functions
359
+
360
+ | Function Name | Description | Usage Example | Verified |
361
+ | ------------- | ---------------------- | -------------------------------------------- | -------- |
362
+ | `cast()` | Type conversion | `cast(amount AS STRING)` | Passed |
363
+ | `string()` | To string | `string(user_id)` | Passed |
364
+ | `int()` | To integer | `int(price_str)` | Passed |
365
+ | `if()` | Simple conditional | `if(amount > 0, 'positive', 'negative')` | Passed |
366
+ | `coalesce()` | Null handling | `coalesce(nickname, username, 'anonymous')` | Passed |
367
+ | `nullif()` | Null conversion | `nullif(status, '')` | Passed |
368
+
369
+ #### Unsupported Non-Deterministic Functions (Confirmed by Testing)
370
+
371
+ The following functions are not supported in generated columns and will cause syntax errors:
372
+
373
+ * `current_timestamp()` - Current timestamp
374
+ * `current_date()` - Current date
375
+ * `random()` - Random number generation
376
+ * `uuid()` - UUID generation
377
+ * `current_user()` - Current user
378
+
379
+ **Comprehensive Generated Column Application Example**:
380
+
381
+ ```sql
382
+ CREATE TABLE comprehensive_generated_columns (
383
+ order_id BIGINT IDENTITY,
384
+ customer_name VARCHAR(100),
385
+ order_time TIMESTAMP NOT NULL,
386
+ total_amount DECIMAL(12,2),
387
+ discount_rate DECIMAL(5,4) DEFAULT 0,
388
+
389
+ -- Time dimension generated columns (for partitioning and analysis)
390
+ order_year INT GENERATED ALWAYS AS (year(order_time)),
391
+ order_month INT GENERATED ALWAYS AS (month(order_time)),
392
+ order_date STRING GENERATED ALWAYS AS (date_format(order_time, 'yyyy-MM-dd')),
393
+ order_hour INT GENERATED ALWAYS AS (hour(order_time)),
394
+ quarter_label STRING GENERATED ALWAYS AS (concat('Q', string(quarter(order_time)))),
395
+ weekday INT GENERATED ALWAYS AS (dayofweek(order_time)),
396
+
397
+ -- Business calculation generated columns
398
+ final_amount DECIMAL(12,2) GENERATED ALWAYS AS (round(total_amount * (1 - discount_rate), 2)),
399
+ amount_category STRING GENERATED ALWAYS AS (
400
+ if(total_amount < 100, 'small',
401
+ if(total_amount < 1000, 'medium', 'large'))
402
+ ),
403
+
404
+ -- String processing generated columns
405
+ customer_initial STRING GENERATED ALWAYS AS (upper(substr(trim(customer_name), 1, 1))),
406
+ name_length INT GENERATED ALWAYS AS (length(trim(customer_name))),
407
+ display_name STRING GENERATED ALWAYS AS (concat('[', string(order_id), '] ', customer_name)),
408
+ normalized_name STRING GENERATED ALWAYS AS (lower(trim(customer_name)))
409
+ )
410
+ PARTITIONED BY (order_date) -- Use generated column as partition key
411
+ COMMENT 'Order table - demonstrating various real-world use cases of generated columns';
412
+ ```
413
+
414
+ ***
415
+
416
+ ## Partition Architecture Design
417
+
418
+ ### Partition Strategy Selection Framework
419
+
420
+ #### Supported Partition Data Types (Confirmed by Testing)
421
+
422
+ | Type | Supported | Usage Advice | Practical Example | Test Status |
423
+ | -------------- | --------- | ---------------------------------- | ------------------------- | ------------- |
424
+ | `TINYINT` | Yes | Status/level partitioning | `status TINYINT` | Verified |
425
+ | `SMALLINT` | Yes | Year/month partitioning | `year_part SMALLINT` | Verified |
426
+ | `INT` | Yes | Common partition type | `user_id INT` | Verified |
427
+ | `BIGINT` | Yes | Large value partitioning | `account_id BIGINT` | Verified |
428
+ | `STRING` | Yes | **Most commonly used partition type** | `date_partition STRING` | Verified |
429
+ | `VARCHAR(n)` | Yes | Variable-length string partitioning| `region VARCHAR(50)` | Verified |
430
+ | `CHAR(n)` | Yes | Fixed-length partitioning | `country CHAR(2)` | Verified |
431
+ | `BOOLEAN` | Yes | Binary partitioning | `is_active BOOLEAN` | Verified |
432
+ | `DATE` | Yes | Date partitioning | `order_date DATE` | Verified |
433
+ | `TIMESTAMP` | No | Needs conversion to other type | Use generated column conversion | Confirmed limit |
434
+ | `FLOAT/DOUBLE` | No | Not recommended due to precision | Avoid | Confirmed limit |
435
+ | `DECIMAL` | No | Precision and performance concerns | Avoid | Confirmed limit |
436
+
437
+ #### Time Series Partition Patterns
438
+
439
+ **Pattern 1: Daily Partitioning (Recommended, Most Common)**
440
+
441
+ ```sql
442
+ CREATE TABLE daily_business_logs (
443
+ log_id BIGINT IDENTITY,
444
+ application VARCHAR(50) NOT NULL,
445
+ log_level VARCHAR(10) NOT NULL,
446
+ message STRING,
447
+ user_id INT,
448
+ log_timestamp TIMESTAMP NOT NULL,
449
+
450
+ -- Use generated column to create date partition key
451
+ date_partition STRING GENERATED ALWAYS AS (
452
+ date_format(log_timestamp, 'yyyy-MM-dd')
453
+ )
454
+ )
455
+ PARTITIONED BY (date_partition)
456
+ HASH CLUSTERED BY (application)
457
+ SORTED BY (log_timestamp DESC)
458
+ INTO 128 BUCKETS
459
+ COMMENT 'Business log table - partitioned by date for easy log management and querying';
460
+ ```
461
+
462
+ **Pattern 2: Hourly Partitioning (High-Frequency Data)**
463
+
464
+ ```sql
465
+ CREATE TABLE realtime_metrics (
466
+ metric_id BIGINT IDENTITY,
467
+ sensor_id VARCHAR(100) NOT NULL,
468
+ metric_value DOUBLE,
469
+ collect_time TIMESTAMP NOT NULL,
470
+
471
+ -- Hourly partitioning for real-time monitoring
472
+ hour_partition STRING GENERATED ALWAYS AS (
473
+ date_format(collect_time, 'yyyy-MM-dd-HH')
474
+ )
475
+ )
476
+ PARTITIONED BY (hour_partition)
477
+ HASH CLUSTERED BY (sensor_id)
478
+ SORTED BY (collect_time DESC)
479
+ INTO 512 BUCKETS
480
+ COMMENT 'Real-time metrics table - hourly partitioning for high-frequency data ingestion';
481
+ ```
482
+
483
+ **Pattern 3: Monthly Partitioning (Historical Archive)**
484
+
485
+ ```sql
486
+ CREATE TABLE monthly_report_data (
487
+ report_id BIGINT IDENTITY,
488
+ business_data JSON,
489
+ created_time TIMESTAMP NOT NULL,
490
+
491
+ -- Monthly partitioning to reduce partition count
492
+ month_partition STRING GENERATED ALWAYS AS (
493
+ date_format(created_time, 'yyyy-MM')
494
+ )
495
+ )
496
+ PARTITIONED BY (month_partition)
497
+ COMMENT 'Monthly report data - partitioned by month for optimized long-term storage';
498
+ ```
499
+
500
+ #### Business Dimension Partitioning Patterns
501
+
502
+ **Multi-Tenant Partitioning Pattern**:
503
+
504
+ ```sql
505
+ CREATE TABLE saas_tenant_data (
506
+ record_id BIGINT IDENTITY,
507
+ tenant_id VARCHAR(50) NOT NULL,
508
+ entity_type VARCHAR(50) NOT NULL,
509
+ entity_data JSON,
510
+ created_time TIMESTAMP DEFAULT current_timestamp(),
511
+
512
+ -- Partition by tenant for data isolation
513
+ tenant_partition STRING GENERATED ALWAYS AS (tenant_id)
514
+ )
515
+ PARTITIONED BY (tenant_partition)
516
+ HASH CLUSTERED BY (entity_type)
517
+ SORTED BY (created_time DESC)
518
+ INTO 64 BUCKETS
519
+ COMMENT 'Multi-tenant data table - partitioned by tenant ID for complete data isolation';
520
+ ```
521
+
522
+ **Geographic Region Partitioning Pattern**:
523
+
524
+ ```sql
525
+ CREATE TABLE global_order_data (
526
+ order_id BIGINT IDENTITY,
527
+ customer_id INT NOT NULL,
528
+ region VARCHAR(50) NOT NULL, -- Geographic region
529
+ country VARCHAR(50) NOT NULL,
530
+ order_data JSON,
531
+ order_time TIMESTAMP
532
+ )
533
+ PARTITIONED BY (region) -- Partition by region
534
+ HASH CLUSTERED BY (customer_id)
535
+ SORTED BY (order_time DESC)
536
+ INTO 128 BUCKETS
537
+ COMMENT 'Global order data - partitioned by geographic region for regionalized queries';
538
+ ```
539
+
540
+ #### Composite Partition Strategy (Advanced)
541
+
542
+ **Time + Business Dimension Dual Partitioning**:
543
+
544
+ ```sql
545
+ CREATE TABLE advanced_partitioning_example (
546
+ event_id BIGINT IDENTITY,
547
+ user_id INT NOT NULL,
548
+ business_type VARCHAR(50) NOT NULL,
549
+ event_time TIMESTAMP NOT NULL,
550
+ event_data JSON,
551
+
552
+ -- Composite partition keys
553
+ date_partition STRING GENERATED ALWAYS AS (date_format(event_time, 'yyyy-MM-dd')),
554
+ business_partition STRING GENERATED ALWAYS AS (business_type)
555
+ )
556
+ PARTITIONED BY (date_partition, business_partition) -- Dual partitioning
557
+ HASH CLUSTERED BY (user_id)
558
+ SORTED BY (event_time DESC)
559
+ INTO 256 BUCKETS
560
+ COMMENT 'Advanced partitioning example - dual partitioning by time and business dimension';
561
+ ```
562
+
563
+ ### Partition Management and Optimization
564
+
565
+ #### Dynamic Partition Limits
566
+
567
+ **Key Limitation**: A single insert task can create a maximum of 2048 dynamic partitions
568
+
569
+ ```sql
570
+ -- Operations that may exceed the limit
571
+ INSERT INTO large_partition_table
572
+ SELECT * FROM source_table_with_many_partitions; -- Fails if source table has >2048 partitions
573
+
574
+ -- Solution 1: Batch insert
575
+ INSERT INTO large_partition_table
576
+ SELECT * FROM source_table_with_many_partitions
577
+ WHERE date_column BETWEEN '2024-01-01' AND '2024-01-10'; -- Limit partition range
578
+
579
+ -- Solution 2: Loop insert (application-level implementation)
580
+ -- In the application, batch insert by dimensions like date/region, controlling to within 2000 partitions per batch
581
+ ```
582
+
583
+ #### Data Lifecycle Management
584
+
585
+ ```sql
586
+ -- Set table-level data lifecycle
587
+ CREATE TABLE lifecycle_managed_table (
588
+ record_id BIGINT IDENTITY,
589
+ business_data JSON,
590
+ created_time TIMESTAMP,
591
+
592
+ date_partition STRING GENERATED ALWAYS AS (date_format(created_time, 'yyyy-MM-dd'))
593
+ )
594
+ PARTITIONED BY (date_partition)
595
+ PROPERTIES ('data_lifecycle' = '90') -- Auto-cleanup after 90 days
596
+ COMMENT 'Lifecycle-managed table - 90-day data retention policy';
597
+ ```
598
+
599
+ ***
600
+
601
+ ## Bucketing and Sorting Optimization
602
+
603
+ ### Bucketing Strategy Design
604
+
605
+ #### Bucket Count Planning Guide
606
+
607
+ Bucket configuration recommendations based on practical testing:
608
+
609
+ | Data Size | Recommended Buckets | Target Size Per Bucket | Use Case | Test Result |
610
+ | ----------- | ------------------- | ---------------------- | --------------------- | ----------- |
611
+ | < 10GB | 16-32 | ~512MB | Small business tables, dimension tables | Passed |
612
+ | 10GB-1TB | 64-256 | ~1GB | Main business tables, fact tables | Passed |
613
+ | 1TB-10TB | 256-1024 | ~2GB | Large analytical tables, history tables | Recommended |
614
+ | > 10TB | 1024+ | ~4GB | Very large data warehouse tables | Architecture supports |
615
+
616
+ #### Bucket Column Selection Principles
617
+
618
+ 1. **High Cardinality Principle**: Choose columns with evenly distributed, high-cardinality values
619
+ 2. **Query Affinity**: Prioritize key columns used in JOINs and GROUP BY
620
+ 3. **Write Balance**: Avoid data skew and write hot spots
621
+
622
+ ```sql
623
+ -- Best practice: User behavior analysis table
624
+ CREATE TABLE user_behavior_optimized (
625
+ behavior_id BIGINT IDENTITY,
626
+ user_id INT NOT NULL, -- High cardinality, evenly distributed
627
+ session_id VARCHAR(100) NOT NULL,
628
+ behavior_type VARCHAR(50), -- Browse, click, purchase, etc.
629
+ behavior_time TIMESTAMP NOT NULL,
630
+ product_id INT,
631
+
632
+ -- Partition strategy
633
+ date_partition STRING GENERATED ALWAYS AS (date_format(behavior_time, 'yyyy-MM-dd'))
634
+ )
635
+ PARTITIONED BY (date_partition)
636
+ HASH CLUSTERED BY (user_id) -- User dimension bucketing for user behavior analysis
637
+ SORTED BY (behavior_time DESC, behavior_type ASC) -- Time descending + behavior type ascending
638
+ INTO 256 BUCKETS; -- Suitable for medium-to-large data volumes
639
+
640
+ -- Index optimization
641
+ CREATE BLOOMFILTER INDEX user_lookup_idx ON TABLE user_behavior_optimized(user_id);
642
+ CREATE BLOOMFILTER INDEX product_filter_idx ON TABLE user_behavior_optimized(product_id);
643
+ CREATE INVERTED INDEX behavior_type_idx ON TABLE user_behavior_optimized(behavior_type);
644
+ ```
645
+
646
+ ### Sorting Strategy Optimization
647
+
648
+ The choice of sort fields directly impacts query performance, especially for range queries and TOP-N queries:
649
+
650
+ ```sql
651
+ -- Sort optimization for financial transaction tables
652
+ CREATE TABLE financial_transactions_optimized (
653
+ transaction_id BIGINT IDENTITY,
654
+ account_id INT NOT NULL,
655
+ transaction_time TIMESTAMP NOT NULL,
656
+ amount DECIMAL(15,2) NOT NULL,
657
+ transaction_type VARCHAR(20) NOT NULL,
658
+ risk_score DECIMAL(5,3),
659
+
660
+ date_partition STRING GENERATED ALWAYS AS (date_format(transaction_time, 'yyyy-MM-dd'))
661
+ )
662
+ PARTITIONED BY (date_partition)
663
+ HASH CLUSTERED BY (account_id) -- Bucket by account
664
+ SORTED BY (
665
+ transaction_time DESC, -- Time descending: latest transactions first
666
+ amount DESC, -- Amount descending: large transactions first
667
+ risk_score DESC -- Risk score descending: high risk first
668
+ )
669
+ INTO 512 BUCKETS
670
+ COMMENT 'Financial transaction table - optimized for time, amount, and risk dimension query performance';
671
+ ```
672
+
673
+ ***
674
+
675
+ ## Index Architecture Design
676
+
677
+ ### Vector Index Detailed Configuration
678
+
679
+ #### Complete Distance Function Support List (All Verified)
680
+
681
+ **Full Test Verification**: All of the following distance functions have been thoroughly tested and confirmed to be fully available in the current version of the Singdata Lakehouse
682
+
683
+ | Distance Function | Use Case | Mathematical Property | Performance | Verification Status |
684
+ | -------------------- | -------------------------- | -------------------------- | ---------------- | ------------------- |
685
+ | `cosine_distance` | Text semantic similarity, recommendation systems | Angular distance, normalization-independent | Medium performance | **Fully Verified** |
686
+ | `l2_distance` | Image feature matching, Euclidean space | Euclidean distance | Higher performance | **Fully Verified** |
687
+ | `dot_product` | Dot product similarity, normalized vectors | Dot product (optimized for min/max) | **High Performance** | **Fully Verified** |
688
+ | `jaccard_distance` | Set similarity, sparse vectors | Intersection/union ratio | Medium performance | **Fully Verified** |
689
+ | `hamming_distance` | Binary features, hash codes | Bit difference count | High performance | **Fully Verified** |
690
+
691
+ #### Vector Index Scalar Type Configuration
692
+
693
+ | Scalar Type | Storage Precision | Supported Vector Column Types | Performance Impact | Use Case |
694
+ | ----------- | ----------------- | ----------------------------- | --------------------------- | ---------------------------- |
695
+ | `f32` | 32-bit float | INT, FLOAT | Standard performance, balanced precision | General recommendation, production-grade |
696
+ | `f16` | 16-bit float | INT, FLOAT | Higher performance, slight precision loss | Mobile, fast retrieval |
697
+ | `i8` | 8-bit integer | TINYINT, INT, FLOAT | High performance, quantized precision | Extreme performance requirements |
698
+ | `b1` | 1-bit binary | TINYINT, INT, FLOAT | Highest performance, smallest storage | Binary vectors, bloom filter |
699
+
700
+ #### HNSW Algorithm Parameter Details
701
+
702
+ | Parameter | Default | Recommended Range | Description | Performance Impact |
703
+ | ------------------- | ------- | ----------------- | ------------------------------------ | ---------------------------- |
704
+ | `m` | 16 | 8-64 | Maximum connections per node | Higher -> Better precision, higher memory |
705
+ | `ef.construction` | 128 | 64-1000 | Candidate set size during construction | Higher -> Better quality, longer build time |
706
+ | `max.elements` | auto | Based on data size| Estimated max vector count | Proper setting avoids rebuild |
707
+
708
+ #### Complete Vector Index Configuration Examples
709
+
710
+ ```sql
711
+ -- Create table with multiple vector types
712
+ CREATE TABLE comprehensive_vector_demo (
713
+ doc_id INT,
714
+ title VARCHAR(200),
715
+
716
+ -- Vector configurations for different scenarios
717
+ semantic_vector VECTOR(FLOAT, 768), -- Semantic search vector
718
+ image_vector VECTOR(FLOAT, 512), -- Image feature vector
719
+ user_vector VECTOR(INT, 256), -- User profile vector
720
+ binary_vector VECTOR(TINYINT, 128) -- Binary feature vector
721
+ );
722
+
723
+ -- High-quality semantic search index
724
+ CREATE VECTOR INDEX semantic_search_idx
725
+ ON TABLE comprehensive_vector_demo(semantic_vector)
726
+ PROPERTIES (
727
+ "distance.function" = "cosine_distance", -- Preferred for semantic similarity
728
+ "scalar.type" = "f32", -- Standard precision
729
+ "m" = "32", -- Higher connections for better precision
730
+ "ef.construction" = "400", -- High-quality construction
731
+ "reuse.vector.column" = "false", -- Independent storage for best performance
732
+ "compress.codec" = "uncompressed" -- No compression for guaranteed performance
733
+ );
734
+
735
+ -- Fast image retrieval index
736
+ CREATE VECTOR INDEX image_search_idx
737
+ ON TABLE comprehensive_vector_demo(image_vector)
738
+ PROPERTIES (
739
+ "distance.function" = "l2_distance", -- L2 distance suitable for image features
740
+ "scalar.type" = "f16", -- Half-precision for speed
741
+ "m" = "16", -- Standard connections
742
+ "ef.construction" = "128", -- Balance quality and speed
743
+ "reuse.vector.column" = "true", -- Reuse data to save space
744
+ "compress.codec" = "lz4" -- Light compression
745
+ );
746
+
747
+ -- Extreme performance binary index
748
+ CREATE VECTOR INDEX binary_search_idx
749
+ ON TABLE comprehensive_vector_demo(binary_vector)
750
+ PROPERTIES (
751
+ "distance.function" = "hamming_distance", -- Dedicated for binary vectors
752
+ "scalar.type" = "b1", -- 1-bit storage for minimal size
753
+ "m" = "16",
754
+ "ef.construction" = "128",
755
+ "conversion.rule" = "as_bits", -- Process per bit
756
+ "compress.codec" = "zstd", -- High compression ratio
757
+ "compress.level" = "best" -- Maximum compression
758
+ );
759
+
760
+ -- Recommendation system user profile index
761
+ CREATE VECTOR INDEX user_profile_idx
762
+ ON TABLE comprehensive_vector_demo(user_vector)
763
+ PROPERTIES (
764
+ "distance.function" = "dot_product", -- Dot product distance function
765
+ "scalar.type" = "i8", -- 8-bit integer suitable for discrete features
766
+ "m" = "24", -- Moderate connections
767
+ "ef.construction" = "200" -- Balanced construction quality
768
+ );
769
+ ```
770
+
771
+ ### Full-Text Search Index Configuration (Inverted Index)
772
+
773
+ #### Tokenizer Selection Guide
774
+
775
+ | Tokenizer | Language Support | Tokenization Rule | Case Handling | Use Case | Performance |
776
+ | ---------- | ---------------- | --------------------------- | ------------- | -------------------------- | ---------------- |
777
+ | `keyword` | Universal | No tokenization, exact match | Preserve case | Status codes, tags, IDs | **Highest Performance** |
778
+ | `english` | English | ASCII alphanumeric boundaries | Lowercase | English documents, product descriptions | Higher Performance |
779
+ | `chinese` | Chinese-English mixed | Chinese segmentation + English word | English lowercase | Chinese content, mixed text | Medium Performance |
780
+ | `unicode` | Multilingual | Unicode text boundaries | Lowercase | International content, multilingual | Lower Performance |
781
+
782
+ #### Inverted Index Support by Data Type
783
+
784
+ | Data Type | Index Support | Tokenizer Requirement | Use Case | Notes |
785
+ | ----------------- | --------------- | --------------------- | -------------------------- | --------------------------------------- |
786
+ | `STRING` | Supported | **Recommended** | Full-text search on long text | Recommend specifying analyzer for string types |
787
+ | `VARCHAR(n)` | Supported | **Recommended** | Title, description field search | Same requirements as STRING |
788
+ | `CHAR(n)` | Supported | **Recommended** | Fixed-length text | Less common use case |
789
+ | `INT/BIGINT` | Supported | Not needed | Numeric range query optimization | Auto-handled, efficient |
790
+ | `DECIMAL` | Supported | Not needed | Precise numeric queries | Common in financial scenarios |
791
+ | `DATE/TIMESTAMP` | Supported | Not needed | Time range query optimization | Essential for time-series data |
792
+ | `BOOLEAN` | Supported | Not needed | Boolean fast filtering | Status filtering optimization |
793
+ | `ARRAY<T>` | **Partially supported** | **analyzer NOT supported** | Tag lists, etc. | ARRAY type columns do not support the analyzer parameter |
794
+
795
+ #### Complete Inverted Index Application Examples
796
+
797
+ ```sql
798
+ -- Table design for comprehensive search scenarios
799
+ CREATE TABLE comprehensive_search_demo (
800
+ record_id BIGINT IDENTITY,
801
+
802
+ -- Text search fields
803
+ title VARCHAR(200) NOT NULL,
804
+ content STRING,
805
+ tags ARRAY<STRING>,
806
+ author VARCHAR(100),
807
+ category VARCHAR(50),
808
+
809
+ -- Numeric and time fields
810
+ price DECIMAL(10,2),
811
+ view_count INT,
812
+ rating TINYINT,
813
+ created_date DATE,
814
+ updated_time TIMESTAMP,
815
+ is_featured BOOLEAN DEFAULT false
816
+ );
817
+
818
+ -- Chinese title search index
819
+ CREATE INVERTED INDEX title_chinese_idx
820
+ ON TABLE comprehensive_search_demo(title)
821
+ PROPERTIES ('analyzer' = 'chinese');
822
+
823
+ -- Full-text content search index (multilingual)
824
+ CREATE INVERTED INDEX content_unicode_idx
825
+ ON TABLE comprehensive_search_demo(content)
826
+ PROPERTIES ('analyzer' = 'unicode');
827
+
828
+ -- Tag array index (cannot specify analyzer)
829
+ CREATE INVERTED INDEX tags_idx
830
+ ON TABLE comprehensive_search_demo(tags);
831
+
832
+ -- Author name search index
833
+ CREATE INVERTED INDEX author_keyword_idx
834
+ ON TABLE comprehensive_search_demo(author)
835
+ PROPERTIES ('analyzer' = 'keyword');
836
+
837
+ -- Numeric field range query optimization
838
+ CREATE INVERTED INDEX price_range_idx
839
+ ON TABLE comprehensive_search_demo(price);
840
+
841
+ CREATE INVERTED INDEX view_count_idx
842
+ ON TABLE comprehensive_search_demo(view_count);
843
+
844
+ CREATE INVERTED INDEX rating_idx
845
+ ON TABLE comprehensive_search_demo(rating);
846
+
847
+ -- Time field query optimization
848
+ CREATE INVERTED INDEX created_date_idx
849
+ ON TABLE comprehensive_search_demo(created_date);
850
+
851
+ CREATE INVERTED INDEX updated_time_idx
852
+ ON TABLE comprehensive_search_demo(updated_time);
853
+
854
+ -- Boolean field fast filtering
855
+ CREATE INVERTED INDEX featured_filter_idx
856
+ ON TABLE comprehensive_search_demo(is_featured);
857
+ ```
858
+
859
+ ### Bloom Filter Index Application (High-Cardinality Column Optimization)
860
+
861
+ #### Use Case Analysis
862
+
863
+ | Use Case | Cardinality Feature | Query Pattern | Optimization Effect | Practical Application |
864
+ | ---------------- | --------------------------- | ---------------- | ------------------- | --------------------- |
865
+ | User ID lookup | Extremely high (millions+) | = Exact match | Significant improvement | User behavior analysis |
866
+ | Email verification| High cardinality, strong uniqueness | = Existence check | Fast filtering | Registration dedup verification |
867
+ | Product SKU search| High cardinality, business unique | = Inventory query | Fast location | E-commerce inventory system |
868
+ | Order number query| Extremely high, unique | = Order lookup | Millisecond response | Order management system |
869
+ | Device ID monitoring| High cardinality, device unique | = Device status | Efficient filtering | IoT monitoring platform |
870
+
871
+ #### Bloom Filter Best Practices
872
+
873
+ ```sql
874
+ -- High-cardinality user management table
875
+ CREATE TABLE user_management_optimized (
876
+ user_id BIGINT IDENTITY,
877
+ username VARCHAR(50) NOT NULL,
878
+ email VARCHAR(320) NOT NULL,
879
+ mobile_phone VARCHAR(20),
880
+ id_card_hash VARCHAR(64), -- ID card hash
881
+ device_fingerprint VARCHAR(200), -- Device fingerprint
882
+
883
+ -- Core business fields
884
+ registration_date DATE,
885
+ last_login_time TIMESTAMP,
886
+ account_status TINYINT DEFAULT 1, -- 1=normal, 0=disabled, 2=locked
887
+ verification_level TINYINT DEFAULT 0 -- 0=unverified, 1=email, 2=phone, 3=real-name
888
+ );
889
+
890
+ -- Bloom filter indexes for high-cardinality fields
891
+ CREATE BLOOMFILTER INDEX username_bloom_idx
892
+ ON TABLE user_management_optimized(username);
893
+
894
+ CREATE BLOOMFILTER INDEX email_bloom_idx
895
+ ON TABLE user_management_optimized(email);
896
+
897
+ CREATE BLOOMFILTER INDEX phone_bloom_idx
898
+ ON TABLE user_management_optimized(mobile_phone);
899
+
900
+ CREATE BLOOMFILTER INDEX idcard_bloom_idx
901
+ ON TABLE user_management_optimized(id_card_hash);
902
+
903
+ CREATE BLOOMFILTER INDEX device_bloom_idx
904
+ ON TABLE user_management_optimized(device_fingerprint);
905
+
906
+ -- Practical query application examples
907
+ -- 1. Fast duplicate check during user registration
908
+ SELECT COUNT(*) FROM user_management_optimized
909
+ WHERE email = 'newuser@example.com'; -- Bloom filter fast filtering
910
+
911
+ -- 2. Fast location during user login
912
+ SELECT user_id, account_status, verification_level
913
+ FROM user_management_optimized
914
+ WHERE username = 'target_username'; -- Bloom filter accelerates lookup
915
+
916
+ -- 3. Device risk control check
917
+ SELECT user_id, COUNT(*) as device_usage_count
918
+ FROM user_management_optimized
919
+ WHERE device_fingerprint = 'specific_device_fp' -- Bloom filter fast matching
920
+ GROUP BY user_id;
921
+ ```
922
+
923
+ ### Index Naming and Management Standards
924
+
925
+ #### Index Naming Best Practices
926
+
927
+ **Note**: The current version of Singdata Lakehouse **strictly enforces schema-level uniqueness** for index naming.
928
+
929
+ #### Recommended Index Naming Convention
930
+
931
+ **Naming Format**: `{table_name}_{index_type}_{column_name}_idx`
932
+
933
+ **Index Type Abbreviations**:
934
+
935
+ * `vec` - Vector Index (VECTOR INDEX)
936
+ * `inv` - Inverted Index (INVERTED INDEX)
937
+ * `bloom` - Bloom Filter Index (BLOOMFILTER INDEX)
938
+
939
+ ```sql
940
+ -- Correct index naming practice
941
+ CREATE TABLE product_catalog (
942
+ product_id INT,
943
+ product_name VARCHAR(200),
944
+ description STRING,
945
+ category VARCHAR(100),
946
+ price DECIMAL(10,2),
947
+ features_vector VECTOR(FLOAT, 512)
948
+ );
949
+
950
+ -- Unique and descriptive index names
951
+ CREATE VECTOR INDEX products_vec_features_idx
952
+ ON TABLE product_catalog(features_vector)
953
+ PROPERTIES ("distance.function" = "cosine_distance");
954
+
955
+ CREATE INVERTED INDEX products_inv_name_idx
956
+ ON TABLE product_catalog(product_name)
957
+ PROPERTIES ('analyzer' = 'chinese');
958
+
959
+ CREATE INVERTED INDEX products_inv_desc_idx
960
+ ON TABLE product_catalog(description)
961
+ PROPERTIES ('analyzer' = 'unicode');
962
+
963
+ CREATE BLOOMFILTER INDEX products_bloom_category_idx
964
+ ON TABLE product_catalog(category);
965
+
966
+ -- Another table uses different index name prefixes
967
+ CREATE TABLE user_content (
968
+ content_id BIGINT IDENTITY,
969
+ content_text STRING,
970
+ content_vector VECTOR(FLOAT, 768)
971
+ );
972
+
973
+ CREATE VECTOR INDEX users_vec_content_idx -- Different table name prefix
974
+ ON TABLE user_content(content_vector)
975
+ PROPERTIES ("distance.function" = "cosine_distance");
976
+
977
+ CREATE INVERTED INDEX users_inv_text_idx -- Different table name prefix
978
+ ON TABLE user_content(content_text)
979
+ PROPERTIES ('analyzer' = 'chinese');
980
+ ```
981
+
982
+ ### Index Feature Limitations
983
+
984
+ #### IF NOT EXISTS Syntax Current Status
985
+
986
+ Based on the latest test verification, the index creation syntax currently **does not support** the IF NOT EXISTS option:
987
+
988
+ ```sql
989
+ -- Unsupported IF NOT EXISTS index syntax (causes syntax errors)
990
+ CREATE VECTOR INDEX IF NOT EXISTS vec_idx
991
+ ON TABLE example_table(embedding)
992
+ PROPERTIES ("distance.function" = "cosine_distance");
993
+
994
+ CREATE INVERTED INDEX IF NOT EXISTS text_idx
995
+ ON TABLE example_table(content)
996
+ PROPERTIES ('analyzer'='chinese');
997
+
998
+ CREATE BLOOMFILTER INDEX IF NOT EXISTS bloom_idx
999
+ ON TABLE example_table(user_id);
1000
+ ```
1001
+
1002
+ Before creating an index, it is recommended to first check whether the index exists to avoid errors:
1003
+
1004
+ ```sql
1005
+ -- Recommended approach: check if index exists first
1006
+ -- Then create
1007
+ CREATE VECTOR INDEX vec_idx ON TABLE example_table(embedding)
1008
+ PROPERTIES ("distance.function" = "cosine_distance");
1009
+ ```
1010
+
1011
+ #### Index Limitations on ARRAY Type Columns
1012
+
1013
+ Through testing, the following limitations exist when creating inverted indexes on ARRAY type columns:
1014
+
1015
+ ```sql
1016
+ -- ARRAY type columns do not support specifying the analyzer parameter
1017
+ CREATE TABLE array_column_table (
1018
+ id INT,
1019
+ tags ARRAY<STRING>
1020
+ );
1021
+
1022
+ -- Error: Specifying analyzer on ARRAY type column
1023
+ CREATE INVERTED INDEX tags_analyzer_idx
1024
+ ON TABLE array_column_table(tags)
1025
+ PROPERTIES ('analyzer' = 'keyword'); -- Fails!
1026
+
1027
+ -- Correct: Do not specify analyzer on ARRAY type column
1028
+ CREATE INVERTED INDEX tags_idx
1029
+ ON TABLE array_column_table(tags); -- Succeeds
1030
+
1031
+ -- Alternative: Use STRING type to store tags
1032
+ CREATE TABLE string_tags_table (
1033
+ id INT,
1034
+ tags_str STRING -- Comma-separated tag string
1035
+ );
1036
+
1037
+ CREATE INVERTED INDEX tags_str_idx
1038
+ ON TABLE string_tags_table(tags_str)
1039
+ PROPERTIES ('analyzer' = 'keyword'); -- Succeeds
1040
+ ```
1041
+
1042
+ ***
1043
+
1044
+ ## Performance Optimization Strategies
1045
+
1046
+ ### Query Performance Optimization Techniques
1047
+
1048
+ #### Partition Pruning Optimization
1049
+
1050
+ Ensure query conditions effectively leverage partition pruning:
1051
+
1052
+ ```sql
1053
+ -- Excellent query pattern: Full partition pruning utilization
1054
+ SELECT user_id, COUNT(*) as activity_count,
1055
+ AVG(session_duration) as avg_duration
1056
+ FROM user_activity_logs
1057
+ WHERE date_partition BETWEEN '2024-01-01' AND '2024-01-31' -- Partition pruning
1058
+ AND user_id IN (12345, 67890, 54321) -- Bucket targeting
1059
+ AND activity_type = 'purchase' -- Index filtering
1060
+ GROUP BY user_id
1061
+ ORDER BY activity_count DESC;
1062
+
1063
+ -- Query pattern to avoid: Cannot utilize partition pruning
1064
+ SELECT user_id, COUNT(*) as activity_count
1065
+ FROM user_activity_logs
1066
+ WHERE activity_time >= '2024-01-01 00:00:00' -- Using raw time column, no partition pruning
1067
+ AND activity_time <= '2024-01-31 23:59:59'
1068
+ GROUP BY user_id;
1069
+ ```
1070
+
1071
+ #### Multi-Dimensional Index Collaborative Optimization
1072
+
1073
+ ```sql
1074
+ -- Table structure designed for complex business queries
1075
+ CREATE TABLE business_analytics_optimized (
1076
+ record_id BIGINT IDENTITY,
1077
+ user_id INT NOT NULL,
1078
+ product_category VARCHAR(50) NOT NULL,
1079
+ event_type VARCHAR(50) NOT NULL,
1080
+ channel VARCHAR(30) NOT NULL,
1081
+ event_data JSON,
1082
+ revenue_amount DECIMAL(12,2),
1083
+ event_timestamp TIMESTAMP NOT NULL,
1084
+
1085
+ -- Partition key
1086
+ date_partition STRING GENERATED ALWAYS AS (date_format(event_timestamp, 'yyyy-MM-dd'))
1087
+ )
1088
+ PARTITIONED BY (date_partition) -- Time dimension partition pruning
1089
+ HASH CLUSTERED BY (user_id) -- User dimension bucket targeting
1090
+ SORTED BY (event_timestamp DESC, revenue_amount DESC) -- Dual sort by time and revenue
1091
+ INTO 512 BUCKETS;
1092
+
1093
+ -- Multi-dimensional index strategy
1094
+ CREATE BLOOMFILTER INDEX analytics_user_idx ON TABLE business_analytics_optimized(user_id);
1095
+ CREATE BLOOMFILTER INDEX analytics_category_idx ON TABLE business_analytics_optimized(product_category);
1096
+ CREATE BLOOMFILTER INDEX analytics_event_idx ON TABLE business_analytics_optimized(event_type);
1097
+ CREATE BLOOMFILTER INDEX analytics_channel_idx ON TABLE business_analytics_optimized(channel);
1098
+ CREATE INVERTED INDEX analytics_revenue_idx ON TABLE business_analytics_optimized(revenue_amount);
1099
+ CREATE INVERTED INDEX analytics_data_search_idx ON TABLE business_analytics_optimized(event_data)
1100
+ PROPERTIES ('analyzer' = 'unicode');
1101
+
1102
+ -- Efficient multi-dimensional business query
1103
+ SELECT
1104
+ product_category,
1105
+ event_type,
1106
+ COUNT(*) as event_count,
1107
+ SUM(revenue_amount) as total_revenue,
1108
+ AVG(revenue_amount) as avg_revenue
1109
+ FROM business_analytics_optimized
1110
+ WHERE date_partition = '2024-01-15' -- Partition pruning
1111
+ AND user_id IN (SELECT user_id FROM vip_users) -- Bucket targeting + bloom filter
1112
+ AND product_category = 'electronics' -- Bloom filter
1113
+ AND event_type = 'purchase' -- Bloom filter
1114
+ AND channel = 'mobile_app' -- Bloom filter
1115
+ AND revenue_amount > 100 -- Inverted index range query
1116
+ GROUP BY product_category, event_type
1117
+ ORDER BY total_revenue DESC;
1118
+ ```
1119
+
1120
+ #### Vector Similarity Query Optimization
1121
+
1122
+ ```sql
1123
+ -- Vector search performance optimization example
1124
+ CREATE TABLE vector_search_performance (
1125
+ doc_id INT,
1126
+ doc_title VARCHAR(200),
1127
+ doc_category VARCHAR(50),
1128
+ content_embedding VECTOR(FLOAT, 768),
1129
+ summary_embedding VECTOR(FLOAT, 256), -- Lower dimension for fast pre-filtering
1130
+ created_date DATE,
1131
+
1132
+ date_partition STRING GENERATED ALWAYS AS (date_format(created_date, 'yyyy-MM-dd'))
1133
+ )
1134
+ PARTITIONED BY (date_partition);
1135
+
1136
+ -- High-performance vector index
1137
+ CREATE VECTOR INDEX content_semantic_idx
1138
+ ON TABLE vector_search_performance(content_embedding)
1139
+ PROPERTIES (
1140
+ "distance.function" = "cosine_distance",
1141
+ "scalar.type" = "f32",
1142
+ "m" = "32", -- Higher connections for better recall
1143
+ "ef.construction" = "400", -- High-quality construction
1144
+ "reuse.vector.column" = "false" -- Independent storage for optimal performance
1145
+ );
1146
+
1147
+ -- Fast pre-filter vector index
1148
+ CREATE VECTOR INDEX summary_fast_idx
1149
+ ON TABLE vector_search_performance(summary_embedding)
1150
+ PROPERTIES (
1151
+ "distance.function" = "dot_product", -- Dot product distance function
1152
+ "scalar.type" = "f16", -- Half-precision for speed
1153
+ "m" = "16",
1154
+ "ef.construction" = "128"
1155
+ );
1156
+
1157
+ -- Traditional index-assisted filtering
1158
+ CREATE BLOOMFILTER INDEX doc_category_idx ON TABLE vector_search_performance(doc_category);
1159
+
1160
+ -- Multi-level vector search strategy example
1161
+ -- 1. Coarse filtering: Use small vectors for fast pre-filtering
1162
+ -- 2. Fine ranking: Use large vectors for precise calculation
1163
+ -- 3. Filtering: Combine with traditional indexes for further filtering
1164
+ ```
1165
+
1166
+ ### Storage Cost Optimization Strategies
1167
+
1168
+ #### Precise Data Type Selection (Storage Optimization)
1169
+
1170
+ ```sql
1171
+ -- Table design example for storage cost optimization
1172
+ CREATE TABLE storage_cost_optimized (
1173
+ -- Primary key field: necessary storage overhead
1174
+ record_id BIGINT IDENTITY, -- 8 bytes, required auto-increment PK
1175
+
1176
+ -- Business ID fields: Choose types based on actual needs
1177
+ user_id INT NOT NULL, -- 4 bytes, supports 4.2 billion users
1178
+ product_id INT NOT NULL, -- 4 bytes, supports 4.2 billion products
1179
+ order_id BIGINT NOT NULL, -- 8 bytes, supports very large order volumes
1180
+
1181
+ -- Status enum fields: Use smallest type
1182
+ order_status TINYINT DEFAULT 1, -- 1 byte vs VARCHAR(20) 20 bytes, saves 95%
1183
+ priority_level TINYINT DEFAULT 0, -- 1 byte, 0-255 levels sufficient
1184
+ user_level TINYINT DEFAULT 1, -- 1 byte, VIP level enumeration
1185
+
1186
+ -- Boolean fields: Clear semantics
1187
+ is_paid BOOLEAN DEFAULT false, -- 1 byte vs VARCHAR(10) 10 bytes, saves 90%
1188
+ is_shipped BOOLEAN DEFAULT false, -- 1 byte, clear boolean semantics
1189
+ is_gift BOOLEAN DEFAULT false, -- 1 byte, gift flag
1190
+
1191
+ -- Time fields: Choose based on precision requirements
1192
+ order_date DATE, -- 4 bytes, scenarios not needing time of day
1193
+ created_timestamp TIMESTAMP, -- 8 bytes, scenarios needing precise time
1194
+ shipped_date DATE, -- 4 bytes, ship date is sufficient
1195
+
1196
+ -- Amount fields: Precise calculation
1197
+ item_price DECIMAL(10,2), -- Precise amount vs DOUBLE precision risk
1198
+ total_amount DECIMAL(12,2), -- Supports larger amounts
1199
+ discount_amount DECIMAL(8,2), -- Discount amount range is smaller
1200
+
1201
+ -- String fields: Precise length settings
1202
+ customer_name VARCHAR(100), -- 100 chars covers 99.5% of real-world cases
1203
+ email VARCHAR(320), -- RFC5321 standard length
1204
+ phone VARCHAR(20), -- Supports international format +86-13812345678
1205
+ address VARCHAR(500), -- Reasonable length for address info
1206
+
1207
+ -- Complex data: Use appropriately
1208
+ order_metadata JSON, -- Extended properties vs many sparse columns
1209
+
1210
+ -- Category IDs: Use integers instead of strings
1211
+ category_id SMALLINT, -- 2 bytes ID vs VARCHAR(50) 50 bytes, saves 96%
1212
+ subcategory_id SMALLINT, -- 2 bytes, supports 65K categories
1213
+ brand_id SMALLINT -- 2 bytes, brand ID
1214
+ )
1215
+ COMMENT 'Storage cost optimization design - achieving optimal balance between functional requirements and storage costs';
1216
+
1217
+ -- Storage savings analysis:
1218
+ -- Status fields: VARCHAR(20) -> TINYINT, saving 19 bytes per row
1219
+ -- Boolean fields: VARCHAR(10) -> BOOLEAN, saving 9 bytes per row
1220
+ -- Category fields: VARCHAR(50) -> SMALLINT, saving 48 bytes per row
1221
+ -- Total savings: ~76 bytes per row, ~760MB saved for tens of millions of records
1222
+ ```
1223
+
1224
+ #### Bucket Count Optimization Strategy
1225
+
1226
+ ```sql
1227
+ -- Bucket optimization examples based on data scale
1228
+
1229
+ -- Small table optimization (< 10GB): Avoid excessive bucketing
1230
+ CREATE TABLE small_table_optimized (
1231
+ id BIGINT IDENTITY,
1232
+ name VARCHAR(100),
1233
+ category VARCHAR(50),
1234
+ data JSON
1235
+ )
1236
+ HASH CLUSTERED BY (category) -- Bucket by business dimension
1237
+ SORTED BY (id ASC) -- Simple sorting
1238
+ INTO 16 BUCKETS -- Moderate bucket count, avoids small file issues
1239
+ COMMENT 'Small table optimization - 16 buckets balance performance and management complexity';
1240
+
1241
+ -- Medium table optimization (10GB-1TB): Standard configuration
1242
+ CREATE TABLE medium_table_optimized (
1243
+ record_id BIGINT IDENTITY,
1244
+ user_id INT NOT NULL,
1245
+ business_data JSON,
1246
+ created_time TIMESTAMP,
1247
+
1248
+ date_partition STRING GENERATED ALWAYS AS (date_format(created_time, 'yyyy-MM-dd'))
1249
+ )
1250
+ PARTITIONED BY (date_partition)
1251
+ HASH CLUSTERED BY (user_id) -- High-cardinality column bucketing
1252
+ SORTED BY (created_time DESC) -- Time sorting
1253
+ INTO 128 BUCKETS -- Standard bucket count, balances concurrency and file size
1254
+ COMMENT 'Medium table optimization - 128 buckets suitable for mainstream business scenarios';
1255
+
1256
+ -- Large table optimization (> 1TB): High concurrency configuration
1257
+ CREATE TABLE large_table_optimized (
1258
+ event_id BIGINT IDENTITY,
1259
+ user_id INT NOT NULL,
1260
+ session_id VARCHAR(100),
1261
+ event_data JSON,
1262
+ event_time TIMESTAMP,
1263
+
1264
+ date_partition STRING GENERATED ALWAYS AS (date_format(event_time, 'yyyy-MM-dd'))
1265
+ )
1266
+ PARTITIONED BY (date_partition)
1267
+ HASH CLUSTERED BY (user_id, session_id) -- Composite bucketing for better distribution uniformity
1268
+ SORTED BY (event_time DESC)
1269
+ INTO 512 BUCKETS -- High bucket count for high-concurrency writes and queries
1270
+ COMMENT 'Large table optimization - 512 buckets support large-scale concurrent processing';
1271
+ ```
1272
+
1273
+ ***
1274
+
1275
+ ## Common Design Pitfalls and Solutions
1276
+
1277
+ ### Data Type Design Pitfalls
1278
+
1279
+ #### Pitfall 1: Wrong IDENTITY Column Type
1280
+
1281
+ **Error Scenario**:
1282
+
1283
+ ```sql
1284
+ -- All of the following IDENTITY declarations will fail
1285
+ CREATE TABLE identity_type_errors (
1286
+ id INT IDENTITY, -- Fails: INT type not supported
1287
+ small_id SMALLINT IDENTITY, -- Fails: SMALLINT type not supported
1288
+ char_id CHAR(10) IDENTITY, -- Fails: character type not supported
1289
+ decimal_id DECIMAL(10,0) IDENTITY -- Fails: DECIMAL type not supported
1290
+ );
1291
+
1292
+ -- Error message: invalid identity column type int, currently only BIGINT is supported
1293
+ ```
1294
+
1295
+ **Correct Solution**:
1296
+
1297
+ ```sql
1298
+ -- Correct: Uniformly use BIGINT IDENTITY
1299
+ CREATE TABLE identity_correct_usage (
1300
+ id BIGINT IDENTITY, -- Only supported IDENTITY type
1301
+ user_id INT NOT NULL, -- Business IDs use other appropriate types
1302
+ order_code VARCHAR(50) NOT NULL, -- Business codes use strings
1303
+ sequence_num INT DEFAULT 1 -- Sequence numbers use plain INT
1304
+ ) COMMENT 'IDENTITY column correct usage example';
1305
+ ```
1306
+
1307
+ #### Pitfall 2: Improper VARCHAR Length Settings
1308
+
1309
+ **Problem Analysis**:
1310
+
1311
+ ```sql
1312
+ -- Common length setting errors
1313
+ CREATE TABLE varchar_length_problems (
1314
+ name VARCHAR(10000), -- Overallocation: wastes storage space
1315
+ email VARCHAR(50), -- Insufficient length: email standard is 320 characters
1316
+ phone VARCHAR(255), -- Overallocation: 20 characters sufficient for phone
1317
+ title VARCHAR(100), -- Insufficient length: article titles typically need 200 chars
1318
+ description VARCHAR(500000) -- Massive allocation: should use STRING type
1319
+ );
1320
+ ```
1321
+
1322
+ **Optimized Solution**:
1323
+
1324
+ ```sql
1325
+ -- Reasonable length settings based on actual business requirements
1326
+ CREATE TABLE varchar_length_optimized (
1327
+ name VARCHAR(100), -- Name: covers 99.5% of real-world cases
1328
+ email VARCHAR(320), -- Email: RFC5321 international standard length
1329
+ phone VARCHAR(20), -- Phone: supports international format +86-13812345678
1330
+ title VARCHAR(200), -- Title: balances SEO needs and storage efficiency
1331
+ summary VARCHAR(500), -- Summary: reasonable summary length
1332
+ description STRING -- Long description: use STRING for variable length
1333
+ ) COMMENT 'VARCHAR length optimization - reasonable settings based on actual business research';
1334
+ ```
1335
+
1336
+ #### Pitfall 3: Using Float Types for Financial Calculations
1337
+
1338
+ **Risk Demonstration**:
1339
+
1340
+ ```sql
1341
+ -- Precision problems with float types in financial calculations
1342
+ CREATE TABLE financial_precision_risks (
1343
+ account_id INT,
1344
+ balance DOUBLE, -- Risk: floating point precision issues
1345
+ interest_rate FLOAT, -- Risk: cumulative compound calculation errors
1346
+ transaction_amount DOUBLE -- Risk: transaction amount calculation errors
1347
+ );
1348
+
1349
+ -- Precision problem demonstration
1350
+ INSERT INTO financial_precision_risks VALUES
1351
+ (1, 0.1 + 0.2, 0.001, 1.0);
1352
+ -- Expected: balance = 0.3
1353
+ -- Actual: balance = 0.30000000000000004 (precision error)
1354
+
1355
+ -- Compound calculation error demonstration
1356
+ SELECT
1357
+ balance * interest_rate as calculated_interest, -- May produce precision errors
1358
+ (balance * interest_rate * 12) as annual_interest -- Errors are amplified
1359
+ FROM financial_precision_risks;
1360
+ ```
1361
+
1362
+ **Correct Solution**:
1363
+
1364
+ ```sql
1365
+ -- Use precise DECIMAL type for financial calculations
1366
+ CREATE TABLE financial_precision_correct (
1367
+ account_id INT,
1368
+ balance DECIMAL(15,2), -- Precise: supports tens of millions, 2 decimal places
1369
+ interest_rate DECIMAL(8,6), -- Precise: supports interest rate, 6 decimal precision
1370
+ transaction_amount DECIMAL(15,2), -- Precise: no precision loss in transaction amounts
1371
+
1372
+ -- DECIMAL configurations for different business scenarios
1373
+ daily_limit DECIMAL(10,2), -- Daily limit: ten-thousand-level amounts
1374
+ annual_fee DECIMAL(8,2), -- Annual fee: thousand-level amounts
1375
+ exchange_rate DECIMAL(10,8) -- Exchange rate: high-precision decimal
1376
+ ) COMMENT 'Financial data precise calculation - using DECIMAL to ensure calculation accuracy';
1377
+
1378
+ -- Precise calculation verification
1379
+ INSERT INTO financial_precision_correct VALUES
1380
+ (1, 0.30, 0.001000, 1.00, 5000.00, 200.00, 6.78901234);
1381
+
1382
+ -- Precise compound calculations
1383
+ SELECT
1384
+ balance * interest_rate as precise_interest, -- Precise calculation
1385
+ balance * interest_rate * 12 as precise_annual, -- Precise annualized calculation
1386
+ transaction_amount * exchange_rate as precise_conversion -- Precise exchange rate conversion
1387
+ FROM financial_precision_correct;
1388
+ ```
1389
+
1390
+ ### Partition Design Pitfalls
1391
+
1392
+ #### Pitfall 4: Unsupported Partition Column Types
1393
+
1394
+ **Error Scenario**:
1395
+
1396
+ ```sql
1397
+ -- Unsupported partition column types (confirmed to fail in testing)
1398
+ CREATE TABLE partition_type_errors (
1399
+ id INT,
1400
+ amount DECIMAL(10,2), -- DECIMAL not supported for direct partitioning
1401
+ price DOUBLE, -- DOUBLE not supported for partitioning
1402
+ created_time TIMESTAMP, -- TIMESTAMP cannot be used directly for partitioning
1403
+ location_point STRUCT<lat:DOUBLE,lng:DOUBLE> -- Complex types not supported for partitioning
1404
+ )
1405
+ PARTITIONED BY (created_time); -- Fails!
1406
+
1407
+ -- Error message example:
1408
+ -- Unsupported data type for partition transform: timestamp_ltz
1409
+ ```
1410
+
1411
+ **Correct Solution**:
1412
+
1413
+ ```sql
1414
+ -- Use generated columns to convert to supported partition types
1415
+ CREATE TABLE partition_type_solutions (
1416
+ id INT,
1417
+ amount DECIMAL(10,2),
1418
+ price DOUBLE,
1419
+ created_time TIMESTAMP,
1420
+ location_point STRUCT<lat:DOUBLE,lng:DOUBLE>,
1421
+
1422
+ -- Use generated column to convert TIMESTAMP to STRING (supports partitioning)
1423
+ date_partition STRING GENERATED ALWAYS AS (
1424
+ date_format(created_time, 'yyyy-MM-dd')
1425
+ ),
1426
+
1427
+ -- Use generated column to convert DECIMAL to category (supports partitioning)
1428
+ amount_range STRING GENERATED ALWAYS AS (
1429
+ if(amount < 100, 'small',
1430
+ if(amount < 1000, 'medium', 'large'))
1431
+ ),
1432
+
1433
+ -- Use generated column to extract field from complex type (supports partitioning)
1434
+ location_region STRING GENERATED ALWAYS AS (
1435
+ if(location_point.lat > 35, 'north', 'south')
1436
+ )
1437
+ )
1438
+ PARTITIONED BY (date_partition) -- Success: STRING type supports partitioning
1439
+ COMMENT 'Partition type solutions - using generated columns to convert unsupported types';
1440
+ ```
1441
+
1442
+ #### Pitfall 5: Dynamic Partition Count Exceeded
1443
+
1444
+ **Problem Scenario**:
1445
+
1446
+ ```sql
1447
+ -- Operations that may exceed dynamic partition limit
1448
+ INSERT INTO large_partition_table
1449
+ SELECT * FROM source_table_with_many_dates; -- Fails if source table has >2048 distinct dates
1450
+
1451
+ -- Error message:
1452
+ -- The count of dynamic partitions exceeds the maximum number 2048
1453
+ ```
1454
+
1455
+ **Solution Strategies**:
1456
+
1457
+ ```sql
1458
+ -- Strategy 1: Batch insert by time range
1459
+ INSERT INTO large_partition_table
1460
+ SELECT * FROM source_table_with_many_dates
1461
+ WHERE event_date BETWEEN '2024-01-01' AND '2024-01-10'; -- Limit partition range
1462
+
1463
+ INSERT INTO large_partition_table
1464
+ SELECT * FROM source_table_with_many_dates
1465
+ WHERE event_date BETWEEN '2024-02-01' AND '2024-02-29'; -- Second batch: 29 partitions
1466
+
1467
+ -- Strategy 2: Batch insert by partition value
1468
+ INSERT INTO large_partition_table
1469
+ SELECT * FROM source_table_with_many_dates
1470
+ WHERE region IN ('north', 'south', 'east', 'west'); -- Limit to 4 partitions
1471
+
1472
+ -- Strategy 3: Pre-filter data
1473
+ WITH filtered_source AS (
1474
+ SELECT *,
1475
+ date_format(event_timestamp, 'yyyy-MM-dd') as date_part
1476
+ FROM source_table_with_many_dates
1477
+ WHERE event_timestamp >= '2024-01-01' -- Pre-filter to reduce partition count
1478
+ AND event_timestamp < '2024-02-01'
1479
+ )
1480
+ INSERT INTO large_partition_table
1481
+ SELECT * FROM filtered_source;
1482
+
1483
+ -- Strategy 4: Application-level loop control (pseudocode)
1484
+ -- for month in ['2024-01', '2024-02', ...]:
1485
+ -- INSERT INTO table SELECT * FROM source WHERE month_partition = month
1486
+ ```
1487
+
1488
+ ### Index Design Pitfalls
1489
+
1490
+ #### Pitfall 6: Index Naming Management
1491
+
1492
+ **Important Update**: Through testing, the current version of the Singdata Lakehouse strictly enforces schema-level uniqueness for index names.
1493
+
1494
+ ```sql
1495
+ -- Use table name prefix for unique index naming
1496
+ CREATE TABLE orders (
1497
+ order_id INT,
1498
+ customer_id INT,
1499
+ order_content STRING
1500
+ );
1501
+ CREATE INVERTED INDEX orders_inv_customer_idx ON TABLE orders(customer_id);
1502
+ CREATE INVERTED INDEX orders_inv_content_idx ON TABLE orders(order_content)
1503
+ PROPERTIES('analyzer'='keyword');
1504
+
1505
+ CREATE TABLE products (
1506
+ product_id INT,
1507
+ customer_id INT,
1508
+ product_description STRING
1509
+ );
1510
+ CREATE INVERTED INDEX products_inv_customer_idx ON TABLE products(customer_id);
1511
+ CREATE INVERTED INDEX products_inv_desc_idx ON TABLE products(product_description)
1512
+ PROPERTIES('analyzer'='chinese');
1513
+
1514
+ -- Recommended index naming convention
1515
+ -- Format: {table_name}_{index_type}_{column_name}_idx
1516
+ -- Examples: users_bloom_email_idx, orders_vec_features_idx
1517
+ ```
1518
+
1519
+ #### Pitfall 7: PRIMARY KEY Constraint Conflict with HASH CLUSTERED BY
1520
+
1521
+ **Problem Scenario**:
1522
+
1523
+ ```sql
1524
+ -- PRIMARY KEY constraint conflicts with HASH CLUSTERED BY
1525
+ CREATE TABLE table_with_conflict (
1526
+ tenant_id VARCHAR(50) PRIMARY KEY,
1527
+ tenant_name VARCHAR(200) NOT NULL,
1528
+ tenant_status TINYINT DEFAULT 1
1529
+ )
1530
+ HASH CLUSTERED BY (tenant_id) -- Conflicts with PRIMARY KEY
1531
+ INTO 32 BUCKETS;
1532
+
1533
+ -- Error message: CLUSTERED BY definition conflicts with enforced PRIMARY KEY
1534
+ -- or UNIQUE constraints defined at :[31,2], must HASH CLUSTERED BY ... SORTED BY ... ASC
1535
+ -- with all PRIMARY KEY or UNIQUE columns
1536
+ ```
1537
+
1538
+ **Solutions**:
1539
+
1540
+ ```sql
1541
+ -- Solution 1: Remove PRIMARY KEY constraint, use plain non-null column
1542
+ CREATE TABLE solution_remove_pk (
1543
+ tenant_id VARCHAR(50) NOT NULL, -- Remove PRIMARY KEY
1544
+ tenant_name VARCHAR(200) NOT NULL,
1545
+ tenant_status TINYINT DEFAULT 1
1546
+ )
1547
+ HASH CLUSTERED BY (tenant_id)
1548
+ INTO 32 BUCKETS;
1549
+
1550
+ -- Solution 2: Adjust HASH CLUSTERED BY and SORTED BY to meet requirements
1551
+ CREATE TABLE solution_adjust_cluster (
1552
+ tenant_id VARCHAR(50) PRIMARY KEY,
1553
+ tenant_name VARCHAR(200) NOT NULL,
1554
+ tenant_status TINYINT DEFAULT 1
1555
+ )
1556
+ HASH CLUSTERED BY (tenant_id) -- Keep consistent with PRIMARY KEY
1557
+ SORTED BY (tenant_id ASC) -- Add sorting with ASC
1558
+ INTO 32 BUCKETS;
1559
+ ```
1560
+
1561
+ #### PRIMARY KEY and Bucketing Strategy Best Practices
1562
+
1563
+ Based on test-verified results, we recommend the following design guidance:
1564
+
1565
+ 1. **Avoid using both simultaneously**: In most scenarios, avoid using both PRIMARY KEY constraints and HASH CLUSTERED BY. Choose one:
1566
+ * For uniqueness constraint scenarios, use PRIMARY KEY
1567
+ * For performance optimization on large tables, use HASH CLUSTERED BY with bloom filter indexes
1568
+
1569
+ 2. **Rules when both must be used**: If business needs require using both, the following conditions must ALL be met:
1570
+ * The HASH CLUSTERED BY column(s) must include ALL PRIMARY KEY columns
1571
+ * A SORTED BY clause must be added
1572
+ * The SORTED BY clause must include ALL PRIMARY KEY columns
1573
+ * All PRIMARY KEY columns in SORTED BY must use ASC sort direction
1574
+
1575
+ 3. **Reference examples**:
1576
+
1577
+ ```sql
1578
+ -- Best practice 1: Only use PRIMARY KEY (recommended for small tables)
1579
+ CREATE TABLE customer_profiles (
1580
+ customer_id INT PRIMARY KEY,
1581
+ customer_name VARCHAR(100) NOT NULL,
1582
+ customer_email VARCHAR(200)
1583
+ );
1584
+
1585
+ -- Best practice 2: Only use HASH CLUSTERED BY (recommended for large tables)
1586
+ CREATE TABLE customer_events (
1587
+ event_id BIGINT IDENTITY,
1588
+ customer_id INT NOT NULL,
1589
+ event_type VARCHAR(50),
1590
+ event_time TIMESTAMP
1591
+ )
1592
+ HASH CLUSTERED BY (customer_id)
1593
+ SORTED BY (event_time DESC)
1594
+ INTO 128 BUCKETS;
1595
+
1596
+ -- Create bloom filter index for efficient lookup
1597
+ CREATE BLOOMFILTER INDEX customer_lookup_idx
1598
+ ON TABLE customer_events(customer_id);
1599
+
1600
+ -- Best practice 3: Correct configuration when both must be used
1601
+ CREATE TABLE order_items (
1602
+ order_id INT,
1603
+ item_id INT,
1604
+ product_id INT,
1605
+ quantity INT,
1606
+ PRIMARY KEY (order_id, item_id)
1607
+ )
1608
+ HASH CLUSTERED BY (order_id, item_id) -- Includes all PRIMARY KEY columns
1609
+ SORTED BY (order_id ASC, item_id ASC) -- Includes all PRIMARY KEY columns, all ASC
1610
+ INTO 64 BUCKETS;
1611
+ ```
1612
+
1613
+ #### Pitfall 8: Incorrect analyzer Usage on ARRAY Type Columns
1614
+
1615
+ **Error Scenario**:
1616
+
1617
+ ```sql
1618
+ -- Using analyzer on ARRAY type columns causes errors
1619
+ CREATE TABLE array_column_table (
1620
+ id INT,
1621
+ tags ARRAY<STRING>
1622
+ );
1623
+
1624
+ CREATE INVERTED INDEX tags_analyzer_idx
1625
+ ON TABLE array_column_table(tags)
1626
+ PROPERTIES ('analyzer' = 'keyword'); -- Fails! ARRAY type does not support analyzer parameter
1627
+
1628
+ -- Error message example:
1629
+ -- invalid.inverted.index.analyzer.type, array<string>
1630
+ ```
1631
+
1632
+ **Correct Solution**:
1633
+
1634
+ ```sql
1635
+ -- Correct: Create inverted index on ARRAY column without specifying analyzer
1636
+ CREATE INVERTED INDEX tags_idx
1637
+ ON TABLE array_column_table(tags); -- Succeeds: no analyzer specified
1638
+
1639
+ -- Or use STRING type with delimiter
1640
+ CREATE TABLE string_tags_table (
1641
+ id INT,
1642
+ tags_str STRING -- Comma-separated tag string
1643
+ );
1644
+
1645
+ CREATE INVERTED INDEX tags_str_idx
1646
+ ON TABLE string_tags_table(tags_str)
1647
+ PROPERTIES ('analyzer' = 'keyword'); -- Succeeds: STRING type supports analyzer
1648
+ ```
1649
+
1650
+ ### Generated Column Design Pitfalls
1651
+
1652
+ #### Pitfall 9: Using Non-Deterministic Functions in Generated Columns
1653
+
1654
+ **Error Scenario**:
1655
+
1656
+ ```sql
1657
+ -- Using non-deterministic functions in generated columns (confirmed to fail in testing)
1658
+ CREATE TABLE generated_column_errors (
1659
+ id INT,
1660
+ event_data VARCHAR(1000),
1661
+
1662
+ -- All of the following generated columns will cause creation failure
1663
+ auto_timestamp TIMESTAMP GENERATED ALWAYS AS (current_timestamp()), -- Fails
1664
+ random_id DOUBLE GENERATED ALWAYS AS (random()), -- Fails
1665
+ current_user_name STRING GENERATED ALWAYS AS (current_user()), -- Fails
1666
+ uuid_value STRING GENERATED ALWAYS AS (uuid()) -- Fails
1667
+ );
1668
+
1669
+ -- Error message: Generated column auto_timestamp only contains built-in/scalar/deterministic function
1670
+ ```
1671
+
1672
+ **Correct Solution**:
1673
+
1674
+ ```sql
1675
+ -- Distinguish between generated columns and default values
1676
+ CREATE TABLE generated_column_solutions (
1677
+ id INT,
1678
+ event_time TIMESTAMP,
1679
+ event_data VARCHAR(1000),
1680
+ amount DECIMAL(10,2),
1681
+
1682
+ -- Use DEFAULT values instead of generated columns (for non-deterministic functions)
1683
+ created_timestamp TIMESTAMP DEFAULT current_timestamp(),
1684
+ random_seed DOUBLE DEFAULT random(),
1685
+ creator_name STRING DEFAULT current_user(),
1686
+
1687
+ -- Generated columns use deterministic functions (computed from other columns)
1688
+ event_year INT GENERATED ALWAYS AS (year(event_time)),
1689
+ event_date STRING GENERATED ALWAYS AS (date_format(event_time, 'yyyy-MM-dd')),
1690
+ data_length INT GENERATED ALWAYS AS (length(event_data)),
1691
+ amount_category STRING GENERATED ALWAYS AS (
1692
+ if(amount < 100, 'small',
1693
+ if(amount < 1000, 'medium', 'large'))
1694
+ ),
1695
+ display_info STRING GENERATED ALWAYS AS (
1696
+ concat('[', string(id), '] ', substr(event_data, 1, 50))
1697
+ )
1698
+ ) COMMENT 'Generated column correct usage - distinguishing deterministic computation from default value settings';
1699
+ ```
1700
+
1701
+ ***
1702
+
1703
+ ## Troubleshooting Guide
1704
+
1705
+ ### Common Error Diagnosis and Solutions
1706
+
1707
+ #### Error 1: IDENTITY Column Type Error
1708
+
1709
+ **Error Message**:
1710
+
1711
+ ```
1712
+ invalid identity column type int, currently only BIGINT is supported
1713
+ ```
1714
+
1715
+ **Root Cause**: Attempting to use IDENTITY constraint on a non-BIGINT column
1716
+
1717
+ **Diagnosis Steps**:
1718
+
1719
+ 1. Check the IDENTITY column definition in the CREATE TABLE statement
1720
+ 2. Confirm whether the IDENTITY column data type is BIGINT
1721
+ 3. Check if INT, SMALLINT, or other numeric types were mistakenly used
1722
+
1723
+ **Solution**:
1724
+
1725
+ ```sql
1726
+ -- Incorrect usage
1727
+ CREATE TABLE wrong_table (id INT IDENTITY, name VARCHAR(50));
1728
+
1729
+ -- Correct usage
1730
+ CREATE TABLE correct_table (id BIGINT IDENTITY, name VARCHAR(50));
1731
+ ```
1732
+
1733
+ #### Error 2: Index Naming Management
1734
+
1735
+ The current version of the Singdata Lakehouse **may not strictly enforce schema-level uniqueness** for index naming. Although indexes with the same name can be created successfully, we still recommend using unique index names for code maintainability and future version compatibility.
1736
+
1737
+ **Best Practice**:
1738
+
1739
+ ```sql
1740
+ -- Recommended unique index naming
1741
+ CREATE INVERTED INDEX table1_inv_content_idx ON TABLE table1(content);
1742
+ CREATE INVERTED INDEX table2_inv_content_idx ON TABLE table2(content);
1743
+
1744
+ -- Naming convention: {table_name}_{index_type}_{column_name}_idx
1745
+ ```
1746
+
1747
+ #### Error 3: Generated Column Function Not Supported
1748
+
1749
+ **Error Message**:
1750
+
1751
+ ```
1752
+ Generated column auto_timestamp only contains built-in/scalar/deterministic function
1753
+ ```
1754
+
1755
+ **Root Cause**: Non-deterministic function used in a generated column
1756
+
1757
+ **Diagnosis Steps**:
1758
+
1759
+ 1. Check the functions used in generated column expressions
1760
+ 2. Cross-reference with the deterministic function support list
1761
+ 3. Distinguish between default values and generated column use cases
1762
+
1763
+ **Solution**:
1764
+
1765
+ ```sql
1766
+ -- Incorrect: Using non-deterministic function in generated column
1767
+ created_at TIMESTAMP GENERATED ALWAYS AS (current_timestamp())
1768
+
1769
+ -- Correct: Use default value
1770
+ created_at TIMESTAMP DEFAULT current_timestamp()
1771
+
1772
+ -- Correct: Generated column using deterministic function
1773
+ date_part STRING GENERATED ALWAYS AS (date_format(some_timestamp, 'yyyy-MM-dd'))
1774
+ ```
1775
+
1776
+ #### Error 4: Unsupported Partition Type
1777
+
1778
+ **Error Message**:
1779
+
1780
+ ```
1781
+ Unsupported data type for partition transform: timestamp_ltz
1782
+ ```
1783
+
1784
+ **Root Cause**: Using a data type not supported for partitioning
1785
+
1786
+ **Diagnosis Steps**:
1787
+
1788
+ 1. Check the data type of the partition column
1789
+ 2. Cross-reference with the supported partition data type list
1790
+ 3. Evaluate if a generated column conversion can be used
1791
+
1792
+ **Solution**:
1793
+
1794
+ ```sql
1795
+ -- Incorrect: Using TIMESTAMP directly for partitioning
1796
+ PARTITIONED BY (created_time)
1797
+
1798
+ -- Correct: Use generated column conversion
1799
+ CREATE TABLE correct_partition (
1800
+ created_time TIMESTAMP,
1801
+ date_part STRING GENERATED ALWAYS AS (date_format(created_time, 'yyyy-MM-dd'))
1802
+ ) PARTITIONED BY (date_part);
1803
+ ```
1804
+
1805
+ #### Error 5: Dynamic Partition Count Exceeded
1806
+
1807
+ **Error Message**:
1808
+
1809
+ ```
1810
+ The count of dynamic partitions exceeds the maximum number 2048
1811
+ ```
1812
+
1813
+ **Root Cause**: A single insert operation involves more than 2048 dynamic partitions
1814
+
1815
+ **Diagnosis Steps**:
1816
+
1817
+ 1. Analyze the partition key distribution in the source data
1818
+ 2. Count the number of distinct partition values
1819
+ 3. Evaluate the data insertion strategy
1820
+
1821
+ **Solution**:
1822
+
1823
+ ```sql
1824
+ -- Query partition distribution in source data
1825
+ SELECT partition_column, COUNT(*)
1826
+ FROM source_table
1827
+ GROUP BY partition_column
1828
+ ORDER BY COUNT(*) DESC;
1829
+
1830
+ -- Batch insert data
1831
+ INSERT INTO target_table
1832
+ SELECT * FROM source_table
1833
+ WHERE date_column BETWEEN '2024-01-01' AND '2024-01-31';
1834
+ ```
1835
+
1836
+ #### Error 6: Specifying analyzer on ARRAY Type Column Index
1837
+
1838
+ **Error Message**:
1839
+
1840
+ ```
1841
+ invalid.inverted.index.analyzer.type, array<string>
1842
+ ```
1843
+
1844
+ **Root Cause**: Specifying the analyzer parameter when creating an inverted index on an ARRAY type column
1845
+
1846
+ **Diagnosis Steps**:
1847
+
1848
+ 1. Check the CREATE INVERTED INDEX statement
1849
+ 2. Confirm whether the index column is of ARRAY type
1850
+ 3. Check if the analyzer parameter is included
1851
+
1852
+ **Solution**:
1853
+
1854
+ ```sql
1855
+ -- Incorrect: Specifying analyzer on ARRAY type
1856
+ CREATE INVERTED INDEX tags_analyzer_idx
1857
+ ON TABLE array_column_table(tags)
1858
+ PROPERTIES ('analyzer' = 'keyword');
1859
+
1860
+ -- Correct: Do not specify analyzer
1861
+ CREATE INVERTED INDEX tags_idx
1862
+ ON TABLE array_column_table(tags);
1863
+ ```
1864
+
1865
+ ### Performance Issue Diagnosis
1866
+
1867
+ #### Slow Query Performance
1868
+
1869
+ **Possible Causes and Solutions**:
1870
+
1871
+ 1. **Partition pruning not taking effect**
1872
+ ```sql
1873
+ -- Check if query uses partition column
1874
+ EXPLAIN SELECT * FROM table WHERE partition_column = 'value';
1875
+
1876
+ -- Ensure WHERE condition includes partition column
1877
+ WHERE date_partition = '2024-01-15' -- Instead of WHERE original_date = '2024-01-15'
1878
+ ```
1879
+
1880
+ 2. **Missing appropriate indexes**
1881
+ ```sql
1882
+ -- Create indexes for high-frequency query columns
1883
+ CREATE BLOOMFILTER INDEX table_column_idx ON TABLE table_name(column_name);
1884
+ ```
1885
+
1886
+ 3. **Improper bucketing strategy**
1887
+ ```sql
1888
+ -- Check cardinality distribution of bucket column
1889
+ SELECT bucket_column, COUNT(*)
1890
+ FROM table_name
1891
+ GROUP BY bucket_column
1892
+ ORDER BY COUNT(*) DESC;
1893
+
1894
+ -- Choose high-cardinality, evenly distributed columns as bucket keys
1895
+ ```
1896
+
1897
+ #### Poor Write Performance
1898
+
1899
+ **Possible Causes and Solutions**:
1900
+
1901
+ 1. **Improper bucket count setting**
1902
+ ```sql
1903
+ -- Small table with too many buckets -> reduce bucket count
1904
+ -- Large table with too few buckets -> increase bucket count
1905
+ ```
1906
+
1907
+ 2. **Data skew issues**
1908
+ ```sql
1909
+ -- Choose a more evenly distributed bucket key
1910
+ HASH CLUSTERED BY (more_uniform_column)
1911
+ ```
1912
+
1913
+ 3. **Excessive index maintenance overhead**
1914
+ ```sql
1915
+ -- Drop unnecessary indexes
1916
+ DROP INDEX unnecessary_index_name;
1917
+ ```
1918
+
1919
+ ### Error Prevention Checklist
1920
+
1921
+ #### Pre-Table Creation Checks
1922
+
1923
+ * [ ] IDENTITY column uses BIGINT type
1924
+ * [ ] Partition column types are in the supported list
1925
+ * [ ] Generated columns use only deterministic functions
1926
+ * [ ] VARCHAR lengths are set reasonably
1927
+ * [ ] Financial fields use DECIMAL type
1928
+
1929
+ #### Pre-Index Creation Checks
1930
+
1931
+ * [ ] Index names are unique and descriptive
1932
+ * [ ] Inverted indexes specify appropriate tokenizer
1933
+ * [ ] ARRAY type columns do not specify analyzer
1934
+ * [ ] Vector index parameters are correctly configured
1935
+ * [ ] PRIMARY KEY and HASH CLUSTERED BY configurations are compatible
1936
+
1937
+ #### Pre-Data Insertion Checks
1938
+
1939
+ * [ ] Estimated dynamic partition count does not exceed the limit
1940
+ * [ ] Complex type data insert syntax is verified
1941
+ * [ ] Data type matching is confirmed
1942
+ * [ ] Constraint conditions are satisfied
1943
+
1944
+ ***
1945
+
1946
+ ## Design Review Checklist
1947
+
1948
+ ### Table Structure Design Checks
1949
+
1950
+ #### Data Type Design
1951
+
1952
+ * [ ] **IDENTITY Column Type**: Uniformly use BIGINT IDENTITY (product limitation)
1953
+ * [ ] **Financial Data Types**: Use DECIMAL instead of FLOAT/DOUBLE (precision guarantee)
1954
+ * [ ] **String Lengths**: Set reasonable lengths based on actual business requirements (storage optimization)
1955
+ * [ ] **Vector Type Syntax**: Use correct VECTOR(scalar_type, dimension) format
1956
+ * [ ] **Complex Type Insertion**: Use struct() or named_struct() functions for STRUCT (correct syntax)
1957
+
1958
+ #### Constraints and Default Values
1959
+
1960
+ * [ ] **NOT NULL Constraints**: Add NOT NULL constraints on core business fields
1961
+ * [ ] **Default Value Settings**: Set reasonable defaults for system fields
1962
+ * [ ] **Generated Column Functions**: Use only deterministic scalar functions (verified support list)
1963
+ * [ ] **Primary Key Design**: Avoid using primary keys (unless specially required)
1964
+
1965
+ #### Partition Strategy
1966
+
1967
+ * [ ] **Partition Column Types**: Use data types supported for partitioning (confirmed support list)
1968
+ * [ ] **Partition Granularity**: Choose appropriate partition granularity to avoid too many small partitions
1969
+ * [ ] **Generated Column Partitions**: Use generated columns to convert unsupported types like TIMESTAMP
1970
+ * [ ] **Dynamic Partition Limits**: Control within 2048 partitions per single operation
1971
+
1972
+ ### Performance Optimization Checks
1973
+
1974
+ #### Bucketing Design
1975
+
1976
+ * [ ] **Bucket Column Selection**: Choose high-cardinality, evenly distributed columns (test-verified)
1977
+ * [ ] **Bucket Count**: Set reasonable bucket count based on data scale (test-verified recommendations provided)
1978
+ * [ ] **Sorting Strategy**: Choose sort columns that support main query scenarios
1979
+ * [ ] **Composite Bucketing**: Consider multi-column composite bucketing for large tables
1980
+
1981
+ #### Index Strategy
1982
+
1983
+ * [ ] **Index Naming**: Follow unique naming convention (recommended to still follow)
1984
+ * [ ] **Vector Index**: Distance function and parameters optimized for business scenarios (distance function support confirmed)
1985
+ * [ ] **Inverted Index**: Specify appropriate tokenizer for string types (verified)
1986
+ * [ ] **ARRAY Index**: Do not specify analyzer for ARRAY types (confirmed limitation)
1987
+ * [ ] **Bloom Filter**: Used for fast filtering on high-cardinality columns (verified)
1988
+ * [ ] **PRIMARY KEY and Bucketing**: Ensure configuration compatibility (confirmed conflict)
1989
+
1990
+ #### Query Optimization
1991
+
1992
+ * [ ] **Partition Pruning**: Main queries can leverage partition pruning
1993
+ * [ ] **Bucket Targeting**: JOIN keys align with bucket columns
1994
+ * [ ] **Index Utilization**: Common filter conditions have corresponding index support
1995
+ * [ ] **Multi-Dimensional Queries**: Design multi-level index strategy for complex queries
1996
+
1997
+ ### Operations and Scalability Checks
1998
+
1999
+ #### Maintainability
2000
+
2001
+ * [ ] **Naming Convention**: Table, field, and index names follow consistent conventions
2002
+ * [ ] **Complete Comments**: Tables and key fields have clear business comments
2003
+ * [ ] **Lifecycle**: Set reasonable data retention policies
2004
+ * [ ] **Version Management**: Important design decisions are documented
2005
+
2006
+ #### Scalability
2007
+
2008
+ * [ ] **Data Growth**: Design considers future data volume growth
2009
+ * [ ] **Business Expansion**: Reserve expansion field space (e.g., JSON columns)
2010
+ * [ ] **Index Expansion**: Index strategy supports new query patterns
2011
+ * [ ] **Bucket Headroom**: Bucket count reserves expansion capacity
2012
+
2013
+ #### Fault Handling
2014
+
2015
+ * [ ] **Error Prevention**: Follow common pitfall avoidance strategies
2016
+ * [ ] **Monitoring Setup**: Establish performance and capacity monitoring
2017
+ * [ ] **Backup Strategy**: Develop data backup and recovery plans
2018
+ * [ ] **Emergency Plans**: Prepare handling plans for common issues
2019
+
2020
+ ### Cost Optimization Checks
2021
+
2022
+ #### Storage Costs
2023
+
2024
+ * [ ] **Type Optimization**: Use the smallest appropriate type for storage
2025
+ * [ ] **Length Control**: VARCHAR lengths set based on actual requirements
2026
+ * [ ] **Compression Strategy**: Use vector index compression parameters appropriately
2027
+ * [ ] **Lifecycle**: Set automatic data cleanup policies
2028
+
2029
+ #### Compute Costs
2030
+
2031
+ * [ ] **Index Count**: Avoid creating too many unnecessary indexes
2032
+ * [ ] **Query Optimization**: Ensure queries can execute efficiently
2033
+ * [ ] **Partition Strategy**: Avoid too many small partitions increasing metadata overhead
2034
+ * [ ] **Resource Configuration**: Bucket count matches cluster resources
2035
+
2036
+ ***
2037
+
2038
+ ## Enterprise Design Patterns in Practice
2039
+
2040
+ ### Pattern 1: Event Sourcing Architecture (Complete Implementation)
2041
+
2042
+ **Use Case**: Financial transactions, audit compliance, user behavior analysis, and other scenarios requiring complete historical records
2043
+
2044
+ ```sql
2045
+ -- Event store main table
2046
+ CREATE TABLE event_store_transactions (
2047
+ event_id BIGINT IDENTITY,
2048
+
2049
+ -- Event identification information
2050
+ aggregate_id VARCHAR(100) NOT NULL, -- Aggregate root ID (user ID, order ID, etc.)
2051
+ aggregate_type VARCHAR(50) NOT NULL, -- Aggregate type (User, Order, Payment, etc.)
2052
+ event_type VARCHAR(50) NOT NULL, -- Event type (Created, Updated, Deleted, etc.)
2053
+ event_version INT NOT NULL DEFAULT 1, -- Event version, supports schema evolution
2054
+
2055
+ -- Event time information
2056
+ event_timestamp TIMESTAMP NOT NULL, -- Business event occurrence time
2057
+ ingestion_timestamp TIMESTAMP DEFAULT current_timestamp(), -- System ingestion time
2058
+
2059
+ -- Event data and metadata
2060
+ event_data JSON NOT NULL, -- Event detailed data
2061
+ event_metadata JSON DEFAULT '{}', -- Event metadata (IP, device, etc.)
2062
+
2063
+ -- Tracing information
2064
+ causation_id VARCHAR(100), -- Causation ID
2065
+ correlation_id VARCHAR(100), -- Correlation ID for business process tracing
2066
+ session_id VARCHAR(100), -- Session ID
2067
+
2068
+ -- Business context
2069
+ tenant_id VARCHAR(50), -- Tenant ID for multi-tenant scenarios
2070
+ user_id VARCHAR(100), -- Operating user ID
2071
+ source_system VARCHAR(50), -- Source system identifier
2072
+
2073
+ -- Partition and performance optimization
2074
+ date_partition STRING GENERATED ALWAYS AS (date_format(event_timestamp, 'yyyy-MM-dd')),
2075
+ hour_partition INT GENERATED ALWAYS AS (hour(event_timestamp))
2076
+ )
2077
+ PARTITIONED BY (date_partition)
2078
+ HASH CLUSTERED BY (aggregate_id) -- Bucket by aggregate root for entity reconstruction
2079
+ SORTED BY (event_timestamp ASC, event_version ASC) -- Guarantee event order
2080
+ INTO 512 BUCKETS
2081
+ COMMENT 'Event sourcing storage table - records all business events, supporting full audit trails';
2082
+
2083
+ -- Event query optimization indexes
2084
+ CREATE BLOOMFILTER INDEX events_aggregate_idx ON TABLE event_store_transactions(aggregate_id);
2085
+ CREATE BLOOMFILTER INDEX events_type_idx ON TABLE event_store_transactions(event_type);
2086
+ CREATE BLOOMFILTER INDEX events_tenant_idx ON TABLE event_store_transactions(tenant_id);
2087
+ CREATE INVERTED INDEX events_data_search_idx ON TABLE event_store_transactions(event_data)
2088
+ PROPERTIES ('analyzer' = 'unicode');
2089
+
2090
+ -- Snapshot table (performance optimization)
2091
+ CREATE TABLE aggregate_snapshots (
2092
+ snapshot_id BIGINT IDENTITY,
2093
+ aggregate_id VARCHAR(100) NOT NULL,
2094
+ aggregate_type VARCHAR(50) NOT NULL,
2095
+ snapshot_version INT NOT NULL,
2096
+
2097
+ -- Snapshot data
2098
+ snapshot_data JSON NOT NULL, -- Complete state snapshot of the aggregate root
2099
+
2100
+ -- Snapshot metadata
2101
+ snapshot_timestamp TIMESTAMP NOT NULL,
2102
+ last_event_id BIGINT NOT NULL, -- Last event ID included in the snapshot
2103
+ last_event_version INT NOT NULL, -- Last event version included in the snapshot
2104
+
2105
+ -- Performance optimization
2106
+ created_at TIMESTAMP DEFAULT current_timestamp(),
2107
+
2108
+ date_partition STRING GENERATED ALWAYS AS (date_format(snapshot_timestamp, 'yyyy-MM-dd'))
2109
+ )
2110
+ PARTITIONED BY (date_partition)
2111
+ HASH CLUSTERED BY (aggregate_id)
2112
+ SORTED BY (snapshot_timestamp DESC)
2113
+ INTO 128 BUCKETS
2114
+ COMMENT 'Aggregate snapshot table - periodically saves aggregate state, optimizing reconstruction performance';
2115
+
2116
+ -- Set data lifecycle
2117
+ ALTER TABLE event_store_transactions SET TBLPROPERTIES ('data_lifecycle' = '2555'); -- 7-year retention
2118
+ ALTER TABLE aggregate_snapshots SET TBLPROPERTIES ('data_lifecycle' = '365'); -- 1-year retention
2119
+ ```
2120
+
2121
+ ### Pattern 2: Real-Time Data Lake Architecture (Lambda Enhanced)
2122
+
2123
+ **Use Case**: Real-time analytics, big data processing, machine learning feature engineering
2124
+
2125
+ ```sql
2126
+ -- Real-time data stream layer (Speed Layer)
2127
+ CREATE TABLE realtime_data_stream (
2128
+ stream_id BIGINT IDENTITY,
2129
+
2130
+ -- Data source identification
2131
+ source_system VARCHAR(50) NOT NULL,
2132
+ data_type VARCHAR(50) NOT NULL, -- metrics, events, logs, etc.
2133
+
2134
+ -- Business identification
2135
+ user_id INT,
2136
+ session_id VARCHAR(100),
2137
+ entity_id VARCHAR(100),
2138
+
2139
+ -- Real-time data
2140
+ raw_data JSON NOT NULL, -- Raw data
2141
+ processed_data JSON, -- Preprocessed data
2142
+
2143
+ -- Time information
2144
+ event_timestamp TIMESTAMP NOT NULL, -- Business time
2145
+ ingestion_timestamp TIMESTAMP DEFAULT current_timestamp(), -- Ingestion time
2146
+ processing_timestamp TIMESTAMP, -- Processing time
2147
+
2148
+ -- Data quality
2149
+ data_quality_score DECIMAL(3,2), -- Data quality score
2150
+ validation_errors ARRAY<STRING>, -- Validation error list
2151
+
2152
+ -- Real-time partitioning (by hour)
2153
+ hour_partition STRING GENERATED ALWAYS AS (
2154
+ date_format(event_timestamp, 'yyyy-MM-dd-HH')
2155
+ )
2156
+ )
2157
+ PARTITIONED BY (hour_partition)
2158
+ HASH CLUSTERED BY (user_id)
2159
+ SORTED BY (event_timestamp DESC)
2160
+ INTO 1024 BUCKETS
2161
+ COMMENT 'Real-time data stream table - Lambda architecture speed layer, processing streaming data';
2162
+
2163
+ -- Real-time query optimization
2164
+ CREATE BLOOMFILTER INDEX realtime_user_idx ON TABLE realtime_data_stream(user_id);
2165
+ CREATE BLOOMFILTER INDEX realtime_source_idx ON TABLE realtime_data_stream(source_system);
2166
+ CREATE INVERTED INDEX realtime_data_search_idx ON TABLE realtime_data_stream(raw_data)
2167
+ PROPERTIES ('analyzer' = 'unicode');
2168
+
2169
+ -- Batch aggregation layer (Batch Layer)
2170
+ CREATE TABLE batch_aggregated_analytics (
2171
+ agg_id BIGINT IDENTITY,
2172
+
2173
+ -- Aggregation dimensions
2174
+ user_id INT NOT NULL,
2175
+ data_type VARCHAR(50) NOT NULL,
2176
+ source_system VARCHAR(50) NOT NULL,
2177
+
2178
+ -- Time windows
2179
+ window_start TIMESTAMP NOT NULL,
2180
+ window_end TIMESTAMP NOT NULL,
2181
+ window_type VARCHAR(20) NOT NULL, -- HOUR, DAY, WEEK, MONTH
2182
+
2183
+ -- Aggregation metrics
2184
+ event_count INT,
2185
+ unique_sessions INT,
2186
+ total_duration BIGINT, -- Milliseconds
2187
+ avg_quality_score DECIMAL(5,3),
2188
+
2189
+ -- Statistical indicators
2190
+ min_value DOUBLE,
2191
+ max_value DOUBLE,
2192
+ avg_value DOUBLE,
2193
+ std_deviation DOUBLE,
2194
+ percentile_50 DOUBLE,
2195
+ percentile_95 DOUBLE,
2196
+ percentile_99 DOUBLE,
2197
+
2198
+ -- Business metrics
2199
+ conversion_rate DECIMAL(5,4),
2200
+ error_rate DECIMAL(5,4),
2201
+
2202
+ -- Batch processing metadata
2203
+ batch_id VARCHAR(100),
2204
+ batch_timestamp TIMESTAMP DEFAULT current_timestamp(),
2205
+ processing_version VARCHAR(20) DEFAULT '2.2',
2206
+
2207
+ date_partition STRING GENERATED ALWAYS AS (date_format(window_start, 'yyyy-MM-dd'))
2208
+ )
2209
+ PARTITIONED BY (date_partition)
2210
+ HASH CLUSTERED BY (user_id, data_type)
2211
+ SORTED BY (window_start DESC)
2212
+ INTO 256 BUCKETS
2213
+ COMMENT 'Batch aggregation table - Lambda architecture batch layer, providing accurate historical analysis';
2214
+
2215
+ -- Serving Layer unified view
2216
+ CREATE TABLE serving_layer_unified_view (
2217
+ view_id BIGINT IDENTITY,
2218
+
2219
+ -- Identification information
2220
+ user_id INT NOT NULL,
2221
+ metric_name VARCHAR(100) NOT NULL,
2222
+
2223
+ -- Real-time data (last 1 hour)
2224
+ realtime_value DOUBLE,
2225
+ realtime_timestamp TIMESTAMP,
2226
+ realtime_confidence DECIMAL(3,2),
2227
+
2228
+ -- Batch data (historical aggregation)
2229
+ batch_value DOUBLE,
2230
+ batch_timestamp TIMESTAMP,
2231
+ batch_window_type VARCHAR(20),
2232
+
2233
+ -- Unified result (intelligent merge)
2234
+ unified_value DOUBLE,
2235
+ data_source VARCHAR(20), -- realtime, batch, hybrid
2236
+ confidence_level DECIMAL(3,2),
2237
+
2238
+ -- Update information
2239
+ last_updated TIMESTAMP DEFAULT current_timestamp(),
2240
+
2241
+ date_partition STRING GENERATED ALWAYS AS (date_format(last_updated, 'yyyy-MM-dd'))
2242
+ )
2243
+ PARTITIONED BY (date_partition)
2244
+ HASH CLUSTERED BY (user_id)
2245
+ SORTED BY (last_updated DESC)
2246
+ INTO 128 BUCKETS
2247
+ COMMENT 'Serving layer unified view - merging real-time and batch results, providing unified query interface';
2248
+
2249
+ -- Set data lifecycle for different layers
2250
+ ALTER TABLE realtime_data_stream SET TBLPROPERTIES ('data_lifecycle' = '7'); -- Real-time data 7 days
2251
+ ALTER TABLE batch_aggregated_analytics SET TBLPROPERTIES ('data_lifecycle' = '365'); -- Batch data 1 year
2252
+ ALTER TABLE serving_layer_unified_view SET TBLPROPERTIES ('data_lifecycle' = '90'); -- Serving layer 3 months
2253
+ ```
2254
+
2255
+ ### Pattern 3: Multi-Tenant SaaS Data Architecture (Enterprise)
2256
+
2257
+ **Use Case**: Enterprise SaaS platforms, multi-tenant applications, scenarios requiring strict data isolation
2258
+
2259
+ ```sql
2260
+ -- Tenant master data table
2261
+ CREATE TABLE saas_tenant_registry (
2262
+ tenant_id VARCHAR(50) NOT NULL, -- Removed PRIMARY KEY for HASH CLUSTERED BY compatibility
2263
+ tenant_name VARCHAR(200) NOT NULL,
2264
+
2265
+ -- Tenant basic information
2266
+ subscription_plan VARCHAR(50) NOT NULL, -- free, basic, premium, enterprise
2267
+ tenant_status TINYINT DEFAULT 1, -- 1=active, 0=suspended, 2=trial
2268
+
2269
+ -- Configuration information
2270
+ data_region VARCHAR(20) DEFAULT 'default', -- Data storage region
2271
+ schema_version VARCHAR(10) DEFAULT '2.2', -- Tenant schema version
2272
+ feature_flags JSON DEFAULT '{}', -- Feature flag configuration
2273
+ quota_settings JSON DEFAULT '{}', -- Quota limit settings
2274
+
2275
+ -- Tenant metadata
2276
+ created_at TIMESTAMP DEFAULT current_timestamp(),
2277
+ updated_at TIMESTAMP,
2278
+
2279
+ -- Contact information
2280
+ admin_email VARCHAR(320),
2281
+ billing_contact JSON
2282
+ )
2283
+ HASH CLUSTERED BY (tenant_id)
2284
+ INTO 32 BUCKETS
2285
+ COMMENT 'Tenant registry table - manages basic information and configuration for all tenants';
2286
+
2287
+ -- Multi-tenant business data table (core table)
2288
+ CREATE TABLE saas_multi_tenant_data (
2289
+ record_id BIGINT IDENTITY,
2290
+ tenant_id VARCHAR(50) NOT NULL,
2291
+
2292
+ -- Business entity information
2293
+ entity_type VARCHAR(50) NOT NULL, -- user, order, product, invoice, etc.
2294
+ entity_id VARCHAR(100) NOT NULL, -- Entity ID within the tenant
2295
+ entity_status TINYINT DEFAULT 1, -- Entity status
2296
+
2297
+ -- Business data
2298
+ core_data JSON NOT NULL, -- Core business data
2299
+ extended_data JSON DEFAULT '{}', -- Extended data
2300
+ custom_fields JSON DEFAULT '{}', -- Tenant custom fields
2301
+
2302
+ -- Data classification and tags
2303
+ data_category VARCHAR(50), -- Data classification
2304
+ tags ARRAY<STRING>, -- Business tags
2305
+ priority_level TINYINT DEFAULT 1, -- Priority: 1=normal, 2=high, 3=critical
2306
+
2307
+ -- Audit information
2308
+ created_by VARCHAR(100),
2309
+ updated_by VARCHAR(100),
2310
+ created_at TIMESTAMP DEFAULT current_timestamp(),
2311
+ updated_at TIMESTAMP,
2312
+ version_number INT DEFAULT 1,
2313
+
2314
+ -- Data governance
2315
+ data_classification VARCHAR(20) DEFAULT 'internal', -- public, internal, confidential, restricted
2316
+ retention_policy VARCHAR(50), -- Data retention policy
2317
+
2318
+ -- Performance optimization
2319
+ tenant_partition STRING GENERATED ALWAYS AS (tenant_id)
2320
+ )
2321
+ PARTITIONED BY (tenant_partition) -- Tenant-level data isolation
2322
+ HASH CLUSTERED BY (entity_id) -- Entity dimension bucketing
2323
+ SORTED BY (updated_at DESC, priority_level DESC) -- Latest and high-priority data first
2324
+ INTO 256 BUCKETS
2325
+ COMMENT 'Multi-tenant business data table - realizing tenant-level data isolation and efficient querying';
2326
+
2327
+ -- Multi-tenant query optimization indexes
2328
+ CREATE BLOOMFILTER INDEX saas_entity_type_idx ON TABLE saas_multi_tenant_data(entity_type);
2329
+ CREATE BLOOMFILTER INDEX saas_entity_id_idx ON TABLE saas_multi_tenant_data(entity_id);
2330
+ CREATE INVERTED INDEX saas_tags_idx ON TABLE saas_multi_tenant_data(tags);
2331
+ CREATE INVERTED INDEX saas_core_data_idx ON TABLE saas_multi_tenant_data(core_data)
2332
+ PROPERTIES ('analyzer' = 'unicode');
2333
+
2334
+ -- Tenant usage statistics table (billing and monitoring)
2335
+ CREATE TABLE saas_tenant_usage_stats (
2336
+ usage_id BIGINT IDENTITY,
2337
+ tenant_id VARCHAR(50) NOT NULL,
2338
+
2339
+ -- Statistics time window
2340
+ stat_date DATE NOT NULL,
2341
+ stat_hour TINYINT, -- 0-23, NULL indicates daily statistics
2342
+
2343
+ -- Usage statistics
2344
+ api_calls_count INT DEFAULT 0,
2345
+ storage_bytes_used BIGINT DEFAULT 0,
2346
+ data_transfer_bytes BIGINT DEFAULT 0,
2347
+ compute_seconds_used INT DEFAULT 0,
2348
+
2349
+ -- Feature usage statistics
2350
+ active_users_count INT DEFAULT 0,
2351
+ unique_sessions_count INT DEFAULT 0,
2352
+ feature_usage_stats JSON DEFAULT '{}',
2353
+
2354
+ -- Performance indicators
2355
+ avg_response_time_ms INT,
2356
+ error_rate DECIMAL(5,4),
2357
+ availability_percentage DECIMAL(5,2),
2358
+
2359
+ -- Cost allocation
2360
+ estimated_cost_usd DECIMAL(10,4),
2361
+
2362
+ -- Update information
2363
+ last_updated TIMESTAMP DEFAULT current_timestamp(),
2364
+
2365
+ date_partition STRING GENERATED ALWAYS AS (string(stat_date))
2366
+ )
2367
+ PARTITIONED BY (date_partition)
2368
+ HASH CLUSTERED BY (tenant_id)
2369
+ SORTED BY (stat_date DESC, stat_hour DESC)
2370
+ INTO 64 BUCKETS
2371
+ COMMENT 'Tenant usage statistics table - supporting billing, monitoring, and resource management';
2372
+
2373
+ -- Set data lifecycle policies
2374
+ ALTER TABLE saas_multi_tenant_data SET TBLPROPERTIES ('data_lifecycle' = '1095'); -- 3 years business data
2375
+ ALTER TABLE saas_tenant_usage_stats SET TBLPROPERTIES ('data_lifecycle' = '730'); -- 2 years statistics data
2376
+ ```
2377
+
2378
+ ### Pattern 4: IoT Time-Series Data Architecture (Industrial Grade)
2379
+
2380
+ **Use Case**: Industrial IoT, smart manufacturing, device monitoring, sensor data processing
2381
+
2382
+ ```sql
2383
+ -- Device master data table
2384
+ CREATE TABLE iot_device_registry (
2385
+ device_id VARCHAR(100) NOT NULL,
2386
+
2387
+ -- Device basic information
2388
+ device_name VARCHAR(200),
2389
+ device_type VARCHAR(50) NOT NULL, -- sensor, actuator, gateway, edge
2390
+ device_model VARCHAR(100),
2391
+ manufacturer VARCHAR(100),
2392
+ firmware_version VARCHAR(50),
2393
+
2394
+ -- Deployment information
2395
+ installation_location VARCHAR(200),
2396
+ geo_location JSON, -- {"lat": 39.9042, "lng": 116.4074}
2397
+ facility_id VARCHAR(50),
2398
+ production_line VARCHAR(50),
2399
+
2400
+ -- Device configuration
2401
+ measurement_interval_seconds INT DEFAULT 60,
2402
+ data_retention_days INT DEFAULT 90,
2403
+ alert_thresholds JSON DEFAULT '{}',
2404
+ calibration_params JSON DEFAULT '{}',
2405
+
2406
+ -- Device status
2407
+ device_status TINYINT DEFAULT 1, -- 1=online, 0=offline, 2=maintenance
2408
+ last_heartbeat TIMESTAMP,
2409
+ health_score DECIMAL(3,2), -- 0.00-1.00
2410
+
2411
+ -- Management information
2412
+ created_at TIMESTAMP DEFAULT current_timestamp(),
2413
+ updated_at TIMESTAMP
2414
+ )
2415
+ HASH CLUSTERED BY (device_type)
2416
+ INTO 32 BUCKETS
2417
+ COMMENT 'IoT device registry table - manages metadata for all IoT devices';
2418
+
2419
+ -- High-frequency time-series data table
2420
+ CREATE TABLE iot_timeseries_measurements (
2421
+ measurement_id BIGINT IDENTITY,
2422
+
2423
+ -- Device and measurement identification
2424
+ device_id VARCHAR(100) NOT NULL,
2425
+ sensor_id VARCHAR(100), -- Sensor ID in composite devices
2426
+ measurement_type VARCHAR(50) NOT NULL, -- temperature, pressure, vibration, current, etc.
2427
+
2428
+ -- Measurement data
2429
+ measurement_value DOUBLE, -- Primary value
2430
+ measurement_unit VARCHAR(20), -- Unit: C, Pa, Hz, A, etc.
2431
+ secondary_values JSON, -- Auxiliary measurement values (multi-dimensional sensors)
2432
+
2433
+ -- Time information (high precision)
2434
+ measurement_timestamp TIMESTAMP NOT NULL, -- Device timestamp
2435
+ collection_timestamp TIMESTAMP DEFAULT current_timestamp(), -- Collection timestamp
2436
+
2437
+ -- Data quality and status
2438
+ data_quality_code TINYINT DEFAULT 1, -- 1=good, 2=uncertain, 3=bad
2439
+ measurement_status TINYINT DEFAULT 0, -- 0=normal, 1=warning, 2=alarm, 3=fault
2440
+ confidence_level DECIMAL(3,2), -- Measurement confidence
2441
+
2442
+ -- Anomaly detection results
2443
+ is_anomaly BOOLEAN DEFAULT false,
2444
+ anomaly_score DECIMAL(5,3), -- Anomaly score
2445
+ anomaly_type VARCHAR(50), -- Anomaly type
2446
+
2447
+ -- Context information
2448
+ environment_context JSON, -- Environmental parameters (temperature, humidity, pressure, etc.)
2449
+ operational_context JSON, -- Operational parameters (load, RPM, etc.)
2450
+
2451
+ -- High-frequency data partitioned by hour
2452
+ hour_partition STRING GENERATED ALWAYS AS (
2453
+ date_format(measurement_timestamp, 'yyyy-MM-dd-HH')
2454
+ )
2455
+ )
2456
+ PARTITIONED BY (hour_partition) -- Hourly partitioning for time range queries
2457
+ HASH CLUSTERED BY (device_id) -- Bucket by device
2458
+ SORTED BY (measurement_timestamp DESC) -- Time descending, latest data first
2459
+ INTO 2048 BUCKETS -- Large number of devices requires more buckets
2460
+ COMMENT 'IoT time-series measurement data table - storing high-frequency sensor data and anomaly detection results';
2461
+
2462
+ -- Time-series data query optimization indexes
2463
+ CREATE BLOOMFILTER INDEX iot_device_lookup_idx ON TABLE iot_timeseries_measurements(device_id);
2464
+ CREATE BLOOMFILTER INDEX iot_measurement_type_idx ON TABLE iot_timeseries_measurements(measurement_type);
2465
+ CREATE INVERTED INDEX iot_anomaly_filter_idx ON TABLE iot_timeseries_measurements(is_anomaly);
2466
+ CREATE INVERTED INDEX iot_status_filter_idx ON TABLE iot_timeseries_measurements(measurement_status);
2467
+
2468
+ -- Device status aggregation table (real-time computed results)
2469
+ CREATE TABLE iot_device_status_aggregated (
2470
+ agg_id BIGINT IDENTITY,
2471
+ device_id VARCHAR(100) NOT NULL,
2472
+
2473
+ -- Aggregation time window
2474
+ window_start TIMESTAMP NOT NULL,
2475
+ window_end TIMESTAMP NOT NULL,
2476
+ window_type VARCHAR(20) NOT NULL, -- MINUTE, HOUR, DAY
2477
+ measurement_type VARCHAR(50) NOT NULL,
2478
+
2479
+ -- Statistical indicators
2480
+ measurement_count INT,
2481
+ valid_measurement_count INT, -- Count of good quality measurements
2482
+
2483
+ -- Numeric statistics
2484
+ min_value DOUBLE,
2485
+ max_value DOUBLE,
2486
+ avg_value DOUBLE,
2487
+ median_value DOUBLE,
2488
+ std_deviation DOUBLE,
2489
+
2490
+ -- Anomaly statistics
2491
+ anomaly_count INT DEFAULT 0,
2492
+ alarm_count INT DEFAULT 0,
2493
+ fault_count INT DEFAULT 0,
2494
+
2495
+ -- Device health indicators
2496
+ uptime_percentage DECIMAL(5,2),
2497
+ data_quality_avg DECIMAL(3,2),
2498
+ health_trend TINYINT, -- 1=improving, 0=stable, -1=degrading
2499
+
2500
+ -- Predictive maintenance indicators
2501
+ maintenance_score DECIMAL(5,3), -- Maintenance requirement score
2502
+ estimated_rul_hours INT, -- Estimated remaining useful life (hours)
2503
+ next_maintenance_date DATE,
2504
+
2505
+ -- Computation metadata
2506
+ computed_timestamp TIMESTAMP DEFAULT current_timestamp(),
2507
+ computation_version VARCHAR(20) DEFAULT '2.2',
2508
+ model_version VARCHAR(20), -- Predictive model version
2509
+
2510
+ date_partition STRING GENERATED ALWAYS AS (date_format(window_start, 'yyyy-MM-dd'))
2511
+ )
2512
+ PARTITIONED BY (date_partition)
2513
+ HASH CLUSTERED BY (device_id)
2514
+ SORTED BY (window_start DESC)
2515
+ INTO 512 BUCKETS
2516
+ COMMENT 'Device status aggregation table - real-time computed device health status and predictive maintenance indicators';
2517
+
2518
+ -- Device alert event table
2519
+ CREATE TABLE iot_device_alerts (
2520
+ alert_id BIGINT IDENTITY,
2521
+
2522
+ -- Alert identification
2523
+ device_id VARCHAR(100) NOT NULL,
2524
+ alert_type VARCHAR(50) NOT NULL, -- threshold, anomaly, fault, offline
2525
+ alert_level TINYINT NOT NULL, -- 1=info, 2=warning, 3=error, 4=critical
2526
+
2527
+ -- Alert content
2528
+ alert_title VARCHAR(200),
2529
+ alert_description STRING,
2530
+ alert_data JSON, -- Alert-related data
2531
+
2532
+ -- Alert status
2533
+ alert_status TINYINT DEFAULT 1, -- 1=active, 2=acknowledged, 3=resolved
2534
+ acknowledged_by VARCHAR(100),
2535
+ resolved_by VARCHAR(100),
2536
+
2537
+ -- Time information
2538
+ alert_timestamp TIMESTAMP NOT NULL,
2539
+ acknowledged_at TIMESTAMP,
2540
+ resolved_at TIMESTAMP,
2541
+
2542
+ -- Business impact
2543
+ business_impact VARCHAR(100), -- Business impact description
2544
+ estimated_downtime_minutes INT, -- Estimated downtime
2545
+
2546
+ date_partition STRING GENERATED ALWAYS AS (date_format(alert_timestamp, 'yyyy-MM-dd'))
2547
+ )
2548
+ PARTITIONED BY (date_partition)
2549
+ HASH CLUSTERED BY (device_id)
2550
+ SORTED BY (alert_timestamp DESC, alert_level DESC)
2551
+ INTO 128 BUCKETS
2552
+ COMMENT 'Device alert event table - records and manages all device alert information';
2553
+
2554
+ -- Set tiered data lifecycle
2555
+ ALTER TABLE iot_timeseries_measurements SET TBLPROPERTIES ('data_lifecycle' = '90'); -- Raw data 3 months
2556
+ ALTER TABLE iot_device_status_aggregated SET TBLPROPERTIES ('data_lifecycle' = '730'); -- Aggregated data 2 years
2557
+ ALTER TABLE iot_device_alerts SET TBLPROPERTIES ('data_lifecycle' = '1095'); -- Alert records 3 years
2558
+ ```
2559
+
2560
+ ## Lab Environment Cleanup Guide
2561
+
2562
+ To ensure proper resource usage and avoid unnecessary storage overhead, the following cleanup operations should be performed after completing table design experiments:
2563
+
2564
+ ### Table Resource Cleanup
2565
+
2566
+ ```sql
2567
+ -- 1. Clean up test tables
2568
+ DROP TABLE IF EXISTS test_identity_table;
2569
+ DROP TABLE IF EXISTS test_identity_seed_table;
2570
+ DROP TABLE IF EXISTS test_string_types;
2571
+ DROP TABLE IF EXISTS test_vector_table;
2572
+ DROP TABLE IF EXISTS test_complex_types;
2573
+ DROP TABLE IF EXISTS test_constraints;
2574
+ DROP TABLE IF EXISTS test_generated_columns;
2575
+
2576
+ -- 2. Clean up partition test tables
2577
+ DROP TABLE IF EXISTS test_partition_daily;
2578
+ DROP TABLE IF EXISTS test_partition_hourly;
2579
+ DROP TABLE IF EXISTS test_partition_tenant;
2580
+ DROP TABLE IF EXISTS test_partition_multi;
2581
+ DROP TABLE IF EXISTS partition_type_solutions;
2582
+
2583
+ -- 3. Clean up index test tables
2584
+ DROP TABLE IF EXISTS test_vector_index_table;
2585
+ DROP TABLE IF EXISTS test_inverted_index_table;
2586
+ DROP TABLE IF EXISTS test_bloom_index_table;
2587
+ DROP TABLE IF EXISTS comprehensive_vector_demo;
2588
+ DROP TABLE IF EXISTS comprehensive_search_demo;
2589
+ DROP TABLE IF EXISTS user_management_optimized;
2590
+ DROP TABLE IF EXISTS product_catalog;
2591
+ DROP TABLE IF EXISTS user_content;
2592
+
2593
+ -- 4. Clean up optimization test tables
2594
+ DROP TABLE IF EXISTS user_behavior_optimized;
2595
+ DROP TABLE IF EXISTS financial_transactions_optimized;
2596
+ DROP TABLE IF EXISTS business_analytics_optimized;
2597
+ DROP TABLE IF EXISTS vector_search_performance;
2598
+ DROP TABLE IF EXISTS storage_cost_optimized;
2599
+ DROP TABLE IF EXISTS small_table_optimized;
2600
+ DROP TABLE IF EXISTS medium_table_optimized;
2601
+ DROP TABLE IF EXISTS large_table_optimized;
2602
+
2603
+ -- 5. Clean up enterprise architecture pattern tables
2604
+ -- Event sourcing architecture
2605
+ DROP TABLE IF EXISTS event_store_transactions;
2606
+ DROP TABLE IF EXISTS aggregate_snapshots;
2607
+
2608
+ -- Real-time data lake architecture
2609
+ DROP TABLE IF EXISTS realtime_data_stream;
2610
+ DROP TABLE IF EXISTS batch_aggregated_analytics;
2611
+ DROP TABLE IF EXISTS serving_layer_unified_view;
2612
+
2613
+ -- Multi-tenant SaaS architecture
2614
+ DROP TABLE IF EXISTS saas_tenant_registry;
2615
+ DROP TABLE IF EXISTS saas_multi_tenant_data;
2616
+ DROP TABLE IF EXISTS saas_tenant_usage_stats;
2617
+
2618
+ -- IoT time-series data architecture
2619
+ DROP TABLE IF EXISTS iot_device_registry;
2620
+ DROP TABLE IF EXISTS iot_timeseries_measurements;
2621
+ DROP TABLE IF EXISTS iot_device_status_aggregated;
2622
+ DROP TABLE IF EXISTS iot_device_alerts;
2623
+ ```
2624
+
2625
+ ***
2626
+
2627
+ ## Summary
2628
+
2629
+ ### Verification Results
2630
+
2631
+ This guide has been fully verified in the Singdata Lakehouse environment, with all key functional points confirmed to be available:
2632
+
2633
+ #### Verified Features
2634
+
2635
+ * **Data Types**: IDENTITY (BIGINT only), vector types, complex types (STRUCT/ARRAY/MAP)
2636
+ * **Constraints and Generated Columns**: Deterministic function list, default value syntax
2637
+ * **Partition Strategies**: Supported partition types, generated column partition conversion
2638
+ * **Bucketing and Sorting**: Bucket count configuration, sorting strategy optimization
2639
+ * **Index Architecture**: 5 vector index distance functions, inverted index tokenizers, bloom filters
2640
+ * **Performance Optimization**: Query pruning, multi-dimensional index collaboration
2641
+ * **Enterprise Architecture**: Complete implementation of four design patterns
2642
+
2643
+ #### Key Findings and Corrections
2644
+
2645
+ 1. **Index Naming**: Current version enforces schema-level uniqueness; follow unique naming
2646
+ 2. **Vector Dimensions**: Must strictly match defined dimensions on insert
2647
+ 3. **ARRAY Index**: analyzer parameter not supported
2648
+ 4. **PRIMARY KEY Conflict**: Strict conditions must be met when used simultaneously with HASH CLUSTERED BY
2649
+
2650
+ ### Core Value
2651
+
2652
+ The core value of this guide lies in:
2653
+
2654
+ 1. **Practicality**: All examples are practically verified and can be directly applied in production
2655
+ 2. **Completeness**: Covers full-stack design guidance from basic types to enterprise architecture
2656
+ 3. **Forward-Looking**: Based on the latest product features, adapting to technology trends
2657
+ 4. **Maintainability**: Provides a complete troubleshooting and design review system
2658
+
2659
+ ### Usage Recommendations
2660
+
2661
+ 1. **New Projects**: Build a design framework following the design philosophy chapter, reference enterprise patterns to choose the right architecture
2662
+ 2. **Existing Systems**: Use the design review checklist for system optimization and issue diagnosis
2663
+ 3. **Team Training**: Combine with actual business scenarios, study and practice chapter by chapter
2664
+ 4. **Continuous Optimization**: Regularly evaluate and adjust design strategies based on business development and data growth
2665
+
2666
+ **Best Practice Recommendation**: Strictly following the design principles and verified SQL syntax in this guide will significantly improve system performance, reduce operational complexity, and provide a reliable data infrastructure foundation for business growth.
2667
+
2668
+ ## References
2669
+
2670
+ [Create Table Syntax](create-table-ddl.md)
2671
+
2672
+ ***
2673
+
2674
+ *Note: This guide is based on testing results from the Singdata Lakehouse version as of May 2025. Subsequent versions may vary. Please check the official documentation regularly for the latest information.*