@clickzetta/cz-cli-darwin-x64 0.3.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/bin/cz-cli +0 -0
- package/bin/skills/cz-cli/SKILL.md +58 -0
- package/bin/skills/cz-cli/references/profile-setup.md +88 -0
- package/bin/skills/cz-cli-inner/SKILL.md +96 -0
- package/bin/skills/dt-creator/SKILL.md +15 -0
- package/bin/skills/dt-creator/references/dt-declaration-strategy.md +185 -0
- package/bin/skills/dt-creator/references/incremental-config-reference.md +429 -0
- package/bin/skills/dt-creator/references/refresh-history-guide.md +268 -0
- package/bin/skills/dt-creator/references/sql-limitations.md +80 -0
- package/bin/skills/dynamic-table-alter/SKILL.md +190 -0
- package/bin/skills/lakehouse-doc/SKILL.md +107 -0
- package/bin/skills/lakehouse-doc/references/.gitattributes +1 -0
- package/bin/skills/lakehouse-doc/references/.gitlab-ci.yml +11 -0
- package/bin/skills/lakehouse-doc/references/702bc8f656.md +387 -0
- package/bin/skills/lakehouse-doc/references/774c65e217.md +136 -0
- package/bin/skills/lakehouse-doc/references/9164fed65a.md +1 -0
- package/bin/skills/lakehouse-doc/references/AIGateway.md +3 -0
- package/bin/skills/lakehouse-doc/references/AI_COMPLETE.md +157 -0
- package/bin/skills/lakehouse-doc/references/AI_EMBEDDING.md +70 -0
- package/bin/skills/lakehouse-doc/references/AI_Gateway.md +255 -0
- package/bin/skills/lakehouse-doc/references/AI_eco.md +1 -0
- package/bin/skills/lakehouse-doc/references/AI_function_in_SQL.md +25 -0
- package/bin/skills/lakehouse-doc/references/AI_function_overview.md +25 -0
- package/bin/skills/lakehouse-doc/references/ALTER-EXTERNAL-TABLE.md +78 -0
- package/bin/skills/lakehouse-doc/references/ALTER-SCHEMA.md +85 -0
- package/bin/skills/lakehouse-doc/references/ALTER-TABLE-COLUMN.md +223 -0
- package/bin/skills/lakehouse-doc/references/ALTERTABLE.md +92 -0
- package/bin/skills/lakehouse-doc/references/ARRAY.md +99 -0
- package/bin/skills/lakehouse-doc/references/Account.md +2 -0
- package/bin/skills/lakehouse-doc/references/Analysis.md +1 -0
- package/bin/skills/lakehouse-doc/references/AnalyticsModernDataStack.md +390 -0
- package/bin/skills/lakehouse-doc/references/Application_list.md +26 -0
- package/bin/skills/lakehouse-doc/references/Approval.md +1 -0
- package/bin/skills/lakehouse-doc/references/Approval_list.md +61 -0
- package/bin/skills/lakehouse-doc/references/BIGINT.md +49 -0
- package/bin/skills/lakehouse-doc/references/BINARY.md +104 -0
- package/bin/skills/lakehouse-doc/references/BOOLEAN.md +47 -0
- package/bin/skills/lakehouse-doc/references/BP_AI_Function_Image2text.md +203 -0
- package/bin/skills/lakehouse-doc/references/BluepipeOracleLakehouse_DataSync.md +244 -0
- package/bin/skills/lakehouse-doc/references/CHAR.md +38 -0
- package/bin/skills/lakehouse-doc/references/CONNECTION.md +1 -0
- package/bin/skills/lakehouse-doc/references/COPY-INTO-Location.md +404 -0
- package/bin/skills/lakehouse-doc/references/COPY_INTO_Location.md +371 -0
- package/bin/skills/lakehouse-doc/references/CREAREUSER.md +42 -0
- package/bin/skills/lakehouse-doc/references/CREATE-BLOOMFILTER-INDEX.md +138 -0
- package/bin/skills/lakehouse-doc/references/CREATECONNECTION.md +11 -0
- package/bin/skills/lakehouse-doc/references/CREATEEXTERNAlLSCHEMA.md +290 -0
- package/bin/skills/lakehouse-doc/references/CREATEMATERIALIZEDVIEW.md +518 -0
- package/bin/skills/lakehouse-doc/references/CREATEROLE.md +71 -0
- package/bin/skills/lakehouse-doc/references/CREATESCHEMA.md +40 -0
- package/bin/skills/lakehouse-doc/references/CREATEVIEW.md +63 -0
- package/bin/skills/lakehouse-doc/references/CREATE_EXTERNATL_FUNCTION.md +219 -0
- package/bin/skills/lakehouse-doc/references/CTERevenueCohort.md +275 -0
- package/bin/skills/lakehouse-doc/references/ComputeResourceDDL.md +1 -0
- package/bin/skills/lakehouse-doc/references/Concepts.md +1 -0
- package/bin/skills/lakehouse-doc/references/Create_Embeding_Function.md +236 -0
- package/bin/skills/lakehouse-doc/references/Create_LLM_Function.md +242 -0
- package/bin/skills/lakehouse-doc/references/CreditScoringwithZettaparkandPythonMLlibraryNew.md +873 -0
- package/bin/skills/lakehouse-doc/references/DATE.md +50 -0
- package/bin/skills/lakehouse-doc/references/DDL.md +1 -0
- package/bin/skills/lakehouse-doc/references/DECIMAL.md +27 -0
- package/bin/skills/lakehouse-doc/references/DELETE.md +48 -0
- package/bin/skills/lakehouse-doc/references/DESC-INDEX.md +41 -0
- package/bin/skills/lakehouse-doc/references/DESC-JOB.md +31 -0
- package/bin/skills/lakehouse-doc/references/DESCCONNECTION.md +39 -0
- package/bin/skills/lakehouse-doc/references/DESCMATERIALIZEDVIEW.md +31 -0
- package/bin/skills/lakehouse-doc/references/DESCSCHEMAS.md +59 -0
- package/bin/skills/lakehouse-doc/references/DESCTABLE.md +105 -0
- package/bin/skills/lakehouse-doc/references/DESCVIEW.md +66 -0
- package/bin/skills/lakehouse-doc/references/DOUBLE.md +58 -0
- package/bin/skills/lakehouse-doc/references/DQL.md +1 -0
- package/bin/skills/lakehouse-doc/references/DROP-INDEX.md +29 -0
- package/bin/skills/lakehouse-doc/references/DROPCONNECTION.md +32 -0
- package/bin/skills/lakehouse-doc/references/DROPMATERIALIZEDVIEW.md +42 -0
- package/bin/skills/lakehouse-doc/references/DROPROLE.md +56 -0
- package/bin/skills/lakehouse-doc/references/DROPSCHEMA.md +37 -0
- package/bin/skills/lakehouse-doc/references/DROPTABLE.md +46 -0
- package/bin/skills/lakehouse-doc/references/DROPUSER.md +34 -0
- package/bin/skills/lakehouse-doc/references/DROPVIEW.md +33 -0
- package/bin/skills/lakehouse-doc/references/DataQuality.md +99 -0
- package/bin/skills/lakehouse-doc/references/DataSourceConfigGuide.md +1 -0
- package/bin/skills/lakehouse-doc/references/DataSource_ADBMySQL.md +35 -0
- package/bin/skills/lakehouse-doc/references/DataSource_ADB_PostgreSQL.md +36 -0
- package/bin/skills/lakehouse-doc/references/DataSource_AMQP.md +37 -0
- package/bin/skills/lakehouse-doc/references/DataSource_Amazon_DocumentDB.md +115 -0
- package/bin/skills/lakehouse-doc/references/DataSource_Amazon_OpenSearch.md +42 -0
- package/bin/skills/lakehouse-doc/references/DataSource_Aurora_MySQL.md +35 -0
- package/bin/skills/lakehouse-doc/references/DataSource_Aurora_PostgreSQL.md +36 -0
- package/bin/skills/lakehouse-doc/references/DataSource_AutoMQ.md +124 -0
- package/bin/skills/lakehouse-doc/references/DataSource_COS.md +31 -0
- package/bin/skills/lakehouse-doc/references/DataSource_ClickHouse.md +36 -0
- package/bin/skills/lakehouse-doc/references/DataSource_DB2.md +36 -0
- package/bin/skills/lakehouse-doc/references/DataSource_DM.md +35 -0
- package/bin/skills/lakehouse-doc/references/DataSource_Databricks.md +38 -0
- package/bin/skills/lakehouse-doc/references/DataSource_Doris.md +34 -0
- package/bin/skills/lakehouse-doc/references/DataSource_DynamoDB.md +37 -0
- package/bin/skills/lakehouse-doc/references/DataSource_ElasticSearch.md +30 -0
- package/bin/skills/lakehouse-doc/references/DataSource_Greenplum.md +36 -0
- package/bin/skills/lakehouse-doc/references/DataSource_HANA.md +53 -0
- package/bin/skills/lakehouse-doc/references/DataSource_HBase.md +29 -0
- package/bin/skills/lakehouse-doc/references/DataSource_Hive.md +52 -0
- package/bin/skills/lakehouse-doc/references/DataSource_Hologres.md +36 -0
- package/bin/skills/lakehouse-doc/references/DataSource_Kafka.md +32 -0
- package/bin/skills/lakehouse-doc/references/DataSource_MariaDB.md +36 -0
- package/bin/skills/lakehouse-doc/references/DataSource_MaxCompute.md +32 -0
- package/bin/skills/lakehouse-doc/references/DataSource_MongoDB.md +36 -0
- package/bin/skills/lakehouse-doc/references/DataSource_MySQL.md +36 -0
- package/bin/skills/lakehouse-doc/references/DataSource_OSS.md +31 -0
- package/bin/skills/lakehouse-doc/references/DataSource_Oracle.md +35 -0
- package/bin/skills/lakehouse-doc/references/DataSource_PorarDB.md +37 -0
- package/bin/skills/lakehouse-doc/references/DataSource_PostgreSQL.md +36 -0
- package/bin/skills/lakehouse-doc/references/DataSource_Redis.md +55 -0
- package/bin/skills/lakehouse-doc/references/DataSource_Redshift.md +49 -0
- package/bin/skills/lakehouse-doc/references/DataSource_RestApi.md +30 -0
- package/bin/skills/lakehouse-doc/references/DataSource_S3.md +25 -0
- package/bin/skills/lakehouse-doc/references/DataSource_SLS.md +31 -0
- package/bin/skills/lakehouse-doc/references/DataSource_StarRocks.md +35 -0
- package/bin/skills/lakehouse-doc/references/DataSource_TiDB.md +36 -0
- package/bin/skills/lakehouse-doc/references/Datalake_StorageConnection.md +12 -0
- package/bin/skills/lakehouse-doc/references/Datasource_SQLServer.md +36 -0
- package/bin/skills/lakehouse-doc/references/Datus_Lakehouse_Integrated_Guide.md +3 -0
- package/bin/skills/lakehouse-doc/references/Datus_Lakehouse_MCPServer.md +111 -0
- package/bin/skills/lakehouse-doc/references/Dify_Integreated_with_LakehouseMCPServer.md +71 -0
- package/bin/skills/lakehouse-doc/references/ELTModernDataStack.md +497 -0
- package/bin/skills/lakehouse-doc/references/ELT_practice.md +1 -0
- package/bin/skills/lakehouse-doc/references/EXPLAIN.md +92 -0
- package/bin/skills/lakehouse-doc/references/EXTERNALFUNCITON.md +1 -0
- package/bin/skills/lakehouse-doc/references/EXTERNALFUNCTION.md +0 -0
- package/bin/skills/lakehouse-doc/references/EXTERNALFUNCTION/345/274/200/345/217/221/346/214/207/345/215/227.md +142 -0
- package/bin/skills/lakehouse-doc/references/EXTERNALSCHEMA.md +1 -0
- package/bin/skills/lakehouse-doc/references/EXTERNALSCHMEA.md +94 -0
- package/bin/skills/lakehouse-doc/references/ExternalFunctionDevGuideJava.md +556 -0
- package/bin/skills/lakehouse-doc/references/FLOAT.md +33 -0
- package/bin/skills/lakehouse-doc/references/FeatureEngineeringForExpandingCustomerFeatureswithZettapark.md +427 -0
- package/bin/skills/lakehouse-doc/references/FileCommand.md +1 -0
- package/bin/skills/lakehouse-doc/references/FileFunction.md +1 -0
- package/bin/skills/lakehouse-doc/references/FineBI.md +195 -0
- package/bin/skills/lakehouse-doc/references/Full_Text_Search.md +1 -0
- package/bin/skills/lakehouse-doc/references/GET.md +63 -0
- package/bin/skills/lakehouse-doc/references/GET_PRESIGNED_URL.md +91 -0
- package/bin/skills/lakehouse-doc/references/GrantPriveleges.md +113 -0
- package/bin/skills/lakehouse-doc/references/Hive_connection.md +50 -0
- package/bin/skills/lakehouse-doc/references/IDENTITY-Column.md +74 -0
- package/bin/skills/lakehouse-doc/references/INSERT.md +186 -0
- package/bin/skills/lakehouse-doc/references/INT.md +36 -0
- package/bin/skills/lakehouse-doc/references/INTERVAL.md +143 -0
- package/bin/skills/lakehouse-doc/references/Ingesting_Data_from_Alibaba_Cloud_Data_Lake_into_Lakehouse.md +976 -0
- package/bin/skills/lakehouse-doc/references/Ingestion.md +1 -0
- package/bin/skills/lakehouse-doc/references/JDBC-Driver.md +67 -0
- package/bin/skills/lakehouse-doc/references/JDBC_MindsDB_ML_LLM.md +237 -0
- package/bin/skills/lakehouse-doc/references/JOIN.md +204 -0
- package/bin/skills/lakehouse-doc/references/JSON.md +423 -0
- package/bin/skills/lakehouse-doc/references/JSON_DataType.md +49 -0
- package/bin/skills/lakehouse-doc/references/KAFKA_Storage_connection.md +1 -0
- package/bin/skills/lakehouse-doc/references/Kafka_connection.md +36 -0
- package/bin/skills/lakehouse-doc/references/Key_Concepts.md +112 -0
- package/bin/skills/lakehouse-doc/references/LATERALVIEW.md +78 -0
- package/bin/skills/lakehouse-doc/references/Lakehouse-client-repository.md +11 -0
- package/bin/skills/lakehouse-doc/references/LakehouseAI.md +0 -0
- package/bin/skills/lakehouse-doc/references/LakehouseAI_overview.md +16 -0
- package/bin/skills/lakehouse-doc/references/LakehouseAI/346/246/202/350/277/260.md +0 -0
- package/bin/skills/lakehouse-doc/references/LakehouseDataGPTTour.md +64 -0
- package/bin/skills/lakehouse-doc/references/LakehouseMCPServer.md +1 -0
- package/bin/skills/lakehouse-doc/references/LakehouseMCPServer_intro.md +493 -0
- package/bin/skills/lakehouse-doc/references/LakehousePythonZettapark.md +1 -0
- package/bin/skills/lakehouse-doc/references/LakehouseStudioTour.md +185 -0
- package/bin/skills/lakehouse-doc/references/Lakehouse_Index_Best_Practice.md +681 -0
- package/bin/skills/lakehouse-doc/references/Lakehouse_Insight.md +104 -0
- package/bin/skills/lakehouse-doc/references/Lakehouse_Platform_Release_Note.md +1 -0
- package/bin/skills/lakehouse-doc/references/Lakehouse_Studio_101.md +1 -0
- package/bin/skills/lakehouse-doc/references/Lakehouse_Studio_Release_Note.md +1 -0
- package/bin/skills/lakehouse-doc/references/Lakehouse_Zilliz_MakeDataReadyforBIandAI.md +228 -0
- package/bin/skills/lakehouse-doc/references/Langchain_plug_installation.md +244 -0
- package/bin/skills/lakehouse-doc/references/Langchain_plug_quick_start.md +225 -0
- package/bin/skills/lakehouse-doc/references/Langchain_plugins_overview.md +406 -0
- package/bin/skills/lakehouse-doc/references/Limitation.md +8 -0
- package/bin/skills/lakehouse-doc/references/LoggingIn.md +67 -0
- package/bin/skills/lakehouse-doc/references/Logstash.md +172 -0
- package/bin/skills/lakehouse-doc/references/MAP.md +42 -0
- package/bin/skills/lakehouse-doc/references/MATERIALIZEDVIEW.md +112 -0
- package/bin/skills/lakehouse-doc/references/MCPServers.md +267 -0
- package/bin/skills/lakehouse-doc/references/MERGE.md +498 -0
- package/bin/skills/lakehouse-doc/references/ManageAccounts.md +184 -0
- package/bin/skills/lakehouse-doc/references/ManagingFilesonDatalakeVolumewithZettapark.md +145 -0
- package/bin/skills/lakehouse-doc/references/MigrateSnowflakeRealtimeETLPipelinetoClickzettaLakehouse.md +865 -0
- package/bin/skills/lakehouse-doc/references/Migrate_Spark_DataEngineeringBestPractices_Project_to_Lakehouse.md +292 -0
- package/bin/skills/lakehouse-doc/references/ModernDataStackWithEcosystemTools.md +1 -0
- package/bin/skills/lakehouse-doc/references/N8N_AI_Workflow_Integration.md +1 -0
- package/bin/skills/lakehouse-doc/references/N8N_Integrated_with_LakehouseMCPServer.md +128 -0
- package/bin/skills/lakehouse-doc/references/Notebook.md +109 -0
- package/bin/skills/lakehouse-doc/references/NotesandGuidelinesforUsingPartitionTables.md +1627 -0
- package/bin/skills/lakehouse-doc/references/OPTIMIZE.md +80 -0
- package/bin/skills/lakehouse-doc/references/OptimizingComputingResources.md +1 -0
- package/bin/skills/lakehouse-doc/references/Overview.md +1 -0
- package/bin/skills/lakehouse-doc/references/PUT.md +70 -0
- package/bin/skills/lakehouse-doc/references/PerformingVectorandScalarRetrievalinheSameTableinLakehouse.md +84 -0
- package/bin/skills/lakehouse-doc/references/Permission_application.md +43 -0
- package/bin/skills/lakehouse-doc/references/PowerBI.md +113 -0
- package/bin/skills/lakehouse-doc/references/PythonSDKVersionHistory.md +24 -0
- package/bin/skills/lakehouse-doc/references/PythonSample_put_gharchive2oss.md +153 -0
- package/bin/skills/lakehouse-doc/references/PythonSample_put_github_rt_events.md +336 -0
- package/bin/skills/lakehouse-doc/references/PythonSqlAlchemyVersionHistory.md +23 -0
- package/bin/skills/lakehouse-doc/references/PythonTaskDev.md +1 -0
- package/bin/skills/lakehouse-doc/references/Python_Task.md +28 -0
- package/bin/skills/lakehouse-doc/references/Query_SnowflakeOpenCatalog_Icebergtable.md +114 -0
- package/bin/skills/lakehouse-doc/references/QuickStartwithCopycommand.md +79 -0
- package/bin/skills/lakehouse-doc/references/README.md +1 -0
- package/bin/skills/lakehouse-doc/references/REFRESH.md +37 -0
- package/bin/skills/lakehouse-doc/references/REMOTEFUNCTION.md +1 -0
- package/bin/skills/lakehouse-doc/references/RN_2023-08-07.md +67 -0
- package/bin/skills/lakehouse-doc/references/RN_2023-09-05.md +75 -0
- package/bin/skills/lakehouse-doc/references/RN_2023-09-18.md +50 -0
- package/bin/skills/lakehouse-doc/references/RN_2023-09-20.md +55 -0
- package/bin/skills/lakehouse-doc/references/RN_2023-10-25.md +102 -0
- package/bin/skills/lakehouse-doc/references/RN_2023-11-09.md +84 -0
- package/bin/skills/lakehouse-doc/references/RN_2023-12-25.md +84 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-01-05.md +78 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-02-05.md +87 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-03-22.md +76 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-04-10.md +61 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-04-16.md +38 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-05-10.md +47 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-05-15.md +43 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-05-24.md +143 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-06-06.md +59 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-06-07.md +71 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-06-27.md +55 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-07-22.md +121 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-07-24.md +58 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-08-07.md +42 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-09-26.md +106 -0
- package/bin/skills/lakehouse-doc/references/RN_2024-10-15.md +49 -0
- package/bin/skills/lakehouse-doc/references/RN_2024_11_11.md +50 -0
- package/bin/skills/lakehouse-doc/references/RN_2024_12_12.md +52 -0
- package/bin/skills/lakehouse-doc/references/RN_2024_12_25.md +40 -0
- package/bin/skills/lakehouse-doc/references/RN_2025-03-05.md +68 -0
- package/bin/skills/lakehouse-doc/references/RN_2025-04-01.md +74 -0
- package/bin/skills/lakehouse-doc/references/RN_2025-05-20.md +90 -0
- package/bin/skills/lakehouse-doc/references/RN_2025-07-03.md +100 -0
- package/bin/skills/lakehouse-doc/references/RN_2025-08-25.md +106 -0
- package/bin/skills/lakehouse-doc/references/RN_2025_03_03.md +141 -0
- package/bin/skills/lakehouse-doc/references/RN_2025_04_22.md +110 -0
- package/bin/skills/lakehouse-doc/references/RN_2025_07_15.md +60 -0
- package/bin/skills/lakehouse-doc/references/RN_2025_10_23.md +85 -0
- package/bin/skills/lakehouse-doc/references/RN_2025_10_30.md +68 -0
- package/bin/skills/lakehouse-doc/references/RN_2025_12_17.md +71 -0
- package/bin/skills/lakehouse-doc/references/RN_2026_1_30-2.0.0.md +43 -0
- package/bin/skills/lakehouse-doc/references/RN_LH_2025_12_30.md +73 -0
- package/bin/skills/lakehouse-doc/references/RN_LH_2026_03_13.md +47 -0
- package/bin/skills/lakehouse-doc/references/Refactor_ELT_practice.md +241 -0
- package/bin/skills/lakehouse-doc/references/RemoteFunctionAsUDF.md +1 -0
- package/bin/skills/lakehouse-doc/references/RemoteFunctionBestPractice.md +350 -0
- package/bin/skills/lakehouse-doc/references/RemoteFunctionDevGuidePython3.md +571 -0
- package/bin/skills/lakehouse-doc/references/RemoteFunctionOnACR.md +249 -0
- package/bin/skills/lakehouse-doc/references/RemoteFunctionintro.md +54 -0
- package/bin/skills/lakehouse-doc/references/RemoteFunction/344/273/213/347/273/215.md +1 -0
- package/bin/skills/lakehouse-doc/references/RemoteFunction/345/274/200/345/217/221/346/214/207/345/215/227Python3.md +1 -0
- package/bin/skills/lakehouse-doc/references/RemoteFunction/346/234/200/344/275/263/345/256/236/350/267/265.md +1 -0
- package/bin/skills/lakehouse-doc/references/RevokePriveleges.md +98 -0
- package/bin/skills/lakehouse-doc/references/SCHEMA.md +48 -0
- package/bin/skills/lakehouse-doc/references/SCHEMADDL.md +0 -0
- package/bin/skills/lakehouse-doc/references/SHOW-INDEX.md +22 -0
- package/bin/skills/lakehouse-doc/references/SHOWCONNECTIONS.md +44 -0
- package/bin/skills/lakehouse-doc/references/SHOWFUNCTIONS.md +38 -0
- package/bin/skills/lakehouse-doc/references/SHOWGRANTS.md +62 -0
- package/bin/skills/lakehouse-doc/references/SHOWROLES.md +59 -0
- package/bin/skills/lakehouse-doc/references/SHOWTABLES.md +46 -0
- package/bin/skills/lakehouse-doc/references/SHOWUSERS.md +26 -0
- package/bin/skills/lakehouse-doc/references/SMALLINT.md +48 -0
- package/bin/skills/lakehouse-doc/references/SQL_CREATE_TABLE_GUIDE.md +1210 -0
- package/bin/skills/lakehouse-doc/references/SQL_DML_Considerations.md +601 -0
- package/bin/skills/lakehouse-doc/references/SQL_Join_Guide.md +655 -0
- package/bin/skills/lakehouse-doc/references/SQL_SELECT_Considerations.md +1818 -0
- package/bin/skills/lakehouse-doc/references/SQL_With_CTE_Guide.md +1510 -0
- package/bin/skills/lakehouse-doc/references/SQL_customers.md +74 -0
- package/bin/skills/lakehouse-doc/references/SQL_revenue.md +31 -0
- package/bin/skills/lakehouse-doc/references/STRING.md +80 -0
- package/bin/skills/lakehouse-doc/references/STRUCT.md +33 -0
- package/bin/skills/lakehouse-doc/references/SUMMARY.md +1279 -0
- package/bin/skills/lakehouse-doc/references/Security_system_inventory_and_optimization_based_Information_Schema.md +412 -0
- package/bin/skills/lakehouse-doc/references/Server_data_for_AI.md +15 -0
- package/bin/skills/lakehouse-doc/references/SlowlyChangingDimensionsInLakehouseUsingStreamsandTasks.md +616 -0
- package/bin/skills/lakehouse-doc/references/Spark_Lakehouse_iceberg_REST.md +151 -0
- package/bin/skills/lakehouse-doc/references/StudioDI_PrivateLinkVPC_fromRDS.md +105 -0
- package/bin/skills/lakehouse-doc/references/Supported_Cloud_Platforms.md +40 -0
- package/bin/skills/lakehouse-doc/references/TABLE.md +49 -0
- package/bin/skills/lakehouse-doc/references/TIMESTAMP.md +56 -0
- package/bin/skills/lakehouse-doc/references/TIMETRAVEL.md +207 -0
- package/bin/skills/lakehouse-doc/references/TINYINT.md +63 -0
- package/bin/skills/lakehouse-doc/references/TPC-H100G_experience.md +49 -0
- package/bin/skills/lakehouse-doc/references/TRUNCATE.md +144 -0
- package/bin/skills/lakehouse-doc/references/TableDesign.md +270 -0
- package/bin/skills/lakehouse-doc/references/TableauConnectToLakehouse.md +64 -0
- package/bin/skills/lakehouse-doc/references/Tutorials.md +1 -0
- package/bin/skills/lakehouse-doc/references/UNDROP-TABLE.md +163 -0
- package/bin/skills/lakehouse-doc/references/UPDATE.md +70 -0
- package/bin/skills/lakehouse-doc/references/USESCHEMA.md +53 -0
- package/bin/skills/lakehouse-doc/references/UnifiedWorkflowIntro.md +31 -0
- package/bin/skills/lakehouse-doc/references/UnifiedWorkflow_demo.md +175 -0
- package/bin/skills/lakehouse-doc/references/Unstructured_io.md +735 -0
- package/bin/skills/lakehouse-doc/references/VARCHARleghth.md +42 -0
- package/bin/skills/lakehouse-doc/references/VIEW.md +47 -0
- package/bin/skills/lakehouse-doc/references/Volume_LIST.md +52 -0
- package/bin/skills/lakehouse-doc/references/WINDOWFUNCTION.md +561 -0
- package/bin/skills/lakehouse-doc/references/WITH.md +41 -0
- package/bin/skills/lakehouse-doc/references/ZettaparkQuickStart.md +453 -0
- package/bin/skills/lakehouse-doc/references/Zettapark_Data_Engineering_Demo.md +348 -0
- package/bin/skills/lakehouse-doc/references/a_comprehensive_guide_to_ingesting_data_into_clickzetta_lakehouse.md +66 -0
- package/bin/skills/lakehouse-doc/references/access-control-configration.md +249 -0
- package/bin/skills/lakehouse-doc/references/access-control-general.md +82 -0
- package/bin/skills/lakehouse-doc/references/access-control.md +240 -0
- package/bin/skills/lakehouse-doc/references/account_user_management.md +105 -0
- package/bin/skills/lakehouse-doc/references/accountfunds.md +87 -0
- package/bin/skills/lakehouse-doc/references/agg_function.md +1 -0
- package/bin/skills/lakehouse-doc/references/ai_ready_data_overview.md +13 -0
- package/bin/skills/lakehouse-doc/references/airbyte.md +95 -0
- package/bin/skills/lakehouse-doc/references/alert.md +143 -0
- package/bin/skills/lakehouse-doc/references/alicloud-arn-externalid.md +51 -0
- package/bin/skills/lakehouse-doc/references/alicloud_byos_configration.md +129 -0
- package/bin/skills/lakehouse-doc/references/aliyun_storage_connection.md +135 -0
- package/bin/skills/lakehouse-doc/references/alter-dynamic-table.md +375 -0
- package/bin/skills/lakehouse-doc/references/alter-external-schema.md +20 -0
- package/bin/skills/lakehouse-doc/references/alter-materialzied-view.md +238 -0
- package/bin/skills/lakehouse-doc/references/alter-share.md +43 -0
- package/bin/skills/lakehouse-doc/references/alter-user.md +13 -0
- package/bin/skills/lakehouse-doc/references/alter-vcluster.md +134 -0
- package/bin/skills/lakehouse-doc/references/alter-worksapce.md +55 -0
- package/bin/skills/lakehouse-doc/references/alter.md +35 -0
- package/bin/skills/lakehouse-doc/references/analysis_internet_data_nyc_green_data.md +449 -0
- package/bin/skills/lakehouse-doc/references/analytics_cluster_best_practices.md +377 -0
- package/bin/skills/lakehouse-doc/references/analyze-table.md +58 -0
- package/bin/skills/lakehouse-doc/references/array_size.md +34 -0
- package/bin/skills/lakehouse-doc/references/authentication.md +53 -0
- package/bin/skills/lakehouse-doc/references/authoritymanagement.md +1 -0
- package/bin/skills/lakehouse-doc/references/auto-index.md +57 -0
- package/bin/skills/lakehouse-doc/references/aws_storage_connection.md +114 -0
- package/bin/skills/lakehouse-doc/references/backfilling_data.md +60 -0
- package/bin/skills/lakehouse-doc/references/batch_sync.md +54 -0
- package/bin/skills/lakehouse-doc/references/batch_sync_Sop.md +135 -0
- package/bin/skills/lakehouse-doc/references/batchloadparquertfileintoLakehouse.md +79 -0
- package/bin/skills/lakehouse-doc/references/bestpractice_bazhuanyu.md +1 -0
- package/bin/skills/lakehouse-doc/references/billing.md +62 -0
- package/bin/skills/lakehouse-doc/references/bitmap-type.md +524 -0
- package/bin/skills/lakehouse-doc/references/bitmap_uba_guide.md +1190 -0
- package/bin/skills/lakehouse-doc/references/bloomfilter-summary.md +164 -0
- package/bin/skills/lakehouse-doc/references/book.json +17 -0
- package/bin/skills/lakehouse-doc/references/bring_your_own_storage.md +1 -0
- package/bin/skills/lakehouse-doc/references/build-inverted-index.md +27 -0
- package/bin/skills/lakehouse-doc/references/build_rag_with_langchain.md +616 -0
- package/bin/skills/lakehouse-doc/references/bulkload-summary.md +37 -0
- package/bin/skills/lakehouse-doc/references/bulkloadv1-java-sdk.md +178 -0
- package/bin/skills/lakehouse-doc/references/bulkloadv1-python-sdk.md +169 -0
- package/bin/skills/lakehouse-doc/references/byos_general.md +165 -0
- package/bin/skills/lakehouse-doc/references/byos_tencentcloud_configration.md +138 -0
- package/bin/skills/lakehouse-doc/references/cache-command.md +39 -0
- package/bin/skills/lakehouse-doc/references/cancel-job.md +51 -0
- package/bin/skills/lakehouse-doc/references/cardinality_array.md +45 -0
- package/bin/skills/lakehouse-doc/references/charge_analysis_with_lakehouse_mcp_server.md +393 -0
- package/bin/skills/lakehouse-doc/references/clone-doc.md +111 -0
- package/bin/skills/lakehouse-doc/references/cloud_object_storage.md +1 -0
- package/bin/skills/lakehouse-doc/references/cluster-table-guide.md +68 -0
- package/bin/skills/lakehouse-doc/references/cluster-table.md +64 -0
- package/bin/skills/lakehouse-doc/references/composite_task.md +178 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_3rd_tools.md +11 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_dbv_sql_put.md +48 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_environment_and_data_generate.md +627 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_javasdk_buckload_realtime.md +740 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_kafka_realtime_sync.md +71 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_local_file_into_table_by_studio.md +64 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_overview.md +66 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_pipe_kafka.md +7 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_pipe_oss.md +7 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_studio_batchload_public_network.md +52 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_studio_python_node.md +111 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_studio_realtime_cdc_public_network.md +249 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_studio_sql_insert.md +180 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_zettapark_put_file_to_lake.md +206 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_zettapark_save_as_table.md +79 -0
- package/bin/skills/lakehouse-doc/references/comprehensive_guide_to_ingesting_zettapark_sql_insert.md +113 -0
- package/bin/skills/lakehouse-doc/references/computation.md +6 -0
- package/bin/skills/lakehouse-doc/references/concurrency_scaling.md +57 -0
- package/bin/skills/lakehouse-doc/references/config-datasource.md +79 -0
- package/bin/skills/lakehouse-doc/references/config_volume_dify_storage.md +254 -0
- package/bin/skills/lakehouse-doc/references/connect-with-cli.md +155 -0
- package/bin/skills/lakehouse-doc/references/connect_to_Lakehouse.md +1 -0
- package/bin/skills/lakehouse-doc/references/connection-guide.md +61 -0
- package/bin/skills/lakehouse-doc/references/continue-job.md +260 -0
- package/bin/skills/lakehouse-doc/references/conversational_analytics_datagpt.md +1 -0
- package/bin/skills/lakehouse-doc/references/copy-into-table.md +398 -0
- package/bin/skills/lakehouse-doc/references/cos_storage_connection.md +73 -0
- package/bin/skills/lakehouse-doc/references/cos_volume_creation.md +39 -0
- package/bin/skills/lakehouse-doc/references/cost_management.md +1 -0
- package/bin/skills/lakehouse-doc/references/create-api-connection.md +363 -0
- package/bin/skills/lakehouse-doc/references/create-catalog-connection.md +220 -0
- package/bin/skills/lakehouse-doc/references/create-dynamic-table.md +910 -0
- package/bin/skills/lakehouse-doc/references/create-external-catalog.md +305 -0
- package/bin/skills/lakehouse-doc/references/create-external-table.md +78 -0
- package/bin/skills/lakehouse-doc/references/create-hive-catalog.md +77 -0
- package/bin/skills/lakehouse-doc/references/create-inverted-index.md +163 -0
- package/bin/skills/lakehouse-doc/references/create-kafka-external.md +300 -0
- package/bin/skills/lakehouse-doc/references/create-schema-from-share.md +45 -0
- package/bin/skills/lakehouse-doc/references/create-share.md +46 -0
- package/bin/skills/lakehouse-doc/references/create-sql-function.md +120 -0
- package/bin/skills/lakehouse-doc/references/create-storage-connection.md +346 -0
- package/bin/skills/lakehouse-doc/references/create-synonym.md +75 -0
- package/bin/skills/lakehouse-doc/references/create-table-ddl.md +405 -0
- package/bin/skills/lakehouse-doc/references/create-table-stream.md +226 -0
- package/bin/skills/lakehouse-doc/references/create-vector-index.md +115 -0
- package/bin/skills/lakehouse-doc/references/create.md +46 -0
- package/bin/skills/lakehouse-doc/references/create_cluster.md +121 -0
- package/bin/skills/lakehouse-doc/references/creating_alicloud_privatelinkendpoint.md +37 -0
- package/bin/skills/lakehouse-doc/references/creating_alicloud_privatelinkservice.md +31 -0
- package/bin/skills/lakehouse-doc/references/creating_tencentcloud_privatelinkendpoint.md +33 -0
- package/bin/skills/lakehouse-doc/references/creating_tencentcloud_privatelinkservice.md +19 -0
- package/bin/skills/lakehouse-doc/references/czguide-intro-to-cdc-using-clickzetta-rtsync-dynamic-tables.md +717 -0
- package/bin/skills/lakehouse-doc/references/data-catalog.md +1 -0
- package/bin/skills/lakehouse-doc/references/data-integration-intro.md +60 -0
- package/bin/skills/lakehouse-doc/references/data-integration.md +10 -0
- package/bin/skills/lakehouse-doc/references/data-lifecycle.md +46 -0
- package/bin/skills/lakehouse-doc/references/data-load-summary.md +71 -0
- package/bin/skills/lakehouse-doc/references/data-mamager-tool.md +1 -0
- package/bin/skills/lakehouse-doc/references/data-recover.md +52 -0
- package/bin/skills/lakehouse-doc/references/data-type.md +1 -0
- package/bin/skills/lakehouse-doc/references/data-types-timestamp-ntz.md +139 -0
- package/bin/skills/lakehouse-doc/references/data.md +1 -0
- package/bin/skills/lakehouse-doc/references/data_catalog.md +106 -0
- package/bin/skills/lakehouse-doc/references/data_clean_with_sql.md +406 -0
- package/bin/skills/lakehouse-doc/references/data_ecosystem.md +1 -0
- package/bin/skills/lakehouse-doc/references/data_ops.md +7 -0
- package/bin/skills/lakehouse-doc/references/data_org.md +1 -0
- package/bin/skills/lakehouse-doc/references/data_privacy.md +50 -0
- package/bin/skills/lakehouse-doc/references/data_result_profile.md +22 -0
- package/bin/skills/lakehouse-doc/references/data_security.md +1 -0
- package/bin/skills/lakehouse-doc/references/data_sharing_between_accounts_guide.md +331 -0
- package/bin/skills/lakehouse-doc/references/data_transfer_datalake.md +1 -0
- package/bin/skills/lakehouse-doc/references/data_visualization.md +96 -0
- package/bin/skills/lakehouse-doc/references/databricks_yunqi_integration_guide_v2.md +811 -0
- package/bin/skills/lakehouse-doc/references/datagpt_bestpractice.md +1 -0
- package/bin/skills/lakehouse-doc/references/datagpt_data_source.md +58 -0
- package/bin/skills/lakehouse-doc/references/datagpt_get_accurate_answers.md +34 -0
- package/bin/skills/lakehouse-doc/references/datagpt_intro.md +1 -0
- package/bin/skills/lakehouse-doc/references/datagpt_quickstart.md +94 -0
- package/bin/skills/lakehouse-doc/references/datagpt_tutorial.md +1 -0
- package/bin/skills/lakehouse-doc/references/datalake_FAQ.md +54 -0
- package/bin/skills/lakehouse-doc/references/datalake_overview.md +17 -0
- package/bin/skills/lakehouse-doc/references/datalake_privilege.md +55 -0
- package/bin/skills/lakehouse-doc/references/datalake_query_ingest.md +18 -0
- package/bin/skills/lakehouse-doc/references/datalake_unstructure_data.md +3 -0
- package/bin/skills/lakehouse-doc/references/datalake_volume.md +92 -0
- package/bin/skills/lakehouse-doc/references/datalake_volume_anlytics.md +1 -0
- package/bin/skills/lakehouse-doc/references/datalake_volume_object.md +1 -0
- package/bin/skills/lakehouse-doc/references/dataops_practice.md +105 -0
- package/bin/skills/lakehouse-doc/references/datasharing.md +322 -0
- package/bin/skills/lakehouse-doc/references/datasharing_catalog.md +1 -0
- package/bin/skills/lakehouse-doc/references/datasource_ip_whitelist.md +93 -0
- package/bin/skills/lakehouse-doc/references/datasources.md +62 -0
- package/bin/skills/lakehouse-doc/references/datatype-cast.md +105 -0
- package/bin/skills/lakehouse-doc/references/datatype-conversion.md +85 -0
- package/bin/skills/lakehouse-doc/references/datetime_patterns.md +61 -0
- package/bin/skills/lakehouse-doc/references/datus_lakehouse_installation.md +376 -0
- package/bin/skills/lakehouse-doc/references/datus_lakehouse_solution_overview.md +148 -0
- package/bin/skills/lakehouse-doc/references/db_dw_connection.md +1 -0
- package/bin/skills/lakehouse-doc/references/default-value.md +89 -0
- package/bin/skills/lakehouse-doc/references/delta-lake.md +185 -0
- package/bin/skills/lakehouse-doc/references/desc-catalog-table.md +33 -0
- package/bin/skills/lakehouse-doc/references/desc-catalog.md +31 -0
- package/bin/skills/lakehouse-doc/references/desc-dynamic-table.md +62 -0
- package/bin/skills/lakehouse-doc/references/desc-external-schemas.md +66 -0
- package/bin/skills/lakehouse-doc/references/desc-external-table.md +70 -0
- package/bin/skills/lakehouse-doc/references/desc-function.md +16 -0
- package/bin/skills/lakehouse-doc/references/desc-history-dynamic-table.md +50 -0
- package/bin/skills/lakehouse-doc/references/desc-history-table.md +73 -0
- package/bin/skills/lakehouse-doc/references/desc-history.md +73 -0
- package/bin/skills/lakehouse-doc/references/desc-share.md +59 -0
- package/bin/skills/lakehouse-doc/references/desc-table-stream.md +44 -0
- package/bin/skills/lakehouse-doc/references/desc-vcluster.md +42 -0
- package/bin/skills/lakehouse-doc/references/describe.md +38 -0
- package/bin/skills/lakehouse-doc/references/dify_config_lakehouse_as_vectordb.md +286 -0
- package/bin/skills/lakehouse-doc/references/dify_yunqilakehouse_integration_overview.md +188 -0
- package/bin/skills/lakehouse-doc/references/discovery_analysis_data_in_json_file_on_external_volume.md +625 -0
- package/bin/skills/lakehouse-doc/references/discovery_analysis_data_in_parquert_file_on_external_volume.md +599 -0
- package/bin/skills/lakehouse-doc/references/download-data.md +1 -0
- package/bin/skills/lakehouse-doc/references/drop-dynamic-table.md +46 -0
- package/bin/skills/lakehouse-doc/references/drop-external-schema.md +44 -0
- package/bin/skills/lakehouse-doc/references/drop-external-table.md +43 -0
- package/bin/skills/lakehouse-doc/references/drop-function.md +36 -0
- package/bin/skills/lakehouse-doc/references/drop-share.md +43 -0
- package/bin/skills/lakehouse-doc/references/drop-synonym.md +35 -0
- package/bin/skills/lakehouse-doc/references/drop-table-stream.md +42 -0
- package/bin/skills/lakehouse-doc/references/drop-vcluster.md +52 -0
- package/bin/skills/lakehouse-doc/references/drop.md +32 -0
- package/bin/skills/lakehouse-doc/references/dynamic-mask.md +201 -0
- package/bin/skills/lakehouse-doc/references/dynamic-table-incre.md +62 -0
- package/bin/skills/lakehouse-doc/references/dynamic-table-introduce.md +339 -0
- package/bin/skills/lakehouse-doc/references/dynamicTable-DML-sql.md +52 -0
- package/bin/skills/lakehouse-doc/references/dynamicTable-dml.md +48 -0
- package/bin/skills/lakehouse-doc/references/dynamicTable-parmaters.md +425 -0
- package/bin/skills/lakehouse-doc/references/dynamic_table_summary.md +366 -0
- package/bin/skills/lakehouse-doc/references/dynamic_table_task.md +73 -0
- package/bin/skills/lakehouse-doc/references/dynamic_table_using_studio.md +159 -0
- package/bin/skills/lakehouse-doc/references/dynamictable.md +56 -0
- package/bin/skills/lakehouse-doc/references/eco_integration/Zeppelin.md +84 -0
- package/bin/skills/lakehouse-doc/references/eco_integration/airbyte.md +75 -0
- package/bin/skills/lakehouse-doc/references/eco_integration/datagrip-lakehouse.md +56 -0
- package/bin/skills/lakehouse-doc/references/eco_integration/datax.md +154 -0
- package/bin/skills/lakehouse-doc/references/eco_integration/dbeaver-lakehouse.md +67 -0
- package/bin/skills/lakehouse-doc/references/eco_integration/dbt.md +139 -0
- package/bin/skills/lakehouse-doc/references/eco_integration/rath.md +87 -0
- package/bin/skills/lakehouse-doc/references/eco_integration/sqlline.md +82 -0
- package/bin/skills/lakehouse-doc/references/eco_integration/sqlworkbench-j-lakehouse.md +54 -0
- package/bin/skills/lakehouse-doc/references/eco_integration/streamlit.md +117 -0
- package/bin/skills/lakehouse-doc/references/eco_integration/superset.md +109 -0
- package/bin/skills/lakehouse-doc/references/eco_integration/trino.md +75 -0
- package/bin/skills/lakehouse-doc/references/ecosystem-all.md +24 -0
- package/bin/skills/lakehouse-doc/references/export_data_with_data-integration.md +3 -0
- package/bin/skills/lakehouse-doc/references/external-Volume.md +10 -0
- package/bin/skills/lakehouse-doc/references/external-catalog-summary.md +42 -0
- package/bin/skills/lakehouse-doc/references/external-function-summary.md +0 -0
- package/bin/skills/lakehouse-doc/references/external-hudi-table.md +187 -0
- package/bin/skills/lakehouse-doc/references/external-table-guide.md +96 -0
- package/bin/skills/lakehouse-doc/references/external_object_user_guide.md +298 -0
- package/bin/skills/lakehouse-doc/references/external_volume.md +1 -0
- package/bin/skills/lakehouse-doc/references/f6fc6447ee.md +151 -0
- package/bin/skills/lakehouse-doc/references/federation-query.md +1 -0
- package/bin/skills/lakehouse-doc/references/finebi-mysql.md +104 -0
- package/bin/skills/lakehouse-doc/references/flink-write-connector.md +695 -0
- package/bin/skills/lakehouse-doc/references/from_lakehouse_to_volume.md +98 -0
- package/bin/skills/lakehouse-doc/references/from_volume_to_table.md +45 -0
- package/bin/skills/lakehouse-doc/references/fulltext_indexes_guide.md +1180 -0
- package/bin/skills/lakehouse-doc/references/generated-column.md +113 -0
- package/bin/skills/lakehouse-doc/references/generated_columns_guide.md +847 -0
- package/bin/skills/lakehouse-doc/references/geospatial_analysis.md +558 -0
- package/bin/skills/lakehouse-doc/references/get-started-with-sample-data.md +83 -0
- package/bin/skills/lakehouse-doc/references/getting_started_with_vcluster_for_processing_analytics.md +471 -0
- package/bin/skills/lakehouse-doc/references/grant-to-share.md +55 -0
- package/bin/skills/lakehouse-doc/references/grant-user-privileges.md +100 -0
- package/bin/skills/lakehouse-doc/references/groupby.md +1260 -0
- package/bin/skills/lakehouse-doc/references/guides-overview-connecting.md +46 -0
- package/bin/skills/lakehouse-doc/references/ide.md +2 -0
- package/bin/skills/lakehouse-doc/references/ifnull.md +72 -0
- package/bin/skills/lakehouse-doc/references/ilike.md +89 -0
- package/bin/skills/lakehouse-doc/references/import_data_with_data-integration.md +3 -0
- package/bin/skills/lakehouse-doc/references/importdatabypythonintoLakehouse.md +134 -0
- package/bin/skills/lakehouse-doc/references/index2.md +1 -0
- package/bin/skills/lakehouse-doc/references/instance-informaiton-schema-summary.md +51 -0
- package/bin/skills/lakehouse-doc/references/instance-informaiton-schema.md +278 -0
- package/bin/skills/lakehouse-doc/references/instance-information_schema.md +1 -0
- package/bin/skills/lakehouse-doc/references/internal_volume.md +271 -0
- package/bin/skills/lakehouse-doc/references/intro-supported-features.md +48 -0
- package/bin/skills/lakehouse-doc/references/inverted-index.md +445 -0
- package/bin/skills/lakehouse-doc/references/inverted_idx_bm25_param.md +251 -0
- package/bin/skills/lakehouse-doc/references/inverted_idx_multi-match.md +89 -0
- package/bin/skills/lakehouse-doc/references/is-null.md +59 -0
- package/bin/skills/lakehouse-doc/references/it-operation-management.md +1 -0
- package/bin/skills/lakehouse-doc/references/java_reference/client.md +49 -0
- package/bin/skills/lakehouse-doc/references/java_reference/java-sdk-release-notes.md +32 -0
- package/bin/skills/lakehouse-doc/references/java_reference/java-sdk-summary.md +138 -0
- package/bin/skills/lakehouse-doc/references/java_reference/jdbc.md +211 -0
- package/bin/skills/lakehouse-doc/references/java_reference/realtime-upload.md +295 -0
- package/bin/skills/lakehouse-doc/references/jdbc_task.md +37 -0
- package/bin/skills/lakehouse-doc/references/job-manage.md +1 -0
- package/bin/skills/lakehouse-doc/references/job_history_analysis_with_information_schema.md +597 -0
- package/bin/skills/lakehouse-doc/references/jobprofile-bestpractices.md +104 -0
- package/bin/skills/lakehouse-doc/references/json_analyze.md +422 -0
- package/bin/skills/lakehouse-doc/references/json_data_process_guide.md +881 -0
- package/bin/skills/lakehouse-doc/references/json_guide_for_complex_biz_cases.md +1899 -0
- package/bin/skills/lakehouse-doc/references/kafka-external-table.md +103 -0
- package/bin/skills/lakehouse-doc/references/lakehouse-ai.md +1 -0
- package/bin/skills/lakehouse-doc/references/lakehouse-quick-experience_guide.md +964 -0
- package/bin/skills/lakehouse-doc/references/lakehouse-table-stream-best-practices.md +500 -0
- package/bin/skills/lakehouse-doc/references/lakehouse_billing_anomaly_alert_configuration_guide.md +226 -0
- package/bin/skills/lakehouse-doc/references/lakehouse_instance_overview.md +39 -0
- package/bin/skills/lakehouse-doc/references/lakehouse_table_design_guide.md +2676 -0
- package/bin/skills/lakehouse-doc/references/langchain.md +71 -0
- package/bin/skills/lakehouse-doc/references/langchain_basic_samples.md +606 -0
- package/bin/skills/lakehouse-doc/references/langchain_integration.md +1 -0
- package/bin/skills/lakehouse-doc/references/left.md +51 -0
- package/bin/skills/lakehouse-doc/references/like.md +115 -0
- package/bin/skills/lakehouse-doc/references/list-partition.md +121 -0
- package/bin/skills/lakehouse-doc/references/llama-index.md +57 -0
- package/bin/skills/lakehouse-doc/references/llms-full.txt +1286 -0
- package/bin/skills/lakehouse-doc/references/llms.txt +71 -0
- package/bin/skills/lakehouse-doc/references/load-data-local.md +82 -0
- package/bin/skills/lakehouse-doc/references/load-data-oss.md +174 -0
- package/bin/skills/lakehouse-doc/references/management.md +5 -0
- package/bin/skills/lakehouse-doc/references/managing-instance.md +67 -0
- package/bin/skills/lakehouse-doc/references/mapjoin.md +62 -0
- package/bin/skills/lakehouse-doc/references/materialized_ddl.md +1 -0
- package/bin/skills/lakehouse-doc/references/meta-objects-and-privileges.md +271 -0
- package/bin/skills/lakehouse-doc/references/metabase.md +73 -0
- package/bin/skills/lakehouse-doc/references/metadata_show_desc_command_guide.md +711 -0
- package/bin/skills/lakehouse-doc/references/metrics_answer_build.md +46 -0
- package/bin/skills/lakehouse-doc/references/mindsdb.md +269 -0
- package/bin/skills/lakehouse-doc/references/monitoring_and_alerting.md +177 -0
- package/bin/skills/lakehouse-doc/references/monitoring_item_specification.md +44 -0
- package/bin/skills/lakehouse-doc/references/multi_cloud_instance_manage_with_mcp_server.md +281 -0
- package/bin/skills/lakehouse-doc/references/multitable_batch_sync.md +463 -0
- package/bin/skills/lakehouse-doc/references/multitable_realtime_sync.md +412 -0
- package/bin/skills/lakehouse-doc/references/multitable_realtime_sync_sop.md +593 -0
- package/bin/skills/lakehouse-doc/references/n8n_Integreated_with_lakehouse_mcp_server.md +494 -0
- package/bin/skills/lakehouse-doc/references/navicat-mysql.md +65 -0
- package/bin/skills/lakehouse-doc/references/network_policy.md +281 -0
- package/bin/skills/lakehouse-doc/references/nyc_green_taxi_data_clean_transform_with_mcp_server.md +315 -0
- package/bin/skills/lakehouse-doc/references/object-model-overview.md +70 -0
- package/bin/skills/lakehouse-doc/references/object_identifier.md +259 -0
- package/bin/skills/lakehouse-doc/references/object_model_design.md +1 -0
- package/bin/skills/lakehouse-doc/references/opensource/travel.md +134 -0
- package/bin/skills/lakehouse-doc/references/operation-maintenance.md +172 -0
- package/bin/skills/lakehouse-doc/references/oss_volume_creation.md +39 -0
- package/bin/skills/lakehouse-doc/references/partition_table.md +344 -0
- package/bin/skills/lakehouse-doc/references/partition_table_guide.md +340 -0
- package/bin/skills/lakehouse-doc/references/performance_optimization.md +1 -0
- package/bin/skills/lakehouse-doc/references/performence_test.md +1 -0
- package/bin/skills/lakehouse-doc/references/permissions-of-built-in-workspace-level-roles.md +131 -0
- package/bin/skills/lakehouse-doc/references/pipe-kafka-bestpractice-1.md +431 -0
- package/bin/skills/lakehouse-doc/references/pipe-kafka-table-stream.md +180 -0
- package/bin/skills/lakehouse-doc/references/pipe-kafka.md +210 -0
- package/bin/skills/lakehouse-doc/references/pipe-storage-object.md +247 -0
- package/bin/skills/lakehouse-doc/references/pipe-summary.md +114 -0
- package/bin/skills/lakehouse-doc/references/pipe-syntax.md +200 -0
- package/bin/skills/lakehouse-doc/references/practice_data_analysis.md +1 -0
- package/bin/skills/lakehouse-doc/references/practice_data_import_and_export.md +1 -0
- package/bin/skills/lakehouse-doc/references/practice_python_task.md +157 -0
- package/bin/skills/lakehouse-doc/references/pricing.md +225 -0
- package/bin/skills/lakehouse-doc/references/primary key.md +86 -0
- package/bin/skills/lakehouse-doc/references/primary-key.md +187 -0
- package/bin/skills/lakehouse-doc/references/privacy-policy.md +364 -0
- package/bin/skills/lakehouse-doc/references/private-link-general.md +68 -0
- package/bin/skills/lakehouse-doc/references/private_link.md +1 -0
- package/bin/skills/lakehouse-doc/references/product-trial-agreement.md +99 -0
- package/bin/skills/lakehouse-doc/references/product_concept.md +1 -0
- package/bin/skills/lakehouse-doc/references/put-get.md +1 -0
- package/bin/skills/lakehouse-doc/references/put_get_volume.md +3 -0
- package/bin/skills/lakehouse-doc/references/python-igs.md +297 -0
- package/bin/skills/lakehouse-doc/references/python_package_install_import_guide.md +53 -0
- package/bin/skills/lakehouse-doc/references/python_reference/connector.md +281 -0
- package/bin/skills/lakehouse-doc/references/python_reference/python-sdk-summary.md +13 -0
- package/bin/skills/lakehouse-doc/references/python_reference/sqlalchemy.md +77 -0
- package/bin/skills/lakehouse-doc/references/python_shell_datasource.md +334 -0
- package/bin/skills/lakehouse-doc/references/query-json-sy.md +47 -0
- package/bin/skills/lakehouse-doc/references/query-syntax.md +234 -0
- package/bin/skills/lakehouse-doc/references/quick_start_batch_sync_data.md +116 -0
- package/bin/skills/lakehouse-doc/references/quick_start_bi_analysis.md +589 -0
- package/bin/skills/lakehouse-doc/references/quick_start_create_workspace.md +58 -0
- package/bin/skills/lakehouse-doc/references/quick_start_data_quality.md +75 -0
- package/bin/skills/lakehouse-doc/references/quick_start_etl.md +131 -0
- package/bin/skills/lakehouse-doc/references/quick_start_monitoring_and_alerting.md +93 -0
- package/bin/skills/lakehouse-doc/references/quick_start_sql_query.md +93 -0
- package/bin/skills/lakehouse-doc/references/quick_start_upload_data.md +69 -0
- package/bin/skills/lakehouse-doc/references/quick_start_user_management.md +73 -0
- package/bin/skills/lakehouse-doc/references/quick_start_workspace.md +72 -0
- package/bin/skills/lakehouse-doc/references/quick_start_workspace_user.md +67 -0
- package/bin/skills/lakehouse-doc/references/quickstart_datashare_between_companies.md +249 -0
- package/bin/skills/lakehouse-doc/references/quickstart_envirment_for_team.md +271 -0
- package/bin/skills/lakehouse-doc/references/quickstart_local_csv.md +99 -0
- package/bin/skills/lakehouse-doc/references/realtime_sync.md +36 -0
- package/bin/skills/lakehouse-doc/references/realtime_sync_and_analysis_practice.md +187 -0
- package/bin/skills/lakehouse-doc/references/realtimesync_m.md +190 -0
- package/bin/skills/lakehouse-doc/references/refresh-history.md +63 -0
- package/bin/skills/lakehouse-doc/references/regexp-statement.md +80 -0
- package/bin/skills/lakehouse-doc/references/releasenotes.md +1 -0
- package/bin/skills/lakehouse-doc/references/releasenotesupdata.md +1 -0
- package/bin/skills/lakehouse-doc/references/remove-volume.md +32 -0
- package/bin/skills/lakehouse-doc/references/restore-dynamic-table.md +126 -0
- package/bin/skills/lakehouse-doc/references/restore.md +127 -0
- package/bin/skills/lakehouse-doc/references/result_cache.md +102 -0
- package/bin/skills/lakehouse-doc/references/revoke-from-share.md +48 -0
- package/bin/skills/lakehouse-doc/references/revoke-user-privileges.md +91 -0
- package/bin/skills/lakehouse-doc/references/right.md +84 -0
- package/bin/skills/lakehouse-doc/references/rlike.md +78 -0
- package/bin/skills/lakehouse-doc/references/rn_2024_11_12.md +63 -0
- package/bin/skills/lakehouse-doc/references/role-privlilige-manage.md +1 -0
- package/bin/skills/lakehouse-doc/references/roles.md +123 -0
- package/bin/skills/lakehouse-doc/references/rom_lakehouse_to_volume.md +1 -0
- package/bin/skills/lakehouse-doc/references/s3_volume_creation.md +37 -0
- package/bin/skills/lakehouse-doc/references/sample-data-using.md +768 -0
- package/bin/skills/lakehouse-doc/references/security_compliance_audit_guide.md +1 -0
- package/bin/skills/lakehouse-doc/references/security_overview.md +41 -0
- package/bin/skills/lakehouse-doc/references/select-catalog-table.md +26 -0
- package/bin/skills/lakehouse-doc/references/semantic_view.md +711 -0
- package/bin/skills/lakehouse-doc/references/setup.md +33 -0
- package/bin/skills/lakehouse-doc/references/share-ddl.md +1 -0
- package/bin/skills/lakehouse-doc/references/show-cached-status.md +34 -0
- package/bin/skills/lakehouse-doc/references/show-catalog-schema.md +48 -0
- package/bin/skills/lakehouse-doc/references/show-catalog-table.md +22 -0
- package/bin/skills/lakehouse-doc/references/show-catalog.md +26 -0
- package/bin/skills/lakehouse-doc/references/show-columns.md +75 -0
- package/bin/skills/lakehouse-doc/references/show-create-dynamic-table.md +44 -0
- package/bin/skills/lakehouse-doc/references/show-create-external-table.md +45 -0
- package/bin/skills/lakehouse-doc/references/show-create-materialized-view.md +43 -0
- package/bin/skills/lakehouse-doc/references/show-create-table.md +107 -0
- package/bin/skills/lakehouse-doc/references/show-dynamic-table.md +68 -0
- package/bin/skills/lakehouse-doc/references/show-external-schemas.md +35 -0
- package/bin/skills/lakehouse-doc/references/show-external-table.md +38 -0
- package/bin/skills/lakehouse-doc/references/show-finctions.md +25 -0
- package/bin/skills/lakehouse-doc/references/show-functions.md +44 -0
- package/bin/skills/lakehouse-doc/references/show-grants-user.md +41 -0
- package/bin/skills/lakehouse-doc/references/show-jobs.md +48 -0
- package/bin/skills/lakehouse-doc/references/show-materialized-view.md +42 -0
- package/bin/skills/lakehouse-doc/references/show-schemas.md +41 -0
- package/bin/skills/lakehouse-doc/references/show-shares.md +94 -0
- package/bin/skills/lakehouse-doc/references/show-synonyms.md +29 -0
- package/bin/skills/lakehouse-doc/references/show-table-streams.md +53 -0
- package/bin/skills/lakehouse-doc/references/show-tables-history.md +91 -0
- package/bin/skills/lakehouse-doc/references/show-tables.md +50 -0
- package/bin/skills/lakehouse-doc/references/show-users.md +46 -0
- package/bin/skills/lakehouse-doc/references/show-vclusters.md +59 -0
- package/bin/skills/lakehouse-doc/references/show-views.md +31 -0
- package/bin/skills/lakehouse-doc/references/show-volume.md +57 -0
- package/bin/skills/lakehouse-doc/references/show.md +168 -0
- package/bin/skills/lakehouse-doc/references/simpletosimple_bazhuayu_datagpt.md +152 -0
- package/bin/skills/lakehouse-doc/references/small_file_optimization.md +101 -0
- package/bin/skills/lakehouse-doc/references/spark-connector-summary.md +329 -0
- package/bin/skills/lakehouse-doc/references/spark-connector-use.md +233 -0
- package/bin/skills/lakehouse-doc/references/sql-parmaters.md +594 -0
- package/bin/skills/lakehouse-doc/references/sql-qualify.md +236 -0
- package/bin/skills/lakehouse-doc/references/sql-reference.md +1 -0
- package/bin/skills/lakehouse-doc/references/sql_data_transfom_NestedDataTypes.md +451 -0
- package/bin/skills/lakehouse-doc/references/sql_data_transform.md +1 -0
- package/bin/skills/lakehouse-doc/references/sql_data_transform_basic.md +576 -0
- package/bin/skills/lakehouse-doc/references/sql_data_transform_cte.md +177 -0
- package/bin/skills/lakehouse-doc/references/sql_data_transform_tips.md +407 -0
- package/bin/skills/lakehouse-doc/references/sql_data_transform_windows.md +430 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/any_value.md +64 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/approx_count_distinct.md +45 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/approx_histogram.md +56 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/approx_percentile.md +65 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/approx_top_k.md +62 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/avg.md +68 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/bit_and.md +81 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/bit_or.md +78 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/bit_xor.md +79 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/bool_and.md +85 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/bool_or.md +86 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/collect_list.md +95 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/collect_list_on_array.md +64 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/collect_set.md +85 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/collect_set_on_array.md +80 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/corr.md +103 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/count.md +96 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/count_distinct.md +96 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/count_if.md +62 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/covar_pop.md +89 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/covar_samp.md +113 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/first_value.md +122 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/group_bitmap.md +42 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/group_bitmap_and.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/group_bitmap_and_state.md +47 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/group_bitmap_merge.md +37 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/group_bitmap_merge_state.md +29 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/group_bitmap_or.md +44 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/group_bitmap_or_state.md +64 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/group_bitmap_state.md +27 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/group_bitmap_xor.md +45 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/group_bitmap_xor_state.md +99 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/group_concat.md +53 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/last_value.md +100 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/map_agg.md +79 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/max.md +102 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/max_by.md +67 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/median.md +60 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/min.md +89 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/min_by.md +99 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/percentile.md +69 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/percentile_approx.md +33 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/percentile_rank.md +36 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/std.md +47 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/stddev.md +68 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/stddev_pop.md +67 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/stddev_samp.md +67 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/sum.md +67 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/var_pop.md +65 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/var_samp.md +65 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/variance.md +47 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/aggregate_functions/wm_concat.md +79 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/context_functions/current_instance_id.md +16 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/context_functions/current_schema.md +29 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/context_functions/current_session_id.md +30 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/context_functions/current_user.md +29 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/context_functions/current_user_id.md +21 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/context_functions/current_vcluster.md +40 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/context_functions/current_workspace.md +19 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/context_functions/current_workspace_id.md +24 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/binary_to_bitmap.md +29 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_and.md +39 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_and_cardinality.md +48 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_andnot.md +61 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_andnot_cardinality.md +60 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_build.md +58 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_cardinality.md +43 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_contains.md +50 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_count.md +45 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_empty.md +52 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_has_all.md +49 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_has_any.md +65 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_hash.md +52 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_max.md +26 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_min.md +33 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_or.md +61 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_or_cardinality.md +27 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_remove.md +42 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_subset_in_range.md +60 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_subset_limit.md +36 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_to_array.md +24 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_to_binary.md +28 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_to_string.md +41 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_transform.md +42 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_xor.md +59 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/bitmap_xor_cardinality.md +30 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/string_to_bitmap.md +56 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/sub_bitmap.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitmap_functions/to_bitmap.md +78 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitwise_functions/bit_count.md +43 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/bitwise_functions/bitnot.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/assert_true.md +44 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/between.md +52 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/case_when.md +99 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/coalesce.md +51 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/decode.md +45 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/if.md +56 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/in.md +33 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/is_false.md +63 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/is_not_null.md +40 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/is_null.md +44 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/is_true.md +63 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/multiif.md +54 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/nvl.md +39 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/conditional_functions/raise_error.md +52 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/add_days.md +65 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/add_months.md +38 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/add_years.md +61 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/adddate.md +53 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/convert_timezone.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/current_date.md +56 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/current_timestamp.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/date.md +42 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/date_add.md +35 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/date_format.md +48 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/date_format_mysql.md +58 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/date_format_pg.md +54 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/date_sub.md +58 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/date_trunc.md +61 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/dateadd.md +60 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/datediff.md +50 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/datetime_patterns.md +61 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/day.md +53 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/dayofmonth.md +36 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/dayofweek.md +31 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/dayofweek_iso.md +56 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/dayofyear.md +40 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/days.md +21 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/extract.md +49 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/from_unixtime.md +43 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/from_utc_timestamp.md +45 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/hour.md +49 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/hours.md +21 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/last_day.md +35 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/localtimestamp.md +19 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/makde_date.md +20 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/make_date.md +20 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/make_dt_interval.md +66 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/make_ym_interval.md +53 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/minute.md +49 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/month.md +39 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/months.md +21 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/months_between.md +18 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/next_day.md +16 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/now.md +53 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/quarter.md +52 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/second.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/str_to_date_mysql.md +42 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/sub_days.md +66 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/timestamp_micros.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/timestamp_millis.md +54 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/timestamp_seconds.md +62 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/timestampadd.md +59 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/timestampdiff.md +78 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/to_date.md +43 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/to_start_of_interval.md +25 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/to_timestamp.md +66 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/to_timestamp_ntz.md +52 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/to_unix_timestamp.md +59 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/to_unix_timestamp_ms.md +45 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/to_unix_timestamp_us.md +40 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/to_utc_timestamp.md +62 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/toyyyymmdd.md +69 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/trunc.md +38 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/unix_timestamp.md +96 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/week.md +31 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/weekday.md +51 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/weekofyear.md +29 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/year.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/yearofweek.md +64 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/datetime_functions/years.md +21 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/geo_functions/st_geohash.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/geo_functions/st_latfromgeohash.md +59 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/geo_functions/st_longfromgeohash.md +53 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/hash_functions/bucket.md +33 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/hash_functions/general_hash.md +49 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/hash_functions/hash_combine.md +37 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/hash_functions/hash_combine_commutative.md +54 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/hash_functions/murmurhash.md +36 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/high_order_functions/array_sort_by_key.md +42 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/high_order_functions/element_at.md +59 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/high_order_functions/exists.md +48 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/high_order_functions/filter.md +37 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/high_order_functions/forall.md +68 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/high_order_functions/high_order_functions.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/high_order_functions/map_filter.md +25 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/high_order_functions/map_zip_with.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/high_order_functions/transform.md +48 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/high_order_functions/transform_keys.md +48 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/high_order_functions/transform_values.md +36 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/high_order_functions/zip_with.md +38 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/ip_functions/get_ip_info.md +199 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/ip_functions/ipv4_num_to_string.md +29 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/ip_functions/ipv4_string_to_num.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/ip_functions/ipv6_num_to_string.md +35 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/ip_functions/ipv6_string_to_num.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/ip_functions/is_ip_address_in_range.md +42 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/from_json.md +86 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/get_json_object.md +83 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/json_array.md +30 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/json_contains.md +86 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/json_extract.md +64 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/json_minify.md +28 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/json_normalize.md +36 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/json_object.md +37 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/json_parse.md +59 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/json_remove.md +36 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/json_type.md +44 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/json_valid.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/schema_of_json.md +79 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/json_functions/to_json.md +68 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/abs.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/acos.md +30 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/acosh.md +39 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/asin.md +39 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/asinh.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/atan.md +27 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/atan2.md +43 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/atanh.md +24 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/bround.md +43 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/cbrt.md +56 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/ceil.md +45 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/ceilling.md +47 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/cos.md +41 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/cosh.md +41 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/cot.md +49 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/csc.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/degrees.md +42 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/div.md +28 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/e.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/exp.md +32 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/exp2.md +36 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/expm1.md +54 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/floor.md +41 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/greatest.md +36 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/hypot.md +37 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/isnan.md +27 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/least.md +38 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/ln.md +57 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/log.md +39 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/log10.md +47 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/log1p.md +48 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/log2.md +28 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/median.md +25 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/mod.md +35 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/monotonically_increasing_id.md +81 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/negative.md +15 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/operators.md +427 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/pi.md +41 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/pmod.md +31 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/positive.md +15 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/pow.md +39 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/radians.md +30 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/rand.md +39 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/randn.md +26 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/random.md +67 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/round.md +48 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/shiftleft.md +67 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/shiftright.md +70 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/shiftrightunsigned.md +70 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/sign.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/sin.md +67 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/sinh.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/sqrt.md +40 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/tan.md +42 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/math_functions/tanh.md +38 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_append.md +26 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_compact.md +29 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_contains.md +38 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_distinct.md +43 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_except.md +35 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_intersect.md +39 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_join.md +51 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_max.md +57 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_min.md +60 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_position.md +50 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_prepend.md +26 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_remove.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_repeat.md +51 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_size.md +32 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_sort.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_sort_reverse.md +31 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/array_union.md +36 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/arrays_overlap.md +41 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/arrays_zip.md +50 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/cardinality.md +45 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/concat.md +49 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/concat_ws.md +73 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/element_at.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/flatten.md +62 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/map.md +44 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/map_concat.md +38 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/map_contains_key.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/map_entries.md +44 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/map_equal.md +33 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/map_except.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/map_from_arrays.md +28 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/map_from_entries.md +52 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/map_keys.md +37 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/map_values.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/multimap_from_entries.md +47 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/named_struct.md +55 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/sequence.md +35 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/size.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/slice.md +43 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/sort_array.md +22 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/struct.md +35 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/struct_insert.md +54 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/struct_update.md +42 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/nested_functions/trans_array.md +147 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/partition/max_pt.md +41 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/search_functions/match_all.md +56 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/search_functions/match_any.md +50 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/search_functions/match_phrase.md +65 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/search_functions/match_phrase_prefix.md +36 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/search_functions/match_regexp.md +30 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/search_functions/tokenize.md +57 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/aes_decrypt.md +40 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/aes_decrypt_mysql.md +30 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/aes_encrypt.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/aes_encrypt_mysql.md +53 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/ascii.md +36 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/base64.md +27 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/binary.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/btrim.md +38 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/char.md +55 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/char_length.md +52 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/character_length.md +52 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/chr.md +26 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/collation_sort_key.md +58 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/concat.md +75 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/concat_ws.md +72 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/contains.md +31 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/conv.md +50 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/endswith.md +70 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/find_in_set.md +19 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/format_string.md +53 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/hex.md +45 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/instr.md +41 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/is_ascii.md +27 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/is_utf8.md +27 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/lcase.md +45 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/left.md +35 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/length.md +33 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/lengthb.md +49 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/like.md +63 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/locate.md +42 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/lower.md +61 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/lpad.md +45 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/ltrim.md +48 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/mask.md +37 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/md5.md +71 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/octet_length.md +47 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/parse_url.md +78 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/position.md +35 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/regexp_count.md +115 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/regexp_extract.md +37 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/regexp_extract_all.md +31 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/regexp_instr.md +152 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/regexp_replace.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/repeat.md +44 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/replace.md +37 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/reverse.md +70 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/right.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/rlike.md +64 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/rpad.md +53 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/rtrim.md +37 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/sha1.md +67 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/space.md +34 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/split.md +61 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/split_part.md +42 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/startswith.md +70 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/str_to_map.md +60 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/strpos.md +60 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/substr.md +43 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/substring.md +53 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/substring_index.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/translate.md +43 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/trim.md +54 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/typeof.md +54 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/ucase.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/unbase64.md +54 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/unhex.md +46 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/upper.md +51 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/url_decode.md +32 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/url_encode.md +33 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/string_functions/uuid.md +33 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/vector_functions/binary_quantize.md +51 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/vector_functions/cosine_distance.md +53 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/vector_functions/dot_product.md +54 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/vector_functions/hamming_distance.md +57 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/vector_functions/jaccard_distance.md +55 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/vector_functions/l2_distance.md +53 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/vector_functions/l2_norm.md +52 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/vector_functions/l2_normalize.md +54 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/vector_functions/print_vector_bits.md +56 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/vector_functions/vector.md +70 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/scalar_functions/vector_functions/vector_add_scalar.md +57 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/table_functions/explode.md +71 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/table_functions/inline.md +48 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/table_functions/json_tuple.md +44 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/table_functions/load_history.md +17 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/table_functions/posexplode.md +89 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/table_functions/read_kafka.md +85 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/table_functions/stack.md +69 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/table_functions/table_changes.md +136 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/table_functions/unnset.md +106 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/avg.md +91 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/count.md +88 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/cume_dist.md +104 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/dense_rank.md +119 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/first.md +159 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/first_value.md +133 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/lag.md +72 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/last.md +166 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/last_value.md +160 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/lead.md +49 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/max.md +167 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/min.md +138 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/nth_value.md +58 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/ntile.md +57 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/percent_rank.md +58 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/rank.md +139 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/row_number.md +141 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/sum.md +105 -0
- package/bin/skills/lakehouse-doc/references/sql_functions/window_functions/window_clause.md +91 -0
- package/bin/skills/lakehouse-doc/references/sql_practice.md +1 -0
- package/bin/skills/lakehouse-doc/references/sql_rfm.md +181 -0
- package/bin/skills/lakehouse-doc/references/sqlalchemy.md +82 -0
- package/bin/skills/lakehouse-doc/references/ssb-benchmark.md +224 -0
- package/bin/skills/lakehouse-doc/references/sso-configuration.md +157 -0
- package/bin/skills/lakehouse-doc/references/storage_encryption.md +81 -0
- package/bin/skills/lakehouse-doc/references/streaming_data_pipeline_overview.md +20 -0
- package/bin/skills/lakehouse-doc/references/streaming_data_pipeline_overview1.md +1 -0
- package/bin/skills/lakehouse-doc/references/streaming_pipeline_with_dynamic_table.md +66 -0
- package/bin/skills/lakehouse-doc/references/structure_data_analysis.md +229 -0
- package/bin/skills/lakehouse-doc/references/studio_manual.md +1 -0
- package/bin/skills/lakehouse-doc/references/studio_overview.md +71 -0
- package/bin/skills/lakehouse-doc/references/synonym.md +139 -0
- package/bin/skills/lakehouse-doc/references/table-stream-title.md +1 -0
- package/bin/skills/lakehouse-doc/references/table-stream.md +1 -0
- package/bin/skills/lakehouse-doc/references/table-summary.md +1 -0
- package/bin/skills/lakehouse-doc/references/table_funciotn.md +1 -0
- package/bin/skills/lakehouse-doc/references/table_stream.md +118 -0
- package/bin/skills/lakehouse-doc/references/tablesample.md +474 -0
- package/bin/skills/lakehouse-doc/references/tablestream_summary.md +515 -0
- package/bin/skills/lakehouse-doc/references/task-instance-maintenance.md +207 -0
- package/bin/skills/lakehouse-doc/references/task_development.md +56 -0
- package/bin/skills/lakehouse-doc/references/task_group.md +151 -0
- package/bin/skills/lakehouse-doc/references/task_param.md +978 -0
- package/bin/skills/lakehouse-doc/references/task_scheduling.md +1 -0
- package/bin/skills/lakehouse-doc/references/task_scheduling_dependency.md +74 -0
- package/bin/skills/lakehouse-doc/references/taskdevelop.md +268 -0
- package/bin/skills/lakehouse-doc/references/tencentcloud_arn_and_externalid.md +29 -0
- package/bin/skills/lakehouse-doc/references/time-function.md +67 -0
- package/bin/skills/lakehouse-doc/references/timetravel-summary.md +47 -0
- package/bin/skills/lakehouse-doc/references/tools_AI.md +1 -0
- package/bin/skills/lakehouse-doc/references/tools_BI.md +1 -0
- package/bin/skills/lakehouse-doc/references/tpcds-benchmark.md +754 -0
- package/bin/skills/lakehouse-doc/references/tpch-benchmark.md +887 -0
- package/bin/skills/lakehouse-doc/references/transformt-dt.md +291 -0
- package/bin/skills/lakehouse-doc/references/trial-account-quotas-and-limits.md +81 -0
- package/bin/skills/lakehouse-doc/references/tutorial_DataGPT.md +1 -0
- package/bin/skills/lakehouse-doc/references/tutorial_connect_to_lakehouse.md +1 -0
- package/bin/skills/lakehouse-doc/references/tutorial_data_transformation.md +1 -0
- package/bin/skills/lakehouse-doc/references/tutorial_migration.md +1 -0
- package/bin/skills/lakehouse-doc/references/tutorial_virtual_cluster.md +1 -0
- package/bin/skills/lakehouse-doc/references/tutorial_work_with_workspace.md +1 -0
- package/bin/skills/lakehouse-doc/references/tutorial_zettapark.md +1 -0
- package/bin/skills/lakehouse-doc/references/tutorials-streaming-data-pipeline-with_dynamic-table.md +124 -0
- package/bin/skills/lakehouse-doc/references/undrop-dynamic-table.md +79 -0
- package/bin/skills/lakehouse-doc/references/undrop-materialized-view.md +80 -0
- package/bin/skills/lakehouse-doc/references/unifiedWorkflow.md +1 -0
- package/bin/skills/lakehouse-doc/references/unloa-data-summary.md +17 -0
- package/bin/skills/lakehouse-doc/references/unload-data-local.md +72 -0
- package/bin/skills/lakehouse-doc/references/unstructure_data_analysis.md +1 -0
- package/bin/skills/lakehouse-doc/references/unstructured_etl_pipeline_notebook.md +12 -0
- package/bin/skills/lakehouse-doc/references/unstructured_etl_pipeline_user_guide.md +949 -0
- package/bin/skills/lakehouse-doc/references/unstructured_etl_python_api.md +896 -0
- package/bin/skills/lakehouse-doc/references/upload-data.md +1 -0
- package/bin/skills/lakehouse-doc/references/upload_data.md +123 -0
- package/bin/skills/lakehouse-doc/references/use-dbt-dev.md +441 -0
- package/bin/skills/lakehouse-doc/references/use-external-schema.md +42 -0
- package/bin/skills/lakehouse-doc/references/use-java-sdk-releatime-uploaddata.md +168 -0
- package/bin/skills/lakehouse-doc/references/use-java-sdk-upload-dta-local.md +140 -0
- package/bin/skills/lakehouse-doc/references/use-mysql-client.md +189 -0
- package/bin/skills/lakehouse-doc/references/use-python-sdk-upload-data.md +99 -0
- package/bin/skills/lakehouse-doc/references/use-schema.md +49 -0
- package/bin/skills/lakehouse-doc/references/use-vcluster.md +38 -0
- package/bin/skills/lakehouse-doc/references/user-aggrement.md +229 -0
- package/bin/skills/lakehouse-doc/references/user-external-funciton.md +1 -0
- package/bin/skills/lakehouse-doc/references/user-identification.md +58 -0
- package/bin/skills/lakehouse-doc/references/user_permission_grand_guide.md +322 -0
- package/bin/skills/lakehouse-doc/references/using-google-authenticator.md +48 -0
- package/bin/skills/lakehouse-doc/references/using-udf-in-dynamic-table.md +162 -0
- package/bin/skills/lakehouse-doc/references/using_mcp_solute_data_pipeline_issue.md +343 -0
- package/bin/skills/lakehouse-doc/references/uuid.md +47 -0
- package/bin/skills/lakehouse-doc/references/validate_schema_evolution.md +167 -0
- package/bin/skills/lakehouse-doc/references/vc-job.md +1 -0
- package/bin/skills/lakehouse-doc/references/vc_cache.md +71 -0
- package/bin/skills/lakehouse-doc/references/vcluster_size_description.md +98 -0
- package/bin/skills/lakehouse-doc/references/vector-search.md +144 -0
- package/bin/skills/lakehouse-doc/references/vector-type.md +52 -0
- package/bin/skills/lakehouse-doc/references/vector_data_process_guide.md +952 -0
- package/bin/skills/lakehouse-doc/references/vector_search_ai.md +423 -0
- package/bin/skills/lakehouse-doc/references/version-update.md +21 -0
- package/bin/skills/lakehouse-doc/references/virtual-cluster.md +221 -0
- package/bin/skills/lakehouse-doc/references/volume_best_practices.md +1141 -0
- package/bin/skills/lakehouse-doc/references/web-job-history.md +163 -0
- package/bin/skills/lakehouse-doc/references/what_is_clickzetta_lakehouse.md +92 -0
- package/bin/skills/lakehouse-doc/references/window-function-summary.md +134 -0
- package/bin/skills/lakehouse-doc/references/windowframe.md +139 -0
- package/bin/skills/lakehouse-doc/references/working_with_Vclusters.md +171 -0
- package/bin/skills/lakehouse-doc/references/working_with_cache.md +102 -0
- package/bin/skills/lakehouse-doc/references/worksapce-informaiton_schema-views.md +207 -0
- package/bin/skills/lakehouse-doc/references/worksheet.md +15 -0
- package/bin/skills/lakehouse-doc/references/workspace-introduction.md +41 -0
- package/bin/skills/lakehouse-doc/references/worskapce-infroamtionschema-summary.md +56 -0
- package/package.json +13 -0
|
@@ -0,0 +1,1627 @@
|
|
|
1
|
+
# Lakehouse分区使用指南
|
|
2
|
+
|
|
3
|
+
## 文档目标
|
|
4
|
+
|
|
5
|
+
**如果您正在从其他数据平台迁移到云器 Lakehouse,这份文档将帮助您充分发挥 Lakehouse 分区的先进优势。**
|
|
6
|
+
|
|
7
|
+
云器 Lakehouse 的分区基于 Apache Iceberg 的隐藏分区理念,相比传统分区有显著优势:无需手动指定分区条件、自动分区裁剪、更灵活的分区演化。本文档面向来自各大数据平台的有经验数据工程师,涵盖Hive、Spark、MaxCompute、Snowflake、Databricks等主流平台的迁移场景,重点关注**如何成功迁移**和**发挥最大价值**。
|
|
8
|
+
|
|
9
|
+
### 🎯 **您将获得什么**
|
|
10
|
+
|
|
11
|
+
* **先进理念理解**:掌握隐藏分区的创新价值和使用方法
|
|
12
|
+
* **迁移最佳实践**:经过验证的成功迁移策略和实施步骤
|
|
13
|
+
* **性能优化指导**:充分发挥 Lakehouse 分区性能优势的实用技巧
|
|
14
|
+
* **架构升级方案**:将复杂分区架构优化为更高效的 Lakehouse 方案
|
|
15
|
+
* **实战验证方法**:确保分区表正确创建和性能达标的验证步骤
|
|
16
|
+
|
|
17
|
+
### 💡 **Lakehouse分区的核心优势**
|
|
18
|
+
|
|
19
|
+
* **智能化分区裁剪**:查询时无需手动指定分区条件,系统自动优化
|
|
20
|
+
* **简化的分区管理**:告别复杂的分区维护,专注业务逻辑
|
|
21
|
+
* **更好的性能可预测性**:避免过度分区,确保稳定的查询性能
|
|
22
|
+
* **现代化架构设计**:基于Apache Iceberg的先进分区理念
|
|
23
|
+
|
|
24
|
+
***
|
|
25
|
+
|
|
26
|
+
## 核心差异和优势理解
|
|
27
|
+
|
|
28
|
+
### 🚀 **Lakehouse分区的5大创新特性**
|
|
29
|
+
|
|
30
|
+
#### 1. **智能转换分区函数**
|
|
31
|
+
|
|
32
|
+
**创新价值**:避免与标准SQL函数冲突,提供更精确的时间分区控制
|
|
33
|
+
|
|
34
|
+
```sql
|
|
35
|
+
-- ✅ Lakehouse的优雅设计
|
|
36
|
+
CREATE TABLE events PARTITIONED BY (days(event_date)); -- 精确的天级分区
|
|
37
|
+
|
|
38
|
+
-- 💡 为什么使用复数形式?
|
|
39
|
+
-- 避免与SQL标准函数year(), month()等冲突
|
|
40
|
+
-- 提供更清晰的语义:years = 年数,而不是提取年份
|
|
41
|
+
-- 返回值是计算后的数值:days('2024-06-01') = 19875
|
|
42
|
+
-- years('2024-06-01') = 54(从1970年开始计算的年份偏移)
|
|
43
|
+
```
|
|
44
|
+
|
|
45
|
+
**迁移时的理解要点**:
|
|
46
|
+
|
|
47
|
+
* `years/months/days/hours` 是复数形式,语义更准确
|
|
48
|
+
* 转换分区避免逻辑冲突,确保分区策略的一致性
|
|
49
|
+
* 返回值是计算后的数值,系统自动处理转换逻辑
|
|
50
|
+
* 时间函数计算基准:years 从 1970 年开始,days 从 1970-01-01 开始
|
|
51
|
+
|
|
52
|
+
#### 2. **隐藏分区的自动优化**
|
|
53
|
+
|
|
54
|
+
**创新价值**:用户无需关心分区实现细节,专注业务逻辑
|
|
55
|
+
|
|
56
|
+
```sql
|
|
57
|
+
-- ✅ Lakehouse中的简洁查询
|
|
58
|
+
SELECT * FROM sales WHERE order_date = '2024-06-01';
|
|
59
|
+
-- 系统自动转换为分区条件,无需手动指定分区
|
|
60
|
+
|
|
61
|
+
-- 🎯 对比传统方式的优势
|
|
62
|
+
-- 1. 查询更简洁,无需复杂的分区条件
|
|
63
|
+
-- 2. 分区策略变更不影响查询逻辑
|
|
64
|
+
-- 3. 系统自动选择最优的分区扫描策略
|
|
65
|
+
```
|
|
66
|
+
|
|
67
|
+
#### 3. **智能分区数量控制**
|
|
68
|
+
|
|
69
|
+
**设计理念**:分区数量限制是性能保护机制,鼓励更好的分区设计
|
|
70
|
+
|
|
71
|
+
```sql
|
|
72
|
+
-- 💡 Lakehouse的分区哲学:质量优于数量
|
|
73
|
+
-- 传统思维:尽可能细分分区
|
|
74
|
+
-- Lakehouse理念:合理分区粒度 + 索引优化
|
|
75
|
+
|
|
76
|
+
-- ✅ 推荐的高效设计
|
|
77
|
+
CREATE TABLE user_events (
|
|
78
|
+
event_id INT,
|
|
79
|
+
user_id INT,
|
|
80
|
+
event_data JSON,
|
|
81
|
+
event_time TIMESTAMP_LTZ
|
|
82
|
+
) PARTITIONED BY (days(event_time)); -- 时间分区(主要)
|
|
83
|
+
|
|
84
|
+
CREATE BLOOMFILTER INDEX idx_user ON TABLE user_events(user_id); -- 索引优化(辅助)
|
|
85
|
+
|
|
86
|
+
-- 🎯 优势:避免小文件问题,确保每个分区有足够数据量
|
|
87
|
+
-- 📊 限制说明:建议单次操作控制在合理范围内,避免过多小分区
|
|
88
|
+
```
|
|
89
|
+
|
|
90
|
+
#### 4. **转换分区的逻辑一致性**
|
|
91
|
+
|
|
92
|
+
**设计原则**:避免冲突的分区维度,确保分区逻辑清晰
|
|
93
|
+
|
|
94
|
+
< '2025-01-01' -- 年级过滤
|
|
95
|
+
```
|
|
96
|
+
|
|
97
|
+
#### 5. **类型安全的分区设计**
|
|
98
|
+
|
|
99
|
+
**安全保障**:通过类型检查避免数据写入错误
|
|
100
|
+
|
|
101
|
+
```sql
|
|
102
|
+
-- 💡 Lakehouse的类型安全机制
|
|
103
|
+
CREATE TABLE orders (
|
|
104
|
+
id INT,
|
|
105
|
+
amount DOUBLE
|
|
106
|
+
) PARTITIONED BY (order_date STRING); -- 明确的STRING类型
|
|
107
|
+
|
|
108
|
+
-- ⚠️ 类型不匹配会报错
|
|
109
|
+
INSERT INTO orders VALUES (1, 100.0, '2024-06-01'); -- 字符串写入STRING分区:正确
|
|
110
|
+
INSERT INTO orders VALUES (1, 100.0, DATE('2024-06-01')); -- DATE写入STRING分区:错误
|
|
111
|
+
|
|
112
|
+
-- ✅ 推荐做法:使用转换分区避免类型问题
|
|
113
|
+
CREATE TABLE orders_safe (
|
|
114
|
+
id INT,
|
|
115
|
+
amount DOUBLE,
|
|
116
|
+
order_timestamp TIMESTAMP_LTZ
|
|
117
|
+
) PARTITIONED BY (days(order_timestamp)); -- 让系统处理类型转换
|
|
118
|
+
```
|
|
119
|
+
|
|
120
|
+
### 🔄 **认知升级:从复杂到简洁**
|
|
121
|
+
|
|
122
|
+
#### **传统分区 vs Lakehouse分区**
|
|
123
|
+
|
|
124
|
+
| 维度 | 传统分区思维 | Lakehouse分区理念 | 优势 |
|
|
125
|
+
| ---------- | ---------- | ------------- | ----------- |
|
|
126
|
+
| **分区策略** | 越细越好,多维度分区 | 合理粒度,重点维度 | 避免小文件,性能稳定 |
|
|
127
|
+
| **查询方式** | 必须指定分区条件 | 自动分区裁剪 | 查询更简洁,维护更容易 |
|
|
128
|
+
| **类型处理** | 手动处理类型转换 | 系统自动类型安全 | 减少错误,提高可靠性 |
|
|
129
|
+
| **维护成本** | 复杂的分区管理 | 简化的分区维护 | 降低运维成本 |
|
|
130
|
+
| **性能可预测性** | 依赖分区设计经验 | 系统保障性能 | 更稳定的查询表现 |
|
|
131
|
+
|
|
132
|
+
***
|
|
133
|
+
|
|
134
|
+
## 🔍 **分区表创建验证(最佳实践**)
|
|
135
|
+
|
|
136
|
+
### ⚡ **创建后验证:确保分区表正确生效**
|
|
137
|
+
|
|
138
|
+
无论使用什么方式创建分区表(SQL、工具、脚本),都建议创建后立即验证。这是确保分区功能正常的最佳实践。
|
|
139
|
+
|
|
140
|
+
#### 推荐的创建和验证流程
|
|
141
|
+
|
|
142
|
+
```sql
|
|
143
|
+
-- 1. 创建分区表(推荐使用原生SQL以确保语法准确)
|
|
144
|
+
CREATE TABLE orders_partitioned (
|
|
145
|
+
id INT,
|
|
146
|
+
amount DOUBLE,
|
|
147
|
+
order_date DATE
|
|
148
|
+
) PARTITIONED BY (days(order_date));
|
|
149
|
+
|
|
150
|
+
-- 2. 立即验证分区表是否正确创建(必做步骤)
|
|
151
|
+
SHOW PARTITIONS orders_partitioned;
|
|
152
|
+
-- ✅ 正确:显示分区列表或空列表
|
|
153
|
+
-- ❌ 异常:报错 "not a partitioned table"
|
|
154
|
+
```
|
|
155
|
+
|
|
156
|
+
#### 完整验证清单(每次创建后执行)
|
|
157
|
+
|
|
158
|
+
```sql
|
|
159
|
+
-- 🔍 验证步骤1:确认是分区表
|
|
160
|
+
SHOW PARTITIONS orders_partitioned;
|
|
161
|
+
|
|
162
|
+
-- 🔍 验证步骤2:测试数据插入和分区创建
|
|
163
|
+
INSERT INTO orders_partitioned VALUES (1, 100.50, DATE('2024-06-01'));
|
|
164
|
+
SHOW PARTITIONS orders_partitioned;
|
|
165
|
+
-- ✅ 正确:显示类似 "days(order_date)=19875" 的分区
|
|
166
|
+
|
|
167
|
+
-- 🔍 验证步骤3:验证分区裁剪生效
|
|
168
|
+
SELECT * FROM orders_partitioned WHERE order_date = '2024-06-01';
|
|
169
|
+
-- 应该能正常返回数据,且性能良好
|
|
170
|
+
|
|
171
|
+
-- 🔍 验证步骤4:检查表结构
|
|
172
|
+
DESCRIBE TABLE orders_partitioned;
|
|
173
|
+
-- 确认列结构正确,分区字段类型匹配
|
|
174
|
+
|
|
175
|
+
-- 🔍 验证步骤5:测试最大分区获取(兼容性验证)
|
|
176
|
+
SELECT max_pt('orders_partitioned');
|
|
177
|
+
-- 应该返回最大的分区值,用于兼容原平台的类似功能
|
|
178
|
+
```
|
|
179
|
+
|
|
180
|
+
#### 验证失败时的解决方案
|
|
181
|
+
|
|
182
|
+
| 验证失败现象 | 可能原因 | 解决方案 |
|
|
183
|
+
| --------------------------- | ------------- | ---------- |
|
|
184
|
+
| `not a partitioned table` | 建表语法错误或工具创建异常 | 用原生SQL重新创建 |
|
|
185
|
+
| `implicit cast not allowed` | 分区字段类型不匹配 | 检查插入数据类型 |
|
|
186
|
+
| 无分区显示 | 分区函数语法错误 | 检查是否用了复数形式 |
|
|
187
|
+
| 性能无提升 | 查询未利用分区字段 | 优化WHERE条件 |
|
|
188
|
+
|
|
189
|
+
**为什么需要验证**?
|
|
190
|
+
|
|
191
|
+
* 确保分区定义语法正确,避免性能问题
|
|
192
|
+
* 及早发现配置错误,减少后续排查成本
|
|
193
|
+
* 验证分区策略是否符合查询模式
|
|
194
|
+
|
|
195
|
+
***
|
|
196
|
+
|
|
197
|
+
## 📊 **SHOW PARTITIONS 完整功能指南**
|
|
198
|
+
|
|
199
|
+
### **基础语法和高级用法**
|
|
200
|
+
|
|
201
|
+
```sql
|
|
202
|
+
-- 完整语法
|
|
203
|
+
SHOW PARTITIONS [EXTENDED] table_name
|
|
204
|
+
[ PARTITION ( partition_col_name = partition_col_val [, ...] ) ]
|
|
205
|
+
[WHERE <expr>
|
|
206
|
+
|
|
207
|
+
### **基础用法**
|
|
208
|
+
|
|
209
|
+
```sql
|
|
210
|
+
-- 查看所有分区
|
|
211
|
+
SHOW PARTITIONS sales_table;
|
|
212
|
+
|
|
213
|
+
-- 查看分区详细信息
|
|
214
|
+
SHOW PARTITIONS EXTENDED sales_table;
|
|
215
|
+
|
|
216
|
+
-- 查看特定分区
|
|
217
|
+
SHOW PARTITIONS sales_table PARTITION (pt1 = '2023');
|
|
218
|
+
|
|
219
|
+
-- 多级分区过滤
|
|
220
|
+
SHOW PARTITIONS sales_table PARTITION (pt1 = '2023', pt2 = '01');
|
|
221
|
+
|
|
222
|
+
-- 限制返回数量
|
|
223
|
+
SHOW PARTITIONS EXTENDED sales_table LIMIT 10;
|
|
224
|
+
```
|
|
225
|
+
|
|
226
|
+
### **高级用法:分区健康检查神器**
|
|
227
|
+
|
|
228
|
+
⚠️ **重要限制**:`SHOW PARTITIONS` 不支持 `ORDER BY` 子句,如需排序请使用 WITH 子查询。
|
|
229
|
+
|
|
230
|
+
< 100*1024*1024;
|
|
231
|
+
|
|
232
|
+
-- 🔍 分区信息查看(支持LIMIT)
|
|
233
|
+
SHOW PARTITIONS EXTENDED table_name LIMIT 10;
|
|
234
|
+
|
|
235
|
+
-- 🔍 分区排序(通过WITH子查询实现)
|
|
236
|
+
WITH partition_info AS (
|
|
237
|
+
SELECT partitions, bytes, total_rows, total_files, created_time
|
|
238
|
+
FROM (SHOW PARTITIONS EXTENDED table_name)
|
|
239
|
+
)
|
|
240
|
+
SELECT * FROM partition_info ORDER BY CAST(bytes AS BIGINT) DESC LIMIT 10;
|
|
241
|
+
```
|
|
242
|
+
|
|
243
|
+
### **MAX_PT 函数 - 获取最新分区**
|
|
244
|
+
|
|
245
|
+
```sql
|
|
246
|
+
-- 💡 MAX_PT函数:获取分区表中最大分区的值
|
|
247
|
+
-- 语法:max_pt('schema_name.table_name' | 'table_name')
|
|
248
|
+
|
|
249
|
+
-- 基本用法:查询最新分区的数据
|
|
250
|
+
SELECT * FROM sales_table WHERE pt = max_pt('sales_table');
|
|
251
|
+
|
|
252
|
+
-- 跨schema使用
|
|
253
|
+
SELECT max_pt('prod_schema.sales_table');
|
|
254
|
+
|
|
255
|
+
-- 实际应用场景:
|
|
256
|
+
-- 1. 增量数据处理:总是处理最新分区
|
|
257
|
+
INSERT INTO target_table
|
|
258
|
+
SELECT * FROM source_table
|
|
259
|
+
WHERE pt = max_pt('source_table');
|
|
260
|
+
|
|
261
|
+
-- 2. 数据质量检查:检查最新分区的数据质量
|
|
262
|
+
SELECT COUNT(*), AVG(amount), MAX(created_time)
|
|
263
|
+
FROM orders
|
|
264
|
+
WHERE pt = max_pt('orders');
|
|
265
|
+
```
|
|
266
|
+
|
|
267
|
+
**MAX_PT 函数的迁移价值**:
|
|
268
|
+
|
|
269
|
+
* **MaxCompute用户**:直接替代原有的max\_pt函数,无需修改查询逻辑
|
|
270
|
+
* **Hive用户**:简化复杂的最大分区查询,提高开发效率
|
|
271
|
+
* **其他平台用户**:提供便捷的最新数据查询方式
|
|
272
|
+
|
|
273
|
+
***
|
|
274
|
+
|
|
275
|
+
## 🏗️ **高级分区表创建指南**
|
|
276
|
+
|
|
277
|
+
### **分区 + 分桶 + 排序组合**
|
|
278
|
+
|
|
279
|
+
```sql
|
|
280
|
+
-- ✅ 方案1:分区 + 分桶(推荐用于散列分布)
|
|
281
|
+
CREATE TABLE events_clustered (
|
|
282
|
+
user_id INT,
|
|
283
|
+
event_type STRING,
|
|
284
|
+
event_data JSON,
|
|
285
|
+
event_time TIMESTAMP_LTZ
|
|
286
|
+
) PARTITIONED BY (days(event_time))
|
|
287
|
+
CLUSTERED BY (user_id) INTO 32 BUCKETS;
|
|
288
|
+
|
|
289
|
+
-- ✅ 方案2:分区 + 排序(推荐用于范围查询)
|
|
290
|
+
CREATE TABLE events_sorted (
|
|
291
|
+
user_id INT,
|
|
292
|
+
event_type STRING,
|
|
293
|
+
event_data JSON,
|
|
294
|
+
event_time TIMESTAMP_LTZ
|
|
295
|
+
) PARTITIONED BY (days(event_time))
|
|
296
|
+
SORTED BY (event_type);
|
|
297
|
+
|
|
298
|
+
CREATE TABLE events_both
|
|
299
|
+
(
|
|
300
|
+
user_id INT,
|
|
301
|
+
event_type STRING,
|
|
302
|
+
event_data JSON,
|
|
303
|
+
event_time TIMESTAMP_LTZ
|
|
304
|
+
)
|
|
305
|
+
PARTITIONED BY (days(event_time))
|
|
306
|
+
CLUSTERED BY (user_id)
|
|
307
|
+
SORTED BY (event_type)
|
|
308
|
+
INTO 32 BUCKETS;
|
|
309
|
+
```
|
|
310
|
+
|
|
311
|
+
### **bucket函数使用指南**
|
|
312
|
+
|
|
313
|
+
**参数范围和建议**:
|
|
314
|
+
|
|
315
|
+
```sql
|
|
316
|
+
-- ✅ 推荐的bucket数量范围
|
|
317
|
+
CREATE TABLE sales PARTITIONED BY (
|
|
318
|
+
days(sale_date),
|
|
319
|
+
bucket(10, user_id) -- 推荐:1-1000的合理范围
|
|
320
|
+
);
|
|
321
|
+
|
|
322
|
+
-- 📊 bucket数量选择指南:
|
|
323
|
+
-- 1-10个桶:适合小数据量表(<100万行)
|
|
324
|
+
-- 10-100个桶:适合中等数据量表(100万-1000万行)
|
|
325
|
+
-- 100-1000个桶:适合大数据量表(>
|
|
326
|
+
|
|
327
|
+
### **复杂数据类型支持**
|
|
328
|
+
|
|
329
|
+
```sql
|
|
330
|
+
-- ✅ Lakehouse支持所有现代数据类型的分区
|
|
331
|
+
CREATE TABLE modern_table (
|
|
332
|
+
user_id INT,
|
|
333
|
+
user_profile STRUCT<name: STRING, age: INT, location: STRING>,
|
|
334
|
+
tags ARRAY<STRING>,
|
|
335
|
+
metadata MAP<STRING, STRING>,
|
|
336
|
+
config JSON,
|
|
337
|
+
created_date DATE,
|
|
338
|
+
created_timestamp TIMESTAMP_LTZ,
|
|
339
|
+
created_timestamp_ntz TIMESTAMP_NTZ
|
|
340
|
+
) PARTITIONED BY (days(created_date));
|
|
341
|
+
|
|
342
|
+
-- 📝 支持的分区字段类型:
|
|
343
|
+
-- 基础类型:INT, BIGINT, STRING, DATE, TIMESTAMP_LTZ, TIMESTAMP_NTZ
|
|
344
|
+
-- 转换分区:years(), months(), days(), hours(), bucket()
|
|
345
|
+
-- 不支持:STRUCT, ARRAY, MAP, JSON作为直接分区字段
|
|
346
|
+
```
|
|
347
|
+
|
|
348
|
+
### **分区值的特殊情况处理**
|
|
349
|
+
|
|
350
|
+
```sql
|
|
351
|
+
-- 💡 NULL值分区处理
|
|
352
|
+
CREATE TABLE user_regions (
|
|
353
|
+
user_id INT,
|
|
354
|
+
region STRING -- 允许NULL值
|
|
355
|
+
) PARTITIONED BY (region);
|
|
356
|
+
|
|
357
|
+
INSERT INTO user_regions VALUES (1, NULL);
|
|
358
|
+
-- 结果:创建 region=NULL 的分区
|
|
359
|
+
|
|
360
|
+
-- 💡 特殊字符支持
|
|
361
|
+
INSERT INTO user_regions VALUES
|
|
362
|
+
(2, 'beijing'), -- 正常字符
|
|
363
|
+
(3, 'shang-hai'), -- 支持破折号
|
|
364
|
+
(4, 'guang_zhou'), -- 支持下划线
|
|
365
|
+
(5, 'xi an'), -- 支持空格
|
|
366
|
+
(6, 'very_long_city_name_with_many_characters'); -- 支持长字符串
|
|
367
|
+
|
|
368
|
+
-- 查看分区结果
|
|
369
|
+
SHOW PARTITIONS user_regions;
|
|
370
|
+
-- 结果显示:region=NULL, region=beijing, region=shang-hai, 等等
|
|
371
|
+
|
|
372
|
+
-- 📝 特殊字符支持总结:
|
|
373
|
+
-- ✅ 支持:字母、数字、下划线、破折号、空格、中文
|
|
374
|
+
-- ✅ 长度:支持很长的分区值(测试过100+字符)
|
|
375
|
+
-- ⚠️ 建议:避免使用特殊符号如@#$%等,虽然可能支持但不推荐
|
|
376
|
+
```
|
|
377
|
+
|
|
378
|
+
***
|
|
379
|
+
|
|
380
|
+
## 复杂分区迁移策略
|
|
381
|
+
|
|
382
|
+
### 🏗️ **多级分区架构迁移挑战**
|
|
383
|
+
|
|
384
|
+
如果您在原平台上已经实现了复杂的分区策略,迁移到Lakehouse时会遇到架构限制和设计选择问题。
|
|
385
|
+
|
|
386
|
+
|
|
387
|
+
|
|
388
|
+
#### 复合维度分区的迁移挑战
|
|
389
|
+
|
|
390
|
+
**原平台的复合分区**:
|
|
391
|
+
|
|
392
|
+
```sql
|
|
393
|
+
-- MaxCompute/Hive中常见的复合分区
|
|
394
|
+
CREATE TABLE sales PARTITIONED BY (
|
|
395
|
+
dt STRING, -- 日期:20240601
|
|
396
|
+
region STRING, -- 地区:beijing, shanghai
|
|
397
|
+
channel STRING -- 渠道:online, offline
|
|
398
|
+
);
|
|
399
|
+
|
|
400
|
+
-- 分区数量 = 日期数 × 地区数 × 渠道数
|
|
401
|
+
-- 例如:365 × 10 × 3 = 10,950个分区/年
|
|
402
|
+
```
|
|
403
|
+
|
|
404
|
+
**❌ 直接迁移的问题**:
|
|
405
|
+
|
|
406
|
+
<= '2024-06-30' -- 分区裁剪
|
|
407
|
+
AND region = 'beijing' -- 索引加速
|
|
408
|
+
AND channel = 'online'; -- 索引加速
|
|
409
|
+
```
|
|
410
|
+
|
|
411
|
+
**策略2:散列分区重设计**
|
|
412
|
+
|
|
413
|
+
```sql
|
|
414
|
+
-- 使用散列函数减少分区数量
|
|
415
|
+
CREATE TABLE sales (
|
|
416
|
+
sale_id INT,
|
|
417
|
+
amount DOUBLE,
|
|
418
|
+
region STRING,
|
|
419
|
+
channel STRING,
|
|
420
|
+
sale_date DATE
|
|
421
|
+
) PARTITIONED BY (
|
|
422
|
+
days(sale_date), -- 时间分区
|
|
423
|
+
bucket(10, region) -- 地区散列到10个桶
|
|
424
|
+
);
|
|
425
|
+
|
|
426
|
+
-- 分区数量 = 365 × 10 = 3,650个分区/年(可控)
|
|
427
|
+
```
|
|
428
|
+
|
|
429
|
+
### 🔄 **分区演化和维护策略迁移**
|
|
430
|
+
|
|
431
|
+
#### 动态分区管理的差异
|
|
432
|
+
|
|
433
|
+
**原平台的分区管理**:
|
|
434
|
+
|
|
435
|
+
```sql
|
|
436
|
+
-- Hive中的分区管理
|
|
437
|
+
-- 1. 动态分区自动创建
|
|
438
|
+
INSERT OVERWRITE TABLE target PARTITION(dt, region)
|
|
439
|
+
SELECT ..., dt, region FROM source; -- 自动创建所有dt×region组合
|
|
440
|
+
|
|
441
|
+
-- 2. 分区修复
|
|
442
|
+
MSCK REPAIR TABLE target; -- 自动发现新分区
|
|
443
|
+
|
|
444
|
+
-- 3. 分区删除
|
|
445
|
+
ALTER TABLE target DROP PARTITION (dt<'20240101'); -- 批量删除
|
|
446
|
+
```
|
|
447
|
+
|
|
448
|
+
**Lakehouse中的等价实现**:
|
|
449
|
+
|
|
450
|
+
```sql
|
|
451
|
+
-- 1. 动态分区创建(需要注意分区数量)
|
|
452
|
+
-- 建议先检查分区数量
|
|
453
|
+
SELECT COUNT(DISTINCT CONCAT(dt, '/', region)) FROM source;
|
|
454
|
+
|
|
455
|
+
-- 分批处理(如果数量较多)
|
|
456
|
+
INSERT INTO target
|
|
457
|
+
SELECT ..., dt, region FROM source
|
|
458
|
+
WHERE dt BETWEEN '20240601' AND '20240615';
|
|
459
|
+
|
|
460
|
+
-- 2. 分区发现(自动完成,无需手动修复)
|
|
461
|
+
-- Lakehouse自动管理分区元数据
|
|
462
|
+
|
|
463
|
+
-- 3. 分区清理(功能更强大)
|
|
464
|
+
TRUNCATE TABLE target PARTITION (dt = '20240101');
|
|
465
|
+
```
|
|
466
|
+
|
|
467
|
+
#### 分区生命周期管理迁移
|
|
468
|
+
|
|
469
|
+
**MaxCompute的自动生命周期**:
|
|
470
|
+
|
|
471
|
+
```sql
|
|
472
|
+
-- MaxCompute中设置表级生命周期
|
|
473
|
+
ALTER TABLE events SET LIFECYCLE 90; -- 90天后自动删除
|
|
474
|
+
|
|
475
|
+
-- 分区级生命周期
|
|
476
|
+
ALTER TABLE events PARTITION(dt='20240101') SET LIFECYCLE 30; -- 单分区30天
|
|
477
|
+
```
|
|
478
|
+
|
|
479
|
+
**Lakehouse中的分区清理功能**:
|
|
480
|
+
|
|
481
|
+
```sql
|
|
482
|
+
-- 基础分区清理(针对STRING分区)
|
|
483
|
+
TRUNCATE TABLE events PARTITION (dt = '20240101');
|
|
484
|
+
|
|
485
|
+
-- 转换分区清理(需要使用具体分区值)
|
|
486
|
+
TRUNCATE TABLE events_with_days PARTITION (days(event_date) = 19875);
|
|
487
|
+
|
|
488
|
+
-- ✅ 高级功能:条件过滤清理
|
|
489
|
+
-- 删除90天前的所有分区(需要先计算具体分区值)
|
|
490
|
+
-- 对于STRING分区:
|
|
491
|
+
TRUNCATE TABLE events
|
|
492
|
+
PARTITION (dt < date_format(date_sub(current_date(), 90), 'yyyyMMdd'));
|
|
493
|
+
|
|
494
|
+
-- 复合条件清理:删除特定日期和地区的数据
|
|
495
|
+
TRUNCATE TABLE sales
|
|
496
|
+
PARTITION (days(sale_date) = 19875 AND region = 'beijing');
|
|
497
|
+
|
|
498
|
+
-- 批量分区清理:同时清理多个分区
|
|
499
|
+
TRUNCATE TABLE logs
|
|
500
|
+
PARTITION (days(log_date) = 19875),
|
|
501
|
+
PARTITION (days(log_date) = 19876);
|
|
502
|
+
```
|
|
503
|
+
|
|
504
|
+
**分区清理最佳实践**:
|
|
505
|
+
|
|
506
|
+
```sql
|
|
507
|
+
-- 1. 清理前先检查要删除的分区
|
|
508
|
+
SHOW PARTITIONS EXTENDED table_name
|
|
509
|
+
WHERE dt < '2024-01-01';
|
|
510
|
+
|
|
511
|
+
-- 2. 分阶段清理大量分区(避免长时间锁表)
|
|
512
|
+
-- 对于STRING分区:
|
|
513
|
+
TRUNCATE TABLE large_table
|
|
514
|
+
PARTITION (dt = '20230101');
|
|
515
|
+
-- 然后继续清理下一天的数据
|
|
516
|
+
|
|
517
|
+
-- 对于转换分区,需要先查询分区值:
|
|
518
|
+
-- SELECT days('2023-01-01'); -- 获取具体分区值
|
|
519
|
+
TRUNCATE TABLE large_table_with_days
|
|
520
|
+
PARTITION (days(date_col) = 具体分区值);
|
|
521
|
+
|
|
522
|
+
-- 3. 定期清理调度脚本示例
|
|
523
|
+
-- 每日凌晨2点执行,清理30天前的分区(STRING分区)
|
|
524
|
+
TRUNCATE TABLE daily_logs
|
|
525
|
+
PARTITION (dt < date_format(date_sub(current_date(), 30), 'yyyyMMdd'));
|
|
526
|
+
```
|
|
527
|
+
|
|
528
|
+
### 🎯 **性能优化策略的迁移**
|
|
529
|
+
|
|
530
|
+
#### Z-Order优化的等价实现
|
|
531
|
+
|
|
532
|
+
**Databricks Delta Lake的Z-Order**:
|
|
533
|
+
|
|
534
|
+
```sql
|
|
535
|
+
-- Delta Lake中的多维优化
|
|
536
|
+
OPTIMIZE events ZORDER BY (user_id, event_type, timestamp);
|
|
537
|
+
-- 实现多个字段的联合优化
|
|
538
|
+
```
|
|
539
|
+
|
|
540
|
+
**Lakehouse中的等价策略**:
|
|
541
|
+
|
|
542
|
+
```sql
|
|
543
|
+
-- 策略1:分区+分桶组合(推荐)
|
|
544
|
+
CREATE TABLE events (
|
|
545
|
+
user_id INT,
|
|
546
|
+
event_type STRING,
|
|
547
|
+
event_data JSON,
|
|
548
|
+
timestamp TIMESTAMP_LTZ
|
|
549
|
+
) PARTITIONED BY (days(timestamp)) -- 时间分区
|
|
550
|
+
CLUSTERED BY (user_id) INTO 32 BUCKETS; -- 用户散列分桶
|
|
551
|
+
|
|
552
|
+
-- 策略2:分区+排序组合
|
|
553
|
+
CREATE TABLE events_sorted (
|
|
554
|
+
user_id INT,
|
|
555
|
+
event_type STRING,
|
|
556
|
+
event_data JSON,
|
|
557
|
+
timestamp TIMESTAMP_LTZ
|
|
558
|
+
) PARTITIONED BY (days(timestamp)) -- 时间分区
|
|
559
|
+
SORTED BY (event_type); -- 事件类型排序
|
|
560
|
+
|
|
561
|
+
-- 策略3:重写优化(类似OPTIMIZE)
|
|
562
|
+
INSERT OVERWRITE events
|
|
563
|
+
SELECT * FROM events
|
|
564
|
+
ORDER BY user_id, event_type, timestamp; -- 手动重排数据
|
|
565
|
+
```
|
|
566
|
+
|
|
567
|
+
#### 分区裁剪优化的迁移
|
|
568
|
+
|
|
569
|
+
**原平台的分区裁剪逻辑**:
|
|
570
|
+
|
|
571
|
+
```sql
|
|
572
|
+
-- Spark中的复杂分区过滤
|
|
573
|
+
df.filter(
|
|
574
|
+
(col("year") >
|
|
575
|
+
|
|
576
|
+
**Lakehouse 中的等价查询**:
|
|
577
|
+
|
|
578
|
+
< '2024-09-01' -- month <= 8
|
|
579
|
+
AND region = 'beijing'; -- region = "beijing"
|
|
580
|
+
|
|
581
|
+
-- 💡 关键:将复杂的分区逻辑转换为简单的时间范围查询
|
|
582
|
+
```
|
|
583
|
+
|
|
584
|
+
***
|
|
585
|
+
|
|
586
|
+
## 分平台迁移注意事项
|
|
587
|
+
|
|
588
|
+
### 🐘 **Hive用户特别注意**
|
|
589
|
+
|
|
590
|
+
#### 语法兼容性差异
|
|
591
|
+
|
|
592
|
+
```sql
|
|
593
|
+
-- ✅ Hive语法在Lakehouse中仍然支持
|
|
594
|
+
CREATE TABLE hive_style (
|
|
595
|
+
order_id INT,
|
|
596
|
+
amount DOUBLE
|
|
597
|
+
) PARTITIONED BY (dt STRING, region STRING); -- 完全兼容
|
|
598
|
+
|
|
599
|
+
-- ⚠️ 但要注意ADD COLUMN的位置问题
|
|
600
|
+
-- Hive: 新列会加到分区列之前的最后位置
|
|
601
|
+
-- Lakehouse: 建议明确指定列位置
|
|
602
|
+
ALTER TABLE hive_style ADD COLUMN new_col STRING AFTER amount; -- 明确指定位置
|
|
603
|
+
```
|
|
604
|
+
|
|
605
|
+
#### 动态分区配置差异
|
|
606
|
+
|
|
607
|
+
```sql
|
|
608
|
+
-- Hive中需要的配置在Lakehouse中不需要
|
|
609
|
+
-- set hive.exec.dynamic.partition=true; -- Lakehouse中不需要
|
|
610
|
+
-- set hive.exec.dynamic.partition.mode=nonstrict; -- Lakehouse中不需要
|
|
611
|
+
|
|
612
|
+
-- 但要注意分区数量管理
|
|
613
|
+
-- Hive: hive.exec.max.dynamic.partitions=1000(可调)
|
|
614
|
+
-- Lakehouse: 建议单次操作控制在合理范围内
|
|
615
|
+
```
|
|
616
|
+
|
|
617
|
+
#### Hive用户的真实痛点补充
|
|
618
|
+
|
|
619
|
+
```sql
|
|
620
|
+
-- Hive用户常遇到的额外问题:
|
|
621
|
+
-- 1. 分区字段类型限制
|
|
622
|
+
CREATE TABLE hive_table PARTITIONED BY (dt STRING); -- Hive分区字段通常必须是STRING
|
|
623
|
+
|
|
624
|
+
-- 2. 分区目录结构依赖
|
|
625
|
+
-- Hive严格依赖 /data/table/year=2024/month=06/day=01/ 这样的目录结构
|
|
626
|
+
-- Lakehouse中无此限制,更灵活
|
|
627
|
+
|
|
628
|
+
-- 3. 分区修复的频繁需求
|
|
629
|
+
-- Hive: MSCK REPAIR TABLE table_name; -- 经常需要手动修复
|
|
630
|
+
-- Lakehouse: 自动维护元数据,无需手动修复
|
|
631
|
+
```
|
|
632
|
+
|
|
633
|
+
### ⚡ **Spark用户特别注意**
|
|
634
|
+
|
|
635
|
+
#### DataFrame写入方式差异
|
|
636
|
+
|
|
637
|
+
```sql
|
|
638
|
+
-- Spark DataFrame常见写入方式
|
|
639
|
+
-- df.write.mode("overwrite").partitionBy("date", "region").saveAsTable("table")
|
|
640
|
+
|
|
641
|
+
-- 迁移到Lakehouse SQL时需要注意
|
|
642
|
+
-- 1. 确保表已经创建并正确分区
|
|
643
|
+
CREATE TABLE spark_migrated PARTITIONED BY (date STRING, region STRING);
|
|
644
|
+
|
|
645
|
+
-- 2. 使用INSERT语句而不是saveAsTable
|
|
646
|
+
INSERT OVERWRITE spark_migrated SELECT * FROM source_data;
|
|
647
|
+
```
|
|
648
|
+
|
|
649
|
+
#### 转换函数名映射
|
|
650
|
+
|
|
651
|
+
| Spark函数 | Lakehouse函数 | 说明 |
|
|
652
|
+
| ---------------- | ------------- | ------ |
|
|
653
|
+
| `year(col)` | `years(col)` | 注意复数形式 |
|
|
654
|
+
| `month(col)` | `months(col)` | 注意复数形式 |
|
|
655
|
+
| `dayofyear(col)` | `days(col)` | 函数名不同 |
|
|
656
|
+
| `hour(col)` | `hours(col)` | 注意复数形式 |
|
|
657
|
+
|
|
658
|
+
#### Spark用户的额外挑战
|
|
659
|
+
|
|
660
|
+
```scala
|
|
661
|
+
// Spark用户更常遇到的问题:
|
|
662
|
+
// 1. 分区发现问题
|
|
663
|
+
spark.sql("MSCK REPAIR TABLE table_name") // Spark也有这个问题
|
|
664
|
+
|
|
665
|
+
// 2. 动态分区写入的性能陷阱
|
|
666
|
+
df.write.mode("append")
|
|
667
|
+
.option("maxRecordsPerFile", "50000") // 控制文件大小
|
|
668
|
+
.partitionBy("date")
|
|
669
|
+
.saveAsTable("table")
|
|
670
|
+
|
|
671
|
+
// 3. 分区列自动推断类型问题
|
|
672
|
+
df.write.partitionBy($"date".cast("string")) // 必须转string
|
|
673
|
+
```
|
|
674
|
+
|
|
675
|
+
### ☁️ **MaxCompute用户特别注意**
|
|
676
|
+
|
|
677
|
+
#### 分区使用习惯调整
|
|
678
|
+
|
|
679
|
+
```sql
|
|
680
|
+
-- MaxCompute强制要求分区条件
|
|
681
|
+
-- SELECT * FROM table WHERE pt='20240601'; -- 必须带分区条件,否则报错
|
|
682
|
+
|
|
683
|
+
-- Lakehouse中分区条件是自动的
|
|
684
|
+
SELECT * FROM table WHERE order_date='2024-06-01'; -- 自动分区裁剪,更灵活
|
|
685
|
+
|
|
686
|
+
-- ⚠️ 但仍然建议在查询中包含分区条件以获得最佳性能
|
|
687
|
+
```
|
|
688
|
+
|
|
689
|
+
#### 生命周期管理差异
|
|
690
|
+
|
|
691
|
+
```sql
|
|
692
|
+
-- MaxCompute的自动生命周期
|
|
693
|
+
-- ALTER TABLE table SET LIFECYCLE 30; -- 30天后自动删除
|
|
694
|
+
|
|
695
|
+
-- Lakehouse中需要手动管理或通过调度
|
|
696
|
+
TRUNCATE TABLE table PARTITION (order_date = '2024-05-01'); -- 手动清理
|
|
697
|
+
```
|
|
698
|
+
|
|
699
|
+
#### MaxCompute用户的真实痛点
|
|
700
|
+
|
|
701
|
+
```sql
|
|
702
|
+
-- MaxCompute用户最常遇到的问题:
|
|
703
|
+
-- 1. 强制分区过滤的习惯
|
|
704
|
+
-- MaxCompute: 不带分区条件会直接报错
|
|
705
|
+
-- Lakehouse: 允许全表扫描,但建议带分区条件
|
|
706
|
+
|
|
707
|
+
-- 2. 分区表的INSERT OVERWRITE语法差异
|
|
708
|
+
-- MaxCompute: INSERT OVERWRITE TABLE target PARTITION(dt='20240601')
|
|
709
|
+
-- Lakehouse: INSERT OVERWRITE target ...(自动识别分区)
|
|
710
|
+
|
|
711
|
+
-- 3. 跨项目访问语法变化
|
|
712
|
+
-- MaxCompute: SELECT * FROM project.table WHERE pt='20240601';
|
|
713
|
+
-- Lakehouse: SELECT * FROM catalog.schema.table WHERE pt='20240601';
|
|
714
|
+
```
|
|
715
|
+
|
|
716
|
+
### ❄️ **Snowflake用户特别注意**
|
|
717
|
+
|
|
718
|
+
#### 🚨 **主要认知转变:从自动优化到主动分区设计**
|
|
719
|
+
|
|
720
|
+
**传统Snowflake用户的习惯**:您可能较少关注底层分区设计
|
|
721
|
+
|
|
722
|
+
```sql
|
|
723
|
+
-- 传统Snowflake中的典型使用模式
|
|
724
|
+
CREATE TABLE orders (id INT, amount DOUBLE, order_date DATE); -- 系统自动管理存储
|
|
725
|
+
ALTER TABLE orders CLUSTER BY (order_date); -- 设置聚簇键,系统自动微分区管理
|
|
726
|
+
|
|
727
|
+
-- 查询时完全不用考虑分区
|
|
728
|
+
SELECT * FROM orders WHERE amount >
|
|
729
|
+
|
|
730
|
+
**现代 Snowflake 用户(Iceberg 表)**:语法基本相似,但有细微差异
|
|
731
|
+
|
|
732
|
+
```sql
|
|
733
|
+
-- Snowflake Iceberg表
|
|
734
|
+
CREATE ICEBERG TABLE orders_iceberg PARTITION BY (year(order_date));
|
|
735
|
+
|
|
736
|
+
-- Lakehouse中的对应语法
|
|
737
|
+
CREATE TABLE orders_lakehouse PARTITIONED BY (years(order_date)); -- 注意复数
|
|
738
|
+
```
|
|
739
|
+
|
|
740
|
+
**迁移到 Lakehouse 的调整**:
|
|
741
|
+
|
|
742
|
+
```sql
|
|
743
|
+
-- ❌ 传统Snowflake思维在Lakehouse中可能不够优化
|
|
744
|
+
CREATE TABLE orders (id INT, amount DOUBLE, order_date DATE); -- 创建了普通表,不是分区表
|
|
745
|
+
-- 结果:查询性能可能不够理想
|
|
746
|
+
|
|
747
|
+
-- ✅ 学会主动设计分区策略
|
|
748
|
+
CREATE TABLE orders (
|
|
749
|
+
id INT,
|
|
750
|
+
amount DOUBLE,
|
|
751
|
+
order_date DATE
|
|
752
|
+
) PARTITIONED BY (days(order_date)); -- 主动设计分区
|
|
753
|
+
|
|
754
|
+
-- 查询时虽然自动裁剪,但分区设计直接影响性能
|
|
755
|
+
SELECT * FROM orders
|
|
756
|
+
WHERE order_date >= '2024-06-01' -- 分区条件(建议包含)
|
|
757
|
+
AND amount > 1000; -- 业务条件
|
|
758
|
+
```
|
|
759
|
+
|
|
760
|
+
#### Snowflake用户的学习路径
|
|
761
|
+
|
|
762
|
+
```sql
|
|
763
|
+
-- 阶段1:理解分区的价值
|
|
764
|
+
-- 分区 = 物理上将数据按某个字段分别存储
|
|
765
|
+
-- 目的:查询时只扫描相关分区,而不是全表
|
|
766
|
+
|
|
767
|
+
-- 阶段2:学会分区设计
|
|
768
|
+
-- 问自己:我的查询最常用哪个字段做过滤?
|
|
769
|
+
-- 时间字段:order_date, created_at, updated_at
|
|
770
|
+
-- 业务字段:region, department, customer_type
|
|
771
|
+
|
|
772
|
+
-- 阶段3:验证分区效果
|
|
773
|
+
-- 对比分区表 vs 非分区表的查询性能
|
|
774
|
+
```
|
|
775
|
+
|
|
776
|
+
#### Snowflake用户的认知重点
|
|
777
|
+
|
|
778
|
+
```sql
|
|
779
|
+
-- Snowflake用户需要重点理解:
|
|
780
|
+
-- 1. 不是所有数据库都会自动优化
|
|
781
|
+
SELECT * FROM large_table WHERE complex_condition; -- 需要考虑分区设计
|
|
782
|
+
|
|
783
|
+
-- 2. 分区设计的重要性
|
|
784
|
+
-- Snowflake的micro-partitions是自动的,但Lakehouse中需要主动设计
|
|
785
|
+
|
|
786
|
+
-- 3. 文件系统优化概念
|
|
787
|
+
-- 理解"小文件问题"、"分区数量控制"等概念
|
|
788
|
+
|
|
789
|
+
-- 4. 查询优化意识
|
|
790
|
+
SELECT * FROM large_partitioned_table
|
|
791
|
+
WHERE order_date >= '2024-06-01'; -- 包含分区条件的查询习惯
|
|
792
|
+
```
|
|
793
|
+
|
|
794
|
+
### 🧱 **Databricks 用户特别注意**
|
|
795
|
+
|
|
796
|
+
#### Delta Lake vs Iceberg差异
|
|
797
|
+
|
|
798
|
+
```sql
|
|
799
|
+
-- Delta Lake的分区语法
|
|
800
|
+
-- CREATE TABLE delta_table PARTITIONED BY (year, month);
|
|
801
|
+
|
|
802
|
+
-- Lakehouse (Iceberg)中的等价语法
|
|
803
|
+
CREATE TABLE iceberg_table PARTITIONED BY (years(date_col)); -- 只能用单一时间粒度
|
|
804
|
+
|
|
805
|
+
-- ⚠️ 不能像Delta Lake那样同时使用多个时间粒度
|
|
806
|
+
```
|
|
807
|
+
|
|
808
|
+
#### 优化命令差异
|
|
809
|
+
|
|
810
|
+
```sql
|
|
811
|
+
-- Delta Lake的优化命令
|
|
812
|
+
-- OPTIMIZE table_name;
|
|
813
|
+
-- OPTIMIZE table_name ZORDER BY (col1, col2);
|
|
814
|
+
|
|
815
|
+
-- Lakehouse中通过重写实现类似效果
|
|
816
|
+
INSERT OVERWRITE table_name SELECT * FROM table_name; -- 文件合并优化
|
|
817
|
+
```
|
|
818
|
+
|
|
819
|
+
#### Databricks用户的高级功能迁移
|
|
820
|
+
|
|
821
|
+
```sql
|
|
822
|
+
-- Databricks用户更常遇到的问题:
|
|
823
|
+
-- 1. Delta Lake的时间旅行习惯
|
|
824
|
+
-- Delta: SELECT * FROM table TIMESTAMP AS OF '2024-01-01 00:00:00';
|
|
825
|
+
-- Lakehouse: SELECT * FROM table TIMESTAMP AS OF '2024-01-01 00:00:00'; -- 语法相似
|
|
826
|
+
|
|
827
|
+
-- 2. OPTIMIZE和Z-ORDER的重度依赖
|
|
828
|
+
-- Delta: OPTIMIZE table ZORDER BY (col1, col2); -- 这是日常操作
|
|
829
|
+
-- Lakehouse: 需要通过分区+排序+分桶组合实现
|
|
830
|
+
|
|
831
|
+
-- 3. Unity Catalog的影响
|
|
832
|
+
-- Delta: CREATE TABLE catalog.schema.table; -- 三层命名空间习惯
|
|
833
|
+
-- Lakehouse: CREATE TABLE schema.table; -- 两层命名空间
|
|
834
|
+
```
|
|
835
|
+
|
|
836
|
+
***
|
|
837
|
+
|
|
838
|
+
## 实战避坑指南
|
|
839
|
+
|
|
840
|
+
### 分区创建避坑
|
|
841
|
+
|
|
842
|
+
#### Do's and Don'ts 对比
|
|
843
|
+
|
|
844
|
+
```sql
|
|
845
|
+
-- ❌ 错误做法:不明确的分区设计
|
|
846
|
+
CREATE TABLE bad_partition (
|
|
847
|
+
id INT,
|
|
848
|
+
data STRING,
|
|
849
|
+
timestamp_col TIMESTAMP_LTZ
|
|
850
|
+
); -- 没有PARTITIONED BY,不是分区表
|
|
851
|
+
|
|
852
|
+
INSERT INTO bad_partition VALUES (1, 'test', CURRENT_TIMESTAMP());
|
|
853
|
+
-- 后面发现查询慢,再想加分区就晚了
|
|
854
|
+
|
|
855
|
+
-- ✅ 正确做法:提前规划分区策略
|
|
856
|
+
CREATE TABLE good_partition (
|
|
857
|
+
id INT,
|
|
858
|
+
data STRING,
|
|
859
|
+
timestamp_col TIMESTAMP_LTZ
|
|
860
|
+
) PARTITIONED BY (days(timestamp_col));
|
|
861
|
+
|
|
862
|
+
-- 立即验证分区是否正确创建
|
|
863
|
+
SHOW PARTITIONS good_partition; -- 应该能正常执行,不报错
|
|
864
|
+
```
|
|
865
|
+
|
|
866
|
+
#### 分区字段类型选择
|
|
867
|
+
|
|
868
|
+
```sql
|
|
869
|
+
-- ❌ 容易踩坑的类型选择
|
|
870
|
+
CREATE TABLE date_type_trap (
|
|
871
|
+
id INT,
|
|
872
|
+
order_date DATE -- DATE类型
|
|
873
|
+
) PARTITIONED BY (order_date);
|
|
874
|
+
|
|
875
|
+
-- 插入数据时可能报类型错误
|
|
876
|
+
INSERT INTO date_type_trap VALUES (1, '2024-06-01'); -- 字符串vs DATE类型
|
|
877
|
+
|
|
878
|
+
-- ✅ 推荐做法:使用STRING类型分区
|
|
879
|
+
CREATE TABLE string_partition (
|
|
880
|
+
id INT,
|
|
881
|
+
order_date_str STRING -- 用STRING类型避免类型转换问题
|
|
882
|
+
) PARTITIONED BY (order_date_str);
|
|
883
|
+
|
|
884
|
+
-- 或者使用转换分区
|
|
885
|
+
CREATE TABLE transform_partition (
|
|
886
|
+
id INT,
|
|
887
|
+
order_date DATE
|
|
888
|
+
) PARTITIONED BY (days(order_date)); -- 让系统处理类型转换
|
|
889
|
+
```
|
|
890
|
+
|
|
891
|
+
### 数据写入避坑
|
|
892
|
+
|
|
893
|
+
#### 大批量写入策略
|
|
894
|
+
|
|
895
|
+
<= 'end_value';
|
|
896
|
+
```
|
|
897
|
+
|
|
898
|
+
#### 分区数据一致性
|
|
899
|
+
|
|
900
|
+
```sql
|
|
901
|
+
-- ⚠️ 注意:转换分区的时区问题
|
|
902
|
+
CREATE TABLE timezone_sensitive (
|
|
903
|
+
id INT,
|
|
904
|
+
event_time TIMESTAMP_LTZ -- 带时区的时间戳
|
|
905
|
+
) PARTITIONED BY (days(event_time));
|
|
906
|
+
|
|
907
|
+
-- 不同时区的相同本地时间可能落在不同分区
|
|
908
|
+
INSERT INTO timezone_sensitive VALUES
|
|
909
|
+
(1, TIMESTAMP '2024-06-01 23:30:00 UTC'), -- UTC时区
|
|
910
|
+
(2, TIMESTAMP '2024-06-01 23:30:00'); -- 系统默认时区
|
|
911
|
+
|
|
912
|
+
-- 查看分区分布
|
|
913
|
+
SHOW PARTITIONS timezone_sensitive;
|
|
914
|
+
-- 可能看到两个不同的分区值:days(event_time)=19875 和 days(event_time)=19876
|
|
915
|
+
```
|
|
916
|
+
|
|
917
|
+
### 查询性能避坑
|
|
918
|
+
|
|
919
|
+
#### 分区裁剪失效场景
|
|
920
|
+
|
|
921
|
+
```sql
|
|
922
|
+
-- ❌ 无法利用分区裁剪的查询
|
|
923
|
+
SELECT * FROM partitioned_table
|
|
924
|
+
WHERE YEAR(order_date) = 2024; -- 函数包装分区字段
|
|
925
|
+
|
|
926
|
+
SELECT * FROM partitioned_table
|
|
927
|
+
WHERE order_date LIKE '2024%'; -- 模糊匹配
|
|
928
|
+
|
|
929
|
+
-- ✅ 能够有效分区裁剪的查询
|
|
930
|
+
SELECT * FROM partitioned_table
|
|
931
|
+
WHERE order_date >
|
|
932
|
+
|
|
933
|
+
### 🚨 **分区故障快速排查**
|
|
934
|
+
|
|
935
|
+
#### 问题:分区表性能比非分区表还差
|
|
936
|
+
|
|
937
|
+
**排查命令**:
|
|
938
|
+
|
|
939
|
+
< 10*1024*1024;
|
|
940
|
+
|
|
941
|
+
-- 4. 检查查询是否利用分区
|
|
942
|
+
EXPLAIN SELECT * FROM your_table WHERE partition_col = 'value';
|
|
943
|
+
```
|
|
944
|
+
|
|
945
|
+
**常见原因和解决方案**:
|
|
946
|
+
|
|
947
|
+
* **过度分区(分区太小太多**) → 重新设计分区粒度
|
|
948
|
+
* **查询未包含分区字段** → 优化查询WHERE条件
|
|
949
|
+
* **创建的不是真正的分区表** → 用原生SQL重建
|
|
950
|
+
|
|
951
|
+
***
|
|
952
|
+
|
|
953
|
+
## 分区性能验证实战指南
|
|
954
|
+
|
|
955
|
+
### 🎯 **分区效果验证的标准方法**
|
|
956
|
+
|
|
957
|
+
#### 1. 创建对比测试
|
|
958
|
+
|
|
959
|
+
```sql
|
|
960
|
+
-- 创建相同数据的分区表和非分区表
|
|
961
|
+
CREATE TABLE sales_partitioned (
|
|
962
|
+
id INT, amount DOUBLE, sale_date DATE
|
|
963
|
+
) PARTITIONED BY (days(sale_date));
|
|
964
|
+
|
|
965
|
+
CREATE TABLE sales_normal (
|
|
966
|
+
id INT, amount DOUBLE, sale_date DATE
|
|
967
|
+
);
|
|
968
|
+
|
|
969
|
+
-- 插入相同的大量数据(建议>
|
|
970
|
+
|
|
971
|
+
#### 2. 性能基准测试
|
|
972
|
+
|
|
973
|
+
<= '2024-06-30';
|
|
974
|
+
```
|
|
975
|
+
|
|
976
|
+
#### 3. 分区健康检查
|
|
977
|
+
|
|
978
|
+
```sql
|
|
979
|
+
-- 检查分区大小分布
|
|
980
|
+
SHOW PARTITIONS EXTENDED table_name;
|
|
981
|
+
|
|
982
|
+
-- 理想状态:
|
|
983
|
+
-- ✅ 每个分区 128MB - 1GB
|
|
984
|
+
-- ✅ 分区大小相对均匀
|
|
985
|
+
-- ❌ 大量小于10MB的小分区
|
|
986
|
+
-- ❌ 单个分区超过5GB
|
|
987
|
+
```
|
|
988
|
+
|
|
989
|
+
### 📊 **性能问题诊断**
|
|
990
|
+
|
|
991
|
+
#### 分区表比非分区表还慢?
|
|
992
|
+
|
|
993
|
+
**可能原因与解决方案**:
|
|
994
|
+
|
|
995
|
+
1. **过度分区**:分区太细,元数据开销大
|
|
996
|
+
```sql
|
|
997
|
+
-- 问题:每小时分区,导致大量小分区
|
|
998
|
+
PARTITIONED BY (hours(timestamp))
|
|
999
|
+
|
|
1000
|
+
-- 解决:改为天级分区
|
|
1001
|
+
PARTITIONED BY (days(timestamp))
|
|
1002
|
+
```
|
|
1003
|
+
|
|
1004
|
+
2. **查询未利用分区**:WHERE条件没有包含分区字段
|
|
1005
|
+
```sql
|
|
1006
|
+
-- ❌ 无法利用分区
|
|
1007
|
+
SELECT * FROM partitioned_table WHERE amount >
|
|
1008
|
+
|
|
1009
|
+
3. **分区字段选择错误**:分区字段不是查询热点
|
|
1010
|
+
```sql
|
|
1011
|
+
-- 问题分析:检查查询模式
|
|
1012
|
+
-- 如果查询主要按user_id过滤,但按date分区,效果不佳
|
|
1013
|
+
|
|
1014
|
+
-- 解决:重新设计分区策略
|
|
1015
|
+
PARTITIONED BY (bucket(100, user_id)) -- 改按用户散列分区
|
|
1016
|
+
```
|
|
1017
|
+
|
|
1018
|
+
### 🎛️ **分区调优实战**
|
|
1019
|
+
|
|
1020
|
+
#### 1. 分区粒度选择
|
|
1021
|
+
|
|
1022
|
+
< 10K:月级分区 months()
|
|
1023
|
+
```
|
|
1024
|
+
|
|
1025
|
+
#### 2. 复合分区优化
|
|
1026
|
+
|
|
1027
|
+
```sql
|
|
1028
|
+
-- 原始复合分区问题
|
|
1029
|
+
CREATE TABLE sales_old PARTITIONED BY (region, sale_date, channel);
|
|
1030
|
+
-- 问题:10个地区 × 365天 × 3渠道 = 10,950分区
|
|
1031
|
+
|
|
1032
|
+
-- 优化方案1:主次分区
|
|
1033
|
+
CREATE TABLE sales_optimized (
|
|
1034
|
+
..., region STRING, channel STRING
|
|
1035
|
+
) PARTITIONED BY (days(sale_date)); -- 主分区:时间
|
|
1036
|
+
CREATE BLOOMFILTER INDEX idx_region ON TABLE sales_optimized(region);
|
|
1037
|
+
CREATE BLOOMFILTER INDEX idx_channel ON TABLE sales_optimized(channel);
|
|
1038
|
+
|
|
1039
|
+
-- 优化方案2:散列压缩
|
|
1040
|
+
CREATE TABLE sales_hash PARTITIONED BY (
|
|
1041
|
+
days(sale_date), -- 时间分区:365个
|
|
1042
|
+
bucket(5, region) -- 地区散列到5个桶
|
|
1043
|
+
);
|
|
1044
|
+
-- 结果:365 × 5 = 1,825分区(可控)
|
|
1045
|
+
```
|
|
1046
|
+
|
|
1047
|
+
#### 3. 分区演化策略
|
|
1048
|
+
|
|
1049
|
+
```sql
|
|
1050
|
+
-- 定期分区健康检查
|
|
1051
|
+
WITH partition_health AS (
|
|
1052
|
+
SELECT
|
|
1053
|
+
partitions,
|
|
1054
|
+
total_rows,
|
|
1055
|
+
bytes,
|
|
1056
|
+
CASE
|
|
1057
|
+
WHEN CAST(bytes AS BIGINT) < 10*1024*1024 THEN 'TOO_SMALL'
|
|
1058
|
+
WHEN CAST(bytes AS BIGINT) >
|
|
1059
|
+
|
|
1060
|
+
#### 迁移效果对比
|
|
1061
|
+
|
|
1062
|
+
<= '2024-06-01 17:59:59';
|
|
1063
|
+
|
|
1064
|
+
-- 性能提升:
|
|
1065
|
+
-- 📈 查询速度:提升40%(避免小分区扫描)
|
|
1066
|
+
-- 🔧 维护成本:降低60%(分区数量大幅减少)
|
|
1067
|
+
-- 📝 查询复杂度:降低80%(无需计算年月日时)
|
|
1068
|
+
```
|
|
1069
|
+
|
|
1070
|
+
### 📋 **案例2:日志分析表迁移(MaxCompute → 云器Lakehouse**)
|
|
1071
|
+
|
|
1072
|
+
#### 原始MaxCompute表
|
|
1073
|
+
|
|
1074
|
+
```sql
|
|
1075
|
+
-- 原MaxCompute表:严格分区限制
|
|
1076
|
+
CREATE TABLE app_logs (
|
|
1077
|
+
user_id STRING,
|
|
1078
|
+
event_type STRING,
|
|
1079
|
+
event_data STRING
|
|
1080
|
+
) PARTITIONED BY (
|
|
1081
|
+
pt STRING, -- 格式:20240601
|
|
1082
|
+
region STRING, -- 地区:beijing, shanghai
|
|
1083
|
+
app_version STRING -- 版本:1.0, 1.1, 1.2
|
|
1084
|
+
);
|
|
1085
|
+
|
|
1086
|
+
-- MaxCompute特点:
|
|
1087
|
+
-- ✅ 强制分区条件:SELECT必须带WHERE pt='20240601'
|
|
1088
|
+
-- ❌ 分区数量爆炸:365 × 10 × 20 = 73,000/年
|
|
1089
|
+
-- ❌ 数据倾斜:北京地区数据多,其他地区很少
|
|
1090
|
+
```
|
|
1091
|
+
|
|
1092
|
+
#### 云器Lakehouse迁移策略
|
|
1093
|
+
|
|
1094
|
+
```sql
|
|
1095
|
+
-- 步骤1:分析原有查询模式
|
|
1096
|
+
-- 发现:90%查询都是按时间范围 + 地区过滤
|
|
1097
|
+
-- 决策:时间作为主分区,地区用索引
|
|
1098
|
+
|
|
1099
|
+
-- 步骤2:重设计分区架构
|
|
1100
|
+
CREATE TABLE app_logs_new (
|
|
1101
|
+
user_id STRING,
|
|
1102
|
+
event_type STRING,
|
|
1103
|
+
event_data STRING,
|
|
1104
|
+
region STRING, -- 不分区,用索引
|
|
1105
|
+
app_version STRING, -- 不分区,用索引
|
|
1106
|
+
log_date DATE
|
|
1107
|
+
) PARTITIONED BY (days(log_date)); -- 只按时间分区
|
|
1108
|
+
|
|
1109
|
+
-- 步骤3:创建索引优化非分区查询
|
|
1110
|
+
CREATE BLOOMFILTER INDEX idx_region ON TABLE app_logs_new(region);
|
|
1111
|
+
CREATE BLOOMFILTER INDEX idx_version ON TABLE app_logs_new(app_version);
|
|
1112
|
+
|
|
1113
|
+
-- 步骤4:验证查询性能
|
|
1114
|
+
-- 原查询:
|
|
1115
|
+
SELECT * FROM app_logs WHERE pt='20240601' AND region='beijing';
|
|
1116
|
+
|
|
1117
|
+
-- 新查询:
|
|
1118
|
+
SELECT * FROM app_logs_new
|
|
1119
|
+
WHERE log_date='2024-06-01' AND region='beijing';
|
|
1120
|
+
-- 结果:性能相当,但分区数量从73000降至365
|
|
1121
|
+
```
|
|
1122
|
+
|
|
1123
|
+
#### 迁移过程中的挑战
|
|
1124
|
+
|
|
1125
|
+
```sql
|
|
1126
|
+
-- 挑战1:数据类型不一致
|
|
1127
|
+
-- MaxCompute: pt STRING '20240601'
|
|
1128
|
+
-- 云器Lakehouse: log_date DATE '2024-06-01'
|
|
1129
|
+
|
|
1130
|
+
-- 解决:ETL转换脚本
|
|
1131
|
+
INSERT INTO app_logs_new
|
|
1132
|
+
SELECT
|
|
1133
|
+
user_id,
|
|
1134
|
+
event_type,
|
|
1135
|
+
event_data,
|
|
1136
|
+
region,
|
|
1137
|
+
app_version,
|
|
1138
|
+
DATE(CONCAT(
|
|
1139
|
+
SUBSTR(pt, 1, 4), '-',
|
|
1140
|
+
SUBSTR(pt, 5, 2), '-',
|
|
1141
|
+
SUBSTR(pt, 7, 2)
|
|
1142
|
+
)) as log_date
|
|
1143
|
+
FROM maxcompute_source;
|
|
1144
|
+
|
|
1145
|
+
-- 挑战2:习惯性写法需要调整
|
|
1146
|
+
-- MaxCompute习惯:WHERE pt='20240601'(字符串精确匹配)
|
|
1147
|
+
-- 云器Lakehouse:WHERE log_date='2024-06-01'(自动分区裁剪)
|
|
1148
|
+
|
|
1149
|
+
-- 挑战3:最大分区查询方式变化
|
|
1150
|
+
-- MaxCompute原有:WHERE pt = max_pt()
|
|
1151
|
+
-- 云器Lakehouse新方法:WHERE log_date = (SELECT DATE(max_pt('app_logs_new')))
|
|
1152
|
+
-- 或者重新设计为STRING分区,直接使用:WHERE pt = max_pt('app_logs_new')
|
|
1153
|
+
```
|
|
1154
|
+
|
|
1155
|
+
#### 分区维护自动化
|
|
1156
|
+
|
|
1157
|
+
```sql
|
|
1158
|
+
-- MaxCompute风格的自动化脚本(适应新平台)
|
|
1159
|
+
-- 1. 每日清理30天前的分区
|
|
1160
|
+
TRUNCATE TABLE app_logs_new
|
|
1161
|
+
PARTITION (log_date < current_date() - INTERVAL '30' DAY);
|
|
1162
|
+
|
|
1163
|
+
-- 2. 智能清理:保留最新分区,清理小分区
|
|
1164
|
+
WITH old_partitions AS (
|
|
1165
|
+
SELECT partitions
|
|
1166
|
+
FROM (SHOW PARTITIONS EXTENDED app_logs_new)
|
|
1167
|
+
WHERE CAST(bytes AS BIGINT) < 1000000 -- 小于1MB的分区
|
|
1168
|
+
AND partitions < max_pt('app_logs_new') - INTERVAL '7' DAY
|
|
1169
|
+
)
|
|
1170
|
+
-- 逐个清理小分区(注意:需要具体分区值)
|
|
1171
|
+
|
|
1172
|
+
-- 3. 分区健康检查脚本
|
|
1173
|
+
WITH partition_stats AS (
|
|
1174
|
+
SELECT
|
|
1175
|
+
partitions,
|
|
1176
|
+
CAST(total_rows AS BIGINT) as rows,
|
|
1177
|
+
CAST(bytes AS BIGINT) as size_bytes
|
|
1178
|
+
FROM (SHOW PARTITIONS EXTENDED app_logs_new)
|
|
1179
|
+
)
|
|
1180
|
+
SELECT
|
|
1181
|
+
COUNT(*) as total_partitions,
|
|
1182
|
+
AVG(size_bytes)/1024/1024 as avg_size_mb,
|
|
1183
|
+
SUM(CASE WHEN size_bytes < 10*1024*1024 THEN 1 ELSE 0 END) as small_partitions,
|
|
1184
|
+
max_pt('app_logs_new') as latest_partition
|
|
1185
|
+
FROM partition_stats;
|
|
1186
|
+
```
|
|
1187
|
+
|
|
1188
|
+
### 📋 **案例3:实时数据表迁移(Spark → 云器Lakehouse**)
|
|
1189
|
+
|
|
1190
|
+
#### 原始Spark Delta表
|
|
1191
|
+
|
|
1192
|
+
```sql
|
|
1193
|
+
-- 原Spark表:小时级分区 + Z-Order优化
|
|
1194
|
+
CREATE TABLE user_events USING DELTA
|
|
1195
|
+
PARTITIONED BY (date_hour STRING) -- 格式:2024060109
|
|
1196
|
+
OPTIONS (
|
|
1197
|
+
'path' '/data/user_events'
|
|
1198
|
+
);
|
|
1199
|
+
|
|
1200
|
+
-- 定期优化
|
|
1201
|
+
OPTIMIZE user_events ZORDER BY (user_id, event_type);
|
|
1202
|
+
```
|
|
1203
|
+
|
|
1204
|
+
#### 云器Lakehouse迁移挑战
|
|
1205
|
+
|
|
1206
|
+
```sql
|
|
1207
|
+
-- 挑战1:没有直接的Z-Order等价功能
|
|
1208
|
+
-- 挑战2:小时级分区可能过细
|
|
1209
|
+
-- 挑战3:实时写入性能要求高
|
|
1210
|
+
|
|
1211
|
+
-- 解决方案1:分区 + 分桶组合(推荐)
|
|
1212
|
+
CREATE TABLE user_events_new (
|
|
1213
|
+
user_id INT,
|
|
1214
|
+
event_type STRING,
|
|
1215
|
+
event_data JSON,
|
|
1216
|
+
event_time TIMESTAMP_LTZ
|
|
1217
|
+
) PARTITIONED BY (hours(event_time)) -- 保持小时级分区(实时需求)
|
|
1218
|
+
CLUSTERED BY (user_id) INTO 32 BUCKETS; -- 用户ID分桶
|
|
1219
|
+
|
|
1220
|
+
-- 解决方案2:分区 + 排序组合
|
|
1221
|
+
CREATE TABLE user_events_sorted (
|
|
1222
|
+
user_id INT,
|
|
1223
|
+
event_type STRING,
|
|
1224
|
+
event_data JSON,
|
|
1225
|
+
event_time TIMESTAMP_LTZ
|
|
1226
|
+
) PARTITIONED BY (hours(event_time)) -- 保持小时级分区
|
|
1227
|
+
SORTED BY (event_type); -- 事件类型排序
|
|
1228
|
+
|
|
1229
|
+
-- 实现类似Z-Order的效果:
|
|
1230
|
+
-- 1. 分区:按时间物理隔离
|
|
1231
|
+
-- 2. 分桶:按用户ID散列分布
|
|
1232
|
+
-- 3. 排序:按事件类型聚集存储
|
|
1233
|
+
```
|
|
1234
|
+
|
|
1235
|
+
#### 性能调优过程
|
|
1236
|
+
|
|
1237
|
+
```sql
|
|
1238
|
+
-- 调优1:监控分区大小
|
|
1239
|
+
WITH partition_analysis AS (
|
|
1240
|
+
SELECT
|
|
1241
|
+
partitions,
|
|
1242
|
+
CAST(bytes AS BIGINT)/1024/1024 as size_mb,
|
|
1243
|
+
CAST(total_rows AS BIGINT) as row_count
|
|
1244
|
+
FROM (SHOW PARTITIONS EXTENDED user_events_new)
|
|
1245
|
+
)
|
|
1246
|
+
SELECT * FROM partition_analysis
|
|
1247
|
+
ORDER BY size_mb DESC LIMIT 10;
|
|
1248
|
+
|
|
1249
|
+
-- 发现问题:夜间分区太小(<10MB)
|
|
1250
|
+
-- 解决:动态调整分区策略
|
|
1251
|
+
|
|
1252
|
+
-- 调优2:白天用小时分区,夜间合并
|
|
1253
|
+
-- 通过ETL实现智能分区策略:
|
|
1254
|
+
-- 8:00-22:00高峰期:小时级分区
|
|
1255
|
+
-- 22:00-8:00低峰期:合并到天级分区
|
|
1256
|
+
```
|
|
1257
|
+
|
|
1258
|
+
### 📋 **案例4:数据仓库迁移(Snowflake → 云器Lakehouse**)
|
|
1259
|
+
|
|
1260
|
+
#### Snowflake用户的特殊挑战
|
|
1261
|
+
|
|
1262
|
+
```sql
|
|
1263
|
+
-- 传统Snowflake:自动存储管理
|
|
1264
|
+
CREATE TABLE sales (
|
|
1265
|
+
order_id INT,
|
|
1266
|
+
customer_id INT,
|
|
1267
|
+
amount DECIMAL(10,2),
|
|
1268
|
+
order_date DATE
|
|
1269
|
+
);
|
|
1270
|
+
|
|
1271
|
+
-- 设置聚簇键(用户相对无感知)
|
|
1272
|
+
ALTER TABLE sales CLUSTER BY (order_date);
|
|
1273
|
+
|
|
1274
|
+
-- 查询:不用特别关心物理存储
|
|
1275
|
+
SELECT * FROM sales WHERE amount >
|
|
1276
|
+
|
|
1277
|
+
#### 云器Lakehouse学习路径
|
|
1278
|
+
|
|
1279
|
+
```sql
|
|
1280
|
+
-- 第1阶段:理解分区概念
|
|
1281
|
+
-- 问题:什么是分区?为什么需要分区?
|
|
1282
|
+
-- 答案:分区是将数据按某个字段物理分开存储,查询时只扫描相关分区
|
|
1283
|
+
|
|
1284
|
+
-- 第2阶段:学会分区设计
|
|
1285
|
+
-- 问题:应该按什么字段分区?
|
|
1286
|
+
-- 分析:检查最常用的WHERE条件
|
|
1287
|
+
SELECT
|
|
1288
|
+
COUNT(*) as query_count,
|
|
1289
|
+
'order_date filter' as filter_type
|
|
1290
|
+
FROM query_log
|
|
1291
|
+
WHERE query_text LIKE '%WHERE%order_date%'
|
|
1292
|
+
UNION ALL
|
|
1293
|
+
SELECT
|
|
1294
|
+
COUNT(*),
|
|
1295
|
+
'customer_id filter'
|
|
1296
|
+
FROM query_log
|
|
1297
|
+
WHERE query_text LIKE '%WHERE%customer_id%';
|
|
1298
|
+
|
|
1299
|
+
-- 结果:order_date过滤占80%,customer_id占20%
|
|
1300
|
+
-- 决策:按order_date分区,customer_id用索引
|
|
1301
|
+
|
|
1302
|
+
-- 第3阶段:正确的分区表设计
|
|
1303
|
+
CREATE TABLE sales_partitioned (
|
|
1304
|
+
order_id INT,
|
|
1305
|
+
customer_id INT,
|
|
1306
|
+
amount DECIMAL(10,2),
|
|
1307
|
+
order_date DATE
|
|
1308
|
+
) PARTITIONED BY (days(order_date));
|
|
1309
|
+
|
|
1310
|
+
CREATE BLOOMFILTER INDEX idx_customer ON TABLE sales_partitioned(customer_id);
|
|
1311
|
+
|
|
1312
|
+
-- 第4阶段:验证性能提升
|
|
1313
|
+
-- 对比查询:
|
|
1314
|
+
SELECT * FROM sales WHERE order_date = '2024-06-01'; -- 全表扫描
|
|
1315
|
+
SELECT * FROM sales_partitioned WHERE order_date = '2024-06-01'; -- 分区扫描
|
|
1316
|
+
|
|
1317
|
+
-- 结果:分区表通常有显著性能提升
|
|
1318
|
+
```
|
|
1319
|
+
|
|
1320
|
+
#### Snowflake用户常见调整和实际困惑
|
|
1321
|
+
|
|
1322
|
+
```sql
|
|
1323
|
+
-- 调整1:创建分区表而不是普通表
|
|
1324
|
+
CREATE TABLE sales_correct (
|
|
1325
|
+
order_id INT,
|
|
1326
|
+
amount DECIMAL(10,2),
|
|
1327
|
+
order_date DATE
|
|
1328
|
+
) PARTITIONED BY (days(order_date)); -- ✅ 这是分区表
|
|
1329
|
+
|
|
1330
|
+
-- 调整2:查询时包含分区条件
|
|
1331
|
+
SELECT * FROM sales_partitioned
|
|
1332
|
+
WHERE order_date >= '2024-06-01' -- ✅ 利用分区
|
|
1333
|
+
AND amount > 1000; -- ✅ 业务过滤
|
|
1334
|
+
|
|
1335
|
+
-- 调整3:理解分区字段的重要性
|
|
1336
|
+
SELECT * FROM sales_partitioned WHERE amount > 1000; -- 可以优化为包含分区条件
|
|
1337
|
+
|
|
1338
|
+
-- 调整4:不是所有查询都会自动很快
|
|
1339
|
+
-- Snowflake用户习惯了系统自动处理一切存储优化
|
|
1340
|
+
-- Lakehouse中需要更多的主动设计意识
|
|
1341
|
+
```
|
|
1342
|
+
|
|
1343
|
+
#### 认知转变的实际过程
|
|
1344
|
+
|
|
1345
|
+
```markdown
|
|
1346
|
+
### Snowflake用户的常见疑问:
|
|
1347
|
+
"为什么我的查询在Snowflake上很快,在Lakehouse上需要考虑分区?"
|
|
1348
|
+
|
|
1349
|
+
**答案**:Snowflake的自动优化 vs Lakehouse的主动分区设计各有优势
|
|
1350
|
+
|
|
1351
|
+
### 学习过程中的典型问题:
|
|
1352
|
+
1. "为什么要我来决定分区策略?"
|
|
1353
|
+
2. "什么是小文件问题?我之前没考虑过文件大小"
|
|
1354
|
+
3. "分区数量控制是什么意思?"
|
|
1355
|
+
|
|
1356
|
+
### 理解突破时刻:
|
|
1357
|
+
当用户看到分区表比非分区表有明显性能优势时,开始理解分区的价值
|
|
1358
|
+
```
|
|
1359
|
+
|
|
1360
|
+
***
|
|
1361
|
+
|
|
1362
|
+
## 迁移验证清单
|
|
1363
|
+
|
|
1364
|
+
### 📋 **分区创建验证**
|
|
1365
|
+
|
|
1366
|
+
```sql
|
|
1367
|
+
-- ✅ 检查点1:表确实是分区表
|
|
1368
|
+
SHOW PARTITIONS your_table_name;
|
|
1369
|
+
-- 应该能正常执行,不报"not a partitioned table"错误
|
|
1370
|
+
|
|
1371
|
+
-- ✅ 检查点2:分区字段类型正确
|
|
1372
|
+
DESCRIBE TABLE your_table_name;
|
|
1373
|
+
-- 确认分区字段的数据类型与预期一致
|
|
1374
|
+
|
|
1375
|
+
-- ✅ 检查点3:测试数据写入
|
|
1376
|
+
INSERT INTO your_table_name VALUES (测试数据);
|
|
1377
|
+
SHOW PARTITIONS your_table_name;
|
|
1378
|
+
-- 应该能看到新创建的分区
|
|
1379
|
+
|
|
1380
|
+
-- ✅ 检查点4:验证分区值格式
|
|
1381
|
+
-- 转换分区会生成数值,如 days('2024-06-01') = 19875
|
|
1382
|
+
-- years('2024-06-01') = 54(从1970年开始的年份计数)
|
|
1383
|
+
-- 确认分区值符合预期
|
|
1384
|
+
|
|
1385
|
+
-- ✅ 检查点5:测试最大分区获取(兼容性验证)
|
|
1386
|
+
SELECT max_pt('your_table_name');
|
|
1387
|
+
-- 应该返回最大的分区值,用于兼容原平台的类似功能
|
|
1388
|
+
```
|
|
1389
|
+
|
|
1390
|
+
### 📋 **性能验证**
|
|
1391
|
+
|
|
1392
|
+
```sql
|
|
1393
|
+
-- ✅ 检查点6:分区裁剪是否生效
|
|
1394
|
+
-- 对比这两个查询的执行时间
|
|
1395
|
+
SELECT COUNT(*) FROM your_table_name; -- 全表扫描
|
|
1396
|
+
SELECT COUNT(*) FROM your_table_name WHERE partition_col = 'specific_value'; -- 分区查询
|
|
1397
|
+
|
|
1398
|
+
-- 分区查询应该明显更快
|
|
1399
|
+
|
|
1400
|
+
-- ✅ 检查点7:分区大小是否合理
|
|
1401
|
+
SHOW PARTITIONS EXTENDED your_table_name;
|
|
1402
|
+
-- 检查每个分区的大小,理想范围:128MB - 1GB
|
|
1403
|
+
```
|
|
1404
|
+
|
|
1405
|
+
### 📋 **兼容性验证**
|
|
1406
|
+
|
|
1407
|
+
```sql
|
|
1408
|
+
-- ✅ 检查点8:原有查询语句是否需要修改
|
|
1409
|
+
-- 将原平台的查询语句在Lakehouse中测试
|
|
1410
|
+
-- 特别注意:
|
|
1411
|
+
-- 1. 转换函数名是否需要调整(year -> years)
|
|
1412
|
+
-- 2. 分区条件是否能自动识别
|
|
1413
|
+
-- 3. 数据类型转换是否正常
|
|
1414
|
+
|
|
1415
|
+
-- ✅ 检查点9:分区演化策略是否可行
|
|
1416
|
+
-- 验证分区清理、维护脚本是否正常工作
|
|
1417
|
+
```
|
|
1418
|
+
|
|
1419
|
+
### 🚨 **常见错误自查**
|
|
1420
|
+
|
|
1421
|
+
如果遇到以下错误,按对应方案解决:
|
|
1422
|
+
|
|
1423
|
+
| 错误信息 | 可能原因 | 解决方案 |
|
|
1424
|
+
| ------------------------------------ | -------------------------- | ------------------ |
|
|
1425
|
+
| `not a partitioned table` | 建表时未加PARTITIONED BY或语法错误 | 用原生SQL重新创建表,添加分区定义 |
|
|
1426
|
+
| `implicit cast not allowed` | 分区字段类型不匹配 | 检查插入数据类型 |
|
|
1427
|
+
| `exceeds maximum number` | 单次操作分区过多 | 调整参数或分批处理 |
|
|
1428
|
+
| `conflicts with` | 转换分区逻辑冲突 | 选择单一时间粒度 |
|
|
1429
|
+
| `months conflicts with years` | 多级时间分区设计错误 | 改用`days()`单一粒度 |
|
|
1430
|
+
| `Syntax error at or near 'ORDER'` | SHOW PARTITIONS使用了ORDER BY | 用WITH子查询实现排序 |
|
|
1431
|
+
| `cannot resolve column 'total_rows'` | TRUNCATE PARTITION中使用了分区属性 | 只能使用分区字段本身 |
|
|
1432
|
+
| `operator not found` | 类型不匹配的比较 | 确保数据类型一致 |
|
|
1433
|
+
| `duplicate.syntax.element` | CLUSTERED BY和SORTED BY同时使用 | 选择其中一种语法 |
|
|
1434
|
+
| 查询性能比原平台差 | 分区设计是否合理 | 重新评估分区策略 |
|
|
1435
|
+
| 分区过多过小 | 复合分区维度过多 | 减少分区维度,用索引替代 |
|
|
1436
|
+
| `max_pt function not found` | 可能是表名错误或权限问题 | 检查表名和schema权限 |
|
|
1437
|
+
| `TRUNCATE PARTITION failed` | 分区条件语法错误 | 检查分区过滤表达式语法 |
|
|
1438
|
+
|
|
1439
|
+
### 💡 **性能基准检验标准**
|
|
1440
|
+
|
|
1441
|
+
#### 基础迁移成功标志
|
|
1442
|
+
|
|
1443
|
+
* ✅ 原有查询逻辑无需大改即可在Lakehouse中运行
|
|
1444
|
+
* ✅ 查询性能达到或超过原平台水平
|
|
1445
|
+
* ✅ 分区维护工作量可控且自动化
|
|
1446
|
+
* ✅ 团队成员能够独立处理常见分区问题
|
|
1447
|
+
|
|
1448
|
+
#### 复杂迁移成功标志
|
|
1449
|
+
|
|
1450
|
+
* ✅ 新分区策略比原策略更简洁但性能不降低
|
|
1451
|
+
* ✅ 分区数量在合理范围内
|
|
1452
|
+
* ✅ 分区大小分布均匀(128MB-1GB/分区)
|
|
1453
|
+
* ✅ 查询模式和分区设计高度匹配
|
|
1454
|
+
* ✅ 分区维护自动化程度不低于原平台
|
|
1455
|
+
|
|
1456
|
+
***
|
|
1457
|
+
|
|
1458
|
+
## 总结:成功迁移的关键要素
|
|
1459
|
+
|
|
1460
|
+
### 🎯 **核心认知转变**
|
|
1461
|
+
|
|
1462
|
+
1. **从显式分区到隐藏分区**:不需要在每个查询中手动指定分区条件
|
|
1463
|
+
2. **从自动优化到主动设计**:分区策略需要提前规划和设计
|
|
1464
|
+
3. **从单一语法到多样选择**:支持多种分区创建方式,选择最适合的
|
|
1465
|
+
4. **从较少关注到分区意识**:特别是Snowflake等平台用户,需要增强分区设计意识
|
|
1466
|
+
5. **从复杂分区到简化设计**:多级分区需要重新设计为单一粒度分区
|
|
1467
|
+
|
|
1468
|
+
### 🛡️ **避坑要点总结**
|
|
1469
|
+
|
|
1470
|
+
#### 基础语法陷阱
|
|
1471
|
+
|
|
1472
|
+
1. **转换函数名记住复数形式**:`years`, `months`, `days`, `hours`
|
|
1473
|
+
2. **分区类型保持一致**:避免STRING和DATE类型混用
|
|
1474
|
+
3. **分区数量控制在合理范围**:避免过多小分区
|
|
1475
|
+
4. **分区设计考虑查询模式**:根据WHERE条件设计分区字段
|
|
1476
|
+
5. **及时验证分区效果**:创建后立即检查分区是否正确
|
|
1477
|
+
|
|
1478
|
+
#### 复杂迁移陷阱
|
|
1479
|
+
|
|
1480
|
+
6. **多级时间分区不能直接迁移**:`year+month+day`要改为`days()`单一粒度
|
|
1481
|
+
7. **复合分区维度要精简**:过多维度会导致分区数量过多
|
|
1482
|
+
8. **分区演化策略要重建**:从自动管理改为手动调度管理
|
|
1483
|
+
9. **性能优化策略要调整**:Z-Order等高级优化需要重新设计
|
|
1484
|
+
10. **生命周期管理要补齐**:从自动清理改为脚本定期清理
|
|
1485
|
+
|
|
1486
|
+
#### 创建验证陷阱
|
|
1487
|
+
|
|
1488
|
+
11. **建议使用原生SQL创建分区表**:确保语法准确性
|
|
1489
|
+
12. **每次创建后必须验证**:执行完整的验证清单
|
|
1490
|
+
13. **性能测试要对比基准**:确保分区表确实比非分区表快
|
|
1491
|
+
14. **分区健康状态要监控**:定期检查分区大小和数量分布
|
|
1492
|
+
|
|
1493
|
+
#### 高级语法陷阱
|
|
1494
|
+
|
|
1495
|
+
15. **SHOW PARTITIONS不支持ORDER BY**:用WITH子查询实现排序
|
|
1496
|
+
16. **bucket函数参数要合理**:推荐使用1-1000范围的正整数
|
|
1497
|
+
17. **注意NULL值和特殊字符**:系统会创建对应的分区
|
|
1498
|
+
|
|
1499
|
+
### 💡 **成功迁移的经验**
|
|
1500
|
+
|
|
1501
|
+
#### 对于简单分区场景
|
|
1502
|
+
|
|
1503
|
+
* **渐进式迁移**:先迁移小表验证,再迁移核心大表
|
|
1504
|
+
* **保留原有查询逻辑**:尽量让原有SQL无需大改即可工作
|
|
1505
|
+
* **性能基准对比**:迁移前后的查询性能对比验证
|
|
1506
|
+
* **完整验证流程**:严格按照验证清单执行
|
|
1507
|
+
|
|
1508
|
+
#### 对于复杂分区场景
|
|
1509
|
+
|
|
1510
|
+
* **分区策略重设计**:不要试图完全复制原有分区结构
|
|
1511
|
+
* **性能验证优先**:用代表性查询验证新分区策略的效果
|
|
1512
|
+
* **分阶段实施**:复杂迁移分为分析、重设计、测试、切换四个阶段
|
|
1513
|
+
* **回退方案准备**:确保迁移失败时能快速回退到原方案
|
|
1514
|
+
|
|
1515
|
+
### 🚀 **特别提醒**
|
|
1516
|
+
|
|
1517
|
+
**对于 Snowflake 等平台用户**:
|
|
1518
|
+
迁移到 Lakehouse 的挑战主要在于思维模式的调整。您需要从"较少关注分区"转变为"主动设计分区策略"。建议先在测试环境中体验分区对查询性能的影响,理解分区的价值,再开始设计生产环境的分区策略。
|
|
1519
|
+
|
|
1520
|
+
**对于有复杂分区架构的用户**:
|
|
1521
|
+
不要试图在Lakehouse中完全复制原有的分区结构。Lakehouse的分区哲学是"简化而不失性能"。多级分区、复合分区等复杂设计在Lakehouse中往往可以用更简单的方案达到同样或更好的效果。
|
|
1522
|
+
|
|
1523
|
+
**对于所有迁移用户**:
|
|
1524
|
+
分区表的创建验证是成功迁移的第一步,也是最关键的一步。建议使用原生SQL创建分区表,并严格执行验证清单。很多迁移问题都源于分区表没有正确创建,导致后续的性能优化无从谈起。
|
|
1525
|
+
|
|
1526
|
+
记住:**Lakehouse 的分区不是负担,而是性能优化的利器**。正确使用分区,您将获得比原平台更好的查询性能和更灵活的数据管理能力。
|
|
1527
|
+
|
|
1528
|
+
***
|
|
1529
|
+
|
|
1530
|
+
## 快速参考卡片
|
|
1531
|
+
|
|
1532
|
+
### 🔄 **分区管理常用命令**
|
|
1533
|
+
|
|
1534
|
+
| 功能 | 命令语法 | 使用场景 |
|
|
1535
|
+
| -------- | --------------------------------------------------------------------- | ----------- |
|
|
1536
|
+
| **查看分区** | `SHOW PARTITIONS table_name` | 基础分区查看 |
|
|
1537
|
+
| **分区详情** | `SHOW PARTITIONS EXTENDED table_name` | 查看分区大小、文件数等 |
|
|
1538
|
+
| **分区过滤** | `SHOW PARTITIONS EXTENDED table WHERE bytes > 100*1024*1024` | 健康检查 |
|
|
1539
|
+
| **特定分区** | `SHOW PARTITIONS table PARTITION (pt1 = '2023')` | 查看特定分区 |
|
|
1540
|
+
| **限制数量** | `SHOW PARTITIONS table LIMIT 10` | 限制返回结果 |
|
|
1541
|
+
| **最大分区** | `SELECT max_pt('table_name')` | 获取最新分区值 |
|
|
1542
|
+
| **清理分区** | `TRUNCATE TABLE table PARTITION (pt = 'value')` | 生命周期管理 |
|
|
1543
|
+
| **批量清理** | `TRUNCATE TABLE table PARTITION (pt1 = 'v1'), PARTITION (pt2 = 'v2')` | 复合条件清理 |
|
|
1544
|
+
|
|
1545
|
+
### 🔄 **平台语法快速对照**
|
|
1546
|
+
|
|
1547
|
+
| 功能 | 原平台语法 | Lakehouse语法 | 注意事项 |
|
|
1548
|
+
| -------- | ---------------------------- | -------------------------- | ----------------------- |
|
|
1549
|
+
| **年分区** | `year(date)` | `years(date)` | 复数形式,返回从1970年开始的年数 |
|
|
1550
|
+
| **月分区** | `month(date)` | `months(date)` | 复数形式 |
|
|
1551
|
+
| **天分区** | `day(date)` | `days(date)` | 复数形式,返回从1970-01-01开始的天数 |
|
|
1552
|
+
| **小时分区** | `hour(timestamp)` | `hours(timestamp)` | 复数形式 |
|
|
1553
|
+
| **组合分区** | `(year, month)` | `days(date)` | 不能组合冲突的转换 |
|
|
1554
|
+
| **动态分区** | 需配置开启 | 默认支持 | 注意分区数量控制 |
|
|
1555
|
+
| **最大分区** | `max_pt()` | `max_pt('table_name')` | 需要指定表名 |
|
|
1556
|
+
| **分区清理** | `ALTER TABLE DROP PARTITION` | `TRUNCATE TABLE PARTITION` | 语法更灵活 |
|
|
1557
|
+
|
|
1558
|
+
### 🚨 **错误速查表**
|
|
1559
|
+
|
|
1560
|
+
| 看到这个错误 | 立即检查这个 | 快速解决 |
|
|
1561
|
+
| ------------------------------------ | -------------------------- | ------------------ |
|
|
1562
|
+
| `not a partitioned table` | 建表语句是否有`PARTITIONED BY` | 用原生SQL重建表 |
|
|
1563
|
+
| `implicit cast not allowed` | 数据类型是否匹配 | 统一用STRING分区或使用转换分区 |
|
|
1564
|
+
| `exceeds maximum number` | 分区数量是否过多 | 分批插入或调参数 |
|
|
1565
|
+
| `conflicts with` | 转换分区是否冲突 | 使用单一时间粒度 |
|
|
1566
|
+
| `months conflicts with years` | 多级时间分区设计错误 | 改用`days()`单一粒度 |
|
|
1567
|
+
| `Syntax error at or near 'ORDER'` | SHOW PARTITIONS使用了ORDER BY | 用WITH子查询实现排序 |
|
|
1568
|
+
| `cannot resolve column 'total_rows'` | TRUNCATE PARTITION中使用了分区属性 | 只能使用分区字段本身 |
|
|
1569
|
+
| `operator not found` | 类型不匹配的比较 | 确保数据类型一致 |
|
|
1570
|
+
| `duplicate.syntax.element` | CLUSTERED BY和SORTED BY同时使用 | 选择其中一种语法 |
|
|
1571
|
+
| 查询性能比原平台差 | 分区设计是否合理 | 重新评估分区策略 |
|
|
1572
|
+
| 分区过多过小 | 复合分区维度过多 | 减少分区维度,用索引替代 |
|
|
1573
|
+
| `max_pt function not found` | 可能是表名错误或权限问题 | 检查表名和schema权限 |
|
|
1574
|
+
| `TRUNCATE PARTITION failed` | 分区条件语法错误 | 检查分区过滤表达式语法 |
|
|
1575
|
+
|
|
1576
|
+
### 🔍 **分区表验证速查**
|
|
1577
|
+
|
|
1578
|
+
| 验证项目 | 检查命令 | 正确结果 |
|
|
1579
|
+
| --------- | ------------------------------------- | ---------- |
|
|
1580
|
+
| **是否分区表** | `SHOW PARTITIONS table_name` | 不报错,显示分区列表 |
|
|
1581
|
+
| **分区创建** | 插入数据后再次查看分区 | 显示新分区值 |
|
|
1582
|
+
| **类型匹配** | `DESCRIBE TABLE table_name` | 列类型符合预期 |
|
|
1583
|
+
| **性能提升** | 对比分区查询vs全表扫描 | 分区查询明显更快 |
|
|
1584
|
+
| **最新分区** | `SELECT max_pt('table_name')` | 返回最大分区值 |
|
|
1585
|
+
| **分区健康** | `SHOW PARTITIONS EXTENDED table_name` | 分区大小合理分布 |
|
|
1586
|
+
|
|
1587
|
+
### 💡 **高级语法速查**
|
|
1588
|
+
|
|
1589
|
+
| 功能 | 正确语法 | 错误语法 |
|
|
1590
|
+
| ------------ | --------------------------------------------------------------- | ------------------------------ |
|
|
1591
|
+
| **分区+分桶** | `PARTITIONED BY (days(date)) CLUSTERED BY (id) INTO 32 BUCKETS` | ✅ 正确 |
|
|
1592
|
+
| **分区+排序** | `PARTITIONED BY (days(date)) SORTED BY (name)` | ✅ 正确 |
|
|
1593
|
+
| **分区+分桶+排序** | `不支持同时使用` | ❌ 会报duplicate.syntax.element错误 |
|
|
1594
|
+
| **bucket参数** | `bucket(10, user_id)` | ✅ 推荐1-1000范围 |
|
|
1595
|
+
| **分区排序** | `WITH t AS (SHOW PARTITIONS ...) SELECT * FROM t ORDER BY ...` | ✅ 用子查询 |
|
|
1596
|
+
| **NULL分区** | `INSERT ... VALUES (1, NULL)` → `col=NULL分区` | ✅ 支持 |
|
|
1597
|
+
|
|
1598
|
+
### 💡 **迁移优先级清单**
|
|
1599
|
+
|
|
1600
|
+
#### 🥇 **第一优先级(必做)**
|
|
1601
|
+
|
|
1602
|
+
* [ ] 使用原生SQL创建分区表
|
|
1603
|
+
* [ ] 执行完整验证清单确保分区表正确
|
|
1604
|
+
* [ ] 对比分区表vs非分区表性能
|
|
1605
|
+
* [ ] 验证原有查询在新平台上的执行效果
|
|
1606
|
+
|
|
1607
|
+
#### 🥈 **第二优先级(重要**)
|
|
1608
|
+
|
|
1609
|
+
* [ ] 简化复杂分区结构(多级→单级)
|
|
1610
|
+
* [ ] 控制分区数量在合理范围内
|
|
1611
|
+
* [ ] 为非分区高频查询字段创建索引
|
|
1612
|
+
* [ ] 建立分区健康监控机制
|
|
1613
|
+
|
|
1614
|
+
#### 🥉 **第三优先级(优化**)
|
|
1615
|
+
|
|
1616
|
+
* [ ] 建立分区清理自动化脚本
|
|
1617
|
+
* [ ] 团队培训新的分区概念和操作
|
|
1618
|
+
* [ ] 性能持续监控和调优
|
|
1619
|
+
* [ ] 分区策略的定期评估和调整
|
|
1620
|
+
|
|
1621
|
+
***
|
|
1622
|
+
|
|
1623
|
+
**使用建议**:这份文档可以作为Lakehouse分区迁移的权威参考,所有示例都可以直接在生产环境中使用。对于复杂迁移项目,建议按照文档中的验证清单逐步执行,确保每个步骤都得到正确验证。
|
|
1624
|
+
|
|
1625
|
+
***
|
|
1626
|
+
|
|
1627
|
+
**注意**:本文档基于 Lakehouse 2025 年 6 月的产品文档整理,建议定期查看官方文档获取最新更新。在生产环境中使用前,请务必在测试环境中验证所有操作的正确性和性能影响。
|