CREATE ICEBERG TABLE（使用 Snowflake 作为 Iceberg 目录）¶

在当前/指定架构中，创建或替换将 Snowflake 用作 Iceberg 目录的 Apache Iceberg™ 表。

该命令支持以下变体：

CREATE ICEBERG TABLE ...AS SELECT（创建一个已填充的表；也称为 CTAS）
CREATE ICEBERG TABLE ...LIKE （创建现有表的空副本）

本主题将 Iceberg 表简称为“表”（指定 Iceberg 表 的位置除外）以避免混淆。

备注

To store Iceberg data and metadata in your cloud storage, create an external volume and reference it from the table. For instructions, see 配置外部卷.

To use Snowflake Storage instead, set EXTERNAL_VOLUME = 'SNOWFLAKE_MANAGED' (or rely on defaults when the catalog is Snowflake). You don't need to create a separate external volume object in this case. For more information, see 适用于 Apache Iceberg™ 表的 Snowflake Storage.

另请参阅：: ALTER ICEBERG TABLE、DROP ICEBERG TABLE、SHOW ICEBERG TABLES、DESCRIBE ICEBERG TABLE、UNDROP ICEBERG TABLE

语法¶

CREATE [ OR REPLACE ] [ TRANSIENT ] ICEBERG TABLE [ IF NOT EXISTS ] <table_name> (
    -- Column definition
    <col_name> <col_type> [ DEFAULT <col_default> ]
      [ inlineConstraint ]
      [ NOT NULL ]
      [ [ WITH ] MASKING POLICY <policy_name> [ USING ( <col_name> , <cond_col1> , ... ) ] ]
      [ [ WITH ] PROJECTION POLICY <policy_name> ]
      [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
      [ COMMENT '<string_literal>' ]

    -- Additional column definitions
    [ , <col_name> <col_type> [ DEFAULT <col_default> ] [ ... ] ]

    -- Out-of-line constraints
    [ , outoflineConstraint [ ... ] ]
  )
  [ PARTITION BY ( partitionExpression [, partitionExpression , ...] ) ]
  [ PATH_LAYOUT = { FLAT | HIERARCHICAL } ]
  [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
  [ EXTERNAL_VOLUME = '<external_volume_name>' ]
  [ CATALOG = 'SNOWFLAKE' ]
  [ BASE_LOCATION = '<directory_for_table_files>' ]
  [ TARGET_FILE_SIZE = '{ AUTO | 16MB | 32MB | 64MB | 128MB }' ]
  [ CATALOG_SYNC = '<open_catalog_integration_name>']
  [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ CHANGE_TRACKING = { TRUE | FALSE } ]
  [ COPY GRANTS ]
  [ ERROR_LOGGING = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ ICEBERG_VERSION = <integer> ]
  [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
  [ [ WITH ] AGGREGATION POLICY <policy_name> ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
  [ ENABLE_DATA_COMPACTION = { TRUE | FALSE } ]

其中：

inlineConstraint ::=
  [ CONSTRAINT <constraint_name> ]
  {   UNIQUE
    | PRIMARY KEY
    | [ FOREIGN KEY ] REFERENCES <ref_table_name> [ ( <ref_col_name> ) ]
    | CHECK ( <expr> )
  }
  [ <constraint_properties> ]
有关其他内联约束的详细信息，请参阅 CREATE | ALTER TABLE ... CONSTRAINT。
outoflineConstraint ::=
  [ CONSTRAINT <constraint_name> ]
  {   UNIQUE [ ( <col_name> [ , <col_name> , ... ] ) ]
    | PRIMARY KEY [ ( <col_name> [ , <col_name> , ... ] ) ]
    | [ FOREIGN KEY ] [ ( <col_name> [ , <col_name> , ... ] ) ]
      REFERENCES <ref_table_name> [ ( <ref_col_name> [ , <ref_col_name> , ... ] ) ]
    | CHECK ( <expr> )
  }
  [ <constraint_properties> ]
备注

Snowflake 表示定义为 PRIMARY KEY 的列，作为 Iceberg 元数据中的标识符字段。这些列的 IDs 在元数据中填充为标识符字段 IDs (https://iceberg.apache.org/spec/#identifier-field-ids)。

Snowflake 不对 Iceberg 表的 PRIMARY KEY 列强制执行 NOT NULL 和 UNIQUE 约束条件。

有关其他行外约束的详细信息，请参阅 CREATE | ALTER TABLE ... CONSTRAINT。
partitionExpression ::=
  <col_name> -- identity transform
  | BUCKET ( <num_buckets> , <col_name> )
  | TRUNCATE ( <width> , <col_name> )
  | YEAR ( <col_name> )
  | MONTH ( <col_name> )
  | DAY ( <col_name> )
  | HOUR ( <col_name> )

变体语法¶

CREATE ICEBERG TABLE ...AS SELECT（也称为 CTAS）¶

创建一个新表，其中填充了查询返回的数据。将 AS SELECT 子句放在语句末尾。

CREATE [ OR REPLACE ] [ TRANSIENT ] ICEBERG TABLE <table_name> [ ( <col_name> [ <col_type> ] [ DEFAULT <col_default> ] , <col_name> [ <col_type> ] [ DEFAULT <col_default> ] , ... ) ]
  [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
  [ EXTERNAL_VOLUME = '<external_volume_name>' ]
  [ CATALOG = 'SNOWFLAKE' ]
  [ BASE_LOCATION = '<relative_path_from_external_volume>' ]
  [ COPY GRANTS ]
  [ ICEBERG_VERSION = <integer> ]
  [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
  [ ... ]
  AS SELECT <query>

掩码策略可以应用于 CTAS 语句中的列。先指定列数据类型，然后指定掩码策略。同样，可以将行访问策略应用于表。例如：

CREATE ICEBERG TABLE <table_name> ( <col1> <data_type> [ DEFAULT <col_default> ] [ WITH ] MASKING POLICY <policy_name> [ , ... ] )
  [ EXTERNAL_VOLUME = '<external_volume_name>' ]
  [ CATALOG = 'SNOWFLAKE' ]
  [ BASE_LOCATION = '<directory_for_table_files>' ]
  [ ICEBERG_VERSION = <integer> ]
  [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
  [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col1> [ , ... ] )
  [ ... ]
  AS SELECT <query>

备注

在 CTAS 中，COPY GRANTS 参数仅在与 OR REPLACE 子句组合时才有效。COPY GRANTS 从要使用 CREATE OR REPLACE 替换的表（如果已存在）中复制权限，而不是从 SELECT 语句中查询的源表中复制权限。通过具有 COPY GRANTS 的 CTAS，您可以使用一组新数据覆盖表，同时将现有授权保留在该表上。

有关 COPY GRANTS 参数的更多信息，请参阅本文档中的 COPY GRANTS。

有关该变体语法的更多信息，请参阅使用说明。

CREATE ICEBERG TABLE ... LIKE¶

使用与现有表相同的列定义创建新表，但不从现有表中复制数据。列名称、类型、默认值和约束将复制到新表中：

CREATE [ OR REPLACE ] [ TRANSIENT ] ICEBERG TABLE <table_name> LIKE <source_table>
  [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
  [ COPY GRANTS ]
  [ ... ]

有关 COPY GRANTS 参数的更多信息，请参阅本文档中的 COPY GRANTS。

备注

CREATE TABLE ...LIKE 不支持用于具有通过数据共享访问的自动递增序列的表。

有关该变体语法的更多信息，请参阅使用说明。

CREATE ICEBERG TABLE ... CLONE¶

创建具有相同的列定义的新 Iceberg 表，并包含源表中的全部现有数据，而不会实际复制数据。您还可以利用此变体克隆过去的特定时间/点的表（使用 Time Travel）：

CREATE [ OR REPLACE ] ICEBERG TABLE [ IF NOT EXISTS ] <name>
  CLONE <source_iceberg_table>
    [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
    [COPY GRANTS]
    ...

备注

如果该语句用于替换同名的现有 Iceberg 表，Snowflake 会从要替换的表中复制授权。如果没有该名称的现有表，Snowflake 会从要克隆的源表中复制授权。

有关 COPY GRANTS 参数的更多信息，请参阅本文档中的 COPY GRANTS。

有关克隆的更多信息，请参阅 CREATE <object> ... CLONE 和克隆和 Apache Iceberg™ 表。

必填参数¶

table_name

指定表的标识符（名称）；在创建表的架构中必须是唯一的。

此外，标识符必须以字母字符开头，且不能包含空格或特殊字符，除非整个标识符字符串放在双引号内（例如，"My object"）。放在双引号内的标识符也区分大小写。

有关更多信息，请参阅标识符要求。

col_name

指定列标识符（名称）。表标识符的所有要求也适用于列标识符。

有关更多信息，请参阅标识符要求和保留和受限关键字。

备注

除了标准的保留关键字之外，以下关键字不能用作列标识符，因为它们是为 ANSI 标准上下文函数保留的：

CURRENT_DATE
CURRENT_ROLE
CURRENT_TIME
CURRENT_TIMESTAMP
CURRENT_USER

有关保留关键字的列表，请参阅保留和受限关键字。

col_type

指定列的数据类型。

有关可为表列指定的数据类型的信息，请参阅 Apache Iceberg™ 表的数据类型。

备注

您不能使用 float 或 double 作为主键（根据 Apache Iceberg 规范 (https://iceberg.apache.org/spec/#identifier-field-ids)）。

可选参数¶

TRANSIENT

Creates a transient Iceberg table. Transient tables don't have a Fail-safe period, so they don't incur Fail-safe storage costs.

For Iceberg tables that use Snowflake-provided storage (EXTERNAL_VOLUME = 'SNOWFLAKE_MANAGED'), the TRANSIENT keyword determines whether the table data is protected by Fail-safe. For more information, see 适用于 Apache Iceberg™ 表的 Snowflake Storage.

备注

Transient Iceberg tables are only supported with Snowflake-provided storage (EXTERNAL_VOLUME = 'SNOWFLAKE_MANAGED'). You cannot create a transient Iceberg table with any other external volume.

col_name col_type DEFAULT col_default

For a table that conforms to Iceberg v3, specifies both the initial default and write default for the specified column. If the data type for the column is string, you must surround the default value with single quotes.

重要

When you specify a default value for a column, you must specify a static value; you can't specify an expression or function for the value. This requirement is in accordance with the Iceberg v3 specification and applies to both the initial default and write default.

Default values is an Iceberg v3 feature, so you can't specify a default value for a table that conforms to Iceberg v2. For more information about using default values with Iceberg tables, see Use default values with Iceberg tables.

备注

To change the write default for the column after you create the table, run ALTER ICEBERG TABLE ... ALTER COLUMN ... SET WRITE DEFAULT.

BASE_LOCATION = 'directory_for_table_files'

目录路径，Snowflake 使用该路径为表的数据和元数据文件构造写入路径。指定从表 EXTERNAL_VOLUME 的位置开始的相对路径。

如果未指定，Snowflake 将使用 BASE_LOCATION_PREFIX 参数值和表名称等属性构建写入路径。

有关更多信息，请参阅数据和元数据目录。

创建表后，此目录无法更改。

TARGET_FILE_SIZE = '{ AUTO | 16MB | 32MB | 64MB | 128MB }'

Specifies a target Parquet file size for the table.

'{ 16MB | 32MB | 64MB | 128MB }' specifies a fixed target file size for the table.
'AUTO' works differently, depending on the table type:
- Snowflake-managed tables: AUTO specifies that Snowflake should choose the file size for the table based on table characteristics such as size, DML patterns, ingestion workload, and clustering configuration. Snowflake automatically adjusts the file size, starting at 16 MB, for better read and write performance in Snowflake. Use this option to optimize table performance in Snowflake.
- Externally managed tables: AUTO specifies that Snowflake should aggressively scale to the largest file size (128 MB).

有关更多信息，请参阅数据和元数据目录。

默认：

CONSTRAINT ...

为表中的指定列定义内联或行外约束。

有关语法信息，请参阅 CREATE | ALTER TABLE ... CONSTRAINT。有关约束条件的更多信息，请参阅约束。

MASKING POLICY = policy_name

指定要在列上设置的掩码策略。

PROJECTION POLICY policy_name

指定要在列上设置的投影策略。

COMMENT 'string_literal'

指定列的注释。

（请注意，可以在列级别或表级别指定注释。相应的语法略有不同。）

USING ( col_name , cond_col_1 ... )

指定要传递到条件掩码策略的 SQL 表达式的实参。

列表中的第一列指定用于掩码处理或标记数据的策略条件的列，并且必须与设置掩码策略的列匹配。

附加列指定要评估的列，以确定查询从第一列进行选择时是否对查询结果的每行中的数据进行掩码处理或标记化。

如果省略 USING 子句，Snowflake 会将条件掩码策略视为正常的掩码策略。

PARTITION BY = ( partitionExpression [ , partitionExpression , ... ] )

Specifies one or more partition expressions.

PATH_LAYOUT = { FLAT | HIERARCHICAL }

Specifies the path layout that Snowflake uses when writing Parquet data files to the table:

FLAT: Snowflake writes all Parquet data files under the data/ directory for the table.
HIERARCHICAL: Snowflake writes partitioned data under the data/ directory for the table by using a hierarchical path layout. With this layout, each partition column is represented as a directory level in the path. To define these partition columns, use the PARTITION BY parameter. This layout is also called "Hive-style" partitioning.

If you specify PATH_LAYOUT = HIERARCHICAL without a PARTITION BY clause, Snowflake stores the Parquet data files by using a flat layout path. You can't modify the path layout for an existing table, so you might set this parameter to HIERARCHICAL without specifying a PARTITION BY clause if you don't want to use partitioning with hierarchical paths now but you might in the future.

备注

For externally managed tables that you create in a standard Snowflake database, Snowflake infers and honors the partitioning scheme that is specified by the remote catalog.

Default: FLAT

CLUSTER BY ( expr [ , expr , ... ] )

将表中的一个或多个列或列表达式指定为群集密钥。有关更多信息，请参阅群集密钥和聚类表。

使用变体语法（LIKE、AS SELECT）时，请参阅变体语法使用说明。

默认：无值（未为表定义群集密钥）

重要

群集密钥并非旨在或建议用于所有表；它们通常有利于非常大（即多 TB）的表。

在为表指定群集密钥之前，应当对微分区有所了解。有关更多信息，请参阅了解 Snowflake 表结构。

EXTERNAL_VOLUME = 'external_volume_name'

Specifies where the Iceberg table stores its metadata files and data in Parquet format. Iceberg metadata and manifest files store the table schema, partitions, snapshots, and other metadata.

Use one of the following:

The identifier for an external volume that you created in your account. Iceberg data and metadata are stored in your cloud storage according to that volume's storage locations.
The reserved value SNOWFLAKE_MANAGED to use Snowflake-provided storage. SNOWFLAKE_MANAGED is not a user-created external volume object; you don't run CREATE EXTERNAL VOLUME for it. For more information, see 适用于 Apache Iceberg™ 表的 Snowflake Storage.

If you don't specify this parameter, the Iceberg table defaults to the external volume for the schema, database, or account. The schema takes precedence over the database, and the database takes precedence over the account. When the effective catalog is Snowflake (CATALOG = 'SNOWFLAKE'), the default external volume is SNOWFLAKE_MANAGED unless a different default is set at the schema, database, or account level.

CATALOG = 'SNOWFLAKE'

指定 Snowflake 作为 Iceberg 目录Snowflake 处理表的全部生命周期维护工作，例如压缩。

CATALOG_SYNC = 'open_catalog_integration_name'

（可选）指定为 Snowflake Open 目录配置的目录集成的名称。如果指定，Snowflake 会将表与 Snowflake Open Catalog 账户中的外部目录同步。有关将 Snowflake 管理的 Iceberg 表与 Open Catalog 同步的详细信息，请参阅将 Snowflake 管理的表与 Snowflake Open Catalog 同步。

有关此参数的详细信息，请参阅 CATALOG_SYNC。

ICEBERG_VERSION = integer

Specifies the version of the Apache Iceberg™ specification that the table conforms to.

小心

Before you use other engines to upgrade an Iceberg tables format-version in table properties to v3, ensure that the table isn't used by engines or applications that don't yet support v3. Downgrading format versions isn't supported in the Apache Iceberg specification. Therefore, all readers and writers must support v3. The default version for Iceberg tables in Snowflake is v2, which can be configured to v3 if needed. Using Snowflake to perform in-place version upgrades isn't supported at this time.

If you don't set this parameter, the Iceberg table defaults to the Iceberg version for the schema, database, or account. The schema takes precedence over the database, and the database takes precedence over the account.

2: The table conforms with Iceberg version 2.

3: The table conforms with Iceberg version 3.

Default: 2

For more information about this parameter, see ICEBERG_VERSION.

ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE }

Specifies whether the table uses merge-on-read behavior.

If you don't set this parameter, the Iceberg table defaults to the merge-on-read behavior that is specified for the schema, database, or account. The schema takes precedence over the database, and the database takes precedence over the account.

Values:

TRUE: The table uses merge-on-read behavior. Depending on whether the table conforms to v2 or v3 of the Apache Iceberg™ table specification, the behavior is as described in the following list:

If the table conforms with v2, use positional delete files.
If the table conforms with v3, use deletion vectors.

FALSE: The table uses copy-on-write behavior.

Default: TRUE

For a detailed description of this parameter, see ENABLE_ICEBERG_MERGE_ON_READ.

STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED }

为表指定存储序列化策略。如果在创建表时未指定，表将继承在架构、数据库或账户级别设置的值。如果在任何级别均未指定该值，表将使用默认值。

创建表后，此参数的值无法更改。

COMPATIBLE：Snowflake 执行编码和压缩，确保与第三方计算引擎的互操作性。
OPTIMIZED：Snowflake 执行编码和压缩，可确保 Snowflake 中的最佳表性能。

默认：OPTIMIZED

DATA_RETENTION_TIME_IN_DAYS = integer

指定由 Snowflake 管理的表的保留期，以便可以对表中的历史数据执行 Time Travel 操作（SELECT、CLONE、UNDROP）。有关更多信息，请参阅了解和使用 Time Travel。

有关此对象级参数的详细说明以及有关对象参数的详细信息，请参阅参数。

值：

Standard Edition：0 或 1

Enterprise Edition：0 至 90 用于永久表

默认：

Standard Edition：1

Enterprise Edition（或更高版本）：1 （除非在架构、数据库或账户级别指定了不同的默认值）

备注

0 值实际上会为表禁用 Time Travel。

MAX_DATA_EXTENSION_TIME_IN_DAYS = integer

对象参数，指定 Snowflake 可以延长表的数据保留期以防止表上的流过时的最大天数。

有关此参数的详细说明，请参阅 MAX_DATA_EXTENSION_TIME_IN_DAYS。

CHANGE_TRACKING = { TRUE | FALSE }

指定是否对表启用变更跟踪。

TRUE 在表上启用变更跟踪。此设置将一对隐藏列添加到源表中，并开始在列中存储变更跟踪元数据。这些列会占用少量存储空间。

可以使用 SELECT 语句的 CHANGES 子句查询变更跟踪元数据，也可以通过在表上创建和查询一个或多个流来查询变更跟踪元数据。
FALSE 不在表上启用变更跟踪。

默认：FALSE

COPY GRANTS

指定在使用以下 CREATE TABLE 变体创建新表时保留原始表的访问权限：

CREATE OR REPLACE TABLE

CREATE TABLE ...LIKE

CREATE TABLE ...CLONE

该参数将除 OWNERSHIP 之外的所有权限从现有表复制到新表。新表不会继承为架构中的对象类型定义的任何未来授权。默认情况下，执行 CREATE TABLE 语句的角色拥有新表。

如果该参数未包含在 CREATE ICEBERG TABLE 语句中，则新表不会继承在原始表上授予的任何显式访问权限，但会继承为架构中的对象类型定义的任何未来授权。

注意：

借助数据共享：

如果现有表已共享到另一个账户，则替换表也会共享。

如果现有表已作为数据使用者与您的账户共享，并且进一步授予了对账户中其他角色的访问权限（在父数据库上使用 GRANT IMPORTED PRIVILEGES），则还会授予对替换表的访问权限。

替换表的 SHOW GRANTS 输出会将复制权限的获得者列为执行 CREATE ICEBERG TABLE 语句的角色，并附带执行语句时的当前时间戳。

复制授权的操作在 CREATE ICEBERG TABLE 命令中会以原子方式发生（即在同一事务中）。

ERROR_LOGGING = { TRUE | FALSE }

Specifies whether to turn on DML error logging for the table.

TRUE turns on DML error logging for the table.
FALSE turns off DML error logging for the table.

For more information, see DML 错误日志记录.

备注

If the OPT_OUT_ERROR_LOGGING parameter is set to TRUE for a session, DML error logging isn't turned on, regardless of whether it is turned on for specific tables.

COMMENT = 'string_literal'

指定注释。您可以在列级别或表级别指定注释。相应的语法略有不同。

默认：无值

WITH CONTACT ( purpose = contact [ , purpose = contact ...] )

将新对象与一个或多个联系人关联。

Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

ROW ACCESS POLICY policy_name ON ( col_name [ , col_name ... ] ): 指定要在表上设置的行访问策略。
AGGREGATION POLICY policy_name: 指定要在表上设置的聚合策略。

TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )

指定标签名称和标签字符串值。

标签值始终为字符串，标签值的最大字符数为 256。

有关在语句中指定标签的信息，请参阅 Tag quotas。

ENABLE_DATA_COMPACTION = { TRUE | FALSE }

Specifies whether Snowflake should enable data compaction on the table.

TRUE: Snowflake performs data compaction on the table.
FALSE: Snowflake doesn't perform data compaction on the table.

Default: TRUE

For more information, see ENABLE_DATA_COMPACTION and Set data compaction.

ICEBERG_VERSION = integer

Specifies the version of the Apache Iceberg™ specification that the table conforms to.

小心

2: The table conforms with Iceberg version 2.

3: The table conforms with Iceberg version 3.

Default: 2

For more information about this parameter, see ICEBERG_VERSION.

ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE }

Specifies whether to enable merge-on-read behavior for Apache Iceberg™ tables.

Values:

TRUE: New tables use merge-on-read behavior.

FALSE: New tables use copy-on-write behavior.

Default:

TRUE

For a detailed description of this parameter, see ENABLE_ICEBERG_MERGE_ON_READ. For more information about merge-on-read and copy-on-write behavior in Snowflake, see 使用行级删除.

Partition expression parameters (`partitionExpression`)¶

Snowflake supports all partition transforms in version 2 of the Apache Iceberg specification. For more information, see Partition Transforms (https://iceberg.apache.org/spec/#partition-transforms) in the Apache Iceberg specification.

有关此参数的详细信息，请参阅 CATALOG_SYNC。

col_name

指定列的数据类型。

When used alone, without a transform such as YEAR, specifies an identity transform on the source column. For more information, see identity (https://iceberg.apache.org/spec/#partition-transforms).

col_type

Specifies a bucket transform. For more information, see Bucket Transform Details (https://iceberg.apache.org/spec/#bucket-transform-details).

num_buckets is the number of buckets to group the data into.

col_type

Specifies a truncate transform, which partitions the data based on the truncated values of the specified source column. For more information, see Truncate Transform Details (https://iceberg.apache.org/spec/#truncate-transform-details).

col_type

Specifies a year transform, which extracts the year from a date or timestamp source-column value. For more information, see Partition Transforms (https://iceberg.apache.org/spec/#partition-transforms).

col_type

Specifies a month transform. For more information, see Partition Transforms (https://iceberg.apache.org/spec/#partition-transforms).

col_type

Specifies a day transform, which extracts the day from a date or timestamp source-column value. For more information, see Partition Transforms (https://iceberg.apache.org/spec/#partition-transforms).

col_type

Specifies an hour transform, which extracts the hour from a timestamp source-column value. For more information, see Partition Transforms (https://iceberg.apache.org/spec/#partition-transforms).

访问控制要求¶

用于执行此操作的角色必须至少具有以下权限：


权限	对象	备注
CREATE ICEBERG TABLE	架构
CREATE EXTERNAL VOLUME	账户	需要创建新的外部卷。
USAGE	外部卷	需要引用现有的外部卷。

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

有关创建具有指定权限集的自定义角色的说明，请参阅创建自定义角色。

有关对安全对象执行 SQL 操作的相应角色和权限授予的一般信息，请参阅访问控制概述。

使用说明¶

运行此命令的注意事项：
- 使用 Snowflake 作为 Iceberg 目录时，目前不支持跨云和跨区域的 Iceberg 表。如果 CREATE ICEBERG TABLE 返回类似 "External volume <volume_name> must have a STORAGE_LOCATION defined in the local region ..." 的错误消息，请确保您的外部卷使用与 Snowflake 账户位于同一区域的活动存储位置。
- 如果您使用双引号标识符创建了外部卷，则必须完全按照在 CREATE ICEBERG TABLE 语句中创建的标识符（包括双引号）来指定标识符。未包含引号可能会导致 Object does not exist 错误（或类似类型的错误）。
  
  若要查看示例，请参阅 ` 示例 `_ （本主题内容）部分。
- 要使用 USING TEMPLATE 子句（以及从 INFER_SCHEMA 输出中派生的列定义）创建 Iceberg 表，必须为 INFER_SCHEMA 函数指定 KIND => 'ICEBERG'。
创建表的注意事项：
- 架构不能包含同名的表和/或视图。创建表时：
  如果架构中已存在同名视图，则会返回错误，并且不会创建表。
  
  如果架构中已存在同名的表，则会返回错误，并且不会创建表，除非命令中包含可选的 OR REPLACE 关键字。
- CREATE OR REPLACE <object> 语句是原子的。也就是说，当对象被替换时，旧对象将被删除，新对象将在单个事务中创建。
  
  这意味着与 CREATE OR REPLACE ICEBERG TABLE 操作并行的任何查询都使用旧的或新的表版本。
- OR REPLACE 和 IF NOT EXISTS 子句互斥。它们不能同时用于同一条语句中。
- 与保留关键字类似，ANSI 保留函数名称（CURRENT_DATE、CURRENT_TIMESTAMP 等）不能用作列名。
- 重新创建表（使用可选 OR REPLACE 关键字）会删除其历史记录，这会使表上的任何流都过时。过时的流是不可读的。

To troubleshoot issues with creating a Snowflake-managed table, see 您无法创建 Snowflake 管理的表.

示例¶

创建以 Snowflake 作为目录的 Iceberg 表¶

此示例创建以 Snowflake 作为 Iceberg 目录的 Iceberg 表。生成的表由 Snowflake 管理，并支持读写访问。

该示例将表名称 (my_iceberg_table) 设置为 BASE_LOCATION。这样，Snowflake 就会将数据和元数据写入到与外部卷位置中的表同名的目录。

CREATE ICEBERG TABLE my_iceberg_table (amount int)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = 'my_iceberg_table';

CREATE ICEBERG TABLE¶

The following example creates a Snowflake-managed Iceberg table by using the value of a column named c_nationkey to partition the table:

CREATE OR REPLACE ICEBERG TABLE customer_iceberg_partitioned (
  c_custkey INTEGER,
  c_name STRING,
  c_address STRING,
  c_nationkey INTEGER,
  c_phone STRING,
  c_acctbal INTEGER,
  c_mktsegment STRING,
  c_comment STRING
)
  PARTITION BY (c_nationkey)
  EXTERNAL_VOLUME = 'my_ext_vol'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'customer_iceberg_partitioned';

有关更多信息，请参阅数据和元数据目录。

Create a partitioned Iceberg table with hierarchical layout¶

The following example creates a Snowflake-managed Iceberg table by using the value of a column named c_nationkey to partition the table. Because PATH_LAYOUT = HIERARCHICAL, Snowflake writes data to the partitioned Iceberg table by using a hierarchical path layout for files where partitioning information is included in the file paths:

CREATE OR REPLACE ICEBERG TABLE customer_iceberg_partitioned (
  c_custkey INTEGER,
  c_name STRING,
  c_address STRING,
  c_nationkey INTEGER,
  c_phone STRING,
  c_acctbal INTEGER,
  c_mktsegment STRING,
  c_comment STRING
)
  PARTITION BY (c_nationkey)
  PATH_LAYOUT = HIERARCHICAL
  EXTERNAL_VOLUME = 'my_ext_vol'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'customer_iceberg_partitioned';

For more information, see Partitioning with hierarchical paths.

使用 CTAS 变体语法创建 Iceberg 表¶

此示例使用 CREATE ICEBERG TABLE ...AS SELECT 变体语法，从名为 base_iceberg_table 的表创建新 Iceberg 表。AS SELECT 子句必须位于语句末尾。

CREATE OR REPLACE ICEBERG TABLE iceberg_table_copy (column1 int)
  EXTERNAL_VOLUME = 'my_external_volume'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'iceberg_table_copy'
  AS SELECT * FROM base_iceberg_table;

使用双引号标识符指定外部卷¶

此示例创建一个具有外部卷的 Iceberg 表，表的标识符中包含双引号。放在双引号内的标识符区分大小写，并且通常包含特殊字符。

标识符 "external_volume_1" 指定为与创建时完全相同（包括双引号）。未包含引号可能会导致 Object does not exist 错误（或类似类型的错误）。

要了解更多信息，请参阅加双引号的标识符。

CREATE OR REPLACE ICEBERG TABLE table_with_quoted_external_volume
  EXTERNAL_VOLUME = '"external_volume_1"'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'my/relative/path/from/external_volume';

Create a v3 Iceberg table¶

The following example creates a Snowflake-managed Apache Iceberg™ table that conforms to v3 of the Apache Iceberg™ specification:

CREATE ICEBERG TABLE my_v3_iceberg_table (
  record VARIANT,
  event_timestamp TIMESTAMP_LTZ(6)
)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = 'my_iceberg_table'
  ICEBERG_VERSION = 3;

CREATE ICEBERG TABLE（使用 Snowflake 作为 Iceberg 目录）¶

语法¶

变体语法¶

CREATE ICEBERG TABLE ...AS SELECT（也称为 CTAS）¶

CREATE ICEBERG TABLE ... LIKE¶

CREATE ICEBERG TABLE ... CLONE¶

必填参数¶

可选参数¶

Partition expression parameters (partitionExpression)¶

访问控制要求¶

使用说明¶

示例¶

创建以 Snowflake 作为目录的 Iceberg 表¶

CREATE ICEBERG TABLE¶

Create a partitioned Iceberg table with hierarchical layout¶

使用 CTAS 变体语法创建 Iceberg 表¶

使用双引号标识符指定外部卷¶

Create a v3 Iceberg table¶

Partition expression parameters (`partitionExpression`)¶