Snowflake Cortex AI Functions (including LLM functions)¶

Use Cortex AI Functions in Snowflake to run unstructured analytics on text and images with industry-leading LLMs from OpenAI, Anthropic, Meta, Mistral AI, and DeepSeek. AI Functions support use cases such as:

提取实体以丰富元数据并简化验证
汇总客户工单洞察分析
使用自然语言对内容进行筛选和分类
基于情绪和方面进行分析以改进服务
翻译和本地化多语言内容
解析用于分析和 RAG 管道的文档

所有模型都完全托管在 Snowflake 中，可确保性能、可扩展性和治理，同时保持数据安全且到位。

可用函数¶

Snowflake Cortex features are provided as SQL functions and are also available in Python. Cortex AI Functions can be grouped into the following categories:

Cortex AI functions
辅助函数

Cortex AI functions¶

These task-specific functions are purpose-built managed functions that automate routine tasks, like simple summaries and quick translations, that don't require any customization.

重要

The following features are in preview and should not be used in production:

AI_AGG
AI_FILTER
AI_SUMMARIZE_AGG

AI_COMPLETE：使用选定的 LLM 为给定的文本字符串或图像生成补全。将此函数用于大多数生成式 AI 任务。
- AI_COMPLETE is the updated version of COMPLETE (SNOWFLAKE.CORTEX).
AI_CLASSIFY：将文本或图像分类为用户定义的类别。
- AI_CLASSIFY is the updated version of CLASSIFY_TEXT (SNOWFLAKE.CORTEX) with support for multi-label and image classification.
AI_FILTER：对于给定的文本或图像输入，返回 True 或 False，允许您在 SELECT、WHERE 或 JOIN ... ON 子句中筛选结果。
AI_AGG：聚合文本列，并根据用户定义的提示返回跨多行的洞察。此函数不受上下文窗口限制。
AI_EMBED：为文本或图像输入生成嵌入向量，该向量可用于相似性搜索、聚类及分类任务。
- AI_EMBED is the updated version of EMBED_TEXT_1024 (SNOWFLAKE.CORTEX).
AI_EXTRACT: Extracts information from an input string or file, for example, text, images, and documents. Supports multiple languages.
- AI_EXTRACT is the updated version of EXTRACT_ANSWER (SNOWFLAKE.CORTEX).
AI_REDACT: Redacts personally identifiable information (PII) from text.
AI_SENTIMENT: Extracts sentiment from text.
- AI_SENTIMENT is the updated version of SENTIMENT (SNOWFLAKE.CORTEX).
AI_SUMMARIZE_AGG：聚合文本列并返回跨多行的摘要。此函数不受上下文窗口限制。
AI_SIMILARITY：计算两个输入之间的嵌入相似度。
AI_TRANSCRIBE: Transcribes audio and video files stored in a stage, extracting text, timestamps, and speaker information.
AI_PARSE_DOCUMENT: Extracts text (using OCR mode) or text with layout information (using LAYOUT mode) from documents in an internal or external stage.
- AI_PARSE_DOCUMENT is the updated version of PARSE_DOCUMENT (SNOWFLAKE.CORTEX).
AI_TRANSLATE: Translates text between supported languages.
- AI_TRANSLATE is the updated version of TRANSLATE (SNOWFLAKE.CORTEX).
SUMMARIZE (SNOWFLAKE.CORTEX)：返回您指定的文本摘要。

辅助函数¶

Helper functions are purpose-built managed functions that reduce cases of failures when running other Cortex AI Functions, for example by getting the count of tokens in an input prompt to ensure the call doesn't exceed a model limit.

TO_FILE: Creates a reference to a file in an internal or external stage for use with AI_COMPLETE and other functions that accept files.
AI_COUNT_TOKENS: Given an input text, returns the token count based on the model or Cortex function specified.
- AI_COUNT_TOKENS is the updated version of COUNT_TOKENS (SNOWFLAKE.CORTEX).
PROMPT: Helps you build prompt objects for use with AI_COMPLETE and other functions.
TRY_COMPLETE (SNOWFLAKE.CORTEX)：与 COMPLETE 函数类似，但在函数无法执行时返回 NULL，而不是返回错误代码。

Cortex Guard¶

Cortex Guard is an option of the AI_COMPLETE (or SNOWFLAKE.CORTEX.COMPLETE) function designed to filter possible unsafe and harmful responses from a language model. Cortex Guard is currently built with Meta's Llama Guard 3. Cortex Guard works by evaluating the responses of a language model before that output is returned to the application. Once you activate Cortex Guard, language model responses which may be associated with violent crimes, hate, sexual content, self-harm, and more are automatically filtered. See COMPLETE arguments for syntax and examples.

备注

Usage of Cortex Guard incurs compute charges based on the number of input tokens processed, in addition to the charges for the AI_COMPLETE function.

性能注意事项¶

Cortex AI Functions are optimized for throughput. We recommend using these functions to process numerous inputs such as text from large SQL tables. Batch processing is typically better suited for AI Functions. For more interactive use cases where latency is important, use the REST API. These are available for simple inference (Complete API), embedding (Embed API) and agentic applications (Agents API).

Cortex LLM privileges¶

CORTEX_USER database role¶

The CORTEX_USER database role in the SNOWFLAKE database includes the privileges that allow users to call Snowflake Cortex AI Functions. By default, the CORTEX_USER role is granted to the PUBLIC role. The PUBLIC role is automatically granted to all users and roles, so this allows all users in your account to use the Snowflake Cortex AI functions.

If you don't want all users to have this privilege, you can revoke access to the PUBLIC role and grant access to other roles. The SNOWFLAKE.CORTEX_USER database role cannot be granted directly to a user. For more information, see 使用 SNOWFLAKE 数据库角色.

要从 PUBLIC 角色中撤销 CORTEX_USER 数据库角色，请使用 ACCOUNTADMIN 角色运行以下命令：

REVOKE DATABASE ROLE SNOWFLAKE.CORTEX_USER
  FROM ROLE PUBLIC;

REVOKE IMPORTED PRIVILEGES ON DATABASE SNOWFLAKE
  FROM ROLE PUBLIC;

Copy

You can then selectively provide access to specific roles. A user with the ACCOUNTADMIN role can grant this role to a custom role in order to allow users to access Cortex AI functions. In the following example, use the ACCOUNTADMIN role and grant the user some_user the CORTEX_USER database role via the account role cortex_user_role, which you create for this purpose.

USE ROLE ACCOUNTADMIN;

CREATE ROLE cortex_user_role;
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_USER TO ROLE cortex_user_role;

GRANT ROLE cortex_user_role TO USER some_user;

Copy

You can also grant access to Snowflake Cortex AI functions through existing roles commonly used by specific groups of users. (See 用户角色.) For example, if you have created an analyst role that is used as a default role by analysts in your organization, you can easily grant these users access to Snowflake Cortex AI Functions with a single GRANT statement.

GRANT DATABASE ROLE SNOWFLAKE.CORTEX_USER TO ROLE analyst;

Copy

CORTEX_EMBED_USER database role¶

The CORTEX_EMBED_USER database role in the SNOWFLAKE database includes the privileges that allow users to call the text embedding functions AI_EMBED, EMBED_TEXT_768, and EMBED_TEXT_1024 and to create Cortex Search Services with managed vector embeddings. CORTEX_EMBED_USER allows you to grant embedding privileges separately from other Cortex AI capabilities.

备注

You can create Cortex Search Services with user-provided embeddings without the CORTEX_EMBED_USER role. In that case, you must generate the embeddings yourself, outside of Snowflake, and load them into a table.

Unlike the CORTEX_USER role, the CORTEX_EMBED_USER role is not granted to the PUBLIC role by default. You must explicitly grant this role to roles that require embedding capabilities if you have revoked the CORTEx_USER role. The CORTEX_EMBED_USER database role cannot be granted directly to users but must be granted to roles that users can assume. The following example illustrates this process.

USE ROLE ACCOUNTADMIN;

CREATE ROLE cortex_embed_user_role;
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_EMBED_USER TO ROLE cortex_embed_user_role;

GRANT ROLE cortex_embed_user_role TO USER some_user;

Copy

Alternatively, to give all users access to embedding capabilities, grant the CORTEX_EMBED_USER role to the PUBLIC role as follows.

USE ROLE ACCOUNTADMIN;

GRANT DATABASE ROLE SNOWFLAKE.CORTEX_EMBED_USER TO ROLE PUBLIC;

Copy

Using AI Functions in stored procedures with EXECUTE AS RESTRICTED CALLER¶

To use AI Functions inside stored procedures with EXECUTE AS RESTRICTED CALLER, grant the following privileges to the role that created the stored procedure:

GRANT INHERITED CALLER USAGE ON ALL SCHEMAS IN DATABASE snowflake TO ROLE <role_that_created_the_stored_procedure>;
GRANT INHERITED CALLER USAGE ON ALL FUNCTIONS IN DATABASE snowflake TO ROLE <role_that_created_the_stored_procedure>;
GRANT CALLER USAGE ON DATABASE snowflake TO ROLE <role_that_created_the_stored_procedure>;

Copy

控制模型访问¶

Snowflake Cortex provides two independent mechanisms to enforce access to models:

:ref:`账户级别的允许列表参数 <label-cortex_llm_allowlist>`（简单、广泛的控制）
:ref:`基于角色的访问控制 (RBAC) <label-cortex_llm_rbac>`（精细控制）

You can use the account-level allowlist to control model access across your entire account, or you can use RBAC to control model access on a per-role basis. For maximum flexibility, you can also use both mechanisms together, if you can accept additional management complexity.

账户级别的允许列表参数¶

您可以使用 CORTEX_MODELS_ALLOWLIST 参数控制整个账户的模型访问权限。支持的功能将遵循此参数值，并阻止使用未在允许列表中的模型。

可以将 CORTEX_MODELS_ALLOWLIST 参数设置为 'All'、'None' 或以逗号分隔的型号名称列表。此参数只能在账户级别设置，不能在用户或会话级别设置。只有 ACCOUNTADMIN 角色可以使用 ALTER ACCOUNT 命令设置参数。

示例：

要允许访问所有模型，请执行以下操作：
```
ALTER ACCOUNT SET CORTEX_MODELS_ALLOWLIST = 'All';
```
Copy

要允许访问 mistral-large2 和 llama3.1-70b 模型，请执行以下操作：

ALTER ACCOUNT SET CORTEX_MODELS_ALLOWLIST = 'mistral-large2,llama3.1-70b';

Copy

要防止访问任何模型，请执行以下操作：

ALTER ACCOUNT SET CORTEX_MODELS_ALLOWLIST = 'None';

Copy

如下一节所述，使用 RBAC 为特定角色提供超出您在允许列表中指定的访问权限。

基于角色的访问控制 (RBAC)¶

尽管 Cortex 模型本身并非 Snowflake 对象，但 Snowflake 允许您在 SNOWFLAKE.MODELS 架构中创建代表 Cortex 模型的模型对象。通过对这些对象应用 RBAC，您可以像控制其他 Snowflake 对象一样管理模型访问权限。支持的功能可在任何需要指定模型的场景中，接受 SNOWFLAKE.MODELS 架构内对象的标识符。

小技巧

如需独占使用 RBAC，请将 CORTEX_MODELS_ALLOWLIST 设置为 'None'。

刷新模型对象和应用程序角色¶

SNOWFLAKE.MODELS 不会自动填充代表 Cortex 模型的对象。首次设置模型 RBAC 时必须创建这些对象，若需对新模型应用 RBAC 功能则需刷新对象。

As ACCOUNTADMIN, run the SNOWFLAKE.MODELS.CORTEX_BASE_MODELS_REFRESH stored procedure to populate the SNOWFLAKE.MODELS schema with objects representing currently available Cortex models, and to create application roles that correspond to the models. The procedure also creates CORTEX-MODEL-ROLE-ALL, a role that covers all models.

小技巧

您可以随时安全地调用 CORTEX_BASE_MODELS_REFRESH；它不会创建重复的对象或角色。

CALL SNOWFLAKE.MODELS.CORTEX_BASE_MODELS_REFRESH();

Copy

刷新模型对象后，您可按下述方式验证模型是否已出现在 SNOWFLAKE.MODELS 架构中：

SHOW MODELS IN SNOWFLAKE.MODELS;

Copy

返回的模型列表类似于以下内容：

created_on	name	model_type	database_name	schema_name	owner
2025-04-22 09:35:38.558 -0700	CLAUDE-3-5-SONNET	CORTEX_BASE	SNOWFLAKE	MODELS	SNOWFLAKE
2025-04-22 09:36:16.793 -0700	LLAMA3.1-405B	CORTEX_BASE	SNOWFLAKE	MODELS	SNOWFLAKE
2025-04-22 09:37:18.692 -0700	SNOWFLAKE-ARCTIC	CORTEX_BASE	SNOWFLAKE	MODELS	SNOWFLAKE

要验证您能否查看与这些模型关联的应用角色，请使用 SHOW APPLICATION ROLES 命令，如下例所示：

SHOW APPLICATION ROLES IN APPLICATION SNOWFLAKE;

Copy

应用程序角色列表类似于以下内容：

created_on	name	owner	comment	owner_role_type
2025-04-22 09:35:38.558 -0700	CORTEX-MODEL-ROLE-ALL	SNOWFLAKE	MODELS	APPLICATION
2025-04-22 09:36:16.793 -0700	CORTEX-MODEL-ROLE-LLAMA3.1-405B	SNOWFLAKE	MODELS	APPLICATION
2025-04-22 09:37:18.692 -0700	CORTEX-MODEL-ROLE-SNOWFLAKE-ARCTIC	SNOWFLAKE	MODELS	APPLICATION

将应用程序角色授予用户角色¶

创建模型对象和应用程序角色后，您可以将应用程序角色授予账户中的特定用户角色。

要授予角色访问特定模型的权限，请执行以下操作：

GRANT APPLICATION ROLE SNOWFLAKE."CORTEX-MODEL-ROLE-LLAMA3.1-70B" TO ROLE MY_ROLE;

Copy

To grant a role access to all current and future models:

GRANT APPLICATION ROLE SNOWFLAKE."CORTEX-MODEL-ROLE-ALL" TO ROLE MY_ROLE;

Copy

使用具有支持功能的模型对象¶

To use model objects with supported Cortex features, specify the identifier of the model object in SNOWFLAKE.MODELS as the model argument. You can use a fully-qualified identifier, a partial identifier, or a simple model name that will be automatically resolved to SNOWFLAKE.MODELS.

使用完全限定的标识符：

SELECT AI_COMPLETE('SNOWFLAKE.MODELS."LLAMA3.1-70B"', 'Hello');

Copy

使用部分标识符：

USE DATABASE SNOWFLAKE;
USE SCHEMA MODELS;
SELECT AI_COMPLETE('LLAMA3.1-70B', 'Hello');

Copy

Using automatic lookup with a simple model name:

-- Automatically resolves to SNOWFLAKE.MODELS."LLAMA3.1-70B"
SELECT AI_COMPLETE('llama3.1-70b', 'Hello');

Copy

使用 RBAC 搭配账户级允许列表¶

A number of Cortex features accept a model name as a string argument, for example AI_COMPLETE('model', 'prompt'). When you provide a model name:

Cortex first attempts to locate a matching model object in SNOWFLAKE.MODELS. If you provide an unqualified name like 'x', it automatically looks for SNOWFLAKE.MODELS."X".
If the model object is found, RBAC is applied to determine whether the user can use the model.
If no model object is found, the provided string is matched against the account-level allowlist.

以下示例说明了允许列表和 RBAC 组合的用法。在此示例中，允许列表设置为允许 mistral-large2 模型，且用户通过 RBAC 拥有 LLAMA3.1-70B 模型对象的访问权限。

-- set up access
USE SECONDARY ROLES NONE;
USE ROLE ACCOUNTADMIN;
ALTER ACCOUNT SET CORTEX_MODELS_ALLOWLIST = 'MISTRAL-LARGE2';
CALL SNOWFLAKE.MODELS.CORTEX_BASE_MODELS_REFRESH();
GRANT APPLICATION ROLE SNOWFLAKE."CORTEX-MODEL-ROLE-LLAMA3.1-70B" TO ROLE PUBLIC;

-- test access
USE ROLE PUBLIC;

-- this succeeds because mistral-large2 is in the allowlist
SELECT AI_COMPLETE('MISTRAL-LARGE2', 'Hello');

-- this succeeds because the role has access to the model object
SELECT AI_COMPLETE('SNOWFLAKE.MODELS."LLAMA3.1-70B"', 'Hello');

-- this fails because the first argument is
-- neither an identifier for an accessible model object
-- nor is it a model name in the allowlist
SELECT AI_COMPLETE('SNOWFLAKE-ARCTIC', 'Hello');

Copy

常见陷阱¶

访问模型（无论是通过允许列表还是 RBAC）并不总是意味着可以使用该模型。该模型可能仍受跨区域限制、版本弃用或其他可用性约束。这些限制可能导致与模型访问错误相似的信息提示。
模型访问控制仅管控模型的使用权限，不涉及功能本身的使用，功能可能设有独立的访问控制机制。例如，AI_COMPLETE 访问权限由 CORTEX_USER 数据库角色控制。有关更多信息，请参阅 Cortex LLM privileges。
并非所有功能都支持模型访问控制。请参阅支持的功能表，查看给定功能支持哪些访问控制方法。
次要角色可能造成权限模糊。例如，若用户将 ACCOUNTADMIN 设为次要角色，所有模型对象可能显示为可访问状态。验证权限时暂时禁用次要角色。
Qualified model object identifiers are quoted and therefore case-sensitive. See QUOTED_IDENTIFIERS_IGNORE_CASE for more information.

支持的功能¶

以下功能支持模型访问控制：

特征	账户级别允许列表	基于角色的访问控制	备注
AI_COMPLETE	✔	✔
AI_CLASSIFY	✔	✔	若支撑此功能的模型未获允许，错误信息将包含修改允许列表的指引。
AI_FILTER	✔	✔	若支撑此功能的模型未获允许，错误信息将包含修改允许列表的指引。
AI_AGG	✔	✔	若支撑此功能的模型未获允许，错误信息将包含修改允许列表的指引。
AI_SUMMARIZE_AGG	✔	✔	若支撑此功能的模型未获允许，错误信息将包含修改允许列表的指引。
COMPLETE (SNOWFLAKE.CORTEX)	✔	✔
TRY_COMPLETE (SNOWFLAKE.CORTEX)	✔	✔
Cortex REST API	✔	✔
Cortex Playground	✔	✔

Regional availability¶

Snowflake Cortex AI functions are available in the following regions. If your region is not listed for a particular function, use cross-region inference.

备注

TRY_COMPLETE 函数在相同区域以 COMPLETE 形式提供。
The AI_COUNT_TOKENS function is available in all regions for any model, but the models themselves are available only in the regions specified in the tables below.

The following functions and models are available in any region via cross-region inference.

函数 Model	跨云（任何区域）	AWS US （跨区域）	AWS US Commercial Gov （跨区域）	AWS EU （跨区域）	AWS APJ （跨区域）	Azure US （跨区域）
AI_COMPLETE
`claude-sonnet-4-5`	*	*	*	*
`claude-haiku-4-5`	*		*
`claude-4-sonnet`	✔	✔	✔	✔	✔
`claude-3-7-sonnet`	✔	✔	✔	✔
`claude-3-5-sonnet`	✔	✔
`llama4-maverick`	✔	✔
`llama4-scout`	✔	✔
`llama3.1-8b`	✔	✔	✔	✔	✔	✔
`llama3.1-70b`	✔	✔	✔	✔	✔	✔
`llama3.3-70b`	✔	✔
`snowflake-llama-3.3-70b`	✔	✔
`llama3.1-405b`	✔	✔	✔			✔
`openai-gpt-4.1`	✔					✔
`openai-gpt-5`	*					*
`openai-gpt-5-mini`	*					*
`openai-gpt-5-nano`	*					*
`openai-gpt-5-chat`	✔
`openai-gpt-oss-120b`	*
`openai-gpt-oss-20b`	*
`snowflake-llama-3.1-405b`	✔	✔	✔
`snowflake-arctic`	✔	✔				✔
`deepseek-r1`	✔	✔
`mistral-large2`	✔	✔	✔		✔	✔
`mixtral-8x7b`	✔	✔	✔	✔	✔	✔
`mistral-7b`	✔	✔	✔	✔	✔	✔

EMBED_TEXT_768
`e5-base-v2`	✔	✔	✔	✔	✔	✔
`snowflake-arctic-embed-m`	✔	✔	✔	✔	✔	✔
`snowflake-arctic-embed-m-v1.5`	✔	✔	✔	✔	✔	✔

EMBED_TEXT_1024
`snowflake-arctic-embed-l-v2.0`	✔	✔	✔	✔	✔	✔
`snowflake-arctic-embed-l-v2.0-8k`	✔	✔	✔	✔	✔	✔
`nv-embed-qa-4`	✔	✔
`multilingual-e5-large`	✔	✔	✔	✔	✔	✔
`voyage-multilingual-2`	✔	✔	✔	✔	✔	✔

AI_CLASSIFY TEXT	✔	✔		✔	✔	✔
AI_CLASSIFY IMAGE	✔
AI_EXTRACT	✔	✔		✔	✔	✔
AI_FILTER TEXT *	✔	✔		✔	✔	✔
AI_FILTER IMAGE *	✔
AI_AGG *	✔	✔		✔	✔	✔
AI_REDACT	✔	✔	✔	✔	✔	✔
AI_SENTIMENT	✔	✔		✔	✔	✔
AI_SIMILARITY TEXT	✔	✔		✔	✔	✔
AI_SIMILARITY IMAGE	✔	✔		✔
AI_SUMMARIZE_AGG *	✔	✔		✔	✔	✔
AI_TRANSCRIBE	✔	✔		✔		✔
SENTIMENT	✔	✔	✔	✔	✔	✔
ENTITY_SENTIMENT	✔	✔	✔	✔	✔	✔
EXTRACT_ANSWER	✔	✔	✔	✔	✔	✔
SUMMARIZE	✔	✔	✔	✔	✔	✔
TRANSLATE	✔	✔	✔	✔	✔	✔

The following functions and models are available natively in North American regions.

函数 Model	AWS US 西部 2 （俄勒冈）	AWS US 东部 1 （弗吉尼亚北部）	AWS US 东部（商业政府 – 弗吉尼亚北部）	Azure 东部 US 2 （弗吉尼亚）	Azure 西部 US （华盛顿）	Azure 中南部 US （德克萨斯）
AI_COMPLETE
`claude-4-sonnet`
`claude-3-7-sonnet`
`claude-3-5-sonnet`	✔	✔
`llama4-maverick`	✔
`llama4-scout`	✔
`llama3.1-8b`	✔	✔	✔	✔
`llama3.1-70b`	✔	✔	✔	✔
`llama3.3-70b`	✔
`snowflake-llama-3.3-70b`	✔
`llama3.1-405b`	✔	✔	✔	✔
`openai-gpt-4.1`				✔
`openai-gpt-oss-120b`	*
`openai-gpt-oss-20b`	*			*
`snowflake-llama-3.1-405b`	✔
`snowflake-arctic`	✔			✔
`deepseek-r1`	✔
`mistral-large2`	✔	✔	✔	✔
`mixtral-8x7b`	✔	✔	✔	✔
`mistral-7b`	✔	✔	✔	✔

EMBED_TEXT_768
`e5-base-v2`	✔	✔	✔	✔
`snowflake-arctic-embed-m`	✔	✔	✔	✔
`snowflake-arctic-embed-m-v1.5`	✔	✔	✔	✔

EMBED_TEXT_1024
`snowflake-arctic-embed-l-v2.0`	✔	✔	✔	✔
`snowflake-arctic-embed-l-v2.0-8k`	✔	✔	✔	✔
`nv-embed-qa-4`	✔
`multilingual-e5-large`	✔	✔	✔	✔
`voyage-multilingual-2`	✔	✔	✔	✔

AI_CLASSIFY TEXT	✔	✔		✔
AI_CLASSIFY IMAGE	✔	✔
AI_EXTRACT	✔	✔		✔	✔	✔
AI_FILTER TEXT *	✔	✔		✔
AI_FILTER IMAGE *	✔	✔
AI_AGG *	✔	✔		✔
AI_REDACT	✔	✔	✔	✔
AI_SIMILARITY TEXT	✔	✔		✔
AI_SIMILARITY IMAGE	✔	✔
AI_SUMMARIZE_AGG *	✔	✔		✔
AI_TRANSCRIBE	✔	✔		✔
SENTIMENT	✔	✔	✔	✔
ENTITY_SENTIMENT	✔	✔	✔	✔
EXTRACT_ANSWER	✔	✔	✔	✔
SUMMARIZE	✔	✔	✔	✔
TRANSLATE	✔	✔	✔	✔

The following functions and models are available natively in European regions.

函数 Model	AWS 欧洲中部 1 （法兰克福）	AWS 欧洲西部 1 （爱尔兰）	Azure 西欧（荷兰）
AI_COMPLETE
`claude-4-sonnet`
`claude-3-7-sonnet`
`claude-3-5-sonnet`
`llama4-maverick`
`llama4-scout`
`llama3.1-8b`	✔	✔	✔
`llama3.1-70b`	✔	✔	✔
`llama3.3-70b`
`snowflake-llama-3.3-70b`
`llama3.1-405b`
`openai-gpt-4.1`
`openai-gpt-oss-120b`
`openai-gpt-oss-20b`
`snowflake-llama-3.1-405b`
`snowflake-arctic`
`deepseek-r1`
`mistral-large2`	✔	✔	✔
`mixtral-8x7b`	✔	✔	✔
`mistral-7b`	✔	✔	✔

EMBED_TEXT_768
`e5-base-v2`	✔		✔
`snowflake-arctic-embed-m`	✔	✔	✔
`snowflake-arctic-embed-m-v1.5`	✔	✔	✔

EMBED_TEXT_1024
`snowflake-arctic-embed-l-v2.0`	✔	✔	✔
`snowflake-arctic-embed-l-v2.0-8k`	✔	✔	✔
`nv-embed-qa-4`
`multilingual-e5-large`	✔	✔	✔
`voyage-multilingual-2`	✔	✔	✔

AI_CLASSIFY TEXT	✔	✔	✔
AI_CLASSIFY IMAGE	✔
AI_EXTRACT	✔	✔	✔
AI_FILTER TEXT *	✔	✔	✔
AI_FILTER IMAGE *	✔
AI_AGG *	✔	✔	✔
AI_REDACT	✔	✔	✔
AI_SIMILARITY TEXT	✔	✔	✔
AI_SIMILARITY IMAGE	✔
AI_SUMMARIZE_AGG *	✔	✔	✔
AI_TRANSCRIBE	✔
SENTIMENT	✔	✔	✔
ENTITY_SENTIMENT	✔		✔
EXTRACT_ANSWER	✔	✔	✔
SUMMARIZE	✔	✔	✔
TRANSLATE	✔	✔	✔

The following functions and models are available natively in Asia-Pacific regions:

函数 \| Model	AWS AP 东南部 2 （悉尼）	AWS AP 东北部 1 （东京）
AI_COMPLETE
`claude-4-sonnet`
`claude-3-7-sonnet`
`claude-3-5-sonnet`	✔
`llama4-maverick`
`llama4-scout`
`llama3.1-8b`	✔	✔
`llama3.1-70b`	✔	✔
`llama3.3-70b`
`snowflake-llama-3.3-70b`
`llama3.1-405b`
`openai-gpt-4.1`
`snowflake-llama-3.1-405b`
`snowflake-arctic`
`deepseek-r1`
`mistral-large2`	✔	✔
`mixtral-8x7b`	✔	✔
`mistral-7b`	✔	✔

EMBED_TEXT_768
`e5-base-v2`	✔	✔
`snowflake-arctic-embed-m`	✔	✔
`snowflake-arctic-embed-m-v1.5`	✔	✔

EMBED_TEXT_1024
`snowflake-arctic-embed-l-v2.0`	✔	✔
`snowflake-arctic-embed-l-v2.0-8k`	✔	✔
`nv-embed-qa-4`
`multilingual-e5-large`	✔	✔
`voyage-multilingual-2`	✔	✔

AI_EXTRACT	✔	✔
AI_CLASSIFY TEXT	✔	✔
AI_CLASSIFY IMAGE
AI_FILTER TEXT *	✔	✔
AI_FILTER IMAGE *
AI_AGG *	✔	✔
AI_SIMILARITY TEXT	✔	✔
AI_SIMILARITY IMAGE
AI_SUMMARIZE_AGG *	✔	✔
AI_TRANSCRIBE
EXTRACT_ANSWER	✔	✔
SENTIMENT	✔	✔
ENTITY_SENTIMENT		✔
SUMMARIZE	✔	✔
TRANSLATE	✔	✔

* Indicates a preview function or model. Preview features are not suitable for production workloads.

The following Snowflake Cortex AI functions and models are available in the following extended regions.

函数 Model	AWS US 东部 2 （俄亥俄）	AWS CA 中部 1 （中部）	AWS SA 东部 1 （圣保罗）	AWS 欧洲西部 2 （伦敦）	AWS 欧洲中部 1 （法兰克福）	AWS 欧洲北部 1 （斯德哥尔摩）	AWS AP 东北部 1 （东京）	AWS AP 南部 1 （孟买）	AWS AP 东南部 2 （悉尼）	AWS AP 东南部 3 （雅加达）	Azure 中南部 US （德克萨斯）	Azure 西部 US 2 （华盛顿）	Azure UK 南部（伦敦）	Azure 北欧（爱尔兰）	Azure 瑞士北部（苏黎世）	Azure 印度中部（浦那）	Azure 日本东部（东京、琦玉）	Azure 东南亚（新加坡）	Azure 澳大利亚东部（新南威尔士）	Google Cloud Europe West 2 （伦敦）	Google Cloud Europe West 4 （荷兰）	Google Cloud US Central 1 （爱荷华）	Google Cloud US East 4 （弗吉尼亚北部）
EMBED_TEXT_768
`snowflake-arctic-embed-m-v1.5`	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔
`snowflake-arctic-embed-m`	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔
EMBED_TEXT_1024
`multilingual-e5-large`	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔	✔
AI_EXTRACT	✔	✔	✔	✔	✔	Cross-region only	✔	Cross-region only	✔	Cross-region only	✔	✔	Cross-region only	✔	Cross-region only	✔	✔	✔	✔	Cross-region only	Cross-region only	Cross-region only	Cross-region only

The following table lists availability of legacy models. These models have not been deprecated and can still be used. However, Snowflake recommends newer models for new development.

旧版¶
函数（模型）	AWS US 西部 2 （俄勒冈）	AWS US 东部 1 （弗吉尼亚北部）	AWS 欧洲中部 1 （法兰克福）	AWS AP 东南部 2 （悉尼）	AWS AP 东北部 1 （东京）	Azure 东部 US 2 （弗吉尼亚）	Azure 西欧（荷兰）
AI_COMPLETE
`llama3-8b`	✔	✔	✔	✔	✔	✔
`llama3-70b`	✔	✔	✔		✔	✔
`mistral-large`	✔	✔	✔			✔	✔
`openai-o4-mini`						✔

Create stage for media files¶

Cortex AI Functions that process media files (documents, images, audio, or video) require the files to be stored on an internal or external stage. The stage must use server-side encryption. If you want to be able to query the stage or programmatically process all the files stored there, the stage must have a directory table.

The SQL below creates a suitable internal stage:

CREATE OR REPLACE STAGE input_stage
  DIRECTORY = ( ENABLE = true )
  ENCRYPTION = ( TYPE = 'SNOWFLAKE_SSE' );

Copy

To process files from external object storage (e.g., Amazon S3), create a storage integration, then create an external stage that uses the storage integration. To learn how to configure a Snowflake Storage Integration, see our detailed guides:

Create an external stage that references the integration and points to your cloud storage container. This example points to an Amazon S3 bucket:

CREATE OR REPLACE STAGE my_aisql_media_files
  STORAGE_INTEGRATION = my_s3_integration
  URL = 's3://my_bucket/prefix/'
  DIRECTORY = ( ENABLE = TRUE )
  ENCRYPTION = ( TYPE = 'AWS_SSE_S3' );

Copy

With an internal or external stage created, and files stored there, you can use Cortex AI Functions to process media files stored in the stage. For more information, see:

备注

AI Functions are currently incompatible with custom network policies.

Cortex AI Functions storage best practices¶

You may find the following best practices helpful when working with media files in stages with Cortex AI Functions:

Establish a scheme for organizing media files in stages. For example, create a separate stage for each team or project, and store the different types of media files in subdirectories.
Enable directory listings on stages to allow querying and programmatic access to its files.

小技巧

To automatically refresh the directory table for the external stage when new or updated files are available, set AUTO_REFRESH = TRUE when creating the stage.
For external stages, use fine-grained policies on the cloud provider side (for example, AWS IAM policies) to restrict the storage integration's access to only what is necessary.
Always use encryption, such as AWS_SSE or SNOWFLAKE_SSE, to protect your data at rest.

成本注意事项¶

Snowflake Cortex AI 函数根据处理的词元数产生计算成本。请参阅 Snowflake 服务使用表，以了解每个函数的每百万个词元消耗的 credit 成本。

A token is the smallest unit of text processed by Snowflake Cortex AI functions. An industry convention for text is that a token is approximately equal to four characters, although this can vary by model, as can token equivalence for media files.

For functions that generate new text using provided text (AI_COMPLETE, AI_CLASSIFY, AI_FILTER, AI_AGG, AI_SUMMARIZE, and AI_TRANSLATE, and their previous versions in the SNOWFLAKE.CORTEX schema), both input and output tokens are billable.
For Cortex Guard, only input tokens are counted. The number of input tokens is based on the number of tokens output from AI_COMPLETE (or COMPLETE). Cortex Guard usage is billed in addition to the cost of the AI_COMPLETE (or COMPLETE) function.
For AI_SIMILARITY, AI_EMBED, and the SNOWFLAKE.CORTEX.EMBED_* functions, only input tokens are counted.
对于 EXTRACT_ANSWER，可计费词元的数量是 from_text 和 question 字段中的词元数量之和。
AI_CLASSIFY, AI_FILTER, AI_AGG, AI_SENTIMENT, AI_SUMMARIZE_AGG, SUMMARIZE, TRANSLATE, AI_TRANSLATE, EXTRACT_ANSWER, ENTITY_SENTIMENT, and SENTIMENT add a prompt to the input text in order to generate the response. As a result, the billed token count is higher than the number of tokens in the text you provide.
AI_CLASSIFY 标签、描述和示例会作为每条已处理记录的输入词元进行计算，而不仅针对每次 AI_CLASSIFY 调用计算一次。
For AI_PARSE_DOCUMENT (or SNOWFLAKE.CORTEX.PARSE_DOCUMENT), billing is based on the number of document pages processed.
TRY_COMPLETE (SNOWFLAKE.CORTEX) does not incur costs for error handling. If the TRY_COMPLETE(SNOWFLAKE.CORTEX) function returns NULL, no cost is incurred.
For AI_EXTRACT, both input and output tokens are counted. The responseFormat argument is counted as input tokens. For document formats consisting of pages, the number of pages processed is counted as input tokens. Each page in a document is counted as 970 tokens.
AI_COUNT_TOKENS incurs only compute cost to run the function. No additional token-based costs are incurred.

对于支持图像或音频等媒体文件的模型：

音频文件按每秒钟音频 50 个令牌计费。
图像的令牌等价性由所用模型决定。有关更多信息，请参阅 AI图像成本注意事项。

Snowflake recommends executing queries that call a Snowflake Cortex AI Function with a smaller warehouse (no larger than MEDIUM). Larger warehouses do not increase performance. The cost associated with keeping a warehouse active continues to apply when executing a query that calls a Snowflake Cortex LLM Function. For general information on compute costs, see Understanding compute cost.

Warehouse sizing¶

Snowflake recommends using a warehouse size no larger than MEDIUM when calling Snowflake Cortex AI Functions. Using a larger warehouse than necessary does not increase performance, but can result in unnecessary costs. This recommendation may change in the future as we continue to evolve Cortex AI Functions.

跟踪 AI 服务的成本¶

要跟踪账户中用于 AI 服务（包括 LLM 函数）的 credit，请使用 METERING_HISTORY 视图：

SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.METERING_DAILY_HISTORY
  WHERE SERVICE_TYPE='AI_SERVICES';

Copy

Track credit consumption for Cortex AI Functions¶

To view the credit and token consumption for each AI Function call, use the CORTEX_FUNCTIONS_USAGE_HISTORY 视图:

SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FUNCTIONS_USAGE_HISTORY;

Copy

您还可以在 Snowflake 账户中查看每次查询的 credit 和令牌使用量。查看每次查询的 credit 和令牌使用量可帮助您确定使用 credit 和令牌最多的查询。

以下示例查询使用 CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY 视图以显示您账户中所有查询的 Credit 和词元使用量。

SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY;

Copy

您还可以使用同一视图查看特定查询的 Credit和词元使用量。

SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY
WHERE query_id='<query-id>';

Copy

备注

您无法获取 REST API 请求的详细使用信息。

查询使用历史记录按查询中使用的模型分组。例如，如果您运行了以下命令：

SELECT AI_COMPLETE('mistral-7b', 'Is a hot dog a sandwich'), AI_COMPLETE('mistral-large', 'Is a hot dog a sandwich');

Copy

查询使用历史记录将显示两行，一行是 mistral-7b，一行是 mistral-large。

使用量配额¶

On-demand Snowflake accounts without a valid payment method (such as trial accounts) are limited to 10 credits per day for Snowflake Cortex AI Functions usage. To remove this limit, convert your trial account to a paid account.

模型限制¶

Snowflake Cortex 使用的模型具有大小限制，如下表所述。大小以词元数指定。词元通常表示大约四个字符的文本，因此限制对应的单词数小于词元数。超过限制的输入将导致错误。

模型可以生成的输出的最大大小受以下限制：

模型的输出词元限制。
模型使用输入词元后上下文窗口中的可用空间。

例如，claude-3-5-sonnet 具有 200,000 个词元的上下文窗口。如果使用 100,000 个词元进行输入，则模型最多可以生成 8,192 个词元。但是，如果使用 195,000 个词元作为输入，则模型最多只能生成 5,000 个词元，总共 200,000 个词元。

重要

在 AWS AP 东南部 2（悉尼）区域：

llama3-8b 和 mistral-7b 的上下文窗口为 4,096 个词元。
llama3.1-8b 的上下文窗口为 16,384 个词元。
来自 SUMMARIZE 函数的 Snowflake 托管模型的上下文窗口为 4,096 个词元。

在 AWS 欧洲西部 1（爱尔兰）区域：

llama3.1-8b 的上下文窗口为 16,384 个词元。
mistral-7b 的上下文窗口为 4,096 个词元。

函数	模型	上下文窗口（词元）	Max output (tokens)
COMPLETE	`llama4-maverick`	128,000	8,192
	`llama4-scout`	128,000	8,192
	`snowflake-arctic`	4,096	8,192
	`deepseek-r1`	32,768	8,192
	`claude-sonnet-4-5`	200,000	64,000
	`claude-haiku-4-5`	200,000	64,000
	`claude-4-sonnet`	200,000	32,000
	`claude-3-7-sonnet`	200,000	32,000
	`claude-3-5-sonnet`	200,000	8,192
	`mistral-large`	32,000	8,192
	`mistral-large2`	128,000	8,192
	`openai-gpt-4.1`	128,000	32,000
	`openai-o4-mini`	200,000	32,000
	`openai-gpt-5`	272,000	8,192
	`openai-gpt-5-mini`	272,000	8,192
	`openai-gpt-5-nano`	272,000	8,192
	`openai-gpt-5-chat`	128,000	8,192
	`openai-gpt-oss-120b`	128,000	8,192
	`openai-gpt-oss-20b`	128,000	8,192
	`mixtral-8x7b`	32,000	8,192
	`llama3-8b`	8,000	8,192
	`llama3-70b`	8,000	8,192
	`llama3.1-8b`	128,000	8,192
	`llama3.1-70b`	128,000	8,192
	`llama3.3-70b`	128,000	8,192
	`snowflake-llama-3.3-70b`	128,000	8,192
	`llama3.1-405b`	128,000	8,192
	`snowflake-llama-3.1-405b`	8,000	8,192
	`mistral-7b`	32,000	8,192
EMBED_TEXT_768	`e5-base-v2`	512	不适用
	`snowflake-arctic-embed-m`	512	不适用
EMBED_TEXT_1024	`nv-embed-qa-4`	512	不适用
	`multilingual-e5-large`	512	不适用
	`voyage-multilingual-2`	32,000	不适用
AI_EXTRACT	`arctic-extract`	128,000	51,200
AI_FILTER	Snowflake 托管模型	128,000	不适用
AI_CLASSIFY	Snowflake 托管模型	128,000	不适用
AI_AGG	Snowflake 托管模型	每行 128,000 可以跨多行使用	8,192
AI_SENTIMENT	Snowflake 托管模型	2,048	不适用
AI_SUMMARIZE_AGG	Snowflake 托管模型	每行 128,000 可以跨多行使用	8,192
ENTITY_SENTIMENT	Snowflake 托管模型	2,048	不适用
EXTRACT_ANSWER	Snowflake 托管模型	2,048（对于文本） 64（对于问题）	不适用
SENTIMENT	Snowflake 托管模型	512	不适用
SUMMARIZE	Snowflake 托管模型	32,000	4,096
TRANSLATE	Snowflake 托管模型	4,096	不适用

选择模型¶

The Snowflake Cortex AI_COMPLETE function supports multiple models of varying capability, latency, and cost. These models have been carefully chosen to align with common customer use cases. To achieve the best performance per credit, choose a model that's a good match for the content size and complexity of your task. Here are brief overviews of the available models.

大型模型¶

If you're not sure where to start, try the most capable models first to establish a baseline to evaluate other models. claude-3-7-sonnet and mistral-large2 are the most capable models offered by Snowflake Cortex, and will give you a good idea what a state-of-the-art model can do.

Claude 3-7 Sonnet 在一般推理和多模态能力方面处于领先地位。它在需要跨领域和模态推理的任务中表现优于其前几代。您可以利用其庞大的输出容量，从结构化或非结构化查询中获取更多信息。它的推理能力和大型上下文窗口使其非常适合代理工作流程。
deepseek-r1 是一个基于大规模强化学习 (RL) 训练的基础模型，未经过监督微调 (SFT)。它在数学、代码和推理任务上均能实现高性能表现。要访问模型，请将跨区域推理参数设为 AWS_US。
mistral-large2 是 Mistral AI 先进的大型语言模型，具有极强的推理能力。与 mistral-large 相比，它在代码生成、数学、推理方面的能力要强得多，并提供更强大的多语言支持，非常适合需要大量推理能力或高度专业化的复杂任务，例如合成文本生成、代码生成和多语言文本分析。
llama3.1-405b 是来自 Meta 的 llama3.1 模型系列的开源模型，具有 128000 的大型上下文窗口。它在长文档处理、多语言支持、合成数据生成和模型提取方面表现出色。
snowflake-llama3.1-405b is a model derived from the open source llama3.1 model. It uses the SwiftKV optimizations developed by the Snowflake AI research team to deliver up to a 75% inference cost reduction. SwiftKV achieves higher throughput performance with minimal accuracy loss.

中型模型¶

llama3.1-70b 是一种开源模型，具有先进的性能，非常适合聊天应用程序、内容创建和企业应用程序。它是一种高性能、高性价比的模型，可通过 128000 的上下文窗口实现各种用例。llama3-70b 仍受支持，其上下文窗口为 8000。
snowflake-llama3.3-70b is a model derived from the open source llama3.3 model. It uses the SwiftKV optimizations developed by the Snowflake AI research team to deliver up to a 75% inference cost reduction. SwiftKV achieves higher throughput performance with minimal accuracy loss.
snowflake-arctic 是 Snowflake 侧重于企业的一流 LLM。Arctic 擅长执行企业任务，例如 SQL 生成、编码和指令遵循基准测试。
mixtral-8x7b 非常适合文本生成、分类和问答用途。Mistral 模型针对低延迟和低内存要求进行了优化，从而能为企业用例带来更高吞吐量。

小型模型¶

llama3.1-8b is ideal for tasks that require low to moderate reasoning. It's a light-weight, ultra-fast model with a context window of 128K. llama3-8b provides a smaller context window and relatively lower accuracy.
mistral-7b 非常适合需要快速完成的最简单的摘要、结构化和问答任务。它通过其 32000 上下文窗口为多页文本提供低延迟和高吞吐量处理。

The following table provides information on how popular models perform on various benchmarks, including the models offered by Snowflake Cortex AI_COMPLETE as well as a few other popular models.

模型	上下文窗口（词元）	MMLU （推理）	HumanEval （编码）	GSM8K （算术推理）	Spider 1.0 (SQL)
GPT 4.o (https://openai.com/index/hello-gpt-4o/)	128,000	88.7	90.2	96.4	-
Claude 3.5 Sonnet (https://www.anthropic.com/claude)	200,000	88.3	92.0	96.4	-
llama3.1-405b (https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/MODEL_CARD.md)	128,000	88.6	89	96.8	-
llama3.1-70b (https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/MODEL_CARD.md)	128,000	86	80.5	95.1	-
mistral-large2 (https://mistral.ai/news/mistral-large-2407/)	128,000	84	92	93	-
llama3.1-8b (https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/MODEL_CARD.md)	128,000	73	72.6	84.9	-
mixtral-8x7b (https://mistral.ai/news/mixtral-of-experts/)	32,000	70.6	40.2	60.4	-
Snowflake Arctic	4,096	67.3	64.3	69.7	79
mistral-7b (https://mistral.ai/news/announcing-mistral-7b/)	32,000	62.5	26.2	52.1	-
GPT 3.5 Turbo^*	4,097	70	48.1	57.1	-

以前的模型版本¶

The Snowflake Cortex AI_COMPLETE and COMPLETE functions also supports the following older model versions. We recommend using the latest model versions instead of the versions listed in this table.

模型	上下文窗口（词元）	MMLU （推理）	HumanEval （编码）	GSM8K （算术推理）	Spider 1.0 (SQL)
mistral-large (https://mistral.ai/news/mistral-large/)	32,000	81.2	45.1	81	81
llama-2-70b-chat (link removed)	4,096	68.9	30.5	57.5	-

Using Snowflake Cortex AI Functions with Python¶

Call Cortex AI Functions in Snowpark Python¶

You can use Snowflake Cortex AI Functions in the Snowpark Python API. These functions include the following. Note that the functions in Snowpark Python have names in Pythonic "snake_case" format, with words separated by underscores and all letters in lowercase.

`ai_agg` example¶

The ai_agg function aggregates a column of text using natural language instructions in a similar manner to how you would ask an analyst to summarize or extract findings from grouped or ungrouped data.

The following example summarizes customer reviews for each product using the ai_agg function. The function takes a column of text and a natural language instruction to summarize the reviews.

from snowflake.snowpark.functions import ai_agg, col

df = session.create_dataframe([
    [1, "Excellent product!"],
    [1, "Great battery life."],
    [1, "A bit expensive but worth it."],
    [2, "Terrible customer service."],
    [2, "Won’t buy again."],
], schema=["product_id", "review"])

# Summarize reviews per product
summary_df = df.group_by("product_id").agg(
    ai_agg(col("review"), "Summarize the customer reviews in one sentence.")
)
summary_df.show()

Copy

备注

请使用围绕具体用例展开的详细任务描述。例如，“Summarize the customer feedback for an investor report”。

Classify text with `ai_classify`¶

The ai_classify function takes a string or image and classifies it into the categories that you define.

以下示例将旅行评论分为“travel”和“cooking”等类别。该函数采用一列文本和一个类别列表来对文本进行分类。

from snowflake.snowpark.functions import ai_classify, col

df = session.create_dataframe([
    ["I dream of backpacking across South America."],
    ["I made the best pasta yesterday."],
], schema=["sentence"])

df = df.select(
    "sentence",
    ai_classify(col("sentence"), ["travel", "cooking"]).alias("classification")
)
df.show()

Copy

备注

最多可以提供 500 个类别。可以对文本和图像进行分类。

Filter rows with `ai_filter`¶

The ai_filter function evaluates a natural language condition and returns True or False. You can use it to filter or tag rows.

from snowflake.snowpark.functions import ai_filter, prompt, col

df = session.create_dataframe(["Canada", "Germany", "Japan"], schema=["country"])

filtered_df = df.select(
    "country",
    ai_filter(prompt("Is {0} in Asia?", col("country"))).alias("is_in_asia")
)
filtered_df.show()

Copy

备注

You can filter on both strings and files. For dynamic prompts, use the :code:prompt function. For more information, see Snowpark Python reference.

Call Cortex AI Functions in Snowflake ML¶

Snowflake ML contains the older AI Functions, those with names that don't begin with "AI". These functions are supported in version 1.1.2 and later of Snowflake ML. The names are rendered in Pythonic "snake_case" format, with words separated by underscores and all letters in lowercase.

如果在 Snowflake 之外运行 Python 脚本，则必须创建 Snowpark 会话才能使用这些函数。有关说明，请参阅连接到 Snowflake。

Process single values¶

以下 Python 示例演示了如何对单个值调用 Snowflake Cortex AI 函数：

from snowflake.cortex import complete, extract_answer, sentiment, summarize, translate

text = """
    The Snowflake company was co-founded by Thierry Cruanes, Marcin Zukowski,
    and Benoit Dageville in 2012 and is headquartered in Bozeman, Montana.
"""

print(complete("llama3.1-8b", "how do snowflakes get their unique patterns?"))
print(extract_answer(text, "When was snowflake founded?"))
print(sentiment("I really enjoyed this restaurant. Fantastic service!"))
print(summarize(text))
print(translate(text, "en", "fr"))

Copy

Pass hyperparameter options¶

You can pass options that affect the model's hyperparameters when using the complete function. The following Python example illustrates modifying the maximum number of output tokens that the model can generate:

from snowflake.cortex import complete, CompleteOptions

model_options1 = CompleteOptions(
    {'max_tokens':30}
)

print(complete("llama3.1-8b", "how do snowflakes get their unique patterns?", options=model_options1))

Copy

Call functions on table columns¶

You can call an AI function on a table column, as shown below. This example requires a session object (stored in session) and a table articles containing a text column abstract_text, and creates a new column abstract_summary containing a summary of the abstract.

from snowflake.cortex import summarize
from snowflake.snowpark.functions import col

article_df = session.table("articles")
article_df = article_df.withColumn(
    "abstract_summary",
    summarize(col("abstract_text"))
)
article_df.collect()

Copy

备注

The advanced chat-style (multi-message) form of COMPLETE is not currently supported in Snowflake ML Python.

将 Snowflake Cortex AI 函数与 Snowflake CLI 一起使用¶

Snowflake Cortex AI Functions are available in Snowflake CLI version 2.4.0 and later. See 隆重推出 Snowflake CLI for more information about using Snowflake CLI. The functions are the old-style functions, those with names that don't begin with "AI".

以下示例说明了如何在单个值上使用 snow cortex 命令。-c 参数指定要使用的连接。

备注

高级聊天风格（多消息）形式的 COMPLETE 目前在 Snowflake CLI 中不受支持。

snow cortex complete "Is 5 more than 4? Please answer using one word without a period." -c "snowhouse"

Copy

snow cortex extract-answer "what is snowflake?" "snowflake is a company" -c "snowhouse"

Copy

snow cortex sentiment "Mary had a little Lamb" -c "snowhouse"

Copy

snow cortex summarize "John has a car. John's car is blue. John's car is old and John is thinking about buying a new car. There are a lot of cars to choose from and John cannot sleep because it's an important decision for John."

Copy

snow cortex translate herb --to pl

Copy

也可以使用包含要用于命令的文本的文件。对于此示例，假设文件 about_cortex.txt 包含以下内容：

Snowflake Cortex gives you instant access to industry-leading large language models (LLMs) trained by researchers at companies like Anthropic, Mistral, Reka, Meta, and Google, including Snowflake Arctic, an open enterprise-grade model developed by Snowflake.

Since these LLMs are fully hosted and managed by Snowflake, using them requires no setup. Your data stays within Snowflake, giving you the performance, scalability, and governance you expect.

Snowflake Cortex features are provided as SQL functions and are also available in Python. The available functions are summarized below.

COMPLETE: Given a prompt, returns a response that completes the prompt. This function accepts either a single prompt or a conversation with multiple prompts and responses.
EMBED_TEXT_768: Given a piece of text, returns a vector embedding that represents that text.
EXTRACT_ANSWER: Given a question and unstructured data, returns the answer to the question if it can be found in the data.
SENTIMENT: Returns a sentiment score, from -1 to 1, representing the detected positive or negative sentiment of the given text.
SUMMARIZE: Returns a summary of the given text.
TRANSLATE: Translates given text from any supported language to any other.

然后，您可以通过使用 --file 参数传入文件名来执行 snow cortex summarize 命令，如下所示：

snow cortex summarize --file about_cortex.txt

Copy

Snowflake Cortex offers instant access to industry-leading language models, including Snowflake Arctic, with SQL functions for completing prompts (COMPLETE), text embedding (EMBED\_TEXT\_768), extracting answers (EXTRACT\_ANSWER), sentiment analysis (SENTIMENT), summarizing text (SUMMARIZE), and translating text (TRANSLATE).

有关这些命令的更多信息，请参阅 snow cortex commands。

法律声明¶

输入和输出的 Data Classification 如下表所示。

输入 Data Classification	输出 Data Classification	名称
Usage Data	Customer Data	正式发布的功能是涵盖的 AI 功能。预览版功能属于预览版 AI 功能。[1]

有关更多信息，请参阅 Snowflake AI 和 ML。

Snowflake Cortex AI Functions (including LLM functions)¶

可用函数¶

Cortex AI functions¶

辅助函数¶

Cortex Guard¶

性能注意事项¶

Cortex LLM privileges¶

CORTEX_USER database role¶

CORTEX_EMBED_USER database role¶

Using AI Functions in stored procedures with EXECUTE AS RESTRICTED CALLER¶

控制模型访问¶

账户级别的允许列表参数¶

基于角色的访问控制 (RBAC)¶

刷新模型对象和应用程序角色¶

将应用程序角色授予用户角色¶

使用具有支持功能的模型对象¶

使用 RBAC 搭配账户级允许列表¶

常见陷阱¶

支持的功能¶

Regional availability¶

Create stage for media files¶

Cortex AI Functions storage best practices¶

成本注意事项¶

Warehouse sizing¶

跟踪 AI 服务的成本¶

Track credit consumption for Cortex AI Functions¶

使用量配额¶

模型限制¶

选择模型¶

大型模型¶

中型模型¶

小型模型¶

以前的模型版本¶

Using Snowflake Cortex AI Functions with Python¶

Call Cortex AI Functions in Snowpark Python¶

ai_agg example¶

Classify text with ai_classify¶

Filter rows with ai_filter¶

Call Cortex AI Functions in Snowflake ML¶

Process single values¶

Pass hyperparameter options¶

Call functions on table columns¶

将 Snowflake Cortex AI 函数与 Snowflake CLI 一起使用¶

法律声明¶

`ai_agg` example¶

Classify text with `ai_classify`¶

Filter rows with `ai_filter`¶