Cortex Agents¶

Get started with Cortex Agents

概述¶

Cortex Agents 可协调处理结构化和非结构化数据源，从而提供见解。该服务可规划任务，使用工具执行这些任务，并生成响应。代理使用 |cortex-analyst|（结构化）和 Cortex Search（非结构化）作为工具，并结合 LLMs，用分析数据。Cortex Search 从非结构化来源中提取见解，而 |cortex-analyst| 生成 SQL 以处理结构化数据。全面的工具识别与工具执行支持，可实现基于企业数据的复杂应用程序交付。

该工作流涉及四个关键组件：

规划：应用程序通常需要在结构化与非结构化数据源之间切换处理。以设计用于回答用户查询的对话式应用程序为例。企业用户可能首先查询按收入排序的顶级分销商（结构化数据），随后转向询问合同细节（非结构化数据）。Cortex Agents 可以解析请求以编排计划并得出解决方案或响应。
1. 探索选项：当用户提出模糊查询时（例如“请介绍 Acme Supplies”），代理程序会考量不同衍生维度（产品、地理位置或销售人员）以消除歧义并提升准确性。
2. 拆分为子任务：Cortex代理程序可将任务或请求（例如“Acme Supplies 与 Acme Stationery 的合同条款有何差异？”）拆分为多个部分，以提供更精准的响应。
3. 跨工具路由：代理会精准选择工具 Cortex Analyst 或 Cortex Search，以确保受控访问并符合企业策略。
工具使用：制定计划后，代理可以有效地检索数据。Cortex Search 从非结构化来源中提取见解，而 Cortex Analyst 生成 SQL 以处理结构化数据。全面的工具识别与工具执行支持，可实现基于企业数据的复杂应用程序交付。
反射：在每次工具调用后，代理会评估结果以确定后续步骤，包括请求澄清、迭代处理或生成最终响应。通过这种编排机制，代理可在 Snowflake 安全边界内处理复杂数据查询，同时确保准确性与合规性。
Monitor, evaluate, and iterate: After deployment, you can track metrics, analyze performance, perform evaluations, and refine behavior for continuous improvements. By monitoring and refining your agent, you can continuously improve performance and response accuracy.

有关帮助您入门的教程，请参阅 Cortex Agents 教程。

备注

尽管 Snowflake 致力于提供高质量响应，但 LLM 生成的响应内容及其引用来源的准确性不作保证。在向用户提供代理 API 生成的答案前，您应当审核所有响应内容。

访问控制要求¶

To make a request to Cortex Agent via agent:run API, you can use a role that has the SNOWFLAKE.CORTEX_USER or SNOWFLAKE.CORTEX_AGENT_USER role granted. The CORTEX_USER provides access to all Covered AI features including Cortex Agents whereas CORTEX_AGENT_USER provides access to the Agents feature.

备注

You must use the user's default role when calling or updating Cortex Agents. To allow another role to edit the agent, grant USAGE on the database, schema, and agent to that role.

GRANT USAGE ON DATABASE <database_name> to ROLE <role_name>;
GRANT USAGE ON SCHEMA <database_name>.<schema_name> to ROLE <role_name>;
GRANT USAGE ON AGENT <database_name>.<schema_name>.<agent_name> to ROLE <role_name>;

To use Cortex Agents with a semantic model, you also need the following privileges:


权限	对象	Notes
CREATE AGENT	Schema	Required to create the Cortex Agent.
USAGE	Cortex Search service	Required to run the Cortex Search services in the Cortex Agents request.
USAGE	Database, schema, table	Required for access the objects referenced in the Cortex Agents semantic model.
OWNERSHIP	Agent	OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the GRANT OWNERSHIP command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). In a managed access schema, only the schema owner (for example. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants.
MODIFY	Agent	Required to update the Cortex Agent.
MONITOR	Agent	Required to view threads, logs, and traces of the Cortex Agent.
USAGE	Agent	Required to query the Cortex Agent to generate responses.

Requests to the Cortex Agents API must include an authorization token. For details on how to authenticate to the API, see 使用 Snowflake 对 Snowflake REST APIs 进行身份验证. Note that the example in this topic uses a session token to authenticate to a Snowflake account.

Limiting access to specific roles

默认情况下，CORTEX_USER 角色被授予给 PUBLIC 角色。PUBLIC 角色会自动授予所有用户和角色。如果您不希望所有用户都拥有此权限，则可以撤销对 PUBLIC 角色的访问权限，并授予对特定角色的访问权限。有关更多信息，请参阅 Cortex LLM privileges。

To provide selective access to Cortex Agents so that only a subset of users have access to the feature, use the CORTEX_AGENTS_USER role.

Limiting access using the Cortex Agents user role

To provide selective access to Cortex Agents for specific users, use the SNOWFLAKE.CORTEX_AGENT_USER database role. This role includes the privileges needed to call the Cortex Agent API.

重要

If your user roles have the CORTEX_USER role, you must revoke access to the CORTEX_USER role. To revoke the CORTEX_USER database role from your user roles, run the following command using the ACCOUNTADMIN role:

REVOKE DATABASE ROLE SNOWFLAKE.CORTEX_USER FROM ROLE agent;

To provide access to Cortex Agents, use the ACCOUNTADMIN role to do the following:

Grant the SNOWFLAKE.CORTEX_AGENT_USER database role to a custom role.
Assign this custom role to users.

备注

You can't grant database roles directly to users. For more information, see GRANT DATABASE ROLE.

以下示例：

Creates the custom role, cortex_agent_user_role.
Grants it the CORTEX_AGENT_USER database role.
Assigns this role to example_user.

USE ROLE ACCOUNTADMIN;
CREATE ROLE cortex_agent_user_role;
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_AGENT_USER TO ROLE cortex_agent_user_role;

GRANT ROLE cortex_agent_user_role TO USER example_user;

You can also grant access to Cortex Agents through existing roles. For example, if you have an agent role used by agents in your organization, you can grant access with a single GRANT statement:

GRANT DATABASE ROLE SNOWFLAKE.CORTEX_AGENT_USER TO ROLE agent;

身份验证¶

Snowflake REST APIs 支持通过编程访问令牌 (PATs) 进行身份验证、使用 JSON Web 令牌 (JWTs) 进行密钥对身份验证，以及 OAuth。有关详细信息，请参阅使用 Snowflake 对 Snowflake REST APIs 进行身份验证。

重要

Cortex Agents 使用的模型可能并非在所有区域都可用。有关更多信息，请参阅可用性。

重要

Cortex Agent APIs are not supported from within a Streamlit in Snowflake (SiS) application using a warehouse runtime. To call Cortex Agent APIs from a SiS app, use a container runtime instead. For more information, see Streamlit 应用程序的运行时环境.

成本注意事项¶

Cortex Agents incur charges for the orchestration and use of tools.

The orchestration usage is charged based on the tokens used.

Cortex Analyst is charged per token.

Cortex Search charges depend on the size of the index and the time it has persisted.

Warehouse charges depend on the size of the warehouse and how long it runs.

For more information, see the Snowflake Service Consumption Table. Also, use of custom tools may incur warehouse costs.

模型¶

You can use the following models with Cortex Agents. If the model is not available in the local region, you must use cross-region inference.

When creating an agent, we recommend selecting auto for the model. With this option, Cortex automatically selects the highest quality model for your account, and the quality automatically improves as new models become available.

auto
claude-haiku-4-5
claude-sonnet-4-5
claude-4-6-sonnet
claude-4-sonnet
openai-gpt-4-1

The following tables show the models that are available for each region:


Model	Cross-cloud (Any region)	AWS US (Cross-region)	AWS EU (Cross-region)	AWS APJ (Cross-region)	Azure US (Cross-region)
`claude-haiku-4-5`	*	*
`claude-sonnet-4-5`	✔	✔	✔
`claude-4-sonnet`	✔	✔	✔	✔
`claude-4-6-sonnet`	✔	✔
`openai-gpt-4.1`	✔				✔

* Indicates a preview function or model. Preview features are not suitable for production workloads.

Cortex Agents¶

Cortex Agents use Cortex Analyst, Cortex Search and custom tools to plan tasks and generate responses. You can influence the orchestration with instructions. You can also specify attributes to dynamically select a tool based on business logic.

During an interaction, Agents use a thread to maintain context. A thread provides an easy retrieval of the entire conversation context for use in application logic.

You can collect feedback from end-users as you continuously iterate and refine the Agent. An explicit feedback mechanism (positive/negative rating) coupled with subjective feedback (text) allows you to capture user inputs throughout the lifecycle of the Agent.

Agent object¶

The agent configuration includes all metadata, orchestration settings, and tool details that are stored in the agent object. You can use the agent object to interact with the agent.

Threads¶

Threads persist the context of your interactions with the agent, so you don't have to maintain context on the client application. To use threads, you create a thread object and reference the thread ID in the agent interactions.

编排¶

Cortex Agents use LLM-based orchestration to plan tasks and generate responses. You can control the orchestration with the following settings:

模型¶

For information about the models you can use with Cortex Agents for orchestration, see 模型.

说明¶

Response instructions allow you to configure the agent responses to a brand and tone of your preference.

Sample questions¶

You can use these questions to seed the conversation in your client application. These are common questions that can get users started with the interaction.

工具¶

Cortex Agents can orchestrate across both structured and unstructured data. Also, custom tools allow agents to interact with other backend systems or implement custom logic.

备注

Tool names must be between 1 and 64 characters. If you're using a semantic view as a tool, the semantic view name is used as the tool name.

Cortex Analyst semantic view¶

您可以使用 Cortex Analyst 将自然语言转换为 SQL 查询。要使用 Cortex Analyst，您必须创建一个语义模型。有关更多信息，请参阅创建语义模型。

创建 Cortex Search 服务¶

使用 Cortex Search 搜索您的数据。有关更多信息，请参阅 CREATE CORTEX SEARCH SERVICE。

Agents can dynamically adjust the following search parameters if the user's query requires it: filter conditions, metadata columns to retrieve, number of results, per-index queries for multi-index services, and time-decay settings.

备注

查询用户的 DEFAULT_ROLE 必须对Cortex Search 服务及其所在的数据库和架构拥有 USAGE 权限。

Custom tools¶

You can use stored procedures and user defined functions (UDF) to implement custom business logic as a tool. For more information, see 存储过程概述 and 用户定义的函数概述.

Thinking and reflection¶

The Agent emits events throughout the interaction, providing insights into the reasoning process. These steps cover the initial splitting of tasks, sequencing into sub-tasks, and selection of tools for the sub-task. In addition, the agent also surfaces its reflections about tool results and how these influence further orchestration.

Monitor, evaluate, and iterate¶

You can collect feedback from the end user as a rating (positive/negative), along with any subjective inputs (as text). These can be used to refine and improve the agent over the lifecycle. For more information on how to perform monitoring and evaluation with native Snowflake features, see 监控 Cortex Agent 请求 and Cortex Agent 评估.

Web search¶

Before providing web search access to your agents, an ACCOUNTADMIN role must first enable web search access at the account level. To properly enable web search:

Sign in to Snowsight.
In the navigation menu, select AI & ML » Agents.
Select Settings.
Select the Web search toggle to enable the feature, as shown below.

After enabling web search at the account level, you can use the web search tool in your agents. For more information, see 创建代理.

Cortex Agents use the Brave Web Search API to query the web and retrieve results for real-time information during an interaction. The agent creates a query based on the user’s input and any relevant context from the interaction. The API returns results from Brave Search's independent web index. The query and the results leave Snowflake and traverse the public internet. The agent then incorporates the relevant results into its response alongside any data from other configured tools. Snowflake has enabled zero data retention (ZDR) with Brave, which means no search queries are stored by Brave for any length of time. This applies to the search query text, the results returned, and any metadata associated with the request. ZDR simplifies compliance obligations and reduces risk — because the data is never stored.

Interact with agents¶

Cortex Agents support two distinct methods of interacting with agents through the REST API:

Configure an agent object to interact with the agent: With this method, you first configure an agent object that can be reused for the entire interaction. Configuring an agent object simplifies client code and enables CI/CD for enterprise-ready applications.
Interact without an agent object: With this method, you must pass the agent configuration as part of every interaction request. Interaction without an agent object allows you to quickly try out use cases and experiment with different scenarios.

有关帮助您入门的教程，请参阅 Cortex Agents 教程。

Legal notices¶

Where your configuration of Cortex Agents uses a model provided on the Model and Service Flow-down Terms, your use of that model is further subject to the terms for that model on that page.

The data classification of inputs and outputs are as set forth in the following table.


Input data classification	Output data classification	Designation
Usage Data	Customer Data	Covered AI Features [1]

For additional information, refer to Snowflake AI 和 ML.