LIST

Returns a list of files from one of the following Snowflake storage features:

LIST can be abbreviated to LS.

See also:

REMOVE, PUT, COPY INTO <table>, COPY INTO <location>, GET

Syntax

The syntax differs depending on whether you’re listing files in a stage or a Git repository clone.

For a stage

LIST { internalStage | externalStage } [ PATTERN = '<regex_pattern>' ]

Where:

internalStage ::=
    @[<namespace>.]<int_stage_name>[/<path>]
  | @[<namespace>.]%<table_name>[/<path>]
  | @~[/<path>]
externalStage ::=
  @[<namespace>.]<ext_stage_name>[/<path>]

For a Git repository clone

LIST repositoryClone [ PATTERN = '<regex_pattern>' ]

Where:

repositoryClone ::=
  @[<namespace>.]<repository_clone>/<path>

Required parameters

For a stage

internalStage | externalStage

Specifies the location where the data files are staged:

@[namespace.]int_stage_name[/path]Files are in the specified named internal stage.
@[namespace.]ext_stage_name[/path]Files are in the specified named external stage.
@[namespace.]%table_name[/path]Files are in the stage for the specified table.
@~[/path]Files are in the stage for the current user.
Where:
- namespace is the database and/or schema in which the named stage or table resides. It is optional if a database and
schema are currently in use within the session; otherwise, it is required.
- path is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with a
common string) that limits access to a set of files. Paths are alternatively called prefixes or folders by different cloud storage
services.
If the stage name or path includes spaces or special characters, it must be enclosed in single quotes (e.g. '@"my stage"' for a
stage named "my stage").
Specifying a path provides a scope for the LIST command, potentially reducing the amount of time required to run the command.

For a Git repository clone

repositoryClone

Specifies the name of the repository clone and the branch, tag, or commit for which to list files.

@[namespace.]repository_clone/path

When listing files from a Git repository clone, the path is required and must begin with one of the following:

branches/branch_nameList files from the specified branch.
tags/tag_nameList files from the specified tag.
commits/commit_hashList files from the commit specified by the commit hash.
If the repository clone name or path includes spaces or special characters, it must be enclosed in single quotes (for example,
'@"my repository"' for a repository named "my repository").

Optional parameters

PATTERN = 'regex_pattern'

Specifies a regular expression pattern for filtering files from the output. The command lists all files in the specified path and applies the regular expression pattern on each of the files found.

Usage notes

  • To run this command with an external stage that uses a storage integration, you must use a role that has or inherits the USAGE privilege on the storage integration.

    For more information, see Stage privileges.

  • In contrast to named stages, table and user stages are not first-class database objects; rather, they are implicit stages associated with the table/user. As such, they have no grantable privileges of their own:

Output

The command returns columns in the following tables. Column values differ depending on whether you’re using LIST with a stage or Git repository clone.

For a stage

ColumnData typeDescription
nameVARCHARName of the staged file
sizeNUMBERSize of the file compressed (in bytes)
md5VARCHAR

The MD5 column stores an MD5 hash of the contents of the staged data file.

For internal stages with default encryption (SNOWFLAKE_FULL), during upload the source file is encrypted with a random key, and its resulting MD5 digest will always differ from the original local file.

Amazon S3 stages report the value via the S3 eTag field, which might not be an MD5 hash of the file contents.

For Google Cloud stages that use a customer-managed encryption key (CMEK), md5 is expected to be NULL.

For more information, see Customer-managed encryption keys (https://cloud.google.com/storage/docs/encryption/customer-managed-keys).

sha1VARCHARNot used
last_modifiedVARCHARTimestamp when the file was last updated in the stage

For a Git repository clone

ColumnData typeDescription
nameVARCHARFull file path with extension
sizeNUMBERSize of the file compressed (in bytes)
md5VARCHARNot used
sha1VARCHARA unique identifier generated by applying the SHA-1 hashing algorithm to the file’s contents. It is used by Git to track and reference the exact version of a file in the repository, and can be used to detect changes in the file’s content.
last_modifiedVARCHARTimestamp of the commit associated with the listed files. This does not necessarily indicate when the file content was last changed.

Examples

For a stage

List all the files in the stage for the mytable table:

LIST @%mytable;

List all the files in the path1 path of the mystage named stage:

LIST @mystage/path1;

List the files that match a regular expression (i.e. all file names containing the string data_0) in the stage for the mytable table:

LIST @%mytable PATTERN='.*data_0.*';

List the files in the /analysis/ path of the my_csv_stage named stage that match a regular expression (i.e. all file names containing the string data_0):

LIST @my_csv_stage/analysis/ PATTERN='.*data_0.*';

Use the abbreviated form of the command to list all the files in the stage for the current user:

LS @~;

For a Git repository clone

For examples, see View a list of repository files.