- 类别:
:doc:`/sql-reference/functions-string`(全文搜索)
SEARCH¶
从一个或多个表中搜索指定列中的字符数据(文本),包括 VARIANT、OBJECT 和 ARRAY 列中的字段。文本分析器将文本分解为词元,这些词元是文本的离散单位,例如单词或数字。如果您不指定分析器,则会应用默认分析器。
有关使用此函数的更多信息,请参阅 使用全文搜索。
语法¶
SEARCH( <search_data>, <search_string> [ , ANALYZER => '<analyzer_name>' ] )
实参¶
search_data
您要搜索的数据,以字符串字面量、列名或到 VARIANT 列中字段的 路径 的逗号分隔的列表表示。搜索数据也可以是一个单一的字面量字符串,这在测试函数时可能很有用。
您可以指定通配符字符 (
*
),其中*
扩展为函数范围内所有表中所有符合条件的列。符合条件的列是那些具有 VARCHAR(文本)、VARIANT、ARRAY 和 OBJECT 数据类型的列。VARIANT、ARRAY 和 OBJECT 数据已转换为文本以便搜索。当您将通配符传递给函数时,您可以使用表的名称或别名来限定通配符。例如,要传入名为
mytable
的表中的所有列,请指定以下内容:(mytable.*)
您还可以使用 ILIKE 和 EXCLUDE 关键字进行筛选:
ILIKE 筛选条件,用于查找与指定模式匹配的列名。只允许使用一种模式。例如:
(* ILIKE 'col1%')
EXCLUDE 筛选出与指定列或列不匹配的列名。例如:
(* EXCLUDE col1) (* EXCLUDE (col1, col2))
使用这些关键字时,限定符有效。以下示例使用 ILIKE 关键字筛选出与表
mytable
中的模式col1%
相匹配的所有列:(mytable.* ILIKE 'col1%')
ILIKE 和 EXCLUDE 关键字不能组合在单个函数调用中。
关于 ILIKE 和 EXCLUDE 关键字的更多信息,请参阅 SELECT 中的“参数”部分。
当多个表在范围内时,您可以通过联连表或使用 UNION 集合运算符从多个表中搜索列。要搜索联接或 UNION 查询的输出中的所有列,可以使用不带限定的
*
通配符,如下所示:SELECT * FROM t AS T1 JOIN t AS T2 USING (col1) WHERE SEARCH((*), 'string');
要在联接表时搜索特定列,您可能需要限定列名(例如,
table2.colname
)。您还可以使用限定的*
通配符,如下所示:SELECT * FROM t AS T1 JOIN t AS T2 USING (col1) WHERE SEARCH((T2.*), 'string');
然而,请注意,您不能为该函数指定
*
或table.*
超过一次。在前面的联接示例中,您不能指定SEARCH((T1.*, T2.*), 'string')
。此语法返回错误。当列出
*
、table.*
或多个项目时,search_data
实参需要括号。例如:SEARCH((col1, col2, col3), 'string') SEARCH((t1.*), 'string') SEARCH((*), 'string')
如果没有使用括号来分隔多个项目,则逗号会被解析为函数参数之间的分隔符。
另请参阅 预期错误案例的示例。
您可以通过指定列名、冒号或点以及用点分隔的子字段,在 VARIANT 数据中搜索字段。例如:
colname:fieldname.subfieldname
。有关在此类列中指定字段的更多信息,请参阅 遍历半结构化数据。search_string
一个 VARCHAR 字符串,包含一个或多个搜索词。此参数必须是字面量字符串;不支持列名。用一对单引号将整个字符串括起来。不用引号将个别搜索词或短语括起来。例如,请使用:
'blue red green'
请勿使用:
'blue' 'red' 'green'
搜索词列表是析取的。在这种情况下,搜索将查找包含
blue
ORred
ORgreen
的行。然而,当使用 NO_OP_ANALYZER 时,查询字符串会被完全匹配,不会划分词元,也没有析取语义。搜索不区分大小写(使用 NO_OP_ANALYZER 时除外),因此对字符串
'Once upon a time'
执行搜索词'ONCE'
搜索会返回 TRUE。搜索词的顺序并不重要,只要它们在搜索数据中的存在即可。
ANALYZER => 'analyzer_name'
可选参数,用于指定文本分析器的名称。名称必须放在单引号内。
分析器将搜索词(以及正在搜索的列中的文本)分解为词元。如果从搜索字符串中提取的任何词元与正在搜索的任何列或字段中提取的词元完全匹配,则该行匹配。
分析器对字符串进行标记化处理时,会在发现特定分隔符的地方将其打断。这些定界符不包括在结果词元中,不会提取空词元。
该参数可接受以下值:
DEFAULT_ANALYZER:根据以下分隔符将文本分解为词元:
字符
Unicode 代码
描述
U+0020
空格
[
U+005B
左方括号
]
U+005D
右方括号
;
U+003B
分号
<
U+003C
小于号
>
U+003E
大于号
(
U+0028
左括号
)
U+0029
右括号
{
U+007B
左花括号
}
U+007D
右花括号
|
U+007C
竖线
!
U+0021
感叹号
,
U+002C
逗号
'
U+0027
撇号
"
U+0022
引号
*
U+002A
星号
&
U+0026
和号
?
U+003F
问号
+
U+002B
加号
/
U+002F
斜杠
:
U+003A
冒号
=
U+003D
等号
@
U+0040
@ 号
.
U+002E
句号
-
U+002D
连字符
$
U+0024
美元符号
%
U+0025
百分号
\
U+005C
反斜杠
_
U+005F
下划线(底线)
\n
U+000A
新行(换行符)
\r
U+000D
回车
\t
U+0009
水平制表符
UNICODE_ANALYZER:基于 Unicode 分割规则进行词元划分处理;这些规则将空格和某些标点符号视为分隔符。这些内部规则是为自然语言搜索(以多种不同语言)而设计的。例如,默认分析器将 IP 地址中的句点和缩写中的撇号视为分隔符,但 Unicode 分析器则不这样做。请参阅 使用分析器来调整搜索行为。
有关 Unicode Text Segmentation 算法的更多信息,请参阅 https://unicode.org/reports/tr29/ (https://unicode.org/reports/tr29/)。
NO_OP_ANALYZER:既不对数据也不对查询字符串进行划分词元处理。搜索词必须完全匹配列或字段中的完整文本,包括区分大小写;否则,SEARCH 函数将返回 FALSE。即使查询字符串看起来包含多个词元(例如,
'sky blue'
),列或字段也必须完全等于整个查询字符串。在这种情况下,只有'sky blue'
是匹配项;而'sky'
和'blue'
不是匹配项。
有关不同分析器行为的更多信息,请参阅 如何将搜索词划分为词元。
返回¶
返回 BOOLEAN。
如果在
search_data
中找到任何search_string
词元,则值为 TRUE。如果任一实参为 NULL,则返回 NULL。
否则返回 FALSE。
使用说明¶
SEARCH 函数仅针对 VARCHAR、VARIANT、ARRAY 和 OBJECT 数据执行操作。如果
search_data
实参不包含这些数据类型的数据,则该函数会返回错误。当search_data
实参同时包含支持的数据类型和不支持的数据类型时,函数会搜索支持的数据类型的数据,并静默忽略不支持的数据类型的数据。有关示例,请参阅 预期错误案例的示例。您可以使用 ALTER TABLE 命令在作为 SEARCH 函数调用目标的列上添加 FULL_TEXT 搜索优化。例如:
ALTER TABLE lines ADD SEARCH OPTIMIZATION ON FULL_TEXT(play, character, line);
有关更多信息,请参阅 启用 FULL_TEXT 搜索优化。
如何将搜索词划分为词元¶
以下表格展示了一些示例,说明输入搜索词是如何被拆分成标记的,这取决于所使用的分析器所应用的规则。在表格中,逗号表示拆分词元的位置(如果有的话)。
搜索词 |
词元:DEFAULT_ANALYZER |
词元:UNICODE_ANALYZER |
NO_OP_ANALYZER(不拆分) |
---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
示例¶
以下示例展示了使用 SEARCH 函数的不同方式,从简单用法开始,逐渐过渡到更复杂的用例。
匹配字面量¶
最简单的 SEARCH 函数的例子是在字符串字面量上对 TRUE 或 FALSE 的测试。第一个示例返回 TRUE,因为第一个和第二个实参的字面量匹配,前提是比较不区分大小写。
SELECT SEARCH('king','KING');
+-----------------------------+
| SEARCH('KING','KING') |
|-----------------------------|
| True |
+-----------------------------+
第二个示例返回 FALSE,因为在为第一个实参指定的字面量 5.1.33
中没有出现词元 32
。
SELECT SEARCH('5.1.33','32');
+-----------------------------+
| SEARCH('5.1.33','32') |
|-----------------------------|
| False |
+-----------------------------+
匹配列引用¶
此示例使用表中的一列作为第一个实参。该函数返回 TRUE,因为搜索词之一 (king
) 存在于 character
列中。搜索词列表是析取的。(有关此处及后续示例中使用的表的信息,请参阅 SEARCH 的样本数据。)
SELECT SEARCH(character, 'king queen'),character
FROM lines
WHERE line_id=4;
+--------------------------------------+---------------+
| SEARCH(CHARACTER,'KING QUEEN') | CHARACTER |
|--------------------------------------+---------------|
| True | KING HENRY IV |
+--------------------------------------+---------------+
在一列上搜索 WHERE 子句¶
以下查询使用 SEARCH 函数在 line
列中查找包含单词 "wherefore" 的行:
SELECT *
FROM lines
WHERE SEARCH(line, 'wherefore')
ORDER BY character LIMIT 5;
+---------+----------------------+------------+----------------+-----------+-----------------------------------------------------+
| LINE_ID | PLAY | SPEECH_NUM | ACT_SCENE_LINE | CHARACTER | LINE |
|---------+----------------------+------------+----------------+-----------+-----------------------------------------------------|
| 100109 | Troilus and Cressida | 31 | 2.1.53 | ACHILLES | Why, how now, Ajax! wherefore do you thus? How now, |
| 16448 | As You Like It | 2 | 2.3.6 | ADAM | And wherefore are you gentle, strong and valiant? |
| 24055 | The Comedy of Errors | 14 | 5.1.41 | AEMELIA | Be quiet, people. Wherefore throng you hither? |
| 99330 | Troilus and Cressida | 30 | 1.1.102 | AENEAS | How now, Prince Troilus! wherefore not afield? |
| 92454 | The Tempest | 150 | 2.1.343 | ALONSO | Wherefore this ghastly looking? |
+---------+----------------------+------------+----------------+-----------+-----------------------------------------------------+
在多个列上搜索 WHERE 子句¶
以下查询使用 SEARCH 函数在 play
列和/或 character
列中查找包含单词 "king" 的行。对于第一个实参,括号是必需的。
SELECT play, character
FROM lines
WHERE SEARCH((play, character), 'king')
ORDER BY play, character LIMIT 10;
+---------------------------+-----------------+
| PLAY | CHARACTER |
|---------------------------+-----------------|
| All's Well That Ends Well | KING |
| Hamlet | KING CLAUDIUS |
| Hamlet | KING CLAUDIUS |
| Henry IV Part 1 | KING HENRY IV |
| Henry IV Part 1 | KING HENRY IV |
| King John | CHATILLON |
| King John | KING JOHN |
| King Lear | GLOUCESTER |
| King Lear | KENT |
| Richard II | KING RICHARD II |
+---------------------------+-----------------+
在表中对所有符合条件的列进行通配符搜索¶
您可以将 *
字符(或 table.*
)用作 SEARCH 函数的第一个实参,如本示例所示。搜索对您选择的表(在本例中为 lines
表)中的所有符合条件的列进行操作。
lines
表有四个列,具有搜索功能支持的数据类型。请注意,结果由在四个搜索列中的一个或多个列中出现 "king" 的行组成。对于其中一列 (act_scene_line
),该函数未找到匹配项,但其他三列都有匹配项。
SELECT play, character, line, act_scene_line
FROM lines
WHERE SEARCH((lines.*), 'king')
ORDER BY act_scene_line LIMIT 10;
+-----------------+-----------------+----------------------------------------------------+----------------+
| PLAY | CHARACTER | LINE | ACT_SCENE_LINE |
|-----------------+-----------------+----------------------------------------------------+----------------|
| Pericles | LODOVICO | This king unto him took a fere, | 1.0.21 |
| Richard II | KING RICHARD II | Old John of Gaunt, time-honour'd Lancaster, | 1.1.1 |
| Henry VI Part 3 | WARWICK | I wonder how the king escaped our hands. | 1.1.1 |
| King John | KING JOHN | Now, say, Chatillon, what would France with us? | 1.1.1 |
| King Lear | KENT | I thought the king had more affected the Duke of | 1.1.1 |
| Henry IV Part 1 | KING HENRY IV | So shaken as we are, so wan with care, | 1.1.1 |
| Henry IV Part 1 | KING HENRY IV | Which, like the meteors of a troubled heaven, | 1.1.10 |
| King Lear | GLOUCESTER | so often blushed to acknowledge him, that now I am | 1.1.10 |
| Cymbeline | First Gentleman | Is outward sorrow, though I think the king | 1.1.10 |
| King John | CHATILLON | To this fair island and the territories, | 1.1.10 |
+-----------------+-----------------+----------------------------------------------------+----------------+
您还可以使用 ILIKE 和 EXCLUDE 关键字进行筛选。有关这些关键字的更多信息,请参阅 SELECT。
此搜索使用 ILIKE 关键字仅在以字符串 line
结尾的列中进行搜索。
SELECT play, character, line, act_scene_line
FROM lines
WHERE SEARCH((lines.* ILIKE '%line'), 'king')
ORDER BY act_scene_line LIMIT 10;
+-----------------+-----------------+--------------------------------------------------+----------------+
| PLAY | CHARACTER | LINE | ACT_SCENE_LINE |
|-----------------+-----------------+--------------------------------------------------+----------------|
| Pericles | LODOVICO | This king unto him took a fere, | 1.0.21 |
| Henry VI Part 3 | WARWICK | I wonder how the king escaped our hands. | 1.1.1 |
| King Lear | KENT | I thought the king had more affected the Duke of | 1.1.1 |
| Cymbeline | First Gentleman | Is outward sorrow, though I think the king | 1.1.10 |
+-----------------+-----------------+--------------------------------------------------+----------------+
此搜索使用 EXCLUDE 关键字,使得该函数不在 character
列中搜索数据。
SELECT play, character, line, act_scene_line
FROM lines
WHERE SEARCH((lines.* EXCLUDE character), 'king')
ORDER BY act_scene_line LIMIT 10;
+-----------------+-----------------+----------------------------------------------------+----------------+
| PLAY | CHARACTER | LINE | ACT_SCENE_LINE |
|-----------------+-----------------+----------------------------------------------------+----------------|
| Pericles | LODOVICO | This king unto him took a fere, | 1.0.21 |
| Henry VI Part 3 | WARWICK | I wonder how the king escaped our hands. | 1.1.1 |
| King John | KING JOHN | Now, say, Chatillon, what would France with us? | 1.1.1 |
| King Lear | KENT | I thought the king had more affected the Duke of | 1.1.1 |
| Cymbeline | First Gentleman | Is outward sorrow, though I think the king | 1.1.10 |
| King Lear | GLOUCESTER | so often blushed to acknowledge him, that now I am | 1.1.10 |
| King John | CHATILLON | To this fair island and the territories, | 1.1.10 |
+-----------------+-----------------+----------------------------------------------------+----------------+
在 SELECT 列表中进行通配符搜索¶
您可以在 SELECT 列表中使用 *
字符(或 table.*
),如这些示例所示。
以下搜索对您选择的表(在本例中为 lines
表)中的所有符合条件的列进行操作。当 "king" 出现在四个搜索列中的一个或多个中时,搜索返回 True
。
SELECT SEARCH((*), 'king') result, *
FROM lines
ORDER BY act_scene_line LIMIT 10;
+--------+---------+---------------------------+------------+----------------+-----------------+--------------------------------------------------------+
| RESULT | LINE_ID | PLAY | SPEECH_NUM | ACT_SCENE_LINE | CHARACTER | LINE |
|--------+---------+---------------------------+------------+----------------+-----------------+--------------------------------------------------------|
| True | 75787 | Pericles | 178 | 1.0.21 | LODOVICO | This king unto him took a fere, |
| True | 43494 | King John | 1 | 1.1.1 | KING JOHN | Now, say, Chatillon, what would France with us? |
| True | 49031 | King Lear | 1 | 1.1.1 | KENT | I thought the king had more affected the Duke of |
| True | 78407 | Richard II | 1 | 1.1.1 | KING RICHARD II | Old John of Gaunt, time-honour'd Lancaster, |
| False | 67000 | A Midsummer Night's Dream | 1 | 1.1.1 | THESEUS | Now, fair Hippolyta, our nuptial hour |
| True | 4 | Henry IV Part 1 | 1 | 1.1.1 | KING HENRY IV | So shaken as we are, so wan with care, |
| False | 12664 | All's Well That Ends Well | 1 | 1.1.1 | COUNTESS | In delivering my son from me, I bury a second husband. |
| True | 9526 | Henry VI Part 3 | 1 | 1.1.1 | WARWICK | I wonder how the king escaped our hands. |
| False | 52797 | Love's Labour's Lost | 1 | 1.1.1 | FERDINAND | Let fame, that all hunt after in their lives, |
| True | 28487 | Cymbeline | 3 | 1.1.10 | First Gentleman | Is outward sorrow, though I think the king |
+--------+---------+---------------------------+------------+----------------+-----------------+--------------------------------------------------------+
您还可以使用 ILIKE 和 EXCLUDE 关键字进行筛选。有关这些关键字的更多信息,请参阅 SELECT。
此搜索使用 ILIKE 关键字仅在以字符串 line
结尾的列中进行搜索。
SELECT SEARCH(* ILIKE '%line', 'king') result, play, character, line
FROM lines
ORDER BY act_scene_line LIMIT 10;
+--------+---------------------------+-----------------+--------------------------------------------------------+
| RESULT | PLAY | CHARACTER | LINE |
|--------+---------------------------+-----------------+--------------------------------------------------------|
| True | Pericles | LODOVICO | This king unto him took a fere, |
| False | King John | KING JOHN | Now, say, Chatillon, what would France with us? |
| True | King Lear | KENT | I thought the king had more affected the Duke of |
| False | Richard II | KING RICHARD II | Old John of Gaunt, time-honour'd Lancaster, |
| False | A Midsummer Night's Dream | THESEUS | Now, fair Hippolyta, our nuptial hour |
| False | Henry IV Part 1 | KING HENRY IV | So shaken as we are, so wan with care, |
| False | All's Well That Ends Well | COUNTESS | In delivering my son from me, I bury a second husband. |
| True | Henry VI Part 3 | WARWICK | I wonder how the king escaped our hands. |
| False | Love's Labour's Lost | FERDINAND | Let fame, that all hunt after in their lives, |
| True | Cymbeline | First Gentleman | Is outward sorrow, though I think the king |
+--------+---------------------------+-----------------+--------------------------------------------------------+
此搜索使用 EXCLUDE 关键字,使得该函数不在 play
或 line
列中搜索数据。
SELECT SEARCH(* EXCLUDE (play, line), 'king') result, play, character, line
FROM lines
ORDER BY act_scene_line LIMIT 10;
+--------+---------------------------+-----------------+--------------------------------------------------------+
| RESULT | PLAY | CHARACTER | LINE |
|--------+---------------------------+-----------------+--------------------------------------------------------|
| False | Pericles | LODOVICO | This king unto him took a fere, |
| True | King John | KING JOHN | Now, say, Chatillon, what would France with us? |
| False | King Lear | KENT | I thought the king had more affected the Duke of |
| True | Richard II | KING RICHARD II | Old John of Gaunt, time-honour'd Lancaster, |
| False | A Midsummer Night's Dream | THESEUS | Now, fair Hippolyta, our nuptial hour |
| True | Henry IV Part 1 | KING HENRY IV | So shaken as we are, so wan with care, |
| False | All's Well That Ends Well | COUNTESS | In delivering my son from me, I bury a second husband. |
| False | Henry VI Part 3 | WARWICK | I wonder how the king escaped our hands. |
| False | Love's Labour's Lost | FERDINAND | Let fame, that all hunt after in their lives, |
| False | Cymbeline | First Gentleman | Is outward sorrow, though I think the king |
+--------+---------------------------+-----------------+--------------------------------------------------------+
在联接的表中对符合条件的列进行通配符搜索¶
此示例使用了两个小表,其中包含有关汽车型号的信息。表 t1
有两个字符列,表 t2
有三个。您可以创建和加载这些表,如下所示:
CREATE OR REPLACE TABLE t1 (col1 INT, col2 VARCHAR(20), col3 VARCHAR(20));
INSERT INTO t1 VALUES
(1,'Mini','Cooper'),
(2,'Mini','Cooper S'),
(3,'Mini','Countryman'),
(4,'Mini','Countryman S');
CREATE OR REPLACE TABLE t2 (col1 INT, col2 VARCHAR(20), col3 VARCHAR(20), col4 VARCHAR(20));
INSERT INTO t2 VALUES
(1,'Mini','Cooper', 'Convertible'),
(2,'Mini','Cooper S', 'Convertible'),
(3,'Mini','Countryman SE','ALL4'),
(4,'Mini','Countryman S','ALL4');
针对 t1.*
和 t2.*
执行搜索时,以下两个查询的结果不同。t1
中只有两列符合搜索条件,但 t2
中有三列符合条件。
SELECT * FROM t1 JOIN t2 USING(col1)
WHERE SEARCH((t1.*),'s all4');
+------+------+--------------+------+--------------+-------------+
| COL1 | COL2 | COL3 | COL2 | COL3 | COL4 |
|------+------+--------------+------+--------------+-------------|
| 2 | Mini | Cooper S | Mini | Cooper S | Convertible |
| 4 | Mini | Countryman S | Mini | Countryman S | ALL4 |
+------+------+--------------+------+--------------+-------------+
SELECT * FROM t1 JOIN t2 USING(col1)
WHERE SEARCH((t2.*),'s all4');
+------+------+--------------+------+---------------+-------------+
| COL1 | COL2 | COL3 | COL2 | COL3 | COL4 |
|------+------+--------------+------+---------------+-------------|
| 2 | Mini | Cooper S | Mini | Cooper S | Convertible |
| 3 | Mini | Countryman | Mini | Countryman SE | ALL4 |
| 4 | Mini | Countryman S | Mini | Countryman S | ALL4 |
+------+------+--------------+------+---------------+-------------+
针对 UNION 子查询的输出执行通配符搜索¶
以下示例使用与前一个示例相同的两个表。在本例中,搜索应用于来自 t3``(它是子查询产生的表)的所有符合条件的列。子查询会计算 ``t1
和 ``t2``(五行)中前三列的 UNION。搜索返回了 UNION 结果中的两个匹配行。
SELECT *
FROM (
SELECT col1, col2, col3 FROM t1
UNION
SELECT col1, col2, col3 FROM t2
) AS T3
WHERE SEARCH((T3.*),'s');
+------+------+--------------+
| COL1 | COL2 | COL3 |
|------+------+--------------|
| 2 | Mini | Cooper S |
| 4 | Mini | Countryman S |
+------+------+--------------+
查找匹配多个搜索字符串的行¶
以下示例展示了如何在同一查询中使用 SEARCH 两次,使得两个搜索字符串都必须返回 TRUE,才能使行符合结果。
SELECT act_scene_line, character, line
FROM lines
WHERE SEARCH(line, 'Rosencrantz')
AND SEARCH(line, 'Guildenstern')
AND act_scene_line IS NOT NULL;
+----------------+------------------+-----------------------------------------------------------+
| ACT_SCENE_LINE | CHARACTER | LINE |
|----------------+------------------+-----------------------------------------------------------|
| 2.2.1 | KING CLAUDIUS | Welcome, dear Rosencrantz and Guildenstern! |
| 2.2.35 | KING CLAUDIUS | Thanks, Rosencrantz and gentle Guildenstern. |
| 2.2.36 | QUEEN GERTRUDE | Thanks, Guildenstern and gentle Rosencrantz: |
| 2.2.241 | HAMLET | Guildenstern? Ah, Rosencrantz! Good lads, how do ye both? |
| 4.6.27 | HORATIO | where I am. Rosencrantz and Guildenstern hold their |
| 5.2.60 | HORATIO | So Guildenstern and Rosencrantz go to't. |
| 5.2.389 | First Ambassador | That Rosencrantz and Guildenstern are dead: |
+----------------+------------------+-----------------------------------------------------------+
在联接中搜索 VARIANT 和 VARCHAR 数据¶
以下示例显示了 car_rentals
和 car_sales
这两个表的联接,搜索应用于两个表中的列。car_sales
表包含 VARIANT 数据。car_sales
表及其数据在 查询半结构化数据 下描述。以下 SQL 语句创建 car_rentals
表并向其中插入数据:
CREATE OR REPLACE TABLE car_rentals(
vehicle_make varchar(30),
dealership varchar(30),
salesperson varchar(30));
INSERT INTO car_rentals VALUES
('Toyota', 'Tindel Toyota', 'Greg Northrup'),
('Honda', 'Valley View Auto Sales', 'Frank Beasley'),
('Tesla', 'Valley View Auto Sales', 'Arturo Sandoval');
运行查询:
SELECT SEARCH((r.vehicle_make, r.dealership, s.src:dealership), 'Toyota Tesla')
AS contains_toyota_tesla, r.vehicle_make, r.dealership,s.src:dealership
FROM car_rentals r JOIN car_sales s
ON r.SALESPERSON=s.src:salesperson.name;
+-----------------------+--------------+------------------------+--------------------------+
| CONTAINS_TOYOTA_TESLA | VEHICLE_MAKE | DEALERSHIP | S.SRC:DEALERSHIP |
|-----------------------+--------------+------------------------+--------------------------|
| True | Toyota | Tindel Toyota | "Tindel Toyota" |
| False | Honda | Valley View Auto Sales | "Valley View Auto Sales" |
+-----------------------+--------------+------------------------+--------------------------+
在这个第二个示例中,针对相同的数据,使用了不同的搜索词:
SELECT SEARCH((r.vehicle_make, r.dealership, s.src:dealership), 'Toyota Honda')
AS contains_toyota_honda, r.vehicle_make, r.dealership, s.src:dealership
FROM car_rentals r JOIN car_sales s
ON r.SALESPERSON =s.src:salesperson.name;
+-----------------------+--------------+------------------------+--------------------------+
| CONTAINS_TOYOTA_HONDA | VEHICLE_MAKE | DEALERSHIP | S.SRC:DEALERSHIP |
|-----------------------+--------------+------------------------+--------------------------|
| True | Toyota | Tindel Toyota | "Tindel Toyota" |
| True | Honda | Valley View Auto Sales | "Valley View Auto Sales" |
+-----------------------+--------------+------------------------+--------------------------+
使用分析器来调整搜索行为¶
以下示例展示了如何通过指定非默认分析器 UNICODE_ANALYZER 或 NO_OP_ANALYZER 来调整 SEARCH 函数的行为。
第一个示例使用 NO_OP_ANALYZER 来测试字符串 1.2.500
是否与 lines
表的 act_scene_line
列中任何行的确切内容匹配。两行都符合搜索条件。
SELECT line_id, act_scene_line FROM lines
WHERE SEARCH(act_scene_line, '1.2.500', ANALYZER=>'NO_OP_ANALYZER');
+---------+----------------+
| LINE_ID | ACT_SCENE_LINE |
|---------+----------------|
| 91998 | 1.2.500 |
| 108464 | 1.2.500 |
+---------+----------------+
如果您在此示例中移除了 NO_OP_ANALYZER,让它不再是实参,搜索将返回大量的行。默认分析器将 1、2 和 500 视为不同的词元;因此,对于存在 1、2 或 500 的所有行(无论顺序或组合如何),该函数都返回 TRUE。
如果您将此查询更改为仅包含第二个实参的前缀 1.2,则默认分析器会返回 TRUE,但 UNICODE_ANALYZER 和 NO_OP_ANALYZER 都返回 FALSE。默认分析器将这些值中的句点视为分隔符,但 Unicode 分析器则不这样做。
以下两个查询展示了使用 UNICODE_ANALYZER 而不是默认分析器的另一种效果。第一个查询使用 UNICODE_ANALYZER,只返回了一行。请注意,第二个参数中的额外单引号是用来转义撇号的。请参阅 单引号字符串常量。
SELECT DISTINCT(play)
FROM lines
WHERE SEARCH(play, 'love''s', ANALYZER=>'UNICODE_ANALYZER');
+----------------------+
| PLAY |
|----------------------|
| Love's Labour's Lost |
+----------------------+
第二个查询使用默认分析器,返回四行,因为默认分析器将撇号字符视为分隔符。任何包含字母 "s" 作为词元的字符串都符合搜索条件。在此示例中,对于包含“撇号 s”的每个字符串 ('s
),函数返回 TRUE。
SELECT DISTINCT(play) FROM lines WHERE SEARCH(play, 'love''s');
+---------------------------+
| PLAY |
|---------------------------|
| All's Well That Ends Well |
| Love's Labour's Lost |
| A Midsummer Night's Dream |
| The Winter's Tale |
+---------------------------+
预期错误案例的示例¶
以下示例显示了返回预期语法错误的查询。
此示例失败了,因为 5
不是 search_string
实参支持的数据类型。
SELECT SEARCH(line, 5) FROM lines;
001045 (22023): SQL compilation error:
argument needs to be a string: '1'
此示例失败了,因为不存在为 search_data
实参指定支持的数据类型的列。
SELECT SEARCH(line_id, 'dream') FROM lines;
001173 (22023): SQL compilation error: error line 1 at position 7: Expected non-empty set of columns supporting full-text search.
此示例成功了,因为存在为 search_data
实参指定支持的数据类型的列。该函数忽略了 line_id
列,因为它不是支持的数据类型
SELECT SEARCH((line_id, play), 'dream') FROM lines
ORDER BY play LIMIT 5;
+----------------------------------+
| SEARCH((LINE_ID, PLAY), 'DREAM') |
|----------------------------------|
| True |
| True |
| False |
| False |
| False |
+----------------------------------+
此示例失败了,因为针对第一个实参列出了多个字符串字面量,没有括号,导致实参不匹配:
SELECT SEARCH('docs@snowflake.com', 'careers@snowflake.com', '@');
001881 (42601): SQL compilation error: Expected 1 named argument(s), found 0
此示例失败了,因为针对第一个实参列出了多个列名,没有括号,导致实参过多。
SELECT SEARCH(play,line,'king', ANALYZER=>'UNICODE_ANALYZER') FROM lines;
000939 (22023): SQL compilation error: error line 1 at position 7
too many arguments for function [SEARCH(LINES.PLAY, LINES.LINE, 'king', 'UNICODE_ANALYZER')] expected 3, got 4
此示例失败了,因为不接受列名作为搜索字符串实参。
SELECT SEARCH(line, character) FROM lines;
001015 (22023): SQL compilation error:
argument 2 to function SEARCH needs to be constant, found 'LINES.CHARACTER'
SEARCH 的样本数据¶
本节中的一些示例查询包含莎士比亚戏剧中的文本的表。每行文本存储在表的单独一行中。其他列标识戏剧的名称、角色的名称等。lines
表具有以下结构:
DESCRIBE TABLE lines;
+----------------+---------------+--------+-------+-
| name | type | kind | null? |
|----------------+---------------+--------+-------+-
| LINE_ID | NUMBER(38,0) | COLUMN | Y |
| PLAY | VARCHAR(50) | COLUMN | Y |
| SPEECH_NUM | NUMBER(38,0) | COLUMN | Y |
| ACT_SCENE_LINE | VARCHAR(10) | COLUMN | Y |
| CHARACTER | VARCHAR(30) | COLUMN | Y |
| LINE | VARCHAR(2000) | COLUMN | Y |
+----------------+---------------+--------+-------+-
例如,这个表中的一行看起来像这样:
SELECT * FROM lines
WHERE line_id=34230;
+---------+--------+------------+----------------+-----------+--------------------------------------------+
| LINE_ID | PLAY | SPEECH_NUM | ACT_SCENE_LINE | CHARACTER | LINE |
|---------+--------+------------+----------------+-----------+--------------------------------------------|
| 34230 | Hamlet | 19 | 3.1.64 | HAMLET | To be, or not to be, that is the question: |
+---------+--------+------------+----------------+-----------+--------------------------------------------+
如果您想运行本节中的示例,请通过运行以下命令创建此表:
CREATE OR REPLACE TABLE lines(
line_id INT,
play VARCHAR(50),
speech_num INT,
act_scene_line VARCHAR(10),
character VARCHAR(30),
line VARCHAR(2000)
);
INSERT INTO lines VALUES
(4,'Henry IV Part 1',1,'1.1.1','KING HENRY IV','So shaken as we are, so wan with care,'),
(13,'Henry IV Part 1',1,'1.1.10','KING HENRY IV','Which, like the meteors of a troubled heaven,'),
(9526,'Henry VI Part 3',1,'1.1.1','WARWICK','I wonder how the king escaped our hands.'),
(12664,'All''s Well That Ends Well',1,'1.1.1','COUNTESS','In delivering my son from me, I bury a second husband.'),
(15742,'All''s Well That Ends Well',114,'5.3.378','KING','Your gentle hands lend us, and take our hearts.'),
(16448,'As You Like It',2,'2.3.6','ADAM','And wherefore are you gentle, strong and valiant?'),
(24055,'The Comedy of Errors',14,'5.1.41','AEMELIA','Be quiet, people. Wherefore throng you hither?'),
(28487,'Cymbeline',3,'1.1.10','First Gentleman','Is outward sorrow, though I think the king'),
(33522,'Hamlet',1,'2.2.1','KING CLAUDIUS','Welcome, dear Rosencrantz and Guildenstern!'),
(33556,'Hamlet',5,'2.2.35','KING CLAUDIUS','Thanks, Rosencrantz and gentle Guildenstern.'),
(33557,'Hamlet',6,'2.2.36','QUEEN GERTRUDE','Thanks, Guildenstern and gentle Rosencrantz:'),
(33776,'Hamlet',67,'2.2.241','HAMLET','Guildenstern? Ah, Rosencrantz! Good lads, how do ye both?'),
(34230,'Hamlet',19,'3.1.64','HAMLET','To be, or not to be, that is the question:'),
(35672,'Hamlet',7,'4.6.27','HORATIO','where I am. Rosencrantz and Guildenstern hold their'),
(36289,'Hamlet',14,'5.2.60','HORATIO','So Guildenstern and Rosencrantz go to''t.'),
(36640,'Hamlet',143,'5.2.389','First Ambassador','That Rosencrantz and Guildenstern are dead:'),
(43494,'King John',1,'1.1.1','KING JOHN','Now, say, Chatillon, what would France with us?'),
(43503,'King John',5,'1.1.10','CHATILLON','To this fair island and the territories,'),
(49031,'King Lear',1,'1.1.1','KENT','I thought the king had more affected the Duke of'),
(49040,'King Lear',4,'1.1.10','GLOUCESTER','so often blushed to acknowledge him, that now I am'),
(52797,'Love''s Labour''s Lost',1,'1.1.1','FERDINAND','Let fame, that all hunt after in their lives,'),
(55778,'Love''s Labour''s Lost',405,'5.2.971','ADRIANO DE ARMADO','Apollo. You that way: we this way.'),
(67000,'A Midsummer Night''s Dream',1,'1.1.1','THESEUS','Now, fair Hippolyta, our nuptial hour'),
(69296,'A Midsummer Night''s Dream',104,'5.1.428','PUCK','And Robin shall restore amends.'),
(75787,'Pericles',178,'1.0.21','LODOVICO','This king unto him took a fere,'),
(78407,'Richard II',1,'1.1.1','KING RICHARD II','Old John of Gaunt, time-honour''d Lancaster,'),
(91998,'The Tempest',108,'1.2.500','FERDINAND','Were I but where ''tis spoken.'),
(92454,'The Tempest',150,'2.1.343','ALONSO','Wherefore this ghastly looking?'),
(99330,'Troilus and Cressida',30,'1.1.102','AENEAS','How now, Prince Troilus! wherefore not afield?'),
(100109,'Troilus and Cressida',31,'2.1.53','ACHILLES','Why, how now, Ajax! wherefore do you thus? How now,'),
(108464,'The Winter''s Tale',106,'1.2.500','CAMILLO','As or by oath remove or counsel shake')
;