ExtractDocumentRawText 2025.3.28.13-SNAPSHOT

BUNDLE

com.snowflake.openflow.runtime | runtime-document-layout-nar

DESCRIPTION

Extracts the text from a Document and writes it to the FlowFile content. This does not include any text found in any Processing Elements.

TAGS

document, openflow, rag, retrieval augmented generation, text, unstructured

INPUT REQUIREMENT

REQUIRED

Supports Sensitive Dynamic Properties

false

RELATIONSHIPS

NAME

DESCRIPTION

failure

If the text of a FlowFile cannot be extracted for any reason, the input FlowFile will be routed to this relationship.

success

The text of the PDF is routed to the success relationship.

SEE ALSO

Language: English