SplitExcel 2.3.0¶
BUNDLE¶
org.apache.nifi | nifi-poi-nar
DESCRIPTION¶
This processor splits a multi sheet Microsoft Excel spreadsheet into multiple Microsoft Excel spreadsheets where each sheet from the original file is converted to an individual spreadsheet in its own flow file. Currently this processor is only capable of processing .xlsx (XSSF 2007 OOXML file format) Excel documents and not older .xls (HSSF ‘97(-2007) file format) documents. Please note all original cell styles are dropped and formulas are removed leaving only the calculated values. Even a single sheet Microsoft Excel spreadsheet is converted to its own flow file with all the original cell styles dropped and formulas removed.
INPUT REQUIREMENT¶
REQUIRED
Supports Sensitive Dynamic Properties¶
false
PROPERTIES¶
Property |
Description |
---|---|
Password |
The password for a password protected Excel spreadsheet |
Protection Type |
Specifies whether an Excel spreadsheet is protected by a password or not. |
RELATIONSHIPS¶
NAME |
DESCRIPTION |
---|---|
failure |
If a FlowFile cannot be transformed from the configured input format to the configured output format, the unchanged FlowFile will be routed to this relationship. |
split |
The individual Excel ‘segments’ of the original Excel FlowFile will be routed to this relationship. |
original |
The original FlowFile that was split into segments. If the FlowFile fails processing, nothing will be sent to this relationship |
WRITES ATTRIBUTES¶
NAME |
DESCRIPTION |
---|---|
fragment.identifier |
All split Excel FlowFiles produced from the same parent Excel FlowFile will have the same randomly generated UUID added for this attribute |
fragment.index |
A one-up number that indicates the ordering of the split Excel FlowFiles that were created from a single parent Excel FlowFile |
fragment.count |
The number of split Excel FlowFiles generated from the parent Excel FlowFile |
segment.original.filename |
The filename of the parent Excel FlowFile |
sheetname |
The name of the Excel sheet from the original spreadsheet. |
total.rows |
The number of rows in the Excel sheet from the original spreadsheet. |