Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
en:cheatsheet [2020-06-06] – external edit 127.0.0.1 | en:cheatsheet [2024-03-21] – Kamil Nešetřil | ||
---|---|---|---|
Line 1: | Line 1: | ||
+ | ======Pentaho Data Integration Cheat Sheet====== | ||
+ | //This is a short guideline for Kettle: Pentaho Data Integration (< | ||
+ | > Since Hitachi acquired Pentaho, the development of the Pentaho platform has stagnated, and the platform' | ||
+ | |||
+ | {{ : | ||
+ | ===== How to start ===== | ||
+ | - Install [[https:// | ||
+ | If you cannot install it, use the [[https:// | ||
+ | When Java gets **updated** then '' | ||
+ | - Download | ||
+ | - Unzip the file to the folder of your choice. | ||
+ | - Run ''< | ||
+ | - Go to the **Design** tab. \\ {{: | ||
+ | - Drag and drop items from the left bar to the canvas. | ||
+ | |||
+ | ===== Some steps used in transformations ===== | ||
+ | ^ Symbol | ||
+ | | {{ : | ||
+ | |Other steps for data input and output from/to databases, other sources (e-mail, local computer, FTP, HTTP), and files (MS Excel, MS Access, [[wpcs> | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | | [[https:// | ||
+ | |||
+ | Other steps: | ||
+ | * Google Spreadsheet Input/ | ||
+ | * [[git> | ||
+ | * [[https:// | ||
+ | * [[git> | ||
+ | |||
+ | ===== Often used job entries ===== | ||
+ | ^ Symbol ^ Name ^ Description ^ | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | | {{ : | ||
+ | |||
+ | Official source with all [[https:// | ||
+ | |||
+ | =====Regular expressions===== | ||
+ | |||
+ | **Table** Selection of the input files (regex corresponds to the file name) | ||
+ | ^Description ^Reguar expression | ||
+ | |'' | ||
+ | |All files|'' | ||
+ | |Files starting with '' | ||
+ | |||
+ | **Table** Select part of a text string | ||
+ | ^Description | ||
+ | |Between brackets |<code reg> | ||
+ | |Up to '','','';'' | ||
+ | |Up to '','', | ||
+ | |||
+ | |||
+ | =====Tips and tricks===== | ||
+ | * Empty rows in GUI dialogs cause errors. | ||
+ | * Manage errors in separate streams. The first step in a transformation cannot deal with error rows because they do not exist in PDI yet. | ||
+ | * Use [[https:// | ||
+ | * Use variables and parameters (see above) | ||
+ | * Use relative paths ('' | ||
+ | * Check '' | ||
+ | * To export to e.g. SQLite, first create the table by SQL and then load data there (even in the same transformation because scripts are executed first). | ||
+ | * [[https:// | ||
+ | * [[git> | ||
+ | * [[https:// | ||
+ | * [[vid> | ||
+ | |||
+ | |||
+ | * **FAQ [[https:// | ||
+ | * //When I start ''< | ||
+ | * Edit the ''< | ||
+ | * Replace in the last line '' | ||
+ | * Add a '' | ||
+ | * Save and try it again. | ||
+ | * //How to use JNDI?// | ||
+ | * If you look inside the PDI main directory you'll see a sub-directory called '' | ||
+ | |||
+ | < | ||
+ | |||
+ | ===== References ===== | ||
+ | ROLDÁN, María Carina, 2017. //Learning Pentaho Data Integration 8 CE : Third Edition//. [[https:// |
Last modified: 2024-04-20