Pentaho Data Integration Beginnerвђ™s Guide Apr 2026
: A command-line tool specifically for executing transformations. Kitchen : A command-line tool used to execute jobs.
: PDI is metadata-oriented , meaning users specify what to do through the GUI rather than writing code for how to do it.
PDI utilizes a suite of tools, collectively often referred to by their original names (the "Kettle" project components): Pentaho Data Integration Beginner’s Guide
: A lightweight web server that allows for remote execution and monitoring of transformations and jobs. Key Concepts: Transformations vs. Jobs
For beginners, understanding the distinction between these two building blocks is critical: PDI utilizes a suite of tools, collectively often
: It supports data extraction from numerous sources, including relational databases, Excel, XML, Hadoop, and Amazon S3.
: Focused on moving and manipulating data. They consist of steps (e.g., reading a CSV, filtering rows) that typically run in parallel to maximize performance. : Focused on moving and manipulating data
: Focused on high-level orchestration and flow control. They coordinate transformations and other job entries (like sending an email or checking if a file exists) in a sequential manner. Primary Features and Benefits
