The ultimate resource on building and deploying data integration solutions with Kettle Kettle is a scaleable and extensible open source ETL and data integration tool that lets you extract data from databases, flat and XML files, web services, ERP systems, and OLAP cubes. It provides over 120 built-in transformation steps to validate, cleanse, and conform data, as well as numerous options to load data into data warehouses and many other targets. Kettle is a comprehensive, low-cost
I wanted to do this review much sooner but I've been too busy using the book.
Jos and Roland have taken the proven formula they used in Pentaho Solutions and focused it on ETL and Kettle, AKA Pentaho Data Integration. Their magic formula is to seamlessly mix a product users guide with equal parts of real world examples and best practices training. With the addition of Matt Casters, Mr Kettle himself, the depth of knowledge in the book is now equal to it's breadth. The result is a book that you can read cover to cover and learn about all aspects of building and deploying ETL solutions, and is equally useful as a day to day reference.
The book is divided into five parts starting with an obligatory Getting Started. Getting Started, however, goes beyond the traditional "here's how to install it guide" and presents a nice tutorial on the sometimes confusing terminology and practices used in the data world. It explains how Kettle fits into this world and talks...Read more
I wanted to do this review much sooner but I've been too busy using the book.Jos and Roland have taken the proven formula they used in Pentaho Solutions and focused it on ETL and Kettle, AKA Pentaho Data Integration. Their magic formula is to seamlessly mix a product users guide with equal parts of real world examples and best practices training. With the addition of Matt Casters, Mr Kettle himself, the depth of knowledge in the book is now equal to it's breadth. The result is a book that you can read cover to cover and learn about all aspects of building and deploying ETL solutions, and is equally useful as a day to day reference.The book is divided into five parts starting with an obligatory Getting Started. Getting Started, however, goes beyond the traditional "here's how to install it guide" and presents a nice tutorial on the sometimes confusing terminology and practices used in the data world. It explains how Kettle fits into this world and talks...Read more