Scenario: We have a remote datasource, served by a gpfdist server. We need to import the data in a Greenplum database, while performing some ETL manipulation during the import. It is possible to accomplish this goal with a simple transformation in a few steps using Kettle.
In the first part of this article we have created a job, a database connection and defined the flow in Kettle. In the second part we’ll see how Kettle manages the data import from the CSV files.
Recently I have shown you how to perform a data import from a CSV file into a Greenplum database, using Talend Community Edition. In this article I’m going to perform the same task using another ETL tool, Kettle.
UK +44 (0)870 766 7756
US +1 650 378 1218
Stay in touch with us
Subscribe to our monthly newsletter to hear the latest developments from 2ndQuadrant and related technologies.
We’ll also send you any important news or updates that we think you’ll find useful.
We value your privacy and will not pass your details on to anyone else.