In the first part of this tutorial, we have set up all the connections required for creating the job, now we can proceed with data import. Let’s drag and drop inside the visual editor an object named tMap. You can find it on the left, in the instruments palette, inside the “elaboration” folder.
In this article we are going to show you how to write PL/Java functions in Greenplum. I assume that you have a working Greenplum (or Greenplum Community Edition) at your disposal. In this example we will use version **4.0.4**, installed in /usr/local/greenplum-db-220.127.116.11 (which is the default location).
hen working with databases, one of the most common task is to load data from one or more CSV files. Several tools are available to achieve this task. Some are executed via command line, like COPY (using psql), some are more complex, like ETL systems. We will start today with Talend but, in the next weeks, […]
[*MADlib*](http://madlib.net) is an open-source library for scalable in-database analytics which targets the PostgreSQL and the Greenplum databases. MADlib version 0.2beta needs to be installed properly to follow this article, so we encourage you to read the [official documentation](http://github.com/madlib/madlib/wiki/Installation-Guide-%28v0.2beta%29) to install it in a Greenplum database. I’m going to show you how to perform Association Rules […]
Because of PostgreSQL Conference Europe I had to reschedule the German trainings. The next upcoming training will be the 4-Days Administration, Performance, Streaming Replication Training. There are still a few seats left. Schedule: 2011 October 7 – 10 Location: Bielefeld Come to the nice East-Westphalia town and join our training.Register now! Detailed informations in German […]
Greenplum Community Edition is available in different flavours, including a VMWare virtual machine based on CentOS with all the fancy tools and the documentation already installed. This allows you to easily try and evaluate this powerful platform for data warehousing. [Greg Smith from our 2ndQuadrant team, recently explained how to install this image on Linux](http://www.greenplum.com/community/forums/showthread.php?486-Getting-Started-with-VMWare-on-Linux). […]
Picking back up this week’s theme of where you can publicize your PostgreSQL related project at, you’re probably reading this blog entry because it appeared on the Planet PostgreSQL blog aggregator. There are “Planet” feeds around many open-source projects. The Debian and GNOME ones spawned off the Planet software, which now powers a ton of […]
The software license PostgreSQL is released under makes it extremely friendly to businesses who would like to use the database in commercial products. Partly as a result of this, a significant amount of PostgreSQL development is donated by companies who sell products derived from the database (even entire forks of the source code). Normally this […]
One of the coolest features that Greenplum offers to Data warehousing and Business Intelligence operators as far as ETL is concerned, is the combination of read only external tables with gpfdist, Greenplum’s parallel file distribution server. The typical use case for this solution is parallel data loading of text files (coming from etherogeneous sources – […]
During EuroPython 2011, the major annual event for Python developers and users in Europe, 2ndQuadrant will deliver a special hands-on training session entitled “The Python and the Elephant”. This 4-hour workshop will take place on Thursday June 23 and will cover the two main techniques for writing applications in Python for PostgreSQL: standard client applications […]