2ndQuadrant is now part of EDB

Bringing together some of the world's top PostgreSQL experts.

2ndQuadrant | PostgreSQL
Mission Critical Databases
  • Contact us
  • EN
    • FR
    • IT
    • ES
    • DE
    • PT
  • Support & Services
  • Products
  • Downloads
    • Installers
      • Postgres Installer
      • 2UDA – Unified Data Analytics
    • Whitepapers
      • Business Case for PostgreSQL Support
      • Security Best Practices for PostgreSQL
    • Case Studies
      • Performance Tuning
        • BenchPrep
        • tastyworks
      • Distributed Clusters
        • ClickUp
        • European Space Agency (ESA)
        • Telefónica del Sur
        • Animal Logic
      • Database Administration
        • Agilis Systems
      • Professional Training
        • Met Office
        • London & Partners
      • Database Upgrades
        • Alfred Wegener Institute (AWI)
      • Database Migration
        • International Game Technology (IGT)
        • Healthcare Software Solutions (HSS)
        • Navionics
  • Postgres Learning Center
    • Webinars
      • Upcoming Webinars
      • Webinar Library
    • Whitepapers
      • Business Case for PostgreSQL Support
      • Security Best Practices for PostgreSQL
    • Blog
    • Training
      • Course Catalogue
    • Case Studies
      • Performance Tuning
        • BenchPrep
        • tastyworks
      • Distributed Clusters
        • ClickUp
        • European Space Agency (ESA)
        • Telefónica del Sur
        • Animal Logic
      • Database Administration
        • Agilis Systems
      • Professional Training
        • Met Office
        • London & Partners
      • Database Upgrades
        • Alfred Wegener Institute (AWI)
      • Database Migration
        • International Game Technology (IGT)
        • Healthcare Software Solutions (HSS)
        • Navionics
    • Books
      • PostgreSQL 11 Administration Cookbook
      • PostgreSQL 10 Administration Cookbook
      • PostgreSQL High Availability Cookbook – 2nd Edition
      • PostgreSQL 9 Administration Cookbook – 3rd Edition
      • PostgreSQL Server Programming Cookbook – 2nd Edition
      • PostgreSQL 9 Cookbook – Chinese Edition
    • Videos
    • Events
    • PostgreSQL
      • PostgreSQL – History
      • Who uses PostgreSQL?
      • PostgreSQL FAQ
      • PostgreSQL vs MySQL
      • The Business Case for PostgreSQL
      • Security Information
      • Documentation
  • About Us
    • About 2ndQuadrant
    • 2ndQuadrant’s Passion for PostgreSQL
    • News
    • Careers
    • Team Profile
  • Blog
  • Menu Menu
You are here: Home1 / Blog2 / Data Mining3 / 2UDA RC1 – New features in Orange (Part 1)
Lan Zagar

2UDA RC1 – New features in Orange (Part 1)

December 28, 2015/2 Comments/in Data Mining /by Lan Zagar

The 2UDA installation package was updated recently to include the newly released PostgreSQL 9.5 RC1. Also found in the new package is an updated version of Orange bringing some new features, improvements, and bug fixes. Summary of the more noticeable changes can be found in 2UDA release notes.

In this first of a series of posts, I will explore changes related to working on data stored in PostgreSQL databases: logging, approximate preprocessing, materializing queries, and schema selection. There are lots of other features to talk about, stay tuned to read the subsequent blogs.

Logging Let’s start with the new logging functionality. For anyone interested in exploring larger databases or performing more complex analyses with 2UDA, it is now much easier to see where most of the time is being spent. From now on, Orange will log all SQL queries and their execution times into a log file. This can be analysed later to help us optimize the database structure or make the most of what is available and avoid bottlenecks. It should prove useful for a range of people from the developers, administrators, and to the end users.

Approximate preprocessing The benefits of sampling made possible by the new feature of PostgreSQL 9.5, the TABLESAMPLE clause, are finding their way into more and more components. This can greatly improve the experience on large data where a single pass over all rows takes too long. Discretize and Impute preprocessors now execute on 1 second samples of data so they can be used automatically in interactive widgets, without sending the user for another cup of coffee (or on holidays), waiting for the results. To split a numerical variable into a number of discrete intervals, it is not necessary to compute the thresholds from billions of values – a representative sample of thousands of values should be just as good.

Orange - Discretize widget

Fast discretization using data sampling

Materializing queries Another addition in the new version is related to the existing option of using custom SQL queries as input data. Normally Orange works on single tables containing data instances (rows) described with features (columns). However, the user has the option to input a (possibly complex) custom SQL query instead, which compiles the feature descriptions from many tables using e.g. joins and aggregate functions. A new option has been added to save the results to a table directly from Orange and use the stored results instead of repeating the (slower) query in many computations. A user that doesn’t mind some direct database work could create a materialized view there instead, but for others it is a convenient feature available directly from Orange.

Schema selection When using databases with many schemas and tables, it is now easier to select the desired table to be used in Orange by specifying the schema in addition to other database connection parameters. Before, Orange always showed a drop-down menu with all tables from schemas on the user’s search_path. Now, optionally specifying a specific schema can list tables that were previously not visible or filter the list to relevant tables only.

You can download the 2UDA package here.

Part 2 describes the new Color widget and reporting functionality.

Tags: 2UDA, Orange
Share this entry
  • Share on Facebook
  • Share on Twitter
  • Share on WhatsApp
  • Share on LinkedIn
2 replies
  1. Dave
    Dave says:
    January 14, 2016 at 10:04 am

    Will this software work with postgres extensions such as postgis which adds a lot of extra sql functions and a new datatype (a geometry)

    Reply
    • Lan Zagar
      Lan Zagar says:
      January 14, 2016 at 10:23 am

      Having additional extensions should not cause any compatibility problems. But working with new data types directly will probably not work out of the box – Orange expects tables with columns containing standard data types (numerical types, strings, …).

      However, if you can construct a SELECT statement that uses e.g. a table of geometry objects and extracts 2 numerical values (x and y coordinates) then Orange can load that data and use it (e.g. show distributions of those values, visualize scatter plots, etc.)
      If you want to try this, you can select the “Custom SQL” option in the SQL Table widget and input the SELECT statement directly.

      Reply

Leave a Reply

Want to join the discussion?
Feel free to contribute!

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

Get in touch with us!

Recent Posts

  • Random Data December 3, 2020
  • Webinar: COMMIT Without Fear – The Beauty of CAMO [Follow Up] November 13, 2020
  • Full-text search since PostgreSQL 8.3 November 5, 2020
  • Random numbers November 3, 2020
  • Webinar: Best Practices for Bulk Data Loading in PostgreSQL [Follow Up] November 2, 2020

Featured External Blogs

Tomas Vondra's Blog

Our Bloggers

  • Simon Riggs
  • Alvaro Herrera
  • Andrew Dunstan
  • Craig Ringer
  • Francesco Canovai
  • Gabriele Bartolini
  • Giulio Calacoci
  • Ian Barwick
  • Marco Nenciarini
  • Mark Wong
  • Pavan Deolasee
  • Petr Jelinek
  • Shaun Thomas
  • Tomas Vondra
  • Umair Shahid

PostgreSQL Cloud

2QLovesPG 2UDA 9.6 backup Barman BDR Business Continuity community conference database DBA development devops disaster recovery greenplum Hot Standby JSON JSONB logical replication monitoring OmniDB open source Orange performance PG12 pgbarman pglogical PG Phriday postgres Postgres-BDR postgres-xl PostgreSQL PostgreSQL 9.6 PostgreSQL10 PostgreSQL11 PostgreSQL 11 PostgreSQL 11 New Features postgresql repmgr Recovery replication security sql wal webinar webinars

Support & Services

24/7 Production Support

Developer Support

Remote DBA for PostgreSQL

PostgreSQL Database Monitoring

PostgreSQL Health Check

PostgreSQL Performance Tuning

Database Security Audit

Upgrade PostgreSQL

PostgreSQL Migration Assessment

Migrate from Oracle to PostgreSQL

Products

HA Postgres Clusters

Postgres-BDR®

2ndQPostgres

pglogical

repmgr

Barman

Postgres Cloud Manager

SQL Firewall

Postgres-XL

OmniDB

Postgres Installer

2UDA

Postgres Learning Center

Introducing Postgres

Blog

Webinars

Books

Videos

Training

Case Studies

Events

About Us

About 2ndQuadrant

What does 2ndQuadrant Mean?

News

Careers 

Team Profile

© 2ndQuadrant Ltd. All rights reserved. | Privacy Policy
  • Twitter
  • LinkedIn
  • Facebook
  • Youtube
  • Mail
Joins Don’t Scale! Why Logical Replication?
Scroll to top
×