Category: Blog

Self-Healing Greenplum – The Doctor Is Always In

Analytics On IaaS Must Think Differently Than It’s On Premise Implementations We have always maintained that having a data platform that is portable is not only one of the key differentiators of Greenplum, but should be a core functional requirement on anyone’s roadmap for how to best architect for their needs.  But doing so should never…
Read more

Introducing Pivotal Greenplum-Spark Connector, Integrating with Apache Spark

Introducing Pivotal Greenplum-Spark Connector, Integrating with Apache Spark We are excited to announce general availability of the new, native Greenplum-Spark Connector. Pivotal Greenplum-Spark Connector combines the best of both worlds – Greenplum, massively parallel processing (MPP) analytical data platform and Apache Spark, in-memory processing with the flexibility to scale elastic workloads. The connector supports Greenplum…
Read more

Introducing gpbackup & gprestore

Earlier this year the Greenplum team embarked down the path to create the next generation backup and restore tooling for the Greenplum Database.   After conducting dozens of customer interviews and reviewing a long list of enhancement requests, two overarching themes emerged:   Performance User Experience   

Install Greenplum OSS on Ubuntu

About Greenplum Database Greenplum Database is an MPP SQL Database based on PostgreSQL.  Its used in production in hundreds of large corporations and government agencies around the world and including the open source has over thousands of deployments globally. Greenplum Database scales to multi-petabyte data sizes with ease and allows a cluster of powerful servers…
Read more

Introduction to Writable External protocol of gpfdist

Gpfdist support both readable external table and writable external table. This blog will introduce how writable gpfdist external table works.

IoT, CEP, storage and NATS in between. Part 1 of 3.

Intro Hello, my name is Dmitry Dorofeev, I’m a software architect working for Luxms Group. We are a team of creative programmers touching technology which moves faster than we can imagine these days. This blog post is about building a small streaming analytics pipeline which is minimalistic, but can be adapted for bigger projects easily.…
Read more

Greenplum Database Tables and Compression

Greenplum Database is built for advanced Data Warehouse and Analytic workloads at scale. Whether the data set is five terabytes on a handful of servers, or over a petabyte in size on a hundred-plus nodes, the architecture of Greenplum allows it to easily grow to meet the data management and concurrent user access requirements of…
Read more

Conquering your database workloads using WLM

Conquering Your Database Workloads Howard Goldberg – Executive Director,  Morgan Stanley,  Head of Greenplum engineering 1  Introduction Everyone has been in some type of traffic delay, usually at the worst possible time. These traffic jams result from an unexpected accident, volume on the roadway, or lane closures forcing a merge from multiple lanes into a…
Read more

Altered States: Greenplum Alter Table Command by Howard Goldberg

A common question that is frequently asked when performing maintenance on Greenplum tables is “Why does my ALTER TABLE add column DDL statement take so long to run?” Although it appears to be a simple command and the expectations are that it will execute in minutes this is not true depending on the table organization…
Read more

Slash Teradata Spend & Modernize

      Setting the Stage Growing up in the enterprise data and analytics marketplace, I’ve had the good fortune to see a number of game-changing technologies born and rise in corporate adoption. In a subset of cases, I’ve seen the same technology collapse just as quickly as it rose. However, Teradata Database is not…
Read more