Category: Blog

Platform Extension Framework (PXF): Enabling Parallel Query Processing Over Heterogeneous Data Sources In Greenplum

Authors: Venkatesh Raghavan, Alexander Denissov, Francisco Guerrero, Oliver Albertini, Divya Bhargov, Lisa Owen, Shivram Mani, Lav Jain Abstract: With the explosion of data stores and cloud services, data now resides across many disparate systems and in a variety of formats. When multiple data sets exist in external systems, it is often neces- sary to perform…
Read more

Image Classification in Greenplum Database Using Deep Learning

Authors: Oliver Albertini, Divya Bhargov, Alexander Denissov, Francisco Guerrero, Nandish Jayaram, Nikhil Kak, Ekta Khanna, Orhan Kislal, Arun Kumar, Frank McQuillan, Lisa Owen, Venkatesh Raghavan, Domino Valdano, Yuhao Zhang Abstract: Artificial neural networks can be used to create highly accurate models in domains such as language processing and image recog- nition. For example, convolutional neural…
Read more

Bottom-up Join Enumeration in a Top-down Optimizer

Authors: Bhuvnesh Chaudhary, Hans Zeller, Sambitesh Dash, Venkatesh Raghavan MAPBU, VMWare Palo Alto CA, USA Abstract: Greenplum Database is a massively parallel processing (MPP) analytics database that adopts a shared-nothing architecture with multiple cooperating processors. A query submitted to the Greenplum master is optimized by the Orca query optimizer [7] which is based on state-of-the-art…
Read more

Multi-temperature data querying from heterogeneous data stores with Greenplum and PXF

Often in businesses, it is hard to fit all data into a single store. Data that is old and not accessed often (cold data) is generally archived and placed in long-term stores like a data lake or an S3 object store. Data that is recent and prone to frequent access (hot data) is stored in…
Read more

Greenplum Backups To S3

Recently I had the opportunity to demonstrate the ability to use Amazon S3 as the landing spot for backups of Greenplum. I thought that the steps involved in creating incremental backups of Greenplum to S3 would be of interest to many. Greenplum backups can be performed to any folder on a mount / filesystem as…
Read more

Using a Virtualized, Open Source Data Platform on AWS

Co-Authored by Ji Lim and Maurice Martin On April 2nd, 2020 VMware Tanzu Data and Amazon Web Services (AWS) participated in a joint webinar detailing the capabilities and benefits of running advanced analytics and data science models in Greenplum on AWS.  Our collective teams partnered to deliver a series of short presentations and demos to…
Read more

GPCC 6.0 Highlights

Greenplum Command Center (GPCC) is the single application needed by database administrators to manage and monitor Pivotal Greenplum.  In this post I will talk about some new changes that GPCC users should be aware of with the recent 6.0 release of GPCC that is designed to work with Pivotal Greenplum version 6. Turn on query…
Read more

Large Chinese Bank’s Successful TD to GP Migration

Today I used Google Translate to read this phenomenal success case of TD to GP by one of the world’s largest banks with CN¥23.22 trillion $3.375 trillion of total assets (2018). They are based in China and have a good relation with the GP community. The full text of the article in Chinese is here…
Read more

Procedure for Backup methods in Greenplum Database

Purpose of the Document Procedure for Greenplum Database Backup on any DB versions of Greenplum. Procedure ================================================================ Checking Disk Space Usage ================================================================ Before taking a backup of each schema’s just check the DB size of each schema from Greenplum database. Login in to server as a root user. Switch the user to gpadmin    :    su…
Read more

User and Schema Creation in Greenplum Database

Purpose of the Document Procedure for Creation of User / Database (Schema) on any DB versions of Greenplum. Procedure ================================================================      Create User Accounts ================================================================     Use the following procedure to create user account. (i)Login in to the server as a toot user (ii) switch in to gpadmin user (iii) Run the command :  psql postgres (iv)…
Read more