Category

Greenplum

Introduction of Readable External Protocol of gpfdist

As the fundamental of all ETL operation of Greenplum, it worth explaining a little more  about the detail of gpfdist to understand why it is faster than other tools and how could we improve in future. This blog will focus on the detail of communication of readable external

Read More »

Graphing Orlando IoT Temperature Sensor Readings

I wondered what temperatures in Orlando have done over this last week. You see I just happen to have a set of IoT devices which are streaming data that I persist into an archive. One of those sensors is on a covered patio in Orlando, so it would

Read More »

Introduction to Greenplum ETL tool – Overview

Why ETL is important for Greenplum As a data warehouse product of future, Greenplum is able to process huge set of data which is usually in petabyte level, but Greenplum can’t generate such number of data by itself. Data is often generated by millions of users or embedded

Read More »

On-Demand Machine Learning

Achieving Machine Learning Nirvana By Shailesh Doshi Recently, I have been in multiple discussions with clients who want to achieve consistent operationalized data science and machine learning pipelines while the business demands more ‘on-demand’ capability. Often the ‘on-demand’ conversation starts with ‘Apache Spark’ type usage for analytics use

Read More »

Meetup: Introducing Greenplum 5.0

Wednesday, September 20th, 2017 6:00 PM PST Pivotal 875 Howard St., 5th Floor, San Francisco, CA (map) Greenplum 5.0 is a commercially available and open source Data Warehouse. This is the next milestone for the Greenplum community since Greenplum was officially open sourced in October of 2015.

Read More »
You've reached the end of this page.