Massively Parallel PostgreSQL

for Analytics

An open-source massively parallel data platform for analytics, machine learning and AI 

All The Reasons to Choose Greenplum

Rapidly create and deploy models for complex applications in cybersecurity, predictive maintenance, risk management, fraud detection, and many other areas.

flexibility

True Flexibility

Develop Anywhere

Based on PostgreSQL, Greenplum provides you with more control over the software you deploy, reducing vendor lock-in, and allowing open influence on product direction.

scale

Power at Scale

Petabyte-Scale Data Volumes

With its unique cost-based query optimizer designed for large-scale data workloads, Greenplum scales interactive and batch-mode analytics to large datasets in the petabytes without degrading query performance and throughput.

ai

From BI to AI

All-in-One Environment

Greenplum reduces data silos by providing you with a single, scale-out environment for converging analytic and operational workloads, like streaming ingestion.

open

Open Source

Avoid Proprietary Vendor Lock-in

All major Greenplum contributions are part of the Greenplum Database project and share the same database core, including the MPP architecture, analytical interfaces, and security capabilities.

Top Features of Greenplum Database®

home mpp

MPP Architecture

All major Greenplum contributions share the same database core, including the MPP architecture, analytical interfaces, and security capabilities.

home federated

Federated Data Access

Query external data sources with the Greenplum optimizer and query processing engine, including Hadoop, Cloud Storage, ORC, AVRO, Parquet and other Polyglot data stores.

home poly

Polymorphic Data Storage

Fully control the configuration for your table and partition storage, execution, and compression. You can choose row or column-oriented storage and processing for any table or partition.

home analytics

Integrated In-Database Analytics

Tackle data science from experimentation to massive deployment with Apache MADlib, the open-source library of in-cluster machine learning functions for the PostgreSQL family of databases.

home query

Innovation in Query Optimization

Using the first open source cost-based query optimizer to scale interactive and batch mode analytics of large datasets in the petabytes without degrading query performance and throughput.

Commercial Greenplum Features

Easy Handling of Streaming Data

Get fast event processing and integrate cloud data by querying Amazon S3 objects in place. VMware Greenplum includes Kafka integration certified by Confluent.

Security and Disaster Recovery

Address regulatory requirements with security and authentication features, plus high availability, intelligent fault detection, backup and disaster recovery.

VMware-Certified Blueprint

Use Dell Greenplum Reference Architecture for optimal on-premises deployment. Or use HP- or Cisco-certified configurations or your own commodity hardware.

Learn more about GPDB Documentation

Report security vulnerability to GPDB Security

Contact Us

Interact with us on GPDB Slack

Post questions on GPDB Stack Overflow