Massively Parallel Postgres
for Analytics

An open-source massively parallel data platform for analytics, machine learning and AI 

All The Reasons to Choose Greenplum

Rapidly create and deploy models for complex applications in cybersecurity, predictive maintenance, risk management, fraud detection, and many other areas.

内核强大

Power at Scale
Petabyte-scale Data Volumes

With its unique cost-based query optimizer designed for large-scale data workloads, Greenplum scales interactive and batch-mode analytics to large datasets in the petabytes without degrading query performance and throughput.

灵活稳定

True Flexibility
Deploy Anywhere

Based on PostgreSQL, Greenplum provides you with more control over the software you deploy, reducing vendor lock-in, and allowing open influence on product direction.

机器学习

From BI to AI
All In One Environment

Greenplum reduces data silos by providing you with a single, scale-out environment for converging analytic and operational workloads, like streaming ingestion.

开源敏捷

Open Source
Avoid Propietary Vendor Lock-in

All major Greenplum contributions are part of the Greenplum Database project and share the same database core, including the MPP architecture, analytical interfaces, and security capabilities.

DB Architecture

MPP Architecture, Petabyte-Scale Loading

All major Greenplum contributions are part of the Greenplum Database project and share the same database core, including the MPP architecture, analytical interfaces, and security capabilities.

Federated Data Access

Query external data sources with the Greenplum optimizer and query processing engine. Including Hadoop, Cloud Storage, ORC, AVRO, Parquet and other Polyglot data stores.

Cloud Database
Data Storage

Polymorphic Data Storage

Fully control the configuration for your table and partition storage, execution, and compression. Design your tables based on the way data is accessed. Users have the choice of row or column-oriented storage and processing for any table or partition.

Integrated In-Database Analytics

Tackle data science from experimentation to massive deployment with Apache MADlib, the open-source library of in-cluster machine learning functions for the Postgres family of databases. MADlib with Greenplum provides multi-node, multi-GPU and deep learning capabilities.
Database Network
Optimization

Innovation in Query Optimization

The query optimizer available in Greenplum Database is the industry’s first open source cost-based query optimizer designed for big data workloads. It can scale interactive and batch mode analytics to large datasets in the petabytes without degrading query performance and throughput.

Latest Events

Attend the Latest Greenplum Talks, Meetups, and Conferences

2020 Greenplum Summit

Greenplum Summit, a virtual event, kicks off July 29! This online series is where decision makers, data scientists, analysts, DBAs, and developers meet to discuss, share, and shape the future of advanced data technologies.

10 Reasons Why Netezza Professionals Should Consider Greenplum

Listen to Kelly Carrigan from Eon Collective who has spent 13+ years architecting and deploying Netezza machines and Jacque Istok, Pivotal’s Head of Data as they talk about some key themes when making the decision…

READY TO GET START?

Experience the Fully Featured, Integrated, Open Source Analytics platform