Greenplum Database® is an advanced, fully featured, open source data warehouse. It provides powerful and rapid analytics on petabyte scale data volumes. Uniquely geared toward big data analytics, Greenplum Database is powered by the world’s most advanced cost-based query optimizer delivering high analytical query performance on large data volumes.
Greenplum Database® project is released under the Apache 2 license. We want to thank all our current community contributors and are interested in all new potential contributions. For the Greenplum Database community no contribution is too small, we encourage all types of contributions. To ensure that the use of the Greenplum Database® trademarks and graphics marks will not lead to confusion please follow the Greenplum Database trademark guidelines.
The Greenplum Database architecture provides automatic parallelization of all data and queries.
High-performance loading uses MPP technology. Loading speeds scale with each additional node to greater than 10 terabytes per hour, per rack.
The query optimizer available in Greenplum Database is the industry’s first cost-based query optimizer for big data workloads. It can scale interactive and batch mode analytics to large datasets in the petabytes without degrading query performance and throughput.
The table or partition storage, execution, and compression settings can be configured to suit the way data is accessed. Users have the choice of row or column-oriented storage and processing for any table or partition.
Provided by Apache MADLib (incubating), a library for scalable in-database analytics extending the SQL capabilities on Greenplum Database through user-defined functions.
Greenplum Database is the first massively parallel open source data warehouse. It is forever changing the data warehouse market and we welcome all contributors that want to be part of this change. Below are all the ways you can get involved with Greenplum. Development contributions are encouraged but you don't have to be a developer; there are many ways to get involved with Greenplum.
Use the email@example.com mailing list to share any kind of questions related to installation, configuration, usage, product documentation or any other area you might need help with. Feel free to send us your links to blogs and presentations so we can highlight them on greenplum.org. Alternatively you can also be a part of the Greenplum discussions on Stack Overflow.
Do you have an idea for a new feature or bug fix for Greenplum? Please discuss in the firstname.lastname@example.org mailing list or make pull requests on Github.
Are you a Greenplum expert? Want to share your knowledge with others? We are a collaborative community that shares best practices.
Write an email to email@example.com.
Apache MADlib (incubating) is a SQL-based advanced analytics and machine learning library that works with the Greenplum database.
Mailing list for Greenplum user community.
Mailing list for Greenplum developers community.
Mailing list for all major Greenplum product announcements.
This includes every new release and any other critical announcements.
Mailing list to receive commit notifications by email
Mailing list for all Greenplum related jobs. List is open
to anyone that wants to announce jobs.
Mailing list for a modular query optimizer for big data.