Category

Blogs

gpsupport: Support Utility for VMware Greenplum

Author | Nihal Jain Troubleshooting and identifying supportability issues in a complex database system can be a daunting task. However, with the advent of tools like gpsupport, the process has become much simpler and more efficient. gpsupport is a powerful diagnostic utility provided by VMware Greenplum that enables

Read More »

Improving Greenplum Upgrade Performance

Authors | Kevin Yeap & Brent Doil Introduction Greenplum Upgrade (gpupgrade) is a utility that allows in-place upgrades from Greenplum Database (GPDB) 5.x version to 6.x version. Version 1.7.0 boasts a significant performance improvement compared to earlier releases. These resulted from several key optimizations that were implemented while

Read More »

5 Essential Tips for Managing Greenplum Databases

Greenplum Database is a massively parallel processing (MPP) database designed for high-performance analytics and data warehousing. Similar to handling other MPP databases, it mandates routine query refinement, resource allocation adjustments, and data safeguarding. Within this blog post, we will delve into five indispensable guidelines tailored to the effective management of Greenplum.

Read More »

10 Examples of Compression Enablement in Greenplum

Author | Wen Lin Introduction Greenplum is an open-source, massively parallel data warehouse designed for analytics and AI applications. Efficient data compression is vital in Greenplum to reduce storage space and improve query performance. Greenplum offers several techniques for compressing data, reducing storage costs, and improving query performance.

Read More »

Parallel Restore and Partitioned Tables in GPDB

Author | Andrew Repp The release of Greenplum Database 7 (GPDB7) brings with it many new features, and each of them requires some thought to make sure we get the most out of them. Today, I want to talk about the tweaks we on the Greenplum Kernel team

Read More »

How to Play Python3 with GPDB6

We recently released GreenplumPython, a Python library that allows users to interact with Greenplum or PostgreSQL in a Pythonic way. GreenplumPython provides a pandas-like table API that is familiar and intuitive to Python users. GreenplumPython is making it powerful for performing complex analyses such as statistical analysis with

Read More »

How to implement TPC-H queries with GreenplumPython

A quick demonstration and examples. TPCH benchmark   TPC-H is a benchmark developed to evaluate the performance of large-scale SQL and relational databases by the execution of sets of queries. It has 22 queries against a standard database under controlled conditions. These queries: Give answers to real-world business questions

Read More »
You've reached the end of this page.