redshift and MPP

本文深入探讨了MPP数据库及其并行处理架构,特别关注Amazon Redshift作为基于PostgreSQL的MPP数据仓库解决方案。文章解释了Redshift如何通过采用列式存储改进数据查询效率,并介绍了其节点类型,包括领导者节点和计算节点的工作原理。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

MPP database

Massive Parallel Processing (MPP) database is a type of database that scales horizontally. MPP dbs adopted share-nothing architecture in that every “node” will maintain its own CPU, storage, etc. A query will be processed by multiple nodes in parallel and the results will be combined. In the early days, Teradata was the dominant vendor of MPP databases. Each node is a “database-like” program called AMP. Later on there are more MPP dbs. Most notable ones are Greenplum and Redshift. Both are based on PostgreSQL as basic nodes but both changed postgreSQL to columnar DB, whereas the regular postgreSQL is a row-based database. Another famous MPP and columnar database is Vertica, which originated from C-store.

Redshift

Redshift is Amazon’s version of MPP database and data warehouse (BI) that based on PostgreSQL 8.0.2. Since Redshift keeps the same interface as PostgreSQL, it is easy for customers to migrate their existing workload from PostgreSQL to Redshift.

There are several types of nodes: leader nodes, computer nodes. A computer nodes has dedicated CPU, disk resources and the resources are divided into node slices. The rows are distributed to node slices based on a distribution key. Then the leader node will distribute the work to node slices.

https://docs.aws.amazon.com/redshift/latest/dg/c_internal_arch_system_operation.html

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值