1 DryadLINQ, FlumeJava
Similar “distributed collection” API, but cannot reuse datasets efficiently across queries
2 Relational databases
Lineage/provenance, logical logging, materialized views
3 GraphLab, Piccolo, BigTable, RAMCloud
Fine-grained writes similar to distributed shared memory
4 Iterative MapReduce (e.g. Twister, HaLoop)
Implicit data sharing for a fixed computation pattern
5 Caching systems (e.g. Nectar)
Store data in files, no explicit control over what is cached
Similar “distributed collection” API, but cannot reuse datasets efficiently across queries
2 Relational databases
Lineage/provenance, logical logging, materialized views
3 GraphLab, Piccolo, BigTable, RAMCloud
Fine-grained writes similar to distributed shared memory
4 Iterative MapReduce (e.g. Twister, HaLoop)
Implicit data sharing for a fixed computation pattern
5 Caching systems (e.g. Nectar)
Store data in files, no explicit control over what is cached
本文探讨了分布式数据处理、关系数据库、图形处理、迭代式MapReduce、缓存系统等核心概念及其在信息技术领域的应用。
1906

被折叠的 条评论
为什么被折叠?



