From wikipedia, Apache Parquet is a free and open-source column-oriented data storage format of the Apache Hadoop ecosystem.
The open-source project to build Apache Parquet began as a joint effort between Twitter and Cloudera. Parquet was designed as an improvement upon the Trevni columnar storage format created by Hadoop creator Doug Cutting. The first version – Apache Parquet 1.0 – was released in July 2013. Since April 27, 2015, Apache Parquet is a top-level Apache Software Foundation-sponsored project.

最低0.47元/天 解锁文章
8421

被折叠的 条评论
为什么被折叠?



