文章大纲
本文翻译自spark 官方文档,spark 3.2 引入了数据科学大杀器pandas api ,大有一统江湖之势。
我们来一同学习一下
Quickstart: Pandas API on Spark
This is a short introduction to pandas API on Spark, geared mainly for new users. This notebook shows you some key differences between pandas and pandas API on Spark. You can run this examples by yourself in ‘Live Notebook: pandas API on Spark’ at the quickstart page.
Customarily, we import pandas API on Spark as follows:
import pandas as pd
imp