np.percentile()函数超详解异常值极端值百分位四分位数

最新推荐文章于 2024-09-25 09:45:00 发布

转载最新推荐文章于 2024-09-25 09:45:00 发布 · 1.9k 阅读

0 ·

CC 4.0 BY-SA版权

原文链接：https://blog.youkuaiyun.com/weixin_40845358/article/details/84638449

文章标签：

#python

本文详细解析了四分位数的计算方法，包括在数列中确定上四分位数的具体步骤，以及如何使用numpy库进行准确计算，避免常见错误。通过实例说明，帮助读者理解四分位数在数据分析中的应用。

20211115

当有空值存在时，四分位数会是空值

20211019

https://www.zhihu.com/question/58421946
https://baike.baidu.com/item/%E5%9B%9B%E5%88%86%E4%BD%8D%E6%95%B0/5040599?fr=aladdin
上四分位数是大的那端

https://jingyan.baidu.com/article/20095761f8299dcb0621b455.html
计算步骤

https://zhuanlan.zhihu.com/p/235345817?utm_source=wechat_session
四分位数
顺序从1开始计数这里的计算也是不对的,最好按numpy默认的配置计算的结果为准

在这里插入图片描述
举例数列 1 2 3 4 5 6 7 8

上四分位=2x(3-2)*0.75=2.75

第一步 (n+1)/4=求的位置

aa=ndarray=[[1,2,3,4,5,6,7,8,9,10,100],[1,2,2,4,4,4,5,5,5,5,100]]
bb=pd.DataFrame(aa)
min=np.percentile(bb,1,axis=1,interpolation=‘higher’)
max=np.percentile(bb,99,axis=1,interpolation=‘higher’)
1,99 是索引的比例位置
higher lower,是取索引的小值还是大值

官方文档入口：https://docs.scipy.org/doc/numpy/reference/generated/numpy.percentile.html

关注博主即可阅读全文