pandas如何使用pd.cut（）

面试问答更新时间：2026-04-02 03:18:42 发布时间：1602天前 IT归档最新发布模块sitemap 名妆网法律咨询聚返吧英语巴士网伯小乐网商动力

test[‘range’] = pd.cut(test.days, [0,30,60], include_lowest=True)
print (test)
daysrange
0 0 (-0.001, 30.0]
1 31 (30.0, 60.0]
2 45 (30.0, 60.0]

看区别：

test = pd.Dataframe({'days': [0,20,30,31,45,60]})test['range1'] = pd.cut(test.days, [0,30,60], include_lowest=True)#30 value is in [30, 60) grouptest['range2'] = pd.cut(test.days, [0,30,60], right=False)#30 value is in (0, 30] grouptest['range3'] = pd.cut(test.days, [0,30,60])print (test)   days          range1    range2    range30     0  (-0.001, 30.0]   [0, 30)       NaN1    20  (-0.001, 30.0]   [0, 30)   (0, 30]2    30  (-0.001, 30.0]  [30, 60)   (0, 30]3    31    (30.0, 60.0]  [30, 60)  (30, 60]4    45    (30.0, 60.0]  [30, 60)  (30, 60]5    60    (30.0, 60.0]       NaN  (30, 60]

或使用

numpy.searchsorted

，但

days

hast的值必须排序：

arr = np.array([0,30,60])test['range1'] = arr.searchsorted(test.days)test['range2'] = arr.searchsorted(test.days, side='right') - 1print (test)   days  range1  range20     0       0       01    20       1       02    30       1       13    31       2       14    45       2       15    60       2       2

转载请注明：文章转载自 www.mshxw.com

本文地址：https://www.mshxw.com/it/455723.html

上一篇 Argparse-不要用`nargs`捕获位置参数。

下一篇为什么元组比Python中的列表快？

面试问答相关栏目本月热门文章

关于我们文章归档网站地图联系我们