编辑:感谢@WesMcKinney显示了这种更直接的方法:
data[data.groupby('tag').pid.transform(len) > 1]import pandasimport numpy as npdata = pandas.Dataframe( {'pid' : [1,1,1,2,2,3,3,3], 'tag' : [23,45,62,24,45,34,25,62], })bytag = data.groupby('tag').aggregate(np.count_nonzero)tags = bytag[bytag.pid >= 2].indexprint(data[data['tag'].isin(tags)])产量
pid tag1 1 452 1 624 2 457 3 62



