类似
@locojay建议,你可以申请
difflib的
get_close_matches到df2的指标,然后应用
join:
In [23]: import difflib In [24]: difflib.get_close_matchesOut[24]: <function difflib.get_close_matches>In [25]: df2.index = df2.index.map(lambda x: difflib.get_close_matches(x, df1.index)[0])In [26]: df2Out[26]: letterone atwo bthree cfour dfive eIn [31]: df1.join(df2)Out[31]: number letterone 1 atwo 2 bthree 3 cfour 4 dfive 5 e
如果这些是列,则可以按照相同的方式应用于该列,然后merge:
df1 = Dataframe([[1,'one'],[2,'two'],[3,'three'],[4,'four'],[5,'five']], columns=['number', 'name'])df2 = Dataframe([['a','one'],['b','too'],['c','three'],['d','fours'],['e','five']], columns=['letter', 'name'])df2['name'] = df2['name'].apply(lambda x: difflib.get_close_matches(x, df1['name'])[0])df1.merge(df2)



