您可以尝试
str.extract和
strip,但最好使用
str.split,因为电影名称中也可以是数字。下一个解决方案是
replace括号的内容
regex以及
strip前导空格和尾随空格:
#convert column to stringdf['movie_title'] = df['movie_title'].astype(str)#but it remove numbers in names of movies toodf['titles'] = df['movie_title'].str.extract('([a-zA-Z ]+)', expand=False).str.strip()df['titles1'] = df['movie_title'].str.split('(', 1).str[0].str.strip()df['titles2'] = df['movie_title'].str.replace(r'([^)]*)', '').str.strip()print df movie_title titles titles1 titles20 Toy Story 2 (1995) Toy Story Toy Story 2 Toy Story 21 GoldenEye (1995) GoldenEye GoldenEye GoldenEye2 Four Rooms (1995) Four Rooms Four Rooms Four Rooms3 Get Shorty (1995) Get Shorty Get Shorty Get Shorty4 Copycat (1995) Copycat Copycat Copycat


