您可以使用expr函数
>>> from pyspark.sql.functions import substring, length, col, expr>>> df = df.withColumn("flower",expr("substring(name, 1, length(name)-5)"))>>> df.show()+--------------+----+---------+| name|year| flower|+--------------+----+---------+| rose_2012|2012| rose|| jasmine_2013|2013| jasmine|| lily_2014|2014| lily|| daffodil_2017|2017| daffodil||sunflower_2016|2016|sunflower|+--------------+----+---------+


