将滚动部分和JSON放在一边,我设法读取了数据。关键是读取父级内部的所有元素(在问题中完成):
parent = driver.find_element_by_xpath('//*[@id="pvExplorationHost"]/div/div/div/div[2]/div/div[2]/div[2]/visual-container[4]/div/div[3]/visual/div')children = parent.find_elements_by_xpath('.//*')然后使用它们的位置对它们进行排序:
x = [child.location['x'] for child in children]y = [child.location['y'] for child in children]index = np.lexsort((x,y))
要对我们在不同行中阅读的内容进行排序,此代码可能会有所帮助:
rows = []row = []last_line = y[index[0]]for i in index: if last_line != y[i]: row.append[children[i].get_attribute('title')] else: rows.append(row) row = list([children[i].get_attribute('title')]rows.append(row)


