T296814 image suggestions optimisation
If we write a dataframe to hive and want to use it later, it's quicker to read the data back from hive than to use the original dataframe. Don't know why, but it works
Also, writing search index data one wiki at a time means we end up with 70k partitions in the search_index_* tables, so do it all at once instead