I have a personal wiki, but I do not have an index to organize it. So, I decided to generate my own index using clustering (scikit-learn), RoBerta, and char vectorization (to aid with multilingual page clustering). You can see the results in index_output.md, in markdown.
TODO: Add automatic cluster naming.