ssavva05 / stackoverflow-neo4j

:earth_asia: Import stackoverflow in neo4j

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Import stackexchange in neo4j

steps:

  • Download the dump from archive.org: https://archive.org/details/stackexchange
  • extract the community you want in extracted/<name of the community>/ with Posts.xml & co. in the dir
    • you can dtrx on linux
  • you need to sudo pip3 install xmltodict
  • python3 to_csv.py extracted/<name of the community> to get the csvs in csvs/
  • sh import.sh to import the csvs in neo4j
    • assuming that neo4j is in the ../neo/ directory
    • the script assume that you want to remove you old database (at the end)

Look at the scripts before using them to understand what they do :)

Have fun!

About

:earth_asia: Import stackoverflow in neo4j


Languages

Language:Python 91.2%Language:Shell 8.8%