RabbitJackTrade / duke

duke - Dataset Understanding via Knowledge-base Embeddings

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

duke -- Dataset Understanding via Knowledge-base Embeddings

  1. First, you should download wiki2vec model (English Wikipedia) using a torrent as described on this link: https://github.com/idio/wiki2vec -- untar it at a preferred location and note it.

  2. Clone this repository, open main.py in scripts/ subdirectory and modify embedding_path variable to the aforementioned wiki2vec model location.

  3. Using python3.5+, pip3 install required libraries in requirements.txt.

  4. pip install the program Duke as

pip3 install git+https://github.com/NewKnowledge/duke
  1. Identify a dataset of interest, and specify its path in main.py (via dataset_path variable).

  2. Finally, run it as follows, and enjoy the magic:

python3 main.py

About

duke - Dataset Understanding via Knowledge-base Embeddings

License:MIT License


Languages

Language:Python 94.8%Language:Dockerfile 5.0%Language:Shell 0.2%