AlexMikhalev / cord19redisknowledgegraph

This is a repo to build Knowledge Graph out of CORD19 kaggle data using Redis

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

cord19redisknowledgegraph

This is a repo to build Knowledge Graph out of CORD19 kaggle data using Redis

This is a next phase of original Kaggle submission

Redis 'Beyond Cache' Hackathon Submission

Submission Summary

Todo

Documentation

Communication: Twist and Discord

Installation

mkdir ./input
pip install kaggle 
cd input
kaggle datasets download allen-institute-for-ai/CORD-19-research-challenge
unzip CORD-19-research-challenge.zip
docker run -d --name rgcluster -p 30001:30001 -p 30002:30002 -p 30003:30003 redislabs/rgcluster:latest
git clone https://github.com/AlexMikhalev/cord19redisknowledgegraph 
cd cord19redisknowledgegraph
pip install gears-cli
sh cluster_pipeline_events.sh

Start Redis Gears cluster

docker run -d -v $PWD/conf/docker-config.sh:/cluster/config.sh --name rgcluster -p 30001:30001 -p 30002:30002 -p 30003:30003 redislabs/rgcluster:latest

Rebuild Manually using create-cluster

./create-cluster clean
./create-cluster start
 echo "yes" | ./create-cluster create
 redis-trib.py execute --addr 10.144.17.211:30001 RG.REFRESHCLUSTER
 redis-trib.py execute --addr 10.144.17.211:30001 RG.CONFIGSET ExecutionMaxIdleTime 300000
 redis-trib.py execute --addr 10.144.17.211:30001 CONFIG SET proto-max-bulk-len 2048mb

Look at submissionredishack.md

References

Original Inspiration

About

This is a repo to build Knowledge Graph out of CORD19 kaggle data using Redis

License:Apache License 2.0


Languages

Language:Python 45.1%Language:TypeScript 28.1%Language:SCSS 7.8%Language:HTML 7.6%Language:Jupyter Notebook 6.1%Language:Shell 2.8%Language:JavaScript 2.5%