glebmikha / real-world-data-analysis

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Real World Data Analysis

To start

  1. Copy this repo

Create a new directory, cd into it, and then run

git init
git pull https://github.com/glebmikha/data-science-project-template.git

Or you can just download it as a zip and use it without git.

  1. Add your favorite Python modules to ./docker/jupyter/requirements.txt. For example:
xgboost
tensorflow==1.6.0

Or use pip install right in jupyter (don't forget ! in front of the command)

!pip install your_package
  1. Start containers
docker-compose up
  1. Copy a jupyter url from terminal and open it in your browser.

  2. Find an examples.ipynb notebook in ipynb folder. Create your notebooks.

  3. Copy your data into ./data and read it in Jupyter. You can also upload data into PostgreSQL, which is running in it's own container along with Jupyter (see examples notebook for details)

  4. Stop containers

docker-compose down
  1. Clean Docker's mess
docker rmi -f $(docker images -qf dangling=true)

Sometimes it is useful to remove all docker's data.

docker system prune

About


Languages

Language:Jupyter Notebook 99.9%Language:Dockerfile 0.0%Language:Python 0.0%