WSCBS2022

Development

We use two submodules for the data_processing package and visualization package of this repository. To clone the repository, run:

git clone --recurse-submodules https://github.com/Han-Linnn/WSCBS2022.git

or alternatively, run

git submodule init
git submodule update

Build Brane Package and Pipline

The Python scripts (data_processing.py, visualization.py, test_visualization.py, test_preprocessing.py), the corresponding container.yml files and the train.csv and test.csv are initially required to get started.

First run:

brane login http://127.0.0.1 --username lin

Then run :

 make build

To build and push all packages

Running the pipeline

At that point start an ide (make start-ide whereever brane's make file is) and upload your local Pipeline.ipynb file to jupyters and run the cells in Pipeline.ipynb in jupyterlab to get started. Then this should take like max a minute and then return the processed data preprocessing.csv and the visualization outputs.

Project Structure

├── File_export

│ ├── data

│ ├── container.yml

│ ├── file_export.py

│ └── README.md

├── data

│ ├── test.csv

│ └── train.csv

├── data_processing

│ ├── data_processing.py

│ ├── test_preprocessing.py

│ ├── data

│ │ ├──── test.csv

│ │ └──── train.csv

│ ├── container.yml

│ ├── README.md

│ └── brane

├── visualization

│ ├── brane_visualization.py

│ ├── test_visualization.py

│ ├── data

│ │ ├──── test.csv

│ │ └──── train.csv

│ ├── container.yml

│ ├── brane

│ └── README.md

├── makefile

├── README.md

├── test.py

└── pipeline.ipynb

About

Contribution

Languages

Language:Jupyter Notebook 68.0%Language:Makefile 20.9%Language:Python 11.2%