asifr / notebooks

Notebooks and scripts

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Asif Rahman
2022

Index

  1. Physionet 2019 preprocessing
  2. Physionet 2012 preprocessing
  3. UCR time series classification data
  4. Tabular classifier with embeddings
  5. Polars cookbook
  6. Neural adaptive regression splines

Setup

Some methods use Spark for data processing. To install Spark on MacOS, run brew install apache-spark. You will need to install the python modules pyspark and findspark. All required python modules are in requirements.txt.

You can load (and reload) the modules like utils.py using the snippet below:

# Add the parent notebook folder to the modules search path
parent = '../'
if parent not in sys.path: sys.path.append(parent)

import importlib
import utils
importlib.reload(utils)

Utilities

utils.py includes generic utility functions to read/write parquet datasets, impute missing values, summarize dataframes, pad sequences, add styles to plots, initialize spark, and transform design matrices.

About

Notebooks and scripts


Languages

Language:Jupyter Notebook 61.1%Language:Python 38.6%Language:Dockerfile 0.4%