skr3178 / ikarus

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

3.8 badge 3.9 badge 3.10 badge license badge PyPI version

ikarus

ikarus is a stepwise machine learning pipeline that tries to cope with a task of distinguishing tumor cells from normal cells. Leveraging multiple annotated single cell datasets it can be used to define a gene set specific to tumor cells. First, the latter gene set is used to rank cells and then to train a logistic classifier for the robust classification of tumor and normal cells. Finally, sensitivity is increased by propagating the cell labels based on a custom cell-cell network. ikarus is tested on multiple single cell datasets to ascertain that it achieves high sensitivity and specificity in multiple experimental contexts. Please find more information in our preprint.

Installation

ikarus currently supports python 3.8, and can be installed from PyPI:

pip install ikarus

Alterantively, one can install ikarus' master branch directly from github:

python -m pip install git+https://github.com/BIMSBbioinfo/ikarus.git

Usage

The easiest option to get started is to use the provided Tumor/Normal gene lists and the pretrained model:

from ikarus import classifier

model = classifier.Ikarus(signatures_gmt=signatures_path)
model.load_core_model(model_path)
predictions = model.predict(test_adata, 'test_name')

More information on how to train a model or how to create own gene lists is provided in the tutorial notebook.

Example notebooks
Data preparation and basic prediction

About

License:MIT License


Languages

Language:HTML 67.9%Language:Jupyter Notebook 28.9%Language:Python 3.2%