yutanagano / tidytcells

Standardise TR/MH data

Home Page:https://tidytcells.readthedocs.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Tests Docs License

tidytcells is a lightweight python package that cleans and standardizes T cell receptor (TR) and Major Histocompatibility (MH) data to be IMGT-compliant. The main purpose of the package is to solve the problem of parsing and collating together non-standardized TR datasets. It is often difficult to compile TR data from multiple sources because the formats/nomenclature of how each dataset encodes TR and MH gene names are slightly different, or even inconsistent within themselves. tidytcells can ameliorate this issue by auto-correcting and auto-standardizing your data! Check out the documentation page.

Installation

Via PyPI (recommended)

tidytcells can be installed using pip:

$ pip install tidytcells

From source

The source code for the package is available on Github. To install from source, clone the git repository, and run:

$ pip install .

from inside the project root directory.

Useful links