alam-shahul / GLUE

Graph-linked unified embedding for single-cell multi-omics data integration

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GLUE (Graph-Linked Unified Embedding)

license-badge pypi-badge conda-badge docs-badge build-badge coverage-badge

Graph-linked unified embedding for single-cell multi-omics data integration

Model architecture

For more details, please check out our preprint.

Directory structure

.
├── scglue                  # Main Python package
├── data                    # Data files
├── evaluation              # Method evaluation pipelines
├── experiments             # Experiments and case studies
├── tests                   # Unit tests for the Python package
├── docs                    # Documentation files
├── custom                  # Customized third-party packages
├── packrat                 # Reproducible R environment via packrat
├── env.yaml                # Reproducible Python environment via conda
├── setup.py                # Setup script for the Python package
├── LICENSE
└── README.md

Package installation

The scglue package can be installed either via conda:

conda install -c defaults -c pytorch -c bioconda -c conda-forge -c scglue scglue

Or via pip:

pip install scglue

Installing within a conda environment is recommended.

Package usage

Please checkout the documentations and tutorials at scglue.readthedocs.io.

Package development

Install scglue in editable form via flit (first install flit via conda or pip if not installed already):

flit install -s

Run unit tests:

pytest --cov="scglue" --cov-report="term-missing" tests [--cpu-only]

Build documentation:

sphinx-build -a -b html docs docs/_build

Reproduce results

  1. Checkout the repository to v0.1.1:

    git checkout tags/v0.1.1
  2. Create a local conda environment using the env.yaml file, and then install scglue:

    conda env create -p conda -f env.yaml && conda activate ./conda
    flit install -s
  3. Set up a project-specific R environment:

    packrat::restore()  # Packrat should be automatically installed if not available.
    install.packages("data/download/Saunders-2018/DropSeq.util_2.0.tar.gz", repos = NULL)
    install.packages("custom/Seurat_4.0.2.tar.gz", lib = "packrat/custom", repos = NULL)

    R 4.0.2 was used during the project, but any version above 4.0.0 should be compatible.

  4. Follow instructions in data to prepare the necessary data.

  5. Follow instructions in evaluation for method evaluation.

  6. Follow instructions in experiments for case studies.

About

Graph-linked unified embedding for single-cell multi-omics data integration

License:MIT License


Languages

Language:Jupyter Notebook 48.1%Language:Python 44.1%Language:R 7.1%Language:Shell 0.7%