JonathanShor / DoubletDetection

Doublet detection in single-cell RNA-seq data.

Home Page:https://doubletdetection.readthedocs.io/en/stable/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Input types for count matrix

adamgayoso opened this issue · comments

We should at least accept:

  • Sparse matrices
  • Pandas DataFrames
  • 10x mtx format

For sparse matrices, we will probably just instantly make them dense.

If it seems worthwhile to fully support sparse in and out, there are some options for getting "around" PCA densifying the matrix, such as a random subsample PCA.
I'm not familiar with these approaches, so we'd have to look into them further.

Likely use sklearn.utils.check_array.