scalign Documentation¶
scalign is a package for querying and mapping single cell RNA sequencing data
onto a reference atlas. One may examine the spatial mapping directly onto the
UMAP embeddings supplied by the atlas reference.
You should only perform basic quality control measures on raw UMI counts, filtering
out cells of low quality and presumes to be doublets, and then use the UMI matrix
to perform the querying step without any further integration. Since scalign will
automatically read your specified batch key and correct batch effect using scVI.
Install¶
This package is distributed on Python package index (PyPI). It is tested only on Linux environment by now.
The package have two installation option, the basic installation includes the
capability to load non-parametric models only (A trained UMAP model provided
by the atlas, and can only support embedding prediction). And the full installation
can load the parametric models based on neural networks. It supports re-training
and adapting the atlas to better fit the query data, thus providing more accurate
results. This is dependent on local installation of Tensorflow framework and
Keras. Note that neural network models may run very slowly on CPU-only machines.
However, it surely runs successfully.
You can install the basic package using pip:
pip install scalign
Or the full installation using:
pip install scalign[parametric]
This package goes with scalign-umap package, which is a fork from umap-learn.
But the original package contains some bugs so I modified it a bit. This package
is maintained by me independently.