Welcome to MaskGraphene’s documentation!
MaskGraphene: Advancing joint embedding, clustering, and batch correction for spatial transcriptomics using graph-based self-supervised learning
- Installation
- Data availability
- Tutorial 1: Generate hard-links for consecutive ST slices
- Tutorial 1b: Generate hard-links using PASTE
- Tutorial 1c: Generate hard-links using SPACEL
- Tutorial 2: Integrate DLPFC data with MaskGraphene
- Tutorial 3: Integrate MHypo data with MaskGraphene
- Tutorial 4: Integrate MB data with MaskGraphene
- Tutorial 6: Integrate horizontal slices with MaskGraphene
Overview
With the rapid advancement and generation of spatial transcriptomics (ST), integrating data from multiple ST slices is increasingly crucial for joint slice analysis. Nevertheless, the tasks of learning joint embeddings and identifying shared and unique cell/domain types across ST slices remain challenging. To address this, we introduce MaskGraphene, for the purpose of better aligning and integrating different ST slices using both self-supervised and contrastive learning. MaskGraphene learns the high-quality joint embeddings to capture the geometric information efficiently. MaskGraphene further facilitates spatial aware data integration and simultaneous identification of shared and unique cell/domain types across different slices. We have applied MaskGraphene to conduct integrative analyses on various types of ST datasets, including human cortex slices, mouse hypothalamus data, mouse sagittal brain sections, and mouse embryo developmental data. Across datasets, MaskGraphene successfully optimized joint embedding by introducing inter-slice connections, effectively performing batch correction and capturing shared tissue structures across different slices, and tracking spatiotemporal changes during mouse embryonic development. Our experimental results demonstrate that the batch effect is significantly mitigated in the data, validated by the embedding mixture metrics iLISI, while also exhibiting an ideal distribution of the joint embedding in a manner that preserves geometric information and ensures good clustering results and more interpretable downstream analyses.
Citation
@article{hu2024maskgraphene,
title={MaskGraphene: an advanced framework for interpretable latent representation for multi-slice, multi-condition spatial transcriptomics},
author={Yunfei Hu, Zhenhan Lin, Manfei Xie, Weiman Yuan, Yikang Li, Mingxing Rao, Yichen Henry Liu, Wenjun Shen, Lu Zhang, and Xin Maizie Zhou},
journal={bioRxiv},
pages={2024--02},
year={2024},
publisher={Cold Spring Harbor Laboratory}
}