Sage Journals: Discover world-class research

Abstract

Single-cell technologies have favored extensive advancements in cell-type discovery, cell state identification, development of lineage tracing, and disease understanding among others. Further, single-cell multi-omics data generated using modern technologies provide several views of omics contribution for the same set of cells. Analyzing these views of multi-omics data is hindered by large dimensions of the same. In this regard, one effective approach is dimensionality reduction and thereby visualization (in 2D or 3D space) of the integrated views of multi-omics data. However, dimension reduction and visualization of these datasets remain a challenging task since obtaining a low-dimensional embedding that preserves information about local and global structures in data is difficult. Moreover, combining different views obtained from each omics layer to interpret the underlying biology is even more challenging. In this work, we introduce NeuroMDAVIS, a novel unsupervised deep neural network model, for joint visualization of biological datasets having multiple modalities. Joint visualization refers to transforming the feature space of each modality and combining them to produce a latent embedding that supports visualization of the multi-modal dataset in the newly transformed feature space. NeuroMDAVIS transforms the feature space of each modality and integrates them into a shared latent space, capturing both modality-specific and common information across different layers. The model effectively learns both local and global relationships within the data, providing a meaningful low-dimensional representation for further analysis. NeuroMDAVIS is able to capture both individual modality-specific information as well as common information across all modalities. When it comes to visualization capability, NeuroMDAVIS competes against state-of-the-art visualization models such as t-distributed Stochastic Neighbor Embedding (t-SNE), Uniform Manifold Approximation and Projection (UMAP), Fast interpolation-based t-SNE (Fit-SNE), the Siamese network-based visualization method (IVIS), and the manifold learning-based generalized version of UMAP, called MultiMAP. Downstream analyses have reflected effective classification and clustering performance over all the datasets, in terms of accuracy, precision, recall, F1 score, and various cluster validity indices. To the best of our knowledge, NeuroMDAVIS is the first model to offer joint visualization for multi-modal biological datasets. It competes with the state-of-the-art visualization methods, providing a robust and efficient approach for understanding complex multi-omics data.

Keywords

ATAC-seq Mass cytometry CITE-seq deep learning global structure preservation multi-omics visualization shape preservation single-cell omics unsupervised learning

Get full access to this article

View all access options for this article.

References

Adil

, Kumar

, Tasleem Jan

, et al. Single-cell transcriptomics: Current methods and challenges in data acquisition and analysis. Front Neurosci, 2021; 15:591122.

Angermueller

, Clark

, Lee

, et al. Parallel single-cell sequencing links transcript tional and epigenetic heterogeneity. Nat Methods, 2016; 13(3):229–232.

Becht

, McInnes

, Healy

, et al. Dimensionality reduction for visualizing single-cell data using UMAP. Nat Biotechnol, 2018; 37(1):38–44.

Bredikhin

, Kats

, Stegle

. Muon: Multimodal omics analysis framework. Genome Biol, 2022; 23(1):42.

Buenrostro

, Wu

, Chang

, et al. ATAC-seq: A method for assaying chromatin accessibility genome-wide. Curr Protoc Mol Biol, 2015; 109(1):21.29.1–21.29.9.

Cao

, Cusanovich

, Ramani

, et al. Joint profiling of chromatin accessibility and gene expression in thousands of single cells. Science, 2018; 361(6409):1380–1385.

Chari

, Pachter

. The specious art of single-cell genomics. PLoS Comput Biol, 2023; 19(8):e1011288–e20; doi: 10.1371/journal.pcbi.1011288

Chen

, Lake

, Zhang

. High-throughput sequencing of the transcriptome and chromatin accessibility in the same cell. Nat Biotechnol, 2019; 37(12):1452–1457.

Clyde

. SHARE-seq reveals chromatin potential. Nat Rev Genet, 2021; 22(1):2.

10.

Farbehi

, Patrick

, Dorison

, et al. Single-cell expression profiling reveals dynamic flux of cardiac stromal, vascular and immune cells in health and injury. Elife, 2019; 8:e43882.

11.

Hashimshony

, Senderovich

, Avital

, et al. CEL-Seq2: Sensitive highly-multiplexed single cell RNA-Seq. Genome Biol, 2016; 17(1):1–7.

12.

Hou

, Guo

, Cao

, et al. Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas. Cell Res, 2016; 26(3):304–319.

13.

, Huang

, An

, et al. Simultaneous profiling of transcriptome and DNA methylome from a single cell. Genome Biol, 2016; 17(1):88.

14.

Jain

, Polanski

, Conde

, et al. MultiMAP: Dimensionality reduction and integration of multimodal data. Genome Biol, 2021; 22(1):346.

15.

Kobak

, Linderman

. Initialization is critical for preserving global data structure in both t-SNE and UMAP. Nat Biotechnol, 2021; 39(2):156–157.

16.

Kohonen

. The self-organizing map. Proc IEEE, 1990; 78(9):1464–1480.

17.

Kotliarov

, Sparks

, Martins

, et al. Broad immune activation underlies shared set point signatures for vaccine responsiveness in healthy individuals and disease activity in patients with lupus. Nat Med, 2020; 26(4):618–629.

18.

Lan

, Ling

, Chen

, et al. scMoMtF: An interpretable multitask learning framework for single-cell multi-omics data analysis. PLoS Comput Biol, 2024; 20(12):e1012679.

19.

Linderman

, Rachh

, Hoskins

, et al. Fast interpolation-based t-SNE for improved visualization of single-cell RNA-seq data. Nat Methods, 2019; 16(3):243–245.

20.

Lotfollahi

, Litinetskaya

, Theis

. Multigrate: Single-cell multi-omic data integration. BioRxiv, 2022:2022–2023.

21.

Macosko

, Basu

, Satija

, et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell, 2015; 161(5):1202–1214.

22.

Maitra

, Seal

, De

. NeuroDAVIS: A neural network model for data visualization. Neurocomputing (Amst), 2024; 573:127182.

23.

Maitra

, Seal

, Das

, et al. Unsupervised neural network for single cell Multi-omics INTegration (UMINT): an application to health and disease. Front Mol Biosci, 2023; 10:1184748.

24.

Marks

, Garcia

, Barrio

, et al. Resolving the full spectrum of human genome variation using Linked-Reads. Genome Res, 2019; 29(4):635–645.

25.

Peterson

, Zhang

, Kumar

, et al. Multiplexed quantification of proteins and transcripts in single cells. Nat Biotechnol, 2017; 35(10):936–939.

26.

Picelli

, Bjorklund

, Faridani

, et al. Smart-seq2 for sensitive full-length transcriptome profiling in single cells. Nat Methods, 2013; 10(11):1096–1098.

27.

Satpathy

, Granja

, Yost

, et al. Massively parallel single-cell chromatin landscapes of human immune cell development and intratumoral T cell exhaustion. Nat Biotechnol, 2019; 37(8):925–936.

28.

Stoeckius

, Hafemeister

, Stephenson

, et al. Simultaneous epitope and transcriptome measurement in single cells. Nat Methods, 2017; 14(9):865–868.

29.

Stuart

, Butler

, Hoffman

, et al. Comprehensive integration of single-cell data. Cell, 2019; 177(7):1888–1902.e21.

30.

Szubert

, Cole

, Monaco

, et al. Structure-preserving visualisation of high dimensional single-cell datasets. Sci Rep, 2019; 9(1):8914–8910.

31.

Van der Maaten

, Hinton

. “Visualizing data using t-SNE.”. Journal of Machine Learning Research, 2008; 9:2579–2605.

32.

Van Der Maaten

, Postma

, van den Herik

, et al. Dimensionality reduction: A comparative review. Journal of Machine Learning Research, 2009; 10(66–71):13.

33.

Weisenfeld

, Kumar

, Shah

, et al. Direct determination of diploid genome sequences. Genome Res, 2017; 27(5):757–767.

34.

Zheng

GXY

, Terry

, Belgrader

, et al. Massively parallel digital transcriptional profiling of single cells. Nat Commun, 2017; 8(1):14049–14012.

NeuroMDAVIS: Visualization of Single-Cell Multi-Omics Data under Deep Learning Framework

Abstract

Keywords

Get full access to this article

References