AIscEA: unsupervised integration of single-cell gene expression and chromatin accessibility via their biological consistency

dc.contributor.authorJafari, Elham
dc.contributor.authorJohnson, Travis
dc.contributor.authorWang, Yue
dc.contributor.authorLiu, Yunlong
dc.contributor.authorHuang, Kun
dc.contributor.authorWang, Yijie
dc.contributor.departmentBiostatistics and Health Data Science, School of Medicine
dc.date.accessioned2024-03-22T14:14:50Z
dc.date.available2024-03-22T14:14:50Z
dc.date.issued2022
dc.description.abstractMotivation: The integrative analysis of single-cell gene expression and chromatin accessibility measurements is essential for revealing gene regulation, but it is one of the key challenges in computational biology. Gene expression and chromatin accessibility are measurements from different modalities, and no common features can be directly used to guide integration. Current state-of-the-art methods lack practical solutions for finding heterogeneous clusters. However, previous methods might not generate reliable results when cluster heterogeneity exists. More importantly, current methods lack an effective way to select hyper-parameters under an unsupervised setting. Therefore, applying computational methods to integrate single-cell gene expression and chromatin accessibility measurements remains difficult. Results: We introduce AIscEA-Alignment-based Integration of single-cell gene Expression and chromatin Accessibility-a computational method that integrates single-cell gene expression and chromatin accessibility measurements using their biological consistency. AIscEA first defines a ranked similarity score to quantify the biological consistency between cell clusters across measurements. AIscEA then uses the ranked similarity score and a novel permutation test to identify cluster alignment across measurements. AIscEA further utilizes graph alignment for the aligned cell clusters to align the cells across measurements. We compared AIscEA with the competing methods on several benchmark datasets and demonstrated that AIscEA is highly robust to the choice of hyper-parameters and can better handle the cluster heterogeneity problem. Furthermore, AIscEA significantly outperforms the state-of-the-art methods when integrating real-world SNARE-seq and scMultiome-seq datasets in terms of integration accuracy. Availability and implementation: AIscEA is available at https://figshare.com/articles/software/AIscEA_zip/21291135 on FigShare as well as {https://github.com/elhaam/AIscEA} onGitHub.
dc.eprint.versionFinal published version
dc.identifier.citationJafari E, Johnson T, Wang Y, Liu Y, Huang K, Wang Y. AIscEA: unsupervised integration of single-cell gene expression and chromatin accessibility via their biological consistency. Bioinformatics. 2022;38(23):5236-5244. doi:10.1093/bioinformatics/btac683
dc.identifier.urihttps://hdl.handle.net/1805/39435
dc.language.isoen_US
dc.publisherOxford University Press
dc.relation.isversionof10.1093/bioinformatics/btac683
dc.relation.journalBioinformatics
dc.rightsPublisher Policy
dc.sourcePMC
dc.subjectChromatin
dc.subjectComputational biology
dc.subjectGene expression
dc.subjectSingle-cell analysis
dc.titleAIscEA: unsupervised integration of single-cell gene expression and chromatin accessibility via their biological consistency
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
btac683.pdf
Size:
827.93 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: