Clustering individuals using INMTD: a novel versatile multi-view embedding framework integrating omics and imaging data

dc.contributor.authorLi, Zuqi
dc.contributor.authorWindels, Sam F. L.
dc.contributor.authorMalod-Dognin, Noël
dc.contributor.authorWeinberg, Seth M.
dc.contributor.authorMarazita, Mary L.
dc.contributor.authorWalsh, Susan
dc.contributor.authorShriver, Mark D.
dc.contributor.authorFardo, David W.
dc.contributor.authorClaes, Peter
dc.contributor.authorPržulj, Nataša
dc.contributor.authorVan Steen, Kristel
dc.contributor.departmentBiology, School of Science
dc.date.accessioned2025-05-13T09:42:12Z
dc.date.available2025-05-13T09:42:12Z
dc.date.issued2025
dc.description.abstractMotivation: Combining omics and images can lead to a more comprehensive clustering of individuals than classic single-view approaches. Among the various approaches for multi-view clustering, nonnegative matrix tri-factorization (NMTF) and nonnegative Tucker decomposition (NTD) are advantageous in learning low-rank embeddings with promising interpretability. Besides, there is a need to handle unwanted drivers of clusterings (i.e. confounders). Results: In this work, we introduce a novel multi-view clustering method based on NMTF and NTD, named INMTD, which integrates omics and 3D imaging data to derive unconfounded subgroups of individuals. According to the adjusted Rand index, INMTD outperformed other clustering methods on a synthetic dataset with known clusters. In the application to real-life facial-genomic data, INMTD generated biologically relevant embeddings for individuals, genetics, and facial morphology. By removing confounded embedding vectors, we derived an unconfounded clustering with better internal and external quality; the genetic and facial annotations of each derived subgroup highlighted distinctive characteristics. In conclusion, INMTD can effectively integrate omics data and 3D images for unconfounded clustering with biologically meaningful interpretation. Availability and implementation: INMTD is freely available at https://github.com/ZuqiLi/INMTD.
dc.eprint.versionFinal published version
dc.identifier.citationLi Z, Windels SFL, Malod-Dognin N, et al. Clustering individuals using INMTD: a novel versatile multi-view embedding framework integrating omics and imaging data. Bioinformatics. 2025;41(4):btaf122. doi:10.1093/bioinformatics/btaf122
dc.identifier.urihttps://hdl.handle.net/1805/48016
dc.language.isoen_US
dc.publisherOxford University Press
dc.relation.isversionof10.1093/bioinformatics/btaf122
dc.relation.journalBioinformatics
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.sourcePMC
dc.subjectAlgorithms
dc.subjectCluster analysis
dc.subjectComputational biology
dc.subjectGenomics
dc.subjectSoftware
dc.titleClustering individuals using INMTD: a novel versatile multi-view embedding framework integrating omics and imaging data
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Li2025Clustering-CCBY.pdf
Size:
3.5 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.04 KB
Format:
Item-specific license agreed upon to submission
Description: