category
bioRxiv
date
Feb 24, 2026
slug
status
Published
summary
提出协同信息得分(SIS)量化跨模态信息增益,揭示对齐目标仅能检测线性冗余而无法捕捉非线性协同状态,证明复杂任务需要协同整合而非简单对齐,并通过空间转录组学基准验证了协同整合的必要性。
tags
空间组学
type
Post

📄 原文题目

Beyond alignment: synergistic integration is required for multimodal cell foundation models

🔗 原文链接

💡 AI 核心解读

提出协同信息得分(SIS)量化跨模态信息增益,揭示对齐目标仅能检测线性冗余而无法捕捉非线性协同状态,证明复杂任务需要协同整合而非简单对齐,并通过空间转录组学基准验证了协同整合的必要性。

📝 英文原版摘要

The vision of a "virtual cell" - a computational model that simulates biological function across modalities and scales - has become a defining goal in computational biology. While powerful unimodal foundation models exist, the lack of large-scale paired data prohibits the joint training of multimodal approaches. This scarcity favors compositional foundation models (CFMs): architectures that fuse frozen unimodal experts via a learned interface. However, it remains unclear when this multimodal fusion adds task-relevant information beyond the strongest unimodal representation and when it merely aggregates redundant signal. Here, we introduce the Synergistic Information Score (SIS), a metric grounded in partial information decomposition (PID), that quantifies the information gain achievable only through cross-modal interactions. Extending theoretical results from self-supervised learning, we show that standard alignment-based fusion objectives on frozen encoders inherently collapse to detecting linear redundancies, limiting their ability to capture nonlinear synergistic states. This distinction is directly relevant for tasks aiming to link tissue morphology and gene expression. Benchmarking ten fusion methods on spatial transcriptomics datasets, we use SIS to demonstrate that tasks dominated by linear redundancies are sufficiently served by unimodal baselines, whereas complex niche definitions benefit from synergy-aware integration objectives that enable cross-modal interactions beyond linear alignment. Finally, we perform a scaling analysis which highlights that fine-tuning a dominant unimodal expert is the most sample-efficient path for standard tasks, suggesting that the benefits of multimodal frameworks only emerge when tasks depend on information distributed across mod
alities. Together, these results establish that building towards a virtual cell will require a fundamental shift from alignment objectives that emphasize shared structure to synergy-maximizing integration that preserves and exploits complementary cross-modal signal.
系统性识别肿瘤类型特异性DNA甲基化生物标志物人工智能工具可设计基因组:它们会颠覆生命的进化方式吗?
Loading...