category
bioRxiv
date
Feb 6, 2026
slug
status
Published
summary
1. 提出首个结合原型嵌入与自解释机制的单细胞注释模型;2. 通过原型相似性量化机制实现错误标注细胞的自动修正;3. 采用数据增强策略显著提升稀有细胞类型识别能力;4. 可解释性分析可发现新型细胞标记基因。
tags
单细胞测序
type
Post

📄 原文题目

ProtoCloud: a Prototypical Self-explaining Model for Single-cell Analysis

🔗 原文链接

💡 AI 核心解读

1. 提出首个结合原型嵌入与自解释机制的单细胞注释模型;2. 通过原型相似性量化机制实现错误标注细胞的自动修正;3. 采用数据增强策略显著提升稀有细胞类型识别能力;4. 可解释性分析可发现新型细胞标记基因。

📝 英文原版摘要

Cell type annotation is a fundamental task in single-cell genomics. Although various methods have been developed for automatic cell type annotation, they often function as black-box models, making predictions without explaining their reasoning and lacking proper uncertainty estimation for their predictions. Furthermore, they often struggle to annotate rare cell types. We introduce ProtoCloud, a self-explaining deep generative model trained end-to-end to embed cells into a structured, low-dimensional space organized around cell type-specific prototypes. Coupled with a specifically-designed data augmentation strategy, it matches or outperforms existing methods in cell type annotation across 11 large-scale datasets, particularly for rare cell types. Moreover, ProtoCloud improves data annotation quality by identifying and re-annotating misannotated training cells through a built-in certainty quantification mechanism based on cell-prototype similarity. Finally, ProtoCloud provides interpretable predictions by identifying key genes that drive its classifications, facilitating the discovery of both known and novel cell type marker genes. Applied to a time-course dataset of post-injury retinal neurons, ProtoCloud successfully annotates previously unassigned cells; on the esophageal cell atlas, it identifies rare but potentially important cell populations and their marker genes relevant to esophageal inflammation.
抗菌蛋白功能的从头起源与进化转录因子结合的位置语法在植物中划分发育与应激反应调控
Loading...