category
bioRxiv
date
Feb 6, 2026
slug
status
Published
summary
1. 提出首个结合原型嵌入与自解释机制的单细胞注释模型;2. 通过原型相似性量化机制实现错误标注细胞的自动修正;3. 采用数据增强策略显著提升稀有细胞类型识别能力;4. 可解释性分析可发现新型细胞标记基因。
tags
单细胞测序
type
Post
📄 原文题目
ProtoCloud: a Prototypical Self-explaining Model for Single-cell Analysis
🔗 原文链接
💡 AI 核心解读
1. 提出首个结合原型嵌入与自解释机制的单细胞注释模型;2. 通过原型相似性量化机制实现错误标注细胞的自动修正;3. 采用数据增强策略显著提升稀有细胞类型识别能力;4. 可解释性分析可发现新型细胞标记基因。
📝 英文原版摘要
Cell type annotation is a fundamental task in single-cell genomics. Although various methods have been developed for automatic cell type annotation, they often function as black-box models, making predictions without explaining their reasoning and lacking proper uncertainty estimation for their predictions. Furthermore, they often struggle to annotate rare cell types. We introduce ProtoCloud, a self-explaining deep generative model trained end-to-end to embed cells into a structured, low-dimensional space organized around cell type-specific prototypes. Coupled with a specifically-designed data augmentation strategy, it matches or outperforms existing methods in cell type annotation across 11 large-scale datasets, particularly for rare cell types. Moreover, ProtoCloud improves data annotation quality by identifying and re-annotating misannotated training cells through a built-in certainty quantification mechanism based on cell-prototype similarity. Finally, ProtoCloud provides interpretable predictions by identifying key genes that drive its classifications, facilitating the discovery of both known and novel cell type marker genes. Applied to a time-course dataset of post-injury retinal neurons, ProtoCloud successfully annotates previously unassigned cells; on the esophageal cell atlas, it identifies rare but potentially important cell populations and their marker genes relevant to esophageal inflammation.
- 作者:NotionNext
- 链接:https://tangly1024.com/article/30048bd6-1f96-81a3-a109-fd26a24ebbb3
- 声明:本文采用 CC BY-NC-SA 4.0 许可协议,转载请注明出处。
相关文章
