category
bioRxiv
date
Feb 5, 2026
slug
status
Published
summary
提出两种深度学习评分模型:Model 1通过回归预测MSA与参考比对的距离,Model 2通过相对排名预测识别最优比对。两种模型均优于传统SP评分,其中Model 2在比对选择和系统发育重建准确性方面表现更优。
tags
蛋白质进化
type
Post
📄 原文题目
A deep-learning-based score to evaluate multiple sequence alignments
🔗 原文链接
💡 AI 核心解读
提出两种深度学习评分模型:Model 1通过回归预测MSA与参考比对的距离,Model 2通过相对排名预测识别最优比对。两种模型均优于传统SP评分,其中Model 2在比对选择和系统发育重建准确性方面表现更优。
📝 英文原版摘要
Multiple sequence alignment (MSA) inference is a central task in molecular evolution and comparative genomics, and the reliability of downstream analyses, including phylogenetic inference, depends critically on alignment quality. Despite this importance, most widely used MSA methods optimize the sum-of-pairs (SP) score, and relatively little attention has been paid to whether this objective function accurately reflects alignment accuracy. Here, we evaluate the performance of the SP score using simulated and empirical benchmark alignments. For each dataset, we compare alternative MSAs derived from the same unaligned sequences and quantify the relationship between their SP scores and their distances from a reference alignment. We show that the alignment with the optimal SP score often does not correspond to the most accurate alignment. To address this limitation, we develop deep-learning-based scoring functions that integrate a collection of MSA features. We first introduce Model 1, a regression model that predicts the distance of a given MSA from the reference alignment. Across simulated and empirical datasets, this learned score correlates more strongly with true alignment accuracy than the SP score. However, Model 1 is less effective at identifying the best alignment among alternatives. We therefore develop Model 2, which takes as input a set of alternative MSAs generated from the same sequences and predicts their relative ranking. Model 2 more accurately identifies the top-ranking MSA than the SP score, Model 1, and several widely used alignment programs. Using simulations, we show that selecting MSAs based on our approach leads to more accurate phylogenetic reconstructions.
- 作者:NotionNext
- 链接:https://tangly1024.com/article/2fe48bd6-1f96-814d-a55f-cca1e872085c
- 声明:本文采用 CC BY-NC-SA 4.0 许可协议,转载请注明出处。
相关文章
