category
NAR
date
Feb 16, 2026
slug
status
Published
summary
创新性提出无需实验数据的原核生物顺式调控元件设计模型PromoGen2,实现跨物种启动子强度预测相关性提升(Spearman相关性从0.27到0.50),开发Promoter-Factory框架实现未注释基因组启动子设计,并构建基于分类的PromoGen2-proka模型,实验验证显示在多个原核生物中成功率达100%。
tags
合成生物学
type
Post

📄 原文题目

Design prokaryotic cis-regulatory elements using language model

🔗 原文链接

💡 AI 核心解读

创新性提出无需实验数据的原核生物顺式调控元件设计模型PromoGen2,实现跨物种启动子强度预测相关性提升(Spearman相关性从0.27到0.50),开发Promoter-Factory框架实现未注释基因组启动子设计,并构建基于分类的PromoGen2-proka模型,实验验证显示在多个原核生物中成功率达100%。

📝 英文原版摘要

<span class="paragraphSection"><div class="boxTitle">Abstract</div>Deep learning has successfully been applied to design <span style="font-style: italic;">cis-</span>regulatory elements (CREs) for a few species, but a broadly applicable platform for generating functional promoters for thousands of prokaryotes remains lacking. In this study, we introduce a language model for prokaryotic CREs, referred to as PromoGen2, to design CREs without prior experimental data. PromoGen2 was pretrained on CREs derived from 17 000 prokaryotic genomes. It achieved the highest zero-shot prediction correlation of promoter strength across species, improving the average Spearman correlation from 0.27 to 0.50 compared to the best baseline, while reducing the number of parameters by 10<sup>3</sup>. Artificial CREs designed with PromoGen2 demonstrated a 100% success rate in <span style="font-style: italic;">Escherichia coli, Bacillus subtilis, Bacillus licheniformis</span>, and <span style="font-style: italic;">Agrobacterium tumefaciens</span>. Based on PromoGen2, we developed the Promoter-Factory framework to design promoters from unannotated genomes. Experimental validation showed that most of the promoters designed for <span style="font-style: italic;">Jejubacter sp</span>. L23, a newly isolated halophilic bacterium with no available CREs, were active and capable of driving lycopene overproduction. Additionally, we introduced PromoGen2-proka, a taxonomy-aware model for CRE design based on prokaryotic genera. Experimental validation confirmed its reliable success rate. The combined use of PromoGen2-proka and Promoter-Factory offers a broadly applicable tool for designing CREs for prokaryotes, fulfilling the needs of synthetic biology and microbiology research.</span>
设计依赖性缺陷揭示DNA折纸中的复杂协同作用基于机器学习的Toehold-VISTA:解析可编程RNA传感器-靶标相互作用的方法
Loading...