category
bioRxiv
date
Mar 20, 2026
slug
status
Published
summary
创新点包括:1) 提出统一框架直接从基因组文件构建泛转录组索引;2) 开发无对齐映射方法结合多路Cuckoo哈希表存储间隙k-mer;3) 在混合菌株样本中实现比传统工具更高的映射精度和基因表达量化准确性;4) 索引体积更小且分析速度更快。
tags
测序技术
type
Post
📄 原文题目
PanXpress: Gene expression quantification with a pan-transcriptomic gapped k-mer index
🔗 原文链接
💡 AI 核心解读
创新点包括:1) 提出统一框架直接从基因组文件构建泛转录组索引;2) 开发无对齐映射方法结合多路Cuckoo哈希表存储间隙k-mer;3) 在混合菌株样本中实现比传统工具更高的映射精度和基因表达量化准确性;4) 索引体积更小且分析速度更快。
📝 英文原版摘要
Motivation: Most existing workflows for quantifying bacterial gene expression from RNA-seq data rely on mapping reads to a (single) reference transcriptome, typically ignoring strain-level variation. When samples contain unknown or mixed strains, these workflows may introduce reference bias and fail to accurately capture strain-specific gene expression. Pan-transcriptomic approaches address this issue by using pan-transcriptomes as references, but existing solutions require multiple steps for pan-transcriptome construction, indexing, and expression quantification. Results: We introduce PanXpress, a unified framework for bacterial pan-transcriptomics that performs pan-transcriptome construction and indexing directly from genomic FASTA and GFF annotation files, alignment-free mapping of reads to genes from FASTQ samples, and gene expression quantification. The index, a multi-way Cuckoo hash table storing gapped k-mers with associated genes, preserves diversity on the k-mer level. Using simulated RNA-seq data from a mixture of Pseudomonas aeruginosa strains, PanXpress achieves mapping recall comparable to alignment-based methods such as Bowtie2 with higher precision and obtains accurate gene expression and log fold change estimates. On real P. aeruginosa RNA-seq data, using PanXpress' pan-transcriptomic reference increases the proportion of mapped reads and discovered expressed genes. The index of PanXpress is smaller than that of other tools and it provides faster analysis with consistent results, compared to other tools (Salmon, Kallisto, Bowtie2). PanXpress is thus an accurate and efficient method for bacterial gene expression analysis in complex samples.
- 作者:NotionNext
- 链接:https://tangly1024.com/article/32a48bd6-1f96-81fc-87a6-e012abf46ec2
- 声明:本文采用 CC BY-NC-SA 4.0 许可协议,转载请注明出处。
相关文章
