EARRINGS: An efficient and accurate adapter trimmer entails no a priori adapter sequences

Ting Hsuan Wang, Cheng Ching Huang, Jui Hung Hung*

*此作品的通信作者

研究成果: Article同行評審

摘要

Motivation: Cross-sample comparisons or large-scale meta-analyses based on the next generation sequencing (NGS) involve replicable and universal data preprocessing, including removing adapter fragments in contaminated reads (i.e. adapter trimming). While modern adapter trimmers require users to provide candidate adapter sequences for each sample, which are sometimes unavailable or falsely documented in the repositories (such as GEO or SRA), large-scale meta-analyses are therefore jeopardized by suboptimal adapter trimming. Results: Here we introduce a set of fast and accurate adapter detection and trimming algorithms that entail no a priori adapter sequences. These algorithms were implemented in modern Cþþ with SIMD and multithreading to accelerate its speed. Our experiments and benchmarks show that the implementation (i.e. EARRINGS), without being given any hint of adapter sequences, can reach comparable accuracy and higher throughput than that of existing adapter trimmers. EARRINGS is particularly useful in meta-analyses of a large batch of datasets and can be incorporated in any sequence analysis pipelines in all scales. Availability and implementation: EARRINGS is open-source software and is available at https://github.com/jhhung/ EARRINGS.

原文English
頁(從 - 到)1846-1852
頁數7
期刊Bioinformatics
37
發行號13
DOIs
出版狀態Published - 1 7月 2021

指紋

深入研究「EARRINGS: An efficient and accurate adapter trimmer entails no a priori adapter sequences」主題。共同形成了獨特的指紋。

引用此