EARRINGS: An efficient and accurate adapter trimmer entails no a priori adapter sequences

Ting Hsuan Wang, Cheng Ching Huang, Jui Hung Hung*


研究成果: Article同行評審

3 引文 斯高帕斯(Scopus)


Motivation: Cross-sample comparisons or large-scale meta-analyses based on the next generation sequencing (NGS) involve replicable and universal data preprocessing, including removing adapter fragments in contaminated reads (i.e. adapter trimming). While modern adapter trimmers require users to provide candidate adapter sequences for each sample, which are sometimes unavailable or falsely documented in the repositories (such as GEO or SRA), large-scale meta-analyses are therefore jeopardized by suboptimal adapter trimming. Results: Here we introduce a set of fast and accurate adapter detection and trimming algorithms that entail no a priori adapter sequences. These algorithms were implemented in modern Cþþ with SIMD and multithreading to accelerate its speed. Our experiments and benchmarks show that the implementation (i.e. EARRINGS), without being given any hint of adapter sequences, can reach comparable accuracy and higher throughput than that of existing adapter trimmers. EARRINGS is particularly useful in meta-analyses of a large batch of datasets and can be incorporated in any sequence analysis pipelines in all scales. Availability and implementation: EARRINGS is open-source software and is available at https://github.com/jhhung/ EARRINGS.

頁(從 - 到)1846-1852
出版狀態Published - 1 7月 2021


深入研究「EARRINGS: An efficient and accurate adapter trimmer entails no a priori adapter sequences」主題。共同形成了獨特的指紋。