EARRINGS: An efficient and accurate adapter trimmer entails no a priori adapter sequences

Ting Hsuan Wang, Cheng Ching Huang, Jui Hung Hung*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Motivation: Cross-sample comparisons or large-scale meta-analyses based on the next generation sequencing (NGS) involve replicable and universal data preprocessing, including removing adapter fragments in contaminated reads (i.e. adapter trimming). While modern adapter trimmers require users to provide candidate adapter sequences for each sample, which are sometimes unavailable or falsely documented in the repositories (such as GEO or SRA), large-scale meta-analyses are therefore jeopardized by suboptimal adapter trimming. Results: Here we introduce a set of fast and accurate adapter detection and trimming algorithms that entail no a priori adapter sequences. These algorithms were implemented in modern Cþþ with SIMD and multithreading to accelerate its speed. Our experiments and benchmarks show that the implementation (i.e. EARRINGS), without being given any hint of adapter sequences, can reach comparable accuracy and higher throughput than that of existing adapter trimmers. EARRINGS is particularly useful in meta-analyses of a large batch of datasets and can be incorporated in any sequence analysis pipelines in all scales. Availability and implementation: EARRINGS is open-source software and is available at https://github.com/jhhung/ EARRINGS.

Original languageEnglish
Pages (from-to)1846-1852
Number of pages7
JournalBioinformatics
Volume37
Issue number13
DOIs
StatePublished - 1 Jul 2021

Fingerprint

Dive into the research topics of 'EARRINGS: An efficient and accurate adapter trimmer entails no a priori adapter sequences'. Together they form a unique fingerprint.

Cite this