Ghent University Academic Bibliography

Advanced

Evaluation of the impact of Illumina error correction tools on de novo genome assembly

Mahdi Heydari UGent, Giles Miclotte UGent, Piet Demeester UGent, Yves Van de Peer UGent and Jan Fostier UGent (2017) BMC BIOINFORMATICS. 18.
abstract
Background: Recently, many standalone applications have been proposed to correct sequencing errors in Illumina data. The key idea is that downstream analysis tools such as de novo genome assemblers benefit from a reduced error rate in the input data. Surprisingly, a systematic validation of this assumption using state-of-the-art assembly methods is lacking, even for recently published methods. Results: For twelve recent Illumina error correction tools (EC tools) we evaluated both their ability to correct sequencing errors and their ability to improve de novo genome assembly in terms of contig size and accuracy. Conclusions: We confirm that most EC tools reduce the number of errors in sequencing data without introducing many new errors. However, we found that many EC tools suffer from poor performance in certain sequence contexts such as regions with low coverage or regions that contain short repeated or low-complexity sequences. Reads overlapping such regions are often ill-corrected in an inconsistent manner, leading to breakpoints in the resulting assemblies that are not present in assemblies obtained from uncorrected data. Resolving this systematic flaw in future EC tools could greatly improve the applicability of such tools.
Please use this url to cite or link to this publication:
author
organization
year
type
journalArticle (original)
publication status
published
subject
keyword
Next-generation sequencing, Error correction, Illumina, Genome assembly, ACCURATE CORRECTION, SEQUENCING ERRORS, MEMORY-EFFICIENT, BRUIJN GRAPHS, READS, ALIGNMENT
journal title
BMC BIOINFORMATICS
BMC Bioinformatics
volume
18
article number
374
pages
13 pages
Web of Science type
Article
Web of Science id
000408003200002
ISSN
1471-2105
DOI
10.1186/s12859-017-1784-8
project
Bioinformatics: from nucleotids to networks (N2N)
language
English
UGent publication?
yes
classification
A1
copyright statement
Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)
id
8530189
handle
http://hdl.handle.net/1854/LU-8530189
date created
2017-09-04 12:05:28
date last changed
2017-09-11 07:01:17
@article{8530189,
  abstract     = {Background: Recently, many standalone applications have been proposed to correct sequencing errors in Illumina data. The key idea is that downstream analysis tools such as de novo genome assemblers benefit from a reduced error rate in the input data. Surprisingly, a systematic validation of this assumption using state-of-the-art assembly methods is lacking, even for recently published methods. 
Results: For twelve recent Illumina error correction tools (EC tools) we evaluated both their ability to correct sequencing errors and their ability to improve de novo genome assembly in terms of contig size and accuracy. 
Conclusions: We confirm that most EC tools reduce the number of errors in sequencing data without introducing many new errors. However, we found that many EC tools suffer from poor performance in certain sequence contexts such as regions with low coverage or regions that contain short repeated or low-complexity sequences. Reads overlapping such regions are often ill-corrected in an inconsistent manner, leading to breakpoints in the resulting assemblies that are not present in assemblies obtained from uncorrected data. Resolving this systematic flaw in future EC tools could greatly improve the applicability of such tools.},
  articleno    = {374},
  author       = {Heydari, Mahdi and Miclotte, Giles and Demeester, Piet and Van de Peer, Yves and Fostier, Jan},
  issn         = {1471-2105},
  journal      = {BMC BIOINFORMATICS},
  keyword      = {Next-generation sequencing,Error correction,Illumina,Genome assembly,ACCURATE CORRECTION,SEQUENCING ERRORS,MEMORY-EFFICIENT,BRUIJN GRAPHS,READS,ALIGNMENT},
  language     = {eng},
  pages        = {13},
  title        = {Evaluation of the impact of Illumina error correction tools on de novo genome assembly},
  url          = {http://dx.doi.org/10.1186/s12859-017-1784-8},
  volume       = {18},
  year         = {2017},
}

Chicago
Heydari, Mahdi, Giles Miclotte, Piet Demeester, Yves Van de Peer, and Jan Fostier. 2017. “Evaluation of the Impact of Illumina Error Correction Tools on De Novo Genome Assembly.” Bmc Bioinformatics 18.
APA
Heydari, M., Miclotte, G., Demeester, P., Van de Peer, Y., & Fostier, J. (2017). Evaluation of the impact of Illumina error correction tools on de novo genome assembly. BMC BIOINFORMATICS, 18.
Vancouver
1.
Heydari M, Miclotte G, Demeester P, Van de Peer Y, Fostier J. Evaluation of the impact of Illumina error correction tools on de novo genome assembly. BMC BIOINFORMATICS. 2017;18.
MLA
Heydari, Mahdi, Giles Miclotte, Piet Demeester, et al. “Evaluation of the Impact of Illumina Error Correction Tools on De Novo Genome Assembly.” BMC BIOINFORMATICS 18 (2017): n. pag. Print.