The age of data-driven proteomics : how machine learning enables novel workflows
- Author
- Robbin Bouwmeester (UGent) , Ralf Gabriels (UGent) , Tim Van Den Bossche (UGent) , Lennart Martens (UGent) and Sven Degroeve (UGent)
- Organization
- Project
- Abstract
- A lot of energy in the field of proteomics is dedicated to the application of challenging experimental workflows, which include metaproteomics, proteogenomics, data independent acquisition (DIA), non-specific proteolysis, immunopeptidomics, and open modification searches. These workflows are all challenging because of ambiguity in the identification stage; they either expand the search space and thus increase the ambiguity of identifications, or, in the case of DIA, they generate data that is inherently more ambiguous. In this context, machine learning-based predictive models are now generating considerable excitement in the field of proteomics because these predictive models hold great potential to drastically reduce the ambiguity in the identification process of the above-mentioned workflows. Indeed, the field has already produced classical machine learning and deep learning models to predict almost every aspect of a liquid chromatography-mass spectrometry (LC-MS) experiment. Yet despite all the excitement, thorough integration of predictive models in these challenging LC-MS workflows is still limited, and further improvements to the modeling and validation procedures can still be made. In this viewpoint we therefore point out highly promising recent machine learning developments in proteomics, alongside some of the remaining challenges.
- Keywords
- Machine Learning, Deep Learning, Data-Driven Modeling, Proteomics, Mass Spectrometry, machine learning, CROSS-SECTION VALUES, MASS-SPECTROMETRY, CHROMATOGRAPHIC RETENTION, PEPTIDE IDENTIFICATION, PROTEIN IDENTIFICATION, SHOTGUN PROTEOMICS, PREDICTION, SEARCH, SPECTRA, FRAGMENTATION
Downloads
-
(...).pdf
- full text (Published version)
- |
- UGent only
- |
- |
- 569.83 KB
Citation
Please use this url to cite or link to this publication: http://hdl.handle.net/1854/LU-8656489
- MLA
- Bouwmeester, Robbin, et al. “The Age of Data-Driven Proteomics : How Machine Learning Enables Novel Workflows.” PROTEOMICS, vol. 20, no. 21–22, 2020, doi:10.1002/pmic.201900351.
- APA
- Bouwmeester, R., Gabriels, R., Van Den Bossche, T., Martens, L., & Degroeve, S. (2020). The age of data-driven proteomics : how machine learning enables novel workflows. PROTEOMICS, 20(21–22). https://doi.org/10.1002/pmic.201900351
- Chicago author-date
- Bouwmeester, Robbin, Ralf Gabriels, Tim Van Den Bossche, Lennart Martens, and Sven Degroeve. 2020. “The Age of Data-Driven Proteomics : How Machine Learning Enables Novel Workflows.” PROTEOMICS 20 (21–22). https://doi.org/10.1002/pmic.201900351.
- Chicago author-date (all authors)
- Bouwmeester, Robbin, Ralf Gabriels, Tim Van Den Bossche, Lennart Martens, and Sven Degroeve. 2020. “The Age of Data-Driven Proteomics : How Machine Learning Enables Novel Workflows.” PROTEOMICS 20 (21–22). doi:10.1002/pmic.201900351.
- Vancouver
- 1.Bouwmeester R, Gabriels R, Van Den Bossche T, Martens L, Degroeve S. The age of data-driven proteomics : how machine learning enables novel workflows. PROTEOMICS. 2020;20(21–22).
- IEEE
- [1]R. Bouwmeester, R. Gabriels, T. Van Den Bossche, L. Martens, and S. Degroeve, “The age of data-driven proteomics : how machine learning enables novel workflows,” PROTEOMICS, vol. 20, no. 21–22, 2020.
@article{8656489,
abstract = {{A lot of energy in the field of proteomics is dedicated to the application of challenging experimental workflows, which include metaproteomics, proteogenomics, data independent acquisition (DIA), non-specific proteolysis, immunopeptidomics, and open modification searches. These workflows are all challenging because of ambiguity in the identification stage; they either expand the search space and thus increase the ambiguity of identifications, or, in the case of DIA, they generate data that is inherently more ambiguous. In this context, machine learning-based predictive models are now generating considerable excitement in the field of proteomics because these predictive models hold great potential to drastically reduce the ambiguity in the identification process of the above-mentioned workflows. Indeed, the field has already produced classical machine learning and deep learning models to predict almost every aspect of a liquid chromatography-mass spectrometry (LC-MS) experiment. Yet despite all the excitement, thorough integration of predictive models in these challenging LC-MS workflows is still limited, and further improvements to the modeling and validation procedures can still be made. In this viewpoint we therefore point out highly promising recent machine learning developments in proteomics, alongside some of the remaining challenges.}},
articleno = {{1900351}},
author = {{Bouwmeester, Robbin and Gabriels, Ralf and Van Den Bossche, Tim and Martens, Lennart and Degroeve, Sven}},
issn = {{1615-9853}},
journal = {{PROTEOMICS}},
keywords = {{Machine Learning,Deep Learning,Data-Driven Modeling,Proteomics,Mass Spectrometry,machine learning,CROSS-SECTION VALUES,MASS-SPECTROMETRY,CHROMATOGRAPHIC RETENTION,PEPTIDE IDENTIFICATION,PROTEIN IDENTIFICATION,SHOTGUN PROTEOMICS,PREDICTION,SEARCH,SPECTRA,FRAGMENTATION}},
language = {{eng}},
number = {{21-22}},
pages = {{6}},
title = {{The age of data-driven proteomics : how machine learning enables novel workflows}},
url = {{http://doi.org/10.1002/pmic.201900351}},
volume = {{20}},
year = {{2020}},
}
- Altmetric
- View in Altmetric
- Web of Science
- Times cited: