Advanced search
1 file | 996.20 KB Add to list

Investigating time series classification techniques for rapid pathogen identification with single-cell MALDI-TOF mass spectrum data

Author
Organization
Abstract
Matrix-assisted laser desorption/ionization-time-of-flight mass spectrometry (MALDI-TOF-MS) is a well-known technology, widely used in species identification. Specifically, MALDI-TOF-MS is applied on samples that usually include bacterial cells, generating representative signals for the various bacterial species. However, for a reliable identification result, a significant amount of biomass is required. For most samples used for diagnostics of infectious diseases, the sample volume is extremely low to obtain the required amount of biomass. Therefore, amplification of the bacterial load is performed by a culturing phase. If the MALDI process could be applied to individual bacteria, it would be possible to circumvent the need for culturing and isolation, accelerating the whole process. In this paper, we briefly describe an implementation of a MALDI-TOF MS procedure in a setting of individual cells and we demonstrate the use of the produced data for the application of pathogen identification. The identification of pathogens (bacterial species) is performed by using machine learning algorithms on the generated single-cell signals. The high predictive performance of the machine learning models indicates that the produced bacterial signatures constitute an informative representation, helpful in distinguishing the different bacterial species. In addition, we reformulate the bacterial species identification problem as a time series classification task by considering the intensity sequences of a given spectrum as time series values. Experimental results show that algorithms originally introduced for time series analysis are beneficial in modelling observations of single-cell MALDI-TOF MS.
Keywords
MALDI-TOF MS, Single-cell spectrum, Single-ionization-event, Classification, Machine learning methods, SPECTROMETRY Bacterial species identification, Time series

Downloads

  • Investigating Time Series.pdf
    • full text (Accepted manuscript)
    • |
    • open access
    • |
    • PDF
    • |
    • 996.20 KB

Citation

Please use this url to cite or link to this publication:

MLA
Papagiannopoulou, Christina, et al. “Investigating Time Series Classification Techniques for Rapid Pathogen Identification with Single-Cell MALDI-TOF Mass Spectrum Data.” Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019, vol. 11908, Springer, 2020, pp. 416–31, doi:10.1007/978-3-030-46133-1_25.
APA
Papagiannopoulou, C., Parchen, R., & Waegeman, W. (2020). Investigating time series classification techniques for rapid pathogen identification with single-cell MALDI-TOF mass spectrum data. Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019, 11908, 416–431. https://doi.org/10.1007/978-3-030-46133-1_25
Chicago author-date
Papagiannopoulou, Christina, René Parchen, and Willem Waegeman. 2020. “Investigating Time Series Classification Techniques for Rapid Pathogen Identification with Single-Cell MALDI-TOF Mass Spectrum Data.” In Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019, 11908:416–31. Springer. https://doi.org/10.1007/978-3-030-46133-1_25.
Chicago author-date (all authors)
Papagiannopoulou, Christina, René Parchen, and Willem Waegeman. 2020. “Investigating Time Series Classification Techniques for Rapid Pathogen Identification with Single-Cell MALDI-TOF Mass Spectrum Data.” In Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019, 11908:416–431. Springer. doi:10.1007/978-3-030-46133-1_25.
Vancouver
1.
Papagiannopoulou C, Parchen R, Waegeman W. Investigating time series classification techniques for rapid pathogen identification with single-cell MALDI-TOF mass spectrum data. In: Machine Learning and Knowledge Discovery in Databases ECML PKDD 2019. Springer; 2020. p. 416–31.
IEEE
[1]
C. Papagiannopoulou, R. Parchen, and W. Waegeman, “Investigating time series classification techniques for rapid pathogen identification with single-cell MALDI-TOF mass spectrum data,” in Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019, Würzburg, Germany, 2020, vol. 11908, pp. 416–431.
@inproceedings{8647039,
  abstract     = {{Matrix-assisted laser desorption/ionization-time-of-flight mass spectrometry (MALDI-TOF-MS) is a well-known technology, widely used in species identification. Specifically, MALDI-TOF-MS is applied on samples that usually include bacterial cells, generating representative signals for the various bacterial species. However, for a reliable identification result, a significant amount of biomass is required. For most samples used for diagnostics of infectious diseases, the sample volume is extremely low to obtain the required amount of biomass. Therefore, amplification of the bacterial load is performed by a culturing phase. If the MALDI process could be applied to individual bacteria, it would be possible to circumvent the need for culturing and isolation, accelerating the whole process. In this paper, we briefly describe an implementation of a MALDI-TOF MS procedure in a setting of individual cells and we demonstrate the use of the produced data for the application of pathogen identification. The identification of pathogens (bacterial species) is performed by using machine learning algorithms on the generated single-cell signals. The high predictive performance of the machine learning models indicates that the produced bacterial signatures constitute an informative representation, helpful in distinguishing the different bacterial species. In addition, we reformulate the bacterial species identification problem as a time series classification task by considering the intensity sequences of a given spectrum as time series values. Experimental results show that algorithms originally introduced for time series analysis are beneficial in modelling observations of single-cell MALDI-TOF MS.}},
  author       = {{Papagiannopoulou, Christina and Parchen, René and Waegeman, Willem}},
  booktitle    = {{Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019}},
  isbn         = {{9783030461324}},
  issn         = {{0302-9743}},
  keywords     = {{MALDI-TOF MS,Single-cell spectrum,Single-ionization-event,Classification,Machine learning methods,SPECTROMETRY Bacterial species identification,Time series}},
  language     = {{eng}},
  location     = {{Würzburg, Germany}},
  pages        = {{416--431}},
  publisher    = {{Springer}},
  title        = {{Investigating time series classification techniques for rapid pathogen identification with single-cell MALDI-TOF mass spectrum data}},
  url          = {{http://dx.doi.org/10.1007/978-3-030-46133-1_25}},
  volume       = {{11908}},
  year         = {{2020}},
}

Altmetric
View in Altmetric
Web of Science
Times cited: