Classification of collagens via peptide ambiguation, in a paleoproteomic LC-MS/MS-based taxonomic pipeline
- Author
- Ian Engels (UGent) , Alexandra Burnett (UGent) , Prudence Robert (UGent) , Camille Pironneau (UGent) , Grégory Abrams (UGent) , Robbin Bouwmeester (UGent) , Peter Van Der Plaetsen, Kévin Di Modica, Marcel Otte, Lawrence Guy Straus, Valentin Fischer, Fabrice Bray, Bart Mesuere (UGent) , Isabelle De Groote (UGent) , Dieter Deforce (UGent) , Simon Daled (UGent) and Maarten Dhaenens (UGent)
- Organization
- Project
- Abstract
- Liquid chromatography–mass spectrometry (LC-MS/MS) extends the matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) Zooarcheology by Mass Spectrometry (ZooMS) “mass fingerprinting” approach to species identification by providing fragmentation spectra for each peptide. However, ancient bone samples generate sparse data containing only a few collagen proteins, rendering target–decoy strategies unusable and increasing uncertainty in peptide annotation. To ameliorate this issue, we present a ZooMS/MS data pipeline that builds on a manually curated Collagen database and comprises two novel algorithms: isoBLAST and ClassiCOL. isoBLAST first extends peptide ambiguity by generating all “potential peptide candidates” isobaric to the annotated precursor. The exhaustive set of candidates created is then used to retain or reject different potential paths at each taxonomic branching point from superkingdom to species, until the greatest possible specificity is reached. Uniquely, ClassiCOL allows for the identification of taxonomic mixtures, including contaminated samples, as well as suggesting taxonomies not represented in sequence databases, including extinct taxa. All considered ambiguity is then graphically represented with clear prioritization of the potential taxa in the sample. Using public as well as in-house data acquired on different instruments, we demonstrate the performance of this universal postprocessing and explore the identification of both genetic and sample mixtures. Diet reconstruction from 40,000-year-old cave hyena coprolites illustrates the exciting potential of this approach.
- Keywords
- Palaeoproteomics, Bioinformatics, Mass spectrometry, Archaeology, Belgium, isoBLAST, ClassiCOL, paleoproteomics, proteomics, archeology, bioinformatics, ZooMS, ZooMS/MS, DECOY SEARCH STRATEGY, MASS-SPECTROMETRY, IDENTIFICATION, BONE, PROTEINS, PLATFORM, REVEALS, ART
Downloads
-
(...).pdf
- full text (Published version)
- |
- UGent only
- |
- |
- 15.49 MB
Citation
Please use this url to cite or link to this publication: http://hdl.handle.net/1854/LU-01JVMF1D6MHQWMB87V0WDZ5ABK
- MLA
- Engels, Ian, et al. “Classification of Collagens via Peptide Ambiguation, in a Paleoproteomic LC-MS/MS-Based Taxonomic Pipeline.” JOURNAL OF PROTEOME RESEARCH, vol. 24, no. 4, 2025, pp. 1907–25, doi:10.1021/acs.jproteome.4c00962.
- APA
- Engels, I., Burnett, A., Robert, P., Pironneau, C., Abrams, G., Bouwmeester, R., … Dhaenens, M. (2025). Classification of collagens via peptide ambiguation, in a paleoproteomic LC-MS/MS-based taxonomic pipeline. JOURNAL OF PROTEOME RESEARCH, 24(4), 1907–1925. https://doi.org/10.1021/acs.jproteome.4c00962
- Chicago author-date
- Engels, Ian, Alexandra Burnett, Prudence Robert, Camille Pironneau, Grégory Abrams, Robbin Bouwmeester, Peter Van Der Plaetsen, et al. 2025. “Classification of Collagens via Peptide Ambiguation, in a Paleoproteomic LC-MS/MS-Based Taxonomic Pipeline.” JOURNAL OF PROTEOME RESEARCH 24 (4): 1907–25. https://doi.org/10.1021/acs.jproteome.4c00962.
- Chicago author-date (all authors)
- Engels, Ian, Alexandra Burnett, Prudence Robert, Camille Pironneau, Grégory Abrams, Robbin Bouwmeester, Peter Van Der Plaetsen, Kévin Di Modica, Marcel Otte, Lawrence Guy Straus, Valentin Fischer, Fabrice Bray, Bart Mesuere, Isabelle De Groote, Dieter Deforce, Simon Daled, and Maarten Dhaenens. 2025. “Classification of Collagens via Peptide Ambiguation, in a Paleoproteomic LC-MS/MS-Based Taxonomic Pipeline.” JOURNAL OF PROTEOME RESEARCH 24 (4): 1907–1925. doi:10.1021/acs.jproteome.4c00962.
- Vancouver
- 1.Engels I, Burnett A, Robert P, Pironneau C, Abrams G, Bouwmeester R, et al. Classification of collagens via peptide ambiguation, in a paleoproteomic LC-MS/MS-based taxonomic pipeline. JOURNAL OF PROTEOME RESEARCH. 2025;24(4):1907–25.
- IEEE
- [1]I. Engels et al., “Classification of collagens via peptide ambiguation, in a paleoproteomic LC-MS/MS-based taxonomic pipeline,” JOURNAL OF PROTEOME RESEARCH, vol. 24, no. 4, pp. 1907–1925, 2025.
@article{01JVMF1D6MHQWMB87V0WDZ5ABK,
abstract = {{Liquid chromatography–mass spectrometry (LC-MS/MS) extends the matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) Zooarcheology by Mass Spectrometry (ZooMS) “mass fingerprinting” approach to species identification by providing fragmentation spectra for each peptide. However, ancient bone samples generate sparse data containing only a few collagen proteins, rendering target–decoy strategies unusable and increasing uncertainty in peptide annotation. To ameliorate this issue, we present a ZooMS/MS data pipeline that builds on a manually curated Collagen database and comprises two novel algorithms: isoBLAST and ClassiCOL. isoBLAST first extends peptide ambiguity by generating all “potential peptide candidates” isobaric to the annotated precursor. The exhaustive set of candidates created is then used to retain or reject different potential paths at each taxonomic branching point from superkingdom to species, until the greatest possible specificity is reached. Uniquely, ClassiCOL allows for the identification of taxonomic mixtures, including contaminated samples, as well as suggesting taxonomies not represented in sequence databases, including extinct taxa. All considered ambiguity is then graphically represented with clear prioritization of the potential taxa in the sample. Using public as well as in-house data acquired on different instruments, we demonstrate the performance of this universal postprocessing and explore the identification of both genetic and sample mixtures. Diet reconstruction from 40,000-year-old cave hyena coprolites illustrates the exciting potential of this approach.}},
author = {{Engels, Ian and Burnett, Alexandra and Robert, Prudence and Pironneau, Camille and Abrams, Grégory and Bouwmeester, Robbin and Van Der Plaetsen, Peter and Di Modica, Kévin and Otte, Marcel and Straus, Lawrence Guy and Fischer, Valentin and Bray, Fabrice and Mesuere, Bart and De Groote, Isabelle and Deforce, Dieter and Daled, Simon and Dhaenens, Maarten}},
issn = {{1535-3893}},
journal = {{JOURNAL OF PROTEOME RESEARCH}},
keywords = {{Palaeoproteomics,Bioinformatics,Mass spectrometry,Archaeology,Belgium,isoBLAST,ClassiCOL,paleoproteomics,proteomics,archeology,bioinformatics,ZooMS,ZooMS/MS,DECOY SEARCH STRATEGY,MASS-SPECTROMETRY,IDENTIFICATION,BONE,PROTEINS,PLATFORM,REVEALS,ART}},
language = {{eng}},
number = {{4}},
pages = {{1907--1925}},
title = {{Classification of collagens via peptide ambiguation, in a paleoproteomic LC-MS/MS-based taxonomic pipeline}},
url = {{http://doi.org/10.1021/acs.jproteome.4c00962}},
volume = {{24}},
year = {{2025}},
}
- Altmetric
- View in Altmetric
- Web of Science
- Times cited: