Advanced search
1 file | 239.22 KB

Panako: a scalable acoustic fingerprinting system handling time-scale and pitch modification

Joren Six (UGent) and Marc Leman (UGent)
(2014) International Society for Music Information Retrieval, Proceedings. Proceedings of the 15th Conference of the International Society for Music Information Retrieval (ISMIR 2014).
Author
Organization
Abstract
In this paper a scalable granular acoustic fingerprinting system robust against time and pitch scale modification is presented. The aim of acoustic fingerprinting is to identify identical, or recognize similar, audio fragments in a large set using condensed representations of audio signals, i.e. fingerprints. A robust fingerprinting system generates similar fingerprints for perceptually similar audio signals. The new system, presented here, handles a variety of distortions well. It is designed to be robust against pitch shifting, time stretching and tempo changes, while remaining scalable. After a query, the system returns the start time in the reference audio, and the amount of pitch shift and tempo change that has been applied. The design of the system that offers this unique combination of features is the main contribution of this research. The fingerprint itself consists of a combination of key points in a Constant-Q spectrogram. The system is evaluated on commodity hardware using a freely available reference database with fingerprints of over 30.000 songs. The results show that the system responds quickly and reliably on queries, while handling time and pitch scale modifications of up to ten percent.
Keywords
Music Information Retrieval, Acoustic Fingerprinting, Content Based Music Search, Information Retrieval

Downloads

  • ismir 2014 panako acoustic fingerprinter.pdf
    • full text
    • |
    • open access
    • |
    • PDF
    • |
    • 239.22 KB

Citation

Please use this url to cite or link to this publication:

Chicago
Six, Joren, and Marc Leman. 2014. “Panako: a Scalable Acoustic Fingerprinting System Handling Time-scale and Pitch Modification.” In International Society for Music Information Retrieval, Proceedings, ed. Hsin-Min Wang, Yi-Hsuan Yang , and Yin Ha Lee. Vol. Proceedings of the 15th Conference of the International Society for Music Information Retrieval (ISMIR 2014).
APA
Six, J., & Leman, M. (2014). Panako: a scalable acoustic fingerprinting system handling time-scale and pitch modification. In H.-M. Wang, Y.-H. Yang , & Y. H. Lee (Eds.), International Society for Music Information Retrieval, Proceedings (Vol. Proceedings of the 15th Conference of the International Society for Music Information Retrieval (ISMIR 2014)). Presented at the 15th International Society for Music Information Retrieval Conference (ISMIR - 2014).
Vancouver
1.
Six J, Leman M. Panako: a scalable acoustic fingerprinting system handling time-scale and pitch modification. In: Wang H-M, Yang Y-H, Lee YH, editors. International Society for Music Information Retrieval, Proceedings. 2014.
MLA
Six, Joren, and Marc Leman. “Panako: a Scalable Acoustic Fingerprinting System Handling Time-scale and Pitch Modification.” International Society for Music Information Retrieval, Proceedings. Ed. Hsin-Min Wang, Yi-Hsuan Yang , & Yin Ha Lee. Proceedings of the 15th Conference of the International Society for Music Information Retrieval (ISMIR 2014). 2014. Print.
@inproceedings{5754913,
  abstract     = {In this paper a scalable granular acoustic fingerprinting system robust against time and pitch scale modification is presented. The aim of acoustic fingerprinting is to identify identical, or recognize similar, audio fragments in a large set using condensed representations of audio signals, i.e. fingerprints. A robust fingerprinting system generates similar fingerprints for perceptually similar audio signals. The new system, presented here, handles a variety of distortions well. It is designed to be robust against pitch shifting, time stretching and tempo changes, while remaining scalable. After a query, the system returns the start time in the reference audio, and the amount of pitch shift and tempo change that has been applied. The design of the system that offers this unique combination of features is the main contribution of this research. The fingerprint itself consists of a combination of key points in a Constant-Q spectrogram. The system is evaluated on commodity hardware using a freely available reference database with fingerprints of over 30.000 songs. The results show that the system responds quickly and reliably on queries, while handling time and pitch scale modifications of up to ten percent.},
  author       = {Six, Joren and Leman, Marc},
  booktitle    = {International Society for Music Information Retrieval, Proceedings},
  editor       = {Wang, Hsin-Min and Yang , Yi-Hsuan and Lee, Yin Ha},
  keyword      = {Music Information Retrieval,Acoustic Fingerprinting,Content Based Music Search,Information Retrieval},
  language     = {eng},
  location     = {Taipei, Taiwan},
  pages        = {6},
  title        = {Panako: a scalable acoustic fingerprinting system handling time-scale and pitch modification},
  url          = {http://www.terasoft.com.tw/conf/ismir2014/proceedings/T048\_122\_Paper.pdf},
  volume       = {Proceedings of the 15th Conference of the International Society for Music Information Retrieval (ISMIR 2014)},
  year         = {2014},
}