Advanced search
1 file | 1.92 MB Add to list

Exploring annoyance in a soundscape context by joint prediction of sound source and annoyance

Author
Organization
Project
Abstract
Soundscape, the sonic environment as perceived and understood by people, is a conglomerate of different sounds. It has been established that its appraisal by instantaneous annoyance is not solely determined by its calculated loudness, but also by recognised sounds. Hence, most previous research on annoyance has focused on single-source environments. Audio analytics aims at detecting and classifying sound sources, but does not explore human perception of these. This paper proposes a dual-input model to simultaneously perform sound source classification (SSC) and human annoyance rating prediction (ARP). The model takes mel features and root-mean-square value (rms) features as input, and uses convolutional blocks to extract high-level acoustic features. These are used to predict sound source classes and to estimate the human annoyance rating for the whole fragment. Experiments on the DeLTA dataset show that: 1) models using mel features and rms features outperform models using only one of them; 2) The proposed model achieves a SSC accuracy of 90.06%, and an ARP (scale 1 to 10) root mean square error of 1.05.

Downloads

  • ACUS 652.pdf
    • full text (Published version)
    • |
    • open access
    • |
    • PDF
    • |
    • 1.92 MB

Citation

Please use this url to cite or link to this publication:

MLA
Hou, Yuanbo, et al. “Exploring Annoyance in a Soundscape Context by Joint Prediction of Sound Source and Annoyance.” Forum Acusticum 2023 : 10th Convention of the European Acoustics Association, Proceedings, 2023, pp. 1–5.
APA
Hou, Y., Mitchell, A., Ren, Q., Aletta, F., Kang, J., & Botteldooren, D. (2023). Exploring annoyance in a soundscape context by joint prediction of sound source and annoyance. Forum Acusticum 2023 : 10th Convention of the European Acoustics Association, Proceedings, 1–5.
Chicago author-date
Hou, Yuanbo, Andrew Mitchell, Qiaoqiao Ren, Francesco Aletta, Jian Kang, and Dick Botteldooren. 2023. “Exploring Annoyance in a Soundscape Context by Joint Prediction of Sound Source and Annoyance.” In Forum Acusticum 2023 : 10th Convention of the European Acoustics Association, Proceedings, 1–5.
Chicago author-date (all authors)
Hou, Yuanbo, Andrew Mitchell, Qiaoqiao Ren, Francesco Aletta, Jian Kang, and Dick Botteldooren. 2023. “Exploring Annoyance in a Soundscape Context by Joint Prediction of Sound Source and Annoyance.” In Forum Acusticum 2023 : 10th Convention of the European Acoustics Association, Proceedings, 1–5.
Vancouver
1.
Hou Y, Mitchell A, Ren Q, Aletta F, Kang J, Botteldooren D. Exploring annoyance in a soundscape context by joint prediction of sound source and annoyance. In: Forum Acusticum 2023 : 10th Convention of the European Acoustics Association, Proceedings. 2023. p. 1–5.
IEEE
[1]
Y. Hou, A. Mitchell, Q. Ren, F. Aletta, J. Kang, and D. Botteldooren, “Exploring annoyance in a soundscape context by joint prediction of sound source and annoyance,” in Forum Acusticum 2023 : 10th Convention of the European Acoustics Association, Proceedings, Turin, Italy, 2023, pp. 1–5.
@inproceedings{01HHHRMB1CFBX7E12YQXJ0G9P2,
  abstract     = {{Soundscape, the sonic environment as perceived and understood by people, is a conglomerate of different sounds.
It has been established that its appraisal by instantaneous
annoyance is not solely determined by its calculated loudness, but also by recognised sounds. Hence, most previous
research on annoyance has focused on single-source environments. Audio analytics aims at detecting and classifying sound sources, but does not explore human perception
of these. This paper proposes a dual-input model to simultaneously perform sound source classification (SSC) and
human annoyance rating prediction (ARP). The model
takes mel features and root-mean-square value (rms) features as input, and uses convolutional blocks to extract
high-level acoustic features. These are used to predict
sound source classes and to estimate the human annoyance rating for the whole fragment. Experiments on the
DeLTA dataset show that: 1) models using mel features
and rms features outperform models using only one of
them; 2) The proposed model achieves a SSC accuracy
of 90.06%, and an ARP (scale 1 to 10) root mean square
error of 1.05.}},
  author       = {{Hou, Yuanbo and Mitchell, Andrew and Ren, Qiaoqiao and Aletta, Francesco and Kang, Jian and Botteldooren, Dick}},
  booktitle    = {{Forum Acusticum 2023 : 10th Convention of the European Acoustics Association, Proceedings}},
  isbn         = {{9788888942674}},
  issn         = {{2221-3767}},
  language     = {{eng}},
  location     = {{Turin, Italy}},
  pages        = {{1--5}},
  title        = {{Exploring annoyance in a soundscape context by joint prediction of sound source and annoyance}},
  year         = {{2023}},
}