Ghent University Academic Bibliography

Advanced

Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels

Youri Maryn UGent, Paul Corthals UGent, Paul Van Cauwenberge UGent, Nelson Roy and Marc De Bodt UGent (2010) JOURNAL OF VOICE. 24(5). p.540-555
abstract
To improve ecological validity, perceptual and instrumental assessment of disordered voice, including overall voice quality, should ideally sample both sustained vowels and continuous speech. This investigation assessed the utility of combining both voice contexts for the purpose of auditory-perceptual ratings as well as acoustic measurement of overall voice quality. Sustained vowel and continuous speech samples from 251 subjects with (n = 229) or without (n = 22) various voice disorders were concatenated and perceptually rated on overall voice quality by five experienced voice clinicians. After removing the nonvoiced segments within the continuous speech samples, the concatenated samples were analyzed using 13 acoustic measures based on fundamental frequency perturbation, amplitude perturbation, spectral and cepstral analyses. Stepwise multiple regression analysis yielded a six-variable acoustic model for the multiparametric measurement of overall voice quality of the concatenated samples (with a cepstral measure as the main contributor to the prediction of overall voice quality). The correlation of this model with mean ratings of overall voice quality resulted in r(s) = 0.78. A cross-validation approach involving the iterated internal cross-correlations with 30 subgroups of 100, 50, and 10 samples confirmed a comparable degree of association. Furthermore, the ability of the model to distinguish voice-disordered from vocally normal participants was assessed using estimates of diagnostic precision including receiver operating characteristic (ROC) curve analysis, sensitivity, and specificity, as well as likelihood ratios (LRs), which adjust for base-rate differences between the groups. Depending on the cutoff criteria employed, the analyses revealed an impressive area under ROC = 0.895 as well as respectable sensitivity, specificity, and LR. The results support the diagnostic utility of combining voice samples from both continuous speech and sustained vowels in acoustic and perceptual analysis of disordered voice. The findings are discussed in relation to the extant literature and the need for further refinement of the acoustic algorithm.
Please use this url to cite or link to this publication:
author
organization
year
type
journalArticle (original)
publication status
published
subject
keyword
DYSPHONIC VOICE, AERODYNAMIC MEASUREMENTS, PERCEPTUAL EVALUATION, VISUAL ANALOG SCALE, BREATHY VOCAL QUALITY, CEPSTRAL PEAK PROMINENCE, Continuous speech, Sustained vowel, Harmonics-to-noise ratio, Spectral measure, Cepstral measure, Perceptual rating, Overall voice quality, Multivariate acoustic measurement, PATHOLOGICAL VOICE, NORMAL SPEAKERS, GRBAS SCALE, RELIABILITY, Amplitude perturbation
journal title
JOURNAL OF VOICE
J. Voice
volume
24
issue
5
pages
540 - 555
Web of Science type
Article
Web of Science id
000281710100004
JCR category
OTORHINOLARYNGOLOGY
JCR impact factor
1.108 (2010)
JCR rank
20/41 (2010)
JCR quartile
2 (2010)
ISSN
0892-1997
DOI
10.1016/j.jvoice.2008.12.014
language
English
UGent publication?
yes
classification
A1
copyright statement
I have transferred the copyright for this publication to the publisher
id
780271
handle
http://hdl.handle.net/1854/LU-780271
date created
2009-11-12 17:42:03
date last changed
2012-04-18 14:57:58
@article{780271,
  abstract     = {To improve ecological validity, perceptual and instrumental assessment of disordered voice, including overall voice quality, should ideally sample both sustained vowels and continuous speech. This investigation assessed the utility of combining both voice contexts for the purpose of auditory-perceptual ratings as well as acoustic measurement of overall voice quality. Sustained vowel and continuous speech samples from 251 subjects with (n = 229) or without (n = 22) various voice disorders were concatenated and perceptually rated on overall voice quality by five experienced voice clinicians. After removing the nonvoiced segments within the continuous speech samples, the concatenated samples were analyzed using 13 acoustic measures based on fundamental frequency perturbation, amplitude perturbation, spectral and cepstral analyses. Stepwise multiple regression analysis yielded a six-variable acoustic model for the multiparametric measurement of overall voice quality of the concatenated samples (with a cepstral measure as the main contributor to the prediction of overall voice quality). The correlation of this model with mean ratings of overall voice quality resulted in r(s) = 0.78. A cross-validation approach involving the iterated internal cross-correlations with 30 subgroups of 100, 50, and 10 samples confirmed a comparable degree of association. Furthermore, the ability of the model to distinguish voice-disordered from vocally normal participants was assessed using estimates of diagnostic precision including receiver operating characteristic (ROC) curve analysis, sensitivity, and specificity, as well as likelihood ratios (LRs), which adjust for base-rate differences between the groups. Depending on the cutoff criteria employed, the analyses revealed an impressive area under ROC = 0.895 as well as respectable sensitivity, specificity, and LR. The results support the diagnostic utility of combining voice samples from both continuous speech and sustained vowels in acoustic and perceptual analysis of disordered voice. The findings are discussed in relation to the extant literature and the need for further refinement of the acoustic algorithm.},
  author       = {Maryn, Youri and Corthals, Paul and Van Cauwenberge, Paul and Roy, Nelson and De Bodt, Marc},
  issn         = {0892-1997},
  journal      = {JOURNAL OF VOICE},
  keyword      = {DYSPHONIC VOICE,AERODYNAMIC MEASUREMENTS,PERCEPTUAL EVALUATION,VISUAL ANALOG SCALE,BREATHY VOCAL QUALITY,CEPSTRAL PEAK PROMINENCE,Continuous speech,Sustained vowel,Harmonics-to-noise ratio,Spectral measure,Cepstral measure,Perceptual rating,Overall voice quality,Multivariate acoustic measurement,PATHOLOGICAL VOICE,NORMAL SPEAKERS,GRBAS SCALE,RELIABILITY,Amplitude perturbation},
  language     = {eng},
  number       = {5},
  pages        = {540--555},
  title        = {Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels},
  url          = {http://dx.doi.org/10.1016/j.jvoice.2008.12.014},
  volume       = {24},
  year         = {2010},
}

Chicago
Maryn, Youri, Paul Corthals, Paul Van Cauwenberge, Nelson Roy, and Marc De Bodt. 2010. “Toward Improved Ecological Validity in the Acoustic Measurement of Overall Voice Quality: Combining Continuous Speech and Sustained Vowels.” Journal of Voice 24 (5): 540–555.
APA
Maryn, Y., Corthals, P., Van Cauwenberge, P., Roy, N., & De Bodt, M. (2010). Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels. JOURNAL OF VOICE, 24(5), 540–555.
Vancouver
1.
Maryn Y, Corthals P, Van Cauwenberge P, Roy N, De Bodt M. Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels. JOURNAL OF VOICE. 2010;24(5):540–55.
MLA
Maryn, Youri, Paul Corthals, Paul Van Cauwenberge, et al. “Toward Improved Ecological Validity in the Acoustic Measurement of Overall Voice Quality: Combining Continuous Speech and Sustained Vowels.” JOURNAL OF VOICE 24.5 (2010): 540–555. Print.