Abstract
Pupillometry has recently been introduced as a method to evaluate cognitive workload of synthetic speech. Prior research conducted on English speech indicates that in noisy listening conditions, pupil dilation is significantly
higher for synthetic speech as compared to natural speech. In a lab-based listening experiment, we evaluated participants' (n=16) pupil responses to Japanese speech (natural vs. synthetic) at three different signal-to-noise levels (-1dB, -3dB and -5dB). Our research expands on previous work by evaluating pupillary responses both in terms of temporal changes in pupil size and degree of pupil oscillations. We observe statistically significant differences in pupil sizes at the recall stage between each type of speech. For pupil oscillations, we register statistically significant differences in frequency power spectrum densities (PSDs). Our investigation proposes an expansion of the current synthetic speech evaluation methods that are based on pupillary responses and outlines possible avenues for future research that arise from the findings of this work.
higher for synthetic speech as compared to natural speech. In a lab-based listening experiment, we evaluated participants' (n=16) pupil responses to Japanese speech (natural vs. synthetic) at three different signal-to-noise levels (-1dB, -3dB and -5dB). Our research expands on previous work by evaluating pupillary responses both in terms of temporal changes in pupil size and degree of pupil oscillations. We observe statistically significant differences in pupil sizes at the recall stage between each type of speech. For pupil oscillations, we register statistically significant differences in frequency power spectrum densities (PSDs). Our investigation proposes an expansion of the current synthetic speech evaluation methods that are based on pupillary responses and outlines possible avenues for future research that arise from the findings of this work.
Original language | English |
---|---|
Pages | 335-342 |
Number of pages | 8 |
DOIs | |
Publication status | Published - 11 Feb 2021 |
Event | BIOSIGNALS 2021: 14th International Conference on Bio-Inspired Systems and Signal Processing - Online Event Duration: 11 Feb 2021 → 13 Feb 2021 http://www.biosignals.biostec.org/Home.aspx |
Conference
Conference | BIOSIGNALS 2021 |
---|---|
Period | 11/02/21 → 13/02/21 |
Internet address |
Keywords
- speech synthesis
- eye tracking
- signal processing
- cognitive workload