Included here are downloadable pre-prints of published papers arising from this project: (1) Roberts, B., Summers, R.J., and Bailey, P.J. (2010). “The perceptual organization of sine-wave speech under competitive conditions,” Journal of the Acoustical Society of America, 128, 804-817. Pre-print
(2) Summers, R.J., Bailey, P.J., and Roberts, B. (2010). “Effects of differences in fundamental frequency on across-formant grouping in speech perception,” Journal of the Acoustical Society of America, 128, 3667-3677. Pre-print
(3) Roberts, B., Summers, R.J., and Bailey, P.J. (2011). “The intelligibility of noise-vocoded speech: Spectral information available from across-channel comparison of amplitude envelopes,” Proceedings of the Royal Society of London Series B: Biological Sciences, 278, 1595-1600. Pre-print
(4) Summers, R.J. Bailey, P.J. Roberts, B. (2012) Effects of the Rate of Formant-Frequency Variation on the Grouping of Formants in Speech Perception, Journal of the Association for Research in Otolaryngology, 13(2), 269-280. Pre-print
(5) Roberts, B., Summers, R.J., and Bailey, P.J. (2014). “Formant-frequency variation and informational masking of speech by extraneous formants: Evidence against dynamic and speech-specific acoustical constraints," Journal of Experimental Psychology: Human Perception & Performance. Online First Publication, 19th May, 2014. Online First (Open Access)
Included here are posters containing material from the project which has not yet appeared in published journal articles. See also "papers under submission and in preparation."
(1) Poster by Roberts, Summers, and Bailey (presented in September 2010). The perceptual organization of noise-vocoded speech under competitive conditions
(2) Poster by Roberts, Summers, and Bailey (presented in May 2011). The role of formant-frequency contours in the perceptual grouping of speech formants
Included here are posters containing material from the part of the project relating directly to Marcin Stachurski's research towards the PhD. None of this material has yet appeared in the form of published journal articles. Marcin is currently writing up his thesis.
(1) Poster by Stachurski, Summers, and Roberts (presented in September 2009). Grouping and the Verbal Transformation Effect - The influence of fundamental frequency, ear of presentation, and interaural time-difference cues
(2) Poster by Stachurski, Summers, and Roberts (presented in May 2011). Grouping and the Verbal Transformation Effect - The influence of formant transitions
Our approach was to generate artificial speech-like stimuli with precisely controlled properties, particularly the spectral prominences called formants. These are important because they arise as a result of resonances in the air-filled cavities of the talker’s vocal tract. Variation in the frequency and amplitude of a formant is an inevitable consequence of change in the size of its associated cavity as the tongue, lips, and jaw move when the talker produces speech. Hence, knowledge of formant frequencies and their change over time is of great benefit to listeners trying to understand a spoken message, and so choosing the right set of formants from a mixture is critical for intelligibility. Simplified versions of target sentences were synthesised and then mixed with carefully designed “competitors” offering alternative grouping possibilities for the formants in the target sentence. The impact of these competitors on listeners’ recognition of the target sentence in the mixture was measured as the properties of the competitors were manipulated.
The key findings of the project are: (a) Modulation of the formant-frequency contour, but not the amplitude contour, is critical for across-formant grouping; (b) The ability of listeners to reject a competitor formant declines as either the rate or depth of modulation of its frequency contour increases, relative to that of the target sentence; (c) The impact of a competitor does not depend on whether its pattern of variation in formant frequency is plausibly speech-like; (d) The ability of listeners to reject a competitor increases as the pitch difference between target and competitor formants increases; (e) Formant-frequency variation conveys information important for speech intelligibility even in contexts often regarded as conveying information about speech-sound identity mainly through other cues. In summary, the results of this project have shown that our ability to segregate a talker’s speech from a sound mixture depends heavily on general-purpose grouping principles and rather less on speech-specific principles than has been suggested by some researchers. The results also suggest approaches by which engineers and computer scientists might improve the performance of devices such as hearing aids and automatic speech recognizers when they are operating in noisy environments.
Last updated 20 June 2014