An Introduction to Sine-Wave Speech

栏目: IT技术 · 发布时间: 5年前

内容简介:MRC Cognition and Brain Sciences UnitSine-wave speech is a form of artificially degraded speech first developed atRemez, R.E., Rubin, P.E., Pisoni, D.B., Carrell, T.D. (1981) Speech perception without traditional speech cues.

An Introduction to Sine-Wave Speech

Matt Davis

MRC Cognition and Brain Sciences Unit
Chaucer Road
Cambridge CB2 7EF.

1) Introduction:

Sine-wave speech is a form of artificially degraded speech first developed at Haskins Laboratory . Several seminal experiments on the perception of sine-wave speech are described here:

Remez, R.E., Rubin, P.E., Pisoni, D.B., Carrell, T.D. (1981) Speech perception without traditional speech cues. Science, 212, 947-9. PubMed

In this work, Remez and colleagues demonstrated a dramatic change in the way in which sine-wave speech sentences are perceived, depending on listener's specific prior knowledge. For instance, listen to this sound:

Most naive listeners hear this as a set of simultaneous whistles, or science fiction sounds. However, for listeners that have previously heard this sound:

Listening to the sine-wave speech sound again produces a very different percept of a fully intelligible spoken sentence. This dramatic change in perception is an example of "perceptual insight" or pop-out. We have argued that this form of pop-out is an example of a top-down perceptual process produced by higher-level knowledge and expectations concerning sounds that can potentially be heard as speech:

Davis, M.H., Johnsrude, I.S. (2007) "Hearing speech sounds: Top-down influences on the interface between audition and speech perception." Hearing Research, 229(1-2), 132-147.PDF.

There are four more example pairs of sine-wave and clear speech in the table below:

Sine-Wave Speech

Clear Speech

As you listen to these four examples, you may find that you get better at understanding the sine-wave speech first time around. This is an example of perceptual learning. Having heard several examples of sine-wave speech, your perceptual system has tuned into this form of distortion, so as to be able to perceive new sine-wave speech sentences more clearly.

To my knowledge, no one has done controlled experiments to demonstrate that pop-out helps with learning sine-wave speech. However, for another form of distortion (noise-vocoded speech), we have shown that pop-out enhances perceptual learning so that people more rapidly learn to understand new distorted sentences. These experiments withvocoded speech suggest that perceptual learning is also a top-down process:

Davis, M.H., Johnsrude, I.S., Hervais-Adelman, A., Taylor, K. & McGettigan, C.M. (2005) Lexical information drives perceptual learning of distorted speech: Evidence from the comprehension of noise-vocoded sentences. Journal of Experimental Psychology: General, 134 (2), 222-241.PDF.

2) Generating Sine-Wave Speech:

Sine-wave speech is generated by using a formant tracker to detect the formant frequencies found in an utterance, and then synthesising sine waves that track the centre of these formants. This is illustrated in the figures below:

An Introduction to Sine-Wave Speech

A number of pieces of software exist for generating sine-wave versions of utterances. These sentences shown above were generated using Praat software and ascript written byChris Darwin. There's also Matlab code to generate sine-wave speech written by Dan Ellis .

3) Other forms of perceptual insight:

There are a number of examples of perceptual insight in the visual domain that have been documented. For instance, turning grey-scale images into high contrast, black/white images can produce a similar phenomenon to sine-wave speech. This manipulation was originally described by Craig Mooney (1957).

An Introduction to Sine-Wave Speech

Click on the image to receive a visual hint about the content of this image. This form of visual perceptual insight is discussed in greater detail by Nava Rubin in this paper:

Rubin, N., Nakayama, K. and Shapley, R. (2002), The role of insight in perceptual learning: evidence from illusory contour perception. In: Perceptual Learning, Fahle, M. and Poggio, T. (Eds.), MIT Press.

Which is where the example image above comes from. I'd be keen to hear of forms of perceptual insight, in other sensory modalities.

4) Media reports of this work:

New Scientist , Mind Hacks , Boing Boing

This page was last updated on 24th November 2007. Comments and suggestions to matt.davis @ mrc-cbu.cam.ac.uk.


以上所述就是小编给大家介绍的《An Introduction to Sine-Wave Speech》,希望对大家有所帮助,如果大家有任何疑问请给我留言,小编会及时回复大家的。在此也非常感谢大家对 码农网 的支持!

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

微商团队管理实战手册

微商团队管理实战手册

杜一凡 / 人民邮电出版社 / 2015-11 / 45.00元

回顾淘宝,用了10年时间才发展了不到1000万的卖家,再看微商,其仅一年时间就拥有了超过1000万的卖家。进入2015年,微商的发展之路虽有小坎坷,但前景依然被看好。然而任何一个想要做大、做强的微商都要以团队形式来发展,独立的个体只会举步维艰。 本书全面解读微商团队管理的营销书。全书共分为六章,分别从微商团队的商业秘密、微商团队的战略布局、管理基本功、建立高效团队、精通管理工具、未来发展等方......一起来看看 《微商团队管理实战手册》 这本书的介绍吧!

JS 压缩/解压工具
JS 压缩/解压工具

在线压缩/解压 JS 代码

JSON 在线解析
JSON 在线解析

在线 JSON 格式化工具

RGB HSV 转换
RGB HSV 转换

RGB HSV 互转工具