Abstract
A key component in the processing of speech is the division of longer input sounds into a number of smaller sections. For speech interpretation it is generally easier to classify single sections. Similarly, when processing speech for other purposes (e.g. speech filtering), it can be easier and more relevant to process individual phonemes. Here, we propose a biologically inspired speech segmentation technique that filters the speech into multiple bandpassed channels using a Gammatone filterbank, and then uses an essentially energy-based spike coding technique in order to find the onsets and offsets present in an audio signal. These onsets and offsets are then processed using leaky integrate-and-fire neurons, and the spikes from these used to determine the speech segmentation. We evaluate this new system using a quantitative evaluation metric, and the promising results of segmentation of both clean speech and speech in noise demonstrate the effectiveness of this technique.
| Original language | English |
|---|---|
| Title of host publication | 2015 International Joint Conference on Neural Networks (IJCNN) |
| Place of Publication | piscataway, NJ |
| Publisher | IEEE |
| Number of pages | 8 |
| ISBN (Electronic) | 9781479919604 |
| DOIs | |
| Publication status | Published - 1 Oct 2015 |
| Event | International Joint Conference on Neural Networks, IJCNN 2015 - Killarney, Ireland Duration: 12 Jul 2015 → 17 Jul 2015 |
Publication series
| Name | Proceedings of the International Joint Conference on Neural Networks |
|---|---|
| ISSN (Print) | 2161-4393 |
| ISSN (Electronic) | 2161-4407 |
Conference
| Conference | International Joint Conference on Neural Networks, IJCNN 2015 |
|---|---|
| Country/Territory | Ireland |
| City | Killarney |
| Period | 12/07/15 → 17/07/15 |
Funding
The authors would like to thank the UK EPSRC (award EP/G06260911) for funding this work.
Keywords
- biology
- signal resolution
- speech
Fingerprint
Dive into the research topics of 'A biologically inspired onset and offset speech segmentation approach'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver