A biologically inspired onset and offset speech segmentation approach

Andrew K. Abel, Dean Hunter, Leslie S. Smith

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

3 Citations (Scopus)

Abstract

A key component in the processing of speech is the division of longer input sounds into a number of smaller sections. For speech interpretation it is generally easier to classify single sections. Similarly, when processing speech for other purposes (e.g. speech filtering), it can be easier and more relevant to process individual phonemes. Here, we propose a biologically inspired speech segmentation technique that filters the speech into multiple bandpassed channels using a Gammatone filterbank, and then uses an essentially energy-based spike coding technique in order to find the onsets and offsets present in an audio signal. These onsets and offsets are then processed using leaky integrate-and-fire neurons, and the spikes from these used to determine the speech segmentation. We evaluate this new system using a quantitative evaluation metric, and the promising results of segmentation of both clean speech and speech in noise demonstrate the effectiveness of this technique.

Original languageEnglish
Title of host publication2015 International Joint Conference on Neural Networks (IJCNN)
Place of Publicationpiscataway, NJ
PublisherIEEE
Number of pages8
ISBN (Electronic)9781479919604
DOIs
Publication statusPublished - 1 Oct 2015
EventInternational Joint Conference on Neural Networks, IJCNN 2015 - Killarney, Ireland
Duration: 12 Jul 201517 Jul 2015

Publication series

NameProceedings of the International Joint Conference on Neural Networks
ISSN (Print)2161-4393
ISSN (Electronic)2161-4407

Conference

ConferenceInternational Joint Conference on Neural Networks, IJCNN 2015
Country/TerritoryIreland
CityKillarney
Period12/07/1517/07/15

Keywords

  • biology
  • signal resolution
  • speech

Fingerprint

Dive into the research topics of 'A biologically inspired onset and offset speech segmentation approach'. Together they form a unique fingerprint.

Cite this