Skip to main navigation Skip to search Skip to main content

Cognitively Inspired Audiovisual Speech Filtering: Towards an Intelligent, Fuzzy Based, Multimodal, Two-Stage Speech Enhancement System

Andrew Abel*, Amir Hussain

*Corresponding author for this work

Research output: Book/ReportBook

Abstract

This book presents a summary of the cognitively inspired basis behind multimodal speech enhancement, covering the relationship between audio and visual modalities in speech, as well as recent research into audiovisual speech correlation. A number of audiovisual speech filtering approaches that make use of this relationship are also discussed. A novel multimodal speech enhancement system, making use of both visual and audio information to filter speech, is presented, and this book explores the extension of this system with the use of fuzzy logic to demonstrate an initial implementation of an autonomous, adaptive, and context aware multimodal system. This work also discusses the challenges presented with regard to testing such a system, the limitations with many current audiovisual speech corpora, and discusses a suitable approach towards development of a corpus designed to test this novel, cognitively inspired, speech filtering system.

Original languageEnglish
PublisherSpringer International Publishing AG
Number of pages121
Volume5
ISBN (Electronic)9783319135090
ISBN (Print)9783319135083
DOIs
Publication statusPublished - 7 Aug 2015

Keywords

  • audiovisual speech processing
  • speech enhancement
  • fuzzy logic
  • hearing and listening devices

Fingerprint

Dive into the research topics of 'Cognitively Inspired Audiovisual Speech Filtering: Towards an Intelligent, Fuzzy Based, Multimodal, Two-Stage Speech Enhancement System'. Together they form a unique fingerprint.

Cite this