A deep learning method for pathological voice detection using convolutional deep belief networks

Research output: Contribution to conferencePaperpeer-review

85 Citations (Scopus)
225 Downloads (Pure)

Abstract

Automatically detecting pathological voice disorders such as vocal cord paralysis or Reinke’s edema is a challenging and important medical classification problem. While deep learning techniques have achieved significant progress in the speech recognition field there has been less research work in the area of pathological voice disorders detection. A novel system for pathological voice detection using convolutional neural network (CNN) as the basic architecture is presented in this work. The novel system uses spectrograms of normal and pathological speech recordings as the input to the network. Initially Convolutional deep belief network (CDBN) are used to pre-train the weights of CNN system. This acts as a generative model to explore the structure of the input data using statistical methods. Then a CNN is trained using supervised back-propagation learning algorithm to fine tune the weights. It will be shown that a small amount of data can be used to achieve good results in classification with this deep learning approach. A performance analysis of the novel method is provided using real data from the Saarbrucken Voice database
Original languageEnglish
Number of pages5
Publication statusPublished - 2 Sept 2018
EventInterspeech 2018 - Hyderabad, India
Duration: 2 Sept 20186 Sept 2018
http://interspeech2018.org/

Conference

ConferenceInterspeech 2018
Country/TerritoryIndia
CityHyderabad
Period2/09/186/09/18
Internet address

Keywords

  • pathological voice detection
  • convolutional neural network (CNN)
  • convolutional deep belief network (CDBN)
  • deep learning

Fingerprint

Dive into the research topics of 'A deep learning method for pathological voice detection using convolutional deep belief networks'. Together they form a unique fingerprint.

Cite this