Gumelar, Agustinus Bimo and Sugiarto, Indar and Yuniarno, Eko Mulyanto and Mahindara, Vincentius Raki and Anggraeni, Wiwik and Purnomo, Mauridhi Hery (2020) Enhancing Detection of Pathological Voice Disorder Based on Deep VGG-16 CNN. In: The 3rd International Conference on Biomedical Engineering (iBioMed 2020), 08-10-2020 - 08-10-2020, Yogyakarta - Indonesia.
![]() | PDF Download (407Kb) |
![]() | PDF Download (2063Kb) |
Abstract
As a matter of fact, the system of human voice production is a sophisticated biological device that can modulate pitch and loudness. The essentials of internal and external factors often damage the vocal folds and change the vocal voice as a result. Thus, the consequences are well-portrayed in the function of the body and stand of emotion. Consequently, it is primary to identify voice changes at an early stage, deliver an opportunity to overcome any consequence, and enhance the patient’s quality of life. In this case, voice disorder can be detected automatically by using Machine Learning (ML) techniques, which is, indeed, serves as a critical role. In this experiment, we specifically employ the Convolutional Neural Network (CNN), and a robust CNN model: the VGG-16. In investigating the performance of CNN in detecting disordered speech, we used the particular Pathological Voice Disorder (PVD) dataset, named the Respiratory Sound Database, which comprises hundreds of sampled PVD sound files. The experiment showed the accuracy of voice pathology detection arouses to 92.03%.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | Pathological Voice Disorder; CNN, VGG-16, LSTM; VTLP Method |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Industrial Technology > Electrical Engineering Department |
Depositing User: | Admin |
Date Deposited: | 21 Oct 2020 15:17 |
Last Modified: | 26 Aug 2021 22:32 |
URI: | https://repository.petra.ac.id/id/eprint/21815 |
Actions (login required)
View Item |