Repository at Hanyang University: Joint Optimization of Deep Neural Network-Based Dereverberation and Beamforming for Sound Event Detection in Multi-Channel Environments

Browse

My Repository

Repository at Hanyang UniversityCOLLEGE OF ENGINEERING[S](공과대학)ELECTRONIC ENGINEERING(융합전자공학부)Articles

305 93

Full metadata record

DC Field	Value	Language
dc.contributor.author	장준혁	-
dc.date.accessioned	2021-05-13T02:09:53Z	-
dc.date.available	2021-05-13T02:09:53Z	-
dc.date.issued	2020-03	-
dc.identifier.citation	SENSORS, v. 20, no. 7, article no. 1883	en_US
dc.identifier.issn	1424-8220	-
dc.identifier.uri	https://www.mdpi.com/1424-8220/20/7/1883	-
dc.identifier.uri	https://repository.hanyang.ac.kr/handle/20.500.11754/162020	-
dc.description.abstract	In this paper, we propose joint optimization of deep neural network (DNN)-supported dereverberation and beamforming for the convolutional recurrent neural network (CRNN)-based sound event detection (SED) in multi-channel environments. First, the short-time Fourier transform (STFT) coefficients are calculated from multi-channel audio signals under the noisy and reverberant environments, which are then enhanced by the DNN-supported weighted prediction error (WPE) dereverberation with the estimated masks. Next, the STFT coefficients of the dereverberated multi-channel audio signals are conveyed to the DNN-supported minimum variance distortionless response (MVDR) beamformer in which DNN-supported MVDR beamforming is carried out with the source and noise masks estimated by the DNN. As a result, the single-channel enhanced STFT coefficients are shown at the output and tossed to the CRNN-based SED system, and then, the three modules are jointly trained by the single loss function designed for SED. Furthermore, to ease the difficulty of training a deep learning model for SED caused by the imbalance in the amount of data for each class, the focal loss is used as a loss function. Experimental results show that joint training of DNN-supported dereverberation and beamforming with the SED model under the supervision of focal loss significantly improves the performance under the noisy and reverberant environments.	en_US
dc.description.sponsorship	This research was supported by Projects for Research and Development of Police science and Technology under Center for Research and Development of Police science and Technology and Korean National Police Agency funded by the Ministry of Science and ICT (PA-J000001-2017-101).	en_US
dc.language.iso	en	en_US
dc.publisher	MDPI	en_US
dc.subject	sound event detection	en_US
dc.subject	dereverberation	en_US
dc.subject	acoustic beamforming	en_US
dc.subject	convolutional recurrent neural network	en_US
dc.subject	joint optimization	en_US
dc.title	Joint Optimization of Deep Neural Network-Based Dereverberation and Beamforming for Sound Event Detection in Multi-Channel Environments	en_US
dc.type	Article	en_US
dc.relation.no	7	-
dc.relation.volume	20	-
dc.identifier.doi	10.3390/s20071883	-
dc.relation.page	1-13	-
dc.relation.journal	SENSORS	-
dc.contributor.googleauthor	Noh, Kyoungjin	-
dc.contributor.googleauthor	Chang, Joon-Hyuk	-
dc.relation.code	2020053568	-
dc.sector.campus	S	-
dc.sector.daehak	COLLEGE OF ENGINEERING[S]	-
dc.sector.department	DEPARTMENT OF ELECTRONIC ENGINEERING	-
dc.identifier.pid	jchang	-
dc.identifier.orcid	https://orcid.org/0000-0003-2610-2323	-

Appears in Collections:: COLLEGE OF ENGINEERING[S](공과대학) > ELECTRONIC ENGINEERING(융합전자공학부) > Articles

Files in This Item:: Joint Optimization of Deep Neural Network-Based Dereverberation and Beamforming for Sound Event Detection in Multi-Channel Environments.pdf Download

Export: RIS (EndNote); XLS (Excel); XML

Show simple item record

한양대학교 리포지터리는 국립중앙도서관 OAK 보급사업으로 구축되었습니다. Feedback 개인정보처리방침

Hanyang University repository

Browse

My Repository

BROWSE