Repository at Hanyang University: On using Parameterized Multi-channel Non-causal Wiener Filter-Adapted Convolutional Neural Networks for Distant Speech Recognition

Browse

My Repository

Repository at Hanyang UniversityCOLLEGE OF ENGINEERING[S](공과대학)ELECTRONIC ENGINEERING(융합전자공학부)Articles

394 0

Full metadata record

DC Field	Value	Language
dc.contributor.author	장준혁	-
dc.date.accessioned	2017-11-01T05:34:08Z	-
dc.date.available	2017-11-01T05:34:08Z	-
dc.date.issued	2016-01	-
dc.identifier.citation	International Conference on Electronics, Information, and Communications, ICEIC 2016, Article number 7562963, Page. 1-4	en_US
dc.identifier.issn	978-1-4673-8016-4	-
dc.identifier.uri	http://ieeexplore.ieee.org/document/7562963/	-
dc.identifier.uri	http://hdl.handle.net/20.500.11754/30389	-
dc.description.abstract	Recently, the convolutional neural network (CNN) with multiple microphones was proposed to use the delay-sum (DS) beamformer for distant speech recognition (DSR) and compared to the direct use of multiple acoustic channels as a parallel input to the CNN [1]. We explore the parameterized multi-channel non-causal Wiener filter (PMWF) as the front-end to train the CNN, which is applied to acoustic modeling for DSR. For this, we first present a concise description of the basic PMWF as well as its advantages and then explain how to organize the PMWF into the CNN with a novel architecture. Experimental results on the TIMIT dataset show that the proposed PMWF-based CNN approach outperforms the cross-channel CNN and the DS beamformer when evaluating the word error rate (WER) in various DSR environments. © 2016 IEEE.	en_US
dc.description.sponsorship	This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (No. 2014R1A2A1A10049735). This work was also supported by the ICT R&D program of MISP/IITP. [R0126-15-1119. Development of a solution for situation-awareness based on the analysis of speech and environmental sounds]	en_US
dc.language.iso	en	en_US
dc.publisher	IEEE, IEIE	en_US
dc.subject	beamforming	en_US
dc.subject	Convolutional neural networks	en_US
dc.subject	distant speech recognition	en_US
dc.subject	PMWF	en_US
dc.title	On using Parameterized Multi-channel Non-causal Wiener Filter-Adapted Convolutional Neural Networks for Distant Speech Recognition	en_US
dc.type	Article	en_US
dc.identifier.doi	10.1109/ELINFOCOM.2016.7562963	-
dc.relation.page	1-4	-
dc.contributor.googleauthor	Lee, Jeehye	-
dc.contributor.googleauthor	Chang, Joon-Hyuk	-
dc.contributor.googleauthor	Sohn, Jinho	-
dc.sector.campus	S	-
dc.sector.daehak	COLLEGE OF ENGINEERING[S]	-
dc.sector.department	DEPARTMENT OF ELECTRONIC ENGINEERING	-
dc.identifier.pid	jchang	-

Appears in Collections:: COLLEGE OF ENGINEERING[S](공과대학) > ELECTRONIC ENGINEERING(융합전자공학부) > Articles

Files in This Item:

Export: RIS (EndNote); XLS (Excel); XML

Show simple item record

한양대학교 리포지터리는 국립중앙도서관 OAK 보급사업으로 구축되었습니다. Feedback 개인정보처리방침

Hanyang University repository

Browse

My Repository

BROWSE