Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 장준혁 | - |
dc.date.accessioned | 2017-11-01T05:34:08Z | - |
dc.date.available | 2017-11-01T05:34:08Z | - |
dc.date.issued | 2016-01 | - |
dc.identifier.citation | International Conference on Electronics, Information, and Communications, ICEIC 2016, Article number 7562963, Page. 1-4 | en_US |
dc.identifier.issn | 978-1-4673-8016-4 | - |
dc.identifier.uri | http://ieeexplore.ieee.org/document/7562963/ | - |
dc.identifier.uri | http://hdl.handle.net/20.500.11754/30389 | - |
dc.description.abstract | Recently, the convolutional neural network (CNN) with multiple microphones was proposed to use the delay-sum (DS) beamformer for distant speech recognition (DSR) and compared to the direct use of multiple acoustic channels as a parallel input to the CNN [1]. We explore the parameterized multi-channel non-causal Wiener filter (PMWF) as the front-end to train the CNN, which is applied to acoustic modeling for DSR. For this, we first present a concise description of the basic PMWF as well as its advantages and then explain how to organize the PMWF into the CNN with a novel architecture. Experimental results on the TIMIT dataset show that the proposed PMWF-based CNN approach outperforms the cross-channel CNN and the DS beamformer when evaluating the word error rate (WER) in various DSR environments. © 2016 IEEE. | en_US |
dc.description.sponsorship | This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (No. 2014R1A2A1A10049735). This work was also supported by the ICT R&D program of MISP/IITP. [R0126-15-1119. Development of a solution for situation-awareness based on the analysis of speech and environmental sounds] | en_US |
dc.language.iso | en | en_US |
dc.publisher | IEEE, IEIE | en_US |
dc.subject | beamforming | en_US |
dc.subject | Convolutional neural networks | en_US |
dc.subject | distant speech recognition | en_US |
dc.subject | PMWF | en_US |
dc.title | On using Parameterized Multi-channel Non-causal Wiener Filter-Adapted Convolutional Neural Networks for Distant Speech Recognition | en_US |
dc.type | Article | en_US |
dc.identifier.doi | 10.1109/ELINFOCOM.2016.7562963 | - |
dc.relation.page | 1-4 | - |
dc.contributor.googleauthor | Lee, Jeehye | - |
dc.contributor.googleauthor | Chang, Joon-Hyuk | - |
dc.contributor.googleauthor | Sohn, Jinho | - |
dc.sector.campus | S | - |
dc.sector.daehak | COLLEGE OF ENGINEERING[S] | - |
dc.sector.department | DEPARTMENT OF ELECTRONIC ENGINEERING | - |
dc.identifier.pid | jchang | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.