Repository at Hanyang University: On using acoustic environment classification for statistical model-based speech enhancement

Browse

My Repository

Repository at Hanyang UniversityCOLLEGE OF ENGINEERING[S](공과대학)ELECTRONIC ENGINEERING(융합전자공학부)Articles

229 0

Full metadata record

DC Field	Value	Language
dc.contributor.author	장준혁	-
dc.date.accessioned	2018-04-16T04:16:36Z	-
dc.date.available	2018-04-16T04:16:36Z	-
dc.date.issued	2012-03	-
dc.identifier.citation	Speech Communication, Vol.54, No.3 [2012], p477-490	en_US
dc.identifier.issn	0167-6393	-
dc.identifier.uri	http://www.sciencedirect.com/science/article/pii/S0167639311001579?via%3Dihub	-
dc.identifier.uri	http://hdl.handle.net/20.500.11754/67806	-
dc.description.abstract	In this paper, we present a statistical model-based speech enhancement technique using acoustic environment classification supported by a Gaussian mixture model (GMM). In the data training stage, the principal parameters of the statistical model-based speech enhancement algorithm such as the weighting parameter in the decision-directed (DD) method, the long-term smoothing parameter of the noise estimation, and the control parameter of the minimum gain value are uniquely set as optimal operating points according to the given noise information to ensure the best performance for each noise. These optimal operating points, which are specific to the different background noises, are estimated based on the composite measures, which are the objective quality measures representing the highest correlation with the actual speech quality processed by noise suppression algorithms. In the on-line environment-aware speech enhancement step, the noise classification is performed on a frame-by-frame basis using the maximum likelihood (ML)-based Gaussian mixture model (GMM). The speech absence probability (SAP) is used to detect the speech absence periods and to update the likelihood of the GMM. According to the classified noise information for each frame, we assign the optimal values to the aforementioned three parameters for speech enhancement. We evaluated the performances of the proposed methods using objective speech quality measures and subjective listening tests under various noise environments. Our experimental results showed that the proposed method yields better performances than does a conventional algorithm with fixed parameters.	en_US
dc.description.sponsorship	This work was supported by the IT R&D program of MKE/KEIT [2009-S-036-01, Development of New Virtual Machine Specification and Technology]. And, this work was supported by National Research Foundation of Korea (NRF) grant funded by the Korean Government (MEST) (NRF-2011-0009182). This work was supported by the research fund of Hanyang University (HY-2011-201100000000210)	en_US
dc.language.iso	en	en_US
dc.publisher	Elsevier Science B.V., Amsterdam.	en_US
dc.subject	Speech enhancement	en_US
dc.subject	Noise classification	en_US
dc.subject	Gaussian mixture model	en_US
dc.subject	DFT	en_US
dc.title	On using acoustic environment classification for statistical model-based speech enhancement	en_US
dc.type	Article	en_US
dc.relation.no	3	-
dc.relation.volume	54	-
dc.identifier.doi	10.1016/j.specom.2011.10.009	-
dc.relation.page	477-490	-
dc.relation.journal	SPEECH COMMUNICATION	-
dc.contributor.googleauthor	Choi, J. H.	-
dc.contributor.googleauthor	Chang, J. H.	-
dc.relation.code	2012208861	-
dc.sector.campus	S	-
dc.sector.daehak	COLLEGE OF ENGINEERING[S]	-
dc.sector.department	DEPARTMENT OF ELECTRONIC ENGINEERING	-
dc.identifier.pid	jchang	-
dc.identifier.researcherID	34969012900	-

Appears in Collections:: COLLEGE OF ENGINEERING[S](공과대학) > ELECTRONIC ENGINEERING(융합전자공학부) > Articles

Files in This Item:

Export: RIS (EndNote); XLS (Excel); XML

Show simple item record

한양대학교 리포지터리는 국립중앙도서관 OAK 보급사업으로 구축되었습니다. Feedback 개인정보처리방침

Hanyang University repository

Browse

My Repository

BROWSE