Repository at Hanyang University: TTS-driven Data Augmentation with transliteration for non-native English ASR

Browse

My Repository

Repository at Hanyang UniversityGRADUATE SCHOOL OF APPLIED ARTIFICIAL INTELLIGENCE[S](인공지능융합대학원)DEPARTMENT OF ARTIFICIAL INTELLIGENCE SYSTEMS(인공지능시스템학과)Theses (Master)

322 0

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	김태욱	-
dc.contributor.author	박민수	-
dc.date.accessioned	2023-05-11T11:47:46Z	-
dc.date.available	2023-05-11T11:47:46Z	-
dc.date.issued	2023. 2	-
dc.identifier.uri	http://hanyang.dcollection.net/common/orgView/200000651793	en_US
dc.identifier.uri	https://repository.hanyang.ac.kr/handle/20.500.11754/179599	-
dc.description.abstract	Automatic Speech Recognition (ASR) system works pretty well in native English (L1) but it does not work well in non-native English (L2) because the recent state-of-art ASR system is focused on native English. To reduce the performance gap between L1 English and L2 English, training data from non-native speakers are needed. However, both unlabeled and labeled data is hard to obtain and find through publicly available datasets. Speech synthesis (text-to-speech) can be used to build ASR training datasets and solve low-resource problems despite traditional speech synthesis systems focused on generating native language. In this paper, we present a novel way to generate non-native speech synthesis by combining transliteration and native TTS systems as we will also investigate the influence of synthetic L2 English and synthetic L1 English data on L2 English performance. Our best model trained on synthetic L2 and Authentic L2 dataset achieves ~53.34% relative word error rate (WER) reduction compared to the traditional ASR system. For few-shot settings, the model trained with additional synthetic L2 English shows ~31.45% relative WER reduction compared to the model trained on 10 minutes of authentic L2 English.	-
dc.publisher	한양대학교	-
dc.title	TTS-driven Data Augmentation with transliteration for non-native English ASR	-
dc.type	Theses	-
dc.contributor.googleauthor	박민수	-
dc.sector.campus	S	-
dc.sector.daehak	인공지능융합대학원	-
dc.sector.department	인공지능시스템학과	-
dc.description.degree	Master	-

Appears in Collections:: GRADUATE SCHOOL OF APPLIED ARTIFICIAL INTELLIGENCE[S](인공지능융합대학원) > DEPARTMENT OF ARTIFICIAL INTELLIGENCE SYSTEMS(인공지능시스템학과) > Theses (Master)

Files in This Item:

Export: RIS (EndNote); XLS (Excel); XML

Show simple item record

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

한양대학교 리포지터리는 국립중앙도서관 OAK 보급사업으로 구축되었습니다. Feedback 개인정보처리방침

BROWSE