258 0

Full metadata record

DC FieldValueLanguage
dc.contributor.author장준혁-
dc.date.accessioned2020-01-20T06:07:41Z-
dc.date.available2020-01-20T06:07:41Z-
dc.date.issued2019-01-
dc.identifier.citationICEIC 2019 - International Conference on Electronics, Information, and Communication, 8706390en_US
dc.identifier.isbn978-899500444-9-
dc.identifier.urihttps://ieeexplore.ieee.org/document/8706390-
dc.identifier.urihttps://repository.hanyang.ac.kr/handle/20.500.11754/122095-
dc.description.abstractIn this paper, multi speaker speech synthesis using speaker embedding is proposed. The proposed model is based on Tacotron network, but post-processing network of the model is modified with dilated convolution layers, which used in Wavenet architecture, to make it more adaptive to speech. The model can generate multi speaker voice with only one neural network model by giving auxiliary input data, speaker embedding, to the network. This model shows successful result for generating two speaker's voices without significant deterioration of speech quality. © 2019 Institute of Electronics and Information Engineers (IEIE).en_US
dc.description.sponsorshipThis work was supported by Institute for Information \& communications Technology Promotion (IITP) grant funded by the Korea government (MSIT) (No.2017-0-00474, Intelligent Signal Processing for AI Speaker Voice Guardian) This research was supported by Projects for Research and Development of Police science and Technology under Center for Research and Development of Police science and Technology and Korean National Police Agency funded by the Ministry of Science and ICT(PA-J000001-2017-101).en_US
dc.language.isoenen_US
dc.publisherIEEE/ICEICen_US
dc.subjectDeep learningen_US
dc.subjectSequence to sequenceen_US
dc.subjectSpeech synthesisen_US
dc.subjectMulti speaker speech synthesisen_US
dc.titleDNN based multi-speaker speech synthesis with temporal auxiliary speaker ID embeddingen_US
dc.typeArticleen_US
dc.identifier.doi10.23919/ELINFOCOM.2019.8706390-
dc.relation.page61-64-
dc.contributor.googleauthorLee, Junmo-
dc.contributor.googleauthorSong, Kwangsub-
dc.contributor.googleauthorNoh, Kyoungjin-
dc.contributor.googleauthorPark, Tae-Jun-
dc.contributor.googleauthorChang, Joon-Hyuk-
dc.sector.campusS-
dc.sector.daehakCOLLEGE OF ENGINEERING[S]-
dc.sector.departmentDEPARTMENT OF ELECTRONIC ENGINEERING-
dc.identifier.pidjchang-
dc.identifier.orcidhttps://orcid.org/0000-0003-2610-2323-
Appears in Collections:
COLLEGE OF ENGINEERING[S](공과대학) > ELECTRONIC ENGINEERING(융합전자공학부) > Articles
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE