INTERSPEECH 2020, v. 2020-Oct, Page. 1013.0-1014.0
Abstract
We introduce an open-source Python library, VCTUBE, which can automatically generate ˂audio, text˃ pair of speech data from a given Youtube URL. We believe VCTUBE is useful for collecting, processing, and annotating speech data easily toward developing speech synthesis systems.