Download PDFOpen PDF in browser

sprocket: Open-Source Voice Conversion Software

EasyChair Preprint no. 64

8 pagesDate: April 15, 2018


Statistical voice conversion (VC) is a technique to convert specific non- or paralinguistic information while keeping linguistic information unchanged, and speaker conversion has been studied as a typical application of VC for a few decades.
To better understand various VC techniques using a freely available common dataset, the Voice Conversion Challenge (VCC) was launched in 2016 and the 2nd challenge was held in 2018.
As one of the baseline systems for VCC 2018, we developed open-source VC software called "sprocket", in which not only conventional techniques, such as a trajectory-based conversion method using a Gaussian mixture model (GMM) and a vocoder-based conversion framework but also recently developed techniques, such as a vocoder-free VC framework, have been implemented.
Using sprocket, it is possible to 1) easily reproduce converted voices using the VCC datasets and 2) develop VC systems using other parallel speech datasets with fundamental VC functions, such as acoustic feature extraction, time alignment between the source and target features, GMM training, feature conversion, and waveform generation.
In this paper, we describe 1) the technical details and use of sprocket, 2) the development of the baseline systems for the HUB and SPOKE tasks of VCC 2018 using sprocket, and 3) the performance of sprocket as a VC system by demonstrating results for our developed baseline systems in VCC 2018.

Keyphrases: Gaussian Mixture Model, Open Source Software, Sprocket, voice conversion, Voice Conversion Challenge

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
  author = {Kazuhiro Kobayashi and Tomoki Toda},
  title = {sprocket: Open-Source Voice Conversion Software},
  howpublished = {EasyChair Preprint no. 64},
  doi = {10.29007/s4t1},
  year = {EasyChair, 2018}}
Download PDFOpen PDF in browser