On a Voice Conversion by using Prosodic Control
Jongkuk Kim, Min Cheol Hong, Hernsoo Hahn
Available Online August 2013.
- https://doi.org/10.2991/icacsei.2013.117How to use a DOI?
- PSOLA, Voice conversion, Prosodic, DTW, Mapping, Pitch, Modification
- Voice conversion is a method that aims to transform the input speech signal such that the output signal will be perceived as produced by another speaker .Speech synthesizers using voice conversion technologies allow developers to create more voices from a single database and users to personalize the synthesizer to speak with any desired voice after a training period. In this paper, we present the method that converts time and pitch scaling using spectral mapping and PSOLA technique with OLA. This new synthesis scheme allows very flexible modifications of the pitch-scale, the time-scale and the spectral envelope characteristics while producing high-quality speech output. This synthesis scheme is thus well suited to voice conversion. Further work will be conducted on a matching method to correspond well with each phonetic information, and larger corpora to assess the robustness of the method.
- Open Access
- This is an open access article distributed under the CC BY-NC license.
Cite this article
TY - CONF AU - Jongkuk Kim AU - Min Cheol Hong AU - Hernsoo Hahn PY - 2013/08 DA - 2013/08 TI - On a Voice Conversion by using Prosodic Control BT - 2013 International Conference on Advanced Computer Science and Electronics Information (ICACSEI 2013) PB - Atlantis Press SP - 477 EP - 481 SN - 1951-6851 UR - https://doi.org/10.2991/icacsei.2013.117 DO - https://doi.org/10.2991/icacsei.2013.117 ID - Kim2013/08 ER -