On a Voice Conversion by using Prosodic Control

Jongkuk Kim; Min Cheol Hong; Hernsoo Hahn

doi:10.2991/icacsei.2013.117

<Previous Article In Volume

Next Article In Volume>

On a Voice Conversion by using Prosodic Control

Authors

Jongkuk Kim, Min Cheol Hong, Hernsoo Hahn

Corresponding Author

Jongkuk Kim

Available Online August 2013.

DOI: 10.2991/icacsei.2013.117 How to use a DOI?
Keywords: PSOLA, Voice conversion, Prosodic, DTW, Mapping, Pitch, Modification
Abstract: Voice conversion is a method that aims to transform the input speech signal such that the output signal will be perceived as produced by another speaker .Speech synthesizers using voice conversion technologies allow developers to create more voices from a single database and users to personalize the synthesizer to speak with any desired voice after a training period. In this paper, we present the method that converts time and pitch scaling using spectral mapping and PSOLA technique with OLA. This new synthesis scheme allows very flexible modifications of the pitch-scale, the time-scale and the spectral envelope characteristics while producing high-quality speech output. This synthesis scheme is thus well suited to voice conversion. Further work will be conducted on a matching method to correspond well with each phonetic information, and larger corpora to assess the robustness of the method.
Copyright: © 2013, the Authors. Published by Atlantis Press.
Open Access: This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

<Previous Article In Volume

Next Article In Volume>

Volume Title: Proceedings of the 2013 International Conference on Advanced Computer Science and Electronics Information (ICACSEI 2013)
Series: Advances in Intelligent Systems Research
Publication Date: August 2013
ISBN: 978-90-78677-74-1
ISSN: 1951-6851
DOI: 10.2991/icacsei.2013.117 How to use a DOI?
Open Access: This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

ris enw bib

TY  - CONF
AU  - Jongkuk Kim
AU  - Min Cheol Hong
AU  - Hernsoo Hahn
PY  - 2013/08
DA  - 2013/08
TI  - On a Voice Conversion by using Prosodic Control
BT  - Proceedings of the 2013 International Conference on Advanced Computer Science and Electronics Information (ICACSEI 2013)
PB  - Atlantis Press
SP  - 477
EP  - 481
SN  - 1951-6851
UR  - https://doi.org/10.2991/icacsei.2013.117
DO  - 10.2991/icacsei.2013.117
ID  - Kim2013/08
ER  -

download .riscopy to clipboard