Proceedings of the Positive Technology International Conference 2023 Positive Technology: Possible Synergies between Emerging Technologies and Positive Psychology (PT 2023)

Human Voice Analysis and Virtual Teacher for Speech Therapy

Authors
Man-Ching Yuen1, *, Chi-Wai Yung1, Linjing Zhang1, Jiaer Song1, Xingzi Li1, Yinlin Li1
1Department of Applied Data Science, Hong Kong Shue Yan University, Hong Kong, China
*Corresponding author. Email: mcyuen@hksyu.edu
Corresponding Author
Man-Ching Yuen
Available Online 19 February 2024.
DOI
10.2991/978-94-6463-378-8_7How to use a DOI?
Keywords
Speech therapy; Human voice analysis; Waveform analysis
Abstract

Based on the literature review, researchers reported that at most, 24.6% of young children in the world were estimated to have speech delay or speech sound disorder (SSD). Once children with SSD are identified, speech-language pathologists (SLPs) select initial therapy programs for children with regular review and adjustments on therapy. The success of therapy highly relies on the effectiveness of long-term home training. In this project, we carry out human voice analysis and design and implement a virtual teacher for home training in speech therapy. For the first part of this project, we conduct sound analysis research to see if children’s Cantonese pronunciation is correct. Once the children’s voices are captured, human voices can be automatically transferred for waveform analysis, allowing a large number of tasks to be completed quickly. The created waveform is compared to the standard waveform. If the majority of the waveform is inconsistent, it suggests that the pronunciation of children is not standard. As a result, it points out children’s pronunciation problems and generates feedback quickly. Through the waveform diagram, our system can accurately process and analyze the sound, as well as eliminate the inaccuracy caused by varied timbres of children, making the analysis more accurate and effective. For the second part of this project, we implement a virtual teacher by using Blender and Audio2Face technology. Blender technology is often useful in areas such as live streaming and business, but it also has great potential in education. Therefore, it provides a more convenient way to conduct speech imitation and language learning for implementation of a virtual teacher. It can achieve low-cost popularization, timely correction of children’s pronunciation problems.

Copyright
© 2024 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the Positive Technology International Conference 2023 Positive Technology: Possible Synergies between Emerging Technologies and Positive Psychology (PT 2023)
Series
Atlantis Highlights in Social Sciences, Education and Humanities
Publication Date
19 February 2024
ISBN
10.2991/978-94-6463-378-8_7
ISSN
2667-128X
DOI
10.2991/978-94-6463-378-8_7How to use a DOI?
Copyright
© 2024 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Man-Ching Yuen
AU  - Chi-Wai Yung
AU  - Linjing Zhang
AU  - Jiaer Song
AU  - Xingzi Li
AU  - Yinlin Li
PY  - 2024
DA  - 2024/02/19
TI  - Human Voice Analysis and Virtual Teacher for Speech Therapy
BT  - Proceedings of the Positive Technology International Conference 2023 Positive Technology: Possible Synergies between Emerging Technologies and Positive Psychology (PT 2023)
PB  - Atlantis Press
SP  - 98
EP  - 110
SN  - 2667-128X
UR  - https://doi.org/10.2991/978-94-6463-378-8_7
DO  - 10.2991/978-94-6463-378-8_7
ID  - Yuen2024
ER  -