Paper
8 December 2023 The prefabs lexicon of Lhasa Tibetan in continuous speech
Author Affiliations +
Proceedings Volume 12943, International Workshop on Signal Processing and Machine Learning (WSPML 2023); 1294306 (2023) https://doi.org/10.1117/12.3014046
Event: International Workshop on Signal Processing and Machine Learning (WSPML 2023), 2023, Hangzhou, ZJ, China
Abstract
In this study, we propose a method for constructing a Lhasa Tibetan prosodic lexicon based on a continuous speech database, which leads to significant improvements in speech synthesis performance for low-resource and complex languages. The experiment begins by utilizing a 3.95-hour speech database of a Lhasa Tibetan speaker, focusing on the prosodic feature of “tone sandhi” to investigate the phonological features and grammatical functions of Lhasa Tibetan. Drawing inspiration from the “Usage-Based Theory” in cognitive linguistics, we extract prefabs (prefabricated chunks) from 2,526 utterances. According to the prosodic features and grammatical structure of these prefabs, we construct a Prefabs Lexicon consisting of 175 thousand entries. In the comparative experiment, we employ a sequence-to-sequence speech synthesis approach and automatically segment the input sequence using both the Prefabs Lexicon and the conventional Tibetan lexicon. To evaluate the performance, a 56-minute dataset from another professional Lhasa broadcaster is used as a test set. Compared to the conventional Tibetan lexicon, the Prefabs Lexicon achieves an improved 𝐹1 − 𝑠𝑐𝑜𝑟𝑒 of 0.92. Additionally, in the synthesis experiment for the toneless Amdo Tibetan, the Mean Opinion Score (MOS) increases to 4.17, indicating the universal applicability of the Prefabs Lexicon across dialects.
(2023) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Chen Lu, Yiqing Zu, Ronghua Zhu, Chenning Liu, Pengfei Shao, Xiao Zhang, and 'Bum Thr Klu "The prefabs lexicon of Lhasa Tibetan in continuous speech", Proc. SPIE 12943, International Workshop on Signal Processing and Machine Learning (WSPML 2023), 1294306 (8 December 2023); https://doi.org/10.1117/12.3014046
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
Back to Top