In this paper, we compared the differences of acoustic characteristics between synthetic speech and emotional speech under the same text from the perspective of the lack of emotional expression of synthetic speech by using Praat software for a single phoneme /ei/. Analyzing the results, it was concluded that the differences of the emotional information were mainly in the small dispersion of synthetic speech fundamental frequency, the dispersion of synthetic speech intensity was much smaller than that of real speech with large emotional fluctuations, the harmonic waves in narrowband spectrograms were nearly straight without bending and jittering, the formant center frequencies interlacing degree is small. The common differences between synthetic speech and neutral and emotional speech were shown by the absence of harmonic waves at frequencies above 3000 Hz and the obvious difference in the direction of the tail end of the second formant.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.