Emotional speech from machine
Authors
Date
2020Copyright
This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.
Emotional speech is the expressiveness in speech that is transmitted through changes in pitch, loudness, timbre, speech rate and pauses that convey emotion. Although the current TTS technology is capable of converting a given text into speech, they sound monotonous and lack emotion and naturalness. In order to improve artificial voices, application of emotion is highly evaluated. In this thesis, we will be creating a system that makes use of speech mark-up language to produce emotion in speech by analysing the tone of given text. For this purpose, we combine IBM tone analyser with TTS that accepts the speech mark-up language. In this research, we perform empirical study on two experimental implementation using two TTS and two speech mark-up language. The first combination involves IBM TTS and SSML and the second combination includes MARY TTS and EmotionML. The mark-ups are predefined in EmotionML for four major emotions namely anger, fear, joy and sadness and for SSML prosody value from previous study is used. Therefore, this study describes the two implementations and evaluate their output emotional speech synthesis which is then compares with human voice to define its perfection.
...


Keywords
Metadata
Show full item recordCollections
- Pro gradu -tutkielmat [24931]
Related items
Showing items with similar title or keywords.
-
Politicisation as a Speech Act : A Repertoire for Analysing Politicisation in Parliamentary Plenary Debates
Palonen, Kari (Palgrave Macmillan, 2022)This chapter analyses the actual speech acts of politicisation among parliamentarians, which also makes it possible to set the current debates among European Union (EU) scholars in a wider context. My aim is to sketch a ... -
Part-of-speech tagging in written slang
Korolainen, Valtteri (2014)Erilaiset kieliteknologiasovellukset ovat olleet jo vuosikymmeniä arkipäiväises-sä käytössä. Esimerkiksi ennustava tekstinsyöttö ja automaattinen korjaus ovat olleet käytössä jo vuosikymmeniä. Puheen tunnistus ja kielen ... -
Spoken foreign language anxiety during English lessons among junior high school students
Härmälä, Jesse (2022)Vieraan kielen puhuminen voi aiheuttaa ihmisissä jännityksen tunteita. Vieraan kielen puhumisesta aiheutuvaa jännitystä ja kieliahdistusta on tutkittu jo 1980-luvun loppupuolelta lähtien. Kieliahdistukseen voi liittyä ... -
Top-Down Predictions of Familiarity and Congruency in Audio-Visual Speech Perception at Neural Level
Kolozsvári, Orsolya B.; Xu, Weiyong; Leppänen, Paavo H. T.; Hämäläinen, Jarmo A. (Frontiers Media, 2019)During speech perception, listeners rely on multimodal input and make use of both auditory and visual information. When presented with speech, for example syllables, the differences in brain responses to distinct stimuli ... -
Associations Between Sympathetic Nervous System Synchrony, Movement Synchrony, and Speech in Couple Therapy
Tourunen, Anu; Nyman-Salonen, Petra; Muotka, Joona; Penttonen, Markku; Seikkula, Jaakko; Kykyri, Virpi-Liisa (Frontiers Media SA, 2022)Background: Research on interpersonal synchrony has mostly focused on a single modality, and hence little is known about the connections between different types of social attunement. In this study, the relationship between ...