Emotional speech from machine
Tekijät
Päivämäärä
2020Tekijänoikeudet
Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty.
Emotional speech is the expressiveness in speech that is transmitted through changes in pitch, loudness, timbre, speech rate and pauses that convey emotion. Although the current TTS technology is capable of converting a given text into speech, they sound monotonous and lack emotion and naturalness. In order to improve artificial voices, application of emotion is highly evaluated. In this thesis, we will be creating a system that makes use of speech mark-up language to produce emotion in speech by analysing the tone of given text. For this purpose, we combine IBM tone analyser with TTS that accepts the speech mark-up language. In this research, we perform empirical study on two experimental implementation using two TTS and two speech mark-up language. The first combination involves IBM TTS and SSML and the second combination includes MARY TTS and EmotionML. The mark-ups are predefined in EmotionML for four major emotions namely anger, fear, joy and sadness and for SSML prosody value from previous study is used. Therefore, this study describes the two implementations and evaluate their output emotional speech synthesis which is then compares with human voice to define its perfection.
...
Asiasanat
Metadata
Näytä kaikki kuvailutiedotKokoelmat
- Pro gradu -tutkielmat [28121]
Samankaltainen aineisto
Näytetään aineistoja, joilla on samankaltainen nimeke tai asiasanat.
-
Politicisation as a Speech Act : A Repertoire for Analysing Politicisation in Parliamentary Plenary Debates
Palonen, Kari (Palgrave Macmillan, 2022)This chapter analyses the actual speech acts of politicisation among parliamentarians, which also makes it possible to set the current debates among European Union (EU) scholars in a wider context. My aim is to sketch a ... -
Part-of-speech tagging in written slang
Korolainen, Valtteri (2014)Erilaiset kieliteknologiasovellukset ovat olleet jo vuosikymmeniä arkipäiväises-sä käytössä. Esimerkiksi ennustava tekstinsyöttö ja automaattinen korjaus ovat olleet käytössä jo vuosikymmeniä. Puheen tunnistus ja kielen ... -
Smart Educational Process Based on Personal Learning Capabilities
Gavriushenko, Mariia; Lindberg, Renny S. N.; Khriyenko, Oleksiy (IATED Academy, 2017)Personalized learning is increasingly gaining popularity, especially with the development of information technology and modern educational resources for learning. Each person is individual and has different knowledge ... -
Analysis of Somatosensory Cortical Responses to Different Electrotactile Stimulations as a Method Towards an Objective Definition of Artificial Sensory Feedback Stimuli : An MEG Pilot Study
Liu, Jia; Piitulainen, Harri; Vujaklija, Ivan (IEEE, 2022)Sensory feedback is a critical component in many human-machine interfaces (e.g., bionic limbs) to provide missing sensations. Specifically, electrotactile stimulation is a popular feedback modality able to evoke configurable ... -
Periodic, aperiodic, and phase-locked brain activity in the perception of continuous speech with different levels of difficulty and intelligibility
Ekroos, Tuike (2023)Jatkuva puhe on tärkeä osa ihmisten välistä jokapäiväistä kommunikaatiota. Tähänastisessa tutkimuksessa on esitetty, että puheen havaintoon liittyy niin periodista ja aperiodista aivotoimintaa kuin aivotoiminnan lukkiutumista ...
Ellei toisin mainittu, julkisesti saatavilla olevia JYX-metatietoja (poislukien tiivistelmät) saa vapaasti uudelleenkäyttää CC0-lisenssillä.