Publications

Public deliverables

Internship reports

 

Academic publications

Show all

József Domokos, Adriana Stan, Mircea Giurgiu (2014): An Approach to Lexical Stress Detection from Transcribed Continuous Speech Using Acoustic Features. In: Proc. Telfor2014, 2014. (Type: Inproceeding | BibTeX | Tags: lexical stress, Speech synthesis)
A. Gallardo-Antolín, J.M. Montero, S. King (2014): A Comparison of Open-Source Segmentation Architectures for Dealing with Imperfect Data from the Media in Speech Synthesis. In: Proc. Interspeech 2014, 2014. (Type: Inproceeding | Abstract | BibTeX | Tags: expressive speech synthesis, speaker diarization, speaking styles, Speech synthesis)
Ling-Hui Chen, Tuomo Raitio, Cassia Valentini-Botinhao, Junichi Yamagishi, Zhen-Hua Ling (2014): DNN-based stochastic postfilter for HMM-based speech synthesis. In: Proc. Interspeech 2014, pp. 1954-1958, Singapore, 2014. (Type: Inproceeding | Links | BibTeX | Tags: DNN, HMM, modulation spectrum, postfilter, segmental quality, Speech synthesis)
Thomas Merritt, Tuomo Raitio, Simon King (2014): Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis. In: Proc. Interspeech 2014, pp. 1509-1513, Singapore, 2014. (Type: Inproceeding | Links | BibTeX | Tags: GlottHMM, hidden Markov modelling, source filter interaction, source filter model, Speech synthesis)
Tuomo Raitio, Antti Suni, Lauri Juvela, Martti Vainio, Paavo Alku (2014): Deep neural network based trainable voice source model for synthesis of speech with varying vocal effort. In: Proc. Interspeech 2014, pp. 1969-1973, Singapore, 2014. (Type: Inproceeding | Links | BibTeX | Tags: Deep neural network, DNN, glottal flow, Speech synthesis, Vocal effort, voice source modelling)
S. Lebai Lutfi, F. Fernández-Martínez, J. Lorenzo-Trueba, R. Barra-Chicote, J. M. Montero (2013): I Feel You: The Design and Evaluation of a Domotic Affect- Sensitive Spoken Conversational Agent. In: Sensors, 13 (8), pp. 10519-10538, 2013, ISSN: 1424-8220. (Type: Article | Abstract | Links | BibTeX | Tags: affective agent, emotional speech, Speech synthesis, spoken conversational agents; evaluation)
Bajibabu Bollepalli, Tuomo Raitio, Paavo Alku (2013): Effect of MPEG Audio Compression on HMM-based Speech Synthesis. In: Proc. Interspeech 2013, 2013. (Type: Inproceeding | Abstract | BibTeX | Tags: GlottHMM, HMM, MP3, Speech synthesis)
Tuomo Raitio, Antti Suni, Jouni Pohjalainen, Manu Airaksinen, Martti Vainio, Paavo Alku (2013): Analysis and Synthesis of Shouted Speech. In: Proc. Interspeech 2103, 2013. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: shouting, speech analysis, Speech synthesis)
Tuomo Raitio, John Kane, Thomas Drugman, Christer Gobl (2013): HMM-based synthesis of creaky voice. In: Proc. Interspeech 2013, 2013. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Contextual Factors, Creaky voice, excitation modeling, F0 estimation, Speech synthesis)
Antti Suni, Reima Karhila, Tuomo Raitio, Mikko Kurimo, Martti Vainio, Paavo Alku (2013): Lombard Modified Text-to-Speech Synthesis for Improved Intelligibility: Submission for the Hurricane Challenge 2013. In: Proc. Interspeech 2013, 2013. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: GlottHMM, Hurricane challenge, intelligibility, Lombard speech, Speech synthesis)
Karhila, Reima, Remes, Ulpu, Kurimo, Mikko (2013): HMM-Based Speech Synthesis Adaptation Using Noisy Data: Analysis and Evaluation Methods. In: Proceedings of ICASSP-13, 2013, (Accepted to ICASSP 2013). (Type: Inproceeding | Abstract | BibTeX | Tags: Adaptation, Evaluation, Feature extraction, Noise robustness, Speech synthesis)
Drugman, Thomas, Kane, John, Raitio, Tuomo, Gobl, Christer (2013): Prediction of Creaky Voice from Contextual Factors. In: Proc. ICASSP 2013, 2013. (Type: Inproceeding | Abstract | BibTeX | Tags: Contextual Factors, Creaky voice, Expressive Speech, Speech synthesis)
Ruben San-Segundo, Juan M. Montero, Veronica Lopez-Ludeña, Simon King (2012): Detecting Acronyms from Capital Letter Sequences in Spanish. In: Proc. Interspeech 2012, 2012, ISSN: 1990-9772. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: Abbreviations, Acronyms, Capital letter sequence pronunciation, Spanish, Speech synthesis, Spelling)
Anna C. Janska, Erich Schröger, Thomas Jacobsen, Robert A. J. Clark (2012): Asymmetries in the perception of synthesized speech. In: Proc. Interspeech 2012, 2012, ISSN: 1990-9770. (Type: Inproceeding | Abstract | Links | BibTeX | Tags: perceptual evaluation, Speech synthesis)
Reima Karhila, Rama Sanand Doddipatla, Mikko Kurimo, Peter Smit (2012): Creating synthetic voices for children by adapting adult average voice using stacked transformations and VTLN. In: Proc. ICASSP 2012, IEEEE, 2012, ISSN: 1520-6149. (Type: Inproceeding | Links | BibTeX | Tags: speaker adaptation, Speech synthesis)