Prosodic Data-Driven Modelling of Narrative Style in FESTIVAL TTS

Conference Paper

Publication Date:

2004

abstract:

A general data-driven procedure for creating new prosodic modules for the Italian FESTIVAL Text-To-Speech (TTS) [1] synthesizer is described. These modules are based on the "Classification and Regression Trees" (CART) theory. The prosodic factors taken into consideration are: duration, pitch and loudness. Loudness control has been implemented as an extension to the MBROLA diphone concatenative synthesizer. The prosodic models were trained using two speech corpora with different speaking style, and the effectiveness of the CART-based prosody was assessed with a set of evaluation tests.

Iris type:

04.01 Contributo in Atti di convegno

Keywords:

Data-Driven; Prosody Model; Narrative Style; TTS

List of contributors:

Tisato, Graziano; Drioli, Carlo; Tesser, Fabio; Cosi, Piero

Handle:

https://iris.cnr.it/handle/20.500.14243/60745

Book title:

5th ISCA Speech Synthesis Workshop

Overview

URL

http://www.pd.istc.cnr.it/Papers/PieroCosi/ft-SSW2004.pdf

Prosodic Data-Driven Modelling of Narrative Style in FESTIVAL TTS

Tisato, Graziano; Drioli, Carlo; Tesser, Fabio; Cosi, Piero

Overview

URL