View article

[PDF] from ed.ac.uk

Modelling acoustic feature dependencies with artificial neural networks: Trajectory-RNADE

Authors

Benigno Uria, Iain Murray, Steve Renals, Cassia Valentini-Botinhao, John Bridle

Publication date

2015/4/19

Conference

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pages

4465-4469

Publisher

IEEE

Description

Given a transcription, sampling from a good model of acoustic feature trajectories should result in plausible realizations of an utterance. However, samples from current probabilistic speech synthesis systems result in low quality synthetic speech. Henter et al. have demonstrated the need to capture the dependencies between acoustic features conditioned on the phonetic labels in order to obtain high quality synthetic speech. These dependencies are often ignored in neural network based acoustic models. We tackle this deficiency by introducing a probabilistic neural network model of acoustic trajectories, trajectory RNADE, able to capture these dependencies.

Total citations

Cited by 38

2015201620172018201920202021202220235 9 6 6 2 4 4 1

Scholar articles

Modelling acoustic feature dependencies with artificial neural networks: Trajectory-RNADE

B Uria, I Murray, S Renals, C Valentini-Botinhao… - 2015 IEEE International Conference on Acoustics …, 2015