Simple4All technology can be viewed and downloaded here.

Blizzard 2014 Annotations

Annotations generated for the 2014 Blizzard challenge

Text Normalisation Datasets

Text datasets in 3 languages: English, Spanish, and Romanian

Romanian Broadcast News

A speech and text dataset

Romanian Parliamentary Speeches

Speech and text of Romanian parliamentary speeches

User Feedback data

Synthetic samples and the spoken user feedback


LDA based language identification tool