Interspeech 2016 - Experiment results for paper "webASR 2 - Improved cloud based speech technology"

dataset

posted on 2016-06-22, 09:25 authored by Oscar Saz TorralbaOscar Saz Torralba, Thomas HainThomas Hain, Salil Deena, Mortaza Doulaty BashkandMortaza Doulaty Bashkand, Madina HasanMadina Hasan, Wai Man NgWai Man Ng, Rosanna MilnerRosanna Milner, Yulan LiuYulan Liu

The files in the dataset correspond to results that have been generated for the Interspeech 2016 article: "webASR 2 - Improved cloud based speech technology" DOI: 10.21437/Interspeech.2016-700.

The files included are of several types:
- .ctm, which correspond to the output of an automatic speech recognition system.
- .rttm, which correspond to the output of a speaker diarisation system.
- .moses which correspond to the output of a machine translation system
- .sys, which correspond to the scoring results of the corresponding system.

The following is a description about the naming convention of the files:

TableX-LineY: This is the output and scoring results corresponding to Line Y of Table X in the article.

All file types are standard outputs that are recognised by the speech technology community and can be opened using any text editor.

Funding

EPSRC Programme Grant EP/I031022/1 (Natural Speech Technology)

History

Ethics

There is no personal data or any that requires ethical approval

Policy

The data complies with the institution and funders' policies on access and sharing

Sharing and access restrictions

The data can be shared openly

Data description

The file formats are open or commonly used

Methodology, headings and units

Headings and units are explained in the files

Usage metrics

Keywords

Automatic Speech Recognition Web API Lightly Supervised Alignment Speaker Diarisation Machine Translation Interspeech 2016 Signal Processing Natural Language Processing Artificial Intelligence and Image Processing Web Technologies (excl. Web Search)

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM