Computer, Speech and Language - Experiment results for paper "Acoustic Adaptation to Dynamic Background Conditions with Asynchronous Transformations"

dataset

posted on 2016-07-04, 08:56 authored by Oscar Saz TorralbaOscar Saz Torralba, Thomas HainThomas Hain

The files in the dataset correspond to results that have been generated for the Computer, Speech and Language article: "Acoustic Adaptation to Dynamic Background Conditions with Asynchronous Transformations" http://dx.doi.org/10.1016/j.csl.2016.06.008.

The files in the zip file are of three types:

- .ctm, which correspond to the output of the automatic speech recognition system and the columns include segment information as well as transcripts of the recognition.

- .sys, which correspond to scoring of the automatic speech recognition system and includes the overall word error rate as well as the number of insertions, deletions and substitutions of the overall system.

- .lur, which provides a more detailed decomposition of the word error rate across different tags.

The following is a description about the naming convention of the files:

TableX-LineY: This is the recognition and scoring output corresponding to Line Y of Table X in the article.

Figure X-BarY: This is the recognition and scoring output corresponding to Bar Y (starting on the left hand side) of Figure X in the article.

All three file types are standard outputs that are recognised by the automatic speech recognition community and can be opened using any text editor.

Funding

EPSRC Programme Grant EP/I031022/1 (Natural Speech Technology)

History

Ethics

There is no personal data or any that requires ethical approval

Policy

The data complies with the institution and funders' policies on access and sharing

Sharing and access restrictions

The data can be shared openly

Data description

The file formats are open or commonly used

Methodology, headings and units

Headings and units are explained in the files

Usage metrics

Keywords

Automatic Speech Recognition Spoken Language Technologies Multimedia Data Natural Language Processing Pattern Recognition and Data Mining Computer-Human Interaction Signal Processing Artificial Intelligence and Image Processing

Licence

CC BY 4.0