The University of Sheffield
Browse
TEXT
Figure2a-Bar1.ctm (844.67 kB)
TEXT
Figure2a-Bar1.sys (3.71 kB)
TEXT
Figure2a-Bar2.ctm (856.57 kB)
TEXT
Figure2a-Bar2.sys (3.71 kB)
TEXT
Figure2a-Bar3.ctm (850.74 kB)
TEXT
Figure2a-Bar3.sys (3.71 kB)
TEXT
Figure2a-Bar4.ctm (857.38 kB)
TEXT
Figure2a-Bar4.sys (3.71 kB)
TEXT
Figure2a-Bar5.ctm (850.18 kB)
TEXT
Figure2a-Bar5.sys (3.71 kB)
TEXT
Figure2a-Bar6.ctm (836.28 kB)
TEXT
Figure2a-Bar6.sys (3.71 kB)
TEXT
Figure2a-Bar7.ctm (849.12 kB)
TEXT
Figure2a-Bar7.sys (3.71 kB)
TEXT
Figure2a-Bar8.ctm (842.15 kB)
TEXT
Figure2a-Bar8.sys (3.71 kB)
TEXT
Figure2a-Bar9.ctm (851.01 kB)
TEXT
Figure2a-Bar9.sys (3.71 kB)
TEXT
Figure2a-Bar10.ctm (842.18 kB)
TEXT
Figure2a-Bar10.sys (3.71 kB)
1/0
99 files

Computer, Speech and Language - Experiment results for paper "Acoustic Adaptation to Dynamic Background Conditions with Asynchronous Transformations"

dataset
posted on 2016-07-04, 08:56 authored by Oscar Saz TorralbaOscar Saz Torralba, Thomas HainThomas Hain
The files in the dataset correspond to results that have been generated for the Computer, Speech and Language article: "Acoustic Adaptation to Dynamic Background Conditions with Asynchronous Transformations" http://dx.doi.org/10.1016/j.csl.2016.06.008.

The files in the zip file are of three types:
- .ctm, which correspond to the output of the automatic speech recognition system and the columns include segment information as well as transcripts of the recognition.
- .sys, which correspond to scoring of the automatic speech recognition system and includes the overall word error rate as well as the number of insertions, deletions and substitutions of the overall system.
- .lur, which provides a more detailed decomposition of the word error rate across different tags.

The following is a description about the naming convention of the files:

TableX-LineY: This is the recognition and scoring output corresponding to Line Y of Table X in the article.
Figure X-BarY: This is the recognition and scoring output corresponding to Bar Y (starting on the left hand side) of Figure X in the article.

All three file types are standard outputs that are recognised by the automatic speech recognition community and can be opened using any text editor.

Funding

EPSRC Programme Grant EP/I031022/1 (Natural Speech Technology)

History

Ethics

  • There is no personal data or any that requires ethical approval

Policy

  • The data complies with the institution and funders' policies on access and sharing

Sharing and access restrictions

  • The data can be shared openly

Data description

  • The file formats are open or commonly used

Methodology, headings and units

  • Headings and units are explained in the files