TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
TEXT
1/2
Interspeech 2016 - Experiment results for paper "Error correction in lightly supervised alignment of broadcast subtitles"
dataset
posted on 2016-06-22, 09:37 authored by Oscar Saz TorralbaOscar Saz Torralba, Thomas HainThomas Hain, Julia Olcoz MartinezThe files in the dataset correspond to results that have been generated
for the Interspeech 2016 article: "Error correction in lightly supervised alignment of broadcast subtitles" DOI: 10.21437/Interspeech.2016-56.
The files in the zip file are of two types:
- .ctm, which correspond to the output of the lightly supervised alignment system.
- .sys, which correspond to scoring of the lightly supervised alignment system and includes the overall F measure as well as the precision and recall of the overall system and of each individual show.
The following is a description about the naming convention of the files:
TableX-LineY: This is the alignment and scoring output corresponding to Line Y of Table X in the article.
TableX-LineY-[dev|eval]: This is the alignment and scoring output corresponding to Line Y of Table X for development (dev) or evaluation (eval) in the article.
All two file types are standard outputs that are recognised by the speech technology community and can be opened using any text editor.
The files in the zip file are of two types:
- .ctm, which correspond to the output of the lightly supervised alignment system.
- .sys, which correspond to scoring of the lightly supervised alignment system and includes the overall F measure as well as the precision and recall of the overall system and of each individual show.
The following is a description about the naming convention of the files:
TableX-LineY: This is the alignment and scoring output corresponding to Line Y of Table X in the article.
TableX-LineY-[dev|eval]: This is the alignment and scoring output corresponding to Line Y of Table X for development (dev) or evaluation (eval) in the article.
All two file types are standard outputs that are recognised by the speech technology community and can be opened using any text editor.
Funding
EPSRC Programme Grant EP/I031022/1 (Natural Speech Technology)
History
Ethics
- There is no personal data or any that requires ethical approval
Policy
- The data complies with the institution and funders' policies on access and sharing
Sharing and access restrictions
- The data can be shared openly
Data description
- The file formats are open or commonly used
Methodology, headings and units
- Headings and units are explained in the files