The files in the dataset correspond to results that have been generated
for the Multimedia Tools and Applications (Springer ISSN: 1380-7501 / 1573-7721) article: "Lightly supervised alignment of subtitles on multigenre broadcasts".<br>
<br>
The files in the zip file are of three types:<br>
- .ctm, which correspond to the output of the automatic speech
recognition system or lightly supervised alignment system.<br>- .rttm, which correspond to the output of the speech segmentation system.<br>
- .sys, which correspond to scoring of the speech segmentation, automatic speech
recognition or lightly supervised alignment system.<br>
<br>The following is a description about the naming convention of the files:<br>
<br>TableX-LineY-[ser|wer|f1]: This is the output and scoring results corresponding to Line Y of Table X in the article in terms of SER, WER or F1 score.<br>
<br>
All three file types are standard outputs that are recognised by the speech technology community and can be opened using any text
editor.
Funding
EPSRC Programme Grant EP/I031022/1 (Natural Speech Technology)