Actions
Task #3855
closedTask #3680: RA4a - Automatic error prediction
Task #3698: Experiment with one-class clasification for join cost enhancements
More data for artefacts collection
Status:
Postponed
Priority:
Normal
Assignee:
Target version:
Start date:
06.04.2016
Due date:
10.04.2016
% Done:
0%
Estimated time:
Description
We need more data for listening tests. Especially we need to increase the coverage of rare vowels. Currently we have:
phone | total | OK | artefact |
a | 78 | 60 | 18 |
e | 82 | 46 | 36 |
i | 49 | 30 | 19 |
o | 92 | 50 | 42 |
u | 23 | 22 | 1 |
A | 123 | 17 | 104 |
E | 4 | 4 | 0 |
I | 23 | 17 | 6 |
O | 0 | 0 | 0 |
U | 4 | 4 | 0 |
We can either try to find additional words in the corpus (shorter, though), or build "artificial" words by joining two halves of words (or words transitions) from the corpus.
Files
Actions