Task #3855: More data for artefacts collection - HQSYN16 - Projects of Department of Cybernetics & NTIS P1 - Cybernetic Systems, University of West Bohemia

Actions

Copy link

Task #3855

closed

Task #3680: RA4a - Automatic error prediction

Task #3698: Experiment with one-class clasification for join cost enhancements

More data for artefacts collection

Added by Tihelka Dan about 10 years ago. Updated over 9 years ago.

Status:

Postponed

Priority:

Normal

Assignee:

Grůber Martin

Target version:

RA4: Automatic error prediction and signal modification

Start date:

06.04.2016

Due date:

10.04.2016

% Done:

Estimated time:

Description

We need more data for listening tests. Especially we need to increase the coverage of rare vowels. Currently we have:

phone	total	OK	artefact
a	78	60	18
e	82	46	36
i	49	30	19
o	92	50	42
u	23	22	1
A	123	17	104
E	4	4	0
I	23	17	6
O	0	0	0
U	4	4	0

We can either try to find additional words in the corpus (shorter, though), or build "artificial" words by joining two halves of words (or words transitions) from the corpus.

Files

Download all files

prepare_words.py (26.8 KB) prepare_words.py	Script to select words of the appropriate length from ASF	Tihelka Dan, 06.04.2016 13:39
asf2json_mix.py (5.06 KB) asf2json_mix.py		Tihelka Dan, 09.08.2016 15:15

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

HQSYN16

Custom queries

Task #3855

More data for artefacts collection

Updated by Tihelka Dan about 10 years ago

Updated by Matoušek Jindřich about 10 years ago

Updated by Grůber Martin almost 10 years ago

Updated by Tihelka Dan almost 10 years ago

Updated by Grůber Martin over 9 years ago