Hungarian Telephone Speech Test Database (Tesztel)
Project coordinator: Klara Vicsi
Staff: Cs. Teleki, Gy. Szaszák, Z. Valyon (Budapest University of Technology and Economics)
The aim of this project was to create a mobile phone voice based Hungarian speech database recorded in noisy environments for testing purposes (also called Tesztel). The database contains voices of 100 speakers, recorded through mobile telephone in noisy environments.
The main goal of creating this database was to test phoneme based recognizers, which have been already trained, so the corpus must have been compact and had to cover as good as possible the specific character of the Hungarian language. The text that the speaker had to tell was designed to contain at least one of every Hungarian phoneme, taking in consideration the statistics of phonemes, diphones, triphones and syllables in Hungarian language.
The corpus contains not only continuously told sentences, but command words, spelled forenames, numbers, dates, different currency types, city names, questions with yes/no answer, phonetically rich words. The database contains mostly spontaneous speech.
For further informations, contact us!