This paper describes an attempt to use Estonian statistical parametric speech synthesis for audio pronunciation of words and word forms in online dictionaries. Two new HTS-voices were created and compared for this purpose. The paper gives an overview of a design and evaluation process for these voices. Different errors were detected including quantity errors, bad sound quality, accent errors, gemination at the boundary of compound word components, etc. The level of correctness and sound quality for the two parametric speech synthesisers ranged from 69% to 76%. The paper demonstrates that voice Eva-2, which can accept text with diacritics as input, produces fewer errors. Still, the error rate of both new voices is too high to fill the criteria of orthoepy in learner's dictionaries.
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
Tel.: +1 703 830 6300
Fax: +1 703 830 2300 email@example.com
(Corporate matters and books only) IOS Press c/o Accucoms US, Inc.
For North America Sales and Customer Service
West Point Commons
Lansdale PA 19446
Tel.: +1 866 855 8967
Fax: +1 215 660 5042 firstname.lastname@example.org