Comparison between two models of language for the automatic phonetic labeling of an undocumented language of the South-Asia: The case of Mo Piu
Author(s):
Caelen-haumont, Genevieve; Sam, S.; Celi - Language And Information Technology; European Media Laboratory Gmbh (eml); Immi; Meta; Nuance; Quaero
Editor(s):
Dogan M.U.; Mariani J.; Moreno A.; Goggi S.; Choukri K.; Calzolari N.; Odijk J.; Declerck T.; Maegaard B.; Piperidis S.; Mazo H.; Hamon O.
Format:
Conference presentation
Publisher:
European Language Resources Association (ELRA), 2012.
Language:
English
Abstract:
This paper aims at assessing the automatic labeling of an undocumented, unknown, unwritten and under-resourced language (Mo Piu) of the North Vietnam, by an expert phonetician. In the previous stage of the work, 7 sets of languages were chosen among Mandarin, Vietnamese, Khmer, English, French, to compete in order to select the best models of languages to be used for the phonetic labeling of Mo Piu isolated words. Two sets of languages (1Mandarin + French, 2° Vietnamese + French) which got the best scores showed an additional distribution of their results. Our aim is now to study this distribution more precisely and more extensively, in order to statistically select the best models of languages and among them, the best sets of phonetic units which minimize the wrong phonetic automatic labeling.