Zde se nacházíte:
Informace o publikaci
Text Corpus with Errors
Autoři | |
---|---|
Rok publikování | 2003 |
Druh | Článek ve sborníku |
Konference | Text, Speech and Dialogue: Sixth International Conference, TSD 2003 |
Fakulta / Pracoviště MU | |
Citace | |
www | http://nlp.fi.muni.cz/publications/tsd2003_pala_smrz_pary/ |
Obor | Informatika |
Klíčová slova | error detection |
Popis | This paper presents a description of a Czech text corpus (Chyby) containing various kinds of errors such as spelling, typographical, grammatical, style, lexical. We explain how Chyby has been built, how the errors in it have been discovered, marked and annotated. The classification of the errors is presented and the statistics concerning the types of errors is given. The tools for annotating the errors are also described. To the best of our knowledge, this is first text corpus of this sort prepared for Czech. |
Související projekty: |