Značkování a status některých gramatických kategorií v ČNK (syntetické futurum, stupňování adjektiv, neurčité číslovky a příslovce míry)

Osolsobě,  Klára

Publication details

Značkování a status některých gramatických kategorií v ČNK (syntetické futurum, stupňování adjektiv, neurčité číslovky a příslovce míry)

Title in English	Tagging and classification of selected grammatical categiries in the Czech National Corpus (synthetic future, comparative forms of adjectives, indefinite numerals and measure adverbs)
Authors	OSOLSOBĚ Klára
Year of publication	2008
Type	Article in Proceedings
Conference	Grammar & Corpora / Gramatika a korpus 2007
MU Faculty or unit	Faculty of Arts
Citation
Field	Linguistics
Keywords	Corpus; tagging; synthetic future; gradation; undefinite numeral
Description	The aim of this paper is to present how a corpus can be used as a device (source) to improve the description of chosen grammatical phenomena in dictionary and grammar on one hand and in morpholigical taggers on the other hand. Two automatic morphological taggers used for tagging of Czech language corpora (Hajič, 2004 and Sedláček, 2005) will be compared. We shall analyze how three phenomena: a) synthetic future in Czech, b) comparison of adjectiv and c) word class transposition of words like hodně, mnoho, moc, are annotated in CNK and how are they described in Czech dictionaries (Slovník spisovného jazyka českého and Slovník spisovné češtiny pro školu a veřejnost) and grammars (Mluvnice češtiny, 1986, Česká mluvnice, 1989, Příruční mluvnice češtiny, 1996, Čeština, řeč a jazyk, 1996). We shall discuss how the analysis of corpus mined data can be used for detecting of gaps in examined materials and how can it contribute to filling them in.
Related projects:	Czech Language in Linguistic Terms (a dictionary)

10 reasons why you will fall in love with MU

Ask our ambassador

Read about research at MU

Značkování a status některých gramatických kategorií v ČNK (syntetické futurum, stupňování adjektiv, neurčité číslovky a příslovce míry)