Project information
Czech language in the era of computers. Text corpora and lexical and grammatical base for the development of Czech in the 21 st century
- Project Identification
- GV405/96/K214
- Project Period
- 1/1996 - 1/2001
- Investor / Pogramme / Project type
-
Czech Science Foundation
- Complex Projects
- MU Faculty or unit
- Faculty of Informatics
- Cooperating Organization
-
Institute of the Czech Language of the ASCR, v. v. i.
- Responsible person RNDr. Jan Králík, CSc.
- Responsible person prof. PhDr. Eva Hajičová, DrSc.
The aim of the submitted complex project is to create a base for a versatile processing of Czech language, which is a primary cultural heritage of the nation, and with the use of the contemporary information technologies to build its computerized treasur y for the 2lst century. The first task of the project is to complete successfully the already started building of Czech National Corpus (designed as a structured corpus of Czech texts containing tens of millions of words) and make it accessible for the w ide community of experts and researchers: this can bn done through the stabilization of the staff of the Institute of Czech National Corpus. The second natural task of the project is a research of the present-day Czech based on contemporary theoretical methods and techniques of computational linguistics and lexicography: this covers examination of the grammatical structure of the present-day Czech and building formal descriptions of the language phenomena belonging to the particular language levels tak
Publications
Total number of publications: 15
2009
-
Kajícný a nevěřícný - adjektiva na -cí/-cný: slovníky, gramatiky, korpusy
After Half a Century of Slavonic Natural Language Processing., edition: Vyd. 1., year: 2009, number of pages: 250 s.
-
Semantic Classes of Czech Verbs
Proceedings of the Conference on Intelligent Information Systems 2009, year: 2009
2008
-
K jednomu typu vyjadřování stupně v češtině
Year: 2008, type: Conference abstract
2007
-
Korpus jako zdroj dat pro opravy chyb automatické morfologické analýzy
Grammar & Corpora, 2nd International Conference, Abstracts, year: 2007
2006
-
Korpus soukromé korespondence (KSK) z hlediska morfologického značkování
Linguistica Brunensia, year: 2006, volume: A 54, edition: 1
2002
-
Mluvnice versus korpus, několik poznámek k problémům dubletních a variantních koncovek českých substantiv
Čeština - univerzália a specifika 4, year: 2002
2001
-
Příprava elektronických korpusů češtiny
Přednášky a besedy z XXXIV. běhu LŠSS, year: 2001
2000
-
GCQP -- Multiplatform Graphical User Interface to the CQP corpus manager
Proceedings of the Ninth EURALEX International Congress, year: 2000
1999
-
Document Multiplicity Elimination and Corpora Management
Proceedings of ISAS'99, year: 1999
-
Morfologické značkování složených slovesných tvarů v korpusu.
SPFFBU, year: 1999, volume: 48, edition: A 47