Building Big Czech Corpus : Collecting and Converting Czech Corpora
Authors | |
---|---|
Year of publication | 2008 |
Type | Article in Proceedings |
Conference | RASLAN 2008 |
MU Faculty or unit | |
Citation | HANČAR, Pavel. Building Big Czech Corpus : Collecting and Converting Czech Corpora. In RASLAN 2008. Masaryk University, Brno: Masaryk University, Brno, 2008, p. 94-97, 100 pp. ISBN 978-80-210-4741-9. |
web | https://nlp.fi.muni.cz/raslan/2008/papers/11.pdf |
Field | Linguistics |
Keywords | corpus; desamb; vertjoin; |
Description | This paper describes a creating of a big Czech corpus from many Czech corpora kept on the NLP Centre server. It describes new tools developed for this purpose, difficulties which may come up and a way how solve them. |
Related projects: |