Publication details

Building Big Czech Corpus : Collecting and Converting Czech Corpora

Authors

HANČAR Pavel

Year of publication 2008
Type Article in Proceedings
Conference RASLAN 2008
MU Faculty or unit

Faculty of Informatics

Citation
Web https://nlp.fi.muni.cz/raslan/2008/papers/11.pdf
Field Linguistics
Keywords corpus; desamb; vertjoin;
Description This paper describes a creating of a big Czech corpus from many Czech corpora kept on the NLP Centre server. It describes new tools developed for this purpose, difficulties which may come up and a way how solve them.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.

More info