Publication details

Building Big Czech Corpus : Collecting and Converting Czech Corpora

Authors

HANČAR Pavel

Year of publication 2008
Type Article in Proceedings
Conference RASLAN 2008
MU Faculty or unit

Faculty of Informatics

Citation HANČAR, Pavel. Building Big Czech Corpus : Collecting and Converting Czech Corpora. In RASLAN 2008. Masaryk University, Brno: Masaryk University, Brno, 2008, p. 94-97, 100 pp. ISBN 978-80-210-4741-9.
web https://nlp.fi.muni.cz/raslan/2008/papers/11.pdf
Field Linguistics
Keywords corpus; desamb; vertjoin;
Description This paper describes a creating of a big Czech corpus from many Czech corpora kept on the NLP Centre server. It describes new tools developed for this purpose, difficulties which may come up and a way how solve them.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.

More info