RNDr. Vít Suchomel, Ph.D.
Researcher, Centre for Natural Language Processing
correspondence Address:
Botanická 554/68a, 602 00 Brno
e‑mail: |
---|
Total number of publications: 51
2012
-
Detecting Spam in Web Corpora
6th Workshop on Recent Advances in Slavonic Natural Language Processing, year: 2012
-
Efficient Web Crawling for Large Text Corpora
Proceedings of the seventh Web as Corpus Workshop (WAC7), year: 2012
-
Large Corpora for Turkic Languages and Unsupervised Morphological Analysis
Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12), year: 2012
-
POS Annotated 50M Corpus of Tajik Language
Proceedings of the Workshop on Language Technology for Normalisation of Less-Resourced Languages (SALTMIL 8/AfLaT 2012), year: 2012
-
Recent Czech Web Corpora
6th Workshop on Recent Advances in Slavonic Natural Language Processing, year: 2012
-
SpiderLing
Year: 2012
-
Towards 100M Morphologically Annotated Corpus of Tajik
Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2012, year: 2012
2011
-
Building a 50M Corpus of Tajik Language
Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2011, year: 2011
-
Chared
Year: 2011
-
chared: Character Encoding Detection with a Known Language
RASLAN 2011, year: 2011