Publication details

The Sketch Engine

Authors

KILGARRIFF Adam RYCHLÝ Pavel SMRŽ Pavel TUGWELL David

Year of publication 2004
Type Article in Proceedings
Conference Proceedings of the Eleventh EURALEX International Congress
MU Faculty or unit

Faculty of Informatics

Citation
Web http://nlp.fi.muni.cz/publications/euralex2004_kilgarriff_pary_smrz_tugwell/
Field Informatics
Keywords corpora; corpus management; statistics; word sketches
Description Word sketches are one-page automatic, corpus-based summaries of a word s grammatical and collocational behaviour. They were first used in the production of the Macmillan English Dictionary and were presented at Euralex 2002. At that point, they only existed for English. Now, we have developed the Sketch Engine, a corpus tool which takes as input a corpus of any language and a corresponding grammar patterns and which generates word sketches for the words of that language. It also generates a thesaurus and sketch differences , which specify similarities and differences between near-synonyms. We briefly present a case study investigating applicability of the Sketch Engine to free wordorder languages. The results show that word sketches could facilitate lexicographic work in Czech as they have for English.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.

More info