Publication details

Pattern Generation Revisited

Authors

ANTOŠ David SOJKA Petr

Year of publication 2001
Type Article in Proceedings
Conference Proceedings of the Twelfth European TeX Conference
MU Faculty or unit

Faculty of Informatics

Citation
web
Field Use of computers, robotics and its application
Keywords patterns;UNICODE;finite automata;natural language processing;language engineering
Description The program PATGEN, being nearly twenty years old, doesn't suit today's needs, as a) it is nearly impossible to make changes, b) it is limited to eight-bit encodings, c) it uses static data structures, d) reuse of the pattern technique and packed trie data structure for problems other than hyphenation (context dependent ligature handling, spell checking Thai syllabification, etc) is cumbersome. Those and other reasons explained further in the paper led us to the decision to reimplement PATGEN from scratch in an object-oriented manner (like NTS) and to create the PATtern LIBrary PATLIB and the (hyphenation) pattern generator based on it. We argue that this general aproach allows the code to be used in many applications in computer typesetting area, in addition to those of pattern recognition, which include various language processing, optical character recognition, and others.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.

More info