Classification of Primary Medical Records with RUBRYX-2: First Experience
Authors | |
---|---|
Year of publication | 2012 |
Type | Chapter of a book |
MU Faculty or unit | |
Citation | KRAUROVA, Olga, Mikhail ALEXANDROV and Aleš BOUREK. Classification of Primary Medical Records with RUBRYX-2: First Experience. Online. In Artificial Intelligence Methods and Techniques for Business and Engineering Aplications. 1st ed. Rzesow, Sofia: ITHEA, 2012, p. 56-70. ITHEA IBS ISC No.:26. ISBN 978-954-16-0058-0. |
Description | RUBRYX is a document classifier developed in 2000s for processing large volumes of Web information. RUBRYX uses weighted sum of n-grams (n=1,2,3) extracted from a very limited number of samples (about 5-10) and takes into account their mutual position in a given text. This sophisticated algorithm proves to be very effective in classifying primary medical records presented in a free text form. In the paper we study possibilities of RUBRYX (version 2.2) on a limited document set in Spanish. These documents are medical histories related to stomach diseases. Such area should be considered as a narrow subset of medical records. The high quality of archived results (accuracy 80%-90%) allows us to recommend RUBRYX for similar applications. |