
Annotating Health Records: Does Ground Truth Even Exist?
Autoři | |
---|---|
Rok publikování | 2024 |
Druh | Článek ve sborníku |
Konference | Proceedings of the Eighteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2024 |
Fakulta / Pracoviště MU | |
Citace | |
www | https://nlp.fi.muni.cz/raslan/2024/paper12.pdf |
Klíčová slova | Czech; Electronic health records; EHR; annotation; named entity recognition; NER; medical concept mining |
Popis | This paper introduces a new ground truth subset of the CSEHR dataset, a dataset of Czech health records annotated using a schema of 14 classes that is an adapted version of Apache cTAKES Core Clinical Element types. The paper details the considerations involved in (re)defining individual annotation classes in attempts to maximize utility in computational understanding of medical text. |
Související projekty: |