Informace o publikaci

Development of HAMOD: a High Agreement Multi-lingual Outlier Detection dataset

Autoři

JAKUBÍČEK Miloš ROMANI Emma RYCHLÝ Pavel HERMAN Ondřej

Rok publikování 2021
Druh Článek ve sborníku
Konference Recent Advances in Slavonic Natural Language Processing (RASLAN 2021)
Fakulta / Pracoviště MU

Fakulta informatiky

Citace
www
Klíčová slova HAMOD; Distributional thesaurus; Outlier detection; Word embeddings; Sketch Engine
Popis In this paper we describe further development of a High Agreement Multi- lingual Outlier Detection dataset (HAMOD) outlier that is used for the purpose of evaluation of automatic distributional thesauri. We briefly introduce the task and methodological motivation for developing such a dataset, then we present the current status of the dataset and related tools as well as results measured on the dataset so far (both in terms of agreement rates and thesauri eveluation). Finally we discuss future developments of HAMOD.
Související projekty:

Používáte starou verzi internetového prohlížeče. Doporučujeme aktualizovat Váš prohlížeč na nejnovější verzi.

Další info