Zde se nacházíte:
Informace o publikaci
Data Gathered with Automatic Tools from European Parliamentary Chambers
Autoři | |
---|---|
Rok publikování | 2023 |
Druh | Článek ve sborníku |
Konference | Recent Advances in Slavonic Natural Language Processing, RASLAN 2023 |
Fakulta / Pracoviště MU | |
Citace | |
www | Článek ve sborníku |
Klíčová slova | parliamentary protocols, continuous downloading, corpus processing, automatic tools, corpus development, automatic maintenance of tools |
Přiložené soubory | |
Popis | This paper reflects on the set of tools developed in my bachelor’s thesis, titled ”Continuous Automatic Development of European Parliamentary Corpora.” Despite the existence of numerous corpora offering speeches from the parliaments of the European Union, the developed toolset is designed to gather and build such corpora with minimal human intervention. With nine months of practical application, this paper presents insights into the faced challenges and their respective solutions, providing an overview since the initial release of the toolset. |