NLP group

Findings of the shared task on multilingual coreference resolution

Zdenek Žabokrtský and Miloslav Konopík and Anna Nedoluzhko and Michal Novak and Maciej Ogrodniczuk and Martin Popel and Ondřej Pražák and Jakub Sido and Daniel Zeman and Yilun Zhu
CRAC (2022)

PDF

Research topics:

Neural Networks

Abstract

This paper presents an overview of the shared task on multilingual coreference resolution associated with the CRAC 2022 workshop. Shared task participants were supposed to develop trainable systems capable of identifying mentions and clustering them according to identity coreference. The public edition of CorefUD 1.0, which contains 13 datasets for 10 languages, was used as the source of training and evaluation data. The CoNLL score used in previous coreference-oriented shared tasks was used as the main evaluation metric. There were 8 coreference prediction systems submitted by 5 participating teams; in addition, there was a competitive Transformer-based baseline system provided by the organizers at the beginning of the shared task. The winner system outperformed the baseline by 12 percentage points (in terms of the CoNLL scores averaged across all datasets for individual languages).

NLP group

Research & development

Findings of the shared task on multilingual coreference resolution

Research topics:

Abstract

Authors

Ing. Miloslav Konopík, Ph.D.

Researcher

Ing. Ondřej Pražák

PhD student

Ing. Jakub Sido

PhD student

BibTex

Contact Us

NLP group

We offer