NLP group

UWBA at SemEval-2024 Task 3: Dialogue Representation and Multimodal Fusion for Emotion Cause Analysis

Josef Baloun and Jiří Martínek and Ladislav Lenc and Pavel Král
SemEval2024 (2024)

Research topics:

Document Classification | Neural Networks

Abstract

In this paper, we present an approach for solving SemEval-2024 Task 3: The Competition of Multimodal Emotion Cause Analysis in Conversations. The task includes two subtasks that focus on emotion-cause pair extraction using text, video, and audio modalities. Our approach is composed of encoding all modalities (MFCC and Wav2Vec for audio, 3D-CNN for video, and transformer-based models for text) and combining them in an utterance-level fusion module. The model is then optimized for link and emotion prediction simultaneously. Our approach achieved 6th place in both subtasks. The full leaderboard can be found at https://codalab.lisn.upsaclay.fr/competitions/16141#results

Authors

Ing. Josef Baloun

PhD student

balounj@kiv.zcu.cz
More details

Ing. Jiří Martínek, Ph.D.

Researcher

jimar@kiv.zcu.cz
More details

Ing. Ladislav Lenc, Ph.D.

Researcher

llenc@kiv.zcu.cz
More details

Doc.Ing. Pavel Král, Ph.D.

Team leader

pkral@kiv.zcu.cz
More details

BibTex

@inproceedings{baloun-etal-2024-uwba, title = "{UWBA} at {S}em{E}val-2024 Task 3: Dialogue Representation and Multimodal Fusion for Emotion Cause Analysis", author = "Baloun, Josef and Martinek, Jiri and Lenc, Ladislav and Kral, Pavel and Zeman, Mat{\v{e}}j and Vl{\v{c}}ek, Luk{\'a}{\v{s}}", editor = {Ojha, Atul Kr. and Do{\u{g}}ru{\"o}z, A. Seza and Tayyar Madabushi, Harish and Da San Martino, Giovanni and Rosenthal, Sara and Ros{\'a}, Aiala}, booktitle = "Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)", month = jun, year = "2024", address = "Mexico City, Mexico", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2024.semeval-1.49", doi = "10.18653/v1/2024.semeval-1.49", pages = "316--325", abstract = "In this paper, we present an approach for solving SemEval-2024 Task 3: The Competition of Multimodal Emotion Cause Analysis in Conversations. The task includes two subtasks that focus on emotion-cause pair extraction using text, video, and audio modalities. Our approach is composed of encoding all modalities (MFCC and Wav2Vec for audio, 3D-CNN for video, and transformer-based models for text) and combining them in an utterance-level fusion module. The model is then optimized for link and emotion prediction simultaneously. Our approach achieved 6th place in both subtasks. The full leaderboard can be found at https://codalab.lisn.upsaclay.fr/competitions/16141{\#}results", }

NLP group

Research & development

UWBA at SemEval-2024 Task 3: Dialogue Representation and Multimodal Fusion for Emotion Cause Analysis

Research topics:

Abstract

Authors

Ing. Josef Baloun

PhD student

Ing. Jiří Martínek, Ph.D.

Researcher

Ing. Ladislav Lenc, Ph.D.

Researcher

Doc.Ing. Pavel Král, Ph.D.

Team leader

BibTex

Contact Us

NLP group

We offer