PEÑA, Diego; AGUILERA, Ana; DONGO, Irvin; HEREDIA, Juanpablo; CARDINALE, Yudith

doi:10.1109/ACCESS.2023.3240420

dc.rights.license	open	en_US
dc.contributor.author	PEÑA, Diego
dc.contributor.author	AGUILERA, Ana
hal.structure.identifier	ESTIA - Institute of technology [ESTIA]
dc.contributor.author	DONGO, Irvin
dc.contributor.author	HEREDIA, Juanpablo
dc.contributor.author	CARDINALE, Yudith
dc.date.accessioned	2024-11-02T10:48:34Z
dc.date.available	2024-11-02T10:48:34Z
dc.date.issued	2023
dc.identifier.issn	2169-3536	en_US
dc.identifier.uri	https://oskar-bordeaux.fr/handle/20.500.12278/203107
dc.description.abstractEn	Multimodal methods for emotion recognition consider several sources of data to predict emotions; thus, a fusion method is needed to aggregate the individual results. In the literature, there is a high variety of fusion methods to perform this task, but they are not suitable for all scenarios. In particular, there are two relevant aspects that can vary from one application to another: (i) in many scenarios, individual modalities can have different levels of data quality or even be absent, which demands fusion methods able to discriminate non-useful from relevant data; and (ii) in many applications, there are hardware restrictions that limit the use of complex fusion methods (e.g., a deep learning model), which could be quite computationally intensive. In this context, developers and researchers need metrics, guidelines, and a systematic process to evaluate and compare different fusion methods that can fit to their particular application scenarios. As a response to this need, this paper presents a framework that establishes a base to perform a comparative evaluation of fusion methods to demonstrate how they adapt to the quality differences of individual modalities and to evaluate their performance. The framework provides equivalent conditions to perform a fair assessment of fusion methods. Based on this framework, we evaluate several fusion methods for multimodal emotion recognition. Results demonstrate that for the architecture and dataset selected, the methods that best fit are: Self-Attention andWeighted methods for all available modalities, and Self-Attention and Embracenet+when a modality is missing. Concerning the time, the best times correspond to Multilayer Perceptron (MLP) and Self-Attention models, due to their small number of operations. Thus, the proposed framework provides insights for researchers in this area to identify which fusion methods better fit their requirements, and thus to justify the selection.
dc.language.iso	EN	en_US
dc.rights	Attribution 3.0 United States	*
dc.rights.uri	http://creativecommons.org/licenses/by/3.0/us/	*
dc.subject.en	Emotion recognition
dc.subject.en	Fusion methods
dc.subject.en	Multimodality.
dc.title.en	A Framework to Evaluate Fusion Methods for Multimodal Emotion Recognition
dc.type	Article de revue	en_US
dc.identifier.doi	10.1109/ACCESS.2023.3240420	en_US
dc.subject.hal	Informatique [cs]	en_US
bordeaux.journal	IEEE Access	en_US
bordeaux.page	10218-10237	en_US
bordeaux.volume	11	en_US
bordeaux.hal.laboratories	ESTIA - Recherche	en_US
bordeaux.institution	Université de Bordeaux	en_US
bordeaux.peerReviewed	oui	en_US
bordeaux.inpress	non	en_US
bordeaux.import.source	hal
hal.identifier	hal-04745555
hal.version	1
hal.popular	non	en_US
hal.audience	Internationale	en_US
hal.export	false
workflow.import.source	hal
dc.rights.cc	CC BY	en_US
bordeaux.COinS	ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=IEEE%20Access&rft.date=2023&rft.volume=11&rft.spage=10218-10237&rft.epage=10218-10237&rft.eissn=2169-3536&rft.issn=2169-3536&rft.au=PE%C3%91A,%20Diego&AGUILERA,%20Ana&DONGO,%20Irvin&HEREDIA,%20Juanpablo&CARDINALE,%20Yudith&rft.genre=article

Fichier(s) constituant ce document

Nom:: ESTIA_IEEE_2023_Pena.pdf
Taille:: 3.872Mo
Format:: PDF

Voir/Ouvrir

Nom:: license_rdf
Taille:: 914octets
Format:: application/rdf+xml

Voir/Ouvrir

Ce document figure dans la(les) collection(s) suivante(s)

ESTIA - Recherche

Afficher la notice abrégée

A Framework to Evaluate Fusion Methods for Multimodal Emotion Recognition

Fichier(s) constituant ce document

Ce document figure dans la(les) collection(s) suivante(s)