Modality encoded latent dataset for emotion recognition

dc.authorid0000-0003-4236-3646
dc.contributor.authorMert, Ahmet
dc.date.accessioned2026-02-12T21:05:29Z
dc.date.available2026-02-12T21:05:29Z
dc.date.issued2023
dc.departmentBursa Teknik Üniversitesi
dc.description.abstractVariational autoencoder (VAE) is an unsupervised learning that represents high dimensional input data into normally distributed latent space. Multi-channel physiological signals, namely EEG and peripherals are mostly preferred for affective computing. The DEAP dataset is converted into multimodal latent dataset for emotion recognition in this study. 40-ch recordings of 32 participants are encoded to different modalities of peripherals and 32-ch EEG. First, short-time Fourier transform (STFT) is used to extract time-frequency (TF) distribution for training VAE. Thus, the localized components in the each channel of the modalities is converted to 100 -dimensional space using VAE. The proposed method is applied to each participant's recordings to obtain new latent encoded dataset. Within and between subject classification results using latent dataset are compared to the original data for peripheral, 32ch EEG and peripheral with EEG modalities. Naive Bayes (NB) classifier is used to evaluate the encoding performance of the 100-dimensional modalities, and compared to original results. The error rates of leave-one participant-out cross-validation (LOPO CV) 0.3322 and 0.3327 are yielded for high/low arousal and valence states while the originals are 0.349 and 0.382.
dc.identifier.doi10.1016/j.bspc.2022.104140
dc.identifier.issn1746-8094
dc.identifier.issn1746-8108
dc.identifier.scopus2-s2.0-85138440573
dc.identifier.scopusqualityQ1
dc.identifier.urihttps://doi.org/10.1016/j.bspc.2022.104140
dc.identifier.urihttps://hdl.handle.net/20.500.12885/6977
dc.identifier.volume79
dc.identifier.wosWOS:000883344100001
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherElsevier Sci Ltd
dc.relation.ispartofBiomedical Signal Processing and Control
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_WoS_20260212
dc.subjectVariational autoencoder
dc.subjectEmotion recognition
dc.subjectMultimodal data fusion
dc.subjectLatent space
dc.titleModality encoded latent dataset for emotion recognition
dc.typeArticle

Dosyalar