Angular Margin Softmax Loss and Its Variants for Double Compressed AMR Audio Detection
dc.authorid | 0000-0002-6404-1499 | en_US |
dc.authorid | 0000-0002-9174-0367 | en_US |
dc.authorscopusid | 57215422993 | en_US |
dc.authorscopusid | 35781455400 | en_US |
dc.contributor.author | Büker, Aykut | |
dc.contributor.author | Hanilçi, Cemal | |
dc.date.accessioned | 2022-04-01T12:04:13Z | |
dc.date.available | 2022-04-01T12:04:13Z | |
dc.date.issued | 2021 | en_US |
dc.department | BTÜ, Mühendislik ve Doğa Bilimleri Fakültesi, Elektrik-Elektronik Mühendisliği Bölümü | en_US |
dc.description.abstract | Double compressed (DC) adaptive multi-rate (AMR) audio detection is an important but challenging audio forensic task which has received great attention over the last decade. Although the majority of the existing studies extract hand-crafted features and classify these features using traditional pattern matching algorithms such as support vector machines (SVM), recently convolutional neural network (CNN) based DC AMR audio detection system was proposed which yields very promising detection performance. Similar to any traditional CNN based classification system, CNN based DC AMR recognition system uses standard softmax loss as the training criterion. In this paper, we propose to use angular margin softmax loss and its variants for DC AMR detection problem. Although using angular margin softmax was originally proposed for face recognition, we adapt it to the CNN based end-to-end DC audio detection system. The angular margin softmax basically introduces a margin between two classes so that the system can learn more discriminative embeddings for the problem. Experimental results show that adding angular margin penalty to the traditional softmax loss increases the average DC AMR audio detection from 95.83% to 100%. It is also found that the angular margin softmax loss functions boost the DC AMR audio detection performance when there is a mismatch between training and test datasets. | en_US |
dc.identifier.doi | 10.1145/3437880.3460414 | en_US |
dc.identifier.endpage | 50 | en_US |
dc.identifier.isbn | 978-145038295-3 | |
dc.identifier.scopusquality | N/A | en_US |
dc.identifier.startpage | 45 | en_US |
dc.identifier.uri | https://hdl.handle.net/20.500.12885/1843 | |
dc.indekslendigikaynak | Scopus | en_US |
dc.institutionauthor | Büker, Aykut | |
dc.institutionauthor | Hanilçi, Cemal | |
dc.language.iso | en | en_US |
dc.publisher | Association for Computing Machinery, Inc | en_US |
dc.relation.ispartof | IH and MMSec 2021 - Proceedings of the 2021 ACM Workshop on Information Hiding and Multimedia Security | en_US |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | angular margin softmax loss | en_US |
dc.subject | cnn | en_US |
dc.subject | double compressed amr audio detection | en_US |
dc.title | Angular Margin Softmax Loss and Its Variants for Double Compressed AMR Audio Detection | en_US |
dc.type | Conference Object | en_US |
Dosyalar
Lisans paketi
1 - 1 / 1
Küçük Resim Yok
- İsim:
- license.txt
- Boyut:
- 1.44 KB
- Biçim:
- Item-specific license agreed upon to submission
- Açıklama: